Quantcast
Channel: Active questions tagged utf-8 - Stack Overflow
Browsing all 1142 articles
Browse latest View live

kotlin android bad conversion of UTF-8 characters

I am encountering a basic issue with kotlin android (API 29)import kotlin.text.Charsets.UTF_8var buf = byteArrayOf(0xF0.toByte(), 0xa9.toByte(), 0xbd.toByte(), 0xbe.toByte())var s = String( buf,...

View Article


Generate UTF-8 MD5 hash for Chinese in SQL Server

SELECT LOWER(CONVERT(VARCHAR(32), HASHBYTES('MD5', 'a'), 2));Result : 0cc175b9c0f1b6a831c399e269772661This is OK.SELECT LOWER(CONVERT(VARCHAR(32), HASHBYTES('MD5', '啊'), 2));Result :...

View Article


How can I read a CSV file with UTF-8 code page in SQL bulk insert?

I have a Persian CSV file and I need to read that with SQL bulk into the SQL server:I wrote this bulk:BULK INSERT TEMPFROM 'D:\t1.csv'WITH(FIELDTERMINATOR = ',',ROWTERMINATOR = '\n',CODEPAGE =...

View Article

Download blob to a path using azure blob java sdk with UTF-8 encoded file name

I have blobs in a folder location and sometimes the blob name has some Chinese characters in it. For example, I have blob name as 项目.xlsx. Currently I am using downloadToFile method where am trying to...

View Article

Image may be NSFW.
Clik here to view.

Fatal error: Uncaught mysqli_sql_exception: Incorrect string value: '\xF6'

My hosting provider changed the mysql server default value of the character_set_server and character_set_database system variables from latin1 to utf8mb4.Since then, the file upload functions failes...

View Article


How to convert UTF-8 to US-Ascii in Java

We have a system where customers, mainly European enter texts (in UTF-8) that has to be distributed to different systems, most of them accepting UTF-8, but now we must also distribute the texts to a US...

View Article

String in utf8 format, i need to make it normal text

I have llama model on deepinfra, and its resposes me utf8 string in answer, idk how to parse it into normal format.im tryingFuture<String> getContent(String style, String size, String keywords,...

View Article

PHP multibyte regex not working with UTF-8 [duplicate]

I have UTF-8 string that I want to search for all occurrences of img_(\d+).I have tried original$pattern = '/img_(\d+)/u';preg_match_all($pattern, $text, $matches, PREG_OFFSET_CAPTURE);but it gives me...

View Article


UnicodeDecodeError: 'utf-8' when debugging Python files in PyCharm Community

Current conclusion:The encoding of the converted file is utf-8->utf-8 big->ansi -> utf-8. Reopen the file after each conversion.After observing for a period of time, there is no such...

View Article


Get multibyte character count before match with preg_match()...

I'm trying to search a UTF8-encoded string using preg_match.preg_match('/H/u', "\xC2\xA1Hola!", $a_matches, PREG_OFFSET_CAPTURE);echo $a_matches[0][1];This should print 1, since "H" is at index 1 in...

View Article

Image may be NSFW.
Clik here to view.

How to read .csv file that contains utf-8 values by pandas dataframe

I'm trying to read .csv file that contains utf-8 data in some of its columns. The method of reading is by using pandas dataframe. The code is as following:df = pd.read_csv('Cancer_training.csv',...

View Article

How to find the character which is throwing UnicodeDecodeError when reading...

When reading a file from pandas read_csv , got UnicodeDecodeError.Syntax:df = pd.read_csv("file_name.csv", sep='|')UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 0: invalid start...

View Article

How to decode the UTF-8 response into text in JMeter

I have been trying to do performance testing in JMeter for websocket.I have sent request data in Websocket request-response Sampler and received a UTF-8 encoded response.I want to know a way to decode...

View Article


Convert accented characters to their plain ascii equivalents

I have to convert french characters into english on my php. I've used the following code:iconv("utf-8", "ascii//TRANSLIT", $string);But the result for ËËË was "E"E"E.I don't need that double quote and...

View Article

How to obtain UTF-8 binary representation of an emoji character?

I am trying to print bits of of UTf-8 character. In that case an emoji.According to the the following video: https://www.youtube.com/watch?v=ut74oHojxqothe following code script.py (both from the...

View Article


Anchor mailto subject using Hebrew

Basically my problem is when I’m trying to use mail to in anchor tag, when adding a subject in Hebrew, some of the computers can see it correctly when it opens the outlook mail, but some will see the...

View Article

unicode error when using dask.dataframe.read_csv

I am runnning into the he error UnicodeDecodeError: 'utf-8' codec can't decode byte 0xac in position 0: invalid start byte 2023-09-19 13:04:11,361 - distributed.core - ERROR - Exception while handling...

View Article


Google Cloud - SQL - utf8/ utf8_general_ci flags not working

For testing purposes I am trying to recreate a copy of a live database, this live instance is set up with the flag 'character_set_server utf8'. The databases 'information_schema', 'mysql' and our...

View Article

classic asp character encoding

I'm having a problem with Spanish characters in a classic asp site. A user is able to submit their name/address in a form on an aspx page. The aspx page then does an ajax post to a classic asp page...

View Article

Is UTF8 injective mapping?

We write a C++ application and need to know this:Is UTF8 text encoding an injective mapping from bytes to characters, meaning that every single character (letter...) is encoded in only one way? So,...

View Article
Browsing all 1142 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>