String.length() doesn't work for "Rolling On the Floor Laughing" : 🤣?

I'm trying to print the first 30 characters of some UTF-8 strings, and notice that Java's String.substring() is returning some funky strings. I've boiled it down to:

I'm expecting "🤣" to be String with length 1, and String.substring to not try to cut it over in the middle. Why is my expectation not met? Java thinks it has length 2.

I'm pretty sure (1 2) the UTF-8 encoding for 🤣 (U+1F923) "Rolling On the Floor Laughing" is:

0xF0 0x9F 0xA4 0xA3

And so I expect this tiny program:

import java.nio.charset.StandardCharsets;public class Foo {  public static void main(String[] args){    String str = "🤣";    // These are the UTF-8 bytes for "ROLLING ON THE FLOOR LAUGHING"    byte[] raw = {(byte)0xf0, (byte)0x9f, (byte)0xa4, (byte)0xa3};    String str2 = new String(raw, StandardCharsets.UTF_8);    System.out.println(str.equals(str2));    System.out.println(str.length());    System.out.println(str.substring(0,1));  }}

To print out:

true1🤣

But in fact it prints out:

true2?

Am I doing something wrong?

I've tried an custom java 11.0.20.1 build and these standard Ubuntu packages with the same results:

$ javac -versionjavac 19.0.2$ java -versionopenjdk version "19.0.2" 2023-01-17OpenJDK Runtime Environment (build 19.0.2+7-Ubuntu-0ubuntu322.04)OpenJDK 64-Bit Server VM (build 19.0.2+7-Ubuntu-0ubuntu322.04, mixed mode, sharing)

python3 does what I expect:

$ python3 -c 'print(len("🤣"))'1$ python3 -c 'print("🤣"[0])'🤣

String.length() doesn't work for "Rolling On the Floor Laughing" : 🤣?

Trending Articles

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Practice Sheet of Right form of verbs for HSC Students

Woman stabbed 12 times and dumped in ditch

リッチテキストフォーマットのメッセージに返信すると NDR が配信される

Windows Update / Microsoft Update の接続先 URL について

Shopping Status

Installation Joomla! 5.x • Fatal error 4.4.3 to 5

Diddy’s son Quincy Brown reveals truth about being adopted by hip hop legend

Gracie Abrams – I miss you, I’m sorry – Single [iTunes Plus M4A]

Azure Adhoc RMS licenses User is not able to open right protected messages

Download:Sergeant B ft Bon’zee Baby – Never believe any one(Prod by Doco)

Neem Baba Extra Questions Answer Class 6 English Poorvi

Bureau of Internal Revenue: Regional Offices (Directory)

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

[GET] Michael J. Gelb – Genius Mastery ($497.00)

Seven offenders dealt with by magistrates in Grimsby

Man wanted in Assault/Extortion investigation Siaka Camara, 25

update not allowed for field 'Party ID(PartyNumber) in D365 AX

Ra Rakumara lyrucs and translation | GAV / Govindhudu andhari vadele (2014)

Isle of Man property sales, September 3, 2015