What is the character code of those characters appear when you place mouse-pointer on a variable?
I'm trying to read in a UTF-8 written file. (Written in Japanese.)I've read in a Shift-JIS file and can check the right character is read in with the debugger. (Here, I should be thinking that the debugger is using Shift-JIS, I suppose.)I know that UTF-8 uses 3 bytes for a character. So when I read in, I do fgetc() 3 times and concatenate 3 characters (or integers) to make up a UTF-8 character. Am I wrong here? I'm not sure. I watch variable with debugger and I see garbled text. (Can I just set the debugger character coding to UTF-8? If I can, please let me know. Also how-to, too.)
I have no plans to do with other than UTF-8. Do I still have to convert UTF-8 integers to Shift-JIS ones?
I've written few things, but I'm not quite sure of them. So tell me if I'm saying something strange, please.
All I want to do is to read in UTF-8 characters, concatenate, compare, split, etc.
Sorry, I'm on a smart-phone and I've not got the code put up.
I concatenated 3 characters read in from a UTF-8 file. I expected to see a Japanese character on variable watcher on debugger.