This entry is part 1 of 4 in the series Text Encoding and Unicode. Later posts include Unicode vs. UTF-8, When Good Unicode Encoding Goes Bad, and PHP and Unicode. I’ve started to really enjoy Node.js’s Buffer object for byte level examination of files. For example — if you create a text file with a bit of unicode in it [...]
astorm
This entry is part 2 of 4 in the series Text Encoding and Unicode. Earlier posts include Inspecting Bytes with Node.js Buffer Objects. Later posts include When Good Unicode Encoding Goes Bad, and PHP and Unicode. In my last quick tips post I mentioned examining the bytes of a text file that contained the text Hyvä, and getting back the [...]
astorm
This entry is part 3 of 4 in the series Text Encoding and Unicode. Earlier posts include Inspecting Bytes with Node.js Buffer Objects, and Unicode vs. UTF-8. Later posts include PHP and Unicode. So why am I posting so much about unicode and the word Hyvä? That’s a long story. Whenever I work on a longer piece of writing, part of my [...]
astorm
This entry is part 4 of 4 in the series Text Encoding and Unicode. Earlier posts include Inspecting Bytes with Node.js Buffer Objects, Unicode vs. UTF-8, and When Good Unicode Encoding Goes Bad. This is the most recent post in the series. PHP’s unicode story is — not great. PHP’s strings don’t know anything about [...]
astorm