Text Encoding and Unicode

Below you'll find all the Text Encoding and Unicode articles on the site.

Text Encoding and Unicode

Inspecting Bytes with Node.js Buffer Objects

This entry is part 1 of 4 in the series Text Encoding and Unicode. Later posts include Unicode vs. UTF-8, When Good Unicode Encoding Goes Bad, and PHP and Unicode. I’ve started to really enjoy Node.js’s Buffer object for byte level examination of files. For example — if you create a text file with a bit of unicode in it [...]

astorm

Unicode vs. UTF-8

This entry is part 2 of 4 in the series Text Encoding and Unicode. Earlier posts include Inspecting Bytes with Node.js Buffer Objects. Later posts include When Good Unicode Encoding Goes Bad, and PHP and Unicode. In my last quick tips post I mentioned examining the bytes of a text file that contained the text Hyvä, and getting back the [...]

astorm

When Good Unicode Encoding Goes Bad

This entry is part 3 of 4 in the series Text Encoding and Unicode. Earlier posts include Inspecting Bytes with Node.js Buffer Objects, and Unicode vs. UTF-8. Later posts include PHP and Unicode. So why am I posting so much about unicode and the word Hyvä? That’s a long story. Whenever I work on a longer piece of writing, part of my [...]

astorm

PHP and Unicode

This entry is part 4 of 4 in the series Text Encoding and Unicode. Earlier posts include Inspecting Bytes with Node.js Buffer Objects, Unicode vs. UTF-8, and When Good Unicode Encoding Goes Bad. This is the most recent post in the series. PHP’s unicode story is — not great. PHP’s strings don’t know anything about [...]

astorm

Categories

Recent Posts

Archives