Charsets, Encodings… How does it work?
Oct 16, 2018
I’ve curated the following really good reads on the concepts of charsets[Unicode], encodings[UTFs(8,16,32), UCS(2,4)], etc.
After reading all these articles (you’ll have to spend a little bit of time :P), you should be in a position where you understand these concepts pretty decently.
(Read in order of mention — you’ll enjoy :P)
- The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)
- Code Points
- Unicode Plane — Read the initial intro.
- Unicode Plane (Basic Multilingual Plane)
- Surrogates Characters
- UTF-8(Intro &Description)
- Difference b/w Unicode & UTF — 8 (part 1)
- Difference b/w Unicode & UTF — 8 (part 2)
How does the Unicode charset & encoding gets affected while doing string manipulations — And an application of all above