Charsets, Encodings… How does it work?

Pankaj Singhal
Oct 16, 2018

I’ve curated the following really good reads on the concepts of charsets[Unicode], encodings[UTFs(8,16,32), UCS(2,4)], etc.

After reading all these articles (you’ll have to spend a little bit of time :P), you should be in a position where you understand these concepts pretty decently.

(Read in order of mention — you’ll enjoy :P)

  1. The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)
  2. Code Points
  3. Unicode Plane — Read the initial intro.
  4. Unicode Plane (Basic Multilingual Plane)
  5. Surrogates Characters
  6. UTF-8(Intro &Description)
  7. Difference b/w Unicode & UTF — 8 (part 1)
  8. Difference b/w Unicode & UTF — 8 (part 2)

How does the Unicode charset & encoding gets affected while doing string manipulations — And an application of all above

--

--