Some notes on odd characters

Strange characters appear on your screen? It’s mostly a character encoding issue.

First case: you try to display a single-byte character encoding in a multibyte environment Second case: vice versa (a multi-byte encoding in a single-byte environment) First case:

 

Büttermölker

 

the string "Büttermölker" is UTF-8 encoded and displayed on a Iso-8859-1 environment (Iso-8859-1, also known as Latin-1, is pretty much the same as ASCII plus some characters, used in western european languages). The two bytes of "ü" and "ö" are interpreted as "ü" and "ö". check the unicode codepoints here now: Büttermölker

Second case:

 

B��rm��r

 

the string "Büttermölker" is Iso-8859-1 encoded and displayed on a UTF-8 environment. check the unicode codepoints here now: B��rm��r