An encoding for multi-byte character schemes such as Unicode, in which ASCII text encodes to itself and POSIX string manipulation routines work largely as expected.
A 16-bit Unicode string that encodes ASCII text will have every other data byte within it set to zero. The UTF-8 encoding of this text will have a zero only at the end.
Context Categories:
General