UTF-8

An encoding for multi-byte character schemes such as Unicode, in which ASCII text encodes to itself and POSIX string manipulation routines work largely as expected.

A 16-bit Unicode string that encodes ASCII text will have every other data byte within it set to zero. The UTF-8 encoding of this text will have a zero only at the end.

Context Categories:

General

Back to Dictionary

Subscribe to UTF-8