UTF-1
| MIME / IANA | ISO-10646-UTF-1 |
|---|---|
| Language(s) | International |
| Current status | Obscure, of mainly historical interest. |
| Classification | Unicode Transformation Format, extended ASCII, variable-width encoding |
| Extends | US-ASCII |
| Transforms / Encodes | ISO/IEC 10646 (Unicode) |
| Succeeded by | UTF-8 |
UTF-1 is an obsolete method of transforming ISO/IEC 10646/Unicode into a stream of bytes. Its design does not provide self-synchronization, which makes searching for substrings and error recovery difficult. It reuses the ASCII printing characters for multi-byte encodings, making it unsuited for some uses (for instance Unix filenames cannot contain the byte value used for forward slash). UTF-1 is also slow to encode or decode due to its use of division and multiplication by a number which is not a power of 2. Due to these issues, it did not gain acceptance and was quickly replaced by UTF-8.