This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: Blatantly false statement in the Unicode section
Type: Stage:
Components: Documentation Versions:
process
Status: closed Resolution: not a bug
Dependencies: Superseder:
Assigned To: Nosy List: chtito, hyeshik.chang
Priority: normal Keywords:

Created on 2004-11-01 19:47 by chtito, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Messages (2)
msg22977 - (view) Author: Olivier Verdier (chtito) Date: 2004-11-01 19:47
The statement is "you will find it convenient that the lower 256 
characters of Unicode are the same as the 256 characters of Latin
-1". 

That is obviously false. Only the first 128 characters (those 
encoded on the first 7 bits) are the same in usual one byte 
encodings (latin-x, for instance) and unicode. The eighth bit is used 
by unicode to trigger the multi-byte mechanism, whereas it is used 
to encode further 128 characters in one byte encodings.
msg22978 - (view) Author: Hyeshik Chang (hyeshik.chang) * (Python committer) Date: 2004-11-01 20:22
Logged In: YES 
user_id=55188

No. Latin-1 exactly overlaps over Unicode for 0~255.
You seem to confuse Unicode encodings and Unicode itself. 
In the context, we're saying about not encodings such as
UTF-8 but Python's "unicode" built-in type which is based on
Unicode character set.
History
Date User Action Args
2022-04-11 14:56:07adminsetgithub: 41114
2004-11-01 19:47:18chtitocreate