Issue 1058351: Blatantly false statement in the Unicode section

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

This issue has been migrated to GitHub: https://github.com/python/cpython/issues/41114

classification

Title:	Blatantly false statement in the Unicode section
Type:		Stage:
Components:	Documentation	Versions:

process

Created on 2004-11-01 19:47 by chtito, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Messages (2)
msg22977 - (view)	Author: Olivier Verdier (chtito)	Date: 2004-11-01 19:47
The statement is "you will find it convenient that the lower 256 characters of Unicode are the same as the 256 characters of Latin -1". That is obviously false. Only the first 128 characters (those encoded on the first 7 bits) are the same in usual one byte encodings (latin-x, for instance) and unicode. The eighth bit is used by unicode to trigger the multi-byte mechanism, whereas it is used to encode further 128 characters in one byte encodings.
msg22978 - (view)	Author: Hyeshik Chang (hyeshik.chang) *	Date: 2004-11-01 20:22
Logged In: YES user_id=55188 No. Latin-1 exactly overlaps over Unicode for 0~255. You seem to confuse Unicode encodings and Unicode itself. In the context, we're saying about not encodings such as UTF-8 but Python's "unicode" built-in type which is based on Unicode character set.