This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: old UnicodeData.txt
Type: Stage:
Components: Unicode Versions: Python 2.3
process
Status: closed Resolution:
Dependencies: Superseder:
Assigned To: lemburg Nosy List: doko, effbot, lemburg, loewis
Priority: normal Keywords:

Created on 2002-12-01 12:54 by doko, last changed 2022-04-10 16:05 by admin. This issue is now closed.

Messages (5)
msg13499 - (view) Author: Matthias Klose (doko) * (Python committer) Date: 2002-12-01 12:54
[submitted at http://bugs.debian.org/171061]

The version mentioned is CVS 021121 HEAD

unicodedata.so is obviously built with older
UnicodeData.txt file,
and does not match new characters introduced in Unicode
3.2.
To fix it, I copied UnicodeData.txt (conveniently
provided by perl-modules
in /usr/share/perl/5.8.0/unicore/UnicodeData.txt) to
the top of
unpacked python2.3 source package, renamed it to
UnicodeData-Latest.txt,
ran "python Tools/unicode/makeunicodedata.py" and then
recompiled
python2.3 package. This should probably be addressed
upstream as well.

before:
>>> import unicodedata
>>> unicodedata.name(u'\u20b0')
Traceback (most recent call last):
  File "<stdin>", line 1, in ?
  ValueError: no such name

after:
>>> import unicodedata
>>> unicodedata.name(u'\u20b0')
'GERMAN PENNY SIGN'



msg13500 - (view) Author: Fredrik Lundh (effbot) * (Python committer) Date: 2002-12-01 13:18
Logged In: YES 
user_id=38376

Why is Debian shipping a "python2.3", when
Python 2.3 hasn't been released yet (it's not
even in alpha)?

(the Unicode database in the current Python
CVS has already been updated...)

</F>
msg13501 - (view) Author: Martin v. Löwis (loewis) * (Python committer) Date: 2002-12-01 13:30
Logged In: YES 
user_id=21627

Are you sure you are using the current CVS? In my copy of 
the CVS, renaming to UnicodeData-Latest is not necessary, 
and GERMANY PENNY SIGN is included in the database.

To verify that you use the current CVS, please report the 
value of unicodedata.unidata_version.

Fredrik, Debian has the Python 2.3 package only in 
its "unstable" (and "testing") distribution, see

http://packages.debian.org/unstable/interpreters/python2.3.ht
ml

It is common to provide Debian packages for CVS versions of 
software in "unstable", so that the Debian developers can 
analyse effects of upcoming versions on their software.
msg13502 - (view) Author: Matthias Klose (doko) * (Python committer) Date: 2002-12-01 19:37
Logged In: YES 
user_id=60903

The CVS version (as mentioned) was 021121 HEAD. Closing the
report, as this has changed six days ago.

Frederik: Debian currently has 1.5, 2.1, 2.2 and 2.3, 2.1
beeing the default in the released (stable) distribution,
and 2.2 the default in the "unstable" distribution. I did
put 2.3 in unstable to ease building third party modules
using the new version.
msg13503 - (view) Author: Martin v. Löwis (loewis) * (Python committer) Date: 2002-12-06 10:58
Logged In: YES 
user_id=21627

It may not matter anymore, but...

In the Debian package 2.2.94-1exp1, the changelog indicates
that it was from CVS 021120. In that version, GERMANY PENNY
SIGN is already in the Unicode database.
History
Date User Action Args
2022-04-10 16:05:57adminsetgithub: 37558
2002-12-01 12:54:56dokocreate