This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title: pyexpat produces fals parsing results in CharacterDataHandle
Type: Stage:
Components: XML Versions: Python 2.5
process
Status: closed Resolution: not a bug
Dependencies: Superseder:
Assigned To: Nosy List: einsteinmg, loewis
Priority: normal Keywords:

Created on 2006-09-26 20:34 by einsteinmg, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Files
File name Uploaded Description Edit
test2.xml.bz2 einsteinmg, 2006-09-26 20:34 xml data file to reproduce this bug
xmltest.py einsteinmg, 2006-09-26 20:36 python test program
Messages (2)
msg30011 - (view) Author: Michael Gebetsroither (einsteinmg) Date: 2006-09-26 20:34
hi,

with bigger files pyexpat begins to split up some 
things parsed through CharacterDataHandler.

c:       "root-menu"
pyexpat: "root-me"
         "nu"

c:       "TopLeft"
pyexpat: "TopL"
         "eft"


that strange results are also reproduseable on 
python2.4

greets
msg30012 - (view) Author: Martin v. Löwis (loewis) * (Python committer) Date: 2006-09-27 07:49
Logged In: YES 
user_id=21627

This is not a bug. Instead, applications are expected to be
aware of it, and deal with it accordingly. In general, it is
not possible to provide all character data in a single
callback, since that may exhaust the available address space.

The actual splitting of character data depends on the
internal buffering that Expat performs. If the buffer is
exhausted in the middle of character data, those data are
sent to the application before reading more input.
History
Date User Action Args
2022-04-11 14:56:20adminsetgithub: 44035
2006-09-26 20:34:21einsteinmgcreate