HTMLParser.HTMLParser and derived classes crash when
two tag attributes are glued like:
WIDTH="512"HEIGHT="83"
Example from IDLE:
>>> import HTMLParser
>>> BUGTEXT = """
<table CELLSPACING="0" CELLPADDING="0" BORDER="0"
WIDTH="100%">
<tr><td align="center">
<table CELLSPACING="0" CELLPADDING="0" BORDER="0"
WIDTH="600" BGCOLOR="white">
<tr><td WIDTH="512">
<img SRC="/pict/img83.jpg" WIDTH="512"HEIGHT="83"
BORDER="0" VSPACE="0" HSPACE="0">
</td><td WIDTH="2"></td>
</tr></table></td></tr></table>
"""
>>> parser = HTMLParser.HTMLParser()
>>> parser.feed(BUGTEXT)
>>> parser.close()
Traceback (most recent call last):
File "<pyshell#19>", line 1, in ?
parser.close()
File "C:\Python22\lib\HTMLParser.py", line 112, in close
self.goahead(1)
File "C:\Python22\lib\HTMLParser.py", line 166, in
goahead
self.error("EOF in middle of construct")
File "C:\Python22\lib\HTMLParser.py", line 115, in error
raise HTMLParseError(message, self.getpos())
HTMLParseError: EOF in middle of construct, at line 6,
column 1
|