getattr/setattr still ASCII-only, not Unicode - blows up SGMLlibfrom BeautifulSoup

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • John Nagle

    getattr/setattr still ASCII-only, not Unicode - blows up SGMLlibfrom BeautifulSoup

    Just noticed, again, that getattr/setattr are ASCII-only, and don't support
    Unicode.

    SGMLlib blows up because of this when faced with a Unicode end tag:

    File "/usr/local/lib/python2.5/sgmllib.py", line 353, in finish_endtag
    method = getattr(self, 'end_' + tag)
    UnicodeEncodeEr ror: 'ascii' codec can't encode character u'\xae'
    in position 46: ordinal not in range(128)

    Should attributes be restricted to ASCII, or is this a bug?

    John Nagle
Working...