Opened 11 years ago

Closed 11 years ago

#3597 closed (fixed)

UnicodeDecode error when ChoiceField(w=RadioSelect) has Unicode choices

Reported by: Georgi Stanojevski <glisha gmail com> Owned by: Malcolm Tredinnick
Component: Forms Version: master
Severity: Keywords: UnicodeDecodeError RadioSelect
Cc: Triage Stage: Accepted
Has patch: yes Needs documentation: no
Needs tests: no Patch needs improvement: no
Easy pickings: UI/UX:


When I have a form like this, note the choices with unicode data I get an UnicodeDecodeError.

I attached a patch which just replaces the str method with unicode for RadioSelect widgets in django/newforms/ +232.

It seems to work.

class aform(forms.Form):

    def __init__(self, *args, **kwargs):
        super(aform, self).__init__(*args, **kwargs)
        self.fields['aradio'].choices = [('1',u'јуникод'),('2','latin')]

    aradio = forms.ChoiceField(choices=(), widget=RadioSelect())

  f = aform()

I get an UnicodeDecodeError.

exceptions.UnicodeDecodeError                        Traceback (most recent call last)

/home/webstrana/django/kajmakot/<ipython console>

/home/webstrana/django/django_src/django/newforms/ in as_ul(self)
    146     def as_ul(self):
    147         "Returns this form rendered as HTML <li>s -- excluding the <ul></ul>."
--> 148         return self._html_output(u'<li>%(errors)s%(label)s %(field)s%(help_text)s</li>', u'<li>%s</li>', '</li>', u' %s', False)
    150     def as_p(self):

/home/webstrana/django/django_src/django/newforms/ in _html_output(self, normal_row, error_row, row_ender, help_text_html, errors_on_separate_row)
    127                 else:
    128                     help_text = u''
--> 129                 output.append(normal_row % {'errors': bf_errors, 'label': label, 'field': unicode(bf), 'help_text': help_text})
    130         if top_errors:
    131             output.insert(0, error_row % top_errors)

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 80: ordinal not in range(128)

Attachments (1)

unicodedecodenewormsforms.diff (684 bytes) - added by Georgi Stanojevski <glisha gmail com> 11 years ago.
It replaces str with unicode when called on RadioSelect widgets

Download all attachments as: .zip

Change History (5)

Changed 11 years ago by Georgi Stanojevski <glisha gmail com>

It replaces str with unicode when called on RadioSelect widgets

comment:1 Changed 11 years ago by Michael Radziej <mir@…>

#3904 marked as duplicate

comment:2 Changed 11 years ago by Malcolm Tredinnick

Owner: changed from Adrian Holovaty to Malcolm Tredinnick
Triage Stage: UnreviewedAccepted

Since the object returned by RadioSelect.render() is a StrAndUnicode subclass, the change in the proposed patch shouldn't be necessary. Which suggests there is a deeper bug at work. I'm looking into it.

comment:3 Changed 11 years ago by Malcolm Tredinnick

Wow... this did turn out to be the right patch (thanks, Georgi), for a tricky reason: Python treats whatever is returned from a __unicode__ method as a unicode object. So it subsequently was trying to re-encode the result into a string, rather than using the string object we were returning. Forcing the method to return a pure unicode object cures the problem.

comment:4 Changed 11 years ago by Malcolm Tredinnick

Resolution: fixed
Status: newclosed

(In [4924]) Fixed #3597 -- Fixed unicode encoding problem in form rendering. Thanks,
Georgi Stanojevski and Ville S?\195?\164?\195?\164vuori.

Note: See TracTickets for help on using tickets.
Back to Top