Opened 10 years ago
Last modified 10 years ago
#24985 closed Cleanup/optimization
Rss201rev2Feed invalid characters in character data for RSS — at Version 1
| Reported by: | Michael Wood | Owned by: | nobody |
|---|---|---|---|
| Component: | Documentation | Version: | 1.7 |
| Severity: | Normal | Keywords: | |
| Cc: | Triage Stage: | Ready for checkin | |
| Has patch: | no | Needs documentation: | no |
| Needs tests: | no | Patch needs improvement: | no |
| Easy pickings: | no | UI/UX: | no |
Description (last modified by )
I have some data which comes from log files that I'd like to put into a RSS feed, unfortunately due to the nature of this data it sometimes contains control characters e.g. \0001 \0003 , this causes it to fail RSS feed reader validation due to these characters (although valid utf-8) are not allowed (1).
I'm not sure if this is something that should be fixed in this module, perhaps in sax/saxutils or in somewhere like django.utils.encoding force_text ?
At the moment I'm working around this issue with a regex which replaces this range of chars.