Opened 10 months ago

Last modified 10 months ago

#27778 assigned Cleanup/optimization

Update unicode documentation for Python 3

Reported by: Aymeric Augustin Owned by: ChillarAnand
Component: Documentation Version: master
Severity: Normal Keywords:
Cc: Triage Stage: Accepted
Has patch: yes Needs documentation: no
Needs tests: no Patch needs improvement: yes
Easy pickings: no UI/UX: no

Description (last modified by Aymeric Augustin)

The "Unicode data" page doesn't make sense for people who started on Python 3 and who have always had the luxury of a str type that works.

Also it assumes that Python's default charset is ASCII, while it is UTF-8 now.

This page should be renamed to "Encoded data" and, instead of explaining how unicode strings are different from (Python 2) regular strings, it should explained how encoded strings are different from (Python 3) regular strings and how external systems deal with them.

There's also a handful of references to "Unicode strings" in other documentation pages that should be changed to just "strings".

Change History (6)

comment:1 Changed 10 months ago by Aymeric Augustin

Description: modified (diff)

comment:2 Changed 10 months ago by Claude Paroz

Triage Stage: UnreviewedAccepted

This PR is dealing with the unicode term replacement with "string". But as I commented on the PR, the "Unicode data" page does indeed need to be rewritten.

comment:3 Changed 10 months ago by Tim Graham <timograham@…>

In d1bab24:

Refs #23919, #27778 -- Removed obsolete mentions of unicode.

comment:4 Changed 10 months ago by Tim Graham

Summary: Update unicode documentationUpdate unicode documentation for Python 3

comment:5 Changed 10 months ago by ChillarAnand

Owner: changed from nobody to ChillarAnand
Status: newassigned

comment:6 Changed 10 months ago by Tim Graham

Has patch: set
Patch needs improvement: set

A PR starts this but hasn't done much rewriting yet.

Note: See TracTickets for help on using tickets.
Back to Top