upstream/mercurial-mirror Files · mercurial/pure

encoding: handle UTF-16 internal limit with fromutf8b (issue5031)...

encoding: handle UTF-16 internal limit with fromutf8b (issue5031) Default builds of Python have a Unicode type that isn't actually full Unicode but UTF-16, which encodes non-BMP codepoints to a pair of BMP codepoints with surrogate escaping. Since our UTF-8b hack escaping uses a plane that overlaps with the UTF-16 escaping system, this gets extra complicated. In addition, unichr() for codepoints greater than U+FFFF may not work either. This changes the code to reuse getutf8char to walk the byte string, so we only rely on Python for unpacking our U+DCxx characters.

Matt Mackall -


            r27699:c8d3392f

default

Name	Size	Modified	Last Commit	Author
/ mercurial / pure
__init__.py	Loading ...
base85.py	Loading ...
bdiff.py	Loading ...
diffhelpers.py	Loading ...
mpatch.py	Loading ...
osutil.py	Loading ...
parsers.py	Loading ...

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages