upstream/mercurial-mirror Commit - r26878:d7e83f10

encoding: use getutf8char in toutf8b...

Matt Mackall -

r26878:d7e83f10 default

parent child

mercurial/encoding.py

0 +10 -7

                      s.decode('utf-8')
                      return s
                  except UnicodeDecodeError:
-                     # surrogate-encode any characters that don't round-trip
-                     s2 = s.decode('utf-8', 'ignore').encode('utf-8')
+                     pass
-                     r = ""
-                     pos = 0
-                     for c in s:
-                         if s2[pos:pos + 1] == c:
+                 l = len(s)
+                 while pos < l:
+                     try:
+                         c = getutf8char(s, pos)
+                         pos += len(c)
+                     except UnicodeDecodeError:
+                         c = unichr(0xdc00 + ord(s[pos])).encode('utf-8')
+                         pos += 1
-                             r += c
-                             pos += 1
-                         else:
-                             r += unichr(0xdc00 + ord(c)).encode('utf-8')
-                     return r
              def fromutf8b(s):

General Comments 0

You need to be logged in to leave comments. Login now

No TODOs yet

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages