##// END OF EJS Templates
replace Python standard textwrap by MBCS sensitive one for i18n text...
replace Python standard textwrap by MBCS sensitive one for i18n text Mercurial has problem around text wrapping/filling in MBCS encoding environment, because standard 'textwrap' module of Python can not treat it correctly. It splits byte sequence for one character into two lines. According to unicode specification, "east asian width" classifies characters into: W(ide), N(arrow), F(ull-width), H(alf-width), A(mbiguous) W/N/F/H can be always recognized as 2/1/2/1 bytes in byte sequence, but 'A' can not. Size of 'A' depends on language in which it is used. Unicode specification says: If the context(= language) cannot be established reliably they should be treated as narrow characters by default but many of class 'A' characters are full-width, at least, in Japanese environment. So, this patch treats class 'A' characters as full-width always for safety wrapping. This patch focuses only on MBCS safe-ness, not on writing/printing rule strict wrapping for each languages MBCS sensitive textwrap class is originally implemented by ITO Nobuaki <daydream.trippers@gmail.com>.

File last commit:

r11297:d320e704 default
r11297:d320e704 default
Show More
test-encoding.out
174 lines | 4.8 KiB | text/plain | TextLexer
/ tests / test-encoding.out
Matt Mackall
Add output for encoding test
r3778 adding changesets
adding manifests
adding file changes
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 added 2 changesets with 2 changes to 1 files
Matt Mackall
Add output for encoding test
r3778 (run 'hg update' to get a working copy)
1 files updated, 0 files merged, 0 files removed, 0 files unresolved
% should fail with encoding error
M a
Matt Mackall
Make quoting in test-encoding simpler
r3783 ? latin-1
? latin-1-tag
? utf-8
Matt Mackall
Add output for encoding test
r3778 transaction abort!
rollback completed
Matt Mackall
transactions: avoid late tear-down (issue641)...
r4970 abort: decoding near ' encoded: �': 'ascii' codec can't decode byte 0xe9 in position 20: ordinal not in range(128)!
Matt Mackall
Add output for encoding test
r3778 % these should work
Brendan Cully
Fix up tests for new branch message
r4602 marked working directory as branch �
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg log (ascii)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 5:db5520b4645f
Alexis S. L. Carvalho
log: convert branch names to the local encoding
r3827 branch: ?
tag: tip
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Alexis S. L. Carvalho
log: convert branch names to the local encoding
r3827 summary: latin1 branch
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 4:9cff3c980b58
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: Added tag ? for changeset 770b9b11621d
Matt Mackall
Add output for encoding test
r3778
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 3:770b9b11621d
Matt Mackall
Add output for encoding test
r3778 tag: ?
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Matt Mackall
Make quoting in test-encoding simpler
r3783 summary: utf-8 e' encoded: ?
Matt Mackall
Add output for encoding test
r3778
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 2:0572af48b948
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Matt Mackall
Add output for encoding test
r3778 summary: latin-1 e' encoded: ?
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 1:0e5b7e3f9c4a
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: koi8-r: ????? = u'\u0440\u0442\u0443\u0442\u044c'
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 0:1e78a93102a3
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: latin-1 e': ? = u'\xe9'
Matt Mackall
Add output for encoding test
r3778
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg log (latin-1)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 5:db5520b4645f
Alexis S. L. Carvalho
log: convert branch names to the local encoding
r3827 branch: �
tag: tip
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Alexis S. L. Carvalho
log: convert branch names to the local encoding
r3827 summary: latin1 branch
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 4:9cff3c980b58
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: Added tag � for changeset 770b9b11621d
Matt Mackall
Add output for encoding test
r3778
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 3:770b9b11621d
Matt Mackall
Add output for encoding test
r3778 tag: �
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Matt Mackall
Make quoting in test-encoding simpler
r3783 summary: utf-8 e' encoded: �
Matt Mackall
Add output for encoding test
r3778
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 2:0572af48b948
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Matt Mackall
Add output for encoding test
r3778 summary: latin-1 e' encoded: �
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 1:0e5b7e3f9c4a
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: koi8-r: ����� = u'\u0440\u0442\u0443\u0442\u044c'
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 0:1e78a93102a3
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: latin-1 e': � = u'\xe9'
Matt Mackall
Add output for encoding test
r3778
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg log (utf-8)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 5:db5520b4645f
Alexis S. L. Carvalho
log: convert branch names to the local encoding
r3827 branch: é
tag: tip
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Alexis S. L. Carvalho
log: convert branch names to the local encoding
r3827 summary: latin1 branch
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 4:9cff3c980b58
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: Added tag é for changeset 770b9b11621d
Matt Mackall
Add output for encoding test
r3778
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 3:770b9b11621d
Matt Mackall
Add output for encoding test
r3778 tag: é
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Matt Mackall
Make quoting in test-encoding simpler
r3783 summary: utf-8 e' encoded: é
Matt Mackall
Add output for encoding test
r3778
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 2:0572af48b948
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Matt Mackall
Add output for encoding test
r3778 summary: latin-1 e' encoded: é
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 1:0e5b7e3f9c4a
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: koi8-r: ÒÔÕÔØ = u'\u0440\u0442\u0443\u0442\u044c'
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 0:1e78a93102a3
Matt Mackall
Add output for encoding test
r3778 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: latin-1 e': é = u'\xe9'
Matt Mackall
Add output for encoding test
r3778
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg tags (ascii)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 tip 5:db5520b4645f
? 3:770b9b11621d
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg tags (latin-1)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 tip 5:db5520b4645f
FUJIWARA Katsunori
replace Python standard textwrap by MBCS sensitive one for i18n text...
r11297 � 3:770b9b11621d
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg tags (utf-8)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 tip 5:db5520b4645f
FUJIWARA Katsunori
replace Python standard textwrap by MBCS sensitive one for i18n text...
r11297 é 3:770b9b11621d
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg branches (ascii)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 ? 5:db5520b4645f
Eric Hopper
Change branches to sort 'active' branches first, and add an option to show only active branches.
r4675 default 4:9cff3c980b58 (inactive)
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg branches (latin-1)
FUJIWARA Katsunori
replace Python standard textwrap by MBCS sensitive one for i18n text...
r11297 � 5:db5520b4645f
Eric Hopper
Change branches to sort 'active' branches first, and add an option to show only active branches.
r4675 default 4:9cff3c980b58 (inactive)
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg branches (utf-8)
FUJIWARA Katsunori
replace Python standard textwrap by MBCS sensitive one for i18n text...
r11297 é 5:db5520b4645f
Eric Hopper
Change branches to sort 'active' branches first, and add an option to show only active branches.
r4675 default 4:9cff3c980b58 (inactive)
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg log (utf-8)
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 5:db5520b4645f
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 branch: é
tag: tip
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 summary: latin1 branch
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 4:9cff3c980b58
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: Added tag é for changeset 770b9b11621d
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 3:770b9b11621d
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 tag: é
user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 summary: utf-8 e' encoded: é
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 2:0572af48b948
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 summary: latin-1 e' encoded: é
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 1:0e5b7e3f9c4a
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: koi8-r: ртуть = u'\u0440\u0442\u0443\u0442\u044c'
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 changeset: 0:1e78a93102a3
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835 user: test
Thomas Arendsen Hein
Fixes test-encoding for python2.3 and minor cleanups:...
r3839 date: Mon Jan 12 13:46:40 1970 +0000
summary: latin-1 e': И = u'\xe9'
Alexis S. L. Carvalho
Allow the user to specify the fallback encoding for the changelog...
r3835
Martin Geisler
test-encoding: make test output easier to follow
r9483 % hg log (dolphin)
Matt Mackall
make transcoding more robust...
r3843 abort: unknown encoding: dolphin, please check your locale settings
Alexis S. L. Carvalho
Use UTF-8 in .hg/branch
r3862 abort: decoding near '�': 'ascii' codec can't decode byte 0xe9 in position 0: ordinal not in range(128)!
abort: branch name not in UTF-8!