upstream/ipython Commit - r3291:6aff23d2

BUG: when given unicode inputs, arg_split should return unicode outputs. Always use utf-8 to encode the string instead of relying on sys.stdin.encoding, which may not be able to accept the full range of Unicode characters. When given unicode strings, arg_split is probably not receiving input from a terminal.

Robert Kern -

r3291:6aff23d2

parent child

Collapse all files

IPython/utils/process.py

0 +8 -2

                 # http://bugs.python.org/issue1170
                 # At least encoding the input when it's unicode seems to help, but there
                 # may be more problems lurking.  Apparently this is fixed in python3.
+                is_unicode = False
                 if isinstance(s, unicode):
-                    s = s.encode(sys.stdin.encoding)
+                    is_unicode = True
+                    s = s.encode('utf-8')
                 lex = shlex.shlex(s, posix=posix)
                 lex.whitespace_split = True
-                return list(lex)
+                tokens = list(lex)
+                if is_unicode:
+                    # Convert the tokens back to unicode.
+                    tokens = [x.decode('utf-8') for x in tokens]
+                return tokens
             def abbrev_cwd():

IPython/utils/tests/test_process.py

0 +3 0

                 """Ensure that argument lines are correctly split like in a shell."""
                 tests = [['hi', ['hi']],
                          [u'hi', [u'hi']],
+                         ['hello there', ['hello', 'there']],
+                         [u'h\N{LATIN SMALL LETTER A WITH CARON}llo', [u'h\N{LATIN SMALL LETTER A WITH CARON}llo']],
+                         ['something "with quotes"', ['something', '"with quotes"']],
                          ]
                 for argstr, argv in tests:
                     nt.assert_equal(arg_split(argstr), argv)

General Comments 0

Write
Preview

You need to be logged in to leave comments. Login now

No TODOs yet

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages