upstream/ipython Commit - r5752:faab0f8c

Add encoding explanatory comment

Fernando Perez -

r5752:faab0f8c

parent child

IPython/core/magic.py

0 +9 0

             # Used for exception handling in magic_edit
             class MacroToEdit(ValueError): pass
+            # Taken from PEP 263, this is the official encoding regexp.
             _encoding_declaration_re = re.compile(r"^#.*coding[:=]\s*([-\w.]+)")
             #***************************************************************************
                     if remote_url:
                         import urllib2
                         fileobj = urllib2.urlopen(arg_s)
+                        # While responses have a .info().getencoding() way of asking for
+                        # their encoding, in *many* cases the return value is bogus.  In
+                        # the wild, servers serving utf-8 but declaring latin-1 are
+                        # extremely common, as the old HTTP standards specify latin-1 as
+                        # the default but many modern filesystems use utf-8.  So we can NOT
+                        # rely on the headers.  Short of building complex encoding-guessing
+                        # logic, going with utf-8 is a simple solution likely to be right
+                        # in most real-world cases.
                         linesource = fileobj.read().decode('utf-8', 'replace').splitlines()
                     else:
                         fileobj = linesource = open(arg_s)

General Comments 0

You need to be logged in to leave comments. Login now

No TODOs yet

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages