upstream/ipython Files · IPython/utils/encoding.py

Reset the interactive namespace __warningregistry__ before executing code...

Reset the interactive namespace __warningregistry__ before executing code Fixes . Idea: Right now, people often don't see important warnings when running code in IPython, because (to a first approximation) any given warning will only issue once per session. Blink and you'll miss it! This is a very common contributor to confused emails to numpy-discussion. E.g.: In [5]: 1 / my_array_with_random_contents /home/njs/.user-python2.7-64bit-3/bin/ipython:1: RuntimeWarning: divide by zero encountered in divide #!/home/njs/.user-python2.7-64bit-3/bin/python Out[5]: array([ 1.77073316, -2.29765021, -2.01800811, ..., 1.13871243, -1.08302964, -8.6185091 ]) Oo, right, guess I gotta be careful of those zeros -- thanks, numpy, for giving me that warning! A few days later: In [592]: 1 / some_other_array Out[592]: array([ 3.07735763, 0.50769289, 0.83984078, ..., -0.67563917, -0.85736257, -1.36511271]) Oops, it turns out that this array had a zero in it too, and that's going to bite me later. But no warning this time! The effect of this commit is to make it so that warnings triggered by the code in cell 5 do *not* suppress warnings triggered by the code in cell 592. Note that this only applies to warnings triggered *directly* by code entered interactively -- if somepkg.foo() calls anotherpkg.bad_func() which issues a warning, then this warning will still only be displayed once, even if multiple cells call somepkg.foo(). But if cell 5 and cell 592 both call anotherpkg.bad_func() directly, then both will get warnings. (Important exception: if foo() is defined *interactively*, and calls anotherpkg.bad_func(), then every cell that calls foo() will display the warning again. This is unavoidable without fixes to CPython upstream.) Explanation: Python's warning system has some weird quirks. By default, it tries to suppress duplicate warnings, where "duplicate" means the same warning message triggered twice by the same line of code. This requires determining which line of code is responsible for triggering a warning, and this is controlled by the stacklevel= argument to warnings.warn. Basically, though, the idea is that if foo() calls bar() which calls baz() which calls some_deprecated_api(), then baz() will get counted as being "responsible", and the warning system will make a note that the usage of some_deprecated_api() inside baz() has already been warned about and doesn't need to be warned about again. So far so good. To accomplish this, obviously, there has to be a record of somewhere which line this was. You might think that this would be done by recording the filename:linenumber pair in a dict inside the warnings module, or something like that. You would be wrong. What actually happens is that the warnings module will use stack introspection to reach into baz()'s execution environment, create a global (module-level) variable there named __warningregistry__, and then, inside this dictionary, record just the line number. Basically, it assumes that any given module contains only one line 1, only one line 2, etc., so storing the filename is irrelevant. Obviously for interactive code this is totally wrong -- all cells share the same execution environment and global namespace, and they all contain a new line 1. Currently the warnings module treats these as if they were all the same line. In fact they are not the same line; once we have executed a given chunk of code, we will never see those particular lines again. As soon as a given chunk of code finishes executing, its line number labels become meaningless, and the corresponding warning registry entries become meaningless as well. Therefore, with this patch we delete the __warningregistry__ each time we execute a new block of code.

MinRK - - Load All Authors

File last commit:

r14695:bda2ad7d


                r18548:61431d7d

Download file

             encoding.py
        
                    71 lines
            
             | 2.8 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / IPython / utils / encoding.py
          
                    History
                
                 |
                  Source
                 | Raw
                 |Copy content
                 |Copy permalink

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
      # coding: utf-8

      """

      Utilities for dealing with text encodings

      """

      #-----------------------------------------------------------------------------

      #  Copyright (C) 2008-2012  The IPython Development Team

      #

      #  Distributed under the terms of the BSD License.  The full license is in

      #  the file COPYING, distributed as part of this software.

      #-----------------------------------------------------------------------------

      #-----------------------------------------------------------------------------

      # Imports

      #-----------------------------------------------------------------------------

      import sys

      import locale

        dhirschfeld
    
Fix for incorrect default encoding on Windows.

              r12444
            
      import warnings

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
      # to deal with the possibility of sys.std* not being a stream at all

      def get_stream_enc(stream, default=None):

          """Return the given stream's encoding or a default.

        Thomas Kluyver
    
Miscellaneous docs fixes

              r9244
            
          There are cases where ``sys.std*`` might not actually be a stream, so

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
          check for the encoding attribute prior to returning it, and return

        Thomas Kluyver
    
Miscellaneous docs fixes

              r9244
            
          a default if it doesn't exist or evaluates as False. ``default``

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
          is None if not provided.

          """

          if not hasattr(stream, 'encoding') or not stream.encoding:

              return default

          else:

              return stream.encoding

      # Less conservative replacement for sys.getdefaultencoding, that will try

      # to match the environment.

      # Defined here as central function, so if we find better choices, we

      # won't need to make changes all over IPython.

        MinRK
    
add prefer_stream arg to utils.encoding.getdefaultencoding...

              r14695
            
      def getdefaultencoding(prefer_stream=True):

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
          """Return IPython's guess for the default encoding for bytes as text.

        MinRK
    
add prefer_stream arg to utils.encoding.getdefaultencoding...

              r14695
            
          If prefer_stream is True (default), asks for stdin.encoding first,

          to match the calling Terminal, but that is often None for subprocesses.

          Then fall back on locale.getpreferredencoding(),

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
          which should be a sensible platform default (that respects LANG environment),

          and finally to sys.getdefaultencoding() which is the most conservative option,

        MinRK
    
add prefer_stream arg to utils.encoding.getdefaultencoding...

              r14695
            
          and usually ASCII on Python 2 or UTF8 on Python 3.

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
          """

        MinRK
    
add prefer_stream arg to utils.encoding.getdefaultencoding...

              r14695
            
          enc = None

          if prefer_stream:

              enc = get_stream_enc(sys.stdin)

        Brandon Parsons
    
Feedback from pull request #1245

              r6655
            
          if not enc or enc=='ascii':

              try:

                  # There are reports of getpreferredencoding raising errors

                  # in some cases, which may well be fixed, but let's be conservative here.

                  enc = locale.getpreferredencoding()

              except Exception:

                  pass

        dhirschfeld
    
Fix for incorrect default encoding on Windows.

              r12444
            
          enc = enc or sys.getdefaultencoding()

          # On windows `cp0` can be returned to indicate that there is no code page.

          # Since cp0 is an invalid encoding return instead cp1252 which is the

          # Western European default.

          if enc == 'cp0':

              warnings.warn(

                  "Invalid code page cp0 detected - using cp1252 instead."

                  "If cp1252 is incorrect please ensure a valid code page "

                  "is defined for the process.", RuntimeWarning)

              return 'cp1252'

          return enc

        Brandon Parsons
    
saner default encoding mechanism

              r6716
            
      DEFAULT_ENCODING = getdefaultencoding()

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

Brandon Parsons Feedback from pull request #1245	r6655	# coding: utf-8
		"""
		Utilities for dealing with text encodings
		"""

		#-----------------------------------------------------------------------------
		# Copyright (C) 2008-2012 The IPython Development Team
		#
		# Distributed under the terms of the BSD License. The full license is in
		# the file COPYING, distributed as part of this software.
		#-----------------------------------------------------------------------------

		#-----------------------------------------------------------------------------
		# Imports
		#-----------------------------------------------------------------------------
		import sys
		import locale
dhirschfeld Fix for incorrect default encoding on Windows.	r12444	import warnings
Brandon Parsons Feedback from pull request #1245	r6655
		# to deal with the possibility of sys.std* not being a stream at all
		def get_stream_enc(stream, default=None):
		"""Return the given stream's encoding or a default.

Thomas Kluyver Miscellaneous docs fixes	r9244	There are cases where ``sys.std*`` might not actually be a stream, so
Brandon Parsons Feedback from pull request #1245	r6655	check for the encoding attribute prior to returning it, and return
Thomas Kluyver Miscellaneous docs fixes	r9244	a default if it doesn't exist or evaluates as False. ``default``
Brandon Parsons Feedback from pull request #1245	r6655	is None if not provided.
		"""
		if not hasattr(stream, 'encoding') or not stream.encoding:
		return default
		else:
		return stream.encoding

		# Less conservative replacement for sys.getdefaultencoding, that will try
		# to match the environment.
		# Defined here as central function, so if we find better choices, we
		# won't need to make changes all over IPython.
MinRK add prefer_stream arg to utils.encoding.getdefaultencoding...	r14695	def getdefaultencoding(prefer_stream=True):
Brandon Parsons Feedback from pull request #1245	r6655	"""Return IPython's guess for the default encoding for bytes as text.
MinRK add prefer_stream arg to utils.encoding.getdefaultencoding...	r14695
		If prefer_stream is True (default), asks for stdin.encoding first,
		to match the calling Terminal, but that is often None for subprocesses.

		Then fall back on locale.getpreferredencoding(),
Brandon Parsons Feedback from pull request #1245	r6655	which should be a sensible platform default (that respects LANG environment),
		and finally to sys.getdefaultencoding() which is the most conservative option,
MinRK add prefer_stream arg to utils.encoding.getdefaultencoding...	r14695	and usually ASCII on Python 2 or UTF8 on Python 3.
Brandon Parsons Feedback from pull request #1245	r6655	"""
MinRK add prefer_stream arg to utils.encoding.getdefaultencoding...	r14695	enc = None
		if prefer_stream:
		enc = get_stream_enc(sys.stdin)
Brandon Parsons Feedback from pull request #1245	r6655	if not enc or enc=='ascii':
		try:
		# There are reports of getpreferredencoding raising errors
		# in some cases, which may well be fixed, but let's be conservative here.
		enc = locale.getpreferredencoding()
		except Exception:
		pass
dhirschfeld Fix for incorrect default encoding on Windows.	r12444	enc = enc or sys.getdefaultencoding()
		# On windows `cp0` can be returned to indicate that there is no code page.
		# Since cp0 is an invalid encoding return instead cp1252 which is the
		# Western European default.
		if enc == 'cp0':
		warnings.warn(
		"Invalid code page cp0 detected - using cp1252 instead."
		"If cp1252 is incorrect please ensure a valid code page "
		"is defined for the process.", RuntimeWarning)
		return 'cp1252'
		return enc
Brandon Parsons saner default encoding mechanism	r6716
		DEFAULT_ENCODING = getdefaultencoding()