upstream/ipython Files · IPython/utils/_process_posix.py

Reset the interactive namespace __warningregistry__ before executing code...

Reset the interactive namespace __warningregistry__ before executing code Fixes . Idea: Right now, people often don't see important warnings when running code in IPython, because (to a first approximation) any given warning will only issue once per session. Blink and you'll miss it! This is a very common contributor to confused emails to numpy-discussion. E.g.: In [5]: 1 / my_array_with_random_contents /home/njs/.user-python2.7-64bit-3/bin/ipython:1: RuntimeWarning: divide by zero encountered in divide #!/home/njs/.user-python2.7-64bit-3/bin/python Out[5]: array([ 1.77073316, -2.29765021, -2.01800811, ..., 1.13871243, -1.08302964, -8.6185091 ]) Oo, right, guess I gotta be careful of those zeros -- thanks, numpy, for giving me that warning! A few days later: In [592]: 1 / some_other_array Out[592]: array([ 3.07735763, 0.50769289, 0.83984078, ..., -0.67563917, -0.85736257, -1.36511271]) Oops, it turns out that this array had a zero in it too, and that's going to bite me later. But no warning this time! The effect of this commit is to make it so that warnings triggered by the code in cell 5 do *not* suppress warnings triggered by the code in cell 592. Note that this only applies to warnings triggered *directly* by code entered interactively -- if somepkg.foo() calls anotherpkg.bad_func() which issues a warning, then this warning will still only be displayed once, even if multiple cells call somepkg.foo(). But if cell 5 and cell 592 both call anotherpkg.bad_func() directly, then both will get warnings. (Important exception: if foo() is defined *interactively*, and calls anotherpkg.bad_func(), then every cell that calls foo() will display the warning again. This is unavoidable without fixes to CPython upstream.) Explanation: Python's warning system has some weird quirks. By default, it tries to suppress duplicate warnings, where "duplicate" means the same warning message triggered twice by the same line of code. This requires determining which line of code is responsible for triggering a warning, and this is controlled by the stacklevel= argument to warnings.warn. Basically, though, the idea is that if foo() calls bar() which calls baz() which calls some_deprecated_api(), then baz() will get counted as being "responsible", and the warning system will make a note that the usage of some_deprecated_api() inside baz() has already been warned about and doesn't need to be warned about again. So far so good. To accomplish this, obviously, there has to be a record of somewhere which line this was. You might think that this would be done by recording the filename:linenumber pair in a dict inside the warnings module, or something like that. You would be wrong. What actually happens is that the warnings module will use stack introspection to reach into baz()'s execution environment, create a global (module-level) variable there named __warningregistry__, and then, inside this dictionary, record just the line number. Basically, it assumes that any given module contains only one line 1, only one line 2, etc., so storing the filename is irrelevant. Obviously for interactive code this is totally wrong -- all cells share the same execution environment and global namespace, and they all contain a new line 1. Currently the warnings module treats these as if they were all the same line. In fact they are not the same line; once we have executed a given chunk of code, we will never see those particular lines again. As soon as a given chunk of code finishes executing, its line number labels become meaningless, and the corresponding warning registry entries become meaningless as well. Therefore, with this patch we delete the __warningregistry__ each time we execute a new block of code.

Thomas Kluyver - - Load All Authors

File last commit:

r17181:c8507215


                r18548:61431d7d

Download file

             _process_posix.py
        
                    225 lines
            
             | 8.7 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / IPython / utils / _process_posix.py
          
                    History
                
                 |
                  Annotation
                 | Raw
                 |Copy content
                 |Copy permalink

      """Posix-specific implementation of process utilities.

      This file is only meant to be imported by process.py, not by end-users.

      """

      #-----------------------------------------------------------------------------

      #  Copyright (C) 2010-2011  The IPython Development Team

      #

      #  Distributed under the terms of the BSD License.  The full license is in

      #  the file COPYING, distributed as part of this software.

      #-----------------------------------------------------------------------------

      #-----------------------------------------------------------------------------

      # Imports

      #-----------------------------------------------------------------------------

      from __future__ import print_function

      # Stdlib

      import errno

      import os

      import subprocess as sp

      import sys

      from IPython.external import pexpect

      # Our own

      from ._process_common import getoutput, arg_split

      from IPython.utils import py3compat

      from IPython.utils.encoding import DEFAULT_ENCODING

      #-----------------------------------------------------------------------------

      # Function definitions

      #-----------------------------------------------------------------------------

      def _find_cmd(cmd):

          """Find the full path to a command using which."""

          path = sp.Popen(['/usr/bin/env', 'which', cmd],

                          stdout=sp.PIPE, stderr=sp.PIPE).communicate()[0]

          return py3compat.bytes_to_str(path)

      class ProcessHandler(object):

          """Execute subprocesses under the control of pexpect.

          """

          # Timeout in seconds to wait on each reading of the subprocess' output.

          # This should not be set too low to avoid cpu overusage from our side,

          # since we read in a loop whose period is controlled by this timeout.

          read_timeout = 0.05

          # Timeout to give a process if we receive SIGINT, between sending the

          # SIGINT to the process and forcefully terminating it.

          terminate_timeout = 0.2

          # File object where stdout and stderr of the subprocess will be written

          logfile = None

          # Shell to call for subprocesses to execute

          _sh = None

          @property

          def sh(self):

              if self._sh is None:        

                  self._sh = pexpect.which('sh')

                  if self._sh is None:

                      raise OSError('"sh" shell not found')

              return self._sh

          def __init__(self, logfile=None, read_timeout=None, terminate_timeout=None):

              """Arguments are used for pexpect calls."""

              self.read_timeout = (ProcessHandler.read_timeout if read_timeout is

                                   None else read_timeout)

              self.terminate_timeout = (ProcessHandler.terminate_timeout if

                                        terminate_timeout is None else

                                        terminate_timeout)

              self.logfile = sys.stdout if logfile is None else logfile

          def getoutput(self, cmd):

              """Run a command and return its stdout/stderr as a string.

              Parameters

              ----------

              cmd : str

                A command to be executed in the system shell.

              Returns

              -------

              output : str

                A string containing the combination of stdout and stderr from the

              subprocess, in whatever order the subprocess originally wrote to its

              file descriptors (so the order of the information in this string is the

              correct order as would be seen if running the command in a terminal).

              """

              try:

                  return pexpect.run(self.sh, args=['-c', cmd]).replace('\r\n', '\n')

              except KeyboardInterrupt:

                  print('^C', file=sys.stderr, end='')

          def getoutput_pexpect(self, cmd):

              """Run a command and return its stdout/stderr as a string.

              Parameters

              ----------

              cmd : str

                A command to be executed in the system shell.

              Returns

              -------

              output : str

                A string containing the combination of stdout and stderr from the

              subprocess, in whatever order the subprocess originally wrote to its

              file descriptors (so the order of the information in this string is the

              correct order as would be seen if running the command in a terminal).

              """

              try:

                  return pexpect.run(self.sh, args=['-c', cmd]).replace('\r\n', '\n')

              except KeyboardInterrupt:

                  print('^C', file=sys.stderr, end='')

          def system(self, cmd):

              """Execute a command in a subshell.

              Parameters

              ----------

              cmd : str

                A command to be executed in the system shell.

              Returns

              -------

              int : child's exitstatus

              """

              # Get likely encoding for the output.

              enc = DEFAULT_ENCODING

              # Patterns to match on the output, for pexpect.  We read input and

              # allow either a short timeout or EOF

              patterns = [pexpect.TIMEOUT, pexpect.EOF]

              # the index of the EOF pattern in the list.

              # even though we know it's 1, this call means we don't have to worry if

              # we change the above list, and forget to change this value:

              EOF_index = patterns.index(pexpect.EOF)

              # The size of the output stored so far in the process output buffer.

              # Since pexpect only appends to this buffer, each time we print we

              # record how far we've printed, so that next time we only print *new*

              # content from the buffer.

              out_size = 0

              try:

                  # Since we're not really searching the buffer for text patterns, we

                  # can set pexpect's search window to be tiny and it won't matter.

                  # We only search for the 'patterns' timeout or EOF, which aren't in

                  # the text itself.

                  #child = pexpect.spawn(pcmd, searchwindowsize=1)

                  if hasattr(pexpect, 'spawnb'):

                      child = pexpect.spawnb(self.sh, args=['-c', cmd]) # Pexpect-U

                  else:

                      child = pexpect.spawn(self.sh, args=['-c', cmd])  # Vanilla Pexpect

                  flush = sys.stdout.flush

                  while True:

                      # res is the index of the pattern that caused the match, so we

                      # know whether we've finished (if we matched EOF) or not

                      res_idx = child.expect_list(patterns, self.read_timeout)

                      print(child.before[out_size:].decode(enc, 'replace'), end='')

                      flush()

                      if res_idx==EOF_index:

                          break

                      # Update the pointer to what we've already printed

                      out_size = len(child.before)

              except KeyboardInterrupt:

                  # We need to send ^C to the process.  The ascii code for '^C' is 3

                  # (the character is known as ETX for 'End of Text', see

                  # curses.ascii.ETX).

                  child.sendline(chr(3))

                  # Read and print any more output the program might produce on its

                  # way out.

                  try:

                      out_size = len(child.before)

                      child.expect_list(patterns, self.terminate_timeout)

                      print(child.before[out_size:].decode(enc, 'replace'), end='')

                      sys.stdout.flush()

                  except KeyboardInterrupt:

                      # Impatient users tend to type it multiple times

                      pass

                  finally:

                      # Ensure the subprocess really is terminated

                      child.terminate(force=True)

              # add isalive check, to ensure exitstatus is set:

              child.isalive()

              # We follow the subprocess pattern, returning either the exit status

              # as a positive number, or the terminating signal as a negative

              # number.

              # on Linux, sh returns 128+n for signals terminating child processes on Linux

              # on BSD (OS X), the signal code is set instead

              if child.exitstatus is None:

                  # on WIFSIGNALED, pexpect sets signalstatus, leaving exitstatus=None

                  if child.signalstatus is None:

                      # this condition may never occur,

                      # but let's be certain we always return an integer.

                      return 0

                  return -child.signalstatus

              if child.exitstatus > 128:

                  return -(child.exitstatus - 128)

              return child.exitstatus

      # Make system() with a functional interface for outside use.  Note that we use

      # getoutput() from the _common utils, which is built on top of popen(). Using

      # pexpect to get subprocess output produces difficult to parse output, since

      # programs think they are talking to a tty and produce highly formatted output

      # (ls is a good example) that makes them hard.

      system = ProcessHandler().system

      def check_pid(pid):

          try:

              os.kill(pid, 0)

          except OSError as err:

              if err.errno == errno.ESRCH:

                  return False

              elif err.errno == errno.EPERM:

                  # Don't have permission to signal the process - probably means it exists

                  return True

              raise

          else:

              return True

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

				"""Posix-specific implementation of process utilities.

				This file is only meant to be imported by process.py, not by end-users.
				"""

				#-----------------------------------------------------------------------------
				# Copyright (C) 2010-2011 The IPython Development Team
				#
				# Distributed under the terms of the BSD License. The full license is in
				# the file COPYING, distributed as part of this software.
				#-----------------------------------------------------------------------------

				#-----------------------------------------------------------------------------
				# Imports
				#-----------------------------------------------------------------------------
				from __future__ import print_function

				# Stdlib
				import errno
				import os
				import subprocess as sp
				import sys

				from IPython.external import pexpect

				# Our own
				from ._process_common import getoutput, arg_split
				from IPython.utils import py3compat
				from IPython.utils.encoding import DEFAULT_ENCODING

				#-----------------------------------------------------------------------------
				# Function definitions
				#-----------------------------------------------------------------------------

				def _find_cmd(cmd):
				"""Find the full path to a command using which."""

				path = sp.Popen(['/usr/bin/env', 'which', cmd],
				stdout=sp.PIPE, stderr=sp.PIPE).communicate()[0]
				return py3compat.bytes_to_str(path)


				class ProcessHandler(object):
				"""Execute subprocesses under the control of pexpect.
				"""
				# Timeout in seconds to wait on each reading of the subprocess' output.
				# This should not be set too low to avoid cpu overusage from our side,
				# since we read in a loop whose period is controlled by this timeout.
				read_timeout = 0.05

				# Timeout to give a process if we receive SIGINT, between sending the
				# SIGINT to the process and forcefully terminating it.
				terminate_timeout = 0.2

				# File object where stdout and stderr of the subprocess will be written
				logfile = None

				# Shell to call for subprocesses to execute
				_sh = None

				@property
				def sh(self):
				if self._sh is None:
				self._sh = pexpect.which('sh')
				if self._sh is None:
				raise OSError('"sh" shell not found')

				return self._sh

				def __init__(self, logfile=None, read_timeout=None, terminate_timeout=None):
				"""Arguments are used for pexpect calls."""
				self.read_timeout = (ProcessHandler.read_timeout if read_timeout is
				None else read_timeout)
				self.terminate_timeout = (ProcessHandler.terminate_timeout if
				terminate_timeout is None else
				terminate_timeout)
				self.logfile = sys.stdout if logfile is None else logfile

				def getoutput(self, cmd):
				"""Run a command and return its stdout/stderr as a string.

				Parameters
				----------
				cmd : str
				A command to be executed in the system shell.

				Returns
				-------
				output : str
				A string containing the combination of stdout and stderr from the
				subprocess, in whatever order the subprocess originally wrote to its
				file descriptors (so the order of the information in this string is the
				correct order as would be seen if running the command in a terminal).
				"""
				try:
				return pexpect.run(self.sh, args=['-c', cmd]).replace('\r\n', '\n')
				except KeyboardInterrupt:
				print('^C', file=sys.stderr, end='')

				def getoutput_pexpect(self, cmd):
				"""Run a command and return its stdout/stderr as a string.

				Parameters
				----------
				cmd : str
				A command to be executed in the system shell.

				Returns
				-------
				output : str
				A string containing the combination of stdout and stderr from the
				subprocess, in whatever order the subprocess originally wrote to its
				file descriptors (so the order of the information in this string is the
				correct order as would be seen if running the command in a terminal).
				"""
				try:
				return pexpect.run(self.sh, args=['-c', cmd]).replace('\r\n', '\n')
				except KeyboardInterrupt:
				print('^C', file=sys.stderr, end='')

				def system(self, cmd):
				"""Execute a command in a subshell.

				Parameters
				----------
				cmd : str
				A command to be executed in the system shell.

				Returns
				-------
				int : child's exitstatus
				"""
				# Get likely encoding for the output.
				enc = DEFAULT_ENCODING

				# Patterns to match on the output, for pexpect. We read input and
				# allow either a short timeout or EOF
				patterns = [pexpect.TIMEOUT, pexpect.EOF]
				# the index of the EOF pattern in the list.
				# even though we know it's 1, this call means we don't have to worry if
				# we change the above list, and forget to change this value:
				EOF_index = patterns.index(pexpect.EOF)
				# The size of the output stored so far in the process output buffer.
				# Since pexpect only appends to this buffer, each time we print we
				# record how far we've printed, so that next time we only print new
				# content from the buffer.
				out_size = 0
				try:
				# Since we're not really searching the buffer for text patterns, we
				# can set pexpect's search window to be tiny and it won't matter.
				# We only search for the 'patterns' timeout or EOF, which aren't in
				# the text itself.
				#child = pexpect.spawn(pcmd, searchwindowsize=1)
				if hasattr(pexpect, 'spawnb'):
				child = pexpect.spawnb(self.sh, args=['-c', cmd]) # Pexpect-U
				else:
				child = pexpect.spawn(self.sh, args=['-c', cmd]) # Vanilla Pexpect
				flush = sys.stdout.flush
				while True:
				# res is the index of the pattern that caused the match, so we
				# know whether we've finished (if we matched EOF) or not
				res_idx = child.expect_list(patterns, self.read_timeout)
				print(child.before[out_size:].decode(enc, 'replace'), end='')
				flush()
				if res_idx==EOF_index:
				break
				# Update the pointer to what we've already printed
				out_size = len(child.before)
				except KeyboardInterrupt:
				# We need to send ^C to the process. The ascii code for '^C' is 3
				# (the character is known as ETX for 'End of Text', see
				# curses.ascii.ETX).
				child.sendline(chr(3))
				# Read and print any more output the program might produce on its
				# way out.
				try:
				out_size = len(child.before)
				child.expect_list(patterns, self.terminate_timeout)
				print(child.before[out_size:].decode(enc, 'replace'), end='')
				sys.stdout.flush()
				except KeyboardInterrupt:
				# Impatient users tend to type it multiple times
				pass
				finally:
				# Ensure the subprocess really is terminated
				child.terminate(force=True)
				# add isalive check, to ensure exitstatus is set:
				child.isalive()

				# We follow the subprocess pattern, returning either the exit status
				# as a positive number, or the terminating signal as a negative
				# number.
				# on Linux, sh returns 128+n for signals terminating child processes on Linux
				# on BSD (OS X), the signal code is set instead
				if child.exitstatus is None:
				# on WIFSIGNALED, pexpect sets signalstatus, leaving exitstatus=None
				if child.signalstatus is None:
				# this condition may never occur,
				# but let's be certain we always return an integer.
				return 0
				return -child.signalstatus
				if child.exitstatus > 128:
				return -(child.exitstatus - 128)
				return child.exitstatus


				# Make system() with a functional interface for outside use. Note that we use
				# getoutput() from the _common utils, which is built on top of popen(). Using
				# pexpect to get subprocess output produces difficult to parse output, since
				# programs think they are talking to a tty and produce highly formatted output
				# (ls is a good example) that makes them hard.
				system = ProcessHandler().system

				def check_pid(pid):
				try:
				os.kill(pid, 0)
				except OSError as err:
				if err.errno == errno.ESRCH:
				return False
				elif err.errno == errno.EPERM:
				# Don't have permission to signal the process - probably means it exists
				return True
				raise
				else:
				return True