upstream/ipython Files · IPython/utils/openpy.py

Improve typing and MIME hook API for inspector (#14342)...

Improve typing and MIME hook API for inspector (#14342) Fixes https://github.com/ipython/ipython/issues/14339 ### Additions Adds `InfoDict` type to improve the typing of `info()` result. Adds missing `"subclasses"` to `info_fields` list (these were added to the field list in https://github.com/ipython/ipython/pull/11486 but we forgot to update `info_fields` variable at the time) - the newly added `InfoDict` type will ensure that this won't happen again. Adds `InspectorHookData` dataclass which is passed to the MIME hooks which now should expect a single argument. Having a single dataclass argument enables us to deprecate individual fields, or add new fields without breaking the existing hooks. The old hooks will still work (if any are out there since this mechanism got just added in the previous point version). ### Deletions A comment over `info_fields` gets deleted: - Contrarily to the comment (which is getting deleted in this PR), `info_fields` were not defining the order of display since at least 2015 (https://github.com/ipython/ipython/pull/7903 - I did not feel the need to go further in the history to find when exactly it happened). - Also contrarily to this comment, current Jupyter messaging spec does not define the contents of `info_fields` (I guess this was lost during IPython/Jupyter split), but the newly added `InfoDict` at least properly annotates their type (if you know where I can find the old IPython messaging spec with the descriptions I can add these as doc comments). Unused `cast_unicode` import gets deleted. If someone imported it from here... well they really should not have. ### Deprecations - mime hooks taking two arguments (`obj, info`)

Matthias Bussonnier - - Load All Authors

File last commit:

r26419:7663c521


                r28661:2084e7f3

Download file

             openpy.py
        
                    105 lines
            
             | 3.3 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / IPython / utils / openpy.py
          
                    History
                
                 |
                  Annotation
                 | Raw
                 |Copy content
                 |Copy permalink

      """

      Tools to open .py files as Unicode, using the encoding specified within the file,

      as per PEP 263.

      Much of the code is taken from the tokenize module in Python 3.2.

      """

      import io

      from io import TextIOWrapper, BytesIO

      from pathlib import Path

      import re

      from tokenize import open, detect_encoding

      cookie_re = re.compile(r"coding[:=]\s*([-\w.]+)", re.UNICODE)

      cookie_comment_re = re.compile(r"^\s*#.*coding[:=]\s*([-\w.]+)", re.UNICODE)

      def source_to_unicode(txt, errors='replace', skip_encoding_cookie=True):

          """Converts a bytes string with python source code to unicode.

          Unicode strings are passed through unchanged. Byte strings are checked

          for the python source file encoding cookie to determine encoding.

          txt can be either a bytes buffer or a string containing the source

          code.

          """

          if isinstance(txt, str):

              return txt

          if isinstance(txt, bytes):

              buffer = BytesIO(txt)

          else:

              buffer = txt

          try:

              encoding, _ = detect_encoding(buffer.readline)

          except SyntaxError:

              encoding = "ascii"

          buffer.seek(0)

          with TextIOWrapper(buffer, encoding, errors=errors, line_buffering=True) as text:

              text.mode = 'r'

              if skip_encoding_cookie:

                  return u"".join(strip_encoding_cookie(text))

              else:

                  return text.read()

      def strip_encoding_cookie(filelike):

          """Generator to pull lines from a text-mode file, skipping the encoding

          cookie if it is found in the first two lines.

          """

          it = iter(filelike)

          try:

              first = next(it)

              if not cookie_comment_re.match(first):

                  yield first

              second = next(it)

              if not cookie_comment_re.match(second):

                  yield second

          except StopIteration:

              return

          for line in it:

              yield line

      def read_py_file(filename, skip_encoding_cookie=True):

          """Read a Python file, using the encoding declared inside the file.

          Parameters

          ----------

          filename : str

              The path to the file to read.

          skip_encoding_cookie : bool

              If True (the default), and the encoding declaration is found in the first

              two lines, that line will be excluded from the output.

          Returns

          -------

          A unicode string containing the contents of the file.

          """

          filepath = Path(filename)

          with open(filepath) as f:  # the open function defined in this module.

              if skip_encoding_cookie:

                  return "".join(strip_encoding_cookie(f))

              else:

                  return f.read()

      def read_py_url(url, errors='replace', skip_encoding_cookie=True):

          """Read a Python file from a URL, using the encoding declared inside the file.

          Parameters

          ----------

          url : str

              The URL from which to fetch the file.

          errors : str

              How to handle decoding errors in the file. Options are the same as for

              bytes.decode(), but here 'replace' is the default.

          skip_encoding_cookie : bool

              If True (the default), and the encoding declaration is found in the first

              two lines, that line will be excluded from the output.

          Returns

          -------

          A unicode string containing the contents of the file.

          """

          # Deferred import for faster start

          from urllib.request import urlopen 

          response = urlopen(url)

          buffer = io.BytesIO(response.read())

          return source_to_unicode(buffer, errors, skip_encoding_cookie)

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

				"""
				Tools to open .py files as Unicode, using the encoding specified within the file,
				as per PEP 263.

				Much of the code is taken from the tokenize module in Python 3.2.
				"""

				import io
				from io import TextIOWrapper, BytesIO
				from pathlib import Path
				import re
				from tokenize import open, detect_encoding

				cookie_re = re.compile(r"coding[:=]\s*([-\w.]+)", re.UNICODE)
				cookie_comment_re = re.compile(r"^\s#.coding[:=]\s*([-\w.]+)", re.UNICODE)

				def source_to_unicode(txt, errors='replace', skip_encoding_cookie=True):
				"""Converts a bytes string with python source code to unicode.

				Unicode strings are passed through unchanged. Byte strings are checked
				for the python source file encoding cookie to determine encoding.
				txt can be either a bytes buffer or a string containing the source
				code.
				"""
				if isinstance(txt, str):
				return txt
				if isinstance(txt, bytes):
				buffer = BytesIO(txt)
				else:
				buffer = txt
				try:
				encoding, _ = detect_encoding(buffer.readline)
				except SyntaxError:
				encoding = "ascii"
				buffer.seek(0)
				with TextIOWrapper(buffer, encoding, errors=errors, line_buffering=True) as text:
				text.mode = 'r'
				if skip_encoding_cookie:
				return u"".join(strip_encoding_cookie(text))
				else:
				return text.read()

				def strip_encoding_cookie(filelike):
				"""Generator to pull lines from a text-mode file, skipping the encoding
				cookie if it is found in the first two lines.
				"""
				it = iter(filelike)
				try:
				first = next(it)
				if not cookie_comment_re.match(first):
				yield first
				second = next(it)
				if not cookie_comment_re.match(second):
				yield second
				except StopIteration:
				return

				for line in it:
				yield line

				def read_py_file(filename, skip_encoding_cookie=True):
				"""Read a Python file, using the encoding declared inside the file.

				Parameters
				----------
				filename : str
				The path to the file to read.
				skip_encoding_cookie : bool
				If True (the default), and the encoding declaration is found in the first
				two lines, that line will be excluded from the output.

				Returns
				-------
				A unicode string containing the contents of the file.
				"""
				filepath = Path(filename)
				with open(filepath) as f: # the open function defined in this module.
				if skip_encoding_cookie:
				return "".join(strip_encoding_cookie(f))
				else:
				return f.read()

				def read_py_url(url, errors='replace', skip_encoding_cookie=True):
				"""Read a Python file from a URL, using the encoding declared inside the file.

				Parameters
				----------
				url : str
				The URL from which to fetch the file.
				errors : str
				How to handle decoding errors in the file. Options are the same as for
				bytes.decode(), but here 'replace' is the default.
				skip_encoding_cookie : bool
				If True (the default), and the encoding declaration is found in the first
				two lines, that line will be excluded from the output.

				Returns
				-------
				A unicode string containing the contents of the file.
				"""
				# Deferred import for faster start
				from urllib.request import urlopen
				response = urlopen(url)
				buffer = io.BytesIO(response.read())
				return source_to_unicode(buffer, errors, skip_encoding_cookie)