##// END OF EJS Templates
Update script.py...
Update script.py Handle non utf-8 characters during decoding for %%bash

File last commit:

r27290:af9d4242
r28025:73fd29a1
Show More
inputsplitter.py
772 lines | 27.5 KiB | text/x-python | PythonLexer
Thomas Kluyver
Mark inputsplitter & inputtransformer as deprecated
r24177 """DEPRECATED: Input handling and transformation machinery.
This module was deprecated in IPython 7.0, in favour of inputtransformer2.
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Thomas Kluyver
Update inputsplitter docstring
r13890 The first class in this module, :class:`InputSplitter`, is designed to tell when
input from a line-oriented frontend is complete and should be executed, and when
the user should be prompted for another line of code instead. The name 'input
splitter' is largely for historical reasons.
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Fernando Perez
Final cleanups responding to Brian's code review....
r2782 A companion, :class:`IPythonInputSplitter`, provides the same functionality but
with full support for the extended IPython syntax (magics, system calls, etc).
Thomas Kluyver
Update inputsplitter docstring
r13890 The code to actually do these transformations is in :mod:`IPython.core.inputtransformer`.
:class:`IPythonInputSplitter` feeds the raw code to the transformers in order
and stores the results.
Fernando Perez
Final cleanups responding to Brian's code review....
r2782
Thomas Kluyver
Update inputsplitter docstring
r13890 For more details, see the class docstrings below.
Fernando Perez
Created tool for handling interactive input blocks....
r2628 """
Matthias Bussonnier
add docstring, and emit deprecation warnings
r24406 from warnings import warn
warn('IPython.core.inputsplitter is deprecated since IPython 7 in favor of `IPython.core.inputtransformer2`',
DeprecationWarning)
MinRK
always pass single lines to transform.push...
r17813 # Copyright (c) IPython Development Team.
# Distributed under the terms of the Modified BSD License.
Thomas Kluyver
Fix up so tests pass again. input_splitter now uses ast module instead of compiler, bringing it closer to the Python 3 implementation.
r3454 import ast
Fernando Perez
Created tool for handling interactive input blocks....
r2628 import codeop
Thomas Kluyver
Calculate indentation based on tokens, not regexes...
r23331 import io
Fernando Perez
Created tool for handling interactive input blocks....
r2628 import re
import sys
Thomas Kluyver
Calculate indentation based on tokens, not regexes...
r23331 import tokenize
Min RK
SyntaxWarning in compile indicates invalid input
r18850 import warnings
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 from IPython.core.inputtransformer import (leading_indent,
classic_prompt,
ipy_prompt,
cellmagic,
Thomas Kluyver
Prototype transformer to assemble logical lines
r10105 assemble_logical_lines,
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 help_end,
Thomas Kluyver
Simplify input transformers...
r10107 escaped_commands,
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 assign_from_magic,
assign_from_system,
Thomas Kluyver
Revised input transformation framework.
r10106 assemble_python_lines,
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 )
Thomas Kluyver
Remove unused imports from IPython.core
r11124 # These are available in this module for backwards compatibility.
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 from IPython.core.inputtransformer import (ESC_SHELL, ESC_SH_CAP, ESC_HELP,
ESC_HELP2, ESC_MAGIC, ESC_MAGIC2,
ESC_QUOTE, ESC_QUOTE2, ESC_PAREN, ESC_SEQUENCES)
Matthias BUSSONNIER
take #!%... prefix into account for completion...
r7554
Fernando Perez
Completed first pass of inputsplitter with IPython syntax....
r2780 #-----------------------------------------------------------------------------
Fernando Perez
Created tool for handling interactive input blocks....
r2628 # Utilities
#-----------------------------------------------------------------------------
Fernando Perez
Completed first pass of inputsplitter with IPython syntax....
r2780 # FIXME: These are general-purpose utilities that later can be moved to the
# general ward. Kept here for now because we're being very strict about test
# coverage with this code, and this lets us ensure that we keep 100% coverage
# while developing.
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633
Fernando Perez
Created tool for handling interactive input blocks....
r2628 # compiled regexps for autoindent management
David Warde-Farley
Fix behaviour of dedent triggering. Closes gh-142.
r3704 dedent_re = re.compile('|'.join([
r'^\s+raise(\s.*)?$', # raise statement (+ space + other stuff, maybe)
r'^\s+raise\([^\)]*\).*$', # wacky raise with immediate open paren
r'^\s+return(\s.*)?$', # normal return (+ space + other stuff, maybe)
r'^\s+return\([^\)]*\).*$', # wacky return with immediate open paren
Aaron Meurer
`yield`, `break`, and `continue` automatically dedent...
r7824 r'^\s+pass\s*$', # pass (optionally followed by trailing spaces)
r'^\s+break\s*$', # break (optionally followed by trailing spaces)
r'^\s+continue\s*$', # continue (optionally followed by trailing spaces)
David Warde-Farley
Fix behaviour of dedent triggering. Closes gh-142.
r3704 ]))
Fernando Perez
Created tool for handling interactive input blocks....
r2628 ini_spaces_re = re.compile(r'^([ \t\r\f\v]+)')
Fernando Perez
Fix bug where 'if 1:' was being added to comment-only code....
r2979 # regexp to match pure comment lines so we don't accidentally insert 'if 1:'
# before pure comments
Matthias Bussonnier
Fix more escape sequences
r24452 comment_line_re = re.compile(r'^\s*\#')
Fernando Perez
Fix bug where 'if 1:' was being added to comment-only code....
r2979
Fernando Perez
Created tool for handling interactive input blocks....
r2628
def num_ini_spaces(s):
"""Return the number of initial spaces in a string.
Note that tabs are counted as a single space. For now, we do *not* support
mixing of tabs and spaces in the user's input.
Parameters
----------
s : string
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663
Returns
-------
n : int
Fernando Perez
Created tool for handling interactive input blocks....
r2628 """
ini_spaces = ini_spaces_re.match(s)
if ini_spaces:
return ini_spaces.end()
else:
return 0
Thomas Kluyver
Calculate indentation based on tokens, not regexes...
r23331 # Fake token types for partial_tokenize:
INCOMPLETE_STRING = tokenize.N_TOKENS
IN_MULTILINE_STATEMENT = tokenize.N_TOKENS + 1
# The 2 classes below have the same API as TokenInfo, but don't try to look up
# a token type name that they won't find.
class IncompleteString:
type = exact_type = INCOMPLETE_STRING
def __init__(self, s, start, end, line):
self.s = s
self.start = start
self.end = end
self.line = line
class InMultilineStatement:
type = exact_type = IN_MULTILINE_STATEMENT
def __init__(self, pos, line):
self.s = ''
self.start = self.end = pos
self.line = line
def partial_tokens(s):
"""Iterate over tokens from a possibly-incomplete string of code.
This adds two special token types: INCOMPLETE_STRING and
IN_MULTILINE_STATEMENT. These can only occur as the last token yielded, and
represent the two main ways for code to be incomplete.
"""
readline = io.StringIO(s).readline
token = tokenize.TokenInfo(tokenize.NEWLINE, '', (1, 0), (1, 0), '')
try:
for token in tokenize.generate_tokens(readline):
yield token
except tokenize.TokenError as e:
# catch EOF error
lines = s.splitlines(keepends=True)
end = len(lines), len(lines[-1])
if 'multi-line string' in e.args[0]:
l, c = start = token.end
s = lines[l-1][c:] + ''.join(lines[l:])
yield IncompleteString(s, start, end, lines[-1])
elif 'multi-line statement' in e.args[0]:
yield InMultilineStatement(end, lines[-1])
else:
raise
def find_next_indent(code):
"""Find the number of spaces for the next line of indentation"""
tokens = list(partial_tokens(code))
if tokens[-1].type == tokenize.ENDMARKER:
tokens.pop()
if not tokens:
return 0
Matthias Bussonnier
Fix detection of indentation in nested context....
r23344 while (tokens[-1].type in {tokenize.DEDENT, tokenize.NEWLINE, tokenize.COMMENT}):
Thomas Kluyver
Calculate indentation based on tokens, not regexes...
r23331 tokens.pop()
if tokens[-1].type == INCOMPLETE_STRING:
# Inside a multiline string
return 0
# Find the indents used before
prev_indents = [0]
def _add_indent(n):
if n != prev_indents[-1]:
prev_indents.append(n)
tokiter = iter(tokens)
for tok in tokiter:
if tok.type in {tokenize.INDENT, tokenize.DEDENT}:
_add_indent(tok.end[1])
elif (tok.type == tokenize.NL):
try:
_add_indent(next(tokiter).start[1])
except StopIteration:
break
last_indent = prev_indents.pop()
Thomas Kluyver
Add comment
r23333 # If we've just opened a multiline statement (e.g. 'a = ['), indent more
Thomas Kluyver
Calculate indentation based on tokens, not regexes...
r23331 if tokens[-1].type == IN_MULTILINE_STATEMENT:
if tokens[-2].exact_type in {tokenize.LPAR, tokenize.LSQB, tokenize.LBRACE}:
return last_indent + 4
return last_indent
if tokens[-1].exact_type == tokenize.COLON:
# Line ends with colon - indent
return last_indent + 4
if last_indent:
# Examine the last line for dedent cues - statements like return or
# raise which normally end a block of code.
last_line_starts = 0
for i, tok in enumerate(tokens):
if tok.type == tokenize.NEWLINE:
last_line_starts = i + 1
last_line_tokens = tokens[last_line_starts:]
names = [t.string for t in last_line_tokens if t.type == tokenize.NAME]
if names and names[0] in {'raise', 'return', 'pass', 'break', 'continue'}:
# Find the most recent indentation less than the current level
for indent in reversed(prev_indents):
if indent < last_indent:
return indent
return last_indent
Fernando Perez
First working version of cell magics in inputsplitter in line mode....
r6978 def last_blank(src):
"""Determine if the input source ends in a blank.
A blank is either a newline or a line consisting of whitespace.
Parameters
----------
src : string
Matthias Bussonnier
reformat all of core
r27290 A single or multiline string.
Fernando Perez
First working version of cell magics in inputsplitter in line mode....
r6978 """
Fernando Perez
Add a few more fixes to cell/line input code, switch approaches....
r6981 if not src: return False
ll = src.splitlines()[-1]
return (ll == '') or ll.isspace()
Fernando Perez
Working implementation of cell mode with regular expressions - cleaner.
r6979
Fernando Perez
Add a few more fixes to cell/line input code, switch approaches....
r6981 last_two_blanks_re = re.compile(r'\n\s*\n\s*$', re.MULTILINE)
last_two_blanks_re2 = re.compile(r'.+\n\s*\n\s+$', re.MULTILINE)
Fernando Perez
Working implementation of cell mode with regular expressions - cleaner.
r6979
def last_two_blanks(src):
"""Determine if the input source ends in two blanks.
A blank is either a newline or a line consisting of whitespace.
Parameters
----------
src : string
Matthias Bussonnier
reformat all of core
r27290 A single or multiline string.
Fernando Perez
Working implementation of cell mode with regular expressions - cleaner.
r6979 """
Fernando Perez
Add a few more fixes to cell/line input code, switch approaches....
r6981 if not src: return False
# The logic here is tricky: I couldn't get a regexp to work and pass all
# the tests, so I took a different approach: split the source by lines,
# grab the last two and prepend '###\n' as a stand-in for whatever was in
# the body before the last two lines. Then, with that structure, it's
# possible to analyze with two regexps. Not the most elegant solution, but
# it works. If anyone tries to change this logic, make sure to validate
# the whole test suite first!
new_src = '\n'.join(['###\n'] + src.splitlines()[-2:])
return (bool(last_two_blanks_re.match(new_src)) or
bool(last_two_blanks_re2.match(new_src)) )
Fernando Perez
First working version of cell magics in inputsplitter in line mode....
r6978
Fernando Perez
Created tool for handling interactive input blocks....
r2628 def remove_comments(src):
"""Remove all comments from input source.
Note: comments are NOT recognized inside of strings!
Parameters
----------
src : string
Matthias Bussonnier
reformat all of core
r27290 A single or multiline input string.
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Returns
-------
String with all Python comments removed.
"""
return re.sub('#.*', '', src)
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Fernando Perez
Created tool for handling interactive input blocks....
r2628
def get_input_encoding():
Fernando Perez
Add test for missing input encoding. Back to 100% coverage.
r2718 """Return the default standard input encoding.
If sys.stdin has no encoding, 'ascii' is returned."""
epatters
Made blockbreakers' input encoding detection more robust to strange...
r2674 # There are strange environments for which sys.stdin.encoding is None. We
# ensure that a valid encoding is returned.
encoding = getattr(sys.stdin, 'encoding', None)
if encoding is None:
encoding = 'ascii'
return encoding
Fernando Perez
Created tool for handling interactive input blocks....
r2628
#-----------------------------------------------------------------------------
Fernando Perez
Completed first pass of inputsplitter with IPython syntax....
r2780 # Classes and functions for normal Python syntax handling
Fernando Perez
Created tool for handling interactive input blocks....
r2628 #-----------------------------------------------------------------------------
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 class InputSplitter(object):
Thomas Kluyver
Miscellaneous docs fixes
r13597 r"""An object that can accumulate lines of Python source before execution.
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663
Thomas Kluyver
Update docstring for InputSplitter.
r4077 This object is designed to be fed python source line-by-line, using
Thomas Kluyver
Miscellaneous docs fixes
r13597 :meth:`push`. It will return on each push whether the currently pushed
code could be executed already. In addition, it provides a method called
:meth:`push_accepts_more` that can be used to query whether more input
can be pushed into a single interactive block.
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663
This is a simple example of how an interactive terminal-based client can use
this tool::
isp = InputSplitter()
while isp.push_accepts_more():
indent = ' '*isp.indent_spaces
prompt = '>>> ' + indent
line = indent + raw_input(prompt)
isp.push(line)
print 'Input source was:\n', isp.source_reset(),
"""
Thomas Kluyver
Clarify comment per @willingc's suggestion
r24050 # A cache for storing the current indentation
# The first value stores the most recently processed source input
# The second value is the number of spaces for the current indentation
# If self.source matches the first value, the second value is a valid
# current indentation. Otherwise, the cache is invalid and the indentation
# must be recalculated.
Thomas Kluyver
Avoid unnecessary calculation of indent on every line
r24048 _indent_spaces_cache = None, None
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 # String, indicating the default input encoding. It is computed by default
# at initialization time via get_input_encoding(), but it can be reset by a
# client with specific knowledge of the encoding.
Fernando Perez
Created tool for handling interactive input blocks....
r2628 encoding = ''
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 # String where the current full source input is stored, properly encoded.
# Reading this attribute is the normal way of querying the currently pushed
# source code, that has been properly encoded.
Fernando Perez
Created tool for handling interactive input blocks....
r2628 source = ''
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 # Code object corresponding to the current source. It is automatically
# synced to the source, so it can be queried at any time to obtain the code
# object; it will be None if the source doesn't compile to valid Python.
Fernando Perez
Created tool for handling interactive input blocks....
r2628 code = None
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 # Private attributes
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 # List with lines of input accumulated so far
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 _buffer = None
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 # Command compiler
_compile = None
# Boolean indicating whether the current block is complete
_is_complete = None
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 # Boolean indicating whether the current block has an unrecoverable syntax error
_is_invalid = False
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 def __init__(self):
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 """Create a new InputSplitter instance.
Fernando Perez
Add support for append/replace mode after discussion with Evan....
r2634 """
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 self._buffer = []
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 self._compile = codeop.CommandCompiler()
Fernando Perez
Created tool for handling interactive input blocks....
r2628 self.encoding = get_input_encoding()
def reset(self):
"""Reset the input buffer and associated state."""
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 self._buffer[:] = []
Fernando Perez
Created tool for handling interactive input blocks....
r2628 self.source = ''
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 self.code = None
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 self._is_complete = False
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 self._is_invalid = False
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Fernando Perez
Change get_source() api as per code review, to source_reset()....
r2636 def source_reset(self):
"""Return the input source and perform a full reset.
Fernando Perez
Created tool for handling interactive input blocks....
r2628 """
out = self.source
Fernando Perez
Change get_source() api as per code review, to source_reset()....
r2636 self.reset()
Fernando Perez
Created tool for handling interactive input blocks....
r2628 return out
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 def check_complete(self, source):
Thomas Kluyver
Expose IPython machinery for testing code completeness
r17624 """Return whether a block of code is ready to execute, or should be continued
Matthias Bussonnier
reformat all of core
r27290
Thomas Kluyver
Expose IPython machinery for testing code completeness
r17624 This is a non-stateful API, and will reset the state of this InputSplitter.
Matthias Bussonnier
reformat all of core
r27290
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 Parameters
----------
source : string
Matthias Bussonnier
reformat all of core
r27290 Python input code, which can be multiline.
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 Returns
-------
status : str
Matthias Bussonnier
reformat all of core
r27290 One of 'complete', 'incomplete', or 'invalid' if source is not a
prefix of valid code.
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 indent_spaces : int or None
Matthias Bussonnier
reformat all of core
r27290 The number of spaces by which to indent the next line of code. If
status is not 'incomplete', this is None.
Thomas Kluyver
Expose IPython machinery for testing code completeness
r17624 """
self.reset()
try:
self.push(source)
except SyntaxError:
# Transformers in IPythonInputSplitter can raise SyntaxError,
# which push() will not catch.
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 return 'invalid', None
else:
if self._is_invalid:
return 'invalid', None
elif self.push_accepts_more():
Thomas Kluyver
Avoid unnecessary calculation of indent on every line
r24048 return 'incomplete', self.get_indent_spaces()
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 else:
return 'complete', None
Thomas Kluyver
Expose IPython machinery for testing code completeness
r17624 finally:
self.reset()
Matthias Bussonnier
remove cast_unicode and add some typings
r25343 def push(self, lines:str) -> bool:
Robert Kern
Fix bug where bare strings would be silently ignored in input....
r3293 """Push one or more lines of input.
Fernando Perez
Created tool for handling interactive input blocks....
r2628
This stores the given lines and returns a status code indicating
whether the code forms a complete Python block or not.
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 Any exceptions generated in compilation are swallowed, but if an
exception was produced, the method returns True.
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Parameters
----------
lines : string
Matthias Bussonnier
reformat all of core
r27290 One or more lines of Python input.
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Fernando Perez
Created tool for handling interactive input blocks....
r2628 Returns
-------
is_complete : boolean
Matthias Bussonnier
reformat all of core
r27290 True if the current input source (the result of the current input
plus prior inputs) forms a complete Python execution block. Note that
this value is also stored as a private attribute (``_is_complete``), so it
can be queried at any time.
Fernando Perez
Created tool for handling interactive input blocks....
r2628 """
Matthias Bussonnier
remove cast_unicode and add some typings
r25343 assert isinstance(lines, str)
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 self._store(lines)
Fernando Perez
Created tool for handling interactive input blocks....
r2628 source = self.source
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 # Before calling _compile(), reset the code object to None so that if an
Fernando Perez
Created tool for handling interactive input blocks....
r2628 # exception is raised in compilation, we don't mislead by having
# inconsistent code/source attributes.
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 self.code, self._is_complete = None, None
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 self._is_invalid = False
Fernando Perez
Completed full block splitting for block-based frontends.
r2645
Fernando Perez
Fix bug with lines ending in continuation markers (\)....
r3013 # Honor termination lines properly
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823 if source.endswith('\\\n'):
Fernando Perez
Fix bug with lines ending in continuation markers (\)....
r3013 return False
Fernando Perez
push() now swallows syntax errors and immediately produces a 'ready'...
r2635 try:
Min RK
SyntaxWarning in compile indicates invalid input
r18850 with warnings.catch_warnings():
warnings.simplefilter('error', SyntaxWarning)
self.code = self._compile(source, symbol="exec")
Fernando Perez
push() now swallows syntax errors and immediately produces a 'ready'...
r2635 # Invalid syntax can produce any of a number of different errors from
# inside the compiler, so we have to catch them all. Syntax errors
# immediately produce a 'ready' block, so the invalid Python can be
# sent to the kernel for evaluation with possible ipython
# special-syntax conversion.
Fernando Perez
Completed full block splitting for block-based frontends.
r2645 except (SyntaxError, OverflowError, ValueError, TypeError,
Min RK
SyntaxWarning in compile indicates invalid input
r18850 MemoryError, SyntaxWarning):
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 self._is_complete = True
Thomas Kluyver
Four possible states for completion reply, & indent hint
r17804 self._is_invalid = True
Fernando Perez
push() now swallows syntax errors and immediately produces a 'ready'...
r2635 else:
# Compilation didn't produce any exceptions (though it may not have
# given a complete code object)
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 self._is_complete = self.code is not None
Fernando Perez
push() now swallows syntax errors and immediately produces a 'ready'...
r2635
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 return self._is_complete
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 def push_accepts_more(self):
"""Return whether a block of interactive input can accept more input.
Fernando Perez
Created tool for handling interactive input blocks....
r2628
This method is meant to be used by line-oriented frontends, who need to
guess whether a block is complete or not based solely on prior and
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 current input lines. The InputSplitter considers it has a complete
Thomas Kluyver
Fix test failure in IPython.lib
r10255 interactive block and will not accept more input when either:
Matthias Bussonnier
reformat all of core
r27290
Thomas Kluyver
Fix test failure in IPython.lib
r10255 * A SyntaxError is raised
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Thomas Kluyver
Fix test failure in IPython.lib
r10255 * The code is complete and consists of a single line or a single
non-compound statement
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Thomas Kluyver
Fix test failure in IPython.lib
r10255 * The code is complete and has a blank line at the end
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 If the current input produces a syntax error, this method immediately
returns False but does *not* raise the syntax error exception, as
typically clients will want to send invalid syntax to an execution
backend which might convert the invalid syntax into valid Python via
one of the dynamic IPython mechanisms.
Fernando Perez
Created tool for handling interactive input blocks....
r2628 """
Fernando Perez
Implement support for 'cell' mode with Ctrl-Enter....
r3004
# With incomplete input, unconditionally accept more
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 # A syntax error also sets _is_complete to True - see push()
Fernando Perez
Renamed to inputsplitter, added more tests and examples....
r2663 if not self._is_complete:
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 #print("Not complete") # debug
Fernando Perez
Created tool for handling interactive input blocks....
r2628 return True
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251
# The user can make any (complete) input execute by leaving a blank line
last_line = self.source.splitlines()[-1]
if (not last_line) or last_line.isspace():
#print("Blank line") # debug
return False
Thomas Kluyver
Fix test failure in IPython.lib
r10255 # If there's just a single line or AST node, and we're flush left, as is
# the case after a simple statement such as 'a=1', we want to execute it
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 # straight away.
Thomas Kluyver
Avoid unnecessary calculation of indent on every line
r24048 if self.get_indent_spaces() == 0:
Thomas Kluyver
Fix test failure in IPython.lib
r10255 if len(self.source.splitlines()) <= 1:
return False
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 try:
code_ast = ast.parse(u''.join(self._buffer))
except Exception:
#print("Can't parse AST") # debug
return False
Fernando Perez
Implement support for 'cell' mode with Ctrl-Enter....
r3004 else:
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 if len(code_ast.body) == 1 and \
not hasattr(code_ast.body[0], 'body'):
#print("Simple statement") # debug
Fernando Perez
Implement support for 'cell' mode with Ctrl-Enter....
r3004 return False
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 # General fallback - accept more code
return True
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Thomas Kluyver
Avoid unnecessary calculation of indent on every line
r24048 def get_indent_spaces(self):
sourcefor, n = self._indent_spaces_cache
if sourcefor == self.source:
return n
Thomas Kluyver
Calculate indentation based on tokens, not regexes...
r23331 # self.source always has a trailing newline
Thomas Kluyver
Avoid unnecessary calculation of indent on every line
r24048 n = find_next_indent(self.source[:-1])
self._indent_spaces_cache = (self.source, n)
return n
Fernando Perez
Created tool for handling interactive input blocks....
r2628
Thomas Kluyver
Leave inputsplitter.indent_spaces for backwards compatibility
r24051 # Backwards compatibility. I think all code that used .indent_spaces was
# inside IPython, but we can leave this here until IPython 7 in case any
# other modules are using it. -TK, November 2017
indent_spaces = property(get_indent_spaces)
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 def _store(self, lines, buffer=None, store='source'):
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 """Store one or more lines of input.
If input lines are not newline-terminated, a newline is automatically
appended."""
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 if buffer is None:
buffer = self._buffer
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 if lines.endswith('\n'):
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 buffer.append(lines)
Fernando Perez
Split blockbreaker tests into a separate file and clean up api....
r2633 else:
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 buffer.append(lines+'\n')
setattr(self, store, self._set_source(buffer))
Fernando Perez
Completed full block splitting for block-based frontends.
r2645
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 def _set_source(self, buffer):
Thomas Kluyver
Further fixes and tweaks for inputsplitter.
r3455 return u''.join(buffer)
Fernando Perez
First pass of input syntax transformation support
r2719
class IPythonInputSplitter(InputSplitter):
"""An input splitter that recognizes all of IPython's special syntax."""
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 # String with raw, untransformed input.
source_raw = ''
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093
# Flag to track when a transformer has stored input that it hasn't given
# back yet.
transformer_accumulating = False
Thomas Kluyver
Revised input transformation framework.
r10106
# Flag to track when assemble_python_lines has stored input that it hasn't
# given back yet.
within_python_line = False
Fernando Perez
First implementation of cell magics that goes via inputsplitter....
r6976
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 # Private attributes
Fernando Perez
First working version of cell magics in inputsplitter in line mode....
r6978
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 # List with lines of raw input accumulated so far.
_buffer_raw = None
Thomas Kluyver
Make line_input_checker=True the default for InputSplitter....
r10493 def __init__(self, line_input_checker=True, physical_line_transforms=None,
Thomas Kluyver
Revised input transformation framework.
r10106 logical_line_transforms=None, python_line_transforms=None):
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 super(IPythonInputSplitter, self).__init__()
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 self._buffer_raw = []
Fernando Perez
First working version of cell magics in inputsplitter in line mode....
r6978 self._validate = True
Thomas Kluyver
Revised input transformation framework.
r10106
Thomas Kluyver
Fix for starting the Qt console
r10113 if physical_line_transforms is not None:
self.physical_line_transforms = physical_line_transforms
else:
MinRK
push cell magic to the head of the transformer line...
r11460 self.physical_line_transforms = [
leading_indent(),
Thomas Kluyver
Fix for starting the Qt console
r10113 classic_prompt(),
ipy_prompt(),
MinRK
move cell magic after prompt transformers
r11463 cellmagic(end_on_blank_line=line_input_checker),
Thomas Kluyver
Fix for starting the Qt console
r10113 ]
Thomas Kluyver
Revised input transformation framework.
r10106
self.assemble_logical_lines = assemble_logical_lines()
Thomas Kluyver
Fix for starting the Qt console
r10113 if logical_line_transforms is not None:
self.logical_line_transforms = logical_line_transforms
else:
MinRK
push cell magic to the head of the transformer line...
r11460 self.logical_line_transforms = [
Thomas Kluyver
Fix for starting the Qt console
r10113 help_end(),
escaped_commands(),
assign_from_magic(),
assign_from_system(),
]
Thomas Kluyver
Revised input transformation framework.
r10106
self.assemble_python_lines = assemble_python_lines()
Thomas Kluyver
Fix for starting the Qt console
r10113 if python_line_transforms is not None:
self.python_line_transforms = python_line_transforms
else:
# We don't use any of these at present
self.python_line_transforms = []
Thomas Kluyver
Revised input transformation framework.
r10106
@property
def transforms(self):
"Quick access to all transformers."
return self.physical_line_transforms + \
[self.assemble_logical_lines] + self.logical_line_transforms + \
[self.assemble_python_lines] + self.python_line_transforms
@property
def transforms_in_use(self):
"""Transformers, excluding logical line transformers if we're in a
Python line."""
Thomas Kluyver
Fix for \ at end of comment, and add tests
r10112 t = self.physical_line_transforms[:]
Thomas Kluyver
Revised input transformation framework.
r10106 if not self.within_python_line:
Thomas Kluyver
Fix for \ at end of comment, and add tests
r10112 t += [self.assemble_logical_lines] + self.logical_line_transforms
Thomas Kluyver
Revised input transformation framework.
r10106 return t + [self.assemble_python_lines] + self.python_line_transforms
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 def reset(self):
"""Reset the input buffer and associated state."""
Fernando Perez
First implementation of cell magics that goes via inputsplitter....
r6976 super(IPythonInputSplitter, self).reset()
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 self._buffer_raw[:] = []
self.source_raw = ''
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 self.transformer_accumulating = False
Thomas Kluyver
Fix for \ at end of comment, and add tests
r10112 self.within_python_line = False
Thomas Kluyver
Protect Qt console against SyntaxError from input transformers.
r13529
Thomas Kluyver
Inputsplitter flushes transformers before retrieving source.
r10096 for t in self.transforms:
Thomas Kluyver
Protect Qt console against SyntaxError from input transformers.
r13529 try:
t.reset()
Thomas Kluyver
Simplify IPythonInputSplitter API
r13912 except SyntaxError:
# Nothing that calls reset() expects to handle transformer
# errors
pass
Thomas Kluyver
Inputsplitter flushes transformers before retrieving source.
r10096
def flush_transformers(self):
MinRK
only reset the transform once...
r17814 def _flush(transform, outs):
MinRK
always pass single lines to transform.push...
r17813 """yield transformed lines
Matthias Bussonnier
reformat all of core
r27290
MinRK
always pass single lines to transform.push...
r17813 always strings, never None
Matthias Bussonnier
reformat all of core
r27290
MinRK
always pass single lines to transform.push...
r17813 transform: the current transform
MinRK
only reset the transform once...
r17814 outs: an iterable of previously transformed inputs.
MinRK
always pass single lines to transform.push...
r17813 Each may be multiline, which will be passed
one line at a time to transform.
"""
MinRK
only reset the transform once...
r17814 for out in outs:
MinRK
always pass single lines to transform.push...
r17813 for line in out.splitlines():
# push one line at a time
tmp = transform.push(line)
if tmp is not None:
yield tmp
MinRK
only reset the transform once...
r17814
# reset the transform
tmp = transform.reset()
if tmp is not None:
yield tmp
Thomas Kluyver
Revised input transformation framework.
r10106
MinRK
always pass single lines to transform.push...
r17813 out = []
Thomas Kluyver
Revised input transformation framework.
r10106 for t in self.transforms_in_use:
out = _flush(t, out)
MinRK
always pass single lines to transform.push...
r17813 out = list(out)
if out:
self._store('\n'.join(out))
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080
Thomas Kluyver
Simplify IPythonInputSplitter API
r13912 def raw_reset(self):
"""Return raw input only and perform a full reset.
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 """
Thomas Kluyver
Simplify IPythonInputSplitter API
r13912 out = self.source_raw
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 self.reset()
Thomas Kluyver
Simplify IPythonInputSplitter API
r13912 return out
Thomas Kluyver
Inputsplitter flushes transformers before retrieving source.
r10096
def source_reset(self):
Thomas Kluyver
Simplify IPythonInputSplitter API
r13912 try:
self.flush_transformers()
return self.source
finally:
self.reset()
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080
Fernando Perez
First working version of cell magics in inputsplitter in line mode....
r6978 def push_accepts_more(self):
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 if self.transformer_accumulating:
return True
Fernando Perez
First working version of cell magics in inputsplitter in line mode....
r6978 else:
return super(IPythonInputSplitter, self).push_accepts_more()
Fernando Perez
Add a transform_cell convenience method to the inputsplitter object.
r7487 def transform_cell(self, cell):
"""Process and translate a cell of input.
"""
self.reset()
Thomas Kluyver
Simplify IPythonInputSplitter API
r13912 try:
self.push(cell)
self.flush_transformers()
return self.source
finally:
self.reset()
Fernando Perez
Add a transform_cell convenience method to the inputsplitter object.
r7487
Matthias Bussonnier
remove cast_unicode and add some typings
r25343 def push(self, lines:str) -> bool:
Fernando Perez
First pass of input syntax transformation support
r2719 """Push one or more lines of IPython input.
Fernando Perez
First implementation of cell magics that goes via inputsplitter....
r6976
This stores the given lines and returns a status code indicating
whether the code forms a complete Python block or not, after processing
all input lines for special IPython syntax.
Any exceptions generated in compilation are swallowed, but if an
exception was produced, the method returns True.
Parameters
----------
lines : string
Matthias Bussonnier
reformat all of core
r27290 One or more lines of Python input.
Fernando Perez
First implementation of cell magics that goes via inputsplitter....
r6976
Returns
-------
is_complete : boolean
Matthias Bussonnier
reformat all of core
r27290 True if the current input source (the result of the current input
plus prior inputs) forms a complete Python execution block. Note that
this value is also stored as a private attribute (_is_complete), so it
can be queried at any time.
Fernando Perez
First pass of input syntax transformation support
r2719 """
Matthias Bussonnier
remove cast_unicode and add some typings
r25343 assert isinstance(lines, str)
Fernando Perez
Stop-gap fix for crash with unicode input....
r3126 # We must ensure all input is pure unicode
Thomas Kluyver
Fix inputsplitter to pass empty lines to transformers
r10094 # ''.splitlines() --> [], but we need to push the empty line to transformers
Fernando Perez
Final cleanups responding to Brian's code review....
r2782 lines_list = lines.splitlines()
Thomas Kluyver
Fix inputsplitter to pass empty lines to transformers
r10094 if not lines_list:
lines_list = ['']
Fernando Perez
Final cleanups responding to Brian's code review....
r2782
Fernando Perez
Add support for accessing raw data to inputsplitter....
r3080 # Store raw source before applying any transformations to it. Note
# that this must be done *after* the reset() call that would otherwise
# flush the buffer.
self._store(lines, self._buffer_raw, 'source_raw')
Aaron Meurer
Line continuations now terminate after one blank line (#2108)...
r7823
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 transformed_lines_list = []
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251 for line in lines_list:
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 transformed = self._transform_line(line)
if transformed is not None:
transformed_lines_list.append(transformed)
Thomas Kluyver
Simplify InputSplitter by stripping out input_mode distinction
r10251
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 if transformed_lines_list:
transformed_lines = '\n'.join(transformed_lines_list)
return super(IPythonInputSplitter, self).push(transformed_lines)
else:
# Got nothing back from transformers - they must be waiting for
# more input.
return False
def _transform_line(self, line):
"""Push a line of input code through the various transformers.
Matthias Bussonnier
reformat all of core
r27290
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 Returns any output from the transformers, or None if a transformer
is accumulating lines.
Matthias Bussonnier
reformat all of core
r27290
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 Sets self.transformer_accumulating as a side effect.
"""
Thomas Kluyver
Revised input transformation framework.
r10106 def _accumulating(dbg):
#print(dbg)
self.transformer_accumulating = True
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 return None
Thomas Kluyver
Revised input transformation framework.
r10106 for transformer in self.physical_line_transforms:
line = transformer.push(line)
if line is None:
return _accumulating(transformer)
Thomas Kluyver
Avoid trying to compile the code after each line
r24049
Thomas Kluyver
Revised input transformation framework.
r10106 if not self.within_python_line:
Thomas Kluyver
Fix for \ at end of comment, and add tests
r10112 line = self.assemble_logical_lines.push(line)
if line is None:
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 return _accumulating('acc logical line')
Thomas Kluyver
Revised input transformation framework.
r10106 for transformer in self.logical_line_transforms:
line = transformer.push(line)
if line is None:
return _accumulating(transformer)
Thomas Kluyver
Avoid trying to compile the code after each line
r24049
Thomas Kluyver
Revised input transformation framework.
r10106 line = self.assemble_python_lines.push(line)
if line is None:
self.within_python_line = True
return _accumulating('acc python line')
else:
self.within_python_line = False
Thomas Kluyver
Avoid trying to compile the code after each line
r24049
Thomas Kluyver
Revised input transformation framework.
r10106 for transformer in self.python_line_transforms:
Thomas Kluyver
Prototype transformer to assemble logical lines
r10105 line = transformer.push(line)
if line is None:
Thomas Kluyver
Revised input transformation framework.
r10106 return _accumulating(transformer)
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093
Thomas Kluyver
Revised input transformation framework.
r10106 #print("transformers clear") #debug
Thomas Kluyver
Update inputsplitter to use new input transformers
r10093 self.transformer_accumulating = False
Thomas Kluyver
Avoid trying to compile the code after each line
r24049 return line