upstream/mercurial-mirror Files · mercurial/dagparser.py

xdiff: add a preprocessing step that trims files...

xdiff: add a preprocessing step that trims files xdiff has a `xdl_trim_ends` step that removes common lines, unmatchable lines. That is in theory good, but happens too late - after splitting, hashing, and adjusting the hash values so they are unique. Those splitting, hashing and adjusting hash values steps could have noticeable overhead. Diffing two large files with minor (one-line-ish) changes are not uncommon. In that case, the raw performance of those preparation steps seriously matter. Even allocating an O(N) array and storing line offsets to it is expensive. Therefore my previous attempts [1] [2] cannot be good enough since they do not remove the O(N) array assignment. This patch adds a preprocessing step - `xdl_trim_files` that runs before other preprocessing steps. It counts common prefix and suffix and lines in them (needed for displaying line number), without doing anything else. Testing with a crafted large (169MB) file, with minor change: ``` open('a','w').write(''.join('%s\n' % (i % 100000) for i in xrange(30000000) if i != 6000000)) open('b','w').write(''.join('%s\n' % (i % 100000) for i in xrange(30000000) if i != 6003000)) ``` Running xdiff by a simple binary [3], this patch improves the xdiff perf by more than 10x for the above case: ``` # xdiff before this patch 2.41s user 1.13s system 98% cpu 3.592 total # xdiff after this patch 0.14s user 0.16s system 98% cpu 0.309 total # gnu diffutils 0.12s user 0.15s system 98% cpu 0.272 total # (best of 20 runs) ``` It's still slightly slower than GNU diffutils. But it's pretty close now. Testing with real repo data: For the whole repo, this patch makes xdiff 25% faster: ``` # hg perfbdiff --count 100 --alldata -c --blocks [--xdiff] # xdiff, after ! wall 0.058861 comb 0.050000 user 0.050000 sys 0.000000 (best of 100) # xdiff, before ! wall 0.077816 comb 0.080000 user 0.080000 sys 0.000000 (best of 91) # bdiff ! wall 0.117473 comb 0.120000 user 0.120000 sys 0.000000 (best of 67) ``` For files that are long (ex. commands.py), the speedup is more than 3x, very significant: ``` # hg perfbdiff --count 3000 --blocks commands.py.i 1 [--xdiff] # xdiff, after ! wall 0.690583 comb 0.690000 user 0.690000 sys 0.000000 (best of 12) # xdiff, before ! wall 2.240361 comb 2.210000 user 2.210000 sys 0.000000 (best of 4) # bdiff ! wall 2.469852 comb 2.440000 user 2.440000 sys 0.000000 (best of 4) ``` [1]: https://phab.mercurial-scm.org/D2631 [2]: https://phab.mercurial-scm.org/D2634 [3]: ``` // Code to run xdiff from command line. No proper error handling. #include <stdlib.h> #include <unistd.h> #include <sys/types.h> #include <sys/stat.h> #include <fcntl.h> #include "mercurial/thirdparty/xdiff/xdiff.h" #define ensure(x) if (!(x)) exit(255); mmfile_t readfile(const char *path) { struct stat st; int fd = open(path, O_RDONLY); fstat(fd, &st); mmfile_t file = { malloc(st.st_size), st.st_size }; ensure(read(fd, file.ptr, st.st_size) == st.st_size); close(fd); return file; } int main(int argc, char const *argv[]) { mmfile_t a = readfile(argv[1]), b = readfile(argv[2]); xpparam_t xpp = {0}; xdemitconf_t xecfg = {0}; xdemitcb_t ecb = {0}; xdl_diff(&a, &b, &xpp, &xecfg, &ecb); return 0; } ``` Differential Revision: https://phab.mercurial-scm.org/D2686

Yuya Nishihara - - Load All Authors

File last commit:

r34209:dfd009e5 default


                r36838:f33a87cf

default

Download file

             dagparser.py
        
                    488 lines
            
             | 14.7 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / mercurial / dagparser.py
          
                    History
                
                 |
                  Annotation
                 | Raw
                 |Copy content
                 |Copy permalink

      # dagparser.py - parser and generator for concise description of DAGs

      #

      # Copyright 2010 Peter Arrenbrecht <peter@arrenbrecht.ch>

      #

      # This software may be used and distributed according to the terms of the

      # GNU General Public License version 2 or any later version.

      from __future__ import absolute_import

      import re

      import string

      from .i18n import _

      from . import (

          error,

          pycompat,

          util,

      )

      def parsedag(desc):

          '''parses a DAG from a concise textual description; generates events

          "+n" is a linear run of n nodes based on the current default parent

          "." is a single node based on the current default parent

          "$" resets the default parent to -1 (implied at the start);

              otherwise the default parent is always the last node created

          "<p" sets the default parent to the backref p

          "*p" is a fork at parent p, where p is a backref

          "*p1/p2/.../pn" is a merge of parents p1..pn, where the pi are backrefs

          "/p2/.../pn" is a merge of the preceding node and p2..pn

          ":name" defines a label for the preceding node; labels can be redefined

          "@text" emits an annotation event for text

          "!command" emits an action event for the current node

          "!!my command\n" is like "!", but to the end of the line

          "#...\n" is a comment up to the end of the line

          Whitespace between the above elements is ignored.

          A backref is either

           * a number n, which references the node curr-n, where curr is the current

             node, or

           * the name of a label you placed earlier using ":name", or

           * empty to denote the default parent.

          All string valued-elements are either strictly alphanumeric, or must

          be enclosed in double quotes ("..."), with "\" as escape character.

          Generates sequence of

            ('n', (id, [parentids])) for node creation

            ('l', (id, labelname)) for labels on nodes

            ('a', text) for annotations

            ('c', command) for actions (!)

            ('C', command) for line actions (!!)

          Examples

          --------

          Example of a complex graph (output not shown for brevity):

              >>> len(list(parsedag(b"""

              ...

              ... +3         # 3 nodes in linear run

              ... :forkhere  # a label for the last of the 3 nodes from above

              ... +5         # 5 more nodes on one branch

              ... :mergethis # label again

              ... <forkhere  # set default parent to labeled fork node

              ... +10        # 10 more nodes on a parallel branch

              ... @stable    # following nodes will be annotated as "stable"

              ... +5         # 5 nodes in stable

              ... !addfile   # custom command; could trigger new file in next node

              ... +2         # two more nodes

              ... /mergethis # merge last node with labeled node

              ... +4         # 4 more nodes descending from merge node

              ...

              ... """)))

              34

          Empty list:

              >>> list(parsedag(b""))

              []

          A simple linear run:

              >>> list(parsedag(b"+3"))

              [('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

          Some non-standard ways to define such runs:

              >>> list(parsedag(b"+1+2"))

              [('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

              >>> list(parsedag(b"+1*1*"))

              [('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

              >>> list(parsedag(b"*"))

              [('n', (0, [-1]))]

              >>> list(parsedag(b"..."))

              [('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

          A fork and a join, using numeric back references:

              >>> list(parsedag(b"+2*2*/2"))

              [('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [0])), ('n', (3, [2, 1]))]

              >>> list(parsedag(b"+2<2+1/2"))

              [('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [0])), ('n', (3, [2, 1]))]

          Placing a label:

              >>> list(parsedag(b"+1 :mylabel +1"))

              [('n', (0, [-1])), ('l', (0, 'mylabel')), ('n', (1, [0]))]

          An empty label (silly, really):

              >>> list(parsedag(b"+1:+1"))

              [('n', (0, [-1])), ('l', (0, '')), ('n', (1, [0]))]

          Fork and join, but with labels instead of numeric back references:

              >>> list(parsedag(b"+1:f +1:p2 *f */p2"))

              [('n', (0, [-1])), ('l', (0, 'f')), ('n', (1, [0])), ('l', (1, 'p2')),

               ('n', (2, [0])), ('n', (3, [2, 1]))]

              >>> list(parsedag(b"+1:f +1:p2 <f +1 /p2"))

              [('n', (0, [-1])), ('l', (0, 'f')), ('n', (1, [0])), ('l', (1, 'p2')),

               ('n', (2, [0])), ('n', (3, [2, 1]))]

          Restarting from the root:

              >>> list(parsedag(b"+1 $ +1"))

              [('n', (0, [-1])), ('n', (1, [-1]))]

          Annotations, which are meant to introduce sticky state for subsequent nodes:

              >>> list(parsedag(b"+1 @ann +1"))

              [('n', (0, [-1])), ('a', 'ann'), ('n', (1, [0]))]

              >>> list(parsedag(b'+1 @"my annotation" +1'))

              [('n', (0, [-1])), ('a', 'my annotation'), ('n', (1, [0]))]

          Commands, which are meant to operate on the most recently created node:

              >>> list(parsedag(b"+1 !cmd +1"))

              [('n', (0, [-1])), ('c', 'cmd'), ('n', (1, [0]))]

              >>> list(parsedag(b'+1 !"my command" +1'))

              [('n', (0, [-1])), ('c', 'my command'), ('n', (1, [0]))]

              >>> list(parsedag(b'+1 !!my command line\\n +1'))

              [('n', (0, [-1])), ('C', 'my command line'), ('n', (1, [0]))]

          Comments, which extend to the end of the line:

              >>> list(parsedag(b'+1 # comment\\n+1'))

              [('n', (0, [-1])), ('n', (1, [0]))]

          Error:

              >>> try: list(parsedag(b'+1 bad'))

              ... except Exception as e: print(pycompat.sysstr(bytes(e)))

              invalid character in dag description: bad...

          '''

          if not desc:

              return

          wordchars = pycompat.bytestr(string.ascii_letters + string.digits)

          labels = {}

          p1 = -1

          r = 0

          def resolve(ref):

              if not ref:

                  return p1

              elif ref[0] in pycompat.bytestr(string.digits):

                  return r - int(ref)

              else:

                  return labels[ref]

          chiter = pycompat.iterbytestr(desc)

          def nextch():

              return next(chiter, '\0')

          def nextrun(c, allow):

              s = ''

              while c in allow:

                  s += c

                  c = nextch()

              return c, s

          def nextdelimited(c, limit, escape):

              s = ''

              while c != limit:

                  if c == escape:

                      c = nextch()

                  s += c

                  c = nextch()

              return nextch(), s

          def nextstring(c):

              if c == '"':

                  return nextdelimited(nextch(), '"', '\\')

              else:

                  return nextrun(c, wordchars)

          c = nextch()

          while c != '\0':

              while c in pycompat.bytestr(string.whitespace):

                  c = nextch()

              if c == '.':

                  yield 'n', (r, [p1])

                  p1 = r

                  r += 1

                  c = nextch()

              elif c == '+':

                  c, digs = nextrun(nextch(), pycompat.bytestr(string.digits))

                  n = int(digs)

                  for i in xrange(0, n):

                      yield 'n', (r, [p1])

                      p1 = r

                      r += 1

              elif c in '*/':

                  if c == '*':

                      c = nextch()

                  c, pref = nextstring(c)

                  prefs = [pref]

                  while c == '/':

                      c, pref = nextstring(nextch())

                      prefs.append(pref)

                  ps = [resolve(ref) for ref in prefs]

                  yield 'n', (r, ps)

                  p1 = r

                  r += 1

              elif c == '<':

                  c, ref = nextstring(nextch())

                  p1 = resolve(ref)

              elif c == ':':

                  c, name = nextstring(nextch())

                  labels[name] = p1

                  yield 'l', (p1, name)

              elif c == '@':

                  c, text = nextstring(nextch())

                  yield 'a', text

              elif c == '!':

                  c = nextch()

                  if c == '!':

                      cmd = ''

                      c = nextch()

                      while c not in '\n\r\0':

                          cmd += c

                          c = nextch()

                      yield 'C', cmd

                  else:

                      c, cmd = nextstring(c)

                      yield 'c', cmd

              elif c == '#':

                  while c not in '\n\r\0':

                      c = nextch()

              elif c == '$':

                  p1 = -1

                  c = nextch()

              elif c == '\0':

                  return # in case it was preceded by whitespace

              else:

                  s = ''

                  i = 0

                  while c != '\0' and i < 10:

                      s += c

                      i += 1

                      c = nextch()

                  raise error.Abort(_('invalid character in dag description: '

                                     '%s...') % s)

      def dagtextlines(events,

                       addspaces=True,

                       wraplabels=False,

                       wrapannotations=False,

                       wrapcommands=False,

                       wrapnonlinear=False,

                       usedots=False,

                       maxlinewidth=70):

          '''generates single lines for dagtext()'''

          def wrapstring(text):

              if re.match("^[0-9a-z]*$", text):

                  return text

              return '"' + text.replace('\\', '\\\\').replace('"', '\"') + '"'

          def gen():

              labels = {}

              run = 0

              wantr = 0

              needroot = False

              for kind, data in events:

                  if kind == 'n':

                      r, ps = data

                      # sanity check

                      if r != wantr:

                          raise error.Abort(_("expected id %i, got %i") % (wantr, r))

                      if not ps:

                          ps = [-1]

                      else:

                          for p in ps:

                              if p >= r:

                                  raise error.Abort(_("parent id %i is larger than "

                                                     "current id %i") % (p, r))

                      wantr += 1

                      # new root?

                      p1 = r - 1

                      if len(ps) == 1 and ps[0] == -1:

                          if needroot:

                              if run:

                                  yield '+%d' % run

                                  run = 0

                              if wrapnonlinear:

                                  yield '\n'

                              yield '$'

                              p1 = -1

                          else:

                              needroot = True

                      if len(ps) == 1 and ps[0] == p1:

                          if usedots:

                              yield "."

                          else:

                              run += 1

                      else:

                          if run:

                              yield '+%d' % run

                              run = 0

                          if wrapnonlinear:

                              yield '\n'

                          prefs = []

                          for p in ps:

                              if p == p1:

                                  prefs.append('')

                              elif p in labels:

                                  prefs.append(labels[p])

                              else:

                                  prefs.append('%d' % (r - p))

                          yield '*' + '/'.join(prefs)

                  else:

                      if run:

                          yield '+%d' % run

                          run = 0

                      if kind == 'l':

                          rid, name = data

                          labels[rid] = name

                          yield ':' + name

                          if wraplabels:

                              yield '\n'

                      elif kind == 'c':

                          yield '!' + wrapstring(data)

                          if wrapcommands:

                              yield '\n'

                      elif kind == 'C':

                          yield '!!' + data

                          yield '\n'

                      elif kind == 'a':

                          if wrapannotations:

                              yield '\n'

                          yield '@' + wrapstring(data)

                      elif kind == '#':

                          yield '#' + data

                          yield '\n'

                      else:

                          raise error.Abort(_("invalid event type in dag: "

                                              "('%s', '%s')")

                                            % (util.escapestr(kind),

                                               util.escapestr(data)))

              if run:

                  yield '+%d' % run

          line = ''

          for part in gen():

              if part == '\n':

                  if line:

                      yield line

                      line = ''

              else:

                  if len(line) + len(part) >= maxlinewidth:

                      yield line

                      line = ''

                  elif addspaces and line and part != '.':

                      line += ' '

                  line += part

          if line:

              yield line

      def dagtext(dag,

                  addspaces=True,

                  wraplabels=False,

                  wrapannotations=False,

                  wrapcommands=False,

                  wrapnonlinear=False,

                  usedots=False,

                  maxlinewidth=70):

          '''generates lines of a textual representation for a dag event stream

          events should generate what parsedag() does, so:

            ('n', (id, [parentids])) for node creation

            ('l', (id, labelname)) for labels on nodes

            ('a', text) for annotations

            ('c', text) for commands

            ('C', text) for line commands ('!!')

            ('#', text) for comment lines

          Parent nodes must come before child nodes.

          Examples

          --------

          Linear run:

              >>> dagtext([(b'n', (0, [-1])), (b'n', (1, [0]))])

              '+2'

          Two roots:

              >>> dagtext([(b'n', (0, [-1])), (b'n', (1, [-1]))])

              '+1 $ +1'

          Fork and join:

              >>> dagtext([(b'n', (0, [-1])), (b'n', (1, [0])), (b'n', (2, [0])),

              ...          (b'n', (3, [2, 1]))])

              '+2 *2 */2'

          Fork and join with labels:

              >>> dagtext([(b'n', (0, [-1])), (b'l', (0, b'f')), (b'n', (1, [0])),

              ...          (b'l', (1, b'p2')), (b'n', (2, [0])), (b'n', (3, [2, 1]))])

              '+1 :f +1 :p2 *f */p2'

          Annotations:

              >>> dagtext([(b'n', (0, [-1])), (b'a', b'ann'), (b'n', (1, [0]))])

              '+1 @ann +1'

              >>> dagtext([(b'n', (0, [-1])),

              ...          (b'a', b'my annotation'),

              ...          (b'n', (1, [0]))])

              '+1 @"my annotation" +1'

          Commands:

              >>> dagtext([(b'n', (0, [-1])), (b'c', b'cmd'), (b'n', (1, [0]))])

              '+1 !cmd +1'

              >>> dagtext([(b'n', (0, [-1])),

              ...          (b'c', b'my command'),

              ...          (b'n', (1, [0]))])

              '+1 !"my command" +1'

              >>> dagtext([(b'n', (0, [-1])),

              ...          (b'C', b'my command line'),

              ...          (b'n', (1, [0]))])

              '+1 !!my command line\\n+1'

          Comments:

              >>> dagtext([(b'n', (0, [-1])), (b'#', b' comment'), (b'n', (1, [0]))])

              '+1 # comment\\n+1'

              >>> dagtext([])

              ''

          Combining parsedag and dagtext:

              >>> dagtext(parsedag(b'+1 :f +1 :p2 *f */p2'))

              '+1 :f +1 :p2 *f */p2'

          '''

          return "\n".join(dagtextlines(dag,

                                        addspaces,

                                        wraplabels,

                                        wrapannotations,

                                        wrapcommands,

                                        wrapnonlinear,

                                        usedots,

                                        maxlinewidth))

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

				# dagparser.py - parser and generator for concise description of DAGs
				#
				# Copyright 2010 Peter Arrenbrecht <peter@arrenbrecht.ch>
				#
				# This software may be used and distributed according to the terms of the
				# GNU General Public License version 2 or any later version.

				from __future__ import absolute_import

				import re
				import string

				from .i18n import _
				from . import (
				error,
				pycompat,
				util,
				)

				def parsedag(desc):
				'''parses a DAG from a concise textual description; generates events

				"+n" is a linear run of n nodes based on the current default parent
				"." is a single node based on the current default parent
				"$" resets the default parent to -1 (implied at the start);
				otherwise the default parent is always the last node created
				"<p" sets the default parent to the backref p
				"*p" is a fork at parent p, where p is a backref
				"*p1/p2/.../pn" is a merge of parents p1..pn, where the pi are backrefs
				"/p2/.../pn" is a merge of the preceding node and p2..pn
				":name" defines a label for the preceding node; labels can be redefined
				"@text" emits an annotation event for text
				"!command" emits an action event for the current node
				"!!my command\n" is like "!", but to the end of the line
				"#...\n" is a comment up to the end of the line

				Whitespace between the above elements is ignored.

				A backref is either
				* a number n, which references the node curr-n, where curr is the current
				node, or
				* the name of a label you placed earlier using ":name", or
				* empty to denote the default parent.

				All string valued-elements are either strictly alphanumeric, or must
				be enclosed in double quotes ("..."), with "\" as escape character.

				Generates sequence of

				('n', (id, [parentids])) for node creation
				('l', (id, labelname)) for labels on nodes
				('a', text) for annotations
				('c', command) for actions (!)
				('C', command) for line actions (!!)

				Examples
				--------

				Example of a complex graph (output not shown for brevity):

				>>> len(list(parsedag(b"""
				...
				... +3 # 3 nodes in linear run
				... :forkhere # a label for the last of the 3 nodes from above
				... +5 # 5 more nodes on one branch
				... :mergethis # label again
				... <forkhere # set default parent to labeled fork node
				... +10 # 10 more nodes on a parallel branch
				... @stable # following nodes will be annotated as "stable"
				... +5 # 5 nodes in stable
				... !addfile # custom command; could trigger new file in next node
				... +2 # two more nodes
				... /mergethis # merge last node with labeled node
				... +4 # 4 more nodes descending from merge node
				...
				... """)))
				34

				Empty list:

				>>> list(parsedag(b""))
				[]

				A simple linear run:

				>>> list(parsedag(b"+3"))
				[('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

				Some non-standard ways to define such runs:

				>>> list(parsedag(b"+1+2"))
				[('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

				>>> list(parsedag(b"+11"))
				[('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

				>>> list(parsedag(b"*"))
				[('n', (0, [-1]))]

				>>> list(parsedag(b"..."))
				[('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [1]))]

				A fork and a join, using numeric back references:

				>>> list(parsedag(b"+22/2"))
				[('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [0])), ('n', (3, [2, 1]))]

				>>> list(parsedag(b"+2<2+1/2"))
				[('n', (0, [-1])), ('n', (1, [0])), ('n', (2, [0])), ('n', (3, [2, 1]))]

				Placing a label:

				>>> list(parsedag(b"+1 :mylabel +1"))
				[('n', (0, [-1])), ('l', (0, 'mylabel')), ('n', (1, [0]))]

				An empty label (silly, really):

				>>> list(parsedag(b"+1:+1"))
				[('n', (0, [-1])), ('l', (0, '')), ('n', (1, [0]))]

				Fork and join, but with labels instead of numeric back references:

				>>> list(parsedag(b"+1:f +1:p2 f /p2"))
				[('n', (0, [-1])), ('l', (0, 'f')), ('n', (1, [0])), ('l', (1, 'p2')),
				('n', (2, [0])), ('n', (3, [2, 1]))]

				>>> list(parsedag(b"+1:f +1:p2 <f +1 /p2"))
				[('n', (0, [-1])), ('l', (0, 'f')), ('n', (1, [0])), ('l', (1, 'p2')),
				('n', (2, [0])), ('n', (3, [2, 1]))]

				Restarting from the root:

				>>> list(parsedag(b"+1 $ +1"))
				[('n', (0, [-1])), ('n', (1, [-1]))]

				Annotations, which are meant to introduce sticky state for subsequent nodes:

				>>> list(parsedag(b"+1 @ann +1"))
				[('n', (0, [-1])), ('a', 'ann'), ('n', (1, [0]))]

				>>> list(parsedag(b'+1 @"my annotation" +1'))
				[('n', (0, [-1])), ('a', 'my annotation'), ('n', (1, [0]))]

				Commands, which are meant to operate on the most recently created node:

				>>> list(parsedag(b"+1 !cmd +1"))
				[('n', (0, [-1])), ('c', 'cmd'), ('n', (1, [0]))]

				>>> list(parsedag(b'+1 !"my command" +1'))
				[('n', (0, [-1])), ('c', 'my command'), ('n', (1, [0]))]

				>>> list(parsedag(b'+1 !!my command line\\n +1'))
				[('n', (0, [-1])), ('C', 'my command line'), ('n', (1, [0]))]

				Comments, which extend to the end of the line:

				>>> list(parsedag(b'+1 # comment\\n+1'))
				[('n', (0, [-1])), ('n', (1, [0]))]

				Error:

				>>> try: list(parsedag(b'+1 bad'))
				... except Exception as e: print(pycompat.sysstr(bytes(e)))
				invalid character in dag description: bad...

				'''
				if not desc:
				return

				wordchars = pycompat.bytestr(string.ascii_letters + string.digits)

				labels = {}
				p1 = -1
				r = 0

				def resolve(ref):
				if not ref:
				return p1
				elif ref[0] in pycompat.bytestr(string.digits):
				return r - int(ref)
				else:
				return labels[ref]

				chiter = pycompat.iterbytestr(desc)

				def nextch():
				return next(chiter, '\0')

				def nextrun(c, allow):
				s = ''
				while c in allow:
				s += c
				c = nextch()
				return c, s

				def nextdelimited(c, limit, escape):
				s = ''
				while c != limit:
				if c == escape:
				c = nextch()
				s += c
				c = nextch()
				return nextch(), s

				def nextstring(c):
				if c == '"':
				return nextdelimited(nextch(), '"', '\\')
				else:
				return nextrun(c, wordchars)

				c = nextch()
				while c != '\0':
				while c in pycompat.bytestr(string.whitespace):
				c = nextch()
				if c == '.':
				yield 'n', (r, [p1])
				p1 = r
				r += 1
				c = nextch()
				elif c == '+':
				c, digs = nextrun(nextch(), pycompat.bytestr(string.digits))
				n = int(digs)
				for i in xrange(0, n):
				yield 'n', (r, [p1])
				p1 = r
				r += 1
				elif c in '*/':
				if c == '*':
				c = nextch()
				c, pref = nextstring(c)
				prefs = [pref]
				while c == '/':
				c, pref = nextstring(nextch())
				prefs.append(pref)
				ps = [resolve(ref) for ref in prefs]
				yield 'n', (r, ps)
				p1 = r
				r += 1
				elif c == '<':
				c, ref = nextstring(nextch())
				p1 = resolve(ref)
				elif c == ':':
				c, name = nextstring(nextch())
				labels[name] = p1
				yield 'l', (p1, name)
				elif c == '@':
				c, text = nextstring(nextch())
				yield 'a', text
				elif c == '!':
				c = nextch()
				if c == '!':
				cmd = ''
				c = nextch()
				while c not in '\n\r\0':
				cmd += c
				c = nextch()
				yield 'C', cmd
				else:
				c, cmd = nextstring(c)
				yield 'c', cmd
				elif c == '#':
				while c not in '\n\r\0':
				c = nextch()
				elif c == '$':
				p1 = -1
				c = nextch()
				elif c == '\0':
				return # in case it was preceded by whitespace
				else:
				s = ''
				i = 0
				while c != '\0' and i < 10:
				s += c
				i += 1
				c = nextch()
				raise error.Abort(_('invalid character in dag description: '
				'%s...') % s)

				def dagtextlines(events,
				addspaces=True,
				wraplabels=False,
				wrapannotations=False,
				wrapcommands=False,
				wrapnonlinear=False,
				usedots=False,
				maxlinewidth=70):
				'''generates single lines for dagtext()'''

				def wrapstring(text):
				if re.match("^[0-9a-z]*$", text):
				return text
				return '"' + text.replace('\\', '\\\\').replace('"', '\"') + '"'

				def gen():
				labels = {}
				run = 0
				wantr = 0
				needroot = False
				for kind, data in events:
				if kind == 'n':
				r, ps = data

				# sanity check
				if r != wantr:
				raise error.Abort(_("expected id %i, got %i") % (wantr, r))
				if not ps:
				ps = [-1]
				else:
				for p in ps:
				if p >= r:
				raise error.Abort(_("parent id %i is larger than "
				"current id %i") % (p, r))
				wantr += 1

				# new root?
				p1 = r - 1
				if len(ps) == 1 and ps[0] == -1:
				if needroot:
				if run:
				yield '+%d' % run
				run = 0
				if wrapnonlinear:
				yield '\n'
				yield '$'
				p1 = -1
				else:
				needroot = True
				if len(ps) == 1 and ps[0] == p1:
				if usedots:
				yield "."
				else:
				run += 1
				else:
				if run:
				yield '+%d' % run
				run = 0
				if wrapnonlinear:
				yield '\n'
				prefs = []
				for p in ps:
				if p == p1:
				prefs.append('')
				elif p in labels:
				prefs.append(labels[p])
				else:
				prefs.append('%d' % (r - p))
				yield '*' + '/'.join(prefs)
				else:
				if run:
				yield '+%d' % run
				run = 0
				if kind == 'l':
				rid, name = data
				labels[rid] = name
				yield ':' + name
				if wraplabels:
				yield '\n'
				elif kind == 'c':
				yield '!' + wrapstring(data)
				if wrapcommands:
				yield '\n'
				elif kind == 'C':
				yield '!!' + data
				yield '\n'
				elif kind == 'a':
				if wrapannotations:
				yield '\n'
				yield '@' + wrapstring(data)
				elif kind == '#':
				yield '#' + data
				yield '\n'
				else:
				raise error.Abort(_("invalid event type in dag: "
				"('%s', '%s')")
				% (util.escapestr(kind),
				util.escapestr(data)))
				if run:
				yield '+%d' % run

				line = ''
				for part in gen():
				if part == '\n':
				if line:
				yield line
				line = ''
				else:
				if len(line) + len(part) >= maxlinewidth:
				yield line
				line = ''
				elif addspaces and line and part != '.':
				line += ' '
				line += part
				if line:
				yield line

				def dagtext(dag,
				addspaces=True,
				wraplabels=False,
				wrapannotations=False,
				wrapcommands=False,
				wrapnonlinear=False,
				usedots=False,
				maxlinewidth=70):
				'''generates lines of a textual representation for a dag event stream

				events should generate what parsedag() does, so:

				('n', (id, [parentids])) for node creation
				('l', (id, labelname)) for labels on nodes
				('a', text) for annotations
				('c', text) for commands
				('C', text) for line commands ('!!')
				('#', text) for comment lines

				Parent nodes must come before child nodes.

				Examples
				--------

				Linear run:

				>>> dagtext([(b'n', (0, [-1])), (b'n', (1, [0]))])
				'+2'

				Two roots:

				>>> dagtext([(b'n', (0, [-1])), (b'n', (1, [-1]))])
				'+1 $ +1'

				Fork and join:

				>>> dagtext([(b'n', (0, [-1])), (b'n', (1, [0])), (b'n', (2, [0])),
				... (b'n', (3, [2, 1]))])
				'+2 2 /2'

				Fork and join with labels:

				>>> dagtext([(b'n', (0, [-1])), (b'l', (0, b'f')), (b'n', (1, [0])),
				... (b'l', (1, b'p2')), (b'n', (2, [0])), (b'n', (3, [2, 1]))])
				'+1 :f +1 :p2 f /p2'

				Annotations:

				>>> dagtext([(b'n', (0, [-1])), (b'a', b'ann'), (b'n', (1, [0]))])
				'+1 @ann +1'

				>>> dagtext([(b'n', (0, [-1])),
				... (b'a', b'my annotation'),
				... (b'n', (1, [0]))])
				'+1 @"my annotation" +1'

				Commands:

				>>> dagtext([(b'n', (0, [-1])), (b'c', b'cmd'), (b'n', (1, [0]))])
				'+1 !cmd +1'

				>>> dagtext([(b'n', (0, [-1])),
				... (b'c', b'my command'),
				... (b'n', (1, [0]))])
				'+1 !"my command" +1'

				>>> dagtext([(b'n', (0, [-1])),
				... (b'C', b'my command line'),
				... (b'n', (1, [0]))])
				'+1 !!my command line\\n+1'

				Comments:

				>>> dagtext([(b'n', (0, [-1])), (b'#', b' comment'), (b'n', (1, [0]))])
				'+1 # comment\\n+1'

				>>> dagtext([])
				''

				Combining parsedag and dagtext:

				>>> dagtext(parsedag(b'+1 :f +1 :p2 f /p2'))
				'+1 :f +1 :p2 f /p2'

				'''
				return "\n".join(dagtextlines(dag,
				addspaces,
				wraplabels,
				wrapannotations,
				wrapcommands,
				wrapnonlinear,
				usedots,
				maxlinewidth))