upstream/mercurial-mirror Files · mercurial/ancestor.py

sqlitestore: support for storing revisions without their parents...

sqlitestore: support for storing revisions without their parents This commit kinda/sorta implements the equivalent of ellipsis nodes for the SQLite storage backend. Without implementing full blown ellipsis nodes (and the necessary support for them in the wire protocol), we instead teach the store to rewrite the p1 and p2 nodes to nullid when the incoming parent isn't in the local store. This allows servers to remain dumb and send the real parent and have the clients deal with the missing parent problem. This obviously isn't ideal because a benefit of ellipsis nodes is we can insert a fake parent to ellide missing changesets. But neither solution is ideal because it drops the original parent from storage. We could probably teach the SQLite store to retain the original parent and handle missing parents at read time. However, parent revisions are stored as integers and it isn't trivial to store an "empty" revision in the store yet, which would be necessary to represent the "missing" parent. The store is somewhat intelligent in trying to remove the missing parents metadata when the revision is re-added. But, revision numbers will be all messed up in that case, so I'm not sure it is worth it. At some point we'll likely want to remove the concept of revision numbers from the database and have the store invent them at index generation time. Or even better, we can do away with revision numbers from the file storage interface completely. We'll get there eventually... Differential Revision: https://phab.mercurial-scm.org/D5168

Georges Racinet - - Load All Authors

File last commit:

r40336:72b94f94 default


                r40428:595641bd

default

Download file

             ancestor.py
        
                    420 lines
            
             | 13.8 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / mercurial / ancestor.py
          
                    History
                
                 |
                  Source
                 | Raw
                 |Copy content
                 |Copy permalink

        Matt Mackall
    
Abstract ancestor algorithm into generic function...

              r3135
            
      # ancestor.py - generic DAG ancestor algorithm for mercurial

      #

      # Copyright 2006 Matt Mackall <mpm@selenic.com>

      #

        Martin Geisler
    
updated license to be explicit about GPL version 2

              r8225
            
      # This software may be used and distributed according to the terms of the

        Matt Mackall
    
Update license to GPLv2+

              r10263
            
      # GNU General Public License version 2 or any later version.

        Matt Mackall
    
Abstract ancestor algorithm into generic function...

              r3135
            
        Gregory Szorc
    
ancestor: use absolute_import...

              r25915
            
      from __future__ import absolute_import

        Augie Fackler
    
cleanup: move stdlib imports to their own import statement...

              r20034
            
      import heapq

        Gregory Szorc
    
ancestor: use absolute_import...

              r25915
            
      from .node import nullrev

        Gregory Szorc
    
global: use pycompat.xrange()...

              r38806
            
      from . import (

        Georges Racinet
    
rust: hooking into Python code...

              r40334
            
          policy,

        Gregory Szorc
    
global: use pycompat.xrange()...

              r38806
            
          pycompat,

      )

        Matt Mackall
    
Abstract ancestor algorithm into generic function...

              r3135
            
        Georges Racinet
    
rust: hooking into Python code...

              r40334
            
      parsers = policy.importmod(r'parsers')

        Mads Kiilerich
    
ancestors: extract candidates function as commonancestorsheads

              r21101
            
      def commonancestorsheads(pfunc, *nodes):

          """Returns a set with the heads of all common ancestors of all nodes,

          heads(::nodes[0] and ::nodes[1] and ...) .

          pfunc must return a list of parent vertices for a given vertex.

          """

          if not isinstance(nodes, set):

              nodes = set(nodes)

          if nullrev in nodes:

              return set()

          if len(nodes) <= 1:

              return nodes

          allseen = (1 << len(nodes)) - 1

          seen = [0] * (max(nodes) + 1)

          for i, n in enumerate(nodes):

              seen[n] = 1 << i

          poison = 1 << (i + 1)

          gca = set()

          interesting = len(nodes)

          nv = len(seen) - 1

          while nv >= 0 and interesting:

              v = nv

              nv -= 1

              if not seen[v]:

                  continue

              sv = seen[v]

              if sv < poison:

                  interesting -= 1

                  if sv == allseen:

                      gca.add(v)

                      sv |= poison

                      if v in nodes:

                          # history is linear

        Martin von Zweigbergk
    
cleanup: use set literals...

              r32291
            
                          return {v}

        Mads Kiilerich
    
ancestors: extract candidates function as commonancestorsheads

              r21101
            
              if sv < poison:

                  for p in pfunc(v):

                      sp = seen[p]

                      if p == nullrev:

                          continue

                      if sp == 0:

                          seen[p] = sv

                          interesting += 1

                      elif sp != sv:

                          seen[p] |= sv

              else:

                  for p in pfunc(v):

                      if p == nullrev:

                          continue

                      sp = seen[p]

                      if sp and sp < poison:

                          interesting -= 1

                      seen[p] = sv

          return gca

        Bryan O'Sullivan
    
ancestor: a new algorithm that is faster for nodes near tip...

              r18986
            
      def ancestors(pfunc, *orignodes):

          """

          Returns the common ancestors of a and b that are furthest from a

          root (as measured by longest path).

          pfunc must return a list of parent vertices for a given vertex.

          """

          def deepest(nodes):

              interesting = {}

              count = max(nodes) + 1

              depth = [0] * count

              seen = [0] * count

              mapping = []

              for (i, n) in enumerate(sorted(nodes)):

                  depth[n] = 1

                  b = 1 << i

                  seen[n] = b

                  interesting[b] = 1

                  mapping.append((b, n))

              nv = count - 1

              while nv >= 0 and len(interesting) > 1:

                  v = nv

                  nv -= 1

                  dv = depth[v]

                  if dv == 0:

                      continue

                  sv = seen[v]

                  for p in pfunc(v):

                      if p == nullrev:

                          continue

                      dp = depth[p]

                      nsp = sp = seen[p]

                      if dp <= dv:

                          depth[p] = dv + 1

                          if sp != sv:

                              interesting[sv] += 1

                              nsp = seen[p] = sv

                              if sp:

                                  interesting[sp] -= 1

                                  if interesting[sp] == 0:

                                      del interesting[sp]

                      elif dv == dp - 1:

                          nsp = sp | sv

                          if nsp == sp:

                              continue

                          seen[p] = nsp

                          interesting.setdefault(nsp, 0)

                          interesting[nsp] += 1

                          interesting[sp] -= 1

                          if interesting[sp] == 0:

                              del interesting[sp]

                  interesting[sv] -= 1

                  if interesting[sv] == 0:

                      del interesting[sv]

              if len(interesting) != 1:

                  return []

              k = 0

              for i in interesting:

                  k |= i

              return set(n for (i, n) in mapping if k & i)

        Mads Kiilerich
    
ancestors: extract candidates function as commonancestorsheads

              r21101
            
          gca = commonancestorsheads(pfunc, *orignodes)

        Bryan O'Sullivan
    
ancestor: a new algorithm that is faster for nodes near tip...

              r18986
            
          if len(gca) <= 1:

              return gca

          return deepest(gca)

        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
      class incrementalmissingancestors(object):

          '''persistent state used to calculate missing ancestors incrementally

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
          Although similar in spirit to lazyancestors below, this is a separate class

          because trying to support contains and missingancestors operations with the

          same internal data structures adds needless complexity.'''

          def __init__(self, pfunc, bases):

              self.bases = set(bases)

              if not self.bases:

                  self.bases.add(nullrev)

              self.pfunc = pfunc

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
        Siddharth Agarwal
    
ancestor: add a way to test whether a missing ancestor object has bases...

              r23340
            
          def hasbases(self):

              '''whether the common set has any non-trivial bases'''

        Martin von Zweigbergk
    
cleanup: use set literals...

              r32291
            
              return self.bases and self.bases != {nullrev}

        Siddharth Agarwal
    
ancestor: add a way to test whether a missing ancestor object has bases...

              r23340
            
        Siddharth Agarwal
    
ancestor: add a way to add to bases of a missing ancestor object...

              r23341
            
          def addbases(self, newbases):

              '''grow the ancestor set by adding new bases'''

              self.bases.update(newbases)

        Siddharth Agarwal
    
ancestor: add a way to remove ancestors of bases from a given set...

              r23342
            
          def removeancestorsfrom(self, revs):

              '''remove all ancestors of bases from the set revs (in place)'''

              bases = self.bases

              pfunc = self.pfunc

              revs.difference_update(bases)

              # nullrev is always an ancestor

              revs.discard(nullrev)

              if not revs:

                  return

              # anything in revs > start is definitely not an ancestor of bases

              # revs <= start needs to be investigated

              start = max(bases)

              keepcount = sum(1 for r in revs if r > start)

              if len(revs) == keepcount:

                  # no revs to consider

                  return

        Gregory Szorc
    
global: use pycompat.xrange()...

              r38806
            
              for curr in pycompat.xrange(start, min(revs) - 1, -1):

        Siddharth Agarwal
    
ancestor: add a way to remove ancestors of bases from a given set...

              r23342
            
                  if curr not in bases:

                      continue

                  revs.discard(curr)

                  bases.update(pfunc(curr))

                  if len(revs) == keepcount:

                      # no more potential revs to discard

                      break

        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
          def missingancestors(self, revs):

              '''return all the ancestors of revs that are not ancestors of self.bases

              This may include elements from revs.

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
              Equivalent to the revset (::revs - ::self.bases). Revs are returned in

              revision number order, which is a topological order.'''

              revsvisit = set(revs)

              basesvisit = self.bases

              pfunc = self.pfunc

              bothvisit = revsvisit.intersection(basesvisit)

              revsvisit.difference_update(bothvisit)

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
              if not revsvisit:

        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
                  return []

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
              start = max(max(revsvisit), max(basesvisit))

              # At this point, we hold the invariants that:

              # - revsvisit is the set of nodes we know are an ancestor of at least

              #   one of the nodes in revs

              # - basesvisit is the same for bases

              # - bothvisit is the set of nodes we know are ancestors of at least one

              #   of the nodes in revs and one of the nodes in bases. bothvisit and

              #   revsvisit are mutually exclusive, but bothvisit is a subset of

              #   basesvisit.

              # Now we walk down in reverse topo order, adding parents of nodes

              # already visited to the sets while maintaining the invariants. When a

              # node is found in both revsvisit and basesvisit, it is removed from

              # revsvisit and added to bothvisit. When revsvisit becomes empty, there

              # are no more ancestors of revs that aren't also ancestors of bases, so

              # exit.

              missing = []

        Gregory Szorc
    
global: use pycompat.xrange()...

              r38806
            
              for curr in pycompat.xrange(start, nullrev, -1):

        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
                  if not revsvisit:

                      break

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
                  if curr in bothvisit:

                      bothvisit.remove(curr)

                      # curr's parents might have made it into revsvisit through

                      # another path

                      for p in pfunc(curr):

                          revsvisit.discard(p)

                          basesvisit.add(p)

                          bothvisit.add(p)

                      continue

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
                  if curr in revsvisit:

                      missing.append(curr)

                      revsvisit.remove(curr)

                      thisvisit = revsvisit

                      othervisit = basesvisit

                  elif curr in basesvisit:

                      thisvisit = basesvisit

                      othervisit = revsvisit

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
                  else:

        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
                      # not an ancestor of revs or bases: ignore

                      continue

        Siddharth Agarwal
    
ancestor: faster algorithm for difference of ancestor sets...

              r17970
            
        Siddharth Agarwal
    
ancestor.missingancestors: turn into a state-keeping class...

              r23334
            
                  for p in pfunc(curr):

                      if p == nullrev:

                          pass

                      elif p in othervisit or p in bothvisit:

                          # p is implicitly in thisvisit. This means p is or should be

                          # in bothvisit

                          revsvisit.discard(p)

                          basesvisit.add(p)

                          bothvisit.add(p)

                      else:

                          # visit later

                          thisvisit.add(p)

              missing.reverse()

              return missing

        Martin von Zweigbergk
    
lazyancestors: extract __iter__ to free function...

              r39517
            
      # Extracted from lazyancestors.__iter__ to avoid a reference cycle

      def _lazyancestorsiter(parentrevs, initrevs, stoprev, inclusive):

          seen = {nullrev}

        Yuya Nishihara
    
ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...

              r39573
            
          heappush = heapq.heappush

          heappop = heapq.heappop

        Yuya Nishihara
    
ancestor: use heapreplace() in place of heappop/heappush()...

              r39574
            
          heapreplace = heapq.heapreplace

        Martin von Zweigbergk
    
lazyancestors: extract __iter__ to free function...

              r39517
            
          see = seen.add

          if inclusive:

        Yuya Nishihara
    
ancestor: remove alias of initrevs from _lazyancestorsiter()...

              r39569
            
              visit = [-r for r in initrevs]

              seen.update(initrevs)

        Martin von Zweigbergk
    
lazyancestors: extract __iter__ to free function...

              r39517
            
              heapq.heapify(visit)

          else:

              visit = []

              heapq.heapify(visit)

        Yuya Nishihara
    
ancestor: remove alias of initrevs from _lazyancestorsiter()...

              r39569
            
              for r in initrevs:

        Yuya Nishihara
    
ancestor: unroll loop of parents in _lazyancestorsiter()...

              r39571
            
                  p1, p2 = parentrevs(r)

                  if p1 not in seen:

        Yuya Nishihara
    
ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...

              r39573
            
                      heappush(visit, -p1)

        Yuya Nishihara
    
ancestor: unroll loop of parents in _lazyancestorsiter()...

              r39571
            
                      see(p1)

                  if p2 not in seen:

        Yuya Nishihara
    
ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...

              r39573
            
                      heappush(visit, -p2)

        Yuya Nishihara
    
ancestor: unroll loop of parents in _lazyancestorsiter()...

              r39571
            
                      see(p2)

        Martin von Zweigbergk
    
lazyancestors: extract __iter__ to free function...

              r39517
            
          while visit:

        Yuya Nishihara
    
ancestor: optimize _lazyancestorsiter() for contiguous chains...

              r39572
            
              current = -visit[0]

        Yuya Nishihara
    
ancestor: return early from _lazyancestorsiter() when reached to stoprev...

              r39570
            
              if current < stoprev:

                  break

              yield current

        Yuya Nishihara
    
ancestor: optimize _lazyancestorsiter() for contiguous chains...

              r39572
            
              # optimize out heapq operation if p1 is known to be the next highest

              # revision, which is quite common in linear history.

        Yuya Nishihara
    
ancestor: unroll loop of parents in _lazyancestorsiter()...

              r39571
            
              p1, p2 = parentrevs(current)

              if p1 not in seen:

        Yuya Nishihara
    
ancestor: optimize _lazyancestorsiter() for contiguous chains...

              r39572
            
                  if current - p1 == 1:

                      visit[0] = -p1

                  else:

        Yuya Nishihara
    
ancestor: use heapreplace() in place of heappop/heappush()...

              r39574
            
                      heapreplace(visit, -p1)

        Yuya Nishihara
    
ancestor: unroll loop of parents in _lazyancestorsiter()...

              r39571
            
                  see(p1)

        Yuya Nishihara
    
ancestor: optimize _lazyancestorsiter() for contiguous chains...

              r39572
            
              else:

        Yuya Nishihara
    
ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...

              r39573
            
                  heappop(visit)

        Yuya Nishihara
    
ancestor: unroll loop of parents in _lazyancestorsiter()...

              r39571
            
              if p2 not in seen:

        Yuya Nishihara
    
ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...

              r39573
            
                  heappush(visit, -p2)

        Yuya Nishihara
    
ancestor: unroll loop of parents in _lazyancestorsiter()...

              r39571
            
                  see(p2)

        Martin von Zweigbergk
    
lazyancestors: extract __iter__ to free function...

              r39517
            
        Siddharth Agarwal
    
revlog: move ancestor generation out to a new class...

              r18090
            
      class lazyancestors(object):

        Siddharth Agarwal
    
ancestor.lazyancestors: take parentrevs function rather than changelog...

              r23328
            
          def __init__(self, pfunc, revs, stoprev=0, inclusive=False):

        Siddharth Agarwal
    
revlog: move ancestor generation out to a new class...

              r18090
            
              """Create a new object generating ancestors for the given revs. Does

              not generate revs lower than stoprev.

        Siddharth Agarwal
    
ancestor: add lazy membership testing to lazyancestors...

              r18091
            
              This is computed lazily starting from revs. The object supports

              iteration and membership.

        Siddharth Agarwal
    
revlog: move ancestor generation out to a new class...

              r18090
            
              cl should be a changelog and revs should be an iterable. inclusive is

              a boolean that indicates whether revs should be included. Revs lower

              than stoprev will not be generated.

              Result does not include the null revision."""

        Siddharth Agarwal
    
ancestor.lazyancestors: take parentrevs function rather than changelog...

              r23328
            
              self._parentrevs = pfunc

        Yuya Nishihara
    
ancestor: filter out initial revisions lower than stoprev

              r39512
            
              self._initrevs = revs = [r for r in revs if r >= stoprev]

        Siddharth Agarwal
    
revlog: move ancestor generation out to a new class...

              r18090
            
              self._stoprev = stoprev

              self._inclusive = inclusive

        Martin von Zweigbergk
    
lazyancestors: reuse __iter__ implementation in __contains__...

              r39518
            
              self._containsseen = set()

              self._containsiter = _lazyancestorsiter(self._parentrevs,

                                                      self._initrevs,

                                                      self._stoprev,

                                                      self._inclusive)

        Siddharth Agarwal
    
ancestor: add lazy membership testing to lazyancestors...

              r18091
            
        Pierre-Yves David
    
ancestors: add a __nonzero__ method...

              r22225
            
          def __nonzero__(self):

              """False if the set is empty, True otherwise."""

              try:

        timeless
    
py3: convert to next() function...

              r29216
            
                  next(iter(self))

        Pierre-Yves David
    
ancestors: add a __nonzero__ method...

              r22225
            
                  return True

              except StopIteration:

                  return False

        Gregory Szorc
    
py3: add __bool__ to every class defining __nonzero__...

              r31476
            
          __bool__ = __nonzero__

        Siddharth Agarwal
    
revlog: move ancestor generation out to a new class...

              r18090
            
          def __iter__(self):

              """Generate the ancestors of _initrevs in reverse topological order.

              If inclusive is False, yield a sequence of revision numbers starting

        Boris Feld
    
ancestors: actually iterate over ancestors in topological order (issue5979)...

              r39509
            
              with the parents of each revision in revs, i.e., each revision is

              *not* considered an ancestor of itself. Results are emitted in reverse

              revision number order. That order is also topological: a child is

              always emitted before its parent.

        Siddharth Agarwal
    
revlog: move ancestor generation out to a new class...

              r18090
            
        Boris Feld
    
ancestors: ensure a consistent order even in the "inclusive" case...

              r39510
            
              If inclusive is True, the source revisions are also yielded. The

              reverse revision number order is still enforced."""

        Yuya Nishihara
    
ancestor: remove extra generator from lazyancestors.__iter__()

              r39617
            
              return _lazyancestorsiter(self._parentrevs, self._initrevs,

                                        self._stoprev, self._inclusive)

        Siddharth Agarwal
    
ancestor: add lazy membership testing to lazyancestors...

              r18091
            
          def __contains__(self, target):

              """Test whether target is an ancestor of self._initrevs."""

              seen = self._containsseen

              if target in seen:

                  return True

        Martin von Zweigbergk
    
lazyancestors: reuse __iter__ implementation in __contains__...

              r39518
            
              iter = self._containsiter

              if iter is None:

                  # Iterator exhausted

                  return False

        Yuya Nishihara
    
py3: make 'None in lazyancestors' not crash...

              r38614
            
              # Only integer target is valid, but some callers expect 'None in self'

              # to be False. So we explicitly allow it.

              if target is None:

                  return False

        Siddharth Agarwal
    
ancestor: add lazy membership testing to lazyancestors...

              r18091
            
        Pierre-Yves David
    
ancestors: prefetch method outside of the loop...

              r25639
            
              see = seen.add

        Martin von Zweigbergk
    
lazyancestors: reuse __iter__ implementation in __contains__...

              r39518
            
              try:

                  while True:

                      rev = next(iter)

                      see(rev)

                      if rev == target:

                          return True

                      if rev < target:

                          return False

              except StopIteration:

                  # Set to None to indicate fast-path can be used next time, and to

                  # free up memory.

                  self._containsiter = None

                  return False

        Georges Racinet
    
rust: hooking into Python code...

              r40334
            
        Georges Racinet
    
rust: rustlazyancestors.__contains__...

              r40336
            
      class rustlazyancestors(object):

        Georges Racinet
    
rust: hooking into Python code...

              r40334
            
          def __init__(self, index, revs, stoprev=0, inclusive=False):

              self._index = index

              self._stoprev = stoprev

              self._inclusive = inclusive

              # no need to prefilter out init revs that are smaller than stoprev,

              # it's done by rustlazyancestors constructor.

              # we need to convert to a list, because our ruslazyancestors

              # constructor (from C code) doesn't understand anything else yet

              self._initrevs = initrevs = list(revs)

              self._containsiter = parsers.rustlazyancestors(

                  index, initrevs, stoprev, inclusive)

        Georges Racinet
    
rust: rustlazyancestors.__contains__...

              r40336
            
          def __nonzero__(self):

              """False if the set is empty, True otherwise.

              It's better to duplicate this essentially trivial method than

              to subclass lazyancestors

              """

              try:

                  next(iter(self))

                  return True

              except StopIteration:

                  return False

        Georges Racinet
    
rust: hooking into Python code...

              r40334
            
          def __iter__(self):

              return parsers.rustlazyancestors(self._index,

                                               self._initrevs,

                                               self._stoprev,

                                               self._inclusive)

        Georges Racinet
    
rust: rustlazyancestors.__contains__...

              r40336
            
          def __contains__(self, target):

              return target in self._containsiter

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

Matt Mackall Abstract ancestor algorithm into generic function...	r3135	# ancestor.py - generic DAG ancestor algorithm for mercurial
		#
		# Copyright 2006 Matt Mackall <mpm@selenic.com>
		#
Martin Geisler updated license to be explicit about GPL version 2	r8225	# This software may be used and distributed according to the terms of the
Matt Mackall Update license to GPLv2+	r10263	# GNU General Public License version 2 or any later version.
Matt Mackall Abstract ancestor algorithm into generic function...	r3135
Gregory Szorc ancestor: use absolute_import...	r25915	from __future__ import absolute_import

Augie Fackler cleanup: move stdlib imports to their own import statement...	r20034	import heapq
Gregory Szorc ancestor: use absolute_import...	r25915
		from .node import nullrev
Gregory Szorc global: use pycompat.xrange()...	r38806	from . import (
Georges Racinet rust: hooking into Python code...	r40334	policy,
Gregory Szorc global: use pycompat.xrange()...	r38806	pycompat,
		)
Matt Mackall Abstract ancestor algorithm into generic function...	r3135
Georges Racinet rust: hooking into Python code...	r40334	parsers = policy.importmod(r'parsers')

Mads Kiilerich ancestors: extract candidates function as commonancestorsheads	r21101	def commonancestorsheads(pfunc, *nodes):
		"""Returns a set with the heads of all common ancestors of all nodes,
		heads(::nodes[0] and ::nodes[1] and ...) .

		pfunc must return a list of parent vertices for a given vertex.
		"""
		if not isinstance(nodes, set):
		nodes = set(nodes)
		if nullrev in nodes:
		return set()
		if len(nodes) <= 1:
		return nodes

		allseen = (1 << len(nodes)) - 1
		seen = [0] * (max(nodes) + 1)
		for i, n in enumerate(nodes):
		seen[n] = 1 << i
		poison = 1 << (i + 1)

		gca = set()
		interesting = len(nodes)
		nv = len(seen) - 1
		while nv >= 0 and interesting:
		v = nv
		nv -= 1
		if not seen[v]:
		continue
		sv = seen[v]
		if sv < poison:
		interesting -= 1
		if sv == allseen:
		gca.add(v)
		sv \|= poison
		if v in nodes:
		# history is linear
Martin von Zweigbergk cleanup: use set literals...	r32291	return {v}
Mads Kiilerich ancestors: extract candidates function as commonancestorsheads	r21101	if sv < poison:
		for p in pfunc(v):
		sp = seen[p]
		if p == nullrev:
		continue
		if sp == 0:
		seen[p] = sv
		interesting += 1
		elif sp != sv:
		seen[p] \|= sv
		else:
		for p in pfunc(v):
		if p == nullrev:
		continue
		sp = seen[p]
		if sp and sp < poison:
		interesting -= 1
		seen[p] = sv
		return gca

Bryan O'Sullivan ancestor: a new algorithm that is faster for nodes near tip...	r18986	def ancestors(pfunc, *orignodes):
		"""
		Returns the common ancestors of a and b that are furthest from a
		root (as measured by longest path).

		pfunc must return a list of parent vertices for a given vertex.
		"""
		def deepest(nodes):
		interesting = {}
		count = max(nodes) + 1
		depth = [0] * count
		seen = [0] * count
		mapping = []
		for (i, n) in enumerate(sorted(nodes)):
		depth[n] = 1
		b = 1 << i
		seen[n] = b
		interesting[b] = 1
		mapping.append((b, n))
		nv = count - 1
		while nv >= 0 and len(interesting) > 1:
		v = nv
		nv -= 1
		dv = depth[v]
		if dv == 0:
		continue
		sv = seen[v]
		for p in pfunc(v):
		if p == nullrev:
		continue
		dp = depth[p]
		nsp = sp = seen[p]
		if dp <= dv:
		depth[p] = dv + 1
		if sp != sv:
		interesting[sv] += 1
		nsp = seen[p] = sv
		if sp:
		interesting[sp] -= 1
		if interesting[sp] == 0:
		del interesting[sp]
		elif dv == dp - 1:
		nsp = sp \| sv
		if nsp == sp:
		continue
		seen[p] = nsp
		interesting.setdefault(nsp, 0)
		interesting[nsp] += 1
		interesting[sp] -= 1
		if interesting[sp] == 0:
		del interesting[sp]
		interesting[sv] -= 1
		if interesting[sv] == 0:
		del interesting[sv]

		if len(interesting) != 1:
		return []

		k = 0
		for i in interesting:
		k \|= i
		return set(n for (i, n) in mapping if k & i)

Mads Kiilerich ancestors: extract candidates function as commonancestorsheads	r21101	gca = commonancestorsheads(pfunc, *orignodes)
Bryan O'Sullivan ancestor: a new algorithm that is faster for nodes near tip...	r18986
		if len(gca) <= 1:
		return gca
		return deepest(gca)

Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	class incrementalmissingancestors(object):
		'''persistent state used to calculate missing ancestors incrementally
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	Although similar in spirit to lazyancestors below, this is a separate class
		because trying to support contains and missingancestors operations with the
		same internal data structures adds needless complexity.'''
		def __init__(self, pfunc, bases):
		self.bases = set(bases)
		if not self.bases:
		self.bases.add(nullrev)
		self.pfunc = pfunc
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970
Siddharth Agarwal ancestor: add a way to test whether a missing ancestor object has bases...	r23340	def hasbases(self):
		'''whether the common set has any non-trivial bases'''
Martin von Zweigbergk cleanup: use set literals...	r32291	return self.bases and self.bases != {nullrev}
Siddharth Agarwal ancestor: add a way to test whether a missing ancestor object has bases...	r23340
Siddharth Agarwal ancestor: add a way to add to bases of a missing ancestor object...	r23341	def addbases(self, newbases):
		'''grow the ancestor set by adding new bases'''
		self.bases.update(newbases)

Siddharth Agarwal ancestor: add a way to remove ancestors of bases from a given set...	r23342	def removeancestorsfrom(self, revs):
		'''remove all ancestors of bases from the set revs (in place)'''
		bases = self.bases
		pfunc = self.pfunc
		revs.difference_update(bases)
		# nullrev is always an ancestor
		revs.discard(nullrev)
		if not revs:
		return
		# anything in revs > start is definitely not an ancestor of bases
		# revs <= start needs to be investigated
		start = max(bases)
		keepcount = sum(1 for r in revs if r > start)
		if len(revs) == keepcount:
		# no revs to consider
		return

Gregory Szorc global: use pycompat.xrange()...	r38806	for curr in pycompat.xrange(start, min(revs) - 1, -1):
Siddharth Agarwal ancestor: add a way to remove ancestors of bases from a given set...	r23342	if curr not in bases:
		continue
		revs.discard(curr)
		bases.update(pfunc(curr))
		if len(revs) == keepcount:
		# no more potential revs to discard
		break

Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	def missingancestors(self, revs):
		'''return all the ancestors of revs that are not ancestors of self.bases

		This may include elements from revs.
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	Equivalent to the revset (::revs - ::self.bases). Revs are returned in
		revision number order, which is a topological order.'''
		revsvisit = set(revs)
		basesvisit = self.bases
		pfunc = self.pfunc
		bothvisit = revsvisit.intersection(basesvisit)
		revsvisit.difference_update(bothvisit)
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970	if not revsvisit:
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	return []
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	start = max(max(revsvisit), max(basesvisit))
		# At this point, we hold the invariants that:
		# - revsvisit is the set of nodes we know are an ancestor of at least
		# one of the nodes in revs
		# - basesvisit is the same for bases
		# - bothvisit is the set of nodes we know are ancestors of at least one
		# of the nodes in revs and one of the nodes in bases. bothvisit and
		# revsvisit are mutually exclusive, but bothvisit is a subset of
		# basesvisit.
		# Now we walk down in reverse topo order, adding parents of nodes
		# already visited to the sets while maintaining the invariants. When a
		# node is found in both revsvisit and basesvisit, it is removed from
		# revsvisit and added to bothvisit. When revsvisit becomes empty, there
		# are no more ancestors of revs that aren't also ancestors of bases, so
		# exit.

		missing = []
Gregory Szorc global: use pycompat.xrange()...	r38806	for curr in pycompat.xrange(start, nullrev, -1):
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	if not revsvisit:
		break
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	if curr in bothvisit:
		bothvisit.remove(curr)
		# curr's parents might have made it into revsvisit through
		# another path
		for p in pfunc(curr):
		revsvisit.discard(p)
		basesvisit.add(p)
		bothvisit.add(p)
		continue
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	if curr in revsvisit:
		missing.append(curr)
		revsvisit.remove(curr)
		thisvisit = revsvisit
		othervisit = basesvisit
		elif curr in basesvisit:
		thisvisit = basesvisit
		othervisit = revsvisit
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970	else:
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	# not an ancestor of revs or bases: ignore
		continue
Siddharth Agarwal ancestor: faster algorithm for difference of ancestor sets...	r17970
Siddharth Agarwal ancestor.missingancestors: turn into a state-keeping class...	r23334	for p in pfunc(curr):
		if p == nullrev:
		pass
		elif p in othervisit or p in bothvisit:
		# p is implicitly in thisvisit. This means p is or should be
		# in bothvisit
		revsvisit.discard(p)
		basesvisit.add(p)
		bothvisit.add(p)
		else:
		# visit later
		thisvisit.add(p)

		missing.reverse()
		return missing

Martin von Zweigbergk lazyancestors: extract __iter__ to free function...	r39517	# Extracted from lazyancestors.__iter__ to avoid a reference cycle
		def _lazyancestorsiter(parentrevs, initrevs, stoprev, inclusive):
		seen = {nullrev}
Yuya Nishihara ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...	r39573	heappush = heapq.heappush
		heappop = heapq.heappop
Yuya Nishihara ancestor: use heapreplace() in place of heappop/heappush()...	r39574	heapreplace = heapq.heapreplace
Martin von Zweigbergk lazyancestors: extract __iter__ to free function...	r39517	see = seen.add

		if inclusive:
Yuya Nishihara ancestor: remove alias of initrevs from _lazyancestorsiter()...	r39569	visit = [-r for r in initrevs]
		seen.update(initrevs)
Martin von Zweigbergk lazyancestors: extract __iter__ to free function...	r39517	heapq.heapify(visit)
		else:
		visit = []
		heapq.heapify(visit)
Yuya Nishihara ancestor: remove alias of initrevs from _lazyancestorsiter()...	r39569	for r in initrevs:
Yuya Nishihara ancestor: unroll loop of parents in _lazyancestorsiter()...	r39571	p1, p2 = parentrevs(r)
		if p1 not in seen:
Yuya Nishihara ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...	r39573	heappush(visit, -p1)
Yuya Nishihara ancestor: unroll loop of parents in _lazyancestorsiter()...	r39571	see(p1)
		if p2 not in seen:
Yuya Nishihara ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...	r39573	heappush(visit, -p2)
Yuya Nishihara ancestor: unroll loop of parents in _lazyancestorsiter()...	r39571	see(p2)
Martin von Zweigbergk lazyancestors: extract __iter__ to free function...	r39517
		while visit:
Yuya Nishihara ancestor: optimize _lazyancestorsiter() for contiguous chains...	r39572	current = -visit[0]
Yuya Nishihara ancestor: return early from _lazyancestorsiter() when reached to stoprev...	r39570	if current < stoprev:
		break
		yield current
Yuya Nishihara ancestor: optimize _lazyancestorsiter() for contiguous chains...	r39572	# optimize out heapq operation if p1 is known to be the next highest
		# revision, which is quite common in linear history.
Yuya Nishihara ancestor: unroll loop of parents in _lazyancestorsiter()...	r39571	p1, p2 = parentrevs(current)
		if p1 not in seen:
Yuya Nishihara ancestor: optimize _lazyancestorsiter() for contiguous chains...	r39572	if current - p1 == 1:
		visit[0] = -p1
		else:
Yuya Nishihara ancestor: use heapreplace() in place of heappop/heappush()...	r39574	heapreplace(visit, -p1)
Yuya Nishihara ancestor: unroll loop of parents in _lazyancestorsiter()...	r39571	see(p1)
Yuya Nishihara ancestor: optimize _lazyancestorsiter() for contiguous chains...	r39572	else:
Yuya Nishihara ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...	r39573	heappop(visit)
Yuya Nishihara ancestor: unroll loop of parents in _lazyancestorsiter()...	r39571	if p2 not in seen:
Yuya Nishihara ancestor: rename local aliases of heapq functions in _lazyancestorsiter()...	r39573	heappush(visit, -p2)
Yuya Nishihara ancestor: unroll loop of parents in _lazyancestorsiter()...	r39571	see(p2)
Martin von Zweigbergk lazyancestors: extract __iter__ to free function...	r39517
Siddharth Agarwal revlog: move ancestor generation out to a new class...	r18090	class lazyancestors(object):
Siddharth Agarwal ancestor.lazyancestors: take parentrevs function rather than changelog...	r23328	def __init__(self, pfunc, revs, stoprev=0, inclusive=False):
Siddharth Agarwal revlog: move ancestor generation out to a new class...	r18090	"""Create a new object generating ancestors for the given revs. Does
		not generate revs lower than stoprev.

Siddharth Agarwal ancestor: add lazy membership testing to lazyancestors...	r18091	This is computed lazily starting from revs. The object supports
		iteration and membership.
Siddharth Agarwal revlog: move ancestor generation out to a new class...	r18090
		cl should be a changelog and revs should be an iterable. inclusive is
		a boolean that indicates whether revs should be included. Revs lower
		than stoprev will not be generated.

		Result does not include the null revision."""
Siddharth Agarwal ancestor.lazyancestors: take parentrevs function rather than changelog...	r23328	self._parentrevs = pfunc
Yuya Nishihara ancestor: filter out initial revisions lower than stoprev	r39512	self._initrevs = revs = [r for r in revs if r >= stoprev]
Siddharth Agarwal revlog: move ancestor generation out to a new class...	r18090	self._stoprev = stoprev
		self._inclusive = inclusive

Martin von Zweigbergk lazyancestors: reuse __iter__ implementation in __contains__...	r39518	self._containsseen = set()
		self._containsiter = _lazyancestorsiter(self._parentrevs,
		self._initrevs,
		self._stoprev,
		self._inclusive)
Siddharth Agarwal ancestor: add lazy membership testing to lazyancestors...	r18091
Pierre-Yves David ancestors: add a __nonzero__ method...	r22225	def __nonzero__(self):
		"""False if the set is empty, True otherwise."""
		try:
timeless py3: convert to next() function...	r29216	next(iter(self))
Pierre-Yves David ancestors: add a __nonzero__ method...	r22225	return True
		except StopIteration:
		return False

Gregory Szorc py3: add __bool__ to every class defining __nonzero__...	r31476	__bool__ = __nonzero__

Siddharth Agarwal revlog: move ancestor generation out to a new class...	r18090	def __iter__(self):
		"""Generate the ancestors of _initrevs in reverse topological order.

		If inclusive is False, yield a sequence of revision numbers starting
Boris Feld ancestors: actually iterate over ancestors in topological order (issue5979)...	r39509	with the parents of each revision in revs, i.e., each revision is
		not considered an ancestor of itself. Results are emitted in reverse
		revision number order. That order is also topological: a child is
		always emitted before its parent.
Siddharth Agarwal revlog: move ancestor generation out to a new class...	r18090
Boris Feld ancestors: ensure a consistent order even in the "inclusive" case...	r39510	If inclusive is True, the source revisions are also yielded. The
		reverse revision number order is still enforced."""
Yuya Nishihara ancestor: remove extra generator from lazyancestors.__iter__()	r39617	return _lazyancestorsiter(self._parentrevs, self._initrevs,
		self._stoprev, self._inclusive)
Siddharth Agarwal ancestor: add lazy membership testing to lazyancestors...	r18091
		def __contains__(self, target):
		"""Test whether target is an ancestor of self._initrevs."""
		seen = self._containsseen
		if target in seen:
		return True
Martin von Zweigbergk lazyancestors: reuse __iter__ implementation in __contains__...	r39518	iter = self._containsiter
		if iter is None:
		# Iterator exhausted
		return False
Yuya Nishihara py3: make 'None in lazyancestors' not crash...	r38614	# Only integer target is valid, but some callers expect 'None in self'
		# to be False. So we explicitly allow it.
		if target is None:
		return False
Siddharth Agarwal ancestor: add lazy membership testing to lazyancestors...	r18091
Pierre-Yves David ancestors: prefetch method outside of the loop...	r25639	see = seen.add
Martin von Zweigbergk lazyancestors: reuse __iter__ implementation in __contains__...	r39518	try:
		while True:
		rev = next(iter)
		see(rev)
		if rev == target:
		return True
		if rev < target:
		return False
		except StopIteration:
		# Set to None to indicate fast-path can be used next time, and to
		# free up memory.
		self._containsiter = None
		return False
Georges Racinet rust: hooking into Python code...	r40334
Georges Racinet rust: rustlazyancestors.__contains__...	r40336	class rustlazyancestors(object):
Georges Racinet rust: hooking into Python code...	r40334
		def __init__(self, index, revs, stoprev=0, inclusive=False):
		self._index = index
		self._stoprev = stoprev
		self._inclusive = inclusive
		# no need to prefilter out init revs that are smaller than stoprev,
		# it's done by rustlazyancestors constructor.
		# we need to convert to a list, because our ruslazyancestors
		# constructor (from C code) doesn't understand anything else yet
		self._initrevs = initrevs = list(revs)

		self._containsiter = parsers.rustlazyancestors(
		index, initrevs, stoprev, inclusive)

Georges Racinet rust: rustlazyancestors.__contains__...	r40336	def __nonzero__(self):
		"""False if the set is empty, True otherwise.

		It's better to duplicate this essentially trivial method than
		to subclass lazyancestors
		"""
		try:
		next(iter(self))
		return True
		except StopIteration:
		return False

Georges Racinet rust: hooking into Python code...	r40334	def __iter__(self):
		return parsers.rustlazyancestors(self._index,
		self._initrevs,
		self._stoprev,
		self._inclusive)
Georges Racinet rust: rustlazyancestors.__contains__...	r40336
		def __contains__(self, target):
		return target in self._containsiter