##// END OF EJS Templates
interfaces: mark `completelocalrepository` as a Protocol class...
interfaces: mark `completelocalrepository` as a Protocol class This is just for completeness, since everything else in here is explicitly marked. The *.pyi file generated for this module is unchanged, because this class currently has no methods or attrs of its own (other than `__doc__`). With this, the odyssey of converting the zope interfaces to Protocol classes is complete. There's a little bit of mopping up in making sure the previously converted Protocols use `@abc.abstractmethod` where appropriate, but that can be deferred for now.

File last commit:

r52756:f4733654 default
r53397:3abf9bc1 default
Show More
setdiscovery.py
525 lines | 18.2 KiB | text/x-python | PythonLexer
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 # setdiscovery.py - improved discovery of common nodeset for mercurial
#
# Copyright 2010 Benoit Boissinot <bboissin@gmail.com>
# and Peter Arrenbrecht <peter@arrenbrecht.ch>
#
# This software may be used and distributed according to the terms of the
# GNU General Public License version 2 or any later version.
Olle Lundberg
setdiscovery: document algorithms used...
r20656 """
Algorithm works in the following way. You have two repository: local and
remote. They both contains a DAG of changelists.
The goal of the discovery protocol is to find one set of node *common*,
the set of nodes shared by local and remote.
One of the issue with the original protocol was latency, it could
potentially require lots of roundtrips to discover that the local repo was a
subset of remote (which is a very common case, you usually have few changes
compared to upstream, while upstream probably had lots of development).
The new protocol only requires one interface for the remote repo: `known()`,
which given a set of changelists tells you if they are present in the DAG.
The algorithm then works as follow:
- We will be using three sets, `common`, `missing`, `unknown`. Originally
all nodes are in `unknown`.
- Take a sample from `unknown`, call `remote.known(sample)`
- For each node that remote knows, move it and all its ancestors to `common`
- For each node that remote doesn't know, move it and all its descendants
to `missing`
- Iterate until `unknown` is empty
There are a couple optimizations, first is instead of starting with a random
sample of missing, start by sending all heads, in the case where the local
repo is a subset, you computed the answer in one round trip.
Then you can do something similar to the bisecting strategy used when
finding faulty changesets. Instead of random samples, you can try picking
nodes that will maximize the number of nodes that will be
classified with it (since all ancestors or descendants will be marked as well).
"""
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
Matt Harbison
typing: add `from __future__ import annotations` to most files...
r52756 from __future__ import annotations
Gregory Szorc
setdiscovery: use absolute_import
r25973
Martin von Zweigbergk
util: drop alias for collections.deque...
r25113 import collections
Augie Fackler
cleanup: move stdlib imports to their own import statement...
r20034 import random
Gregory Szorc
setdiscovery: use absolute_import
r25973
from .i18n import _
Joerg Sonnenberger
node: replace nullid and friends with nodeconstants class...
r47771 from .node import nullrev
Gregory Szorc
setdiscovery: use absolute_import
r25973 from . import (
Pierre-Yves David
error: get Abort from 'error' instead of 'util'...
r26587 error,
Georges Racinet
rust-discovery: using from Python code...
r42972 policy,
discovery: include timing in the debug output...
r32712 util,
Gregory Szorc
setdiscovery: use absolute_import
r25973 )
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
Augie Fackler
formatting: blacken the codebase...
r43346
Gregory Szorc
setdiscovery: don't use dagutil for parent resolution...
r39210 def _updatesample(revs, heads, sample, parentfn, quicksamplesize=0):
Pierre-Yves David
setdiscovery: document the '_updatesample' function...
r23809 """update an existing sample to match the expected size
Gregory Szorc
setdiscovery: reflect use of revs instead of nodes...
r39204 The sample is updated with revs exponentially distant from each head of the
<revs> set. (H~1, H~2, H~4, H~8, etc).
Pierre-Yves David
setdiscovery: document the '_updatesample' function...
r23809
If a target size is specified, the sampling will stop once this size is
Gregory Szorc
setdiscovery: reflect use of revs instead of nodes...
r39204 reached. Otherwise sampling will happen until roots of the <revs> set are
Pierre-Yves David
setdiscovery: document the '_updatesample' function...
r23809 reached.
Gregory Szorc
setdiscovery: reflect use of revs instead of nodes...
r39204 :revs: set of revs we want to discover (if None, assume the whole dag)
Gregory Szorc
setdiscovery: pass heads into _updatesample()...
r39206 :heads: set of DAG head revs
Pierre-Yves David
setdiscovery: document the '_updatesample' function...
r23809 :sample: a sample to update
Gregory Szorc
setdiscovery: don't use dagutil for parent resolution...
r39210 :parentfn: a callable to resolve parents for a revision
Pierre-Yves David
setdiscovery: document the '_updatesample' function...
r23809 :quicksamplesize: optional target size of the sample"""
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 dist = {}
Martin von Zweigbergk
util: drop alias for collections.deque...
r25113 visit = collections.deque(heads)
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 seen = set()
factor = 1
while visit:
curr = visit.popleft()
if curr in seen:
continue
d = dist.setdefault(curr, 1)
if d > factor:
factor *= 2
if d == factor:
Pierre-Yves David
setdiscovery: drop the 'always' argument to '_updatesample'...
r23814 sample.add(curr)
if quicksamplesize and (len(sample) >= quicksamplesize):
return
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 seen.add(curr)
Gregory Szorc
setdiscovery: don't use dagutil for parent resolution...
r39210
for p in parentfn(curr):
if p != nullrev and (not revs or p in revs):
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 dist.setdefault(p, d + 1)
visit.append(p)
Augie Fackler
formatting: blacken the codebase...
r43346
Georges Racinet
rust-discovery: optionally don't randomize at all, for tests...
r42968 def _limitsample(sample, desiredlen, randomize=True):
"""return a random subset of sample of at most desiredlen item.
If randomize is False, though, a deterministic subset is returned.
This is meant for integration tests.
"""
if len(sample) <= desiredlen:
return sample
Raphaël Gomès
python-compat: adapt to Python 3.11 BC breakage with `random.sample`...
r50542 sample = list(sample)
Georges Racinet
rust-discovery: optionally don't randomize at all, for tests...
r42968 if randomize:
return set(random.sample(sample, desiredlen))
sample.sort()
return set(sample[:desiredlen])
Pierre-Yves David
setdiscovery: extract sample limitation in a `_limitsample` function...
r23083
Augie Fackler
formatting: blacken the codebase...
r43346
Gregory Szorc
py3: use class X: instead of class X(object):...
r49801 class partialdiscovery:
Boris Feld
discovery: introduce a partialdiscovery object...
r41147 """an object representing ongoing discovery
Feed with data from the remote repository, this object keep track of the
current set of changeset in various states:
Boris Feld
discovery: improve partial discovery documentation...
r41208 - common: revs also known remotely
- undecided: revs we don't have information on yet
- missing: revs missing remotely
(all tracked revisions are known locally)
Boris Feld
discovery: introduce a partialdiscovery object...
r41147 """
Georges Racinet
rust-discovery: optionally don't randomize at all, for tests...
r42968 def __init__(self, repo, targetheads, respectsize, randomize=True):
Boris Feld
discovery: introduce a partialdiscovery object...
r41147 self._repo = repo
Boris Feld
discovery: move undecided set on the partialdiscovery...
r41203 self._targetheads = targetheads
Boris Feld
discovery: introduce a partialdiscovery object...
r41147 self._common = repo.changelog.incrementalmissingrevs()
Boris Feld
discovery: move undecided set on the partialdiscovery...
r41203 self._undecided = None
Boris Feld
discovery: move missing tracking inside the partialdiscovery object...
r41206 self.missing = set()
discovery: cache the children mapping used during each discovery...
r42051 self._childrenmap = None
Martin von Zweigbergk
setdiscovery: make progress on most connected groups each roundtrip...
r42594 self._respectsize = respectsize
Georges Racinet
rust-discovery: optionally don't randomize at all, for tests...
r42968 self.randomize = randomize
Boris Feld
discovery: introduce a partialdiscovery object...
r41147
def addcommons(self, commons):
Joerg Sonnenberger
setdiscovery: fix a few typos...
r42351 """register nodes known as common"""
Boris Feld
discovery: introduce a partialdiscovery object...
r41147 self._common.addbases(commons)
Boris Feld
partialdiscovery: avoid `undecided` related computation sooner than necessary...
r41374 if self._undecided is not None:
self._common.removeancestorsfrom(self._undecided)
Boris Feld
discovery: introduce a partialdiscovery object...
r41147
Boris Feld
discovery: move missing tracking inside the partialdiscovery object...
r41206 def addmissings(self, missings):
Joerg Sonnenberger
setdiscovery: fix a few typos...
r42351 """register some nodes as missing"""
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 newmissing = self._repo.revs(b'%ld::%ld', missings, self.undecided)
Boris Feld
discovery: compute newly discovered missing in a more efficient way...
r41316 if newmissing:
self.missing.update(newmissing)
self.undecided.difference_update(newmissing)
Boris Feld
discovery: move missing tracking inside the partialdiscovery object...
r41206
Boris Feld
discovery: add a simple `addinfo` method...
r41207 def addinfo(self, sample):
"""consume an iterable of (rev, known) tuples"""
common = set()
missing = set()
for rev, known in sample:
if known:
common.add(rev)
else:
missing.add(rev)
if common:
self.addcommons(common)
if missing:
self.addmissings(missing)
Boris Feld
discovery: introduce a partialdiscovery object...
r41147 def hasinfo(self):
"""return True is we have any clue about the remote state"""
return self._common.hasbases()
Boris Feld
discovery: add a `iscomplete` method to the `partialdiscovery` object...
r41205 def iscomplete(self):
"""True if all the necessary data have been gathered"""
return self._undecided is not None and not self._undecided
Boris Feld
discovery: move undecided set on the partialdiscovery...
r41203 @property
def undecided(self):
if self._undecided is not None:
return self._undecided
self._undecided = set(self._common.missingancestors(self._targetheads))
return self._undecided
Georges Racinet
discovery: stop direct use of attribute of partialdiscovery...
r42272 def stats(self):
return {
Georges Racinet
py3-discovery: using plain str in stats dict...
r43432 'undecided': len(self.undecided),
Georges Racinet
discovery: stop direct use of attribute of partialdiscovery...
r42272 }
Boris Feld
discovery: move common heads computation inside partialdiscovery object...
r41148 def commonheads(self):
"""the heads of the known common set"""
# heads(common) == heads(common.bases) since common represents
# common.bases and all its ancestors
Georges Racinet
discovery: using the new basesheads()...
r41281 return self._common.basesheads()
Boris Feld
discovery: introduce a partialdiscovery object...
r41147
discovery: use a lower level but faster way to retrieve parents...
r42047 def _parentsgetter(self):
getrev = self._repo.changelog.index.__getitem__
Augie Fackler
formatting: blacken the codebase...
r43346
discovery: use a lower level but faster way to retrieve parents...
r42047 def getparents(r):
discovery: fix embarrassing typo in slice definition...
r42145 return getrev(r)[5:7]
Augie Fackler
formatting: blacken the codebase...
r43346
discovery: use a lower level but faster way to retrieve parents...
r42047 return getparents
discovery: explicitly use `undecided` for the children mapping...
r42052 def _childrengetter(self):
discovery: cache the children mapping used during each discovery...
r42051 if self._childrenmap is not None:
discovery: clarify why the caching of children is valid...
r42055 # During discovery, the `undecided` set keep shrinking.
# Therefore, the map computed for an iteration N will be
# valid for iteration N+1. Instead of computing the same
# data over and over we cached it the first time.
discovery: cache the children mapping used during each discovery...
r42051 return self._childrenmap.__getitem__
discovery: move children computation in its own method...
r42050 # _updatesample() essentially does interaction over revisions to look
# up their children. This lookup is expensive and doing it in a loop is
# quadratic. We precompute the children for all relevant revisions and
# make the lookup in _updatesample() a simple dict lookup.
discovery: cache the children mapping used during each discovery...
r42051 self._childrenmap = children = {}
discovery: move children computation in its own method...
r42050
parentrevs = self._parentsgetter()
discovery: explicitly use `undecided` for the children mapping...
r42052 revs = self.undecided
discovery: move children computation in its own method...
r42050
for rev in sorted(revs):
# Always ensure revision has an entry so we don't need to worry
# about missing keys.
children[rev] = []
for prev in parentrevs(rev):
if prev == nullrev:
continue
c = children.get(prev)
if c is not None:
c.append(rev)
return children.__getitem__
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 def takequicksample(self, headrevs, size):
"""takes a quick sample of size <size>
It is meant for initial sampling and focuses on querying heads and close
ancestors of heads.
:headrevs: set of head revisions in local DAG to consider
:size: the maximum size of the sample"""
revs = self.undecided
if len(revs) <= size:
return list(revs)
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 sample = set(self._repo.revs(b'heads(%ld)', revs))
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045
if len(sample) >= size:
Georges Racinet
rust-discovery: optionally don't randomize at all, for tests...
r42968 return _limitsample(sample, size, randomize=self.randomize)
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045
Augie Fackler
formatting: blacken the codebase...
r43346 _updatesample(
None, headrevs, sample, self._parentsgetter(), quicksamplesize=size
)
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 return sample
def takefullsample(self, headrevs, size):
revs = self.undecided
if len(revs) <= size:
return list(revs)
repo = self._repo
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 sample = set(repo.revs(b'heads(%ld)', revs))
discovery: use a lower level but faster way to retrieve parents...
r42047 parentrevs = self._parentsgetter()
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045
# update from heads
discovery: avoid computing identical sets of heads twice...
r42046 revsheads = sample.copy()
discovery: use a lower level but faster way to retrieve parents...
r42047 _updatesample(revs, revsheads, sample, parentrevs)
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045
# update from roots
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 revsroots = set(repo.revs(b'roots(%ld)', revs))
discovery: explicitly use `undecided` for the children mapping...
r42052 childrenrevs = self._childrengetter()
discovery: move children computation in its own method...
r42050 _updatesample(revs, revsroots, sample, childrenrevs)
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 assert sample
discovery: be more conservative when adjusting the sample size...
r42618
if not self._respectsize:
size = max(size, min(len(revsroots), len(revsheads)))
Georges Racinet
rust-discovery: optionally don't randomize at all, for tests...
r42968 sample = _limitsample(sample, size, randomize=self.randomize)
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 if len(sample) < size:
more = size - len(sample)
Georges Racinet
rust-discovery: optionally don't randomize at all, for tests...
r42968 takefrom = list(revs - sample)
if self.randomize:
sample.update(random.sample(takefrom, more))
else:
takefrom.sort()
sample.update(takefrom[:more])
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 return sample
Augie Fackler
formatting: blacken the codebase...
r43346
revlog: do not call Rust code if the index is not compatible with it...
r48043 pure_partialdiscovery = partialdiscovery
Augie Fackler
formatting: blacken the codebase...
r43346 partialdiscovery = policy.importrust(
Augie Fackler
cleanup: remove pointless r-prefixes on single-quoted strings...
r43906 'discovery', member='PartialDiscovery', default=partialdiscovery
Augie Fackler
formatting: blacken the codebase...
r43346 )
Georges Racinet
rust-discovery: using from Python code...
r42972
Augie Fackler
formatting: blacken the codebase...
r43346 def findcommonheads(
ui,
local,
remote,
abortwhenunrelated=True,
ancestorsof=None,
debugdiscovery: display the number of roundtrip used...
r46726 audit=None,
Augie Fackler
formatting: blacken the codebase...
r43346 ):
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """Return a tuple (common, anyincoming, remoteheads) used to identify
Steven Brown
setdiscovery: limit lines to 80 characters
r14206 missing nodes from or in remote.
debugdiscovery: display the number of roundtrip used...
r46726
The audit argument is an optional dictionnary that a caller can pass. it
will be updated with extra data about the discovery, this is useful for
debug.
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
discovery: add a discovery.grow-sample.rate...
r47018
samplegrowth = float(ui.config(b'devel', b'discovery.grow-sample.rate'))
discovery: also audit the number of queries done...
r49881 if audit is not None:
audit[b'total-queries'] = 0
discovery: include timing in the debug output...
r32712 start = util.timer()
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 roundtrips = 0
cl = local.changelog
Gregory Szorc
setdiscovery: don't use dagutil for rev -> node conversions...
r39195 clnode = cl.node
Gregory Szorc
setdiscovery: don't use dagutil for node -> rev conversion...
r39197 clrev = cl.rev
Gregory Szorc
setdiscovery: don't use dagutil for rev -> node conversions...
r39195
Boris Feld
setdiscover: allow to ignore part of the local graph...
r35305 if ancestorsof is not None:
Gregory Szorc
setdiscovery: don't use dagutil to compute heads...
r39201 ownheads = [clrev(n) for n in ancestorsof]
else:
ownheads = [rev for rev in cl.headrevs() if rev != nullrev]
discovery: add a devel.discovery.exchange-heads...
r47034 initial_head_exchange = ui.configbool(b'devel', b'discovery.exchange-heads')
discovery: add config options to control sample size...
r47068 initialsamplesize = ui.configint(b'devel', b'discovery.sample-size.initial')
fullsamplesize = ui.configint(b'devel', b'discovery.sample-size')
setdiscovery: stop limiting the number of local head we initially send...
r42335 # We also ask remote about all the local heads. That set can be arbitrarily
# large, so we used to limit it size to `initialsamplesize`. We no longer
# do as it proved counter productive. The skipped heads could lead to a
# large "undecided" set, slower to be clarified than if we asked the
# question for all heads right away.
#
# We are already fetching all server heads using the `heads` commands,
# sending a equivalent number of heads the other way should not have a
# significant impact. In addition, it is very likely that we are going to
# have to issue "known" request for an equivalent amount of revisions in
# order to decide if theses heads are common or missing.
#
# find a detailled analysis below.
#
# Case A: local and server both has few heads
#
# Ownheads is below initialsamplesize, limit would not have any effect.
#
# Case B: local has few heads and server has many
#
# Ownheads is below initialsamplesize, limit would not have any effect.
#
# Case C: local and server both has many heads
#
# We now transfert some more data, but not significantly more than is
# already transfered to carry the server heads.
#
# Case D: local has many heads, server has few
#
# D.1 local heads are mostly known remotely
#
# All the known head will have be part of a `known` request at some
# point for the discovery to finish. Sending them all earlier is
# actually helping.
#
# (This case is fairly unlikely, it requires the numerous heads to all
# be merged server side in only a few heads)
#
# D.2 local heads are mostly missing remotely
#
# To determine that the heads are missing, we'll have to issue `known`
# request for them or one of their ancestors. This amount of `known`
# request will likely be in the same order of magnitude than the amount
# of local heads.
#
# The only case where we can be more efficient using `known` request on
# ancestors are case were all the "missing" local heads are based on a
# few changeset, also "missing". This means we would have a "complex"
# graph (with many heads) attached to, but very independant to a the
# "simple" graph on the server. This is a fairly usual case and have
# not been met in the wild so far.
discovery: add a devel.discovery.exchange-heads...
r47034 if initial_head_exchange:
if remote.limitedarguments:
sample = _limitsample(ownheads, initialsamplesize)
# indices between sample and externalized version must match
sample = list(sample)
else:
sample = ownheads
Gregory Szorc
wireproto: implement batching on peer executor interface...
r37649
discovery: add a devel.discovery.exchange-heads...
r47034 ui.debug(b"query 1; heads\n")
roundtrips += 1
with remote.commandexecutor() as e:
fheads = e.callcommand(b'heads', {})
discovery: also audit the number of queries done...
r49881 if audit is not None:
audit[b'total-queries'] += len(sample)
discovery: add a devel.discovery.exchange-heads...
r47034 fknown = e.callcommand(
b'known',
{
b'nodes': [clnode(r) for r in sample],
},
)
srvheadhashes, yesno = fheads.result(), fknown.result()
Gregory Szorc
wireproto: implement batching on peer executor interface...
r37649
discovery: add a devel.discovery.exchange-heads...
r47034 if audit is not None:
audit[b'total-roundtrips'] = 1
debugdiscovery: display the number of roundtrip used...
r46726
Joerg Sonnenberger
setdiscovery: simplify by using tiprev directly...
r47599 if cl.tiprev() == nullrev:
Joerg Sonnenberger
node: replace nullid and friends with nodeconstants class...
r47771 if srvheadhashes != [cl.nullid]:
return [cl.nullid], True, srvheadhashes
return [cl.nullid], False, []
discovery: add a devel.discovery.exchange-heads...
r47034 else:
# we still need the remote head for the function return
with remote.commandexecutor() as e:
fheads = e.callcommand(b'heads', {})
srvheadhashes = fheads.result()
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
Steven Brown
setdiscovery: limit lines to 80 characters
r14206 # start actual discovery (we note this before the next "if" for
# compatibility reasons)
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 ui.status(_(b"searching for changes\n"))
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
Georges Racinet
discovery: rename `srvheads` to `knownsrvheads`...
r42044 knownsrvheads = [] # revnos of remote heads that are known locally
Gregory Szorc
setdiscovery: don't use dagutil for node -> rev conversion...
r39197 for node in srvheadhashes:
Joerg Sonnenberger
node: replace nullid and friends with nodeconstants class...
r47771 if node == cl.nullid:
Gregory Szorc
setdiscovery: don't use dagutil for node -> rev conversion...
r39197 continue
try:
Georges Racinet
discovery: rename `srvheads` to `knownsrvheads`...
r42044 knownsrvheads.append(clrev(node))
Gregory Szorc
setdiscovery: don't use dagutil for node -> rev conversion...
r39197 # Catches unknown and filtered nodes.
except error.LookupError:
continue
discovery: add a devel.discovery.exchange-heads...
r47034 if initial_head_exchange:
# early exit if we know all the specified remote heads already
if len(knownsrvheads) == len(srvheadhashes):
ui.debug(b"all remote heads known locally\n")
return srvheadhashes, False, srvheadhashes
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
discovery: add a devel.discovery.exchange-heads...
r47034 if len(sample) == len(ownheads) and all(yesno):
ui.note(_(b"all local changesets known remotely\n"))
ownheadhashes = [clnode(r) for r in ownheads]
return ownheadhashes, True, srvheadhashes
Peter Arrenbrecht
setdiscovery: batch heads and known(ownheads)...
r14624
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 # full blown discovery
discovery: add a `devel', b'discovery.grow-sample`...
r47017 # if the server has a limit to its arguments size, we can't grow the sample.
setdiscovery: add a discovery.grow-sample.dynamic option...
r47562 configbool = local.ui.configbool
grow_sample = configbool(b'devel', b'discovery.grow-sample')
setdiscovery: rearrange code deciding if we will grow the sample...
r47561 grow_sample = grow_sample and not remote.limitedarguments
discovery: add a `devel', b'discovery.grow-sample`...
r47017
setdiscovery: add a discovery.grow-sample.dynamic option...
r47562 dynamic_sample = configbool(b'devel', b'discovery.grow-sample.dynamic')
hard_limit_sample = not (dynamic_sample or remote.limitedarguments)
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 randomize = ui.configbool(b'devel', b'discovery.randomize')
revlog: do not call Rust code if the index is not compatible with it...
r48043 if cl.index.rust_ext_compat:
pd = partialdiscovery
else:
pd = pure_partialdiscovery
disco = pd(local, ownheads, hard_limit_sample, randomize=randomize)
discovery: add a devel.discovery.exchange-heads...
r47034 if initial_head_exchange:
# treat remote heads (and maybe own heads) as a first implicit sample
# response
disco.addcommons(knownsrvheads)
disco.addinfo(zip(sample, yesno))
Brodie Rao
cleanup: eradicate long lines
r16683
discovery: add a devel.discovery.exchange-heads...
r47034 full = not initial_head_exchange
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 progress = ui.makeprogress(_(b'searching'), unit=_(b'queries'))
Boris Feld
discovery: add a `iscomplete` method to the `partialdiscovery` object...
r41205 while not disco.iscomplete():
Boris Feld
discovery: introduce a partialdiscovery object...
r41147 if full or disco.hasinfo():
Pierre-Yves David
setdiscovery: factorize similar sampling code...
r23747 if full:
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 ui.note(_(b"sampling from both directions\n"))
Pierre-Yves David
setdiscovery: factorize similar sampling code...
r23747 else:
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 ui.debug(b"taking initial sample\n")
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 samplefunc = disco.takefullsample
Pierre-Yves David
setdiscovery: limit the size of all sample (issue4411)...
r23130 targetsize = fullsamplesize
setdiscovery: rearrange code deciding if we will grow the sample...
r47561 if grow_sample:
discovery: slowly increase sampling size...
r42546 fullsamplesize = int(fullsamplesize * samplegrowth)
Peter Arrenbrecht
setdiscovery: batch heads and known(ownheads)...
r14624 else:
# use even cheaper initial sample
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 ui.debug(b"taking quick initial sample\n")
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 samplefunc = disco.takequicksample
Pierre-Yves David
setdiscovery: limit the size of all sample (issue4411)...
r23130 targetsize = initialsamplesize
Georges Racinet
discovery: moved sampling functions inside discovery object...
r42045 sample = samplefunc(ownheads, targetsize)
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
roundtrips += 1
Martin von Zweigbergk
setdiscovery: use progress helper...
r38369 progress.update(roundtrips)
Georges Racinet
discovery: stop direct use of attribute of partialdiscovery...
r42272 stats = disco.stats()
Augie Fackler
formatting: blacken the codebase...
r43346 ui.debug(
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 b"query %i; still undecided: %i, sample size is: %i\n"
Georges Racinet
py3-discovery: using plain str in stats dict...
r43432 % (roundtrips, stats['undecided'], len(sample))
Augie Fackler
formatting: blacken the codebase...
r43346 )
Georges Racinet
discovery: stop direct use of attribute of partialdiscovery...
r42272
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 # indices between sample and externalized version must match
sample = list(sample)
Gregory Szorc
wireproto: implement command executor interface for version 1 peers...
r37648
with remote.commandexecutor() as e:
discovery: also audit the number of queries done...
r49881 if audit is not None:
audit[b'total-queries'] += len(sample)
Augie Fackler
formatting: blacken the codebase...
r43346 yesno = e.callcommand(
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 b'known',
{
b'nodes': [clnode(r) for r in sample],
},
Augie Fackler
formatting: blacken the codebase...
r43346 ).result()
Gregory Szorc
wireproto: implement command executor interface for version 1 peers...
r37648
Peter Arrenbrecht
setdiscovery: batch heads and known(ownheads)...
r14624 full = True
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
Boris Feld
discovery: add a simple `addinfo` method...
r41207 disco.addinfo(zip(sample, yesno))
Siddharth Agarwal
setdiscovery: avoid a full changelog graph traversal...
r23343
Boris Feld
discovery: move common heads computation inside partialdiscovery object...
r41148 result = disco.commonheads()
discovery: include timing in the debug output...
r32712 elapsed = util.timer() - start
Martin von Zweigbergk
progress: hide update(None) in a new complete() method...
r38392 progress.complete()
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 ui.debug(b"%d total queries in %.4fs\n" % (roundtrips, elapsed))
Augie Fackler
formatting: blacken the codebase...
r43346 msg = (
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 b'found %d common and %d unknown server heads,'
b' %d roundtrips in %.4fs\n'
Augie Fackler
formatting: blacken the codebase...
r43346 )
Georges Racinet
discovery: rename `srvheads` to `knownsrvheads`...
r42044 missing = set(result) - set(knownsrvheads)
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 ui.log(b'discovery', msg, len(result), len(missing), roundtrips, elapsed)
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
debugdiscovery: display the number of roundtrip used...
r46726 if audit is not None:
audit[b'total-roundtrips'] = roundtrips
Joerg Sonnenberger
node: replace nullid and friends with nodeconstants class...
r47771 if not result and srvheadhashes != [cl.nullid]:
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 if abortwhenunrelated:
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 raise error.Abort(_(b"repository is unrelated"))
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164 else:
Augie Fackler
formatting: byteify all mercurial/ and hgext/ string literals...
r43347 ui.warn(_(b"warning: repository is unrelated\n"))
Augie Fackler
formatting: blacken the codebase...
r43346 return (
Joerg Sonnenberger
node: replace nullid and friends with nodeconstants class...
r47771 {cl.nullid},
Augie Fackler
formatting: blacken the codebase...
r43346 True,
srvheadhashes,
)
Peter Arrenbrecht
discovery: add new set-based discovery...
r14164
Joerg Sonnenberger
node: replace nullid and friends with nodeconstants class...
r47771 anyincoming = srvheadhashes != [cl.nullid]
Gregory Szorc
setdiscovery: don't use dagutil for rev -> node conversions...
r39195 result = {clnode(r) for r in result}
return result, anyincoming, srvheadhashes