##// END OF EJS Templates
branchmap-v3: filter topo heads using node for performance reason...
branchmap-v3: filter topo heads using node for performance reason The branchmap currently contains heads as nodeid. If we build a set of revnum with the topological heads, we need to turn the nodeid in the branchmap to revnum to be able to check if they are topo-heads. That nodeid → revnum lookup is "expensive" and adds up to something noticeable if you do it hundreds of thousand of time. Instead we turn all the topo-heads revnums into nodes and build a set. So we can directly test membership of the nodeids stored in the branchmap. That is much faster. Ideally we would have revnum in the branchmap and could directly test revnum against a revnum set and that would be even faster. However that's an adventure for another time. Without this change, the branchmap format "v3" was significantly slower than the "v2" format. With this changes, some of that gap is recovered With rust + persistent nodemap, this overhead was smaller because the extra lookup did not had to to build the nodemap from scratch. In addition the mozilla-unified repository is able to use the "pure_top" mode of branchmap v3, so it was not really affected by this. Future changeset will work of the remaining of the performance gap. ### benchmark.name = hg.command.unbundle # bin-env-vars.hg.py-re2-module = default # benchmark.variants.issue6528 = disabled # benchmark.variants.resource-usage = default # benchmark.variants.reuse-external-delta-parent = yes # benchmark.variants.revs = any-1-extra-rev # benchmark.variants.source = unbundle # benchmark.variants.validate = default # benchmark.variants.verbosity = quiet ## data-env-vars.name = netbeans-2018-08-01-zstd-sparse-revlog # bin-env-vars.hg.flavor = default branch-v2: 0.233711 ~~~~~ branch-v3 before: 0.380994 (+63.02%, +0.15) branch-v3 after: 0.368769 (+57.79%, +0.14) # bin-env-vars.hg.flavor = rust branch-v2: 0.235230 ~~~~~ branch-v3 before: 0.385060 (+63.70%, +0.15) branch-v3 after: 0.372460 (+58.34%, +0.14) ## data-env-vars.name = netbeans-2018-08-01-ds2-pnm # bin-env-vars.hg.flavor = rust branch-v2: 0.255586 ~~~~~ branch-v3 before: 0.317524 (+24.23%, +0.06) branch-v3 after: 0.318907 (+24.78%, +0.06) ## data-env-vars.name = mozilla-central-2024-03-22-zstd-sparse-revlog # bin-env-vars.hg.flavor = default branch-v2: 0.339010 ~~~~~ branch-v3 before: 0.410007 (+20.94%, +0.07) branch-v3 after: 0.349752 (+3.17%, +0.01) # bin-env-vars.hg.flavor = rust branch-v2: 0.346525 ~~~~~ branch-v3 before: 0.410428 (+18.44%, +0.06) branch-v3 after: 0.354300 (+2.24%, +0.01) ## data-env-vars.name = mozilla-central-2024-03-22-ds2-pnm # bin-env-vars.hg.flavor = rust branch-v2: 0.380202 ~~~~~ branch-v3 before: 0.393871 (+3.60%, +0.01) branch-v3 after: 0.396293 (+4.23%, +0.02) ## data-env-vars.name = mozilla-unified-2024-03-22-zstd-sparse-revlog # bin-env-vars.hg.flavor = default branch-v2: 0.412165 ~~~~~ branch-v3 before: 0.438105 (+6.29%, +0.03) branch-v3 after: 0.424769 (+3.06%, +0.01) # bin-env-vars.hg.flavor = rust branch-v2: 0.412397 ~~~~~ branch-v3 before: 0.438405 (+6.31%, +0.03) branch-v3 after: 0.421796 (+2.28%, +0.01) ## data-env-vars.name = mozilla-unified-2024-03-22-ds2-pnm # bin-env-vars.hg.flavor = rust branch-v2: 0.429501 ~~~~~ branch-v3 before: 0.452692 (+5.40%, +0.02) branch-v3 after: 0.443849 (+3.34%, +0.01) ## data-env-vars.name = mozilla-try-2024-03-26-zstd-sparse-revlog # bin-env-vars.hg.flavor = default branch-v2: 3.403171 ~~~~~ branch-v3 before: 6.562345 (+92.83%, +3.16) branch-v3 after: 6.234055 (+83.18%, +2.83) # bin-env-vars.hg.flavor = rust branch-v2: 3.454876 ~~~~~ branch-v3 before: 6.160248 (+78.31%, +2.71) branch-v3 after: 6.307813 (+82.58%, +2.85) ## data-env-vars.name = mozilla-try-2024-03-26-ds2-pnm # bin-env-vars.hg.flavor = rust branch-v2: 3.465435 ~~~~~ branch-v3 before: 5.381648 (+55.30%, +1.92) branch-v3 after: 5.176076 (+49.36%, +1.71)

File last commit:

r52822:93d872a0 default
r52869:41b8892a default
Show More
dirstate.py
334 lines | 10.4 KiB | text/x-python | PythonLexer
Matt Harbison
typing: add `from __future__ import annotations` to most files...
r52756 from __future__ import annotations
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 import contextlib
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 import os
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816 import typing
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
interfaces: convert the dirstate zope interface to a Protocol class...
r52814 from typing import (
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 Any,
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816 Callable,
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 Dict,
Iterable,
Iterator,
List,
Optional,
Matt Harbison
interfaces: convert the dirstate zope interface to a Protocol class...
r52814 Protocol,
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 Tuple,
Matt Harbison
interfaces: convert the dirstate zope interface to a Protocol class...
r52814 )
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816 if typing.TYPE_CHECKING:
# Almost all mercurial modules are only imported in the type checking phase
# to avoid circular imports
from .. import (
match as matchmod,
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 scmutil,
transaction as txnmod,
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816 )
Augie Fackler
formatting: blacken the codebase...
r43346
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 # TODO: finish adding type hints
AddParentChangeCallbackT = Callable[
["idirstate", Tuple[Any, Any], Tuple[Any, Any]], Any
]
"""The callback type for dirstate.addparentchangecallback()."""
# TODO: add a Protocol for dirstatemap.DirStateItem? (It is
# conditionalized with python or rust implementations. Also,
# git.dirstate needs to yield non-None from ``items()``.)
DirstateItemT = Any # dirstatemap.DirstateItem
IgnoreFileAndLineT = Tuple[Optional[bytes], int, bytes]
"""The return type of dirstate._ignorefileandline(), which holds
``(file, lineno, originalline)``.
"""
FlagFuncFallbackT = Callable[[], "FlagFuncReturnT"]
"""The type for the dirstate.flagfunc() fallback function."""
FlagFuncReturnT = Callable[[bytes], bytes]
"""The return type of dirstate.flagfunc()."""
# TODO: verify and complete this- it came from a pytype *.pyi file
StatusReturnT = Tuple[Any, scmutil.status, Any]
"""The return type of dirstate.status()."""
# TODO: probably doesn't belong here.
TransactionT = txnmod.transaction
"""The type for a transaction used with dirstate.
This is meant to help callers avoid having to remember to delay the import
of the transaction module.
"""
# TODO: The value can also be mercurial.osutil.stat
WalkReturnT = Dict[bytes, Optional[os.stat_result]]
"""The return type of dirstate.walk().
The matched files are keyed in the dictionary, mapped to a stat-like object
if the file exists.
"""
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
interfaces: convert the dirstate zope interface to a Protocol class...
r52814 class idirstate(Protocol):
# TODO: convert these constructor args to fields?
# def __init__(
# self,
# opener,
# ui,
# root,
# validate,
# sparsematchfn,
# nodeconstants,
# use_dirstate_v2,
# use_tracked_hint=False,
# ):
# """Create a new dirstate object.
#
# opener is an open()-like callable that can be used to open the
# dirstate file; root is the root of the directory tracked by
# the dirstate.
# """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Augie Fackler
idirstate: group private methods and attrs that are in the interface...
r43199 # TODO: all these private methods and attributes should be made
# public or removed from the interface.
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816
# TODO: decorate with `@rootcache(b'.hgignore')` like dirstate class?
Matt Harbison
interfaces: change a couple of dirstate fields to `@property`...
r52821 @property
def _ignore(self) -> matchmod.basematcher:
"""Matcher for ignored files."""
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816
@property
def is_changing_any(self) -> bool:
dirstate: introduce a `is_changing_any` property...
r50918 """True if any changes in progress."""
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816
@property
def is_changing_parents(self) -> bool:
dirstate: rename `pendingparentchange` to `is_changing_parents`...
r50917 """True if parents changes in progress."""
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816
@property
def is_changing_files(self) -> bool:
dirstate: add a context for tracking files change...
r50921 """True if file tracking changes in progress."""
Augie Fackler
idirstate: group private methods and attrs that are in the interface...
r43199
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def _ignorefiles(self) -> List[bytes]:
Augie Fackler
idirstate: group private methods and attrs that are in the interface...
r43199 """Return a list of files containing patterns to ignore."""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def _ignorefileandline(self, f: bytes) -> IgnoreFileAndLineT:
Matt Harbison
cleanup: fix docstring formatting...
r44226 """Given a file `f`, return the ignore file and line that ignores it."""
Augie Fackler
idirstate: group private methods and attrs that are in the interface...
r43199
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816 # TODO: decorate with `@util.propertycache` like dirstate class?
# (can't because circular import)
Matt Harbison
interfaces: change a couple of dirstate fields to `@property`...
r52821 @property
def _checklink(self) -> bool:
"""Callable for checking symlinks.""" # TODO: this comment looks stale
Matt Harbison
interfaces: convert the zope `Attribute` attrs to regular fields...
r52816
# TODO: decorate with `@util.propertycache` like dirstate class?
# (can't because circular import)
Matt Harbison
interfaces: change a couple of dirstate fields to `@property`...
r52821 @property
def _checkexec(self) -> bool:
"""Callable for checking exec bits.""" # TODO: this comment looks stale
Augie Fackler
idirstate: group private methods and attrs that are in the interface...
r43199
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 @contextlib.contextmanager
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def changing_parents(self, repo) -> Iterator: # TODO: typehint this
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """Context manager for handling dirstate parents.
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
If an exception occurs in the scope of the context manager,
the incoherent dirstate won't be written when wlock is
released.
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
dirstate: add a context for tracking files change...
r50921 @contextlib.contextmanager
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def changing_files(self, repo) -> Iterator: # TODO: typehint this
dirstate: add a context for tracking files change...
r50921 """Context manager for handling dirstate files.
If an exception occurs in the scope of the context manager,
the incoherent dirstate won't be written when wlock is
released.
"""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def hasdir(self, d: bytes) -> bool:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def flagfunc(self, buildfallback: FlagFuncFallbackT) -> FlagFuncReturnT:
av6
dirstate: update docstrings in idirstate from the current dirstate
r50775 """build a callable that returns flags associated with a filename
The information is extracted from three possible layers:
1. the file system if it supports the information
2. the "fallback" information stored in the dirstate if any
3. a more expensive mechanism inferring the flags from the parents.
"""
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def getcwd(self) -> bytes:
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """Return the path from which a canonical path is calculated.
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
This path should be used to resolve file patterns or to convert
canonical paths back to file paths for display. It shouldn't be
used to get real file paths. Use vfs functions instead.
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def pathto(self, f: bytes, cwd: Optional[bytes] = None) -> bytes:
av6
dirstate: swap pathto() and get_entry() in idirstate...
r50776 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def get_entry(self, path: bytes) -> DirstateItemT:
Matt Harbison
idirstate: add missing get_entry() method...
r49967 """return a DirstateItem for the associated path"""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def __contains__(self, key: Any) -> bool:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 """Check if bytestring `key` is known to the dirstate."""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def __iter__(self) -> Iterator[bytes]:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 """Iterate the dirstate's contained filenames as bytestrings."""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def items(self) -> Iterator[Tuple[bytes, DirstateItemT]]:
dirstate-item: rename the class to DirstateItem...
r48328 """Iterate the dirstate's entries as (filename, DirstateItem.
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
As usual, filename is a bytestring.
"""
iteritems = items
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def parents(self) -> List[bytes]:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def p1(self) -> bytes:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def p2(self) -> bytes:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def branch(self) -> bytes:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 # TODO: typehint the return. It's a copies Map of some sort.
def setparents(self, p1: bytes, p2: Optional[bytes] = None):
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 """Set dirstate parents to p1 and p2.
av6
dirstate: update docstrings in idirstate from the current dirstate
r50775 When moving from two parents to one, "merged" entries a
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 adjusted to normal and previous copy records discarded and
returned by the call.
See localrepo.setparents()
"""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def setbranch(
self, branch: bytes, transaction: Optional[TransactionT]
) -> None:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def invalidate(self) -> None:
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """Causes the next access to reread the dirstate.
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
This is different from localrepo.invalidatedirstate() because it always
rereads the dirstate. Use localrepo.invalidatedirstate() if you want to
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 check whether the dirstate has changed before rereading it."""
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def copy(self, source: Optional[bytes], dest: bytes) -> None:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 """Mark dest as a copy of source. Unmark dest if source is None."""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def copied(self, file: bytes) -> Optional[bytes]:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def copies(self) -> Dict[bytes, bytes]:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def normalize(
self, path: bytes, isknown: bool = False, ignoremissing: bool = False
) -> bytes:
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 normalize the case of a pathname when on a casefolding filesystem
isknown specifies whether the filename came from walking the
disk, to avoid extra filesystem access.
If ignoremissing is True, missing path are returned
unchanged. Otherwise, we try harder to normalize possibly
existing path components.
The normalized case is determined based on the following precedence:
- version of name already stored in the dirstate
- version of name stored on disk
- version provided via command arguments
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def clear(self) -> None:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def rebuild(
self,
parent: bytes,
allfiles: Iterable[bytes], # TODO: more than iterable? (uses len())
changedfiles: Optional[Iterable[bytes]] = None,
) -> None:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def write(self, tr: Optional[TransactionT]) -> None:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 pass
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def addparentchangecallback(
self, category: bytes, callback: AddParentChangeCallbackT
) -> None:
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 """add a callback to be called when the wd parents are changed
Callback will be called with the following arguments:
dirstate, (oldp1, oldp2), (newp1, newp2)
Category is a unique identifier to allow overwriting an old callback
with a newer callback.
"""
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def walk(
self,
match: matchmod.basematcher,
subrepos: Any, # TODO: figure out what this is
unknown: bool,
ignored: bool,
full: bool = True,
) -> WalkReturnT:
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 Walk recursively through the directory tree, finding all files
matched by match.
If full is False, maybe skip some known-clean files.
Return a dict mapping filename to stat-like object (either
mercurial.osutil.stat instance or return value of os.stat()).
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 def status(
self,
match: matchmod.basematcher,
subrepos: bool,
ignored: bool,
clean: bool,
unknown: bool,
) -> StatusReturnT:
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """Determine the status of the working copy relative to the
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 dirstate and return a pair of (unsure, status), where status is of type
scmutil.status and:
unsure:
files that might have been modified since the dirstate was
written, but need to be read to be sure (size is the same
but mtime differs)
status.modified:
files that have definitely been modified since the dirstate
was written (different size or mode)
status.clean:
files that have definitely not been modified since the
dirstate was written
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 # TODO: could return a list, except git.dirstate is a generator
def matches(self, match: matchmod.basematcher) -> Iterable[bytes]:
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197 return files in the dirstate (in whatever state) filtered by match
Augie Fackler
formating: upgrade to black 20.8b1...
r46554 """
Augie Fackler
interfaces: introduce an interface for dirstate implementations...
r43197
Matt Harbison
typing: add type annotations to the dirstate classes...
r52822 # TODO: finish adding typehints here, and to subclasses
def verify(
self, m1, m2, p1: bytes, narrow_matcher: Optional[Any] = None
) -> Iterator[bytes]:
av6
dirstate: add missing methods and kwargs to idirstate interface
r50777 """
check the dirstate contents against the parent manifest and yield errors
"""