upstream/mercurial-mirror Commit - r44742:c443b9ba

py3: __repr__ needs to return str, not bytes...

Kyle Lippincott -

r44742:c443b9ba stable

parent child

mercurial/bundle2.py

0 +1 0

             # bundle2.py - generic container format to transmit arbitrary data.
             #
             # Copyright 2013 Facebook, Inc.
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             """Handling of the new bundle2 format
             The goal of bundle2 is to act as an atomically packet to transmit a set of
             payloads in an application agnostic way. It consist in a sequence of "parts"
             that will be handed to and processed by the application layer.
             General format architecture
             ===========================
             The format is architectured as follow
              - magic string
              - stream level parameters
              - payload parts (any number)
              - end of stream marker.
             the Binary format
             ============================
             All numbers are unsigned and big-endian.
             stream level parameters
             ------------------------
             Binary format is as follow
             :params size: int32
               The total number of Bytes used by the parameters
             :params value: arbitrary number of Bytes
               A blob of `params size` containing the serialized version of all stream level
               parameters.
               The blob contains a space separated list of parameters. Parameters with value
               are stored in the form `<name>=<value>`. Both name and value are urlquoted.
               Empty name are obviously forbidden.
               Name MUST start with a letter. If this first letter is lower case, the
               parameter is advisory and can be safely ignored. However when the first
               letter is capital, the parameter is mandatory and the bundling process MUST
               stop if he is not able to proceed it.
               Stream parameters use a simple textual format for two main reasons:
               - Stream level parameters should remain simple and we want to discourage any
                 crazy usage.
               - Textual data allow easy human inspection of a bundle2 header in case of
                 troubles.
               Any Applicative level options MUST go into a bundle2 part instead.
             Payload part
             ------------------------
             Binary format is as follow
             :header size: int32
               The total number of Bytes used by the part header. When the header is empty
               (size = 0) this is interpreted as the end of stream marker.
             :header:
                 The header defines how to interpret the part. It contains two piece of
                 data: the part type, and the part parameters.
                 The part type is used to route an application level handler, that can
                 interpret payload.
                 Part parameters are passed to the application level handler.  They are
                 meant to convey information that will help the application level object to
                 interpret the part payload.
                 The binary format of the header is has follow
                 :typesize: (one byte)
                 :parttype: alphanumerical part name (restricted to [a-zA-Z0-9_:-]*)
                 :partid: A 32bits integer (unique in the bundle) that can be used to refer
                          to this part.
                 :parameters:
                     Part's parameter may have arbitrary content, the binary structure is::
                         <mandatory-count><advisory-count><param-sizes><param-data>
                     :mandatory-count: 1 byte, number of mandatory parameters
                     :advisory-count:  1 byte, number of advisory parameters
                     :param-sizes:
                         N couple of bytes, where N is the total number of parameters. Each
                         couple contains (<size-of-key>, <size-of-value) for one parameter.
                     :param-data:
                         A blob of bytes from which each parameter key and value can be
                         retrieved using the list of size couples stored in the previous
                         field.
                         Mandatory parameters comes first, then the advisory ones.
                         Each parameter's key MUST be unique within the part.
             :payload:
                 payload is a series of `<chunksize><chunkdata>`.
                 `chunksize` is an int32, `chunkdata` are plain bytes (as much as
                 `chunksize` says)` The payload part is concluded by a zero size chunk.
                 The current implementation always produces either zero or one chunk.
                 This is an implementation limitation that will ultimately be lifted.
                 `chunksize` can be negative to trigger special case processing. No such
                 processing is in place yet.
             Bundle processing
             ============================
             Each part is processed in order using a "part handler". Handler are registered
             for a certain part type.
             The matching of a part to its handler is case insensitive. The case of the
             part type is used to know if a part is mandatory or advisory. If the Part type
             contains any uppercase char it is considered mandatory. When no handler is
             known for a Mandatory part, the process is aborted and an exception is raised.
             If the part is advisory and no handler is known, the part is ignored. When the
             process is aborted, the full bundle is still read from the stream to keep the
             channel usable. But none of the part read from an abort are processed. In the
             future, dropping the stream may become an option for channel we do not care to
             preserve.
             """
             from __future__ import absolute_import, division
             import collections
             import errno
             import os
             import re
             import string
             import struct
             import sys
             from .i18n import _
             from . import (
                 bookmarks,
                 changegroup,
                 encoding,
                 error,
                 node as nodemod,
                 obsolete,
                 phases,
                 pushkey,
                 pycompat,
                 streamclone,
                 tags,
                 url,
                 util,
             )
             from .utils import stringutil
             urlerr = util.urlerr
             urlreq = util.urlreq
             _pack = struct.pack
             _unpack = struct.unpack
             _fstreamparamsize = b'>i'
             _fpartheadersize = b'>i'
             _fparttypesize = b'>B'
             _fpartid = b'>I'
             _fpayloadsize = b'>i'
             _fpartparamcount = b'>BB'
             preferedchunksize = 32768
             _parttypeforbidden = re.compile(b'[^a-zA-Z0-9_:-]')
             def outdebug(ui, message):
                 """debug regarding output stream (bundling)"""
                 if ui.configbool(b'devel', b'bundle2.debug'):
                     ui.debug(b'bundle2-output: %s\n' % message)
             def indebug(ui, message):
                 """debug on input stream (unbundling)"""
                 if ui.configbool(b'devel', b'bundle2.debug'):
                     ui.debug(b'bundle2-input: %s\n' % message)
             def validateparttype(parttype):
                 """raise ValueError if a parttype contains invalid character"""
                 if _parttypeforbidden.search(parttype):
                     raise ValueError(parttype)
             def _makefpartparamsizes(nbparams):
                 """return a struct format to read part parameter sizes
                 The number parameters is variable so we need to build that format
                 dynamically.
                 """
                 return b'>' + (b'BB' * nbparams)
             parthandlermapping = {}
             def parthandler(parttype, params=()):
                 """decorator that register a function as a bundle2 part handler
                 eg::
                     @parthandler('myparttype', ('mandatory', 'param', 'handled'))
                     def myparttypehandler(...):
                         '''process a part of type "my part".'''
                         ...
                 """
                 validateparttype(parttype)
                 def _decorator(func):
                     lparttype = parttype.lower()  # enforce lower case matching.
                     assert lparttype not in parthandlermapping
                     parthandlermapping[lparttype] = func
                     func.params = frozenset(params)
                     return func
                 return _decorator
             class unbundlerecords(object):
                 """keep record of what happens during and unbundle
                 New records are added using `records.add('cat', obj)`. Where 'cat' is a
                 category of record and obj is an arbitrary object.
                 `records['cat']` will return all entries of this category 'cat'.
                 Iterating on the object itself will yield `('category', obj)` tuples
                 for all entries.
                 All iterations happens in chronological order.
                 """
                 def __init__(self):
                     self._categories = {}
                     self._sequences = []
                     self._replies = {}
                 def add(self, category, entry, inreplyto=None):
                     """add a new record of a given category.
                     The entry can then be retrieved in the list returned by
                     self['category']."""
                     self._categories.setdefault(category, []).append(entry)
                     self._sequences.append((category, entry))
                     if inreplyto is not None:
                         self.getreplies(inreplyto).add(category, entry)
                 def getreplies(self, partid):
                     """get the records that are replies to a specific part"""
                     return self._replies.setdefault(partid, unbundlerecords())
                 def __getitem__(self, cat):
                     return tuple(self._categories.get(cat, ()))
                 def __iter__(self):
                     return iter(self._sequences)
                 def __len__(self):
                     return len(self._sequences)
                 def __nonzero__(self):
                     return bool(self._sequences)
                 __bool__ = __nonzero__
             class bundleoperation(object):
                 """an object that represents a single bundling process
                 Its purpose is to carry unbundle-related objects and states.
                 A new object should be created at the beginning of each bundle processing.
                 The object is to be returned by the processing function.
                 The object has very little content now it will ultimately contain:
                 * an access to the repo the bundle is applied to,
                 * a ui object,
                 * a way to retrieve a transaction to add changes to the repo,
                 * a way to record the result of processing each part,
                 * a way to construct a bundle response when applicable.
                 """
                 def __init__(self, repo, transactiongetter, captureoutput=True, source=b''):
                     self.repo = repo
                     self.ui = repo.ui
                     self.records = unbundlerecords()
                     self.reply = None
                     self.captureoutput = captureoutput
                     self.hookargs = {}
                     self._gettransaction = transactiongetter
                     # carries value that can modify part behavior
                     self.modes = {}
                     self.source = source
                 def gettransaction(self):
                     transaction = self._gettransaction()
                     if self.hookargs:
                         # the ones added to the transaction supercede those added
                         # to the operation.
                         self.hookargs.update(transaction.hookargs)
                         transaction.hookargs = self.hookargs
                     # mark the hookargs as flushed.  further attempts to add to
                     # hookargs will result in an abort.
                     self.hookargs = None
                     return transaction
                 def addhookargs(self, hookargs):
                     if self.hookargs is None:
                         raise error.ProgrammingError(
                             b'attempted to add hookargs to '
                             b'operation after transaction started'
                         )
                     self.hookargs.update(hookargs)
             class TransactionUnavailable(RuntimeError):
                 pass
             def _notransaction():
                 """default method to get a transaction while processing a bundle
                 Raise an exception to highlight the fact that no transaction was expected
                 to be created"""
                 raise TransactionUnavailable()
             def applybundle(repo, unbundler, tr, source, url=None, **kwargs):
                 # transform me into unbundler.apply() as soon as the freeze is lifted
                 if isinstance(unbundler, unbundle20):
                     tr.hookargs[b'bundle2'] = b'1'
                     if source is not None and b'source' not in tr.hookargs:
                         tr.hookargs[b'source'] = source
                     if url is not None and b'url' not in tr.hookargs:
                         tr.hookargs[b'url'] = url
                     return processbundle(repo, unbundler, lambda: tr, source=source)
                 else:
                     # the transactiongetter won't be used, but we might as well set it
                     op = bundleoperation(repo, lambda: tr, source=source)
                     _processchangegroup(op, unbundler, tr, source, url, **kwargs)
                     return op
             class partiterator(object):
                 def __init__(self, repo, op, unbundler):
                     self.repo = repo
                     self.op = op
                     self.unbundler = unbundler
                     self.iterator = None
                     self.count = 0
                     self.current = None
                 def __enter__(self):
                     def func():
                         itr = enumerate(self.unbundler.iterparts(), 1)
                         for count, p in itr:
                             self.count = count
                             self.current = p
                             yield p
                             p.consume()
                             self.current = None
                     self.iterator = func()
                     return self.iterator
                 def __exit__(self, type, exc, tb):
                     if not self.iterator:
                         return
                     # Only gracefully abort in a normal exception situation. User aborts
                     # like Ctrl+C throw a KeyboardInterrupt which is not a base Exception,
                     # and should not gracefully cleanup.
                     if isinstance(exc, Exception):
                         # Any exceptions seeking to the end of the bundle at this point are
                         # almost certainly related to the underlying stream being bad.
                         # And, chances are that the exception we're handling is related to
                         # getting in that bad state. So, we swallow the seeking error and
                         # re-raise the original error.
                         seekerror = False
                         try:
                             if self.current:
                                 # consume the part content to not corrupt the stream.
                                 self.current.consume()
                             for part in self.iterator:
                                 # consume the bundle content
                                 part.consume()
                         except Exception:
                             seekerror = True
                         # Small hack to let caller code distinguish exceptions from bundle2
                         # processing from processing the old format. This is mostly needed
                         # to handle different return codes to unbundle according to the type
                         # of bundle. We should probably clean up or drop this return code
                         # craziness in a future version.
                         exc.duringunbundle2 = True
                         salvaged = []
                         replycaps = None
                         if self.op.reply is not None:
                             salvaged = self.op.reply.salvageoutput()
                             replycaps = self.op.reply.capabilities
                         exc._replycaps = replycaps
                         exc._bundle2salvagedoutput = salvaged
                         # Re-raising from a variable loses the original stack. So only use
                         # that form if we need to.
                         if seekerror:
                             raise exc
                     self.repo.ui.debug(
                         b'bundle2-input-bundle: %i parts total\n' % self.count
                     )
             def processbundle(repo, unbundler, transactiongetter=None, op=None, source=b''):
                 """This function process a bundle, apply effect to/from a repo
                 It iterates over each part then searches for and uses the proper handling
                 code to process the part. Parts are processed in order.
                 Unknown Mandatory part will abort the process.
                 It is temporarily possible to provide a prebuilt bundleoperation to the
                 function. This is used to ensure output is properly propagated in case of
                 an error during the unbundling. This output capturing part will likely be
                 reworked and this ability will probably go away in the process.
                 """
                 if op is None:
                     if transactiongetter is None:
                         transactiongetter = _notransaction
                     op = bundleoperation(repo, transactiongetter, source=source)
                 # todo:
                 # - replace this is a init function soon.
                 # - exception catching
                 unbundler.params
                 if repo.ui.debugflag:
                     msg = [b'bundle2-input-bundle:']
                     if unbundler.params:
                         msg.append(b' %i params' % len(unbundler.params))
                     if op._gettransaction is None or op._gettransaction is _notransaction:
                         msg.append(b' no-transaction')
                     else:
                         msg.append(b' with-transaction')
                     msg.append(b'\n')
                     repo.ui.debug(b''.join(msg))
                 processparts(repo, op, unbundler)
                 return op
             def processparts(repo, op, unbundler):
                 with partiterator(repo, op, unbundler) as parts:
                     for part in parts:
                         _processpart(op, part)
             def _processchangegroup(op, cg, tr, source, url, **kwargs):
                 ret = cg.apply(op.repo, tr, source, url, **kwargs)
                 op.records.add(b'changegroup', {b'return': ret,})
                 return ret
             def _gethandler(op, part):
                 status = b'unknown'  # used by debug output
                 try:
                     handler = parthandlermapping.get(part.type)
                     if handler is None:
                         status = b'unsupported-type'
                         raise error.BundleUnknownFeatureError(parttype=part.type)
                     indebug(op.ui, b'found a handler for part %s' % part.type)
                     unknownparams = part.mandatorykeys - handler.params
                     if unknownparams:
                         unknownparams = list(unknownparams)
                         unknownparams.sort()
                         status = b'unsupported-params (%s)' % b', '.join(unknownparams)
                         raise error.BundleUnknownFeatureError(
                             parttype=part.type, params=unknownparams
                         )
                     status = b'supported'
                 except error.BundleUnknownFeatureError as exc:
                     if part.mandatory:  # mandatory parts
                         raise
                     indebug(op.ui, b'ignoring unsupported advisory part %s' % exc)
                     return  # skip to part processing
                 finally:
                     if op.ui.debugflag:
                         msg = [b'bundle2-input-part: "%s"' % part.type]
                         if not part.mandatory:
                             msg.append(b' (advisory)')
                         nbmp = len(part.mandatorykeys)
                         nbap = len(part.params) - nbmp
                         if nbmp or nbap:
                             msg.append(b' (params:')
                             if nbmp:
                                 msg.append(b' %i mandatory' % nbmp)
                             if nbap:
                                 msg.append(b' %i advisory' % nbmp)
                             msg.append(b')')
                         msg.append(b' %s\n' % status)
                         op.ui.debug(b''.join(msg))
                 return handler
             def _processpart(op, part):
                 """process a single part from a bundle
                 The part is guaranteed to have been fully consumed when the function exits
                 (even if an exception is raised)."""
                 handler = _gethandler(op, part)
                 if handler is None:
                     return
                 # handler is called outside the above try block so that we don't
                 # risk catching KeyErrors from anything other than the
                 # parthandlermapping lookup (any KeyError raised by handler()
                 # itself represents a defect of a different variety).
                 output = None
                 if op.captureoutput and op.reply is not None:
                     op.ui.pushbuffer(error=True, subproc=True)
                     output = b''
                 try:
                     handler(op, part)
                 finally:
                     if output is not None:
                         output = op.ui.popbuffer()
                     if output:
                         outpart = op.reply.newpart(b'output', data=output, mandatory=False)
                         outpart.addparam(
                             b'in-reply-to', pycompat.bytestr(part.id), mandatory=False
                         )
             def decodecaps(blob):
                 """decode a bundle2 caps bytes blob into a dictionary
                 The blob is a list of capabilities (one per line)
                 Capabilities may have values using a line of the form::
                     capability=value1,value2,value3
                 The values are always a list."""
                 caps = {}
                 for line in blob.splitlines():
                     if not line:
                         continue
                     if b'=' not in line:
                         key, vals = line, ()
                     else:
                         key, vals = line.split(b'=', 1)
                         vals = vals.split(b',')
                     key = urlreq.unquote(key)
                     vals = [urlreq.unquote(v) for v in vals]
                     caps[key] = vals
                 return caps
             def encodecaps(caps):
                 """encode a bundle2 caps dictionary into a bytes blob"""
                 chunks = []
                 for ca in sorted(caps):
                     vals = caps[ca]
                     ca = urlreq.quote(ca)
                     vals = [urlreq.quote(v) for v in vals]
                     if vals:
                         ca = b"%s=%s" % (ca, b','.join(vals))
                     chunks.append(ca)
                 return b'\n'.join(chunks)
             bundletypes = {
                 b"": (b"", b'UN'),  # only when using unbundle on ssh and old http servers
                 # since the unification ssh accepts a header but there
                 # is no capability signaling it.
                 b"HG20": (),  # special-cased below
                 b"HG10UN": (b"HG10UN", b'UN'),
                 b"HG10BZ": (b"HG10", b'BZ'),
                 b"HG10GZ": (b"HG10GZ", b'GZ'),
             }
             # hgweb uses this list to communicate its preferred type
             bundlepriority = [b'HG10GZ', b'HG10BZ', b'HG10UN']
             class bundle20(object):
                 """represent an outgoing bundle2 container
                 Use the `addparam` method to add stream level parameter. and `newpart` to
                 populate it. Then call `getchunks` to retrieve all the binary chunks of
                 data that compose the bundle2 container."""
                 _magicstring = b'HG20'
                 def __init__(self, ui, capabilities=()):
                     self.ui = ui
                     self._params = []
                     self._parts = []
                     self.capabilities = dict(capabilities)
                     self._compengine = util.compengines.forbundletype(b'UN')
                     self._compopts = None
                     # If compression is being handled by a consumer of the raw
                     # data (e.g. the wire protocol), unsetting this flag tells
                     # consumers that the bundle is best left uncompressed.
                     self.prefercompressed = True
                 def setcompression(self, alg, compopts=None):
                     """setup core part compression to <alg>"""
                     if alg in (None, b'UN'):
                         return
                     assert not any(n.lower() == b'compression' for n, v in self._params)
                     self.addparam(b'Compression', alg)
                     self._compengine = util.compengines.forbundletype(alg)
                     self._compopts = compopts
                 @property
                 def nbparts(self):
                     """total number of parts added to the bundler"""
                     return len(self._parts)
                 # methods used to defines the bundle2 content
                 def addparam(self, name, value=None):
                     """add a stream level parameter"""
                     if not name:
                         raise error.ProgrammingError(b'empty parameter name')
                     if name[0:1] not in pycompat.bytestr(
                         string.ascii_letters  # pytype: disable=wrong-arg-types
                     ):
                         raise error.ProgrammingError(
                             b'non letter first character: %s' % name
                         )
                     self._params.append((name, value))
                 def addpart(self, part):
                     """add a new part to the bundle2 container
                     Parts contains the actual applicative payload."""
                     assert part.id is None
                     part.id = len(self._parts)  # very cheap counter
                     self._parts.append(part)
                 def newpart(self, typeid, *args, **kwargs):
                     """create a new part and add it to the containers
                     As the part is directly added to the containers. For now, this means
                     that any failure to properly initialize the part after calling
                     ``newpart`` should result in a failure of the whole bundling process.
                     You can still fall back to manually create and add if you need better
                     control."""
                     part = bundlepart(typeid, *args, **kwargs)
                     self.addpart(part)
                     return part
                 # methods used to generate the bundle2 stream
                 def getchunks(self):
                     if self.ui.debugflag:
                         msg = [b'bundle2-output-bundle: "%s",' % self._magicstring]
                         if self._params:
                             msg.append(b' (%i params)' % len(self._params))
                         msg.append(b' %i parts total\n' % len(self._parts))
                         self.ui.debug(b''.join(msg))
                     outdebug(self.ui, b'start emission of %s stream' % self._magicstring)
                     yield self._magicstring
                     param = self._paramchunk()
                     outdebug(self.ui, b'bundle parameter: %s' % param)
                     yield _pack(_fstreamparamsize, len(param))
                     if param:
                         yield param
                     for chunk in self._compengine.compressstream(
                         self._getcorechunk(), self._compopts
                     ):
                         yield chunk
                 def _paramchunk(self):
                     """return a encoded version of all stream parameters"""
                     blocks = []
                     for par, value in self._params:
                         par = urlreq.quote(par)
                         if value is not None:
                             value = urlreq.quote(value)
                             par = b'%s=%s' % (par, value)
                         blocks.append(par)
                     return b' '.join(blocks)
                 def _getcorechunk(self):
                     """yield chunk for the core part of the bundle
                     (all but headers and parameters)"""
                     outdebug(self.ui, b'start of parts')
                     for part in self._parts:
                         outdebug(self.ui, b'bundle part: "%s"' % part.type)
                         for chunk in part.getchunks(ui=self.ui):
                             yield chunk
                     outdebug(self.ui, b'end of bundle')
                     yield _pack(_fpartheadersize, 0)
                 def salvageoutput(self):
                     """return a list with a copy of all output parts in the bundle
                     This is meant to be used during error handling to make sure we preserve
                     server output"""
                     salvaged = []
                     for part in self._parts:
                         if part.type.startswith(b'output'):
                             salvaged.append(part.copy())
                     return salvaged
             class unpackermixin(object):
                 """A mixin to extract bytes and struct data from a stream"""
                 def __init__(self, fp):
                     self._fp = fp
                 def _unpack(self, format):
                     """unpack this struct format from the stream
                     This method is meant for internal usage by the bundle2 protocol only.
                     They directly manipulate the low level stream including bundle2 level
                     instruction.
                     Do not use it to implement higher-level logic or methods."""
                     data = self._readexact(struct.calcsize(format))
                     return _unpack(format, data)
                 def _readexact(self, size):
                     """read exactly <size> bytes from the stream
                     This method is meant for internal usage by the bundle2 protocol only.
                     They directly manipulate the low level stream including bundle2 level
                     instruction.
                     Do not use it to implement higher-level logic or methods."""
                     return changegroup.readexactly(self._fp, size)
             def getunbundler(ui, fp, magicstring=None):
                 """return a valid unbundler object for a given magicstring"""
                 if magicstring is None:
                     magicstring = changegroup.readexactly(fp, 4)
                 magic, version = magicstring[0:2], magicstring[2:4]
                 if magic != b'HG':
                     ui.debug(
                         b"error: invalid magic: %r (version %r), should be 'HG'\n"
                         % (magic, version)
                     )
                     raise error.Abort(_(b'not a Mercurial bundle'))
                 unbundlerclass = formatmap.get(version)
                 if unbundlerclass is None:
                     raise error.Abort(_(b'unknown bundle version %s') % version)
                 unbundler = unbundlerclass(ui, fp)
                 indebug(ui, b'start processing of %s stream' % magicstring)
                 return unbundler
             class unbundle20(unpackermixin):
                 """interpret a bundle2 stream
                 This class is fed with a binary stream and yields parts through its
                 `iterparts` methods."""
                 _magicstring = b'HG20'
                 def __init__(self, ui, fp):
                     """If header is specified, we do not read it out of the stream."""
                     self.ui = ui
                     self._compengine = util.compengines.forbundletype(b'UN')
                     self._compressed = None
                     super(unbundle20, self).__init__(fp)
                 @util.propertycache
                 def params(self):
                     """dictionary of stream level parameters"""
                     indebug(self.ui, b'reading bundle2 stream parameters')
                     params = {}
                     paramssize = self._unpack(_fstreamparamsize)[0]
                     if paramssize < 0:
                         raise error.BundleValueError(
                             b'negative bundle param size: %i' % paramssize
                         )
                     if paramssize:
                         params = self._readexact(paramssize)
                         params = self._processallparams(params)
                     return params
                 def _processallparams(self, paramsblock):
                     """"""
                     params = util.sortdict()
                     for p in paramsblock.split(b' '):
                         p = p.split(b'=', 1)
                         p = [urlreq.unquote(i) for i in p]
                         if len(p) < 2:
                             p.append(None)
                         self._processparam(*p)
                         params[p[0]] = p[1]
                     return params
                 def _processparam(self, name, value):
                     """process a parameter, applying its effect if needed
                     Parameter starting with a lower case letter are advisory and will be
                     ignored when unknown.  Those starting with an upper case letter are
                     mandatory and will this function will raise a KeyError when unknown.
                     Note: no option are currently supported. Any input will be either
                           ignored or failing.
                     """
                     if not name:
                         raise ValueError('empty parameter name')
                     if name[0:1] not in pycompat.bytestr(
                         string.ascii_letters  # pytype: disable=wrong-arg-types
                     ):
                         raise ValueError('non letter first character: %s' % name)
                     try:
                         handler = b2streamparamsmap[name.lower()]
                     except KeyError:
                         if name[0:1].islower():
                             indebug(self.ui, b"ignoring unknown parameter %s" % name)
                         else:
                             raise error.BundleUnknownFeatureError(params=(name,))
                     else:
                         handler(self, name, value)
                 def _forwardchunks(self):
                     """utility to transfer a bundle2 as binary
                     This is made necessary by the fact the 'getbundle' command over 'ssh'
                     have no way to know then the reply end, relying on the bundle to be
                     interpreted to know its end. This is terrible and we are sorry, but we
                     needed to move forward to get general delta enabled.
                     """
                     yield self._magicstring
                     assert 'params' not in vars(self)
                     paramssize = self._unpack(_fstreamparamsize)[0]
                     if paramssize < 0:
                         raise error.BundleValueError(
                             b'negative bundle param size: %i' % paramssize
                         )
                     if paramssize:
                         params = self._readexact(paramssize)
                         self._processallparams(params)
                         # The payload itself is decompressed below, so drop
                         # the compression parameter passed down to compensate.
                         outparams = []
                         for p in params.split(b' '):
                             k, v = p.split(b'=', 1)
                             if k.lower() != b'compression':
                                 outparams.append(p)
                         outparams = b' '.join(outparams)
                         yield _pack(_fstreamparamsize, len(outparams))
                         yield outparams
                     else:
                         yield _pack(_fstreamparamsize, paramssize)
                     # From there, payload might need to be decompressed
                     self._fp = self._compengine.decompressorreader(self._fp)
                     emptycount = 0
                     while emptycount < 2:
                         # so we can brainlessly loop
                         assert _fpartheadersize == _fpayloadsize
                         size = self._unpack(_fpartheadersize)[0]
                         yield _pack(_fpartheadersize, size)
                         if size:
                             emptycount = 0
                         else:
                             emptycount += 1
                             continue
                         if size == flaginterrupt:
                             continue
                         elif size < 0:
                             raise error.BundleValueError(b'negative chunk size: %i')
                         yield self._readexact(size)
                 def iterparts(self, seekable=False):
                     """yield all parts contained in the stream"""
                     cls = seekableunbundlepart if seekable else unbundlepart
                     # make sure param have been loaded
                     self.params
                     # From there, payload need to be decompressed
                     self._fp = self._compengine.decompressorreader(self._fp)
                     indebug(self.ui, b'start extraction of bundle2 parts')
                     headerblock = self._readpartheader()
                     while headerblock is not None:
                         part = cls(self.ui, headerblock, self._fp)
                         yield part
                         # Ensure part is fully consumed so we can start reading the next
                         # part.
                         part.consume()
                         headerblock = self._readpartheader()
                     indebug(self.ui, b'end of bundle2 stream')
                 def _readpartheader(self):
                     """reads a part header size and return the bytes blob
                     returns None if empty"""
                     headersize = self._unpack(_fpartheadersize)[0]
                     if headersize < 0:
                         raise error.BundleValueError(
                             b'negative part header size: %i' % headersize
                         )
                     indebug(self.ui, b'part header size: %i' % headersize)
                     if headersize:
                         return self._readexact(headersize)
                     return None
                 def compressed(self):
                     self.params  # load params
                     return self._compressed
                 def close(self):
                     """close underlying file"""
                     if util.safehasattr(self._fp, 'close'):
                         return self._fp.close()
             formatmap = {b'20': unbundle20}
             b2streamparamsmap = {}
             def b2streamparamhandler(name):
                 """register a handler for a stream level parameter"""
                 def decorator(func):
                     assert name not in formatmap
                     b2streamparamsmap[name] = func
                     return func
                 return decorator
             @b2streamparamhandler(b'compression')
             def processcompression(unbundler, param, value):
                 """read compression parameter and install payload decompression"""
                 if value not in util.compengines.supportedbundletypes:
                     raise error.BundleUnknownFeatureError(params=(param,), values=(value,))
                 unbundler._compengine = util.compengines.forbundletype(value)
                 if value is not None:
                     unbundler._compressed = True
             class bundlepart(object):
                 """A bundle2 part contains application level payload
                 The part `type` is used to route the part to the application level
                 handler.
                 The part payload is contained in ``part.data``. It could be raw bytes or a
                 generator of byte chunks.
                 You can add parameters to the part using the ``addparam`` method.
                 Parameters can be either mandatory (default) or advisory. Remote side
                 should be able to safely ignore the advisory ones.
                 Both data and parameters cannot be modified after the generation has begun.
                 """
                 def __init__(
                     self,
                     parttype,
                     mandatoryparams=(),
                     advisoryparams=(),
                     data=b'',
                     mandatory=True,
                 ):
                     validateparttype(parttype)
                     self.id = None
                     self.type = parttype
                     self._data = data
                     self._mandatoryparams = list(mandatoryparams)
                     self._advisoryparams = list(advisoryparams)
                     # checking for duplicated entries
                     self._seenparams = set()
                     for pname, __ in self._mandatoryparams + self._advisoryparams:
                         if pname in self._seenparams:
                             raise error.ProgrammingError(b'duplicated params: %s' % pname)
                         self._seenparams.add(pname)
                     # status of the part's generation:
                     # - None: not started,
                     # - False: currently generated,
                     # - True: generation done.
                     self._generated = None
                     self.mandatory = mandatory
+                @encoding.strmethod
                 def __repr__(self):
                     cls = b"%s.%s" % (self.__class__.__module__, self.__class__.__name__)
                     return b'<%s object at %x; id: %s; type: %s; mandatory: %s>' % (
                         cls,
                         id(self),
                         self.id,
                         self.type,
                         self.mandatory,
                     )
                 def copy(self):
                     """return a copy of the part
                     The new part have the very same content but no partid assigned yet.
                     Parts with generated data cannot be copied."""
                     assert not util.safehasattr(self.data, 'next')
                     return self.__class__(
                         self.type,
                         self._mandatoryparams,
                         self._advisoryparams,
                         self._data,
                         self.mandatory,
                     )
                 # methods used to defines the part content
                 @property
                 def data(self):
                     return self._data
                 @data.setter
                 def data(self, data):
                     if self._generated is not None:
                         raise error.ReadOnlyPartError(b'part is being generated')
                     self._data = data
                 @property
                 def mandatoryparams(self):
                     # make it an immutable tuple to force people through ``addparam``
                     return tuple(self._mandatoryparams)
                 @property
                 def advisoryparams(self):
                     # make it an immutable tuple to force people through ``addparam``
                     return tuple(self._advisoryparams)
                 def addparam(self, name, value=b'', mandatory=True):
                     """add a parameter to the part
                     If 'mandatory' is set to True, the remote handler must claim support
                     for this parameter or the unbundling will be aborted.
                     The 'name' and 'value' cannot exceed 255 bytes each.
                     """
                     if self._generated is not None:
                         raise error.ReadOnlyPartError(b'part is being generated')
                     if name in self._seenparams:
                         raise ValueError(b'duplicated params: %s' % name)
                     self._seenparams.add(name)
                     params = self._advisoryparams
                     if mandatory:
                         params = self._mandatoryparams
                     params.append((name, value))
                 # methods used to generates the bundle2 stream
                 def getchunks(self, ui):
                     if self._generated is not None:
                         raise error.ProgrammingError(b'part can only be consumed once')
                     self._generated = False
                     if ui.debugflag:
                         msg = [b'bundle2-output-part: "%s"' % self.type]
                         if not self.mandatory:
                             msg.append(b' (advisory)')
                         nbmp = len(self.mandatoryparams)
                         nbap = len(self.advisoryparams)
                         if nbmp or nbap:
                             msg.append(b' (params:')
                             if nbmp:
                                 msg.append(b' %i mandatory' % nbmp)
                             if nbap:
                                 msg.append(b' %i advisory' % nbmp)
                             msg.append(b')')
                         if not self.data:
                             msg.append(b' empty payload')
                         elif util.safehasattr(self.data, 'next') or util.safehasattr(
                             self.data, b'__next__'
                         ):
                             msg.append(b' streamed payload')
                         else:
                             msg.append(b' %i bytes payload' % len(self.data))
                         msg.append(b'\n')
                         ui.debug(b''.join(msg))
                     #### header
                     if self.mandatory:
                         parttype = self.type.upper()
                     else:
                         parttype = self.type.lower()
                     outdebug(ui, b'part %s: "%s"' % (pycompat.bytestr(self.id), parttype))
                     ## parttype
                     header = [
                         _pack(_fparttypesize, len(parttype)),
                         parttype,
                         _pack(_fpartid, self.id),
                     ]
                     ## parameters
                     # count
                     manpar = self.mandatoryparams
                     advpar = self.advisoryparams
                     header.append(_pack(_fpartparamcount, len(manpar), len(advpar)))
                     # size
                     parsizes = []
                     for key, value in manpar:
                         parsizes.append(len(key))
                         parsizes.append(len(value))
                     for key, value in advpar:
                         parsizes.append(len(key))
                         parsizes.append(len(value))
                     paramsizes = _pack(_makefpartparamsizes(len(parsizes) // 2), *parsizes)
                     header.append(paramsizes)
                     # key, value
                     for key, value in manpar:
                         header.append(key)
                         header.append(value)
                     for key, value in advpar:
                         header.append(key)
                         header.append(value)
                     ## finalize header
                     try:
                         headerchunk = b''.join(header)
                     except TypeError:
                         raise TypeError(
                             'Found a non-bytes trying to '
                             'build bundle part header: %r' % header
                         )
                     outdebug(ui, b'header chunk size: %i' % len(headerchunk))
                     yield _pack(_fpartheadersize, len(headerchunk))
                     yield headerchunk
                     ## payload
                     try:
                         for chunk in self._payloadchunks():
                             outdebug(ui, b'payload chunk size: %i' % len(chunk))
                             yield _pack(_fpayloadsize, len(chunk))
                             yield chunk
                     except GeneratorExit:
                         # GeneratorExit means that nobody is listening for our
                         # results anyway, so just bail quickly rather than trying
                         # to produce an error part.
                         ui.debug(b'bundle2-generatorexit\n')
                         raise
                     except BaseException as exc:
                         bexc = stringutil.forcebytestr(exc)
                         # backup exception data for later
                         ui.debug(
                             b'bundle2-input-stream-interrupt: encoding exception %s' % bexc
                         )
                         tb = sys.exc_info()[2]
                         msg = b'unexpected error: %s' % bexc
                         interpart = bundlepart(
                             b'error:abort', [(b'message', msg)], mandatory=False
                         )
                         interpart.id = 0
                         yield _pack(_fpayloadsize, -1)
                         for chunk in interpart.getchunks(ui=ui):
                             yield chunk
                         outdebug(ui, b'closing payload chunk')
                         # abort current part payload
                         yield _pack(_fpayloadsize, 0)
                         pycompat.raisewithtb(exc, tb)
                     # end of payload
                     outdebug(ui, b'closing payload chunk')
                     yield _pack(_fpayloadsize, 0)
                     self._generated = True
                 def _payloadchunks(self):
                     """yield chunks of a the part payload
                     Exists to handle the different methods to provide data to a part."""
                     # we only support fixed size data now.
                     # This will be improved in the future.
                     if util.safehasattr(self.data, 'next') or util.safehasattr(
                         self.data, b'__next__'
                     ):
                         buff = util.chunkbuffer(self.data)
                         chunk = buff.read(preferedchunksize)
                         while chunk:
                             yield chunk
                             chunk = buff.read(preferedchunksize)
                     elif len(self.data):
                         yield self.data
             flaginterrupt = -1
             class interrupthandler(unpackermixin):
                 """read one part and process it with restricted capability
                 This allows to transmit exception raised on the producer size during part
                 iteration while the consumer is reading a part.
                 Part processed in this manner only have access to a ui object,"""
                 def __init__(self, ui, fp):
                     super(interrupthandler, self).__init__(fp)
                     self.ui = ui
                 def _readpartheader(self):
                     """reads a part header size and return the bytes blob
                     returns None if empty"""
                     headersize = self._unpack(_fpartheadersize)[0]
                     if headersize < 0:
                         raise error.BundleValueError(
                             b'negative part header size: %i' % headersize
                         )
                     indebug(self.ui, b'part header size: %i\n' % headersize)
                     if headersize:
                         return self._readexact(headersize)
                     return None
                 def __call__(self):
                     self.ui.debug(
                         b'bundle2-input-stream-interrupt: opening out of band context\n'
                     )
                     indebug(self.ui, b'bundle2 stream interruption, looking for a part.')
                     headerblock = self._readpartheader()
                     if headerblock is None:
                         indebug(self.ui, b'no part found during interruption.')
                         return
                     part = unbundlepart(self.ui, headerblock, self._fp)
                     op = interruptoperation(self.ui)
                     hardabort = False
                     try:
                         _processpart(op, part)
                     except (SystemExit, KeyboardInterrupt):
                         hardabort = True
                         raise
                     finally:
                         if not hardabort:
                             part.consume()
                     self.ui.debug(
                         b'bundle2-input-stream-interrupt: closing out of band context\n'
                     )
             class interruptoperation(object):
                 """A limited operation to be use by part handler during interruption
                 It only have access to an ui object.
                 """
                 def __init__(self, ui):
                     self.ui = ui
                     self.reply = None
                     self.captureoutput = False
                 @property
                 def repo(self):
                     raise error.ProgrammingError(b'no repo access from stream interruption')
                 def gettransaction(self):
                     raise TransactionUnavailable(b'no repo access from stream interruption')
             def decodepayloadchunks(ui, fh):
                 """Reads bundle2 part payload data into chunks.
                 Part payload data consists of framed chunks. This function takes
                 a file handle and emits those chunks.
                 """
                 dolog = ui.configbool(b'devel', b'bundle2.debug')
                 debug = ui.debug
                 headerstruct = struct.Struct(_fpayloadsize)
                 headersize = headerstruct.size
                 unpack = headerstruct.unpack
                 readexactly = changegroup.readexactly
                 read = fh.read
                 chunksize = unpack(readexactly(fh, headersize))[0]
                 indebug(ui, b'payload chunk size: %i' % chunksize)
                 # changegroup.readexactly() is inlined below for performance.
                 while chunksize:
                     if chunksize >= 0:
                         s = read(chunksize)
                         if len(s) < chunksize:
                             raise error.Abort(
                                 _(
                                     b'stream ended unexpectedly '
                                     b' (got %d bytes, expected %d)'
                                 )
                                 % (len(s), chunksize)
                             )
                         yield s
                     elif chunksize == flaginterrupt:
                         # Interrupt "signal" detected. The regular stream is interrupted
                         # and a bundle2 part follows. Consume it.
                         interrupthandler(ui, fh)()
                     else:
                         raise error.BundleValueError(
                             b'negative payload chunk size: %s' % chunksize
                         )
                     s = read(headersize)
                     if len(s) < headersize:
                         raise error.Abort(
                             _(b'stream ended unexpectedly  (got %d bytes, expected %d)')
                             % (len(s), chunksize)
                         )
                     chunksize = unpack(s)[0]
                     # indebug() inlined for performance.
                     if dolog:
                         debug(b'bundle2-input: payload chunk size: %i\n' % chunksize)
             class unbundlepart(unpackermixin):
                 """a bundle part read from a bundle"""
                 def __init__(self, ui, header, fp):
                     super(unbundlepart, self).__init__(fp)
                     self._seekable = util.safehasattr(fp, 'seek') and util.safehasattr(
                         fp, b'tell'
                     )
                     self.ui = ui
                     # unbundle state attr
                     self._headerdata = header
                     self._headeroffset = 0
                     self._initialized = False
                     self.consumed = False
                     # part data
                     self.id = None
                     self.type = None
                     self.mandatoryparams = None
                     self.advisoryparams = None
                     self.params = None
                     self.mandatorykeys = ()
                     self._readheader()
                     self._mandatory = None
                     self._pos = 0
                 def _fromheader(self, size):
                     """return the next <size> byte from the header"""
                     offset = self._headeroffset
                     data = self._headerdata[offset : (offset + size)]
                     self._headeroffset = offset + size
                     return data
                 def _unpackheader(self, format):
                     """read given format from header
                     This automatically compute the size of the format to read."""
                     data = self._fromheader(struct.calcsize(format))
                     return _unpack(format, data)
                 def _initparams(self, mandatoryparams, advisoryparams):
                     """internal function to setup all logic related parameters"""
                     # make it read only to prevent people touching it by mistake.
                     self.mandatoryparams = tuple(mandatoryparams)
                     self.advisoryparams = tuple(advisoryparams)
                     # user friendly UI
                     self.params = util.sortdict(self.mandatoryparams)
                     self.params.update(self.advisoryparams)
                     self.mandatorykeys = frozenset(p[0] for p in mandatoryparams)
                 def _readheader(self):
                     """read the header and setup the object"""
                     typesize = self._unpackheader(_fparttypesize)[0]
                     self.type = self._fromheader(typesize)
                     indebug(self.ui, b'part type: "%s"' % self.type)
                     self.id = self._unpackheader(_fpartid)[0]
                     indebug(self.ui, b'part id: "%s"' % pycompat.bytestr(self.id))
                     # extract mandatory bit from type
                     self.mandatory = self.type != self.type.lower()
                     self.type = self.type.lower()
                     ## reading parameters
                     # param count
                     mancount, advcount = self._unpackheader(_fpartparamcount)
                     indebug(self.ui, b'part parameters: %i' % (mancount + advcount))
                     # param size
                     fparamsizes = _makefpartparamsizes(mancount + advcount)
                     paramsizes = self._unpackheader(fparamsizes)
                     # make it a list of couple again
                     paramsizes = list(zip(paramsizes[::2], paramsizes[1::2]))
                     # split mandatory from advisory
                     mansizes = paramsizes[:mancount]
                     advsizes = paramsizes[mancount:]
                     # retrieve param value
                     manparams = []
                     for key, value in mansizes:
                         manparams.append((self._fromheader(key), self._fromheader(value)))
                     advparams = []
                     for key, value in advsizes:
                         advparams.append((self._fromheader(key), self._fromheader(value)))
                     self._initparams(manparams, advparams)
                     ## part payload
                     self._payloadstream = util.chunkbuffer(self._payloadchunks())
                     # we read the data, tell it
                     self._initialized = True
                 def _payloadchunks(self):
                     """Generator of decoded chunks in the payload."""
                     return decodepayloadchunks(self.ui, self._fp)
                 def consume(self):
                     """Read the part payload until completion.
                     By consuming the part data, the underlying stream read offset will
                     be advanced to the next part (or end of stream).
                     """
                     if self.consumed:
                         return
                     chunk = self.read(32768)
                     while chunk:
                         self._pos += len(chunk)
                         chunk = self.read(32768)
                 def read(self, size=None):
                     """read payload data"""
                     if not self._initialized:
                         self._readheader()
                     if size is None:
                         data = self._payloadstream.read()
                     else:
                         data = self._payloadstream.read(size)
                     self._pos += len(data)
                     if size is None or len(data) < size:
                         if not self.consumed and self._pos:
                             self.ui.debug(
                                 b'bundle2-input-part: total payload size %i\n' % self._pos
                             )
                         self.consumed = True
                     return data
             class seekableunbundlepart(unbundlepart):
                 """A bundle2 part in a bundle that is seekable.
                 Regular ``unbundlepart`` instances can only be read once. This class
                 extends ``unbundlepart`` to enable bi-directional seeking within the
                 part.
                 Bundle2 part data consists of framed chunks. Offsets when seeking
                 refer to the decoded data, not the offsets in the underlying bundle2
                 stream.
                 To facilitate quickly seeking within the decoded data, instances of this
                 class maintain a mapping between offsets in the underlying stream and
                 the decoded payload. This mapping will consume memory in proportion
                 to the number of chunks within the payload (which almost certainly
                 increases in proportion with the size of the part).
                 """
                 def __init__(self, ui, header, fp):
                     # (payload, file) offsets for chunk starts.
                     self._chunkindex = []
                     super(seekableunbundlepart, self).__init__(ui, header, fp)
                 def _payloadchunks(self, chunknum=0):
                     '''seek to specified chunk and start yielding data'''
                     if len(self._chunkindex) == 0:
                         assert chunknum == 0, b'Must start with chunk 0'
                         self._chunkindex.append((0, self._tellfp()))
                     else:
                         assert chunknum < len(self._chunkindex), (
                             b'Unknown chunk %d' % chunknum
                         )
                         self._seekfp(self._chunkindex[chunknum][1])
                     pos = self._chunkindex[chunknum][0]
                     for chunk in decodepayloadchunks(self.ui, self._fp):
                         chunknum += 1
                         pos += len(chunk)
                         if chunknum == len(self._chunkindex):
                             self._chunkindex.append((pos, self._tellfp()))
                         yield chunk
                 def _findchunk(self, pos):
                     '''for a given payload position, return a chunk number and offset'''
                     for chunk, (ppos, fpos) in enumerate(self._chunkindex):
                         if ppos == pos:
                             return chunk, 0
                         elif ppos > pos:
                             return chunk - 1, pos - self._chunkindex[chunk - 1][0]
                     raise ValueError(b'Unknown chunk')
                 def tell(self):
                     return self._pos
                 def seek(self, offset, whence=os.SEEK_SET):
                     if whence == os.SEEK_SET:
                         newpos = offset
                     elif whence == os.SEEK_CUR:
                         newpos = self._pos + offset
                     elif whence == os.SEEK_END:
                         if not self.consumed:
                             # Can't use self.consume() here because it advances self._pos.
                             chunk = self.read(32768)
                             while chunk:
                                 chunk = self.read(32768)
                         newpos = self._chunkindex[-1][0] - offset
                     else:
                         raise ValueError(b'Unknown whence value: %r' % (whence,))
                     if newpos > self._chunkindex[-1][0] and not self.consumed:
                         # Can't use self.consume() here because it advances self._pos.
                         chunk = self.read(32768)
                         while chunk:
                             chunk = self.read(32668)
                     if not 0 <= newpos <= self._chunkindex[-1][0]:
                         raise ValueError(b'Offset out of range')
                     if self._pos != newpos:
                         chunk, internaloffset = self._findchunk(newpos)
                         self._payloadstream = util.chunkbuffer(self._payloadchunks(chunk))
                         adjust = self.read(internaloffset)
                         if len(adjust) != internaloffset:
                             raise error.Abort(_(b'Seek failed\n'))
                         self._pos = newpos
                 def _seekfp(self, offset, whence=0):
                     """move the underlying file pointer
                     This method is meant for internal usage by the bundle2 protocol only.
                     They directly manipulate the low level stream including bundle2 level
                     instruction.
                     Do not use it to implement higher-level logic or methods."""
                     if self._seekable:
                         return self._fp.seek(offset, whence)
                     else:
                         raise NotImplementedError(_(b'File pointer is not seekable'))
                 def _tellfp(self):
                     """return the file offset, or None if file is not seekable
                     This method is meant for internal usage by the bundle2 protocol only.
                     They directly manipulate the low level stream including bundle2 level
                     instruction.
                     Do not use it to implement higher-level logic or methods."""
                     if self._seekable:
                         try:
                             return self._fp.tell()
                         except IOError as e:
                             if e.errno == errno.ESPIPE:
                                 self._seekable = False
                             else:
                                 raise
                     return None
             # These are only the static capabilities.
             # Check the 'getrepocaps' function for the rest.
             capabilities = {
                 b'HG20': (),
                 b'bookmarks': (),
                 b'error': (b'abort', b'unsupportedcontent', b'pushraced', b'pushkey'),
                 b'listkeys': (),
                 b'pushkey': (),
                 b'digests': tuple(sorted(util.DIGESTS.keys())),
                 b'remote-changegroup': (b'http', b'https'),
                 b'hgtagsfnodes': (),
                 b'rev-branch-cache': (),
                 b'phases': (b'heads',),
                 b'stream': (b'v2',),
             }
             def getrepocaps(repo, allowpushback=False, role=None):
                 """return the bundle2 capabilities for a given repo
                 Exists to allow extensions (like evolution) to mutate the capabilities.
                 The returned value is used for servers advertising their capabilities as
                 well as clients advertising their capabilities to servers as part of
                 bundle2 requests. The ``role`` argument specifies which is which.
                 """
                 if role not in (b'client', b'server'):
                     raise error.ProgrammingError(b'role argument must be client or server')
                 caps = capabilities.copy()
                 caps[b'changegroup'] = tuple(
                     sorted(changegroup.supportedincomingversions(repo))
                 )
                 if obsolete.isenabled(repo, obsolete.exchangeopt):
                     supportedformat = tuple(b'V%i' % v for v in obsolete.formats)
                     caps[b'obsmarkers'] = supportedformat
                 if allowpushback:
                     caps[b'pushback'] = ()
                 cpmode = repo.ui.config(b'server', b'concurrent-push-mode')
                 if cpmode == b'check-related':
                     caps[b'checkheads'] = (b'related',)
                 if b'phases' in repo.ui.configlist(b'devel', b'legacy.exchange'):
                     caps.pop(b'phases')
                 # Don't advertise stream clone support in server mode if not configured.
                 if role == b'server':
                     streamsupported = repo.ui.configbool(
                         b'server', b'uncompressed', untrusted=True
                     )
                     featuresupported = repo.ui.configbool(b'server', b'bundle2.stream')
                     if not streamsupported or not featuresupported:
                         caps.pop(b'stream')
                 # Else always advertise support on client, because payload support
                 # should always be advertised.
                 return caps
             def bundle2caps(remote):
                 """return the bundle capabilities of a peer as dict"""
                 raw = remote.capable(b'bundle2')
                 if not raw and raw != b'':
                     return {}
                 capsblob = urlreq.unquote(remote.capable(b'bundle2'))
                 return decodecaps(capsblob)
             def obsmarkersversion(caps):
                 """extract the list of supported obsmarkers versions from a bundle2caps dict
                 """
                 obscaps = caps.get(b'obsmarkers', ())
                 return [int(c[1:]) for c in obscaps if c.startswith(b'V')]
             def writenewbundle(
                 ui,
                 repo,
                 source,
                 filename,
                 bundletype,
                 outgoing,
                 opts,
                 vfs=None,
                 compression=None,
                 compopts=None,
             ):
                 if bundletype.startswith(b'HG10'):
                     cg = changegroup.makechangegroup(repo, outgoing, b'01', source)
                     return writebundle(
                         ui,
                         cg,
                         filename,
                         bundletype,
                         vfs=vfs,
                         compression=compression,
                         compopts=compopts,
                     )
                 elif not bundletype.startswith(b'HG20'):
                     raise error.ProgrammingError(b'unknown bundle type: %s' % bundletype)
                 caps = {}
                 if b'obsolescence' in opts:
                     caps[b'obsmarkers'] = (b'V1',)
                 bundle = bundle20(ui, caps)
                 bundle.setcompression(compression, compopts)
                 _addpartsfromopts(ui, repo, bundle, source, outgoing, opts)
                 chunkiter = bundle.getchunks()
                 return changegroup.writechunks(ui, chunkiter, filename, vfs=vfs)
             def _addpartsfromopts(ui, repo, bundler, source, outgoing, opts):
                 # We should eventually reconcile this logic with the one behind
                 # 'exchange.getbundle2partsgenerator'.
                 #
                 # The type of input from 'getbundle' and 'writenewbundle' are a bit
                 # different right now. So we keep them separated for now for the sake of
                 # simplicity.
                 # we might not always want a changegroup in such bundle, for example in
                 # stream bundles
                 if opts.get(b'changegroup', True):
                     cgversion = opts.get(b'cg.version')
                     if cgversion is None:
                         cgversion = changegroup.safeversion(repo)
                     cg = changegroup.makechangegroup(repo, outgoing, cgversion, source)
                     part = bundler.newpart(b'changegroup', data=cg.getchunks())
                     part.addparam(b'version', cg.version)
                     if b'clcount' in cg.extras:
                         part.addparam(
                             b'nbchanges', b'%d' % cg.extras[b'clcount'], mandatory=False
                         )
                     if opts.get(b'phases') and repo.revs(
                         b'%ln and secret()', outgoing.missingheads
                     ):
                         part.addparam(
                             b'targetphase', b'%d' % phases.secret, mandatory=False
                         )
                     if b'exp-sidedata-flag' in repo.requirements:
                         part.addparam(b'exp-sidedata', b'1')
                 if opts.get(b'streamv2', False):
                     addpartbundlestream2(bundler, repo, stream=True)
                 if opts.get(b'tagsfnodescache', True):
                     addparttagsfnodescache(repo, bundler, outgoing)
                 if opts.get(b'revbranchcache', True):
                     addpartrevbranchcache(repo, bundler, outgoing)
                 if opts.get(b'obsolescence', False):
                     obsmarkers = repo.obsstore.relevantmarkers(outgoing.missing)
                     buildobsmarkerspart(bundler, obsmarkers)
                 if opts.get(b'phases', False):
                     headsbyphase = phases.subsetphaseheads(repo, outgoing.missing)
                     phasedata = phases.binaryencode(headsbyphase)
                     bundler.newpart(b'phase-heads', data=phasedata)
             def addparttagsfnodescache(repo, bundler, outgoing):
                 # we include the tags fnode cache for the bundle changeset
                 # (as an optional parts)
                 cache = tags.hgtagsfnodescache(repo.unfiltered())
                 chunks = []
                 # .hgtags fnodes are only relevant for head changesets. While we could
                 # transfer values for all known nodes, there will likely be little to
                 # no benefit.
                 #
                 # We don't bother using a generator to produce output data because
                 # a) we only have 40 bytes per head and even esoteric numbers of heads
                 # consume little memory (1M heads is 40MB) b) we don't want to send the
                 # part if we don't have entries and knowing if we have entries requires
                 # cache lookups.
                 for node in outgoing.missingheads:
                     # Don't compute missing, as this may slow down serving.
                     fnode = cache.getfnode(node, computemissing=False)
                     if fnode is not None:
                         chunks.extend([node, fnode])
                 if chunks:
                     bundler.newpart(b'hgtagsfnodes', data=b''.join(chunks))
             def addpartrevbranchcache(repo, bundler, outgoing):
                 # we include the rev branch cache for the bundle changeset
                 # (as an optional parts)
                 cache = repo.revbranchcache()
                 cl = repo.unfiltered().changelog
                 branchesdata = collections.defaultdict(lambda: (set(), set()))
                 for node in outgoing.missing:
                     branch, close = cache.branchinfo(cl.rev(node))
                     branchesdata[branch][close].add(node)
                 def generate():
                     for branch, (nodes, closed) in sorted(branchesdata.items()):
                         utf8branch = encoding.fromlocal(branch)
                         yield rbcstruct.pack(len(utf8branch), len(nodes), len(closed))
                         yield utf8branch
                         for n in sorted(nodes):
                             yield n
                         for n in sorted(closed):
                             yield n
                 bundler.newpart(b'cache:rev-branch-cache', data=generate(), mandatory=False)
             def _formatrequirementsspec(requirements):
                 requirements = [req for req in requirements if req != b"shared"]
                 return urlreq.quote(b','.join(sorted(requirements)))
             def _formatrequirementsparams(requirements):
                 requirements = _formatrequirementsspec(requirements)
                 params = b"%s%s" % (urlreq.quote(b"requirements="), requirements)
                 return params
             def addpartbundlestream2(bundler, repo, **kwargs):
                 if not kwargs.get('stream', False):
                     return
                 if not streamclone.allowservergeneration(repo):
                     raise error.Abort(
                         _(
                             b'stream data requested but server does not allow '
                             b'this feature'
                         ),
                         hint=_(
                             b'well-behaved clients should not be '
                             b'requesting stream data from servers not '
                             b'advertising it; the client may be buggy'
                         ),
                     )
                 # Stream clones don't compress well. And compression undermines a
                 # goal of stream clones, which is to be fast. Communicate the desire
                 # to avoid compression to consumers of the bundle.
                 bundler.prefercompressed = False
                 # get the includes and excludes
                 includepats = kwargs.get('includepats')
                 excludepats = kwargs.get('excludepats')
                 narrowstream = repo.ui.configbool(
                     b'experimental', b'server.stream-narrow-clones'
                 )
                 if (includepats or excludepats) and not narrowstream:
                     raise error.Abort(_(b'server does not support narrow stream clones'))
                 includeobsmarkers = False
                 if repo.obsstore:
                     remoteversions = obsmarkersversion(bundler.capabilities)
                     if not remoteversions:
                         raise error.Abort(
                             _(
                                 b'server has obsolescence markers, but client '
                                 b'cannot receive them via stream clone'
                             )
                         )
                     elif repo.obsstore._version in remoteversions:
                         includeobsmarkers = True
                 filecount, bytecount, it = streamclone.generatev2(
                     repo, includepats, excludepats, includeobsmarkers
                 )
                 requirements = _formatrequirementsspec(repo.requirements)
                 part = bundler.newpart(b'stream2', data=it)
                 part.addparam(b'bytecount', b'%d' % bytecount, mandatory=True)
                 part.addparam(b'filecount', b'%d' % filecount, mandatory=True)
                 part.addparam(b'requirements', requirements, mandatory=True)
             def buildobsmarkerspart(bundler, markers):
                 """add an obsmarker part to the bundler with <markers>
                 No part is created if markers is empty.
                 Raises ValueError if the bundler doesn't support any known obsmarker format.
                 """
                 if not markers:
                     return None
                 remoteversions = obsmarkersversion(bundler.capabilities)
                 version = obsolete.commonversion(remoteversions)
                 if version is None:
                     raise ValueError(b'bundler does not support common obsmarker format')
                 stream = obsolete.encodemarkers(markers, True, version=version)
                 return bundler.newpart(b'obsmarkers', data=stream)
             def writebundle(
                 ui, cg, filename, bundletype, vfs=None, compression=None, compopts=None
             ):
                 """Write a bundle file and return its filename.
                 Existing files will not be overwritten.
                 If no filename is specified, a temporary file is created.
                 bz2 compression can be turned off.
                 The bundle file will be deleted in case of errors.
                 """
                 if bundletype == b"HG20":
                     bundle = bundle20(ui)
                     bundle.setcompression(compression, compopts)
                     part = bundle.newpart(b'changegroup', data=cg.getchunks())
                     part.addparam(b'version', cg.version)
                     if b'clcount' in cg.extras:
                         part.addparam(
                             b'nbchanges', b'%d' % cg.extras[b'clcount'], mandatory=False
                         )
                     chunkiter = bundle.getchunks()
                 else:
                     # compression argument is only for the bundle2 case
                     assert compression is None
                     if cg.version != b'01':
                         raise error.Abort(
                             _(b'old bundle types only supports v1 changegroups')
                         )
                     header, comp = bundletypes[bundletype]
                     if comp not in util.compengines.supportedbundletypes:
                         raise error.Abort(_(b'unknown stream compression type: %s') % comp)
                     compengine = util.compengines.forbundletype(comp)
                     def chunkiter():
                         yield header
                         for chunk in compengine.compressstream(cg.getchunks(), compopts):
                             yield chunk
                     chunkiter = chunkiter()
                 # parse the changegroup data, otherwise we will block
                 # in case of sshrepo because we don't know the end of the stream
                 return changegroup.writechunks(ui, chunkiter, filename, vfs=vfs)
             def combinechangegroupresults(op):
                 """logic to combine 0 or more addchangegroup results into one"""
                 results = [r.get(b'return', 0) for r in op.records[b'changegroup']]
                 changedheads = 0
                 result = 1
                 for ret in results:
                     # If any changegroup result is 0, return 0
                     if ret == 0:
                         result = 0
                         break
                     if ret < -1:
                         changedheads += ret + 1
                     elif ret > 1:
                         changedheads += ret - 1
                 if changedheads > 0:
                     result = 1 + changedheads
                 elif changedheads < 0:
                     result = -1 + changedheads
                 return result
             @parthandler(
                 b'changegroup',
                 (
                     b'version',
                     b'nbchanges',
                     b'exp-sidedata',
                     b'treemanifest',
                     b'targetphase',
                 ),
             )
             def handlechangegroup(op, inpart):
                 """apply a changegroup part on the repo
                 This is a very early implementation that will massive rework before being
                 inflicted to any end-user.
                 """
                 from . import localrepo
                 tr = op.gettransaction()
                 unpackerversion = inpart.params.get(b'version', b'01')
                 # We should raise an appropriate exception here
                 cg = changegroup.getunbundler(unpackerversion, inpart, None)
                 # the source and url passed here are overwritten by the one contained in
                 # the transaction.hookargs argument. So 'bundle2' is a placeholder
                 nbchangesets = None
                 if b'nbchanges' in inpart.params:
                     nbchangesets = int(inpart.params.get(b'nbchanges'))
                 if (
                     b'treemanifest' in inpart.params
                     and b'treemanifest' not in op.repo.requirements
                 ):
                     if len(op.repo.changelog) != 0:
                         raise error.Abort(
                             _(
                                 b"bundle contains tree manifests, but local repo is "
                                 b"non-empty and does not use tree manifests"
                             )
                         )
                     op.repo.requirements.add(b'treemanifest')
                     op.repo.svfs.options = localrepo.resolvestorevfsoptions(
                         op.repo.ui, op.repo.requirements, op.repo.features
                     )
                     op.repo._writerequirements()
                 bundlesidedata = bool(b'exp-sidedata' in inpart.params)
                 reposidedata = bool(b'exp-sidedata-flag' in op.repo.requirements)
                 if reposidedata and not bundlesidedata:
                     msg = b"repository is using sidedata but the bundle source do not"
                     hint = b'this is currently unsupported'
                     raise error.Abort(msg, hint=hint)
                 extrakwargs = {}
                 targetphase = inpart.params.get(b'targetphase')
                 if targetphase is not None:
                     extrakwargs['targetphase'] = int(targetphase)
                 ret = _processchangegroup(
                     op,
                     cg,
                     tr,
                     b'bundle2',
                     b'bundle2',
                     expectedtotal=nbchangesets,
                     **extrakwargs
                 )
                 if op.reply is not None:
                     # This is definitely not the final form of this
                     # return. But one need to start somewhere.
                     part = op.reply.newpart(b'reply:changegroup', mandatory=False)
                     part.addparam(
                         b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
                     )
                     part.addparam(b'return', b'%i' % ret, mandatory=False)
                 assert not inpart.read()
             _remotechangegroupparams = tuple(
                 [b'url', b'size', b'digests']
                 + [b'digest:%s' % k for k in util.DIGESTS.keys()]
             )
             @parthandler(b'remote-changegroup', _remotechangegroupparams)
             def handleremotechangegroup(op, inpart):
                 """apply a bundle10 on the repo, given an url and validation information
                 All the information about the remote bundle to import are given as
                 parameters. The parameters include:
                   - url: the url to the bundle10.
                   - size: the bundle10 file size. It is used to validate what was
                     retrieved by the client matches the server knowledge about the bundle.
                   - digests: a space separated list of the digest types provided as
                     parameters.
                   - digest:<digest-type>: the hexadecimal representation of the digest with
                     that name. Like the size, it is used to validate what was retrieved by
                     the client matches what the server knows about the bundle.
                 When multiple digest types are given, all of them are checked.
                 """
                 try:
                     raw_url = inpart.params[b'url']
                 except KeyError:
                     raise error.Abort(_(b'remote-changegroup: missing "%s" param') % b'url')
                 parsed_url = util.url(raw_url)
                 if parsed_url.scheme not in capabilities[b'remote-changegroup']:
                     raise error.Abort(
                         _(b'remote-changegroup does not support %s urls')
                         % parsed_url.scheme
                     )
                 try:
                     size = int(inpart.params[b'size'])
                 except ValueError:
                     raise error.Abort(
                         _(b'remote-changegroup: invalid value for param "%s"') % b'size'
                     )
                 except KeyError:
                     raise error.Abort(
                         _(b'remote-changegroup: missing "%s" param') % b'size'
                     )
                 digests = {}
                 for typ in inpart.params.get(b'digests', b'').split():
                     param = b'digest:%s' % typ
                     try:
                         value = inpart.params[param]
                     except KeyError:
                         raise error.Abort(
                             _(b'remote-changegroup: missing "%s" param') % param
                         )
                     digests[typ] = value
                 real_part = util.digestchecker(url.open(op.ui, raw_url), size, digests)
                 tr = op.gettransaction()
                 from . import exchange
                 cg = exchange.readbundle(op.repo.ui, real_part, raw_url)
                 if not isinstance(cg, changegroup.cg1unpacker):
                     raise error.Abort(
                         _(b'%s: not a bundle version 1.0') % util.hidepassword(raw_url)
                     )
                 ret = _processchangegroup(op, cg, tr, b'bundle2', b'bundle2')
                 if op.reply is not None:
                     # This is definitely not the final form of this
                     # return. But one need to start somewhere.
                     part = op.reply.newpart(b'reply:changegroup')
                     part.addparam(
                         b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
                     )
                     part.addparam(b'return', b'%i' % ret, mandatory=False)
                 try:
                     real_part.validate()
                 except error.Abort as e:
                     raise error.Abort(
                         _(b'bundle at %s is corrupted:\n%s')
                         % (util.hidepassword(raw_url), bytes(e))
                     )
                 assert not inpart.read()
             @parthandler(b'reply:changegroup', (b'return', b'in-reply-to'))
             def handlereplychangegroup(op, inpart):
                 ret = int(inpart.params[b'return'])
                 replyto = int(inpart.params[b'in-reply-to'])
                 op.records.add(b'changegroup', {b'return': ret}, replyto)
             @parthandler(b'check:bookmarks')
             def handlecheckbookmarks(op, inpart):
                 """check location of bookmarks
                 This part is to be used to detect push race regarding bookmark, it
                 contains binary encoded (bookmark, node) tuple. If the local state does
                 not marks the one in the part, a PushRaced exception is raised
                 """
                 bookdata = bookmarks.binarydecode(inpart)
                 msgstandard = (
                     b'remote repository changed while pushing - please try again '
                     b'(bookmark "%s" move from %s to %s)'
                 )
                 msgmissing = (
                     b'remote repository changed while pushing - please try again '
                     b'(bookmark "%s" is missing, expected %s)'
                 )
                 msgexist = (
                     b'remote repository changed while pushing - please try again '
                     b'(bookmark "%s" set on %s, expected missing)'
                 )
                 for book, node in bookdata:
                     currentnode = op.repo._bookmarks.get(book)
                     if currentnode != node:
                         if node is None:
                             finalmsg = msgexist % (book, nodemod.short(currentnode))
                         elif currentnode is None:
                             finalmsg = msgmissing % (book, nodemod.short(node))
                         else:
                             finalmsg = msgstandard % (
                                 book,
                                 nodemod.short(node),
                                 nodemod.short(currentnode),
                             )
                         raise error.PushRaced(finalmsg)
             @parthandler(b'check:heads')
             def handlecheckheads(op, inpart):
                 """check that head of the repo did not change
                 This is used to detect a push race when using unbundle.
                 This replaces the "heads" argument of unbundle."""
                 h = inpart.read(20)
                 heads = []
                 while len(h) == 20:
                     heads.append(h)
                     h = inpart.read(20)
                 assert not h
                 # Trigger a transaction so that we are guaranteed to have the lock now.
                 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
                     op.gettransaction()
                 if sorted(heads) != sorted(op.repo.heads()):
                     raise error.PushRaced(
                         b'remote repository changed while pushing - please try again'
                     )
             @parthandler(b'check:updated-heads')
             def handlecheckupdatedheads(op, inpart):
                 """check for race on the heads touched by a push
                 This is similar to 'check:heads' but focus on the heads actually updated
                 during the push. If other activities happen on unrelated heads, it is
                 ignored.
                 This allow server with high traffic to avoid push contention as long as
                 unrelated parts of the graph are involved."""
                 h = inpart.read(20)
                 heads = []
                 while len(h) == 20:
                     heads.append(h)
                     h = inpart.read(20)
                 assert not h
                 # trigger a transaction so that we are guaranteed to have the lock now.
                 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
                     op.gettransaction()
                 currentheads = set()
                 for ls in op.repo.branchmap().iterheads():
                     currentheads.update(ls)
                 for h in heads:
                     if h not in currentheads:
                         raise error.PushRaced(
                             b'remote repository changed while pushing - '
                             b'please try again'
                         )
             @parthandler(b'check:phases')
             def handlecheckphases(op, inpart):
                 """check that phase boundaries of the repository did not change
                 This is used to detect a push race.
                 """
                 phasetonodes = phases.binarydecode(inpart)
                 unfi = op.repo.unfiltered()
                 cl = unfi.changelog
                 phasecache = unfi._phasecache
                 msg = (
                     b'remote repository changed while pushing - please try again '
                     b'(%s is %s expected %s)'
                 )
                 for expectedphase, nodes in enumerate(phasetonodes):
                     for n in nodes:
                         actualphase = phasecache.phase(unfi, cl.rev(n))
                         if actualphase != expectedphase:
                             finalmsg = msg % (
                                 nodemod.short(n),
                                 phases.phasenames[actualphase],
                                 phases.phasenames[expectedphase],
                             )
                             raise error.PushRaced(finalmsg)
             @parthandler(b'output')
             def handleoutput(op, inpart):
                 """forward output captured on the server to the client"""
                 for line in inpart.read().splitlines():
                     op.ui.status(_(b'remote: %s\n') % line)
             @parthandler(b'replycaps')
             def handlereplycaps(op, inpart):
                 """Notify that a reply bundle should be created
                 The payload contains the capabilities information for the reply"""
                 caps = decodecaps(inpart.read())
                 if op.reply is None:
                     op.reply = bundle20(op.ui, caps)
             class AbortFromPart(error.Abort):
                 """Sub-class of Abort that denotes an error from a bundle2 part."""
             @parthandler(b'error:abort', (b'message', b'hint'))
             def handleerrorabort(op, inpart):
                 """Used to transmit abort error over the wire"""
                 raise AbortFromPart(
                     inpart.params[b'message'], hint=inpart.params.get(b'hint')
                 )
             @parthandler(
                 b'error:pushkey',
                 (b'namespace', b'key', b'new', b'old', b'ret', b'in-reply-to'),
             )
             def handleerrorpushkey(op, inpart):
                 """Used to transmit failure of a mandatory pushkey over the wire"""
                 kwargs = {}
                 for name in (b'namespace', b'key', b'new', b'old', b'ret'):
                     value = inpart.params.get(name)
                     if value is not None:
                         kwargs[name] = value
                 raise error.PushkeyFailed(
                     inpart.params[b'in-reply-to'], **pycompat.strkwargs(kwargs)
                 )
             @parthandler(b'error:unsupportedcontent', (b'parttype', b'params'))
             def handleerrorunsupportedcontent(op, inpart):
                 """Used to transmit unknown content error over the wire"""
                 kwargs = {}
                 parttype = inpart.params.get(b'parttype')
                 if parttype is not None:
                     kwargs[b'parttype'] = parttype
                 params = inpart.params.get(b'params')
                 if params is not None:
                     kwargs[b'params'] = params.split(b'\0')
                 raise error.BundleUnknownFeatureError(**pycompat.strkwargs(kwargs))
             @parthandler(b'error:pushraced', (b'message',))
             def handleerrorpushraced(op, inpart):
                 """Used to transmit push race error over the wire"""
                 raise error.ResponseError(_(b'push failed:'), inpart.params[b'message'])
             @parthandler(b'listkeys', (b'namespace',))
             def handlelistkeys(op, inpart):
                 """retrieve pushkey namespace content stored in a bundle2"""
                 namespace = inpart.params[b'namespace']
                 r = pushkey.decodekeys(inpart.read())
                 op.records.add(b'listkeys', (namespace, r))
             @parthandler(b'pushkey', (b'namespace', b'key', b'old', b'new'))
             def handlepushkey(op, inpart):
                 """process a pushkey request"""
                 dec = pushkey.decode
                 namespace = dec(inpart.params[b'namespace'])
                 key = dec(inpart.params[b'key'])
                 old = dec(inpart.params[b'old'])
                 new = dec(inpart.params[b'new'])
                 # Grab the transaction to ensure that we have the lock before performing the
                 # pushkey.
                 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
                     op.gettransaction()
                 ret = op.repo.pushkey(namespace, key, old, new)
                 record = {b'namespace': namespace, b'key': key, b'old': old, b'new': new}
                 op.records.add(b'pushkey', record)
                 if op.reply is not None:
                     rpart = op.reply.newpart(b'reply:pushkey')
                     rpart.addparam(
                         b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
                     )
                     rpart.addparam(b'return', b'%i' % ret, mandatory=False)
                 if inpart.mandatory and not ret:
                     kwargs = {}
                     for key in (b'namespace', b'key', b'new', b'old', b'ret'):
                         if key in inpart.params:
                             kwargs[key] = inpart.params[key]
                     raise error.PushkeyFailed(
                         partid=b'%d' % inpart.id, **pycompat.strkwargs(kwargs)
                     )
             @parthandler(b'bookmarks')
             def handlebookmark(op, inpart):
                 """transmit bookmark information
                 The part contains binary encoded bookmark information.
                 The exact behavior of this part can be controlled by the 'bookmarks' mode
                 on the bundle operation.
                 When mode is 'apply' (the default) the bookmark information is applied as
                 is to the unbundling repository. Make sure a 'check:bookmarks' part is
                 issued earlier to check for push races in such update. This behavior is
                 suitable for pushing.
                 When mode is 'records', the information is recorded into the 'bookmarks'
                 records of the bundle operation. This behavior is suitable for pulling.
                 """
                 changes = bookmarks.binarydecode(inpart)
                 pushkeycompat = op.repo.ui.configbool(
                     b'server', b'bookmarks-pushkey-compat'
                 )
                 bookmarksmode = op.modes.get(b'bookmarks', b'apply')
                 if bookmarksmode == b'apply':
                     tr = op.gettransaction()
                     bookstore = op.repo._bookmarks
                     if pushkeycompat:
                         allhooks = []
                         for book, node in changes:
                             hookargs = tr.hookargs.copy()
                             hookargs[b'pushkeycompat'] = b'1'
                             hookargs[b'namespace'] = b'bookmarks'
                             hookargs[b'key'] = book
                             hookargs[b'old'] = nodemod.hex(bookstore.get(book, b''))
                             hookargs[b'new'] = nodemod.hex(
                                 node if node is not None else b''
                             )
                             allhooks.append(hookargs)
                         for hookargs in allhooks:
                             op.repo.hook(
                                 b'prepushkey', throw=True, **pycompat.strkwargs(hookargs)
                             )
                     bookstore.applychanges(op.repo, op.gettransaction(), changes)
                     if pushkeycompat:
                         def runhook(unused_success):
                             for hookargs in allhooks:
                                 op.repo.hook(b'pushkey', **pycompat.strkwargs(hookargs))
                         op.repo._afterlock(runhook)
                 elif bookmarksmode == b'records':
                     for book, node in changes:
                         record = {b'bookmark': book, b'node': node}
                         op.records.add(b'bookmarks', record)
                 else:
                     raise error.ProgrammingError(
                         b'unkown bookmark mode: %s' % bookmarksmode
                     )
             @parthandler(b'phase-heads')
             def handlephases(op, inpart):
                 """apply phases from bundle part to repo"""
                 headsbyphase = phases.binarydecode(inpart)
                 phases.updatephases(op.repo.unfiltered(), op.gettransaction, headsbyphase)
             @parthandler(b'reply:pushkey', (b'return', b'in-reply-to'))
             def handlepushkeyreply(op, inpart):
                 """retrieve the result of a pushkey request"""
                 ret = int(inpart.params[b'return'])
                 partid = int(inpart.params[b'in-reply-to'])
                 op.records.add(b'pushkey', {b'return': ret}, partid)
             @parthandler(b'obsmarkers')
             def handleobsmarker(op, inpart):
                 """add a stream of obsmarkers to the repo"""
                 tr = op.gettransaction()
                 markerdata = inpart.read()
                 if op.ui.config(b'experimental', b'obsmarkers-exchange-debug'):
                     op.ui.writenoi18n(
                         b'obsmarker-exchange: %i bytes received\n' % len(markerdata)
                     )
                 # The mergemarkers call will crash if marker creation is not enabled.
                 # we want to avoid this if the part is advisory.
                 if not inpart.mandatory and op.repo.obsstore.readonly:
                     op.repo.ui.debug(
                         b'ignoring obsolescence markers, feature not enabled\n'
                     )
                     return
                 new = op.repo.obsstore.mergemarkers(tr, markerdata)
                 op.repo.invalidatevolatilesets()
                 op.records.add(b'obsmarkers', {b'new': new})
                 if op.reply is not None:
                     rpart = op.reply.newpart(b'reply:obsmarkers')
                     rpart.addparam(
                         b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
                     )
                     rpart.addparam(b'new', b'%i' % new, mandatory=False)
             @parthandler(b'reply:obsmarkers', (b'new', b'in-reply-to'))
             def handleobsmarkerreply(op, inpart):
                 """retrieve the result of a pushkey request"""
                 ret = int(inpart.params[b'new'])
                 partid = int(inpart.params[b'in-reply-to'])
                 op.records.add(b'obsmarkers', {b'new': ret}, partid)
             @parthandler(b'hgtagsfnodes')
             def handlehgtagsfnodes(op, inpart):
                 """Applies .hgtags fnodes cache entries to the local repo.
                 Payload is pairs of 20 byte changeset nodes and filenodes.
                 """
                 # Grab the transaction so we ensure that we have the lock at this point.
                 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
                     op.gettransaction()
                 cache = tags.hgtagsfnodescache(op.repo.unfiltered())
                 count = 0
                 while True:
                     node = inpart.read(20)
                     fnode = inpart.read(20)
                     if len(node) < 20 or len(fnode) < 20:
                         op.ui.debug(b'ignoring incomplete received .hgtags fnodes data\n')
                         break
                     cache.setfnode(node, fnode)
                     count += 1
                 cache.write()
                 op.ui.debug(b'applied %i hgtags fnodes cache entries\n' % count)
             rbcstruct = struct.Struct(b'>III')
             @parthandler(b'cache:rev-branch-cache')
             def handlerbc(op, inpart):
                 """receive a rev-branch-cache payload and update the local cache
                 The payload is a series of data related to each branch
 ) branch name length
 ) number of open heads
 ) number of closed heads
 ) open heads nodes
 ) closed heads nodes
                 """
                 total = 0
                 rawheader = inpart.read(rbcstruct.size)
                 cache = op.repo.revbranchcache()
                 cl = op.repo.unfiltered().changelog
                 while rawheader:
                     header = rbcstruct.unpack(rawheader)
                     total += header[1] + header[2]
                     utf8branch = inpart.read(header[0])
                     branch = encoding.tolocal(utf8branch)
                     for x in pycompat.xrange(header[1]):
                         node = inpart.read(20)
                         rev = cl.rev(node)
                         cache.setdata(branch, rev, node, False)
                     for x in pycompat.xrange(header[2]):
                         node = inpart.read(20)
                         rev = cl.rev(node)
                         cache.setdata(branch, rev, node, True)
                     rawheader = inpart.read(rbcstruct.size)
                 cache.write()
             @parthandler(b'pushvars')
             def bundle2getvars(op, part):
                 '''unbundle a bundle2 containing shellvars on the server'''
                 # An option to disable unbundling on server-side for security reasons
                 if op.ui.configbool(b'push', b'pushvars.server'):
                     hookargs = {}
                     for key, value in part.advisoryparams:
                         key = key.upper()
                         # We want pushed variables to have USERVAR_ prepended so we know
                         # they came from the --pushvar flag.
                         key = b"USERVAR_" + key
                         hookargs[key] = value
                     op.addhookargs(hookargs)
             @parthandler(b'stream2', (b'requirements', b'filecount', b'bytecount'))
             def handlestreamv2bundle(op, part):
                 requirements = urlreq.unquote(part.params[b'requirements']).split(b',')
                 filecount = int(part.params[b'filecount'])
                 bytecount = int(part.params[b'bytecount'])
                 repo = op.repo
                 if len(repo):
                     msg = _(b'cannot apply stream clone to non empty repository')
                     raise error.Abort(msg)
                 repo.ui.debug(b'applying stream bundle\n')
                 streamclone.applybundlev2(repo, part, filecount, bytecount, requirements)
             def widen_bundle(
                 bundler, repo, oldmatcher, newmatcher, common, known, cgversion, ellipses
             ):
                 """generates bundle2 for widening a narrow clone
                 bundler is the bundle to which data should be added
                 repo is the localrepository instance
                 oldmatcher matches what the client already has
                 newmatcher matches what the client needs (including what it already has)
                 common is set of common heads between server and client
                 known is a set of revs known on the client side (used in ellipses)
                 cgversion is the changegroup version to send
                 ellipses is boolean value telling whether to send ellipses data or not
                 returns bundle2 of the data required for extending
                 """
                 commonnodes = set()
                 cl = repo.changelog
                 for r in repo.revs(b"::%ln", common):
                     commonnodes.add(cl.node(r))
                 if commonnodes:
                     # XXX: we should only send the filelogs (and treemanifest). user
                     # already has the changelog and manifest
                     packer = changegroup.getbundler(
                         cgversion,
                         repo,
                         oldmatcher=oldmatcher,
                         matcher=newmatcher,
                         fullnodes=commonnodes,
                     )
                     cgdata = packer.generate(
                         {nodemod.nullid},
                         list(commonnodes),
                         False,
                         b'narrow_widen',
                         changelog=False,
                     )
                     part = bundler.newpart(b'changegroup', data=cgdata)
                     part.addparam(b'version', cgversion)
                     if b'treemanifest' in repo.requirements:
                         part.addparam(b'treemanifest', b'1')
                     if b'exp-sidedata-flag' in repo.requirements:
                         part.addparam(b'exp-sidedata', b'1')
                 return bundler

mercurial/linelog.py

0 +1 -1

             # linelog - efficient cache for annotate data
             #
             # Copyright 2018 Google LLC.
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             """linelog is an efficient cache for annotate data inspired by SCCS Weaves.
             SCCS Weaves are an implementation of
             https://en.wikipedia.org/wiki/Interleaved_deltas. See
             mercurial/helptext/internals/linelog.txt for an exploration of SCCS weaves
             and how linelog works in detail.
             Here's a hacker's summary: a linelog is a program which is executed in
             the context of a revision. Executing the program emits information
             about lines, including the revision that introduced them and the line
             number in the file at the introducing revision. When an insertion or
             deletion is performed on the file, a jump instruction is used to patch
             in a new body of annotate information.
             """
             from __future__ import absolute_import, print_function
             import abc
             import struct
             from .thirdparty import attr
             from . import pycompat
             _llentry = struct.Struct(b'>II')
             class LineLogError(Exception):
                 """Error raised when something bad happens internally in linelog."""
             @attr.s
             class lineinfo(object):
                 # Introducing revision of this line.
                 rev = attr.ib()
                 # Line number for this line in its introducing revision.
                 linenum = attr.ib()
                 # Private. Offset in the linelog program of this line. Used internally.
                 _offset = attr.ib()
             @attr.s
             class annotateresult(object):
                 rev = attr.ib()
                 lines = attr.ib()
                 _eof = attr.ib()
                 def __iter__(self):
                     return iter(self.lines)
             class _llinstruction(object):  # pytype: disable=ignored-metaclass
                 __metaclass__ = abc.ABCMeta
                 @abc.abstractmethod
                 def __init__(self, op1, op2):
                     pass
                 @abc.abstractmethod
                 def __str__(self):
                     pass
                 def __repr__(self):
                     return str(self)
                 @abc.abstractmethod
                 def __eq__(self, other):
                     pass
                 @abc.abstractmethod
                 def encode(self):
                     """Encode this instruction to the binary linelog format."""
                 @abc.abstractmethod
                 def execute(self, rev, pc, emit):
                     """Execute this instruction.
                     Args:
                       rev: The revision we're annotating.
                       pc: The current offset in the linelog program.
                       emit: A function that accepts a single lineinfo object.
                     Returns:
                       The new value of pc. Returns None if exeuction should stop
                       (that is, we've found the end of the file.)
                     """
             class _jge(_llinstruction):
                 """If the current rev is greater than or equal to op1, jump to op2."""
                 def __init__(self, op1, op2):
                     self._cmprev = op1
                     self._target = op2
                 def __str__(self):
                     return 'JGE %d %d' % (self._cmprev, self._target)
                 def __eq__(self, other):
                     return (
                         type(self) == type(other)
                         and self._cmprev == other._cmprev
                         and self._target == other._target
                     )
                 def encode(self):
                     return _llentry.pack(self._cmprev << 2, self._target)
                 def execute(self, rev, pc, emit):
                     if rev >= self._cmprev:
                         return self._target
                     return pc + 1
             class _jump(_llinstruction):
                 """Unconditional jumps are expressed as a JGE with op1 set to 0."""
                 def __init__(self, op1, op2):
                     if op1 != 0:
                         raise LineLogError(b"malformed JUMP, op1 must be 0, got %d" % op1)
                     self._target = op2
                 def __str__(self):
                     return 'JUMP %d' % (self._target)
                 def __eq__(self, other):
                     return type(self) == type(other) and self._target == other._target
                 def encode(self):
                     return _llentry.pack(0, self._target)
                 def execute(self, rev, pc, emit):
                     return self._target
             class _eof(_llinstruction):
                 """EOF is expressed as a JGE that always jumps to 0."""
                 def __init__(self, op1, op2):
                     if op1 != 0:
                         raise LineLogError(b"malformed EOF, op1 must be 0, got %d" % op1)
                     if op2 != 0:
                         raise LineLogError(b"malformed EOF, op2 must be 0, got %d" % op2)
                 def __str__(self):
                     return r'EOF'
                 def __eq__(self, other):
                     return type(self) == type(other)
                 def encode(self):
                     return _llentry.pack(0, 0)
                 def execute(self, rev, pc, emit):
                     return None
             class _jl(_llinstruction):
                 """If the current rev is less than op1, jump to op2."""
                 def __init__(self, op1, op2):
                     self._cmprev = op1
                     self._target = op2
                 def __str__(self):
                     return 'JL %d %d' % (self._cmprev, self._target)
                 def __eq__(self, other):
                     return (
                         type(self) == type(other)
                         and self._cmprev == other._cmprev
                         and self._target == other._target
                     )
                 def encode(self):
                     return _llentry.pack(1 | (self._cmprev << 2), self._target)
                 def execute(self, rev, pc, emit):
                     if rev < self._cmprev:
                         return self._target
                     return pc + 1
             class _line(_llinstruction):
                 """Emit a line."""
                 def __init__(self, op1, op2):
                     # This line was introduced by this revision number.
                     self._rev = op1
                     # This line had the specified line number in the introducing revision.
                     self._origlineno = op2
                 def __str__(self):
                     return 'LINE %d %d' % (self._rev, self._origlineno)
                 def __eq__(self, other):
                     return (
                         type(self) == type(other)
                         and self._rev == other._rev
                         and self._origlineno == other._origlineno
                     )
                 def encode(self):
                     return _llentry.pack(2 | (self._rev << 2), self._origlineno)
                 def execute(self, rev, pc, emit):
                     emit(lineinfo(self._rev, self._origlineno, pc))
                     return pc + 1
             def _decodeone(data, offset):
                 """Decode a single linelog instruction from an offset in a buffer."""
                 try:
                     op1, op2 = _llentry.unpack_from(data, offset)
                 except struct.error as e:
                     raise LineLogError(b'reading an instruction failed: %r' % e)
                 opcode = op1 & 0b11
                 op1 = op1 >> 2
                 if opcode == 0:
                     if op1 == 0:
                         if op2 == 0:
                             return _eof(op1, op2)
                         return _jump(op1, op2)
                     return _jge(op1, op2)
                 elif opcode == 1:
                     return _jl(op1, op2)
                 elif opcode == 2:
                     return _line(op1, op2)
                 raise NotImplementedError(b'Unimplemented opcode %r' % opcode)
             class linelog(object):
                 """Efficient cache for per-line history information."""
                 def __init__(self, program=None, maxrev=0):
                     if program is None:
                         # We pad the program with an extra leading EOF so that our
                         # offsets will match the C code exactly. This means we can
                         # interoperate with the C code.
                         program = [_eof(0, 0), _eof(0, 0)]
                     self._program = program
                     self._lastannotate = None
                     self._maxrev = maxrev
                 def __eq__(self, other):
                     return (
                         type(self) == type(other)
                         and self._program == other._program
                         and self._maxrev == other._maxrev
                     )
                 def __repr__(self):
-                    return b'<linelog at %s: maxrev=%d size=%d>' % (
+                    return '<linelog at %s: maxrev=%d size=%d>' % (
                         hex(id(self)),
                         self._maxrev,
                         len(self._program),
                     )
                 def debugstr(self):
                     fmt = '%%%dd %%s' % len(str(len(self._program)))
                     return pycompat.sysstr(b'\n').join(
                         fmt % (idx, i) for idx, i in enumerate(self._program[1:], 1)
                     )
                 @classmethod
                 def fromdata(cls, buf):
                     if len(buf) % _llentry.size != 0:
                         raise LineLogError(
                             b"invalid linelog buffer size %d (must be a multiple of %d)"
                             % (len(buf), _llentry.size)
                         )
                     expected = len(buf) / _llentry.size
                     fakejge = _decodeone(buf, 0)
                     if isinstance(fakejge, _jump):
                         maxrev = 0
                     elif isinstance(fakejge, (_jge, _jl)):
                         maxrev = fakejge._cmprev
                     else:
                         raise LineLogError(
                             'Expected one of _jump, _jge, or _jl. Got %s.'
                             % type(fakejge).__name__
                         )
                     assert isinstance(fakejge, (_jump, _jge, _jl))  # help pytype
                     numentries = fakejge._target
                     if expected != numentries:
                         raise LineLogError(
                             b"corrupt linelog data: claimed"
                             b" %d entries but given data for %d entries"
                             % (expected, numentries)
                         )
                     instructions = [_eof(0, 0)]
                     for offset in pycompat.xrange(1, numentries):
                         instructions.append(_decodeone(buf, offset * _llentry.size))
                     return cls(instructions, maxrev=maxrev)
                 def encode(self):
                     hdr = _jge(self._maxrev, len(self._program)).encode()
                     return hdr + b''.join(i.encode() for i in self._program[1:])
                 def clear(self):
                     self._program = []
                     self._maxrev = 0
                     self._lastannotate = None
                 def replacelines_vec(self, rev, a1, a2, blines):
                     return self.replacelines(
                         rev, a1, a2, 0, len(blines), _internal_blines=blines
                     )
                 def replacelines(self, rev, a1, a2, b1, b2, _internal_blines=None):
                     """Replace lines [a1, a2) with lines [b1, b2)."""
                     if self._lastannotate:
                         # TODO(augie): make replacelines() accept a revision at
                         # which we're editing as well as a revision to mark
                         # responsible for the edits. In hg-experimental it's
                         # stateful like this, so we're doing the same thing to
                         # retain compatibility with absorb until that's imported.
                         ar = self._lastannotate
                     else:
                         ar = self.annotate(rev)
                         #        ar = self.annotate(self._maxrev)
                     if a1 > len(ar.lines):
                         raise LineLogError(
                             b'%d contains %d lines, tried to access line %d'
                             % (rev, len(ar.lines), a1)
                         )
                     elif a1 == len(ar.lines):
                         # Simulated EOF instruction since we're at EOF, which
                         # doesn't have a "real" line.
                         a1inst = _eof(0, 0)
                         a1info = lineinfo(0, 0, ar._eof)
                     else:
                         a1info = ar.lines[a1]
                         a1inst = self._program[a1info._offset]
                     programlen = self._program.__len__
                     oldproglen = programlen()
                     appendinst = self._program.append
                     # insert
                     blineinfos = []
                     bappend = blineinfos.append
                     if b1 < b2:
                         # Determine the jump target for the JGE at the start of
                         # the new block.
                         tgt = oldproglen + (b2 - b1 + 1)
                         # Jump to skip the insert if we're at an older revision.
                         appendinst(_jl(rev, tgt))
                         for linenum in pycompat.xrange(b1, b2):
                             if _internal_blines is None:
                                 bappend(lineinfo(rev, linenum, programlen()))
                                 appendinst(_line(rev, linenum))
                             else:
                                 newrev, newlinenum = _internal_blines[linenum]
                                 bappend(lineinfo(newrev, newlinenum, programlen()))
                                 appendinst(_line(newrev, newlinenum))
                     # delete
                     if a1 < a2:
                         if a2 > len(ar.lines):
                             raise LineLogError(
                                 b'%d contains %d lines, tried to access line %d'
                                 % (rev, len(ar.lines), a2)
                             )
                         elif a2 == len(ar.lines):
                             endaddr = ar._eof
                         else:
                             endaddr = ar.lines[a2]._offset
                         if a2 > 0 and rev < self._maxrev:
                             # If we're here, we're deleting a chunk of an old
                             # commit, so we need to be careful and not touch
                             # invisible lines between a2-1 and a2 (IOW, lines that
                             # are added later).
                             endaddr = ar.lines[a2 - 1]._offset + 1
                         appendinst(_jge(rev, endaddr))
                     # copy instruction from a1
                     a1instpc = programlen()
                     appendinst(a1inst)
                     # if a1inst isn't a jump or EOF, then we need to add an unconditional
                     # jump back into the program here.
                     if not isinstance(a1inst, (_jump, _eof)):
                         appendinst(_jump(0, a1info._offset + 1))
                     # Patch instruction at a1, which makes our patch live.
                     self._program[a1info._offset] = _jump(0, oldproglen)
                     # Update self._lastannotate in place. This serves as a cache to avoid
                     # expensive "self.annotate" in this function, when "replacelines" is
                     # used continuously.
                     if len(self._lastannotate.lines) > a1:
                         self._lastannotate.lines[a1]._offset = a1instpc
                     else:
                         assert isinstance(a1inst, _eof)
                         self._lastannotate._eof = a1instpc
                     self._lastannotate.lines[a1:a2] = blineinfos
                     self._lastannotate.rev = max(self._lastannotate.rev, rev)
                     if rev > self._maxrev:
                         self._maxrev = rev
                 def annotate(self, rev):
                     pc = 1
                     lines = []
                     executed = 0
                     # Sanity check: if instructions executed exceeds len(program), we
                     # hit an infinite loop in the linelog program somehow and we
                     # should stop.
                     while pc is not None and executed < len(self._program):
                         inst = self._program[pc]
                         lastpc = pc
                         pc = inst.execute(rev, pc, lines.append)
                         executed += 1
                     if pc is not None:
                         raise LineLogError(
                             r'Probably hit an infinite loop in linelog. Program:\n'
                             + self.debugstr()
                         )
                     ar = annotateresult(rev, lines, lastpc)
                     self._lastannotate = ar
                     return ar
                 @property
                 def maxrev(self):
                     return self._maxrev
                 # Stateful methods which depend on the value of the last
                 # annotation run. This API is for compatiblity with the original
                 # linelog, and we should probably consider refactoring it.
                 @property
                 def annotateresult(self):
                     """Return the last annotation result. C linelog code exposed this."""
                     return [(l.rev, l.linenum) for l in self._lastannotate.lines]
                 def getoffset(self, line):
                     return self._lastannotate.lines[line]._offset
                 def getalllines(self, start=0, end=0):
                     """Get all lines that ever occurred in [start, end).
                     Passing start == end == 0 means "all lines ever".
                     This works in terms of *internal* program offsets, not line numbers.
                     """
                     pc = start or 1
                     lines = []
                     # only take as many steps as there are instructions in the
                     # program - if we don't find an EOF or our stop-line before
                     # then, something is badly broken.
                     for step in pycompat.xrange(len(self._program)):
                         inst = self._program[pc]
                         nextpc = pc + 1
                         if isinstance(inst, _jump):
                             nextpc = inst._target
                         elif isinstance(inst, _eof):
                             return lines
                         elif isinstance(inst, (_jl, _jge)):
                             pass
                         elif isinstance(inst, _line):
                             lines.append((inst._rev, inst._origlineno))
                         else:
                             raise LineLogError(b"Illegal instruction %r" % inst)
                         if nextpc == end:
                             return lines
                         pc = nextpc
                     raise LineLogError(b"Failed to perform getalllines")

mercurial/manifest.py

0 +3 -1

             # manifest.py - manifest revision class for mercurial
             #
             # Copyright 2005-2007 Matt Mackall <mpm@selenic.com>
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             from __future__ import absolute_import
             import heapq
             import itertools
             import struct
             import weakref
             from .i18n import _
             from .node import (
                 bin,
                 hex,
                 nullid,
                 nullrev,
             )
             from .pycompat import getattr
             from . import (
+                encoding,
                 error,
                 mdiff,
                 pathutil,
                 policy,
                 pycompat,
                 revlog,
                 util,
             )
             from .interfaces import (
                 repository,
                 util as interfaceutil,
             )
             parsers = policy.importmod('parsers')
             propertycache = util.propertycache
             # Allow tests to more easily test the alternate path in manifestdict.fastdelta()
             FASTDELTA_TEXTDIFF_THRESHOLD = 1000
             def _parse(data):
                 # This method does a little bit of excessive-looking
                 # precondition checking. This is so that the behavior of this
                 # class exactly matches its C counterpart to try and help
                 # prevent surprise breakage for anyone that develops against
                 # the pure version.
                 if data and data[-1:] != b'\n':
                     raise ValueError(b'Manifest did not end in a newline.')
                 prev = None
                 for l in data.splitlines():
                     if prev is not None and prev > l:
                         raise ValueError(b'Manifest lines not in sorted order.')
                     prev = l
                     f, n = l.split(b'\0')
                     if len(n) > 40:
                         yield f, bin(n[:40]), n[40:]
                     else:
                         yield f, bin(n), b''
             def _text(it):
                 files = []
                 lines = []
                 for f, n, fl in it:
                     files.append(f)
                     # if this is changed to support newlines in filenames,
                     # be sure to check the templates/ dir again (especially *-raw.tmpl)
                     lines.append(b"%s\0%s%s\n" % (f, hex(n), fl))
                 _checkforbidden(files)
                 return b''.join(lines)
             class lazymanifestiter(object):
                 def __init__(self, lm):
                     self.pos = 0
                     self.lm = lm
                 def __iter__(self):
                     return self
                 def next(self):
                     try:
                         data, pos = self.lm._get(self.pos)
                     except IndexError:
                         raise StopIteration
                     if pos == -1:
                         self.pos += 1
                         return data[0]
                     self.pos += 1
                     zeropos = data.find(b'\x00', pos)
                     return data[pos:zeropos]
                 __next__ = next
             class lazymanifestiterentries(object):
                 def __init__(self, lm):
                     self.lm = lm
                     self.pos = 0
                 def __iter__(self):
                     return self
                 def next(self):
                     try:
                         data, pos = self.lm._get(self.pos)
                     except IndexError:
                         raise StopIteration
                     if pos == -1:
                         self.pos += 1
                         return data
                     zeropos = data.find(b'\x00', pos)
                     hashval = unhexlify(data, self.lm.extrainfo[self.pos], zeropos + 1, 40)
                     flags = self.lm._getflags(data, self.pos, zeropos)
                     self.pos += 1
                     return (data[pos:zeropos], hashval, flags)
                 __next__ = next
             def unhexlify(data, extra, pos, length):
                 s = bin(data[pos : pos + length])
                 if extra:
                     s += chr(extra & 0xFF)
                 return s
             def _cmp(a, b):
                 return (a > b) - (a < b)
             class _lazymanifest(object):
                 """A pure python manifest backed by a byte string.  It is supplimented with
                 internal lists as it is modified, until it is compacted back to a pure byte
                 string.
                 ``data`` is the initial manifest data.
                 ``positions`` is a list of offsets, one per manifest entry.  Positive
                 values are offsets into ``data``, negative values are offsets into the
                 ``extradata`` list.  When an entry is removed, its entry is dropped from
                 ``positions``.  The values are encoded such that when walking the list and
                 indexing into ``data`` or ``extradata`` as appropriate, the entries are
                 sorted by filename.
                 ``extradata`` is a list of (key, hash, flags) for entries that were added or
                 modified since the manifest was created or compacted.
                 """
                 def __init__(
                     self,
                     data,
                     positions=None,
                     extrainfo=None,
                     extradata=None,
                     hasremovals=False,
                 ):
                     if positions is None:
                         self.positions = self.findlines(data)
                         self.extrainfo = [0] * len(self.positions)
                         self.data = data
                         self.extradata = []
                         self.hasremovals = False
                     else:
                         self.positions = positions[:]
                         self.extrainfo = extrainfo[:]
                         self.extradata = extradata[:]
                         self.data = data
                         self.hasremovals = hasremovals
                 def findlines(self, data):
                     if not data:
                         return []
                     pos = data.find(b"\n")
                     if pos == -1 or data[-1:] != b'\n':
                         raise ValueError(b"Manifest did not end in a newline.")
                     positions = [0]
                     prev = data[: data.find(b'\x00')]
                     while pos < len(data) - 1 and pos != -1:
                         positions.append(pos + 1)
                         nexts = data[pos + 1 : data.find(b'\x00', pos + 1)]
                         if nexts < prev:
                             raise ValueError(b"Manifest lines not in sorted order.")
                         prev = nexts
                         pos = data.find(b"\n", pos + 1)
                     return positions
                 def _get(self, index):
                     # get the position encoded in pos:
                     #   positive number is an index in 'data'
                     #   negative number is in extrapieces
                     pos = self.positions[index]
                     if pos >= 0:
                         return self.data, pos
                     return self.extradata[-pos - 1], -1
                 def _getkey(self, pos):
                     if pos >= 0:
                         return self.data[pos : self.data.find(b'\x00', pos + 1)]
                     return self.extradata[-pos - 1][0]
                 def bsearch(self, key):
                     first = 0
                     last = len(self.positions) - 1
                     while first <= last:
                         midpoint = (first + last) // 2
                         nextpos = self.positions[midpoint]
                         candidate = self._getkey(nextpos)
                         r = _cmp(key, candidate)
                         if r == 0:
                             return midpoint
                         else:
                             if r < 0:
                                 last = midpoint - 1
                             else:
                                 first = midpoint + 1
                     return -1
                 def bsearch2(self, key):
                     # same as the above, but will always return the position
                     # done for performance reasons
                     first = 0
                     last = len(self.positions) - 1
                     while first <= last:
                         midpoint = (first + last) // 2
                         nextpos = self.positions[midpoint]
                         candidate = self._getkey(nextpos)
                         r = _cmp(key, candidate)
                         if r == 0:
                             return (midpoint, True)
                         else:
                             if r < 0:
                                 last = midpoint - 1
                             else:
                                 first = midpoint + 1
                     return (first, False)
                 def __contains__(self, key):
                     return self.bsearch(key) != -1
                 def _getflags(self, data, needle, pos):
                     start = pos + 41
                     end = data.find(b"\n", start)
                     if end == -1:
                         end = len(data) - 1
                     if start == end:
                         return b''
                     return self.data[start:end]
                 def __getitem__(self, key):
                     if not isinstance(key, bytes):
                         raise TypeError(b"getitem: manifest keys must be a bytes.")
                     needle = self.bsearch(key)
                     if needle == -1:
                         raise KeyError
                     data, pos = self._get(needle)
                     if pos == -1:
                         return (data[1], data[2])
                     zeropos = data.find(b'\x00', pos)
                     assert 0 <= needle <= len(self.positions)
                     assert len(self.extrainfo) == len(self.positions)
                     hashval = unhexlify(data, self.extrainfo[needle], zeropos + 1, 40)
                     flags = self._getflags(data, needle, zeropos)
                     return (hashval, flags)
                 def __delitem__(self, key):
                     needle, found = self.bsearch2(key)
                     if not found:
                         raise KeyError
                     cur = self.positions[needle]
                     self.positions = self.positions[:needle] + self.positions[needle + 1 :]
                     self.extrainfo = self.extrainfo[:needle] + self.extrainfo[needle + 1 :]
                     if cur >= 0:
                         # This does NOT unsort the list as far as the search functions are
                         # concerned, as they only examine lines mapped by self.positions.
                         self.data = self.data[:cur] + b'\x00' + self.data[cur + 1 :]
                         self.hasremovals = True
                 def __setitem__(self, key, value):
                     if not isinstance(key, bytes):
                         raise TypeError(b"setitem: manifest keys must be a byte string.")
                     if not isinstance(value, tuple) or len(value) != 2:
                         raise TypeError(
                             b"Manifest values must be a tuple of (node, flags)."
                         )
                     hashval = value[0]
                     if not isinstance(hashval, bytes) or not 20 <= len(hashval) <= 22:
                         raise TypeError(b"node must be a 20-byte byte string")
                     flags = value[1]
                     if len(hashval) == 22:
                         hashval = hashval[:-1]
                     if not isinstance(flags, bytes) or len(flags) > 1:
                         raise TypeError(b"flags must a 0 or 1 byte string, got %r", flags)
                     needle, found = self.bsearch2(key)
                     if found:
                         # put the item
                         pos = self.positions[needle]
                         if pos < 0:
                             self.extradata[-pos - 1] = (key, hashval, value[1])
                         else:
                             # just don't bother
                             self.extradata.append((key, hashval, value[1]))
                             self.positions[needle] = -len(self.extradata)
                     else:
                         # not found, put it in with extra positions
                         self.extradata.append((key, hashval, value[1]))
                         self.positions = (
                             self.positions[:needle]
                             + [-len(self.extradata)]
                             + self.positions[needle:]
                         )
                         self.extrainfo = (
                             self.extrainfo[:needle] + [0] + self.extrainfo[needle:]
                         )
                 def copy(self):
                     # XXX call _compact like in C?
                     return _lazymanifest(
                         self.data,
                         self.positions,
                         self.extrainfo,
                         self.extradata,
                         self.hasremovals,
                     )
                 def _compact(self):
                     # hopefully not called TOO often
                     if len(self.extradata) == 0 and not self.hasremovals:
                         return
                     l = []
                     i = 0
                     offset = 0
                     self.extrainfo = [0] * len(self.positions)
                     while i < len(self.positions):
                         if self.positions[i] >= 0:
                             cur = self.positions[i]
                             last_cut = cur
                             # Collect all contiguous entries in the buffer at the current
                             # offset, breaking out only for added/modified items held in
                             # extradata, or a deleted line prior to the next position.
                             while True:
                                 self.positions[i] = offset
                                 i += 1
                                 if i == len(self.positions) or self.positions[i] < 0:
                                     break
                                 # A removed file has no positions[] entry, but does have an
                                 # overwritten first byte.  Break out and find the end of the
                                 # current good entry/entries if there is a removed file
                                 # before the next position.
                                 if (
                                     self.hasremovals
                                     and self.data.find(b'\n\x00', cur, self.positions[i])
                                     != -1
                                 ):
                                     break
                                 offset += self.positions[i] - cur
                                 cur = self.positions[i]
                             end_cut = self.data.find(b'\n', cur)
                             if end_cut != -1:
                                 end_cut += 1
                             offset += end_cut - cur
                             l.append(self.data[last_cut:end_cut])
                         else:
                             while i < len(self.positions) and self.positions[i] < 0:
                                 cur = self.positions[i]
                                 t = self.extradata[-cur - 1]
                                 l.append(self._pack(t))
                                 self.positions[i] = offset
                                 if len(t[1]) > 20:
                                     self.extrainfo[i] = ord(t[1][21])
                                 offset += len(l[-1])
                                 i += 1
                     self.data = b''.join(l)
                     self.hasremovals = False
                     self.extradata = []
                 def _pack(self, d):
                     return d[0] + b'\x00' + hex(d[1][:20]) + d[2] + b'\n'
                 def text(self):
                     self._compact()
                     return self.data
                 def diff(self, m2, clean=False):
                     '''Finds changes between the current manifest and m2.'''
                     # XXX think whether efficiency matters here
                     diff = {}
                     for fn, e1, flags in self.iterentries():
                         if fn not in m2:
                             diff[fn] = (e1, flags), (None, b'')
                         else:
                             e2 = m2[fn]
                             if (e1, flags) != e2:
                                 diff[fn] = (e1, flags), e2
                             elif clean:
                                 diff[fn] = None
                     for fn, e2, flags in m2.iterentries():
                         if fn not in self:
                             diff[fn] = (None, b''), (e2, flags)
                     return diff
                 def iterentries(self):
                     return lazymanifestiterentries(self)
                 def iterkeys(self):
                     return lazymanifestiter(self)
                 def __iter__(self):
                     return lazymanifestiter(self)
                 def __len__(self):
                     return len(self.positions)
                 def filtercopy(self, filterfn):
                     # XXX should be optimized
                     c = _lazymanifest(b'')
                     for f, n, fl in self.iterentries():
                         if filterfn(f):
                             c[f] = n, fl
                     return c
             try:
                 _lazymanifest = parsers.lazymanifest
             except AttributeError:
                 pass
             @interfaceutil.implementer(repository.imanifestdict)
             class manifestdict(object):
                 def __init__(self, data=b''):
                     self._lm = _lazymanifest(data)
                 def __getitem__(self, key):
                     return self._lm[key][0]
                 def find(self, key):
                     return self._lm[key]
                 def __len__(self):
                     return len(self._lm)
                 def __nonzero__(self):
                     # nonzero is covered by the __len__ function, but implementing it here
                     # makes it easier for extensions to override.
                     return len(self._lm) != 0
                 __bool__ = __nonzero__
                 def __setitem__(self, key, node):
                     self._lm[key] = node, self.flags(key, b'')
                 def __contains__(self, key):
                     if key is None:
                         return False
                     return key in self._lm
                 def __delitem__(self, key):
                     del self._lm[key]
                 def __iter__(self):
                     return self._lm.__iter__()
                 def iterkeys(self):
                     return self._lm.iterkeys()
                 def keys(self):
                     return list(self.iterkeys())
                 def filesnotin(self, m2, match=None):
                     '''Set of files in this manifest that are not in the other'''
                     if match:
                         m1 = self.matches(match)
                         m2 = m2.matches(match)
                         return m1.filesnotin(m2)
                     diff = self.diff(m2)
                     files = set(
                         filepath
                         for filepath, hashflags in pycompat.iteritems(diff)
                         if hashflags[1][0] is None
                     )
                     return files
                 @propertycache
                 def _dirs(self):
                     return pathutil.dirs(self)
                 def dirs(self):
                     return self._dirs
                 def hasdir(self, dir):
                     return dir in self._dirs
                 def _filesfastpath(self, match):
                     '''Checks whether we can correctly and quickly iterate over matcher
                     files instead of over manifest files.'''
                     files = match.files()
                     return len(files) < 100 and (
                         match.isexact()
                         or (match.prefix() and all(fn in self for fn in files))
                     )
                 def walk(self, match):
                     '''Generates matching file names.
                     Equivalent to manifest.matches(match).iterkeys(), but without creating
                     an entirely new manifest.
                     It also reports nonexistent files by marking them bad with match.bad().
                     '''
                     if match.always():
                         for f in iter(self):
                             yield f
                         return
                     fset = set(match.files())
                     # avoid the entire walk if we're only looking for specific files
                     if self._filesfastpath(match):
                         for fn in sorted(fset):
                             yield fn
                         return
                     for fn in self:
                         if fn in fset:
                             # specified pattern is the exact name
                             fset.remove(fn)
                         if match(fn):
                             yield fn
                     # for dirstate.walk, files=[''] means "walk the whole tree".
                     # follow that here, too
                     fset.discard(b'')
                     for fn in sorted(fset):
                         if not self.hasdir(fn):
                             match.bad(fn, None)
                 def matches(self, match):
                     '''generate a new manifest filtered by the match argument'''
                     if match.always():
                         return self.copy()
                     if self._filesfastpath(match):
                         m = manifestdict()
                         lm = self._lm
                         for fn in match.files():
                             if fn in lm:
                                 m._lm[fn] = lm[fn]
                         return m
                     m = manifestdict()
                     m._lm = self._lm.filtercopy(match)
                     return m
                 def diff(self, m2, match=None, clean=False):
                     '''Finds changes between the current manifest and m2.
                     Args:
                       m2: the manifest to which this manifest should be compared.
                       clean: if true, include files unchanged between these manifests
                              with a None value in the returned dictionary.
                     The result is returned as a dict with filename as key and
                     values of the form ((n1,fl1),(n2,fl2)), where n1/n2 is the
                     nodeid in the current/other manifest and fl1/fl2 is the flag
                     in the current/other manifest. Where the file does not exist,
                     the nodeid will be None and the flags will be the empty
                     string.
                     '''
                     if match:
                         m1 = self.matches(match)
                         m2 = m2.matches(match)
                         return m1.diff(m2, clean=clean)
                     return self._lm.diff(m2._lm, clean)
                 def setflag(self, key, flag):
                     self._lm[key] = self[key], flag
                 def get(self, key, default=None):
                     try:
                         return self._lm[key][0]
                     except KeyError:
                         return default
                 def flags(self, key, default=b''):
                     try:
                         return self._lm[key][1]
                     except KeyError:
                         return default
                 def copy(self):
                     c = manifestdict()
                     c._lm = self._lm.copy()
                     return c
                 def items(self):
                     return (x[:2] for x in self._lm.iterentries())
                 def iteritems(self):
                     return (x[:2] for x in self._lm.iterentries())
                 def iterentries(self):
                     return self._lm.iterentries()
                 def text(self):
                     # most likely uses native version
                     return self._lm.text()
                 def fastdelta(self, base, changes):
                     """Given a base manifest text as a bytearray and a list of changes
                     relative to that text, compute a delta that can be used by revlog.
                     """
                     delta = []
                     dstart = None
                     dend = None
                     dline = [b""]
                     start = 0
                     # zero copy representation of base as a buffer
                     addbuf = util.buffer(base)
                     changes = list(changes)
                     if len(changes) < FASTDELTA_TEXTDIFF_THRESHOLD:
                         # start with a readonly loop that finds the offset of
                         # each line and creates the deltas
                         for f, todelete in changes:
                             # bs will either be the index of the item or the insert point
                             start, end = _msearch(addbuf, f, start)
                             if not todelete:
                                 h, fl = self._lm[f]
                                 l = b"%s\0%s%s\n" % (f, hex(h), fl)
                             else:
                                 if start == end:
                                     # item we want to delete was not found, error out
                                     raise AssertionError(
                                         _(b"failed to remove %s from manifest") % f
                                     )
                                 l = b""
                             if dstart is not None and dstart <= start and dend >= start:
                                 if dend < end:
                                     dend = end
                                 if l:
                                     dline.append(l)
                             else:
                                 if dstart is not None:
                                     delta.append([dstart, dend, b"".join(dline)])
                                 dstart = start
                                 dend = end
                                 dline = [l]
                         if dstart is not None:
                             delta.append([dstart, dend, b"".join(dline)])
                         # apply the delta to the base, and get a delta for addrevision
                         deltatext, arraytext = _addlistdelta(base, delta)
                     else:
                         # For large changes, it's much cheaper to just build the text and
                         # diff it.
                         arraytext = bytearray(self.text())
                         deltatext = mdiff.textdiff(
                             util.buffer(base), util.buffer(arraytext)
                         )
                     return arraytext, deltatext
             def _msearch(m, s, lo=0, hi=None):
                 '''return a tuple (start, end) that says where to find s within m.
                 If the string is found m[start:end] are the line containing
                 that string.  If start == end the string was not found and
                 they indicate the proper sorted insertion point.
                 m should be a buffer, a memoryview or a byte string.
                 s is a byte string'''
                 def advance(i, c):
                     while i < lenm and m[i : i + 1] != c:
                         i += 1
                     return i
                 if not s:
                     return (lo, lo)
                 lenm = len(m)
                 if not hi:
                     hi = lenm
                 while lo < hi:
                     mid = (lo + hi) // 2
                     start = mid
                     while start > 0 and m[start - 1 : start] != b'\n':
                         start -= 1
                     end = advance(start, b'\0')
                     if bytes(m[start:end]) < s:
                         # we know that after the null there are 40 bytes of sha1
                         # this translates to the bisect lo = mid + 1
                         lo = advance(end + 40, b'\n') + 1
                     else:
                         # this translates to the bisect hi = mid
                         hi = start
                 end = advance(lo, b'\0')
                 found = m[lo:end]
                 if s == found:
                     # we know that after the null there are 40 bytes of sha1
                     end = advance(end + 40, b'\n')
                     return (lo, end + 1)
                 else:
                     return (lo, lo)
             def _checkforbidden(l):
                 """Check filenames for illegal characters."""
                 for f in l:
                     if b'\n' in f or b'\r' in f:
                         raise error.StorageError(
                             _(b"'\\n' and '\\r' disallowed in filenames: %r")
                             % pycompat.bytestr(f)
                         )
             # apply the changes collected during the bisect loop to our addlist
             # return a delta suitable for addrevision
             def _addlistdelta(addlist, x):
                 # for large addlist arrays, building a new array is cheaper
                 # than repeatedly modifying the existing one
                 currentposition = 0
                 newaddlist = bytearray()
                 for start, end, content in x:
                     newaddlist += addlist[currentposition:start]
                     if content:
                         newaddlist += bytearray(content)
                     currentposition = end
                 newaddlist += addlist[currentposition:]
                 deltatext = b"".join(
                     struct.pack(b">lll", start, end, len(content)) + content
                     for start, end, content in x
                 )
                 return deltatext, newaddlist
             def _splittopdir(f):
                 if b'/' in f:
                     dir, subpath = f.split(b'/', 1)
                     return dir + b'/', subpath
                 else:
                     return b'', f
             _noop = lambda s: None
             class treemanifest(object):
                 def __init__(self, dir=b'', text=b''):
                     self._dir = dir
                     self._node = nullid
                     self._loadfunc = _noop
                     self._copyfunc = _noop
                     self._dirty = False
                     self._dirs = {}
                     self._lazydirs = {}
                     # Using _lazymanifest here is a little slower than plain old dicts
                     self._files = {}
                     self._flags = {}
                     if text:
                         def readsubtree(subdir, subm):
                             raise AssertionError(
                                 b'treemanifest constructor only accepts flat manifests'
                             )
                         self.parse(text, readsubtree)
                         self._dirty = True  # Mark flat manifest dirty after parsing
                 def _subpath(self, path):
                     return self._dir + path
                 def _loadalllazy(self):
                     selfdirs = self._dirs
                     for d, (path, node, readsubtree, docopy) in pycompat.iteritems(
                         self._lazydirs
                     ):
                         if docopy:
                             selfdirs[d] = readsubtree(path, node).copy()
                         else:
                             selfdirs[d] = readsubtree(path, node)
                     self._lazydirs = {}
                 def _loadlazy(self, d):
                     v = self._lazydirs.get(d)
                     if v:
                         path, node, readsubtree, docopy = v
                         if docopy:
                             self._dirs[d] = readsubtree(path, node).copy()
                         else:
                             self._dirs[d] = readsubtree(path, node)
                         del self._lazydirs[d]
                 def _loadchildrensetlazy(self, visit):
                     if not visit:
                         return None
                     if visit == b'all' or visit == b'this':
                         self._loadalllazy()
                         return None
                     loadlazy = self._loadlazy
                     for k in visit:
                         loadlazy(k + b'/')
                     return visit
                 def _loaddifflazy(self, t1, t2):
                     """load items in t1 and t2 if they're needed for diffing.
                     The criteria currently is:
                     - if it's not present in _lazydirs in either t1 or t2, load it in the
                       other (it may already be loaded or it may not exist, doesn't matter)
                     - if it's present in _lazydirs in both, compare the nodeid; if it
                       differs, load it in both
                     """
                     toloadlazy = []
                     for d, v1 in pycompat.iteritems(t1._lazydirs):
                         v2 = t2._lazydirs.get(d)
                         if not v2 or v2[1] != v1[1]:
                             toloadlazy.append(d)
                     for d, v1 in pycompat.iteritems(t2._lazydirs):
                         if d not in t1._lazydirs:
                             toloadlazy.append(d)
                     for d in toloadlazy:
                         t1._loadlazy(d)
                         t2._loadlazy(d)
                 def __len__(self):
                     self._load()
                     size = len(self._files)
                     self._loadalllazy()
                     for m in self._dirs.values():
                         size += m.__len__()
                     return size
                 def __nonzero__(self):
                     # Faster than "__len() != 0" since it avoids loading sub-manifests
                     return not self._isempty()
                 __bool__ = __nonzero__
                 def _isempty(self):
                     self._load()  # for consistency; already loaded by all callers
                     # See if we can skip loading everything.
                     if self._files or (
                         self._dirs and any(not m._isempty() for m in self._dirs.values())
                     ):
                         return False
                     self._loadalllazy()
                     return not self._dirs or all(m._isempty() for m in self._dirs.values())
+                @encoding.strmethod
                 def __repr__(self):
                     return (
-                        b'<treemanifest dir=%s, node=%s, loaded=%s, dirty=%s at 0x%x>'
+                        b'<treemanifest dir=%s, node=%s, loaded=%r, dirty=%r at 0x%x>'
                         % (
                             self._dir,
                             hex(self._node),
                             bool(self._loadfunc is _noop),
                             self._dirty,
                             id(self),
                         )
                     )
                 def dir(self):
                     '''The directory that this tree manifest represents, including a
                     trailing '/'. Empty string for the repo root directory.'''
                     return self._dir
                 def node(self):
                     '''This node of this instance. nullid for unsaved instances. Should
                     be updated when the instance is read or written from a revlog.
                     '''
                     assert not self._dirty
                     return self._node
                 def setnode(self, node):
                     self._node = node
                     self._dirty = False
                 def iterentries(self):
                     self._load()
                     self._loadalllazy()
                     for p, n in sorted(
                         itertools.chain(self._dirs.items(), self._files.items())
                     ):
                         if p in self._files:
                             yield self._subpath(p), n, self._flags.get(p, b'')
                         else:
                             for x in n.iterentries():
                                 yield x
                 def items(self):
                     self._load()
                     self._loadalllazy()
                     for p, n in sorted(
                         itertools.chain(self._dirs.items(), self._files.items())
                     ):
                         if p in self._files:
                             yield self._subpath(p), n
                         else:
                             for f, sn in pycompat.iteritems(n):
                                 yield f, sn
                 iteritems = items
                 def iterkeys(self):
                     self._load()
                     self._loadalllazy()
                     for p in sorted(itertools.chain(self._dirs, self._files)):
                         if p in self._files:
                             yield self._subpath(p)
                         else:
                             for f in self._dirs[p]:
                                 yield f
                 def keys(self):
                     return list(self.iterkeys())
                 def __iter__(self):
                     return self.iterkeys()
                 def __contains__(self, f):
                     if f is None:
                         return False
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         if dir not in self._dirs:
                             return False
                         return self._dirs[dir].__contains__(subpath)
                     else:
                         return f in self._files
                 def get(self, f, default=None):
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         if dir not in self._dirs:
                             return default
                         return self._dirs[dir].get(subpath, default)
                     else:
                         return self._files.get(f, default)
                 def __getitem__(self, f):
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         return self._dirs[dir].__getitem__(subpath)
                     else:
                         return self._files[f]
                 def flags(self, f):
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         if dir not in self._dirs:
                             return b''
                         return self._dirs[dir].flags(subpath)
                     else:
                         if f in self._lazydirs or f in self._dirs:
                             return b''
                         return self._flags.get(f, b'')
                 def find(self, f):
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         return self._dirs[dir].find(subpath)
                     else:
                         return self._files[f], self._flags.get(f, b'')
                 def __delitem__(self, f):
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         self._dirs[dir].__delitem__(subpath)
                         # If the directory is now empty, remove it
                         if self._dirs[dir]._isempty():
                             del self._dirs[dir]
                     else:
                         del self._files[f]
                         if f in self._flags:
                             del self._flags[f]
                     self._dirty = True
                 def __setitem__(self, f, n):
                     assert n is not None
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         if dir not in self._dirs:
                             self._dirs[dir] = treemanifest(self._subpath(dir))
                         self._dirs[dir].__setitem__(subpath, n)
                     else:
                         self._files[f] = n[:21]  # to match manifestdict's behavior
                     self._dirty = True
                 def _load(self):
                     if self._loadfunc is not _noop:
                         lf, self._loadfunc = self._loadfunc, _noop
                         lf(self)
                     elif self._copyfunc is not _noop:
                         cf, self._copyfunc = self._copyfunc, _noop
                         cf(self)
                 def setflag(self, f, flags):
                     """Set the flags (symlink, executable) for path f."""
                     self._load()
                     dir, subpath = _splittopdir(f)
                     if dir:
                         self._loadlazy(dir)
                         if dir not in self._dirs:
                             self._dirs[dir] = treemanifest(self._subpath(dir))
                         self._dirs[dir].setflag(subpath, flags)
                     else:
                         self._flags[f] = flags
                     self._dirty = True
                 def copy(self):
                     copy = treemanifest(self._dir)
                     copy._node = self._node
                     copy._dirty = self._dirty
                     if self._copyfunc is _noop:
                         def _copyfunc(s):
                             self._load()
                             s._lazydirs = {
                                 d: (p, n, r, True)
                                 for d, (p, n, r, c) in pycompat.iteritems(self._lazydirs)
                             }
                             sdirs = s._dirs
                             for d, v in pycompat.iteritems(self._dirs):
                                 sdirs[d] = v.copy()
                             s._files = dict.copy(self._files)
                             s._flags = dict.copy(self._flags)
                         if self._loadfunc is _noop:
                             _copyfunc(copy)
                         else:
                             copy._copyfunc = _copyfunc
                     else:
                         copy._copyfunc = self._copyfunc
                     return copy
                 def filesnotin(self, m2, match=None):
                     '''Set of files in this manifest that are not in the other'''
                     if match and not match.always():
                         m1 = self.matches(match)
                         m2 = m2.matches(match)
                         return m1.filesnotin(m2)
                     files = set()
                     def _filesnotin(t1, t2):
                         if t1._node == t2._node and not t1._dirty and not t2._dirty:
                             return
                         t1._load()
                         t2._load()
                         self._loaddifflazy(t1, t2)
                         for d, m1 in pycompat.iteritems(t1._dirs):
                             if d in t2._dirs:
                                 m2 = t2._dirs[d]
                                 _filesnotin(m1, m2)
                             else:
                                 files.update(m1.iterkeys())
                         for fn in t1._files:
                             if fn not in t2._files:
                                 files.add(t1._subpath(fn))
                     _filesnotin(self, m2)
                     return files
                 @propertycache
                 def _alldirs(self):
                     return pathutil.dirs(self)
                 def dirs(self):
                     return self._alldirs
                 def hasdir(self, dir):
                     self._load()
                     topdir, subdir = _splittopdir(dir)
                     if topdir:
                         self._loadlazy(topdir)
                         if topdir in self._dirs:
                             return self._dirs[topdir].hasdir(subdir)
                         return False
                     dirslash = dir + b'/'
                     return dirslash in self._dirs or dirslash in self._lazydirs
                 def walk(self, match):
                     '''Generates matching file names.
                     Equivalent to manifest.matches(match).iterkeys(), but without creating
                     an entirely new manifest.
                     It also reports nonexistent files by marking them bad with match.bad().
                     '''
                     if match.always():
                         for f in iter(self):
                             yield f
                         return
                     fset = set(match.files())
                     for fn in self._walk(match):
                         if fn in fset:
                             # specified pattern is the exact name
                             fset.remove(fn)
                         yield fn
                     # for dirstate.walk, files=[''] means "walk the whole tree".
                     # follow that here, too
                     fset.discard(b'')
                     for fn in sorted(fset):
                         if not self.hasdir(fn):
                             match.bad(fn, None)
                 def _walk(self, match):
                     '''Recursively generates matching file names for walk().'''
                     visit = match.visitchildrenset(self._dir[:-1])
                     if not visit:
                         return
                     # yield this dir's files and walk its submanifests
                     self._load()
                     visit = self._loadchildrensetlazy(visit)
                     for p in sorted(list(self._dirs) + list(self._files)):
                         if p in self._files:
                             fullp = self._subpath(p)
                             if match(fullp):
                                 yield fullp
                         else:
                             if not visit or p[:-1] in visit:
                                 for f in self._dirs[p]._walk(match):
                                     yield f
                 def matches(self, match):
                     '''generate a new manifest filtered by the match argument'''
                     if match.always():
                         return self.copy()
                     return self._matches(match)
                 def _matches(self, match):
                     '''recursively generate a new manifest filtered by the match argument.
                     '''
                     visit = match.visitchildrenset(self._dir[:-1])
                     if visit == b'all':
                         return self.copy()
                     ret = treemanifest(self._dir)
                     if not visit:
                         return ret
                     self._load()
                     for fn in self._files:
                         # While visitchildrenset *usually* lists only subdirs, this is
                         # actually up to the matcher and may have some files in the set().
                         # If visit == 'this', we should obviously look at the files in this
                         # directory; if visit is a set, and fn is in it, we should inspect
                         # fn (but no need to inspect things not in the set).
                         if visit != b'this' and fn not in visit:
                             continue
                         fullp = self._subpath(fn)
                         # visitchildrenset isn't perfect, we still need to call the regular
                         # matcher code to further filter results.
                         if not match(fullp):
                             continue
                         ret._files[fn] = self._files[fn]
                         if fn in self._flags:
                             ret._flags[fn] = self._flags[fn]
                     visit = self._loadchildrensetlazy(visit)
                     for dir, subm in pycompat.iteritems(self._dirs):
                         if visit and dir[:-1] not in visit:
                             continue
                         m = subm._matches(match)
                         if not m._isempty():
                             ret._dirs[dir] = m
                     if not ret._isempty():
                         ret._dirty = True
                     return ret
                 def diff(self, m2, match=None, clean=False):
                     '''Finds changes between the current manifest and m2.
                     Args:
                       m2: the manifest to which this manifest should be compared.
                       clean: if true, include files unchanged between these manifests
                              with a None value in the returned dictionary.
                     The result is returned as a dict with filename as key and
                     values of the form ((n1,fl1),(n2,fl2)), where n1/n2 is the
                     nodeid in the current/other manifest and fl1/fl2 is the flag
                     in the current/other manifest. Where the file does not exist,
                     the nodeid will be None and the flags will be the empty
                     string.
                     '''
                     if match and not match.always():
                         m1 = self.matches(match)
                         m2 = m2.matches(match)
                         return m1.diff(m2, clean=clean)
                     result = {}
                     emptytree = treemanifest()
                     def _iterativediff(t1, t2, stack):
                         """compares two tree manifests and append new tree-manifests which
                         needs to be compared to stack"""
                         if t1._node == t2._node and not t1._dirty and not t2._dirty:
                             return
                         t1._load()
                         t2._load()
                         self._loaddifflazy(t1, t2)
                         for d, m1 in pycompat.iteritems(t1._dirs):
                             m2 = t2._dirs.get(d, emptytree)
                             stack.append((m1, m2))
                         for d, m2 in pycompat.iteritems(t2._dirs):
                             if d not in t1._dirs:
                                 stack.append((emptytree, m2))
                         for fn, n1 in pycompat.iteritems(t1._files):
                             fl1 = t1._flags.get(fn, b'')
                             n2 = t2._files.get(fn, None)
                             fl2 = t2._flags.get(fn, b'')
                             if n1 != n2 or fl1 != fl2:
                                 result[t1._subpath(fn)] = ((n1, fl1), (n2, fl2))
                             elif clean:
                                 result[t1._subpath(fn)] = None
                         for fn, n2 in pycompat.iteritems(t2._files):
                             if fn not in t1._files:
                                 fl2 = t2._flags.get(fn, b'')
                                 result[t2._subpath(fn)] = ((None, b''), (n2, fl2))
                     stackls = []
                     _iterativediff(self, m2, stackls)
                     while stackls:
                         t1, t2 = stackls.pop()
                         # stackls is populated in the function call
                         _iterativediff(t1, t2, stackls)
                     return result
                 def unmodifiedsince(self, m2):
                     return not self._dirty and not m2._dirty and self._node == m2._node
                 def parse(self, text, readsubtree):
                     selflazy = self._lazydirs
                     subpath = self._subpath
                     for f, n, fl in _parse(text):
                         if fl == b't':
                             f = f + b'/'
                             # False below means "doesn't need to be copied" and can use the
                             # cached value from readsubtree directly.
                             selflazy[f] = (subpath(f), n, readsubtree, False)
                         elif b'/' in f:
                             # This is a flat manifest, so use __setitem__ and setflag rather
                             # than assigning directly to _files and _flags, so we can
                             # assign a path in a subdirectory, and to mark dirty (compared
                             # to nullid).
                             self[f] = n
                             if fl:
                                 self.setflag(f, fl)
                         else:
                             # Assigning to _files and _flags avoids marking as dirty,
                             # and should be a little faster.
                             self._files[f] = n
                             if fl:
                                 self._flags[f] = fl
                 def text(self):
                     """Get the full data of this manifest as a bytestring."""
                     self._load()
                     return _text(self.iterentries())
                 def dirtext(self):
                     """Get the full data of this directory as a bytestring. Make sure that
                     any submanifests have been written first, so their nodeids are correct.
                     """
                     self._load()
                     flags = self.flags
                     lazydirs = [
                         (d[:-1], v[1], b't') for d, v in pycompat.iteritems(self._lazydirs)
                     ]
                     dirs = [(d[:-1], self._dirs[d]._node, b't') for d in self._dirs]
                     files = [(f, self._files[f], flags(f)) for f in self._files]
                     return _text(sorted(dirs + files + lazydirs))
                 def read(self, gettext, readsubtree):
                     def _load_for_read(s):
                         s.parse(gettext(), readsubtree)
                         s._dirty = False
                     self._loadfunc = _load_for_read
                 def writesubtrees(self, m1, m2, writesubtree, match):
                     self._load()  # for consistency; should never have any effect here
                     m1._load()
                     m2._load()
                     emptytree = treemanifest()
                     def getnode(m, d):
                         ld = m._lazydirs.get(d)
                         if ld:
                             return ld[1]
                         return m._dirs.get(d, emptytree)._node
                     # let's skip investigating things that `match` says we do not need.
                     visit = match.visitchildrenset(self._dir[:-1])
                     visit = self._loadchildrensetlazy(visit)
                     if visit == b'this' or visit == b'all':
                         visit = None
                     for d, subm in pycompat.iteritems(self._dirs):
                         if visit and d[:-1] not in visit:
                             continue
                         subp1 = getnode(m1, d)
                         subp2 = getnode(m2, d)
                         if subp1 == nullid:
                             subp1, subp2 = subp2, subp1
                         writesubtree(subm, subp1, subp2, match)
                 def walksubtrees(self, matcher=None):
                     """Returns an iterator of the subtrees of this manifest, including this
                     manifest itself.
                     If `matcher` is provided, it only returns subtrees that match.
                     """
                     if matcher and not matcher.visitdir(self._dir[:-1]):
                         return
                     if not matcher or matcher(self._dir[:-1]):
                         yield self
                     self._load()
                     # OPT: use visitchildrenset to avoid loading everything.
                     self._loadalllazy()
                     for d, subm in pycompat.iteritems(self._dirs):
                         for subtree in subm.walksubtrees(matcher=matcher):
                             yield subtree
             class manifestfulltextcache(util.lrucachedict):
                 """File-backed LRU cache for the manifest cache
                 File consists of entries, up to EOF:
                 - 20 bytes node, 4 bytes length, <length> manifest data
                 These are written in reverse cache order (oldest to newest).
                 """
                 _file = b'manifestfulltextcache'
                 def __init__(self, max):
                     super(manifestfulltextcache, self).__init__(max)
                     self._dirty = False
                     self._read = False
                     self._opener = None
                 def read(self):
                     if self._read or self._opener is None:
                         return
                     try:
                         with self._opener(self._file) as fp:
                             set = super(manifestfulltextcache, self).__setitem__
                             # ignore trailing data, this is a cache, corruption is skipped
                             while True:
                                 node = fp.read(20)
                                 if len(node) < 20:
                                     break
                                 try:
                                     size = struct.unpack(b'>L', fp.read(4))[0]
                                 except struct.error:
                                     break
                                 value = bytearray(fp.read(size))
                                 if len(value) != size:
                                     break
                                 set(node, value)
                     except IOError:
                         # the file is allowed to be missing
                         pass
                     self._read = True
                     self._dirty = False
                 def write(self):
                     if not self._dirty or self._opener is None:
                         return
                     # rotate backwards to the first used node
                     with self._opener(
                         self._file, b'w', atomictemp=True, checkambig=True
                     ) as fp:
                         node = self._head.prev
                         while True:
                             if node.key in self._cache:
                                 fp.write(node.key)
                                 fp.write(struct.pack(b'>L', len(node.value)))
                                 fp.write(node.value)
                             if node is self._head:
                                 break
                             node = node.prev
                 def __len__(self):
                     if not self._read:
                         self.read()
                     return super(manifestfulltextcache, self).__len__()
                 def __contains__(self, k):
                     if not self._read:
                         self.read()
                     return super(manifestfulltextcache, self).__contains__(k)
                 def __iter__(self):
                     if not self._read:
                         self.read()
                     return super(manifestfulltextcache, self).__iter__()
                 def __getitem__(self, k):
                     if not self._read:
                         self.read()
                     # the cache lru order can change on read
                     setdirty = self._cache.get(k) is not self._head
                     value = super(manifestfulltextcache, self).__getitem__(k)
                     if setdirty:
                         self._dirty = True
                     return value
                 def __setitem__(self, k, v):
                     if not self._read:
                         self.read()
                     super(manifestfulltextcache, self).__setitem__(k, v)
                     self._dirty = True
                 def __delitem__(self, k):
                     if not self._read:
                         self.read()
                     super(manifestfulltextcache, self).__delitem__(k)
                     self._dirty = True
                 def get(self, k, default=None):
                     if not self._read:
                         self.read()
                     return super(manifestfulltextcache, self).get(k, default=default)
                 def clear(self, clear_persisted_data=False):
                     super(manifestfulltextcache, self).clear()
                     if clear_persisted_data:
                         self._dirty = True
                         self.write()
                     self._read = False
             # and upper bound of what we expect from compression
             # (real live value seems to be "3")
             MAXCOMPRESSION = 3
             @interfaceutil.implementer(repository.imanifeststorage)
             class manifestrevlog(object):
                 '''A revlog that stores manifest texts. This is responsible for caching the
                 full-text manifest contents.
                 '''
                 def __init__(
                     self,
                     opener,
                     tree=b'',
                     dirlogcache=None,
                     indexfile=None,
                     treemanifest=False,
                 ):
                     """Constructs a new manifest revlog
                     `indexfile` - used by extensions to have two manifests at once, like
                     when transitioning between flatmanifeset and treemanifests.
                     `treemanifest` - used to indicate this is a tree manifest revlog. Opener
                     options can also be used to make this a tree manifest revlog. The opener
                     option takes precedence, so if it is set to True, we ignore whatever
                     value is passed in to the constructor.
                     """
                     # During normal operations, we expect to deal with not more than four
                     # revs at a time (such as during commit --amend). When rebasing large
                     # stacks of commits, the number can go up, hence the config knob below.
                     cachesize = 4
                     optiontreemanifest = False
                     opts = getattr(opener, 'options', None)
                     if opts is not None:
                         cachesize = opts.get(b'manifestcachesize', cachesize)
                         optiontreemanifest = opts.get(b'treemanifest', False)
                     self._treeondisk = optiontreemanifest or treemanifest
                     self._fulltextcache = manifestfulltextcache(cachesize)
                     if tree:
                         assert self._treeondisk, b'opts is %r' % opts
                     if indexfile is None:
                         indexfile = b'00manifest.i'
                         if tree:
                             indexfile = b"meta/" + tree + indexfile
                     self.tree = tree
                     # The dirlogcache is kept on the root manifest log
                     if tree:
                         self._dirlogcache = dirlogcache
                     else:
                         self._dirlogcache = {b'': self}
                     self._revlog = revlog.revlog(
                         opener,
                         indexfile,
                         # only root indexfile is cached
                         checkambig=not bool(tree),
                         mmaplargeindex=True,
                         upperboundcomp=MAXCOMPRESSION,
                     )
                     self.index = self._revlog.index
                     self.version = self._revlog.version
                     self._generaldelta = self._revlog._generaldelta
                 def _setupmanifestcachehooks(self, repo):
                     """Persist the manifestfulltextcache on lock release"""
                     if not util.safehasattr(repo, b'_wlockref'):
                         return
                     self._fulltextcache._opener = repo.wcachevfs
                     if repo._currentlock(repo._wlockref) is None:
                         return
                     reporef = weakref.ref(repo)
                     manifestrevlogref = weakref.ref(self)
                     def persistmanifestcache(success):
                         # Repo is in an unknown state, do not persist.
                         if not success:
                             return
                         repo = reporef()
                         self = manifestrevlogref()
                         if repo is None or self is None:
                             return
                         if repo.manifestlog.getstorage(b'') is not self:
                             # there's a different manifest in play now, abort
                             return
                         self._fulltextcache.write()
                     repo._afterlock(persistmanifestcache)
                 @property
                 def fulltextcache(self):
                     return self._fulltextcache
                 def clearcaches(self, clear_persisted_data=False):
                     self._revlog.clearcaches()
                     self._fulltextcache.clear(clear_persisted_data=clear_persisted_data)
                     self._dirlogcache = {self.tree: self}
                 def dirlog(self, d):
                     if d:
                         assert self._treeondisk
                     if d not in self._dirlogcache:
                         mfrevlog = manifestrevlog(
                             self.opener, d, self._dirlogcache, treemanifest=self._treeondisk
                         )
                         self._dirlogcache[d] = mfrevlog
                     return self._dirlogcache[d]
                 def add(
                     self,
                     m,
                     transaction,
                     link,
                     p1,
                     p2,
                     added,
                     removed,
                     readtree=None,
                     match=None,
                 ):
                     if p1 in self.fulltextcache and util.safehasattr(m, b'fastdelta'):
                         # If our first parent is in the manifest cache, we can
                         # compute a delta here using properties we know about the
                         # manifest up-front, which may save time later for the
                         # revlog layer.
                         _checkforbidden(added)
                         # combine the changed lists into one sorted iterator
                         work = heapq.merge(
                             [(x, False) for x in sorted(added)],
                             [(x, True) for x in sorted(removed)],
                         )
                         arraytext, deltatext = m.fastdelta(self.fulltextcache[p1], work)
                         cachedelta = self._revlog.rev(p1), deltatext
                         text = util.buffer(arraytext)
                         n = self._revlog.addrevision(
                             text, transaction, link, p1, p2, cachedelta
                         )
                     else:
                         # The first parent manifest isn't already loaded, so we'll
                         # just encode a fulltext of the manifest and pass that
                         # through to the revlog layer, and let it handle the delta
                         # process.
                         if self._treeondisk:
                             assert readtree, b"readtree must be set for treemanifest writes"
                             assert match, b"match must be specified for treemanifest writes"
                             m1 = readtree(self.tree, p1)
                             m2 = readtree(self.tree, p2)
                             n = self._addtree(
                                 m, transaction, link, m1, m2, readtree, match=match
                             )
                             arraytext = None
                         else:
                             text = m.text()
                             n = self._revlog.addrevision(text, transaction, link, p1, p2)
                             arraytext = bytearray(text)
                     if arraytext is not None:
                         self.fulltextcache[n] = arraytext
                     return n
                 def _addtree(self, m, transaction, link, m1, m2, readtree, match):
                     # If the manifest is unchanged compared to one parent,
                     # don't write a new revision
                     if self.tree != b'' and (
                         m.unmodifiedsince(m1) or m.unmodifiedsince(m2)
                     ):
                         return m.node()
                     def writesubtree(subm, subp1, subp2, match):
                         sublog = self.dirlog(subm.dir())
                         sublog.add(
                             subm,
                             transaction,
                             link,
                             subp1,
                             subp2,
                             None,
                             None,
                             readtree=readtree,
                             match=match,
                         )
                     m.writesubtrees(m1, m2, writesubtree, match)
                     text = m.dirtext()
                     n = None
                     if self.tree != b'':
                         # Double-check whether contents are unchanged to one parent
                         if text == m1.dirtext():
                             n = m1.node()
                         elif text == m2.dirtext():
                             n = m2.node()
                     if not n:
                         n = self._revlog.addrevision(
                             text, transaction, link, m1.node(), m2.node()
                         )
                     # Save nodeid so parent manifest can calculate its nodeid
                     m.setnode(n)
                     return n
                 def __len__(self):
                     return len(self._revlog)
                 def __iter__(self):
                     return self._revlog.__iter__()
                 def rev(self, node):
                     return self._revlog.rev(node)
                 def node(self, rev):
                     return self._revlog.node(rev)
                 def lookup(self, value):
                     return self._revlog.lookup(value)
                 def parentrevs(self, rev):
                     return self._revlog.parentrevs(rev)
                 def parents(self, node):
                     return self._revlog.parents(node)
                 def linkrev(self, rev):
                     return self._revlog.linkrev(rev)
                 def checksize(self):
                     return self._revlog.checksize()
                 def revision(self, node, _df=None, raw=False):
                     return self._revlog.revision(node, _df=_df, raw=raw)
                 def rawdata(self, node, _df=None):
                     return self._revlog.rawdata(node, _df=_df)
                 def revdiff(self, rev1, rev2):
                     return self._revlog.revdiff(rev1, rev2)
                 def cmp(self, node, text):
                     return self._revlog.cmp(node, text)
                 def deltaparent(self, rev):
                     return self._revlog.deltaparent(rev)
                 def emitrevisions(
                     self,
                     nodes,
                     nodesorder=None,
                     revisiondata=False,
                     assumehaveparentrevisions=False,
                     deltamode=repository.CG_DELTAMODE_STD,
                 ):
                     return self._revlog.emitrevisions(
                         nodes,
                         nodesorder=nodesorder,
                         revisiondata=revisiondata,
                         assumehaveparentrevisions=assumehaveparentrevisions,
                         deltamode=deltamode,
                     )
                 def addgroup(self, deltas, linkmapper, transaction, addrevisioncb=None):
                     return self._revlog.addgroup(
                         deltas, linkmapper, transaction, addrevisioncb=addrevisioncb
                     )
                 def rawsize(self, rev):
                     return self._revlog.rawsize(rev)
                 def getstrippoint(self, minlink):
                     return self._revlog.getstrippoint(minlink)
                 def strip(self, minlink, transaction):
                     return self._revlog.strip(minlink, transaction)
                 def files(self):
                     return self._revlog.files()
                 def clone(self, tr, destrevlog, **kwargs):
                     if not isinstance(destrevlog, manifestrevlog):
                         raise error.ProgrammingError(b'expected manifestrevlog to clone()')
                     return self._revlog.clone(tr, destrevlog._revlog, **kwargs)
                 def storageinfo(
                     self,
                     exclusivefiles=False,
                     sharedfiles=False,
                     revisionscount=False,
                     trackedsize=False,
                     storedsize=False,
                 ):
                     return self._revlog.storageinfo(
                         exclusivefiles=exclusivefiles,
                         sharedfiles=sharedfiles,
                         revisionscount=revisionscount,
                         trackedsize=trackedsize,
                         storedsize=storedsize,
                     )
                 @property
                 def indexfile(self):
                     return self._revlog.indexfile
                 @indexfile.setter
                 def indexfile(self, value):
                     self._revlog.indexfile = value
                 @property
                 def opener(self):
                     return self._revlog.opener
                 @opener.setter
                 def opener(self, value):
                     self._revlog.opener = value
             @interfaceutil.implementer(repository.imanifestlog)
             class manifestlog(object):
                 """A collection class representing the collection of manifest snapshots
                 referenced by commits in the repository.
                 In this situation, 'manifest' refers to the abstract concept of a snapshot
                 of the list of files in the given commit. Consumers of the output of this
                 class do not care about the implementation details of the actual manifests
                 they receive (i.e. tree or flat or lazily loaded, etc)."""
                 def __init__(self, opener, repo, rootstore, narrowmatch):
                     usetreemanifest = False
                     cachesize = 4
                     opts = getattr(opener, 'options', None)
                     if opts is not None:
                         usetreemanifest = opts.get(b'treemanifest', usetreemanifest)
                         cachesize = opts.get(b'manifestcachesize', cachesize)
                     self._treemanifests = usetreemanifest
                     self._rootstore = rootstore
                     self._rootstore._setupmanifestcachehooks(repo)
                     self._narrowmatch = narrowmatch
                     # A cache of the manifestctx or treemanifestctx for each directory
                     self._dirmancache = {}
                     self._dirmancache[b''] = util.lrucachedict(cachesize)
                     self._cachesize = cachesize
                 def __getitem__(self, node):
                     """Retrieves the manifest instance for the given node. Throws a
                     LookupError if not found.
                     """
                     return self.get(b'', node)
                 def get(self, tree, node, verify=True):
                     """Retrieves the manifest instance for the given node. Throws a
                     LookupError if not found.
                     `verify` - if True an exception will be thrown if the node is not in
                                the revlog
                     """
                     if node in self._dirmancache.get(tree, ()):
                         return self._dirmancache[tree][node]
                     if not self._narrowmatch.always():
                         if not self._narrowmatch.visitdir(tree[:-1]):
                             return excludeddirmanifestctx(tree, node)
                     if tree:
                         if self._rootstore._treeondisk:
                             if verify:
                                 # Side-effect is LookupError is raised if node doesn't
                                 # exist.
                                 self.getstorage(tree).rev(node)
                             m = treemanifestctx(self, tree, node)
                         else:
                             raise error.Abort(
                                 _(
                                     b"cannot ask for manifest directory '%s' in a flat "
                                     b"manifest"
                                 )
                                 % tree
                             )
                     else:
                         if verify:
                             # Side-effect is LookupError is raised if node doesn't exist.
                             self._rootstore.rev(node)
                         if self._treemanifests:
                             m = treemanifestctx(self, b'', node)
                         else:
                             m = manifestctx(self, node)
                     if node != nullid:
                         mancache = self._dirmancache.get(tree)
                         if not mancache:
                             mancache = util.lrucachedict(self._cachesize)
                             self._dirmancache[tree] = mancache
                         mancache[node] = m
                     return m
                 def getstorage(self, tree):
                     return self._rootstore.dirlog(tree)
                 def clearcaches(self, clear_persisted_data=False):
                     self._dirmancache.clear()
                     self._rootstore.clearcaches(clear_persisted_data=clear_persisted_data)
                 def rev(self, node):
                     return self._rootstore.rev(node)
             @interfaceutil.implementer(repository.imanifestrevisionwritable)
             class memmanifestctx(object):
                 def __init__(self, manifestlog):
                     self._manifestlog = manifestlog
                     self._manifestdict = manifestdict()
                 def _storage(self):
                     return self._manifestlog.getstorage(b'')
                 def new(self):
                     return memmanifestctx(self._manifestlog)
                 def copy(self):
                     memmf = memmanifestctx(self._manifestlog)
                     memmf._manifestdict = self.read().copy()
                     return memmf
                 def read(self):
                     return self._manifestdict
                 def write(self, transaction, link, p1, p2, added, removed, match=None):
                     return self._storage().add(
                         self._manifestdict,
                         transaction,
                         link,
                         p1,
                         p2,
                         added,
                         removed,
                         match=match,
                     )
             @interfaceutil.implementer(repository.imanifestrevisionstored)
             class manifestctx(object):
                 """A class representing a single revision of a manifest, including its
                 contents, its parent revs, and its linkrev.
                 """
                 def __init__(self, manifestlog, node):
                     self._manifestlog = manifestlog
                     self._data = None
                     self._node = node
                     # TODO: We eventually want p1, p2, and linkrev exposed on this class,
                     # but let's add it later when something needs it and we can load it
                     # lazily.
                     # self.p1, self.p2 = store.parents(node)
                     # rev = store.rev(node)
                     # self.linkrev = store.linkrev(rev)
                 def _storage(self):
                     return self._manifestlog.getstorage(b'')
                 def node(self):
                     return self._node
                 def new(self):
                     return memmanifestctx(self._manifestlog)
                 def copy(self):
                     memmf = memmanifestctx(self._manifestlog)
                     memmf._manifestdict = self.read().copy()
                     return memmf
                 @propertycache
                 def parents(self):
                     return self._storage().parents(self._node)
                 def read(self):
                     if self._data is None:
                         if self._node == nullid:
                             self._data = manifestdict()
                         else:
                             store = self._storage()
                             if self._node in store.fulltextcache:
                                 text = pycompat.bytestr(store.fulltextcache[self._node])
                             else:
                                 text = store.revision(self._node)
                                 arraytext = bytearray(text)
                                 store.fulltextcache[self._node] = arraytext
                             self._data = manifestdict(text)
                     return self._data
                 def readfast(self, shallow=False):
                     '''Calls either readdelta or read, based on which would be less work.
                     readdelta is called if the delta is against the p1, and therefore can be
                     read quickly.
                     If `shallow` is True, nothing changes since this is a flat manifest.
                     '''
                     store = self._storage()
                     r = store.rev(self._node)
                     deltaparent = store.deltaparent(r)
                     if deltaparent != nullrev and deltaparent in store.parentrevs(r):
                         return self.readdelta()
                     return self.read()
                 def readdelta(self, shallow=False):
                     '''Returns a manifest containing just the entries that are present
                     in this manifest, but not in its p1 manifest. This is efficient to read
                     if the revlog delta is already p1.
                     Changing the value of `shallow` has no effect on flat manifests.
                     '''
                     store = self._storage()
                     r = store.rev(self._node)
                     d = mdiff.patchtext(store.revdiff(store.deltaparent(r), r))
                     return manifestdict(d)
                 def find(self, key):
                     return self.read().find(key)
             @interfaceutil.implementer(repository.imanifestrevisionwritable)
             class memtreemanifestctx(object):
                 def __init__(self, manifestlog, dir=b''):
                     self._manifestlog = manifestlog
                     self._dir = dir
                     self._treemanifest = treemanifest()
                 def _storage(self):
                     return self._manifestlog.getstorage(b'')
                 def new(self, dir=b''):
                     return memtreemanifestctx(self._manifestlog, dir=dir)
                 def copy(self):
                     memmf = memtreemanifestctx(self._manifestlog, dir=self._dir)
                     memmf._treemanifest = self._treemanifest.copy()
                     return memmf
                 def read(self):
                     return self._treemanifest
                 def write(self, transaction, link, p1, p2, added, removed, match=None):
                     def readtree(dir, node):
                         return self._manifestlog.get(dir, node).read()
                     return self._storage().add(
                         self._treemanifest,
                         transaction,
                         link,
                         p1,
                         p2,
                         added,
                         removed,
                         readtree=readtree,
                         match=match,
                     )
             @interfaceutil.implementer(repository.imanifestrevisionstored)
             class treemanifestctx(object):
                 def __init__(self, manifestlog, dir, node):
                     self._manifestlog = manifestlog
                     self._dir = dir
                     self._data = None
                     self._node = node
                     # TODO: Load p1/p2/linkrev lazily. They need to be lazily loaded so that
                     # we can instantiate treemanifestctx objects for directories we don't
                     # have on disk.
                     # self.p1, self.p2 = store.parents(node)
                     # rev = store.rev(node)
                     # self.linkrev = store.linkrev(rev)
                 def _storage(self):
                     narrowmatch = self._manifestlog._narrowmatch
                     if not narrowmatch.always():
                         if not narrowmatch.visitdir(self._dir[:-1]):
                             return excludedmanifestrevlog(self._dir)
                     return self._manifestlog.getstorage(self._dir)
                 def read(self):
                     if self._data is None:
                         store = self._storage()
                         if self._node == nullid:
                             self._data = treemanifest()
                         # TODO accessing non-public API
                         elif store._treeondisk:
                             m = treemanifest(dir=self._dir)
                             def gettext():
                                 return store.revision(self._node)
                             def readsubtree(dir, subm):
                                 # Set verify to False since we need to be able to create
                                 # subtrees for trees that don't exist on disk.
                                 return self._manifestlog.get(dir, subm, verify=False).read()
                             m.read(gettext, readsubtree)
                             m.setnode(self._node)
                             self._data = m
                         else:
                             if self._node in store.fulltextcache:
                                 text = pycompat.bytestr(store.fulltextcache[self._node])
                             else:
                                 text = store.revision(self._node)
                                 arraytext = bytearray(text)
                                 store.fulltextcache[self._node] = arraytext
                             self._data = treemanifest(dir=self._dir, text=text)
                     return self._data
                 def node(self):
                     return self._node
                 def new(self, dir=b''):
                     return memtreemanifestctx(self._manifestlog, dir=dir)
                 def copy(self):
                     memmf = memtreemanifestctx(self._manifestlog, dir=self._dir)
                     memmf._treemanifest = self.read().copy()
                     return memmf
                 @propertycache
                 def parents(self):
                     return self._storage().parents(self._node)
                 def readdelta(self, shallow=False):
                     '''Returns a manifest containing just the entries that are present
                     in this manifest, but not in its p1 manifest. This is efficient to read
                     if the revlog delta is already p1.
                     If `shallow` is True, this will read the delta for this directory,
                     without recursively reading subdirectory manifests. Instead, any
                     subdirectory entry will be reported as it appears in the manifest, i.e.
                     the subdirectory will be reported among files and distinguished only by
                     its 't' flag.
                     '''
                     store = self._storage()
                     if shallow:
                         r = store.rev(self._node)
                         d = mdiff.patchtext(store.revdiff(store.deltaparent(r), r))
                         return manifestdict(d)
                     else:
                         # Need to perform a slow delta
                         r0 = store.deltaparent(store.rev(self._node))
                         m0 = self._manifestlog.get(self._dir, store.node(r0)).read()
                         m1 = self.read()
                         md = treemanifest(dir=self._dir)
                         for f, ((n0, fl0), (n1, fl1)) in pycompat.iteritems(m0.diff(m1)):
                             if n1:
                                 md[f] = n1
                                 if fl1:
                                     md.setflag(f, fl1)
                         return md
                 def readfast(self, shallow=False):
                     '''Calls either readdelta or read, based on which would be less work.
                     readdelta is called if the delta is against the p1, and therefore can be
                     read quickly.
                     If `shallow` is True, it only returns the entries from this manifest,
                     and not any submanifests.
                     '''
                     store = self._storage()
                     r = store.rev(self._node)
                     deltaparent = store.deltaparent(r)
                     if deltaparent != nullrev and deltaparent in store.parentrevs(r):
                         return self.readdelta(shallow=shallow)
                     if shallow:
                         return manifestdict(store.revision(self._node))
                     else:
                         return self.read()
                 def find(self, key):
                     return self.read().find(key)
             class excludeddir(treemanifest):
                 """Stand-in for a directory that is excluded from the repository.
                 With narrowing active on a repository that uses treemanifests,
                 some of the directory revlogs will be excluded from the resulting
                 clone. This is a huge storage win for clients, but means we need
                 some sort of pseudo-manifest to surface to internals so we can
                 detect a merge conflict outside the narrowspec. That's what this
                 class is: it stands in for a directory whose node is known, but
                 whose contents are unknown.
                 """
                 def __init__(self, dir, node):
                     super(excludeddir, self).__init__(dir)
                     self._node = node
                     # Add an empty file, which will be included by iterators and such,
                     # appearing as the directory itself (i.e. something like "dir/")
                     self._files[b''] = node
                     self._flags[b''] = b't'
                 # Manifests outside the narrowspec should never be modified, so avoid
                 # copying. This makes a noticeable difference when there are very many
                 # directories outside the narrowspec. Also, it makes sense for the copy to
                 # be of the same type as the original, which would not happen with the
                 # super type's copy().
                 def copy(self):
                     return self
             class excludeddirmanifestctx(treemanifestctx):
                 """context wrapper for excludeddir - see that docstring for rationale"""
                 def __init__(self, dir, node):
                     self._dir = dir
                     self._node = node
                 def read(self):
                     return excludeddir(self._dir, self._node)
                 def write(self, *args):
                     raise error.ProgrammingError(
                         b'attempt to write manifest from excluded dir %s' % self._dir
                     )
             class excludedmanifestrevlog(manifestrevlog):
                 """Stand-in for excluded treemanifest revlogs.
                 When narrowing is active on a treemanifest repository, we'll have
                 references to directories we can't see due to the revlog being
                 skipped. This class exists to conform to the manifestrevlog
                 interface for those directories and proactively prevent writes to
                 outside the narrowspec.
                 """
                 def __init__(self, dir):
                     self._dir = dir
                 def __len__(self):
                     raise error.ProgrammingError(
                         b'attempt to get length of excluded dir %s' % self._dir
                     )
                 def rev(self, node):
                     raise error.ProgrammingError(
                         b'attempt to get rev from excluded dir %s' % self._dir
                     )
                 def linkrev(self, node):
                     raise error.ProgrammingError(
                         b'attempt to get linkrev from excluded dir %s' % self._dir
                     )
                 def node(self, rev):
                     raise error.ProgrammingError(
                         b'attempt to get node from excluded dir %s' % self._dir
                     )
                 def add(self, *args, **kwargs):
                     # We should never write entries in dirlogs outside the narrow clone.
                     # However, the method still gets called from writesubtree() in
                     # _addtree(), so we need to handle it. We should possibly make that
                     # avoid calling add() with a clean manifest (_dirty is always False
                     # in excludeddir instances).
                     pass

mercurial/patch.py

0 +1 0

             # patch.py - patch file parsing routines
             #
             # Copyright 2006 Brendan Cully <brendan@kublai.com>
             # Copyright 2007 Chris Mason <chris.mason@oracle.com>
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             from __future__ import absolute_import, print_function
             import collections
             import contextlib
             import copy
             import errno
             import os
             import re
             import shutil
             import zlib
             from .i18n import _
             from .node import (
                 hex,
                 short,
             )
             from .pycompat import open
             from . import (
                 copies,
                 diffhelper,
                 diffutil,
                 encoding,
                 error,
                 mail,
                 mdiff,
                 pathutil,
                 pycompat,
                 scmutil,
                 similar,
                 util,
                 vfs as vfsmod,
             )
             from .utils import (
                 dateutil,
                 hashutil,
                 procutil,
                 stringutil,
             )
             stringio = util.stringio
             gitre = re.compile(br'diff --git a/(.*) b/(.*)')
             tabsplitter = re.compile(br'(\t+|[^\t]+)')
             wordsplitter = re.compile(
                 br'(\t+| +|[a-zA-Z0-9_\x80-\xff]+|[^ \ta-zA-Z0-9_\x80-\xff])'
             )
             PatchError = error.PatchError
             # public functions
             def split(stream):
                 '''return an iterator of individual patches from a stream'''
                 def isheader(line, inheader):
                     if inheader and line.startswith((b' ', b'\t')):
                         # continuation
                         return True
                     if line.startswith((b' ', b'-', b'+')):
                         # diff line - don't check for header pattern in there
                         return False
                     l = line.split(b': ', 1)
                     return len(l) == 2 and b' ' not in l[0]
                 def chunk(lines):
                     return stringio(b''.join(lines))
                 def hgsplit(stream, cur):
                     inheader = True
                     for line in stream:
                         if not line.strip():
                             inheader = False
                         if not inheader and line.startswith(b'# HG changeset patch'):
                             yield chunk(cur)
                             cur = []
                             inheader = True
                         cur.append(line)
                     if cur:
                         yield chunk(cur)
                 def mboxsplit(stream, cur):
                     for line in stream:
                         if line.startswith(b'From '):
                             for c in split(chunk(cur[1:])):
                                 yield c
                             cur = []
                         cur.append(line)
                     if cur:
                         for c in split(chunk(cur[1:])):
                             yield c
                 def mimesplit(stream, cur):
                     def msgfp(m):
                         fp = stringio()
                         g = mail.Generator(fp, mangle_from_=False)
                         g.flatten(m)
                         fp.seek(0)
                         return fp
                     for line in stream:
                         cur.append(line)
                     c = chunk(cur)
                     m = mail.parse(c)
                     if not m.is_multipart():
                         yield msgfp(m)
                     else:
                         ok_types = (b'text/plain', b'text/x-diff', b'text/x-patch')
                         for part in m.walk():
                             ct = part.get_content_type()
                             if ct not in ok_types:
                                 continue
                             yield msgfp(part)
                 def headersplit(stream, cur):
                     inheader = False
                     for line in stream:
                         if not inheader and isheader(line, inheader):
                             yield chunk(cur)
                             cur = []
                             inheader = True
                         if inheader and not isheader(line, inheader):
                             inheader = False
                         cur.append(line)
                     if cur:
                         yield chunk(cur)
                 def remainder(cur):
                     yield chunk(cur)
                 class fiter(object):
                     def __init__(self, fp):
                         self.fp = fp
                     def __iter__(self):
                         return self
                     def next(self):
                         l = self.fp.readline()
                         if not l:
                             raise StopIteration
                         return l
                     __next__ = next
                 inheader = False
                 cur = []
                 mimeheaders = [b'content-type']
                 if not util.safehasattr(stream, b'next'):
                     # http responses, for example, have readline but not next
                     stream = fiter(stream)
                 for line in stream:
                     cur.append(line)
                     if line.startswith(b'# HG changeset patch'):
                         return hgsplit(stream, cur)
                     elif line.startswith(b'From '):
                         return mboxsplit(stream, cur)
                     elif isheader(line, inheader):
                         inheader = True
                         if line.split(b':', 1)[0].lower() in mimeheaders:
                             # let email parser handle this
                             return mimesplit(stream, cur)
                     elif line.startswith(b'--- ') and inheader:
                         # No evil headers seen by diff start, split by hand
                         return headersplit(stream, cur)
                     # Not enough info, keep reading
                 # if we are here, we have a very plain patch
                 return remainder(cur)
             ## Some facility for extensible patch parsing:
             # list of pairs ("header to match", "data key")
             patchheadermap = [
                 (b'Date', b'date'),
                 (b'Branch', b'branch'),
                 (b'Node ID', b'nodeid'),
             ]
             @contextlib.contextmanager
             def extract(ui, fileobj):
                 '''extract patch from data read from fileobj.
                 patch can be a normal patch or contained in an email message.
                 return a dictionary. Standard keys are:
                   - filename,
                   - message,
                   - user,
                   - date,
                   - branch,
                   - node,
                   - p1,
                   - p2.
                 Any item can be missing from the dictionary. If filename is missing,
                 fileobj did not contain a patch. Caller must unlink filename when done.'''
                 fd, tmpname = pycompat.mkstemp(prefix=b'hg-patch-')
                 tmpfp = os.fdopen(fd, 'wb')
                 try:
                     yield _extract(ui, fileobj, tmpname, tmpfp)
                 finally:
                     tmpfp.close()
                     os.unlink(tmpname)
             def _extract(ui, fileobj, tmpname, tmpfp):
                 # attempt to detect the start of a patch
                 # (this heuristic is borrowed from quilt)
                 diffre = re.compile(
                     br'^(?:Index:[ \t]|diff[ \t]-|RCS file: |'
                     br'retrieving revision [0-9]+(\.[0-9]+)*$|'
                     br'---[ \t].*?^\+\+\+[ \t]|'
                     br'\*\*\*[ \t].*?^---[ \t])',
                     re.MULTILINE | re.DOTALL,
                 )
                 data = {}
                 msg = mail.parse(fileobj)
                 subject = msg['Subject'] and mail.headdecode(msg['Subject'])
                 data[b'user'] = msg['From'] and mail.headdecode(msg['From'])
                 if not subject and not data[b'user']:
                     # Not an email, restore parsed headers if any
                     subject = (
                         b'\n'.join(
                             b': '.join(map(encoding.strtolocal, h)) for h in msg.items()
                         )
                         + b'\n'
                     )
                 # should try to parse msg['Date']
                 parents = []
                 nodeid = msg['X-Mercurial-Node']
                 if nodeid:
                     data[b'nodeid'] = nodeid = mail.headdecode(nodeid)
                     ui.debug(b'Node ID: %s\n' % nodeid)
                 if subject:
                     if subject.startswith(b'[PATCH'):
                         pend = subject.find(b']')
                         if pend >= 0:
                             subject = subject[pend + 1 :].lstrip()
                     subject = re.sub(br'\n[ \t]+', b' ', subject)
                     ui.debug(b'Subject: %s\n' % subject)
                 if data[b'user']:
                     ui.debug(b'From: %s\n' % data[b'user'])
                 diffs_seen = 0
                 ok_types = (b'text/plain', b'text/x-diff', b'text/x-patch')
                 message = b''
                 for part in msg.walk():
                     content_type = pycompat.bytestr(part.get_content_type())
                     ui.debug(b'Content-Type: %s\n' % content_type)
                     if content_type not in ok_types:
                         continue
                     payload = part.get_payload(decode=True)
                     m = diffre.search(payload)
                     if m:
                         hgpatch = False
                         hgpatchheader = False
                         ignoretext = False
                         ui.debug(b'found patch at byte %d\n' % m.start(0))
                         diffs_seen += 1
                         cfp = stringio()
                         for line in payload[: m.start(0)].splitlines():
                             if line.startswith(b'# HG changeset patch') and not hgpatch:
                                 ui.debug(b'patch generated by hg export\n')
                                 hgpatch = True
                                 hgpatchheader = True
                                 # drop earlier commit message content
                                 cfp.seek(0)
                                 cfp.truncate()
                                 subject = None
                             elif hgpatchheader:
                                 if line.startswith(b'# User '):
                                     data[b'user'] = line[7:]
                                     ui.debug(b'From: %s\n' % data[b'user'])
                                 elif line.startswith(b"# Parent "):
                                     parents.append(line[9:].lstrip())
                                 elif line.startswith(b"# "):
                                     for header, key in patchheadermap:
                                         prefix = b'# %s ' % header
                                         if line.startswith(prefix):
                                             data[key] = line[len(prefix) :]
                                             ui.debug(b'%s: %s\n' % (header, data[key]))
                                 else:
                                     hgpatchheader = False
                             elif line == b'---':
                                 ignoretext = True
                             if not hgpatchheader and not ignoretext:
                                 cfp.write(line)
                                 cfp.write(b'\n')
                         message = cfp.getvalue()
                         if tmpfp:
                             tmpfp.write(payload)
                             if not payload.endswith(b'\n'):
                                 tmpfp.write(b'\n')
                     elif not diffs_seen and message and content_type == b'text/plain':
                         message += b'\n' + payload
                 if subject and not message.startswith(subject):
                     message = b'%s\n%s' % (subject, message)
                 data[b'message'] = message
                 tmpfp.close()
                 if parents:
                     data[b'p1'] = parents.pop(0)
                     if parents:
                         data[b'p2'] = parents.pop(0)
                 if diffs_seen:
                     data[b'filename'] = tmpname
                 return data
             class patchmeta(object):
                 """Patched file metadata
                 'op' is the performed operation within ADD, DELETE, RENAME, MODIFY
                 or COPY.  'path' is patched file path. 'oldpath' is set to the
                 origin file when 'op' is either COPY or RENAME, None otherwise. If
                 file mode is changed, 'mode' is a tuple (islink, isexec) where
                 'islink' is True if the file is a symlink and 'isexec' is True if
                 the file is executable. Otherwise, 'mode' is None.
                 """
                 def __init__(self, path):
                     self.path = path
                     self.oldpath = None
                     self.mode = None
                     self.op = b'MODIFY'
                     self.binary = False
                 def setmode(self, mode):
                     islink = mode & 0o20000
                     isexec = mode & 0o100
                     self.mode = (islink, isexec)
                 def copy(self):
                     other = patchmeta(self.path)
                     other.oldpath = self.oldpath
                     other.mode = self.mode
                     other.op = self.op
                     other.binary = self.binary
                     return other
                 def _ispatchinga(self, afile):
                     if afile == b'/dev/null':
                         return self.op == b'ADD'
                     return afile == b'a/' + (self.oldpath or self.path)
                 def _ispatchingb(self, bfile):
                     if bfile == b'/dev/null':
                         return self.op == b'DELETE'
                     return bfile == b'b/' + self.path
                 def ispatching(self, afile, bfile):
                     return self._ispatchinga(afile) and self._ispatchingb(bfile)
                 def __repr__(self):
                     return "<patchmeta %s %r>" % (self.op, self.path)
             def readgitpatch(lr):
                 """extract git-style metadata about patches from <patchname>"""
                 # Filter patch for git information
                 gp = None
                 gitpatches = []
                 for line in lr:
                     line = line.rstrip(b' \r\n')
                     if line.startswith(b'diff --git a/'):
                         m = gitre.match(line)
                         if m:
                             if gp:
                                 gitpatches.append(gp)
                             dst = m.group(2)
                             gp = patchmeta(dst)
                     elif gp:
                         if line.startswith(b'--- '):
                             gitpatches.append(gp)
                             gp = None
                             continue
                         if line.startswith(b'rename from '):
                             gp.op = b'RENAME'
                             gp.oldpath = line[12:]
                         elif line.startswith(b'rename to '):
                             gp.path = line[10:]
                         elif line.startswith(b'copy from '):
                             gp.op = b'COPY'
                             gp.oldpath = line[10:]
                         elif line.startswith(b'copy to '):
                             gp.path = line[8:]
                         elif line.startswith(b'deleted file'):
                             gp.op = b'DELETE'
                         elif line.startswith(b'new file mode '):
                             gp.op = b'ADD'
                             gp.setmode(int(line[-6:], 8))
                         elif line.startswith(b'new mode '):
                             gp.setmode(int(line[-6:], 8))
                         elif line.startswith(b'GIT binary patch'):
                             gp.binary = True
                 if gp:
                     gitpatches.append(gp)
                 return gitpatches
             class linereader(object):
                 # simple class to allow pushing lines back into the input stream
                 def __init__(self, fp):
                     self.fp = fp
                     self.buf = []
                 def push(self, line):
                     if line is not None:
                         self.buf.append(line)
                 def readline(self):
                     if self.buf:
                         l = self.buf[0]
                         del self.buf[0]
                         return l
                     return self.fp.readline()
                 def __iter__(self):
                     return iter(self.readline, b'')
             class abstractbackend(object):
                 def __init__(self, ui):
                     self.ui = ui
                 def getfile(self, fname):
                     """Return target file data and flags as a (data, (islink,
                     isexec)) tuple. Data is None if file is missing/deleted.
                     """
                     raise NotImplementedError
                 def setfile(self, fname, data, mode, copysource):
                     """Write data to target file fname and set its mode. mode is a
                     (islink, isexec) tuple. If data is None, the file content should
                     be left unchanged. If the file is modified after being copied,
                     copysource is set to the original file name.
                     """
                     raise NotImplementedError
                 def unlink(self, fname):
                     """Unlink target file."""
                     raise NotImplementedError
                 def writerej(self, fname, failed, total, lines):
                     """Write rejected lines for fname. total is the number of hunks
                     which failed to apply and total the total number of hunks for this
                     files.
                     """
                 def exists(self, fname):
                     raise NotImplementedError
                 def close(self):
                     raise NotImplementedError
             class fsbackend(abstractbackend):
                 def __init__(self, ui, basedir):
                     super(fsbackend, self).__init__(ui)
                     self.opener = vfsmod.vfs(basedir)
                 def getfile(self, fname):
                     if self.opener.islink(fname):
                         return (self.opener.readlink(fname), (True, False))
                     isexec = False
                     try:
                         isexec = self.opener.lstat(fname).st_mode & 0o100 != 0
                     except OSError as e:
                         if e.errno != errno.ENOENT:
                             raise
                     try:
                         return (self.opener.read(fname), (False, isexec))
                     except IOError as e:
                         if e.errno != errno.ENOENT:
                             raise
                         return None, None
                 def setfile(self, fname, data, mode, copysource):
                     islink, isexec = mode
                     if data is None:
                         self.opener.setflags(fname, islink, isexec)
                         return
                     if islink:
                         self.opener.symlink(data, fname)
                     else:
                         self.opener.write(fname, data)
                         if isexec:
                             self.opener.setflags(fname, False, True)
                 def unlink(self, fname):
                     rmdir = self.ui.configbool(b'experimental', b'removeemptydirs')
                     self.opener.unlinkpath(fname, ignoremissing=True, rmdir=rmdir)
                 def writerej(self, fname, failed, total, lines):
                     fname = fname + b".rej"
                     self.ui.warn(
                         _(b"%d out of %d hunks FAILED -- saving rejects to file %s\n")
                         % (failed, total, fname)
                     )
                     fp = self.opener(fname, b'w')
                     fp.writelines(lines)
                     fp.close()
                 def exists(self, fname):
                     return self.opener.lexists(fname)
             class workingbackend(fsbackend):
                 def __init__(self, ui, repo, similarity):
                     super(workingbackend, self).__init__(ui, repo.root)
                     self.repo = repo
                     self.similarity = similarity
                     self.removed = set()
                     self.changed = set()
                     self.copied = []
                 def _checkknown(self, fname):
                     if self.repo.dirstate[fname] == b'?' and self.exists(fname):
                         raise PatchError(_(b'cannot patch %s: file is not tracked') % fname)
                 def setfile(self, fname, data, mode, copysource):
                     self._checkknown(fname)
                     super(workingbackend, self).setfile(fname, data, mode, copysource)
                     if copysource is not None:
                         self.copied.append((copysource, fname))
                     self.changed.add(fname)
                 def unlink(self, fname):
                     self._checkknown(fname)
                     super(workingbackend, self).unlink(fname)
                     self.removed.add(fname)
                     self.changed.add(fname)
                 def close(self):
                     wctx = self.repo[None]
                     changed = set(self.changed)
                     for src, dst in self.copied:
                         scmutil.dirstatecopy(self.ui, self.repo, wctx, src, dst)
                     if self.removed:
                         wctx.forget(sorted(self.removed))
                         for f in self.removed:
                             if f not in self.repo.dirstate:
                                 # File was deleted and no longer belongs to the
                                 # dirstate, it was probably marked added then
                                 # deleted, and should not be considered by
                                 # marktouched().
                                 changed.discard(f)
                     if changed:
                         scmutil.marktouched(self.repo, changed, self.similarity)
                     return sorted(self.changed)
             class filestore(object):
                 def __init__(self, maxsize=None):
                     self.opener = None
                     self.files = {}
                     self.created = 0
                     self.maxsize = maxsize
                     if self.maxsize is None:
                         self.maxsize = 4 * (2 ** 20)
                     self.size = 0
                     self.data = {}
                 def setfile(self, fname, data, mode, copied=None):
                     if self.maxsize < 0 or (len(data) + self.size) <= self.maxsize:
                         self.data[fname] = (data, mode, copied)
                         self.size += len(data)
                     else:
                         if self.opener is None:
                             root = pycompat.mkdtemp(prefix=b'hg-patch-')
                             self.opener = vfsmod.vfs(root)
                         # Avoid filename issues with these simple names
                         fn = b'%d' % self.created
                         self.opener.write(fn, data)
                         self.created += 1
                         self.files[fname] = (fn, mode, copied)
                 def getfile(self, fname):
                     if fname in self.data:
                         return self.data[fname]
                     if not self.opener or fname not in self.files:
                         return None, None, None
                     fn, mode, copied = self.files[fname]
                     return self.opener.read(fn), mode, copied
                 def close(self):
                     if self.opener:
                         shutil.rmtree(self.opener.base)
             class repobackend(abstractbackend):
                 def __init__(self, ui, repo, ctx, store):
                     super(repobackend, self).__init__(ui)
                     self.repo = repo
                     self.ctx = ctx
                     self.store = store
                     self.changed = set()
                     self.removed = set()
                     self.copied = {}
                 def _checkknown(self, fname):
                     if fname not in self.ctx:
                         raise PatchError(_(b'cannot patch %s: file is not tracked') % fname)
                 def getfile(self, fname):
                     try:
                         fctx = self.ctx[fname]
                     except error.LookupError:
                         return None, None
                     flags = fctx.flags()
                     return fctx.data(), (b'l' in flags, b'x' in flags)
                 def setfile(self, fname, data, mode, copysource):
                     if copysource:
                         self._checkknown(copysource)
                     if data is None:
                         data = self.ctx[fname].data()
                     self.store.setfile(fname, data, mode, copysource)
                     self.changed.add(fname)
                     if copysource:
                         self.copied[fname] = copysource
                 def unlink(self, fname):
                     self._checkknown(fname)
                     self.removed.add(fname)
                 def exists(self, fname):
                     return fname in self.ctx
                 def close(self):
                     return self.changed | self.removed
             # @@ -start,len +start,len @@ or @@ -start +start @@ if len is 1
             unidesc = re.compile(br'@@ -(\d+)(?:,(\d+))? \+(\d+)(?:,(\d+))? @@')
             contextdesc = re.compile(br'(?:---|\*\*\*) (\d+)(?:,(\d+))? (?:---|\*\*\*)')
             eolmodes = [b'strict', b'crlf', b'lf', b'auto']
             class patchfile(object):
                 def __init__(self, ui, gp, backend, store, eolmode=b'strict'):
                     self.fname = gp.path
                     self.eolmode = eolmode
                     self.eol = None
                     self.backend = backend
                     self.ui = ui
                     self.lines = []
                     self.exists = False
                     self.missing = True
                     self.mode = gp.mode
                     self.copysource = gp.oldpath
                     self.create = gp.op in (b'ADD', b'COPY', b'RENAME')
                     self.remove = gp.op == b'DELETE'
                     if self.copysource is None:
                         data, mode = backend.getfile(self.fname)
                     else:
                         data, mode = store.getfile(self.copysource)[:2]
                     if data is not None:
                         self.exists = self.copysource is None or backend.exists(self.fname)
                         self.missing = False
                         if data:
                             self.lines = mdiff.splitnewlines(data)
                         if self.mode is None:
                             self.mode = mode
                         if self.lines:
                             # Normalize line endings
                             if self.lines[0].endswith(b'\r\n'):
                                 self.eol = b'\r\n'
                             elif self.lines[0].endswith(b'\n'):
                                 self.eol = b'\n'
                             if eolmode != b'strict':
                                 nlines = []
                                 for l in self.lines:
                                     if l.endswith(b'\r\n'):
                                         l = l[:-2] + b'\n'
                                     nlines.append(l)
                                 self.lines = nlines
                     else:
                         if self.create:
                             self.missing = False
                         if self.mode is None:
                             self.mode = (False, False)
                     if self.missing:
                         self.ui.warn(_(b"unable to find '%s' for patching\n") % self.fname)
                         self.ui.warn(
                             _(
                                 b"(use '--prefix' to apply patch relative to the "
                                 b"current directory)\n"
                             )
                         )
                     self.hash = {}
                     self.dirty = 0
                     self.offset = 0
                     self.skew = 0
                     self.rej = []
                     self.fileprinted = False
                     self.printfile(False)
                     self.hunks = 0
                 def writelines(self, fname, lines, mode):
                     if self.eolmode == b'auto':
                         eol = self.eol
                     elif self.eolmode == b'crlf':
                         eol = b'\r\n'
                     else:
                         eol = b'\n'
                     if self.eolmode != b'strict' and eol and eol != b'\n':
                         rawlines = []
                         for l in lines:
                             if l and l.endswith(b'\n'):
                                 l = l[:-1] + eol
                             rawlines.append(l)
                         lines = rawlines
                     self.backend.setfile(fname, b''.join(lines), mode, self.copysource)
                 def printfile(self, warn):
                     if self.fileprinted:
                         return
                     if warn or self.ui.verbose:
                         self.fileprinted = True
                     s = _(b"patching file %s\n") % self.fname
                     if warn:
                         self.ui.warn(s)
                     else:
                         self.ui.note(s)
                 def findlines(self, l, linenum):
                     # looks through the hash and finds candidate lines.  The
                     # result is a list of line numbers sorted based on distance
                     # from linenum
                     cand = self.hash.get(l, [])
                     if len(cand) > 1:
                         # resort our list of potentials forward then back.
                         cand.sort(key=lambda x: abs(x - linenum))
                     return cand
                 def write_rej(self):
                     # our rejects are a little different from patch(1).  This always
                     # creates rejects in the same form as the original patch.  A file
                     # header is inserted so that you can run the reject through patch again
                     # without having to type the filename.
                     if not self.rej:
                         return
                     base = os.path.basename(self.fname)
                     lines = [b"--- %s\n+++ %s\n" % (base, base)]
                     for x in self.rej:
                         for l in x.hunk:
                             lines.append(l)
                             if l[-1:] != b'\n':
                                 lines.append(b"\n\\ No newline at end of file\n")
                     self.backend.writerej(self.fname, len(self.rej), self.hunks, lines)
                 def apply(self, h):
                     if not h.complete():
                         raise PatchError(
                             _(b"bad hunk #%d %s (%d %d %d %d)")
                             % (h.number, h.desc, len(h.a), h.lena, len(h.b), h.lenb)
                         )
                     self.hunks += 1
                     if self.missing:
                         self.rej.append(h)
                         return -1
                     if self.exists and self.create:
                         if self.copysource:
                             self.ui.warn(
                                 _(b"cannot create %s: destination already exists\n")
                                 % self.fname
                             )
                         else:
                             self.ui.warn(_(b"file %s already exists\n") % self.fname)
                         self.rej.append(h)
                         return -1
                     if isinstance(h, binhunk):
                         if self.remove:
                             self.backend.unlink(self.fname)
                         else:
                             l = h.new(self.lines)
                             self.lines[:] = l
                             self.offset += len(l)
                             self.dirty = True
                         return 0
                     horig = h
                     if (
                         self.eolmode in (b'crlf', b'lf')
                         or self.eolmode == b'auto'
                         and self.eol
                     ):
                         # If new eols are going to be normalized, then normalize
                         # hunk data before patching. Otherwise, preserve input
                         # line-endings.
                         h = h.getnormalized()
                     # fast case first, no offsets, no fuzz
                     old, oldstart, new, newstart = h.fuzzit(0, False)
                     oldstart += self.offset
                     orig_start = oldstart
                     # if there's skew we want to emit the "(offset %d lines)" even
                     # when the hunk cleanly applies at start + skew, so skip the
                     # fast case code
                     if self.skew == 0 and diffhelper.testhunk(old, self.lines, oldstart):
                         if self.remove:
                             self.backend.unlink(self.fname)
                         else:
                             self.lines[oldstart : oldstart + len(old)] = new
                             self.offset += len(new) - len(old)
                             self.dirty = True
                         return 0
                     # ok, we couldn't match the hunk. Lets look for offsets and fuzz it
                     self.hash = {}
                     for x, s in enumerate(self.lines):
                         self.hash.setdefault(s, []).append(x)
                     for fuzzlen in pycompat.xrange(
                         self.ui.configint(b"patch", b"fuzz") + 1
                     ):
                         for toponly in [True, False]:
                             old, oldstart, new, newstart = h.fuzzit(fuzzlen, toponly)
                             oldstart = oldstart + self.offset + self.skew
                             oldstart = min(oldstart, len(self.lines))
                             if old:
                                 cand = self.findlines(old[0][1:], oldstart)
                             else:
                                 # Only adding lines with no or fuzzed context, just
                                 # take the skew in account
                                 cand = [oldstart]
                             for l in cand:
                                 if not old or diffhelper.testhunk(old, self.lines, l):
                                     self.lines[l : l + len(old)] = new
                                     self.offset += len(new) - len(old)
                                     self.skew = l - orig_start
                                     self.dirty = True
                                     offset = l - orig_start - fuzzlen
                                     if fuzzlen:
                                         msg = _(
                                             b"Hunk #%d succeeded at %d "
                                             b"with fuzz %d "
                                             b"(offset %d lines).\n"
                                         )
                                         self.printfile(True)
                                         self.ui.warn(
                                             msg % (h.number, l + 1, fuzzlen, offset)
                                         )
                                     else:
                                         msg = _(
                                             b"Hunk #%d succeeded at %d "
                                             b"(offset %d lines).\n"
                                         )
                                         self.ui.note(msg % (h.number, l + 1, offset))
                                     return fuzzlen
                     self.printfile(True)
                     self.ui.warn(_(b"Hunk #%d FAILED at %d\n") % (h.number, orig_start))
                     self.rej.append(horig)
                     return -1
                 def close(self):
                     if self.dirty:
                         self.writelines(self.fname, self.lines, self.mode)
                     self.write_rej()
                     return len(self.rej)
             class header(object):
                 """patch header
                 """
                 diffgit_re = re.compile(b'diff --git a/(.*) b/(.*)$')
                 diff_re = re.compile(b'diff -r .* (.*)$')
                 allhunks_re = re.compile(b'(?:index|deleted file) ')
                 pretty_re = re.compile(b'(?:new file|deleted file) ')
                 special_re = re.compile(b'(?:index|deleted|copy|rename|new mode) ')
                 newfile_re = re.compile(b'(?:new file|copy to|rename to)')
                 def __init__(self, header):
                     self.header = header
                     self.hunks = []
                 def binary(self):
                     return any(h.startswith(b'index ') for h in self.header)
                 def pretty(self, fp):
                     for h in self.header:
                         if h.startswith(b'index '):
                             fp.write(_(b'this modifies a binary file (all or nothing)\n'))
                             break
                         if self.pretty_re.match(h):
                             fp.write(h)
                             if self.binary():
                                 fp.write(_(b'this is a binary file\n'))
                             break
                         if h.startswith(b'---'):
                             fp.write(
                                 _(b'%d hunks, %d lines changed\n')
                                 % (
                                     len(self.hunks),
                                     sum([max(h.added, h.removed) for h in self.hunks]),
                                 )
                             )
                             break
                         fp.write(h)
                 def write(self, fp):
                     fp.write(b''.join(self.header))
                 def allhunks(self):
                     return any(self.allhunks_re.match(h) for h in self.header)
                 def files(self):
                     match = self.diffgit_re.match(self.header[0])
                     if match:
                         fromfile, tofile = match.groups()
                         if fromfile == tofile:
                             return [fromfile]
                         return [fromfile, tofile]
                     else:
                         return self.diff_re.match(self.header[0]).groups()
                 def filename(self):
                     return self.files()[-1]
                 def __repr__(self):
                     return '<header %s>' % (
                         ' '.join(pycompat.rapply(pycompat.fsdecode, self.files()))
                     )
                 def isnewfile(self):
                     return any(self.newfile_re.match(h) for h in self.header)
                 def special(self):
                     # Special files are shown only at the header level and not at the hunk
                     # level for example a file that has been deleted is a special file.
                     # The user cannot change the content of the operation, in the case of
                     # the deleted file he has to take the deletion or not take it, he
                     # cannot take some of it.
                     # Newly added files are special if they are empty, they are not special
                     # if they have some content as we want to be able to change it
                     nocontent = len(self.header) == 2
                     emptynewfile = self.isnewfile() and nocontent
                     return emptynewfile or any(
                         self.special_re.match(h) for h in self.header
                     )
             class recordhunk(object):
                 """patch hunk
                 XXX shouldn't we merge this with the other hunk class?
                 """
                 def __init__(
                     self,
                     header,
                     fromline,
                     toline,
                     proc,
                     before,
                     hunk,
                     after,
                     maxcontext=None,
                 ):
                     def trimcontext(lines, reverse=False):
                         if maxcontext is not None:
                             delta = len(lines) - maxcontext
                             if delta > 0:
                                 if reverse:
                                     return delta, lines[delta:]
                                 else:
                                     return delta, lines[:maxcontext]
                         return 0, lines
                     self.header = header
                     trimedbefore, self.before = trimcontext(before, True)
                     self.fromline = fromline + trimedbefore
                     self.toline = toline + trimedbefore
                     _trimedafter, self.after = trimcontext(after, False)
                     self.proc = proc
                     self.hunk = hunk
                     self.added, self.removed = self.countchanges(self.hunk)
                 def __eq__(self, v):
                     if not isinstance(v, recordhunk):
                         return False
                     return (
                         (v.hunk == self.hunk)
                         and (v.proc == self.proc)
                         and (self.fromline == v.fromline)
                         and (self.header.files() == v.header.files())
                     )
                 def __hash__(self):
                     return hash(
                         (
                             tuple(self.hunk),
                             tuple(self.header.files()),
                             self.fromline,
                             self.proc,
                         )
                     )
                 def countchanges(self, hunk):
                     """hunk -> (n+,n-)"""
                     add = len([h for h in hunk if h.startswith(b'+')])
                     rem = len([h for h in hunk if h.startswith(b'-')])
                     return add, rem
                 def reversehunk(self):
                     """return another recordhunk which is the reverse of the hunk
                     If this hunk is diff(A, B), the returned hunk is diff(B, A). To do
                     that, swap fromline/toline and +/- signs while keep other things
                     unchanged.
                     """
                     m = {b'+': b'-', b'-': b'+', b'\\': b'\\'}
                     hunk = [b'%s%s' % (m[l[0:1]], l[1:]) for l in self.hunk]
                     return recordhunk(
                         self.header,
                         self.toline,
                         self.fromline,
                         self.proc,
                         self.before,
                         hunk,
                         self.after,
                     )
                 def write(self, fp):
                     delta = len(self.before) + len(self.after)
                     if self.after and self.after[-1] == b'\\ No newline at end of file\n':
                         delta -= 1
                     fromlen = delta + self.removed
                     tolen = delta + self.added
                     fp.write(
                         b'@@ -%d,%d +%d,%d @@%s\n'
                         % (
                             self.fromline,
                             fromlen,
                             self.toline,
                             tolen,
                             self.proc and (b' ' + self.proc),
                         )
                     )
                     fp.write(b''.join(self.before + self.hunk + self.after))
                 pretty = write
                 def filename(self):
                     return self.header.filename()
+                @encoding.strmethod
                 def __repr__(self):
                     return b'<hunk %r@%d>' % (self.filename(), self.fromline)
             def getmessages():
                 return {
                     b'multiple': {
                         b'apply': _(b"apply change %d/%d to '%s'?"),
                         b'discard': _(b"discard change %d/%d to '%s'?"),
                         b'keep': _(b"keep change %d/%d to '%s'?"),
                         b'record': _(b"record change %d/%d to '%s'?"),
                     },
                     b'single': {
                         b'apply': _(b"apply this change to '%s'?"),
                         b'discard': _(b"discard this change to '%s'?"),
                         b'keep': _(b"keep this change to '%s'?"),
                         b'record': _(b"record this change to '%s'?"),
                     },
                     b'help': {
                         b'apply': _(
                             b'[Ynesfdaq?]'
                             b'$$ &Yes, apply this change'
                             b'$$ &No, skip this change'
                             b'$$ &Edit this change manually'
                             b'$$ &Skip remaining changes to this file'
                             b'$$ Apply remaining changes to this &file'
                             b'$$ &Done, skip remaining changes and files'
                             b'$$ Apply &all changes to all remaining files'
                             b'$$ &Quit, applying no changes'
                             b'$$ &? (display help)'
                         ),
                         b'discard': _(
                             b'[Ynesfdaq?]'
                             b'$$ &Yes, discard this change'
                             b'$$ &No, skip this change'
                             b'$$ &Edit this change manually'
                             b'$$ &Skip remaining changes to this file'
                             b'$$ Discard remaining changes to this &file'
                             b'$$ &Done, skip remaining changes and files'
                             b'$$ Discard &all changes to all remaining files'
                             b'$$ &Quit, discarding no changes'
                             b'$$ &? (display help)'
                         ),
                         b'keep': _(
                             b'[Ynesfdaq?]'
                             b'$$ &Yes, keep this change'
                             b'$$ &No, skip this change'
                             b'$$ &Edit this change manually'
                             b'$$ &Skip remaining changes to this file'
                             b'$$ Keep remaining changes to this &file'
                             b'$$ &Done, skip remaining changes and files'
                             b'$$ Keep &all changes to all remaining files'
                             b'$$ &Quit, keeping all changes'
                             b'$$ &? (display help)'
                         ),
                         b'record': _(
                             b'[Ynesfdaq?]'
                             b'$$ &Yes, record this change'
                             b'$$ &No, skip this change'
                             b'$$ &Edit this change manually'
                             b'$$ &Skip remaining changes to this file'
                             b'$$ Record remaining changes to this &file'
                             b'$$ &Done, skip remaining changes and files'
                             b'$$ Record &all changes to all remaining files'
                             b'$$ &Quit, recording no changes'
                             b'$$ &? (display help)'
                         ),
                     },
                 }
             def filterpatch(ui, headers, match, operation=None):
                 """Interactively filter patch chunks into applied-only chunks"""
                 messages = getmessages()
                 if operation is None:
                     operation = b'record'
                 def prompt(skipfile, skipall, query, chunk):
                     """prompt query, and process base inputs
                     - y/n for the rest of file
                     - y/n for the rest
                     - ? (help)
                     - q (quit)
                     Return True/False and possibly updated skipfile and skipall.
                     """
                     newpatches = None
                     if skipall is not None:
                         return skipall, skipfile, skipall, newpatches
                     if skipfile is not None:
                         return skipfile, skipfile, skipall, newpatches
                     while True:
                         resps = messages[b'help'][operation]
                         # IMPORTANT: keep the last line of this prompt short (<40 english
                         # chars is a good target) because of issue6158.
                         r = ui.promptchoice(b"%s\n(enter ? for help) %s" % (query, resps))
                         ui.write(b"\n")
                         if r == 8:  # ?
                             for c, t in ui.extractchoices(resps)[1]:
                                 ui.write(b'%s - %s\n' % (c, encoding.lower(t)))
                             continue
                         elif r == 0:  # yes
                             ret = True
                         elif r == 1:  # no
                             ret = False
                         elif r == 2:  # Edit patch
                             if chunk is None:
                                 ui.write(_(b'cannot edit patch for whole file'))
                                 ui.write(b"\n")
                                 continue
                             if chunk.header.binary():
                                 ui.write(_(b'cannot edit patch for binary file'))
                                 ui.write(b"\n")
                                 continue
                             # Patch comment based on the Git one (based on comment at end of
                             # https://mercurial-scm.org/wiki/RecordExtension)
                             phelp = b'---' + _(
                                 """
             To remove '-' lines, make them ' ' lines (context).
             To remove '+' lines, delete them.
             Lines starting with # will be removed from the patch.
             If the patch applies cleanly, the edited hunk will immediately be
             added to the record list. If it does not apply cleanly, a rejects
             file will be generated: you can use that when you try again. If
             all lines of the hunk are removed, then the edit is aborted and
             the hunk is left unchanged.
             """
                             )
                             (patchfd, patchfn) = pycompat.mkstemp(
                                 prefix=b"hg-editor-", suffix=b".diff"
                             )
                             ncpatchfp = None
                             try:
                                 # Write the initial patch
                                 f = util.nativeeolwriter(os.fdopen(patchfd, 'wb'))
                                 chunk.header.write(f)
                                 chunk.write(f)
                                 f.write(
                                     b''.join(
                                         [b'# ' + i + b'\n' for i in phelp.splitlines()]
                                     )
                                 )
                                 f.close()
                                 # Start the editor and wait for it to complete
                                 editor = ui.geteditor()
                                 ret = ui.system(
                                     b"%s \"%s\"" % (editor, patchfn),
                                     environ={b'HGUSER': ui.username()},
                                     blockedtag=b'filterpatch',
                                 )
                                 if ret != 0:
                                     ui.warn(_(b"editor exited with exit code %d\n") % ret)
                                     continue
                                 # Remove comment lines
                                 patchfp = open(patchfn, 'rb')
                                 ncpatchfp = stringio()
                                 for line in util.iterfile(patchfp):
                                     line = util.fromnativeeol(line)
                                     if not line.startswith(b'#'):
                                         ncpatchfp.write(line)
                                 patchfp.close()
                                 ncpatchfp.seek(0)
                                 newpatches = parsepatch(ncpatchfp)
                             finally:
                                 os.unlink(patchfn)
                                 del ncpatchfp
                             # Signal that the chunk shouldn't be applied as-is, but
                             # provide the new patch to be used instead.
                             ret = False
                         elif r == 3:  # Skip
                             ret = skipfile = False
                         elif r == 4:  # file (Record remaining)
                             ret = skipfile = True
                         elif r == 5:  # done, skip remaining
                             ret = skipall = False
                         elif r == 6:  # all
                             ret = skipall = True
                         elif r == 7:  # quit
                             raise error.Abort(_(b'user quit'))
                         return ret, skipfile, skipall, newpatches
                 seen = set()
                 applied = {}  # 'filename' -> [] of chunks
                 skipfile, skipall = None, None
                 pos, total = 1, sum(len(h.hunks) for h in headers)
                 for h in headers:
                     pos += len(h.hunks)
                     skipfile = None
                     fixoffset = 0
                     hdr = b''.join(h.header)
                     if hdr in seen:
                         continue
                     seen.add(hdr)
                     if skipall is None:
                         h.pretty(ui)
                     files = h.files()
                     msg = _(b'examine changes to %s?') % _(b' and ').join(
                         b"'%s'" % f for f in files
                     )
                     if all(match.exact(f) for f in files):
                         r, skipall, np = True, None, None
                     else:
                         r, skipfile, skipall, np = prompt(skipfile, skipall, msg, None)
                     if not r:
                         continue
                     applied[h.filename()] = [h]
                     if h.allhunks():
                         applied[h.filename()] += h.hunks
                         continue
                     for i, chunk in enumerate(h.hunks):
                         if skipfile is None and skipall is None:
                             chunk.pretty(ui)
                         if total == 1:
                             msg = messages[b'single'][operation] % chunk.filename()
                         else:
                             idx = pos - len(h.hunks) + i
                             msg = messages[b'multiple'][operation] % (
                                 idx,
                                 total,
                                 chunk.filename(),
                             )
                         r, skipfile, skipall, newpatches = prompt(
                             skipfile, skipall, msg, chunk
                         )
                         if r:
                             if fixoffset:
                                 chunk = copy.copy(chunk)
                                 chunk.toline += fixoffset
                             applied[chunk.filename()].append(chunk)
                         elif newpatches is not None:
                             for newpatch in newpatches:
                                 for newhunk in newpatch.hunks:
                                     if fixoffset:
                                         newhunk.toline += fixoffset
                                     applied[newhunk.filename()].append(newhunk)
                         else:
                             fixoffset += chunk.removed - chunk.added
                 return (
                     sum(
                         [
                             h
                             for h in pycompat.itervalues(applied)
                             if h[0].special() or len(h) > 1
                         ],
                         [],
                     ),
                     {},
                 )
             class hunk(object):
                 def __init__(self, desc, num, lr, context):
                     self.number = num
                     self.desc = desc
                     self.hunk = [desc]
                     self.a = []
                     self.b = []
                     self.starta = self.lena = None
                     self.startb = self.lenb = None
                     if lr is not None:
                         if context:
                             self.read_context_hunk(lr)
                         else:
                             self.read_unified_hunk(lr)
                 def getnormalized(self):
                     """Return a copy with line endings normalized to LF."""
                     def normalize(lines):
                         nlines = []
                         for line in lines:
                             if line.endswith(b'\r\n'):
                                 line = line[:-2] + b'\n'
                             nlines.append(line)
                         return nlines
                     # Dummy object, it is rebuilt manually
                     nh = hunk(self.desc, self.number, None, None)
                     nh.number = self.number
                     nh.desc = self.desc
                     nh.hunk = self.hunk
                     nh.a = normalize(self.a)
                     nh.b = normalize(self.b)
                     nh.starta = self.starta
                     nh.startb = self.startb
                     nh.lena = self.lena
                     nh.lenb = self.lenb
                     return nh
                 def read_unified_hunk(self, lr):
                     m = unidesc.match(self.desc)
                     if not m:
                         raise PatchError(_(b"bad hunk #%d") % self.number)
                     self.starta, self.lena, self.startb, self.lenb = m.groups()
                     if self.lena is None:
                         self.lena = 1
                     else:
                         self.lena = int(self.lena)
                     if self.lenb is None:
                         self.lenb = 1
                     else:
                         self.lenb = int(self.lenb)
                     self.starta = int(self.starta)
                     self.startb = int(self.startb)
                     try:
                         diffhelper.addlines(
                             lr, self.hunk, self.lena, self.lenb, self.a, self.b
                         )
                     except error.ParseError as e:
                         raise PatchError(_(b"bad hunk #%d: %s") % (self.number, e))
                     # if we hit eof before finishing out the hunk, the last line will
                     # be zero length.  Lets try to fix it up.
                     while len(self.hunk[-1]) == 0:
                         del self.hunk[-1]
                         del self.a[-1]
                         del self.b[-1]
                         self.lena -= 1
                         self.lenb -= 1
                     self._fixnewline(lr)
                 def read_context_hunk(self, lr):
                     self.desc = lr.readline()
                     m = contextdesc.match(self.desc)
                     if not m:
                         raise PatchError(_(b"bad hunk #%d") % self.number)
                     self.starta, aend = m.groups()
                     self.starta = int(self.starta)
                     if aend is None:
                         aend = self.starta
                     self.lena = int(aend) - self.starta
                     if self.starta:
                         self.lena += 1
                     for x in pycompat.xrange(self.lena):
                         l = lr.readline()
                         if l.startswith(b'---'):
                             # lines addition, old block is empty
                             lr.push(l)
                             break
                         s = l[2:]
                         if l.startswith(b'- ') or l.startswith(b'! '):
                             u = b'-' + s
                         elif l.startswith(b'  '):
                             u = b' ' + s
                         else:
                             raise PatchError(
                                 _(b"bad hunk #%d old text line %d") % (self.number, x)
                             )
                         self.a.append(u)
                         self.hunk.append(u)
                     l = lr.readline()
                     if l.startswith(br'\ '):
                         s = self.a[-1][:-1]
                         self.a[-1] = s
                         self.hunk[-1] = s
                         l = lr.readline()
                     m = contextdesc.match(l)
                     if not m:
                         raise PatchError(_(b"bad hunk #%d") % self.number)
                     self.startb, bend = m.groups()
                     self.startb = int(self.startb)
                     if bend is None:
                         bend = self.startb
                     self.lenb = int(bend) - self.startb
                     if self.startb:
                         self.lenb += 1
                     hunki = 1
                     for x in pycompat.xrange(self.lenb):
                         l = lr.readline()
                         if l.startswith(br'\ '):
                             # XXX: the only way to hit this is with an invalid line range.
                             # The no-eol marker is not counted in the line range, but I
                             # guess there are diff(1) out there which behave differently.
                             s = self.b[-1][:-1]
                             self.b[-1] = s
                             self.hunk[hunki - 1] = s
                             continue
                         if not l:
                             # line deletions, new block is empty and we hit EOF
                             lr.push(l)
                             break
                         s = l[2:]
                         if l.startswith(b'+ ') or l.startswith(b'! '):
                             u = b'+' + s
                         elif l.startswith(b'  '):
                             u = b' ' + s
                         elif len(self.b) == 0:
                             # line deletions, new block is empty
                             lr.push(l)
                             break
                         else:
                             raise PatchError(
                                 _(b"bad hunk #%d old text line %d") % (self.number, x)
                             )
                         self.b.append(s)
                         while True:
                             if hunki >= len(self.hunk):
                                 h = b""
                             else:
                                 h = self.hunk[hunki]
                             hunki += 1
                             if h == u:
                                 break
                             elif h.startswith(b'-'):
                                 continue
                             else:
                                 self.hunk.insert(hunki - 1, u)
                                 break
                     if not self.a:
                         # this happens when lines were only added to the hunk
                         for x in self.hunk:
                             if x.startswith(b'-') or x.startswith(b' '):
                                 self.a.append(x)
                     if not self.b:
                         # this happens when lines were only deleted from the hunk
                         for x in self.hunk:
                             if x.startswith(b'+') or x.startswith(b' '):
                                 self.b.append(x[1:])
                     # @@ -start,len +start,len @@
                     self.desc = b"@@ -%d,%d +%d,%d @@\n" % (
                         self.starta,
                         self.lena,
                         self.startb,
                         self.lenb,
                     )
                     self.hunk[0] = self.desc
                     self._fixnewline(lr)
                 def _fixnewline(self, lr):
                     l = lr.readline()
                     if l.startswith(br'\ '):
                         diffhelper.fixnewline(self.hunk, self.a, self.b)
                     else:
                         lr.push(l)
                 def complete(self):
                     return len(self.a) == self.lena and len(self.b) == self.lenb
                 def _fuzzit(self, old, new, fuzz, toponly):
                     # this removes context lines from the top and bottom of list 'l'.  It
                     # checks the hunk to make sure only context lines are removed, and then
                     # returns a new shortened list of lines.
                     fuzz = min(fuzz, len(old))
                     if fuzz:
                         top = 0
                         bot = 0
                         hlen = len(self.hunk)
                         for x in pycompat.xrange(hlen - 1):
                             # the hunk starts with the @@ line, so use x+1
                             if self.hunk[x + 1].startswith(b' '):
                                 top += 1
                             else:
                                 break
                         if not toponly:
                             for x in pycompat.xrange(hlen - 1):
                                 if self.hunk[hlen - bot - 1].startswith(b' '):
                                     bot += 1
                                 else:
                                     break
                         bot = min(fuzz, bot)
                         top = min(fuzz, top)
                         return old[top : len(old) - bot], new[top : len(new) - bot], top
                     return old, new, 0
                 def fuzzit(self, fuzz, toponly):
                     old, new, top = self._fuzzit(self.a, self.b, fuzz, toponly)
                     oldstart = self.starta + top
                     newstart = self.startb + top
                     # zero length hunk ranges already have their start decremented
                     if self.lena and oldstart > 0:
                         oldstart -= 1
                     if self.lenb and newstart > 0:
                         newstart -= 1
                     return old, oldstart, new, newstart
             class binhunk(object):
                 """A binary patch file."""
                 def __init__(self, lr, fname):
                     self.text = None
                     self.delta = False
                     self.hunk = [b'GIT binary patch\n']
                     self._fname = fname
                     self._read(lr)
                 def complete(self):
                     return self.text is not None
                 def new(self, lines):
                     if self.delta:
                         return [applybindelta(self.text, b''.join(lines))]
                     return [self.text]
                 def _read(self, lr):
                     def getline(lr, hunk):
                         l = lr.readline()
                         hunk.append(l)
                         return l.rstrip(b'\r\n')
                     while True:
                         line = getline(lr, self.hunk)
                         if not line:
                             raise PatchError(
                                 _(b'could not extract "%s" binary data') % self._fname
                             )
                         if line.startswith(b'literal '):
                             size = int(line[8:].rstrip())
                             break
                         if line.startswith(b'delta '):
                             size = int(line[6:].rstrip())
                             self.delta = True
                             break
                     dec = []
                     line = getline(lr, self.hunk)
                     while len(line) > 1:
                         l = line[0:1]
                         if l <= b'Z' and l >= b'A':
                             l = ord(l) - ord(b'A') + 1
                         else:
                             l = ord(l) - ord(b'a') + 27
                         try:
                             dec.append(util.b85decode(line[1:])[:l])
                         except ValueError as e:
                             raise PatchError(
                                 _(b'could not decode "%s" binary patch: %s')
                                 % (self._fname, stringutil.forcebytestr(e))
                             )
                         line = getline(lr, self.hunk)
                     text = zlib.decompress(b''.join(dec))
                     if len(text) != size:
                         raise PatchError(
                             _(b'"%s" length is %d bytes, should be %d')
                             % (self._fname, len(text), size)
                         )
                     self.text = text
             def parsefilename(str):
                 # --- filename \t|space stuff
                 s = str[4:].rstrip(b'\r\n')
                 i = s.find(b'\t')
                 if i < 0:
                     i = s.find(b' ')
                     if i < 0:
                         return s
                 return s[:i]
             def reversehunks(hunks):
                 '''reverse the signs in the hunks given as argument
                 This function operates on hunks coming out of patch.filterpatch, that is
                 a list of the form: [header1, hunk1, hunk2, header2...]. Example usage:
                 >>> rawpatch = b"""diff --git a/folder1/g b/folder1/g
                 ... --- a/folder1/g
                 ... +++ b/folder1/g
                 ... @@ -1,7 +1,7 @@
                 ... +firstline
                 ...  c
                 ...  1
                 ...  2
                 ... + 3
                 ... -4
                 ...  5
                 ...  d
                 ... +lastline"""
                 >>> hunks = parsepatch([rawpatch])
                 >>> hunkscomingfromfilterpatch = []
                 >>> for h in hunks:
                 ...     hunkscomingfromfilterpatch.append(h)
                 ...     hunkscomingfromfilterpatch.extend(h.hunks)
                 >>> reversedhunks = reversehunks(hunkscomingfromfilterpatch)
                 >>> from . import util
                 >>> fp = util.stringio()
                 >>> for c in reversedhunks:
                 ...      c.write(fp)
                 >>> fp.seek(0) or None
                 >>> reversedpatch = fp.read()
                 >>> print(pycompat.sysstr(reversedpatch))
                 diff --git a/folder1/g b/folder1/g
                 --- a/folder1/g
                 +++ b/folder1/g
                 @@ -1,4 +1,3 @@
                 -firstline
                  c
                 @@ -2,6 +1,6 @@
                  c
                 - 3
                 +4
                  d
                 @@ -6,3 +5,2 @@
                  d
                 -lastline
                 '''
                 newhunks = []
                 for c in hunks:
                     if util.safehasattr(c, b'reversehunk'):
                         c = c.reversehunk()
                     newhunks.append(c)
                 return newhunks
             def parsepatch(originalchunks, maxcontext=None):
                 """patch -> [] of headers -> [] of hunks
                 If maxcontext is not None, trim context lines if necessary.
                 >>> rawpatch = b'''diff --git a/folder1/g b/folder1/g
                 ... --- a/folder1/g
                 ... +++ b/folder1/g
                 ... @@ -1,8 +1,10 @@
                 ...  1
                 ...  2
                 ... -3
                 ...  4
                 ...  5
                 ...  6
                 ... +6.1
                 ... +6.2
                 ...  7
                 ...  8
                 ... +9'''
                 >>> out = util.stringio()
                 >>> headers = parsepatch([rawpatch], maxcontext=1)
                 >>> for header in headers:
                 ...     header.write(out)
                 ...     for hunk in header.hunks:
                 ...         hunk.write(out)
                 >>> print(pycompat.sysstr(out.getvalue()))
                 diff --git a/folder1/g b/folder1/g
                 --- a/folder1/g
                 +++ b/folder1/g
                 @@ -2,3 +2,2 @@
                 -3
                 @@ -6,2 +5,4 @@
                 +6.1
                 +6.2
                 @@ -8,1 +9,2 @@
                 +9
                 """
                 class parser(object):
                     """patch parsing state machine"""
                     def __init__(self):
                         self.fromline = 0
                         self.toline = 0
                         self.proc = b''
                         self.header = None
                         self.context = []
                         self.before = []
                         self.hunk = []
                         self.headers = []
                     def addrange(self, limits):
                         self.addcontext([])
                         fromstart, fromend, tostart, toend, proc = limits
                         self.fromline = int(fromstart)
                         self.toline = int(tostart)
                         self.proc = proc
                     def addcontext(self, context):
                         if self.hunk:
                             h = recordhunk(
                                 self.header,
                                 self.fromline,
                                 self.toline,
                                 self.proc,
                                 self.before,
                                 self.hunk,
                                 context,
                                 maxcontext,
                             )
                             self.header.hunks.append(h)
                             self.fromline += len(self.before) + h.removed
                             self.toline += len(self.before) + h.added
                             self.before = []
                             self.hunk = []
                         self.context = context
                     def addhunk(self, hunk):
                         if self.context:
                             self.before = self.context
                             self.context = []
                         if self.hunk:
                             self.addcontext([])
                         self.hunk = hunk
                     def newfile(self, hdr):
                         self.addcontext([])
                         h = header(hdr)
                         self.headers.append(h)
                         self.header = h
                     def addother(self, line):
                         pass  # 'other' lines are ignored
                     def finished(self):
                         self.addcontext([])
                         return self.headers
                     transitions = {
                         b'file': {
                             b'context': addcontext,
                             b'file': newfile,
                             b'hunk': addhunk,
                             b'range': addrange,
                         },
                         b'context': {
                             b'file': newfile,
                             b'hunk': addhunk,
                             b'range': addrange,
                             b'other': addother,
                         },
                         b'hunk': {
                             b'context': addcontext,
                             b'file': newfile,
                             b'range': addrange,
                         },
                         b'range': {b'context': addcontext, b'hunk': addhunk},
                         b'other': {b'other': addother},
                     }
                 p = parser()
                 fp = stringio()
                 fp.write(b''.join(originalchunks))
                 fp.seek(0)
                 state = b'context'
                 for newstate, data in scanpatch(fp):
                     try:
                         p.transitions[state][newstate](p, data)
                     except KeyError:
                         raise PatchError(
                             b'unhandled transition: %s -> %s' % (state, newstate)
                         )
                     state = newstate
                 del fp
                 return p.finished()
             def pathtransform(path, strip, prefix):
                 '''turn a path from a patch into a path suitable for the repository
                 prefix, if not empty, is expected to be normalized with a / at the end.
                 Returns (stripped components, path in repository).
                 >>> pathtransform(b'a/b/c', 0, b'')
                 ('', 'a/b/c')
                 >>> pathtransform(b'   a/b/c   ', 0, b'')
                 ('', '   a/b/c')
                 >>> pathtransform(b'   a/b/c   ', 2, b'')
                 ('a/b/', 'c')
                 >>> pathtransform(b'a/b/c', 0, b'd/e/')
                 ('', 'd/e/a/b/c')
                 >>> pathtransform(b'   a//b/c   ', 2, b'd/e/')
                 ('a//b/', 'd/e/c')
                 >>> pathtransform(b'a/b/c', 3, b'')
                 Traceback (most recent call last):
                 PatchError: unable to strip away 1 of 3 dirs from a/b/c
                 '''
                 pathlen = len(path)
                 i = 0
                 if strip == 0:
                     return b'', prefix + path.rstrip()
                 count = strip
                 while count > 0:
                     i = path.find(b'/', i)
                     if i == -1:
                         raise PatchError(
                             _(b"unable to strip away %d of %d dirs from %s")
                             % (count, strip, path)
                         )
                     i += 1
                     # consume '//' in the path
                     while i < pathlen - 1 and path[i : i + 1] == b'/':
                         i += 1
                     count -= 1
                 return path[:i].lstrip(), prefix + path[i:].rstrip()
             def makepatchmeta(backend, afile_orig, bfile_orig, hunk, strip, prefix):
                 nulla = afile_orig == b"/dev/null"
                 nullb = bfile_orig == b"/dev/null"
                 create = nulla and hunk.starta == 0 and hunk.lena == 0
                 remove = nullb and hunk.startb == 0 and hunk.lenb == 0
                 abase, afile = pathtransform(afile_orig, strip, prefix)
                 gooda = not nulla and backend.exists(afile)
                 bbase, bfile = pathtransform(bfile_orig, strip, prefix)
                 if afile == bfile:
                     goodb = gooda
                 else:
                     goodb = not nullb and backend.exists(bfile)
                 missing = not goodb and not gooda and not create
                 # some diff programs apparently produce patches where the afile is
                 # not /dev/null, but afile starts with bfile
                 abasedir = afile[: afile.rfind(b'/') + 1]
                 bbasedir = bfile[: bfile.rfind(b'/') + 1]
                 if (
                     missing
                     and abasedir == bbasedir
                     and afile.startswith(bfile)
                     and hunk.starta == 0
                     and hunk.lena == 0
                 ):
                     create = True
                     missing = False
                 # If afile is "a/b/foo" and bfile is "a/b/foo.orig" we assume the
                 # diff is between a file and its backup. In this case, the original
                 # file should be patched (see original mpatch code).
                 isbackup = abase == bbase and bfile.startswith(afile)
                 fname = None
                 if not missing:
                     if gooda and goodb:
                         if isbackup:
                             fname = afile
                         else:
                             fname = bfile
                     elif gooda:
                         fname = afile
                 if not fname:
                     if not nullb:
                         if isbackup:
                             fname = afile
                         else:
                             fname = bfile
                     elif not nulla:
                         fname = afile
                     else:
                         raise PatchError(_(b"undefined source and destination files"))
                 gp = patchmeta(fname)
                 if create:
                     gp.op = b'ADD'
                 elif remove:
                     gp.op = b'DELETE'
                 return gp
             def scanpatch(fp):
                 """like patch.iterhunks, but yield different events
                 - ('file',    [header_lines + fromfile + tofile])
                 - ('context', [context_lines])
                 - ('hunk',    [hunk_lines])
                 - ('range',   (-start,len, +start,len, proc))
                 """
                 lines_re = re.compile(br'@@ -(\d+),(\d+) \+(\d+),(\d+) @@\s*(.*)')
                 lr = linereader(fp)
                 def scanwhile(first, p):
                     """scan lr while predicate holds"""
                     lines = [first]
                     for line in iter(lr.readline, b''):
                         if p(line):
                             lines.append(line)
                         else:
                             lr.push(line)
                             break
                     return lines
                 for line in iter(lr.readline, b''):
                     if line.startswith(b'diff --git a/') or line.startswith(b'diff -r '):
                         def notheader(line):
                             s = line.split(None, 1)
                             return not s or s[0] not in (b'---', b'diff')
                         header = scanwhile(line, notheader)
                         fromfile = lr.readline()
                         if fromfile.startswith(b'---'):
                             tofile = lr.readline()
                             header += [fromfile, tofile]
                         else:
                             lr.push(fromfile)
                         yield b'file', header
                     elif line.startswith(b' '):
                         cs = (b' ', b'\\')
                         yield b'context', scanwhile(line, lambda l: l.startswith(cs))
                     elif line.startswith((b'-', b'+')):
                         cs = (b'-', b'+', b'\\')
                         yield b'hunk', scanwhile(line, lambda l: l.startswith(cs))
                     else:
                         m = lines_re.match(line)
                         if m:
                             yield b'range', m.groups()
                         else:
                             yield b'other', line
             def scangitpatch(lr, firstline):
                 """
                 Git patches can emit:
                 - rename a to b
                 - change b
                 - copy a to c
                 - change c
                 We cannot apply this sequence as-is, the renamed 'a' could not be
                 found for it would have been renamed already. And we cannot copy
                 from 'b' instead because 'b' would have been changed already. So
                 we scan the git patch for copy and rename commands so we can
                 perform the copies ahead of time.
                 """
                 pos = 0
                 try:
                     pos = lr.fp.tell()
                     fp = lr.fp
                 except IOError:
                     fp = stringio(lr.fp.read())
                 gitlr = linereader(fp)
                 gitlr.push(firstline)
                 gitpatches = readgitpatch(gitlr)
                 fp.seek(pos)
                 return gitpatches
             def iterhunks(fp):
                 """Read a patch and yield the following events:
                 - ("file", afile, bfile, firsthunk): select a new target file.
                 - ("hunk", hunk): a new hunk is ready to be applied, follows a
                 "file" event.
                 - ("git", gitchanges): current diff is in git format, gitchanges
                 maps filenames to gitpatch records. Unique event.
                 """
                 afile = b""
                 bfile = b""
                 state = None
                 hunknum = 0
                 emitfile = newfile = False
                 gitpatches = None
                 # our states
                 BFILE = 1
                 context = None
                 lr = linereader(fp)
                 for x in iter(lr.readline, b''):
                     if state == BFILE and (
                         (not context and x.startswith(b'@'))
                         or (context is not False and x.startswith(b'***************'))
                         or x.startswith(b'GIT binary patch')
                     ):
                         gp = None
                         if gitpatches and gitpatches[-1].ispatching(afile, bfile):
                             gp = gitpatches.pop()
                         if x.startswith(b'GIT binary patch'):
                             h = binhunk(lr, gp.path)
                         else:
                             if context is None and x.startswith(b'***************'):
                                 context = True
                             h = hunk(x, hunknum + 1, lr, context)
                         hunknum += 1
                         if emitfile:
                             emitfile = False
                             yield b'file', (afile, bfile, h, gp and gp.copy() or None)
                         yield b'hunk', h
                     elif x.startswith(b'diff --git a/'):
                         m = gitre.match(x.rstrip(b' \r\n'))
                         if not m:
                             continue
                         if gitpatches is None:
                             # scan whole input for git metadata
                             gitpatches = scangitpatch(lr, x)
                             yield b'git', [
                                 g.copy() for g in gitpatches if g.op in (b'COPY', b'RENAME')
                             ]
                             gitpatches.reverse()
                         afile = b'a/' + m.group(1)
                         bfile = b'b/' + m.group(2)
                         while gitpatches and not gitpatches[-1].ispatching(afile, bfile):
                             gp = gitpatches.pop()
                             yield b'file', (
                                 b'a/' + gp.path,
                                 b'b/' + gp.path,
                                 None,
                                 gp.copy(),
                             )
                         if not gitpatches:
                             raise PatchError(
                                 _(b'failed to synchronize metadata for "%s"') % afile[2:]
                             )
                         newfile = True
                     elif x.startswith(b'---'):
                         # check for a unified diff
                         l2 = lr.readline()
                         if not l2.startswith(b'+++'):
                             lr.push(l2)
                             continue
                         newfile = True
                         context = False
                         afile = parsefilename(x)
                         bfile = parsefilename(l2)
                     elif x.startswith(b'***'):
                         # check for a context diff
                         l2 = lr.readline()
                         if not l2.startswith(b'---'):
                             lr.push(l2)
                             continue
                         l3 = lr.readline()
                         lr.push(l3)
                         if not l3.startswith(b"***************"):
                             lr.push(l2)
                             continue
                         newfile = True
                         context = True
                         afile = parsefilename(x)
                         bfile = parsefilename(l2)
                     if newfile:
                         newfile = False
                         emitfile = True
                         state = BFILE
                         hunknum = 0
                 while gitpatches:
                     gp = gitpatches.pop()
                     yield b'file', (b'a/' + gp.path, b'b/' + gp.path, None, gp.copy())
             def applybindelta(binchunk, data):
                 """Apply a binary delta hunk
                 The algorithm used is the algorithm from git's patch-delta.c
                 """
                 def deltahead(binchunk):
                     i = 0
                     for c in pycompat.bytestr(binchunk):
                         i += 1
                         if not (ord(c) & 0x80):
                             return i
                     return i
                 out = b""
                 s = deltahead(binchunk)
                 binchunk = binchunk[s:]
                 s = deltahead(binchunk)
                 binchunk = binchunk[s:]
                 i = 0
                 while i < len(binchunk):
                     cmd = ord(binchunk[i : i + 1])
                     i += 1
                     if cmd & 0x80:
                         offset = 0
                         size = 0
                         if cmd & 0x01:
                             offset = ord(binchunk[i : i + 1])
                             i += 1
                         if cmd & 0x02:
                             offset |= ord(binchunk[i : i + 1]) << 8
                             i += 1
                         if cmd & 0x04:
                             offset |= ord(binchunk[i : i + 1]) << 16
                             i += 1
                         if cmd & 0x08:
                             offset |= ord(binchunk[i : i + 1]) << 24
                             i += 1
                         if cmd & 0x10:
                             size = ord(binchunk[i : i + 1])
                             i += 1
                         if cmd & 0x20:
                             size |= ord(binchunk[i : i + 1]) << 8
                             i += 1
                         if cmd & 0x40:
                             size |= ord(binchunk[i : i + 1]) << 16
                             i += 1
                         if size == 0:
                             size = 0x10000
                         offset_end = offset + size
                         out += data[offset:offset_end]
                     elif cmd != 0:
                         offset_end = i + cmd
                         out += binchunk[i:offset_end]
                         i += cmd
                     else:
                         raise PatchError(_(b'unexpected delta opcode 0'))
                 return out
             def applydiff(ui, fp, backend, store, strip=1, prefix=b'', eolmode=b'strict'):
                 """Reads a patch from fp and tries to apply it.
                 Returns 0 for a clean patch, -1 if any rejects were found and 1 if
                 there was any fuzz.
                 If 'eolmode' is 'strict', the patch content and patched file are
                 read in binary mode. Otherwise, line endings are ignored when
                 patching then normalized according to 'eolmode'.
                 """
                 return _applydiff(
                     ui,
                     fp,
                     patchfile,
                     backend,
                     store,
                     strip=strip,
                     prefix=prefix,
                     eolmode=eolmode,
                 )
             def _canonprefix(repo, prefix):
                 if prefix:
                     prefix = pathutil.canonpath(repo.root, repo.getcwd(), prefix)
                     if prefix != b'':
                         prefix += b'/'
                 return prefix
             def _applydiff(
                 ui, fp, patcher, backend, store, strip=1, prefix=b'', eolmode=b'strict'
             ):
                 prefix = _canonprefix(backend.repo, prefix)
                 def pstrip(p):
                     return pathtransform(p, strip - 1, prefix)[1]
                 rejects = 0
                 err = 0
                 current_file = None
                 for state, values in iterhunks(fp):
                     if state == b'hunk':
                         if not current_file:
                             continue
                         ret = current_file.apply(values)
                         if ret > 0:
                             err = 1
                     elif state == b'file':
                         if current_file:
                             rejects += current_file.close()
                             current_file = None
                         afile, bfile, first_hunk, gp = values
                         if gp:
                             gp.path = pstrip(gp.path)
                             if gp.oldpath:
                                 gp.oldpath = pstrip(gp.oldpath)
                         else:
                             gp = makepatchmeta(
                                 backend, afile, bfile, first_hunk, strip, prefix
                             )
                         if gp.op == b'RENAME':
                             backend.unlink(gp.oldpath)
                         if not first_hunk:
                             if gp.op == b'DELETE':
                                 backend.unlink(gp.path)
                                 continue
                             data, mode = None, None
                             if gp.op in (b'RENAME', b'COPY'):
                                 data, mode = store.getfile(gp.oldpath)[:2]
                                 if data is None:
                                     # This means that the old path does not exist
                                     raise PatchError(
                                         _(b"source file '%s' does not exist") % gp.oldpath
                                     )
                             if gp.mode:
                                 mode = gp.mode
                                 if gp.op == b'ADD':
                                     # Added files without content have no hunk and
                                     # must be created
                                     data = b''
                             if data or mode:
                                 if gp.op in (b'ADD', b'RENAME', b'COPY') and backend.exists(
                                     gp.path
                                 ):
                                     raise PatchError(
                                         _(
                                             b"cannot create %s: destination "
                                             b"already exists"
                                         )
                                         % gp.path
                                     )
                                 backend.setfile(gp.path, data, mode, gp.oldpath)
                             continue
                         try:
                             current_file = patcher(ui, gp, backend, store, eolmode=eolmode)
                         except PatchError as inst:
                             ui.warn(stringutil.forcebytestr(inst) + b'\n')
                             current_file = None
                             rejects += 1
                             continue
                     elif state == b'git':
                         for gp in values:
                             path = pstrip(gp.oldpath)
                             data, mode = backend.getfile(path)
                             if data is None:
                                 # The error ignored here will trigger a getfile()
                                 # error in a place more appropriate for error
                                 # handling, and will not interrupt the patching
                                 # process.
                                 pass
                             else:
                                 store.setfile(path, data, mode)
                     else:
                         raise error.Abort(_(b'unsupported parser state: %s') % state)
                 if current_file:
                     rejects += current_file.close()
                 if rejects:
                     return -1
                 return err
             def _externalpatch(ui, repo, patcher, patchname, strip, files, similarity):
                 """use <patcher> to apply <patchname> to the working directory.
                 returns whether patch was applied with fuzz factor."""
                 fuzz = False
                 args = []
                 cwd = repo.root
                 if cwd:
                     args.append(b'-d %s' % procutil.shellquote(cwd))
                 cmd = b'%s %s -p%d < %s' % (
                     patcher,
                     b' '.join(args),
                     strip,
                     procutil.shellquote(patchname),
                 )
                 ui.debug(b'Using external patch tool: %s\n' % cmd)
                 fp = procutil.popen(cmd, b'rb')
                 try:
                     for line in util.iterfile(fp):
                         line = line.rstrip()
                         ui.note(line + b'\n')
                         if line.startswith(b'patching file '):
                             pf = util.parsepatchoutput(line)
                             printed_file = False
                             files.add(pf)
                         elif line.find(b'with fuzz') >= 0:
                             fuzz = True
                             if not printed_file:
                                 ui.warn(pf + b'\n')
                                 printed_file = True
                             ui.warn(line + b'\n')
                         elif line.find(b'saving rejects to file') >= 0:
                             ui.warn(line + b'\n')
                         elif line.find(b'FAILED') >= 0:
                             if not printed_file:
                                 ui.warn(pf + b'\n')
                                 printed_file = True
                             ui.warn(line + b'\n')
                 finally:
                     if files:
                         scmutil.marktouched(repo, files, similarity)
                 code = fp.close()
                 if code:
                     raise PatchError(
                         _(b"patch command failed: %s") % procutil.explainexit(code)
                     )
                 return fuzz
             def patchbackend(
                 ui, backend, patchobj, strip, prefix, files=None, eolmode=b'strict'
             ):
                 if files is None:
                     files = set()
                 if eolmode is None:
                     eolmode = ui.config(b'patch', b'eol')
                 if eolmode.lower() not in eolmodes:
                     raise error.Abort(_(b'unsupported line endings type: %s') % eolmode)
                 eolmode = eolmode.lower()
                 store = filestore()
                 try:
                     fp = open(patchobj, b'rb')
                 except TypeError:
                     fp = patchobj
                 try:
                     ret = applydiff(
                         ui, fp, backend, store, strip=strip, prefix=prefix, eolmode=eolmode
                     )
                 finally:
                     if fp != patchobj:
                         fp.close()
                     files.update(backend.close())
                     store.close()
                 if ret < 0:
                     raise PatchError(_(b'patch failed to apply'))
                 return ret > 0
             def internalpatch(
                 ui,
                 repo,
                 patchobj,
                 strip,
                 prefix=b'',
                 files=None,
                 eolmode=b'strict',
                 similarity=0,
             ):
                 """use builtin patch to apply <patchobj> to the working directory.
                 returns whether patch was applied with fuzz factor."""
                 backend = workingbackend(ui, repo, similarity)
                 return patchbackend(ui, backend, patchobj, strip, prefix, files, eolmode)
             def patchrepo(
                 ui, repo, ctx, store, patchobj, strip, prefix, files=None, eolmode=b'strict'
             ):
                 backend = repobackend(ui, repo, ctx, store)
                 return patchbackend(ui, backend, patchobj, strip, prefix, files, eolmode)
             def patch(
                 ui,
                 repo,
                 patchname,
                 strip=1,
                 prefix=b'',
                 files=None,
                 eolmode=b'strict',
                 similarity=0,
             ):
                 """Apply <patchname> to the working directory.
                 'eolmode' specifies how end of lines should be handled. It can be:
                 - 'strict': inputs are read in binary mode, EOLs are preserved
                 - 'crlf': EOLs are ignored when patching and reset to CRLF
                 - 'lf': EOLs are ignored when patching and reset to LF
                 - None: get it from user settings, default to 'strict'
                 'eolmode' is ignored when using an external patcher program.
                 Returns whether patch was applied with fuzz factor.
                 """
                 patcher = ui.config(b'ui', b'patch')
                 if files is None:
                     files = set()
                 if patcher:
                     return _externalpatch(
                         ui, repo, patcher, patchname, strip, files, similarity
                     )
                 return internalpatch(
                     ui, repo, patchname, strip, prefix, files, eolmode, similarity
                 )
             def changedfiles(ui, repo, patchpath, strip=1, prefix=b''):
                 backend = fsbackend(ui, repo.root)
                 prefix = _canonprefix(repo, prefix)
                 with open(patchpath, b'rb') as fp:
                     changed = set()
                     for state, values in iterhunks(fp):
                         if state == b'file':
                             afile, bfile, first_hunk, gp = values
                             if gp:
                                 gp.path = pathtransform(gp.path, strip - 1, prefix)[1]
                                 if gp.oldpath:
                                     gp.oldpath = pathtransform(
                                         gp.oldpath, strip - 1, prefix
                                     )[1]
                             else:
                                 gp = makepatchmeta(
                                     backend, afile, bfile, first_hunk, strip, prefix
                                 )
                             changed.add(gp.path)
                             if gp.op == b'RENAME':
                                 changed.add(gp.oldpath)
                         elif state not in (b'hunk', b'git'):
                             raise error.Abort(_(b'unsupported parser state: %s') % state)
                     return changed
             class GitDiffRequired(Exception):
                 pass
             diffopts = diffutil.diffallopts
             diffallopts = diffutil.diffallopts
             difffeatureopts = diffutil.difffeatureopts
             def diff(
                 repo,
                 node1=None,
                 node2=None,
                 match=None,
                 changes=None,
                 opts=None,
                 losedatafn=None,
                 pathfn=None,
                 copy=None,
                 copysourcematch=None,
                 hunksfilterfn=None,
             ):
                 '''yields diff of changes to files between two nodes, or node and
                 working directory.
                 if node1 is None, use first dirstate parent instead.
                 if node2 is None, compare node1 with working directory.
                 losedatafn(**kwarg) is a callable run when opts.upgrade=True and
                 every time some change cannot be represented with the current
                 patch format. Return False to upgrade to git patch format, True to
                 accept the loss or raise an exception to abort the diff. It is
                 called with the name of current file being diffed as 'fn'. If set
                 to None, patches will always be upgraded to git format when
                 necessary.
                 prefix is a filename prefix that is prepended to all filenames on
                 display (used for subrepos).
                 relroot, if not empty, must be normalized with a trailing /. Any match
                 patterns that fall outside it will be ignored.
                 copy, if not empty, should contain mappings {dst@y: src@x} of copy
                 information.
                 if copysourcematch is not None, then copy sources will be filtered by this
                 matcher
                 hunksfilterfn, if not None, should be a function taking a filectx and
                 hunks generator that may yield filtered hunks.
                 '''
                 if not node1 and not node2:
                     node1 = repo.dirstate.p1()
                 ctx1 = repo[node1]
                 ctx2 = repo[node2]
                 for fctx1, fctx2, hdr, hunks in diffhunks(
                     repo,
                     ctx1=ctx1,
                     ctx2=ctx2,
                     match=match,
                     changes=changes,
                     opts=opts,
                     losedatafn=losedatafn,
                     pathfn=pathfn,
                     copy=copy,
                     copysourcematch=copysourcematch,
                 ):
                     if hunksfilterfn is not None:
                         # If the file has been removed, fctx2 is None; but this should
                         # not occur here since we catch removed files early in
                         # logcmdutil.getlinerangerevs() for 'hg log -L'.
                         assert (
                             fctx2 is not None
                         ), b'fctx2 unexpectly None in diff hunks filtering'
                         hunks = hunksfilterfn(fctx2, hunks)
                     text = b''.join(sum((list(hlines) for hrange, hlines in hunks), []))
                     if hdr and (text or len(hdr) > 1):
                         yield b'\n'.join(hdr) + b'\n'
                     if text:
                         yield text
             def diffhunks(
                 repo,
                 ctx1,
                 ctx2,
                 match=None,
                 changes=None,
                 opts=None,
                 losedatafn=None,
                 pathfn=None,
                 copy=None,
                 copysourcematch=None,
             ):
                 """Yield diff of changes to files in the form of (`header`, `hunks`) tuples
                 where `header` is a list of diff headers and `hunks` is an iterable of
                 (`hunkrange`, `hunklines`) tuples.
                 See diff() for the meaning of parameters.
                 """
                 if opts is None:
                     opts = mdiff.defaultopts
                 def lrugetfilectx():
                     cache = {}
                     order = collections.deque()
                     def getfilectx(f, ctx):
                         fctx = ctx.filectx(f, filelog=cache.get(f))
                         if f not in cache:
                             if len(cache) > 20:
                                 del cache[order.popleft()]
                             cache[f] = fctx.filelog()
                         else:
                             order.remove(f)
                         order.append(f)
                         return fctx
                     return getfilectx
                 getfilectx = lrugetfilectx()
                 if not changes:
                     changes = ctx1.status(ctx2, match=match)
                 if isinstance(changes, list):
                     modified, added, removed = changes[:3]
                 else:
                     modified, added, removed = (
                         changes.modified,
                         changes.added,
                         changes.removed,
                     )
                 if not modified and not added and not removed:
                     return []
                 if repo.ui.debugflag:
                     hexfunc = hex
                 else:
                     hexfunc = short
                 revs = [hexfunc(node) for node in [ctx1.node(), ctx2.node()] if node]
                 if copy is None:
                     copy = {}
                     if opts.git or opts.upgrade:
                         copy = copies.pathcopies(ctx1, ctx2, match=match)
                 if copysourcematch:
                     # filter out copies where source side isn't inside the matcher
                     # (copies.pathcopies() already filtered out the destination)
                     copy = {
                         dst: src
                         for dst, src in pycompat.iteritems(copy)
                         if copysourcematch(src)
                     }
                 modifiedset = set(modified)
                 addedset = set(added)
                 removedset = set(removed)
                 for f in modified:
                     if f not in ctx1:
                         # Fix up added, since merged-in additions appear as
                         # modifications during merges
                         modifiedset.remove(f)
                         addedset.add(f)
                 for f in removed:
                     if f not in ctx1:
                         # Merged-in additions that are then removed are reported as removed.
                         # They are not in ctx1, so We don't want to show them in the diff.
                         removedset.remove(f)
                 modified = sorted(modifiedset)
                 added = sorted(addedset)
                 removed = sorted(removedset)
                 for dst, src in list(copy.items()):
                     if src not in ctx1:
                         # Files merged in during a merge and then copied/renamed are
                         # reported as copies. We want to show them in the diff as additions.
                         del copy[dst]
                 prefetchmatch = scmutil.matchfiles(
                     repo, list(modifiedset | addedset | removedset)
                 )
                 scmutil.prefetchfiles(repo, [ctx1.rev(), ctx2.rev()], prefetchmatch)
                 def difffn(opts, losedata):
                     return trydiff(
                         repo,
                         revs,
                         ctx1,
                         ctx2,
                         modified,
                         added,
                         removed,
                         copy,
                         getfilectx,
                         opts,
                         losedata,
                         pathfn,
                     )
                 if opts.upgrade and not opts.git:
                     try:
                         def losedata(fn):
                             if not losedatafn or not losedatafn(fn=fn):
                                 raise GitDiffRequired
                         # Buffer the whole output until we are sure it can be generated
                         return list(difffn(opts.copy(git=False), losedata))
                     except GitDiffRequired:
                         return difffn(opts.copy(git=True), None)
                 else:
                     return difffn(opts, None)
             def diffsinglehunk(hunklines):
                 """yield tokens for a list of lines in a single hunk"""
                 for line in hunklines:
                     # chomp
                     chompline = line.rstrip(b'\r\n')
                     # highlight tabs and trailing whitespace
                     stripline = chompline.rstrip()
                     if line.startswith(b'-'):
                         label = b'diff.deleted'
                     elif line.startswith(b'+'):
                         label = b'diff.inserted'
                     else:
                         raise error.ProgrammingError(b'unexpected hunk line: %s' % line)
                     for token in tabsplitter.findall(stripline):
                         if token.startswith(b'\t'):
                             yield (token, b'diff.tab')
                         else:
                             yield (token, label)
                     if chompline != stripline:
                         yield (chompline[len(stripline) :], b'diff.trailingwhitespace')
                     if chompline != line:
                         yield (line[len(chompline) :], b'')
             def diffsinglehunkinline(hunklines):
                 """yield tokens for a list of lines in a single hunk, with inline colors"""
                 # prepare deleted, and inserted content
                 a = b''
                 b = b''
                 for line in hunklines:
                     if line[0:1] == b'-':
                         a += line[1:]
                     elif line[0:1] == b'+':
                         b += line[1:]
                     else:
                         raise error.ProgrammingError(b'unexpected hunk line: %s' % line)
                 # fast path: if either side is empty, use diffsinglehunk
                 if not a or not b:
                     for t in diffsinglehunk(hunklines):
                         yield t
                     return
                 # re-split the content into words
                 al = wordsplitter.findall(a)
                 bl = wordsplitter.findall(b)
                 # re-arrange the words to lines since the diff algorithm is line-based
                 aln = [s if s == b'\n' else s + b'\n' for s in al]
                 bln = [s if s == b'\n' else s + b'\n' for s in bl]
                 an = b''.join(aln)
                 bn = b''.join(bln)
                 # run the diff algorithm, prepare atokens and btokens
                 atokens = []
                 btokens = []
                 blocks = mdiff.allblocks(an, bn, lines1=aln, lines2=bln)
                 for (a1, a2, b1, b2), btype in blocks:
                     changed = btype == b'!'
                     for token in mdiff.splitnewlines(b''.join(al[a1:a2])):
                         atokens.append((changed, token))
                     for token in mdiff.splitnewlines(b''.join(bl[b1:b2])):
                         btokens.append((changed, token))
                 # yield deleted tokens, then inserted ones
                 for prefix, label, tokens in [
                     (b'-', b'diff.deleted', atokens),
                     (b'+', b'diff.inserted', btokens),
                 ]:
                     nextisnewline = True
                     for changed, token in tokens:
                         if nextisnewline:
                             yield (prefix, label)
                             nextisnewline = False
                         # special handling line end
                         isendofline = token.endswith(b'\n')
                         if isendofline:
                             chomp = token[:-1]  # chomp
                             if chomp.endswith(b'\r'):
                                 chomp = chomp[:-1]
                             endofline = token[len(chomp) :]
                             token = chomp.rstrip()  # detect spaces at the end
                             endspaces = chomp[len(token) :]
                         # scan tabs
                         for maybetab in tabsplitter.findall(token):
                             if b'\t' == maybetab[0:1]:
                                 currentlabel = b'diff.tab'
                             else:
                                 if changed:
                                     currentlabel = label + b'.changed'
                                 else:
                                     currentlabel = label + b'.unchanged'
                             yield (maybetab, currentlabel)
                         if isendofline:
                             if endspaces:
                                 yield (endspaces, b'diff.trailingwhitespace')
                             yield (endofline, b'')
                             nextisnewline = True
             def difflabel(func, *args, **kw):
                 '''yields 2-tuples of (output, label) based on the output of func()'''
                 if kw.get('opts') and kw['opts'].worddiff:
                     dodiffhunk = diffsinglehunkinline
                 else:
                     dodiffhunk = diffsinglehunk
                 headprefixes = [
                     (b'diff', b'diff.diffline'),
                     (b'copy', b'diff.extended'),
                     (b'rename', b'diff.extended'),
                     (b'old', b'diff.extended'),
                     (b'new', b'diff.extended'),
                     (b'deleted', b'diff.extended'),
                     (b'index', b'diff.extended'),
                     (b'similarity', b'diff.extended'),
                     (b'---', b'diff.file_a'),
                     (b'+++', b'diff.file_b'),
                 ]
                 textprefixes = [
                     (b'@', b'diff.hunk'),
                     # - and + are handled by diffsinglehunk
                 ]
                 head = False
                 # buffers a hunk, i.e. adjacent "-", "+" lines without other changes.
                 hunkbuffer = []
                 def consumehunkbuffer():
                     if hunkbuffer:
                         for token in dodiffhunk(hunkbuffer):
                             yield token
                         hunkbuffer[:] = []
                 for chunk in func(*args, **kw):
                     lines = chunk.split(b'\n')
                     linecount = len(lines)
                     for i, line in enumerate(lines):
                         if head:
                             if line.startswith(b'@'):
                                 head = False
                         else:
                             if line and not line.startswith(
                                 (b' ', b'+', b'-', b'@', b'\\')
                             ):
                                 head = True
                         diffline = False
                         if not head and line and line.startswith((b'+', b'-')):
                             diffline = True
                         prefixes = textprefixes
                         if head:
                             prefixes = headprefixes
                         if diffline:
                             # buffered
                             bufferedline = line
                             if i + 1 < linecount:
                                 bufferedline += b"\n"
                             hunkbuffer.append(bufferedline)
                         else:
                             # unbuffered
                             for token in consumehunkbuffer():
                                 yield token
                             stripline = line.rstrip()
                             for prefix, label in prefixes:
                                 if stripline.startswith(prefix):
                                     yield (stripline, label)
                                     if line != stripline:
                                         yield (
                                             line[len(stripline) :],
                                             b'diff.trailingwhitespace',
                                         )
                                     break
                             else:
                                 yield (line, b'')
                             if i + 1 < linecount:
                                 yield (b'\n', b'')
                     for token in consumehunkbuffer():
                         yield token
             def diffui(*args, **kw):
                 '''like diff(), but yields 2-tuples of (output, label) for ui.write()'''
                 return difflabel(diff, *args, **kw)
             def _filepairs(modified, added, removed, copy, opts):
                 '''generates tuples (f1, f2, copyop), where f1 is the name of the file
                 before and f2 is the the name after. For added files, f1 will be None,
                 and for removed files, f2 will be None. copyop may be set to None, 'copy'
                 or 'rename' (the latter two only if opts.git is set).'''
                 gone = set()
                 copyto = dict([(v, k) for k, v in copy.items()])
                 addedset, removedset = set(added), set(removed)
                 for f in sorted(modified + added + removed):
                     copyop = None
                     f1, f2 = f, f
                     if f in addedset:
                         f1 = None
                         if f in copy:
                             if opts.git:
                                 f1 = copy[f]
                                 if f1 in removedset and f1 not in gone:
                                     copyop = b'rename'
                                     gone.add(f1)
                                 else:
                                     copyop = b'copy'
                     elif f in removedset:
                         f2 = None
                         if opts.git:
                             # have we already reported a copy above?
                             if (
                                 f in copyto
                                 and copyto[f] in addedset
                                 and copy[copyto[f]] == f
                             ):
                                 continue
                     yield f1, f2, copyop
             def trydiff(
                 repo,
                 revs,
                 ctx1,
                 ctx2,
                 modified,
                 added,
                 removed,
                 copy,
                 getfilectx,
                 opts,
                 losedatafn,
                 pathfn,
             ):
                 '''given input data, generate a diff and yield it in blocks
                 If generating a diff would lose data like flags or binary data and
                 losedatafn is not None, it will be called.
                 pathfn is applied to every path in the diff output.
                 '''
                 def gitindex(text):
                     if not text:
                         text = b""
                     l = len(text)
                     s = hashutil.sha1(b'blob %d\0' % l)
                     s.update(text)
                     return hex(s.digest())
                 if opts.noprefix:
                     aprefix = bprefix = b''
                 else:
                     aprefix = b'a/'
                     bprefix = b'b/'
                 def diffline(f, revs):
                     revinfo = b' '.join([b"-r %s" % rev for rev in revs])
                     return b'diff %s %s' % (revinfo, f)
                 def isempty(fctx):
                     return fctx is None or fctx.size() == 0
                 date1 = dateutil.datestr(ctx1.date())
                 date2 = dateutil.datestr(ctx2.date())
                 gitmode = {b'l': b'120000', b'x': b'100755', b'': b'100644'}
                 if not pathfn:
                     pathfn = lambda f: f
                 for f1, f2, copyop in _filepairs(modified, added, removed, copy, opts):
                     content1 = None
                     content2 = None
                     fctx1 = None
                     fctx2 = None
                     flag1 = None
                     flag2 = None
                     if f1:
                         fctx1 = getfilectx(f1, ctx1)
                         if opts.git or losedatafn:
                             flag1 = ctx1.flags(f1)
                     if f2:
                         fctx2 = getfilectx(f2, ctx2)
                         if opts.git or losedatafn:
                             flag2 = ctx2.flags(f2)
                     # if binary is True, output "summary" or "base85", but not "text diff"
                     if opts.text:
                         binary = False
                     else:
                         binary = any(f.isbinary() for f in [fctx1, fctx2] if f is not None)
                     if losedatafn and not opts.git:
                         if (
                             binary
                             or
                             # copy/rename
                             f2 in copy
                             or
                             # empty file creation
                             (not f1 and isempty(fctx2))
                             or
                             # empty file deletion
                             (isempty(fctx1) and not f2)
                             or
                             # create with flags
                             (not f1 and flag2)
                             or
                             # change flags
                             (f1 and f2 and flag1 != flag2)
                         ):
                             losedatafn(f2 or f1)
                     path1 = pathfn(f1 or f2)
                     path2 = pathfn(f2 or f1)
                     header = []
                     if opts.git:
                         header.append(
                             b'diff --git %s%s %s%s' % (aprefix, path1, bprefix, path2)
                         )
                         if not f1:  # added
                             header.append(b'new file mode %s' % gitmode[flag2])
                         elif not f2:  # removed
                             header.append(b'deleted file mode %s' % gitmode[flag1])
                         else:  # modified/copied/renamed
                             mode1, mode2 = gitmode[flag1], gitmode[flag2]
                             if mode1 != mode2:
                                 header.append(b'old mode %s' % mode1)
                                 header.append(b'new mode %s' % mode2)
                             if copyop is not None:
                                 if opts.showsimilarity:
                                     sim = similar.score(ctx1[path1], ctx2[path2]) * 100
                                     header.append(b'similarity index %d%%' % sim)
                                 header.append(b'%s from %s' % (copyop, path1))
                                 header.append(b'%s to %s' % (copyop, path2))
                     elif revs:
                         header.append(diffline(path1, revs))
                     #  fctx.is  | diffopts                | what to   | is fctx.data()
                     #  binary() | text nobinary git index | output?   | outputted?
                     # ------------------------------------|----------------------------
                     #  yes      | no   no       no  *     | summary   | no
                     #  yes      | no   no       yes *     | base85    | yes
                     #  yes      | no   yes      no  *     | summary   | no
                     #  yes      | no   yes      yes 0     | summary   | no
                     #  yes      | no   yes      yes >0    | summary   | semi [1]
                     #  yes      | yes  *        *   *     | text diff | yes
                     #  no       | *    *        *   *     | text diff | yes
                     # [1]: hash(fctx.data()) is outputted. so fctx.data() cannot be faked
                     if binary and (
                         not opts.git or (opts.git and opts.nobinary and not opts.index)
                     ):
                         # fast path: no binary content will be displayed, content1 and
                         # content2 are only used for equivalent test. cmp() could have a
                         # fast path.
                         if fctx1 is not None:
                             content1 = b'\0'
                         if fctx2 is not None:
                             if fctx1 is not None and not fctx1.cmp(fctx2):
                                 content2 = b'\0'  # not different
                             else:
                                 content2 = b'\0\0'
                     else:
                         # normal path: load contents
                         if fctx1 is not None:
                             content1 = fctx1.data()
                         if fctx2 is not None:
                             content2 = fctx2.data()
                     if binary and opts.git and not opts.nobinary:
                         text = mdiff.b85diff(content1, content2)
                         if text:
                             header.append(
                                 b'index %s..%s' % (gitindex(content1), gitindex(content2))
                             )
                         hunks = ((None, [text]),)
                     else:
                         if opts.git and opts.index > 0:
                             flag = flag1
                             if flag is None:
                                 flag = flag2
                             header.append(
                                 b'index %s..%s %s'
                                 % (
                                     gitindex(content1)[0 : opts.index],
                                     gitindex(content2)[0 : opts.index],
                                     gitmode[flag],
                                 )
                             )
                         uheaders, hunks = mdiff.unidiff(
                             content1,
                             date1,
                             content2,
                             date2,
                             path1,
                             path2,
                             binary=binary,
                             opts=opts,
                         )
                         header.extend(uheaders)
                     yield fctx1, fctx2, header, hunks
             def diffstatsum(stats):
                 maxfile, maxtotal, addtotal, removetotal, binary = 0, 0, 0, 0, False
                 for f, a, r, b in stats:
                     maxfile = max(maxfile, encoding.colwidth(f))
                     maxtotal = max(maxtotal, a + r)
                     addtotal += a
                     removetotal += r
                     binary = binary or b
                 return maxfile, maxtotal, addtotal, removetotal, binary
             def diffstatdata(lines):
                 diffre = re.compile(br'^diff .*-r [a-z0-9]+\s(.*)$')
                 results = []
                 filename, adds, removes, isbinary = None, 0, 0, False
                 def addresult():
                     if filename:
                         results.append((filename, adds, removes, isbinary))
                 # inheader is used to track if a line is in the
                 # header portion of the diff.  This helps properly account
                 # for lines that start with '--' or '++'
                 inheader = False
                 for line in lines:
                     if line.startswith(b'diff'):
                         addresult()
                         # starting a new file diff
                         # set numbers to 0 and reset inheader
                         inheader = True
                         adds, removes, isbinary = 0, 0, False
                         if line.startswith(b'diff --git a/'):
                             filename = gitre.search(line).group(2)
                         elif line.startswith(b'diff -r'):
                             # format: "diff -r ... -r ... filename"
                             filename = diffre.search(line).group(1)
                     elif line.startswith(b'@@'):
                         inheader = False
                     elif line.startswith(b'+') and not inheader:
                         adds += 1
                     elif line.startswith(b'-') and not inheader:
                         removes += 1
                     elif line.startswith(b'GIT binary patch') or line.startswith(
                         b'Binary file'
                     ):
                         isbinary = True
                     elif line.startswith(b'rename from'):
                         filename = line[12:]
                     elif line.startswith(b'rename to'):
                         filename += b' => %s' % line[10:]
                 addresult()
                 return results
             def diffstat(lines, width=80):
                 output = []
                 stats = diffstatdata(lines)
                 maxname, maxtotal, totaladds, totalremoves, hasbinary = diffstatsum(stats)
                 countwidth = len(str(maxtotal))
                 if hasbinary and countwidth < 3:
                     countwidth = 3
                 graphwidth = width - countwidth - maxname - 6
                 if graphwidth < 10:
                     graphwidth = 10
                 def scale(i):
                     if maxtotal <= graphwidth:
                         return i
                     # If diffstat runs out of room it doesn't print anything,
                     # which isn't very useful, so always print at least one + or -
                     # if there were at least some changes.
                     return max(i * graphwidth // maxtotal, int(bool(i)))
                 for filename, adds, removes, isbinary in stats:
                     if isbinary:
                         count = b'Bin'
                     else:
                         count = b'%d' % (adds + removes)
                     pluses = b'+' * scale(adds)
                     minuses = b'-' * scale(removes)
                     output.append(
                         b' %s%s |  %*s %s%s\n'
                         % (
                             filename,
                             b' ' * (maxname - encoding.colwidth(filename)),
                             countwidth,
                             count,
                             pluses,
                             minuses,
                         )
                     )
                 if stats:
                     output.append(
                         _(b' %d files changed, %d insertions(+), %d deletions(-)\n')
                         % (len(stats), totaladds, totalremoves)
                     )
                 return b''.join(output)
             def diffstatui(*args, **kw):
                 '''like diffstat(), but yields 2-tuples of (output, label) for
                 ui.write()
                 '''
                 for line in diffstat(*args, **kw).splitlines():
                     if line and line[-1] in b'+-':
                         name, graph = line.rsplit(b' ', 1)
                         yield (name + b' ', b'')
                         m = re.search(br'\++', graph)
                         if m:
                             yield (m.group(0), b'diffstat.inserted')
                         m = re.search(br'-+', graph)
                         if m:
                             yield (m.group(0), b'diffstat.deleted')
                     else:
                         yield (line, b'')
                     yield (b'\n', b'')

General Comments 0

Write
Preview

You need to be logged in to leave comments. Login now

No TODOs yet

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages