upstream/mercurial-mirror Commit - r30030:0f6d6fdd

pycompat: provide 'ispy3' constant...

Yuya Nishihara -

r30030:0f6d6fdd default

parent child

mercurial/bundle2.py

0 +2 -1

             # bundle2.py - generic container format to transmit arbitrary data.
             #
             # Copyright 2013 Facebook, Inc.
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             """Handling of the new bundle2 format
             The goal of bundle2 is to act as an atomically packet to transmit a set of
             payloads in an application agnostic way. It consist in a sequence of "parts"
             that will be handed to and processed by the application layer.
             General format architecture
             ===========================
             The format is architectured as follow
              - magic string
              - stream level parameters
              - payload parts (any number)
              - end of stream marker.
             the Binary format
             ============================
             All numbers are unsigned and big-endian.
             stream level parameters
             ------------------------
             Binary format is as follow
             :params size: int32
               The total number of Bytes used by the parameters
             :params value: arbitrary number of Bytes
               A blob of `params size` containing the serialized version of all stream level
               parameters.
               The blob contains a space separated list of parameters. Parameters with value
               are stored in the form `<name>=<value>`. Both name and value are urlquoted.
               Empty name are obviously forbidden.
               Name MUST start with a letter. If this first letter is lower case, the
               parameter is advisory and can be safely ignored. However when the first
               letter is capital, the parameter is mandatory and the bundling process MUST
               stop if he is not able to proceed it.
               Stream parameters use a simple textual format for two main reasons:
               - Stream level parameters should remain simple and we want to discourage any
                 crazy usage.
               - Textual data allow easy human inspection of a bundle2 header in case of
                 troubles.
               Any Applicative level options MUST go into a bundle2 part instead.
             Payload part
             ------------------------
             Binary format is as follow
             :header size: int32
               The total number of Bytes used by the part header. When the header is empty
               (size = 0) this is interpreted as the end of stream marker.
             :header:
                 The header defines how to interpret the part. It contains two piece of
                 data: the part type, and the part parameters.
                 The part type is used to route an application level handler, that can
                 interpret payload.
                 Part parameters are passed to the application level handler.  They are
                 meant to convey information that will help the application level object to
                 interpret the part payload.
                 The binary format of the header is has follow
                 :typesize: (one byte)
                 :parttype: alphanumerical part name (restricted to [a-zA-Z0-9_:-]*)
                 :partid: A 32bits integer (unique in the bundle) that can be used to refer
                          to this part.
                 :parameters:
                     Part's parameter may have arbitrary content, the binary structure is::
                         <mandatory-count><advisory-count><param-sizes><param-data>
                     :mandatory-count: 1 byte, number of mandatory parameters
                     :advisory-count:  1 byte, number of advisory parameters
                     :param-sizes:
                         N couple of bytes, where N is the total number of parameters. Each
                         couple contains (<size-of-key>, <size-of-value) for one parameter.
                     :param-data:
                         A blob of bytes from which each parameter key and value can be
                         retrieved using the list of size couples stored in the previous
                         field.
                         Mandatory parameters comes first, then the advisory ones.
                         Each parameter's key MUST be unique within the part.
             :payload:
                 payload is a series of `<chunksize><chunkdata>`.
                 `chunksize` is an int32, `chunkdata` are plain bytes (as much as
                 `chunksize` says)` The payload part is concluded by a zero size chunk.
                 The current implementation always produces either zero or one chunk.
                 This is an implementation limitation that will ultimately be lifted.
                 `chunksize` can be negative to trigger special case processing. No such
                 processing is in place yet.
             Bundle processing
             ============================
             Each part is processed in order using a "part handler". Handler are registered
             for a certain part type.
             The matching of a part to its handler is case insensitive. The case of the
             part type is used to know if a part is mandatory or advisory. If the Part type
             contains any uppercase char it is considered mandatory. When no handler is
             known for a Mandatory part, the process is aborted and an exception is raised.
             If the part is advisory and no handler is known, the part is ignored. When the
             process is aborted, the full bundle is still read from the stream to keep the
             channel usable. But none of the part read from an abort are processed. In the
             future, dropping the stream may become an option for channel we do not care to
             preserve.
             """
             from __future__ import absolute_import
             import errno
             import re
             import string
             import struct
             import sys
             from .i18n import _
             from . import (
                 changegroup,
                 error,
                 obsolete,
                 pushkey,
+                pycompat,
                 tags,
                 url,
                 util,
             )
             urlerr = util.urlerr
             urlreq = util.urlreq
             _pack = struct.pack
             _unpack = struct.unpack
             _fstreamparamsize = '>i'
             _fpartheadersize = '>i'
             _fparttypesize = '>B'
             _fpartid = '>I'
             _fpayloadsize = '>i'
             _fpartparamcount = '>BB'
             preferedchunksize = 4096
             _parttypeforbidden = re.compile('[^a-zA-Z0-9_:-]')
             def outdebug(ui, message):
                 """debug regarding output stream (bundling)"""
                 if ui.configbool('devel', 'bundle2.debug', False):
                     ui.debug('bundle2-output: %s\n' % message)
             def indebug(ui, message):
                 """debug on input stream (unbundling)"""
                 if ui.configbool('devel', 'bundle2.debug', False):
                     ui.debug('bundle2-input: %s\n' % message)
             def validateparttype(parttype):
                 """raise ValueError if a parttype contains invalid character"""
                 if _parttypeforbidden.search(parttype):
                     raise ValueError(parttype)
             def _makefpartparamsizes(nbparams):
                 """return a struct format to read part parameter sizes
                 The number parameters is variable so we need to build that format
                 dynamically.
                 """
                 return '>'+('BB'*nbparams)
             parthandlermapping = {}
             def parthandler(parttype, params=()):
                 """decorator that register a function as a bundle2 part handler
                 eg::
                     @parthandler('myparttype', ('mandatory', 'param', 'handled'))
                     def myparttypehandler(...):
                         '''process a part of type "my part".'''
                         ...
                 """
                 validateparttype(parttype)
                 def _decorator(func):
                     lparttype = parttype.lower() # enforce lower case matching.
                     assert lparttype not in parthandlermapping
                     parthandlermapping[lparttype] = func
                     func.params = frozenset(params)
                     return func
                 return _decorator
             class unbundlerecords(object):
                 """keep record of what happens during and unbundle
                 New records are added using `records.add('cat', obj)`. Where 'cat' is a
                 category of record and obj is an arbitrary object.
                 `records['cat']` will return all entries of this category 'cat'.
                 Iterating on the object itself will yield `('category', obj)` tuples
                 for all entries.
                 All iterations happens in chronological order.
                 """
                 def __init__(self):
                     self._categories = {}
                     self._sequences = []
                     self._replies = {}
                 def add(self, category, entry, inreplyto=None):
                     """add a new record of a given category.
                     The entry can then be retrieved in the list returned by
                     self['category']."""
                     self._categories.setdefault(category, []).append(entry)
                     self._sequences.append((category, entry))
                     if inreplyto is not None:
                         self.getreplies(inreplyto).add(category, entry)
                 def getreplies(self, partid):
                     """get the records that are replies to a specific part"""
                     return self._replies.setdefault(partid, unbundlerecords())
                 def __getitem__(self, cat):
                     return tuple(self._categories.get(cat, ()))
                 def __iter__(self):
                     return iter(self._sequences)
                 def __len__(self):
                     return len(self._sequences)
                 def __nonzero__(self):
                     return bool(self._sequences)
             class bundleoperation(object):
                 """an object that represents a single bundling process
                 Its purpose is to carry unbundle-related objects and states.
                 A new object should be created at the beginning of each bundle processing.
                 The object is to be returned by the processing function.
                 The object has very little content now it will ultimately contain:
                 * an access to the repo the bundle is applied to,
                 * a ui object,
                 * a way to retrieve a transaction to add changes to the repo,
                 * a way to record the result of processing each part,
                 * a way to construct a bundle response when applicable.
                 """
                 def __init__(self, repo, transactiongetter, captureoutput=True):
                     self.repo = repo
                     self.ui = repo.ui
                     self.records = unbundlerecords()
                     self.gettransaction = transactiongetter
                     self.reply = None
                     self.captureoutput = captureoutput
             class TransactionUnavailable(RuntimeError):
                 pass
             def _notransaction():
                 """default method to get a transaction while processing a bundle
                 Raise an exception to highlight the fact that no transaction was expected
                 to be created"""
                 raise TransactionUnavailable()
             def applybundle(repo, unbundler, tr, source=None, url=None, op=None):
                 # transform me into unbundler.apply() as soon as the freeze is lifted
                 tr.hookargs['bundle2'] = '1'
                 if source is not None and 'source' not in tr.hookargs:
                     tr.hookargs['source'] = source
                 if url is not None and 'url' not in tr.hookargs:
                     tr.hookargs['url'] = url
                 return processbundle(repo, unbundler, lambda: tr, op=op)
             def processbundle(repo, unbundler, transactiongetter=None, op=None):
                 """This function process a bundle, apply effect to/from a repo
                 It iterates over each part then searches for and uses the proper handling
                 code to process the part. Parts are processed in order.
                 This is very early version of this function that will be strongly reworked
                 before final usage.
                 Unknown Mandatory part will abort the process.
                 It is temporarily possible to provide a prebuilt bundleoperation to the
                 function. This is used to ensure output is properly propagated in case of
                 an error during the unbundling. This output capturing part will likely be
                 reworked and this ability will probably go away in the process.
                 """
                 if op is None:
                     if transactiongetter is None:
                         transactiongetter = _notransaction
                     op = bundleoperation(repo, transactiongetter)
                 # todo:
                 # - replace this is a init function soon.
                 # - exception catching
                 unbundler.params
                 if repo.ui.debugflag:
                     msg = ['bundle2-input-bundle:']
                     if unbundler.params:
                         msg.append(' %i params')
                     if op.gettransaction is None:
                         msg.append(' no-transaction')
                     else:
                         msg.append(' with-transaction')
                     msg.append('\n')
                     repo.ui.debug(''.join(msg))
                 iterparts = enumerate(unbundler.iterparts())
                 part = None
                 nbpart = 0
                 try:
                     for nbpart, part in iterparts:
                         _processpart(op, part)
                 except Exception as exc:
                     for nbpart, part in iterparts:
                         # consume the bundle content
                         part.seek(0, 2)
                     # Small hack to let caller code distinguish exceptions from bundle2
                     # processing from processing the old format. This is mostly
                     # needed to handle different return codes to unbundle according to the
                     # type of bundle. We should probably clean up or drop this return code
                     # craziness in a future version.
                     exc.duringunbundle2 = True
                     salvaged = []
                     replycaps = None
                     if op.reply is not None:
                         salvaged = op.reply.salvageoutput()
                         replycaps = op.reply.capabilities
                     exc._replycaps = replycaps
                     exc._bundle2salvagedoutput = salvaged
                     raise
                 finally:
                     repo.ui.debug('bundle2-input-bundle: %i parts total\n' % nbpart)
                 return op
             def _processpart(op, part):
                 """process a single part from a bundle
                 The part is guaranteed to have been fully consumed when the function exits
                 (even if an exception is raised)."""
                 status = 'unknown' # used by debug output
                 hardabort = False
                 try:
                     try:
                         handler = parthandlermapping.get(part.type)
                         if handler is None:
                             status = 'unsupported-type'
                             raise error.BundleUnknownFeatureError(parttype=part.type)
                         indebug(op.ui, 'found a handler for part %r' % part.type)
                         unknownparams = part.mandatorykeys - handler.params
                         if unknownparams:
                             unknownparams = list(unknownparams)
                             unknownparams.sort()
                             status = 'unsupported-params (%s)' % unknownparams
                             raise error.BundleUnknownFeatureError(parttype=part.type,
                                                                   params=unknownparams)
                         status = 'supported'
                     except error.BundleUnknownFeatureError as exc:
                         if part.mandatory: # mandatory parts
                             raise
                         indebug(op.ui, 'ignoring unsupported advisory part %s' % exc)
                         return # skip to part processing
                     finally:
                         if op.ui.debugflag:
                             msg = ['bundle2-input-part: "%s"' % part.type]
                             if not part.mandatory:
                                 msg.append(' (advisory)')
                             nbmp = len(part.mandatorykeys)
                             nbap = len(part.params) - nbmp
                             if nbmp or nbap:
                                 msg.append(' (params:')
                                 if nbmp:
                                     msg.append(' %i mandatory' % nbmp)
                                 if nbap:
                                     msg.append(' %i advisory' % nbmp)
                                 msg.append(')')
                             msg.append(' %s\n' % status)
                             op.ui.debug(''.join(msg))
                     # handler is called outside the above try block so that we don't
                     # risk catching KeyErrors from anything other than the
                     # parthandlermapping lookup (any KeyError raised by handler()
                     # itself represents a defect of a different variety).
                     output = None
                     if op.captureoutput and op.reply is not None:
                         op.ui.pushbuffer(error=True, subproc=True)
                         output = ''
                     try:
                         handler(op, part)
                     finally:
                         if output is not None:
                             output = op.ui.popbuffer()
                         if output:
                             outpart = op.reply.newpart('output', data=output,
                                                        mandatory=False)
                             outpart.addparam('in-reply-to', str(part.id), mandatory=False)
                 # If exiting or interrupted, do not attempt to seek the stream in the
                 # finally block below. This makes abort faster.
                 except (SystemExit, KeyboardInterrupt):
                     hardabort = True
                     raise
                 finally:
                     # consume the part content to not corrupt the stream.
                     if not hardabort:
                         part.seek(0, 2)
             def decodecaps(blob):
                 """decode a bundle2 caps bytes blob into a dictionary
                 The blob is a list of capabilities (one per line)
                 Capabilities may have values using a line of the form::
                     capability=value1,value2,value3
                 The values are always a list."""
                 caps = {}
                 for line in blob.splitlines():
                     if not line:
                         continue
                     if '=' not in line:
                         key, vals = line, ()
                     else:
                         key, vals = line.split('=', 1)
                         vals = vals.split(',')
                     key = urlreq.unquote(key)
                     vals = [urlreq.unquote(v) for v in vals]
                     caps[key] = vals
                 return caps
             def encodecaps(caps):
                 """encode a bundle2 caps dictionary into a bytes blob"""
                 chunks = []
                 for ca in sorted(caps):
                     vals = caps[ca]
                     ca = urlreq.quote(ca)
                     vals = [urlreq.quote(v) for v in vals]
                     if vals:
                         ca = "%s=%s" % (ca, ','.join(vals))
                     chunks.append(ca)
                 return '\n'.join(chunks)
             bundletypes = {
                 "": ("", None),       # only when using unbundle on ssh and old http servers
                                       # since the unification ssh accepts a header but there
                                       # is no capability signaling it.
                 "HG20": (), # special-cased below
                 "HG10UN": ("HG10UN", None),
                 "HG10BZ": ("HG10", 'BZ'),
                 "HG10GZ": ("HG10GZ", 'GZ'),
             }
             # hgweb uses this list to communicate its preferred type
             bundlepriority = ['HG10GZ', 'HG10BZ', 'HG10UN']
             class bundle20(object):
                 """represent an outgoing bundle2 container
                 Use the `addparam` method to add stream level parameter. and `newpart` to
                 populate it. Then call `getchunks` to retrieve all the binary chunks of
                 data that compose the bundle2 container."""
                 _magicstring = 'HG20'
                 def __init__(self, ui, capabilities=()):
                     self.ui = ui
                     self._params = []
                     self._parts = []
                     self.capabilities = dict(capabilities)
                     self._compressor = util.compressors[None]()
                 def setcompression(self, alg):
                     """setup core part compression to <alg>"""
                     if alg is None:
                         return
                     assert not any(n.lower() == 'Compression' for n, v in self._params)
                     self.addparam('Compression', alg)
                     self._compressor = util.compressors[alg]()
                 @property
                 def nbparts(self):
                     """total number of parts added to the bundler"""
                     return len(self._parts)
                 # methods used to defines the bundle2 content
                 def addparam(self, name, value=None):
                     """add a stream level parameter"""
                     if not name:
                         raise ValueError('empty parameter name')
                     if name[0] not in string.letters:
                         raise ValueError('non letter first character: %r' % name)
                     self._params.append((name, value))
                 def addpart(self, part):
                     """add a new part to the bundle2 container
                     Parts contains the actual applicative payload."""
                     assert part.id is None
                     part.id = len(self._parts) # very cheap counter
                     self._parts.append(part)
                 def newpart(self, typeid, *args, **kwargs):
                     """create a new part and add it to the containers
                     As the part is directly added to the containers. For now, this means
                     that any failure to properly initialize the part after calling
                     ``newpart`` should result in a failure of the whole bundling process.
                     You can still fall back to manually create and add if you need better
                     control."""
                     part = bundlepart(typeid, *args, **kwargs)
                     self.addpart(part)
                     return part
                 # methods used to generate the bundle2 stream
                 def getchunks(self):
                     if self.ui.debugflag:
                         msg = ['bundle2-output-bundle: "%s",' % self._magicstring]
                         if self._params:
                             msg.append(' (%i params)' % len(self._params))
                         msg.append(' %i parts total\n' % len(self._parts))
                         self.ui.debug(''.join(msg))
                     outdebug(self.ui, 'start emission of %s stream' % self._magicstring)
                     yield self._magicstring
                     param = self._paramchunk()
                     outdebug(self.ui, 'bundle parameter: %s' % param)
                     yield _pack(_fstreamparamsize, len(param))
                     if param:
                         yield param
                     # starting compression
                     for chunk in self._getcorechunk():
                         yield self._compressor.compress(chunk)
                     yield self._compressor.flush()
                 def _paramchunk(self):
                     """return a encoded version of all stream parameters"""
                     blocks = []
                     for par, value in self._params:
                         par = urlreq.quote(par)
                         if value is not None:
                             value = urlreq.quote(value)
                             par = '%s=%s' % (par, value)
                         blocks.append(par)
                     return ' '.join(blocks)
                 def _getcorechunk(self):
                     """yield chunk for the core part of the bundle
                     (all but headers and parameters)"""
                     outdebug(self.ui, 'start of parts')
                     for part in self._parts:
                         outdebug(self.ui, 'bundle part: "%s"' % part.type)
                         for chunk in part.getchunks(ui=self.ui):
                             yield chunk
                     outdebug(self.ui, 'end of bundle')
                     yield _pack(_fpartheadersize, 0)
                 def salvageoutput(self):
                     """return a list with a copy of all output parts in the bundle
                     This is meant to be used during error handling to make sure we preserve
                     server output"""
                     salvaged = []
                     for part in self._parts:
                         if part.type.startswith('output'):
                             salvaged.append(part.copy())
                     return salvaged
             class unpackermixin(object):
                 """A mixin to extract bytes and struct data from a stream"""
                 def __init__(self, fp):
                     self._fp = fp
                     self._seekable = (util.safehasattr(fp, 'seek') and
                                       util.safehasattr(fp, 'tell'))
                 def _unpack(self, format):
                     """unpack this struct format from the stream"""
                     data = self._readexact(struct.calcsize(format))
                     return _unpack(format, data)
                 def _readexact(self, size):
                     """read exactly <size> bytes from the stream"""
                     return changegroup.readexactly(self._fp, size)
                 def seek(self, offset, whence=0):
                     """move the underlying file pointer"""
                     if self._seekable:
                         return self._fp.seek(offset, whence)
                     else:
                         raise NotImplementedError(_('File pointer is not seekable'))
                 def tell(self):
                     """return the file offset, or None if file is not seekable"""
                     if self._seekable:
                         try:
                             return self._fp.tell()
                         except IOError as e:
                             if e.errno == errno.ESPIPE:
                                 self._seekable = False
                             else:
                                 raise
                     return None
                 def close(self):
                     """close underlying file"""
                     if util.safehasattr(self._fp, 'close'):
                         return self._fp.close()
             def getunbundler(ui, fp, magicstring=None):
                 """return a valid unbundler object for a given magicstring"""
                 if magicstring is None:
                     magicstring = changegroup.readexactly(fp, 4)
                 magic, version = magicstring[0:2], magicstring[2:4]
                 if magic != 'HG':
                     raise error.Abort(_('not a Mercurial bundle'))
                 unbundlerclass = formatmap.get(version)
                 if unbundlerclass is None:
                     raise error.Abort(_('unknown bundle version %s') % version)
                 unbundler = unbundlerclass(ui, fp)
                 indebug(ui, 'start processing of %s stream' % magicstring)
                 return unbundler
             class unbundle20(unpackermixin):
                 """interpret a bundle2 stream
                 This class is fed with a binary stream and yields parts through its
                 `iterparts` methods."""
                 _magicstring = 'HG20'
                 def __init__(self, ui, fp):
                     """If header is specified, we do not read it out of the stream."""
                     self.ui = ui
                     self._decompressor = util.decompressors[None]
                     self._compressed = None
                     super(unbundle20, self).__init__(fp)
                 @util.propertycache
                 def params(self):
                     """dictionary of stream level parameters"""
                     indebug(self.ui, 'reading bundle2 stream parameters')
                     params = {}
                     paramssize = self._unpack(_fstreamparamsize)[0]
                     if paramssize < 0:
                         raise error.BundleValueError('negative bundle param size: %i'
                                                      % paramssize)
                     if paramssize:
                         params = self._readexact(paramssize)
                         params = self._processallparams(params)
                     return params
                 def _processallparams(self, paramsblock):
                     """"""
                     params = util.sortdict()
                     for p in paramsblock.split(' '):
                         p = p.split('=', 1)
                         p = [urlreq.unquote(i) for i in p]
                         if len(p) < 2:
                             p.append(None)
                         self._processparam(*p)
                         params[p[0]] = p[1]
                     return params
                 def _processparam(self, name, value):
                     """process a parameter, applying its effect if needed
                     Parameter starting with a lower case letter are advisory and will be
                     ignored when unknown.  Those starting with an upper case letter are
                     mandatory and will this function will raise a KeyError when unknown.
                     Note: no option are currently supported. Any input will be either
                           ignored or failing.
                     """
                     if not name:
                         raise ValueError('empty parameter name')
                     if name[0] not in string.letters:
                         raise ValueError('non letter first character: %r' % name)
                     try:
                         handler = b2streamparamsmap[name.lower()]
                     except KeyError:
                         if name[0].islower():
                             indebug(self.ui, "ignoring unknown parameter %r" % name)
                         else:
                             raise error.BundleUnknownFeatureError(params=(name,))
                     else:
                         handler(self, name, value)
                 def _forwardchunks(self):
                     """utility to transfer a bundle2 as binary
                     This is made necessary by the fact the 'getbundle' command over 'ssh'
                     have no way to know then the reply end, relying on the bundle to be
                     interpreted to know its end. This is terrible and we are sorry, but we
                     needed to move forward to get general delta enabled.
                     """
                     yield self._magicstring
                     assert 'params' not in vars(self)
                     paramssize = self._unpack(_fstreamparamsize)[0]
                     if paramssize < 0:
                         raise error.BundleValueError('negative bundle param size: %i'
                                                      % paramssize)
                     yield _pack(_fstreamparamsize, paramssize)
                     if paramssize:
                         params = self._readexact(paramssize)
                         self._processallparams(params)
                         yield params
                         assert self._decompressor is util.decompressors[None]
                     # From there, payload might need to be decompressed
                     self._fp = self._decompressor(self._fp)
                     emptycount = 0
                     while emptycount < 2:
                         # so we can brainlessly loop
                         assert _fpartheadersize == _fpayloadsize
                         size = self._unpack(_fpartheadersize)[0]
                         yield _pack(_fpartheadersize, size)
                         if size:
                             emptycount = 0
                         else:
                             emptycount += 1
                             continue
                         if size == flaginterrupt:
                             continue
                         elif size < 0:
                             raise error.BundleValueError('negative chunk size: %i')
                         yield self._readexact(size)
                 def iterparts(self):
                     """yield all parts contained in the stream"""
                     # make sure param have been loaded
                     self.params
                     # From there, payload need to be decompressed
                     self._fp = self._decompressor(self._fp)
                     indebug(self.ui, 'start extraction of bundle2 parts')
                     headerblock = self._readpartheader()
                     while headerblock is not None:
                         part = unbundlepart(self.ui, headerblock, self._fp)
                         yield part
                         part.seek(0, 2)
                         headerblock = self._readpartheader()
                     indebug(self.ui, 'end of bundle2 stream')
                 def _readpartheader(self):
                     """reads a part header size and return the bytes blob
                     returns None if empty"""
                     headersize = self._unpack(_fpartheadersize)[0]
                     if headersize < 0:
                         raise error.BundleValueError('negative part header size: %i'
                                                      % headersize)
                     indebug(self.ui, 'part header size: %i' % headersize)
                     if headersize:
                         return self._readexact(headersize)
                     return None
                 def compressed(self):
                     self.params # load params
                     return self._compressed
             formatmap = {'20': unbundle20}
             b2streamparamsmap = {}
             def b2streamparamhandler(name):
                 """register a handler for a stream level parameter"""
                 def decorator(func):
                     assert name not in formatmap
                     b2streamparamsmap[name] = func
                     return func
                 return decorator
             @b2streamparamhandler('compression')
             def processcompression(unbundler, param, value):
                 """read compression parameter and install payload decompression"""
                 if value not in util.decompressors:
                     raise error.BundleUnknownFeatureError(params=(param,),
                                                           values=(value,))
                 unbundler._decompressor = util.decompressors[value]
                 if value is not None:
                     unbundler._compressed = True
             class bundlepart(object):
                 """A bundle2 part contains application level payload
                 The part `type` is used to route the part to the application level
                 handler.
                 The part payload is contained in ``part.data``. It could be raw bytes or a
                 generator of byte chunks.
                 You can add parameters to the part using the ``addparam`` method.
                 Parameters can be either mandatory (default) or advisory. Remote side
                 should be able to safely ignore the advisory ones.
                 Both data and parameters cannot be modified after the generation has begun.
                 """
                 def __init__(self, parttype, mandatoryparams=(), advisoryparams=(),
                              data='', mandatory=True):
                     validateparttype(parttype)
                     self.id = None
                     self.type = parttype
                     self._data = data
                     self._mandatoryparams = list(mandatoryparams)
                     self._advisoryparams = list(advisoryparams)
                     # checking for duplicated entries
                     self._seenparams = set()
                     for pname, __ in self._mandatoryparams + self._advisoryparams:
                         if pname in self._seenparams:
                             raise RuntimeError('duplicated params: %s' % pname)
                         self._seenparams.add(pname)
                     # status of the part's generation:
                     # - None: not started,
                     # - False: currently generated,
                     # - True: generation done.
                     self._generated = None
                     self.mandatory = mandatory
                 def copy(self):
                     """return a copy of the part
                     The new part have the very same content but no partid assigned yet.
                     Parts with generated data cannot be copied."""
                     assert not util.safehasattr(self.data, 'next')
                     return self.__class__(self.type, self._mandatoryparams,
                                           self._advisoryparams, self._data, self.mandatory)
                 # methods used to defines the part content
                 @property
                 def data(self):
                     return self._data
                 @data.setter
                 def data(self, data):
                     if self._generated is not None:
                         raise error.ReadOnlyPartError('part is being generated')
                     self._data = data
                 @property
                 def mandatoryparams(self):
                     # make it an immutable tuple to force people through ``addparam``
                     return tuple(self._mandatoryparams)
                 @property
                 def advisoryparams(self):
                     # make it an immutable tuple to force people through ``addparam``
                     return tuple(self._advisoryparams)
                 def addparam(self, name, value='', mandatory=True):
                     if self._generated is not None:
                         raise error.ReadOnlyPartError('part is being generated')
                     if name in self._seenparams:
                         raise ValueError('duplicated params: %s' % name)
                     self._seenparams.add(name)
                     params = self._advisoryparams
                     if mandatory:
                         params = self._mandatoryparams
                     params.append((name, value))
                 # methods used to generates the bundle2 stream
                 def getchunks(self, ui):
                     if self._generated is not None:
                         raise RuntimeError('part can only be consumed once')
                     self._generated = False
                     if ui.debugflag:
                         msg = ['bundle2-output-part: "%s"' % self.type]
                         if not self.mandatory:
                             msg.append(' (advisory)')
                         nbmp = len(self.mandatoryparams)
                         nbap = len(self.advisoryparams)
                         if nbmp or nbap:
                             msg.append(' (params:')
                             if nbmp:
                                 msg.append(' %i mandatory' % nbmp)
                             if nbap:
                                 msg.append(' %i advisory' % nbmp)
                             msg.append(')')
                         if not self.data:
                             msg.append(' empty payload')
                         elif util.safehasattr(self.data, 'next'):
                             msg.append(' streamed payload')
                         else:
                             msg.append(' %i bytes payload' % len(self.data))
                         msg.append('\n')
                         ui.debug(''.join(msg))
                     #### header
                     if self.mandatory:
                         parttype = self.type.upper()
                     else:
                         parttype = self.type.lower()
                     outdebug(ui, 'part %s: "%s"' % (self.id, parttype))
                     ## parttype
                     header = [_pack(_fparttypesize, len(parttype)),
                               parttype, _pack(_fpartid, self.id),
                              ]
                     ## parameters
                     # count
                     manpar = self.mandatoryparams
                     advpar = self.advisoryparams
                     header.append(_pack(_fpartparamcount, len(manpar), len(advpar)))
                     # size
                     parsizes = []
                     for key, value in manpar:
                         parsizes.append(len(key))
                         parsizes.append(len(value))
                     for key, value in advpar:
                         parsizes.append(len(key))
                         parsizes.append(len(value))
                     paramsizes = _pack(_makefpartparamsizes(len(parsizes) / 2), *parsizes)
                     header.append(paramsizes)
                     # key, value
                     for key, value in manpar:
                         header.append(key)
                         header.append(value)
                     for key, value in advpar:
                         header.append(key)
                         header.append(value)
                     ## finalize header
                     headerchunk = ''.join(header)
                     outdebug(ui, 'header chunk size: %i' % len(headerchunk))
                     yield _pack(_fpartheadersize, len(headerchunk))
                     yield headerchunk
                     ## payload
                     try:
                         for chunk in self._payloadchunks():
                             outdebug(ui, 'payload chunk size: %i' % len(chunk))
                             yield _pack(_fpayloadsize, len(chunk))
                             yield chunk
                     except GeneratorExit:
                         # GeneratorExit means that nobody is listening for our
                         # results anyway, so just bail quickly rather than trying
                         # to produce an error part.
                         ui.debug('bundle2-generatorexit\n')
                         raise
                     except BaseException as exc:
                         # backup exception data for later
                         ui.debug('bundle2-input-stream-interrupt: encoding exception %s'
                                  % exc)
                         exc_info = sys.exc_info()
                         msg = 'unexpected error: %s' % exc
                         interpart = bundlepart('error:abort', [('message', msg)],
                                                mandatory=False)
                         interpart.id = 0
                         yield _pack(_fpayloadsize, -1)
                         for chunk in interpart.getchunks(ui=ui):
                             yield chunk
                         outdebug(ui, 'closing payload chunk')
                         # abort current part payload
                         yield _pack(_fpayloadsize, 0)
-                        if sys.version_info[0] >= 3:
+                        if pycompat.ispy3:
                             raise exc_info[0](exc_info[1]).with_traceback(exc_info[2])
                         else:
                             exec("""raise exc_info[0], exc_info[1], exc_info[2]""")
                     # end of payload
                     outdebug(ui, 'closing payload chunk')
                     yield _pack(_fpayloadsize, 0)
                     self._generated = True
                 def _payloadchunks(self):
                     """yield chunks of a the part payload
                     Exists to handle the different methods to provide data to a part."""
                     # we only support fixed size data now.
                     # This will be improved in the future.
                     if util.safehasattr(self.data, 'next'):
                         buff = util.chunkbuffer(self.data)
                         chunk = buff.read(preferedchunksize)
                         while chunk:
                             yield chunk
                             chunk = buff.read(preferedchunksize)
                     elif len(self.data):
                         yield self.data
             flaginterrupt = -1
             class interrupthandler(unpackermixin):
                 """read one part and process it with restricted capability
                 This allows to transmit exception raised on the producer size during part
                 iteration while the consumer is reading a part.
                 Part processed in this manner only have access to a ui object,"""
                 def __init__(self, ui, fp):
                     super(interrupthandler, self).__init__(fp)
                     self.ui = ui
                 def _readpartheader(self):
                     """reads a part header size and return the bytes blob
                     returns None if empty"""
                     headersize = self._unpack(_fpartheadersize)[0]
                     if headersize < 0:
                         raise error.BundleValueError('negative part header size: %i'
                                                      % headersize)
                     indebug(self.ui, 'part header size: %i\n' % headersize)
                     if headersize:
                         return self._readexact(headersize)
                     return None
                 def __call__(self):
                     self.ui.debug('bundle2-input-stream-interrupt:'
                                   ' opening out of band context\n')
                     indebug(self.ui, 'bundle2 stream interruption, looking for a part.')
                     headerblock = self._readpartheader()
                     if headerblock is None:
                         indebug(self.ui, 'no part found during interruption.')
                         return
                     part = unbundlepart(self.ui, headerblock, self._fp)
                     op = interruptoperation(self.ui)
                     _processpart(op, part)
                     self.ui.debug('bundle2-input-stream-interrupt:'
                                   ' closing out of band context\n')
             class interruptoperation(object):
                 """A limited operation to be use by part handler during interruption
                 It only have access to an ui object.
                 """
                 def __init__(self, ui):
                     self.ui = ui
                     self.reply = None
                     self.captureoutput = False
                 @property
                 def repo(self):
                     raise RuntimeError('no repo access from stream interruption')
                 def gettransaction(self):
                     raise TransactionUnavailable('no repo access from stream interruption')
             class unbundlepart(unpackermixin):
                 """a bundle part read from a bundle"""
                 def __init__(self, ui, header, fp):
                     super(unbundlepart, self).__init__(fp)
                     self.ui = ui
                     # unbundle state attr
                     self._headerdata = header
                     self._headeroffset = 0
                     self._initialized = False
                     self.consumed = False
                     # part data
                     self.id = None
                     self.type = None
                     self.mandatoryparams = None
                     self.advisoryparams = None
                     self.params = None
                     self.mandatorykeys = ()
                     self._payloadstream = None
                     self._readheader()
                     self._mandatory = None
                     self._chunkindex = [] #(payload, file) position tuples for chunk starts
                     self._pos = 0
                 def _fromheader(self, size):
                     """return the next <size> byte from the header"""
                     offset = self._headeroffset
                     data = self._headerdata[offset:(offset + size)]
                     self._headeroffset = offset + size
                     return data
                 def _unpackheader(self, format):
                     """read given format from header
                     This automatically compute the size of the format to read."""
                     data = self._fromheader(struct.calcsize(format))
                     return _unpack(format, data)
                 def _initparams(self, mandatoryparams, advisoryparams):
                     """internal function to setup all logic related parameters"""
                     # make it read only to prevent people touching it by mistake.
                     self.mandatoryparams = tuple(mandatoryparams)
                     self.advisoryparams  = tuple(advisoryparams)
                     # user friendly UI
                     self.params = util.sortdict(self.mandatoryparams)
                     self.params.update(self.advisoryparams)
                     self.mandatorykeys = frozenset(p[0] for p in mandatoryparams)
                 def _payloadchunks(self, chunknum=0):
                     '''seek to specified chunk and start yielding data'''
                     if len(self._chunkindex) == 0:
                         assert chunknum == 0, 'Must start with chunk 0'
                         self._chunkindex.append((0, super(unbundlepart, self).tell()))
                     else:
                         assert chunknum < len(self._chunkindex), \
                                'Unknown chunk %d' % chunknum
                         super(unbundlepart, self).seek(self._chunkindex[chunknum][1])
                     pos = self._chunkindex[chunknum][0]
                     payloadsize = self._unpack(_fpayloadsize)[0]
                     indebug(self.ui, 'payload chunk size: %i' % payloadsize)
                     while payloadsize:
                         if payloadsize == flaginterrupt:
                             # interruption detection, the handler will now read a
                             # single part and process it.
                             interrupthandler(self.ui, self._fp)()
                         elif payloadsize < 0:
                             msg = 'negative payload chunk size: %i' %  payloadsize
                             raise error.BundleValueError(msg)
                         else:
                             result = self._readexact(payloadsize)
                             chunknum += 1
                             pos += payloadsize
                             if chunknum == len(self._chunkindex):
                                 self._chunkindex.append((pos,
                                                          super(unbundlepart, self).tell()))
                             yield result
                         payloadsize = self._unpack(_fpayloadsize)[0]
                         indebug(self.ui, 'payload chunk size: %i' % payloadsize)
                 def _findchunk(self, pos):
                     '''for a given payload position, return a chunk number and offset'''
                     for chunk, (ppos, fpos) in enumerate(self._chunkindex):
                         if ppos == pos:
                             return chunk, 0
                         elif ppos > pos:
                             return chunk - 1, pos - self._chunkindex[chunk - 1][0]
                     raise ValueError('Unknown chunk')
                 def _readheader(self):
                     """read the header and setup the object"""
                     typesize = self._unpackheader(_fparttypesize)[0]
                     self.type = self._fromheader(typesize)
                     indebug(self.ui, 'part type: "%s"' % self.type)
                     self.id = self._unpackheader(_fpartid)[0]
                     indebug(self.ui, 'part id: "%s"' % self.id)
                     # extract mandatory bit from type
                     self.mandatory = (self.type != self.type.lower())
                     self.type = self.type.lower()
                     ## reading parameters
                     # param count
                     mancount, advcount = self._unpackheader(_fpartparamcount)
                     indebug(self.ui, 'part parameters: %i' % (mancount + advcount))
                     # param size
                     fparamsizes = _makefpartparamsizes(mancount + advcount)
                     paramsizes = self._unpackheader(fparamsizes)
                     # make it a list of couple again
                     paramsizes = zip(paramsizes[::2], paramsizes[1::2])
                     # split mandatory from advisory
                     mansizes = paramsizes[:mancount]
                     advsizes = paramsizes[mancount:]
                     # retrieve param value
                     manparams = []
                     for key, value in mansizes:
                         manparams.append((self._fromheader(key), self._fromheader(value)))
                     advparams = []
                     for key, value in advsizes:
                         advparams.append((self._fromheader(key), self._fromheader(value)))
                     self._initparams(manparams, advparams)
                     ## part payload
                     self._payloadstream = util.chunkbuffer(self._payloadchunks())
                     # we read the data, tell it
                     self._initialized = True
                 def read(self, size=None):
                     """read payload data"""
                     if not self._initialized:
                         self._readheader()
                     if size is None:
                         data = self._payloadstream.read()
                     else:
                         data = self._payloadstream.read(size)
                     self._pos += len(data)
                     if size is None or len(data) < size:
                         if not self.consumed and self._pos:
                             self.ui.debug('bundle2-input-part: total payload size %i\n'
                                           % self._pos)
                         self.consumed = True
                     return data
                 def tell(self):
                     return self._pos
                 def seek(self, offset, whence=0):
                     if whence == 0:
                         newpos = offset
                     elif whence == 1:
                         newpos = self._pos + offset
                     elif whence == 2:
                         if not self.consumed:
                             self.read()
                         newpos = self._chunkindex[-1][0] - offset
                     else:
                         raise ValueError('Unknown whence value: %r' % (whence,))
                     if newpos > self._chunkindex[-1][0] and not self.consumed:
                         self.read()
                     if not 0 <= newpos <= self._chunkindex[-1][0]:
                         raise ValueError('Offset out of range')
                     if self._pos != newpos:
                         chunk, internaloffset = self._findchunk(newpos)
                         self._payloadstream = util.chunkbuffer(self._payloadchunks(chunk))
                         adjust = self.read(internaloffset)
                         if len(adjust) != internaloffset:
                             raise error.Abort(_('Seek failed\n'))
                         self._pos = newpos
             # These are only the static capabilities.
             # Check the 'getrepocaps' function for the rest.
             capabilities = {'HG20': (),
                             'error': ('abort', 'unsupportedcontent', 'pushraced',
                                       'pushkey'),
                             'listkeys': (),
                             'pushkey': (),
                             'digests': tuple(sorted(util.DIGESTS.keys())),
                             'remote-changegroup': ('http', 'https'),
                             'hgtagsfnodes': (),
                            }
             def getrepocaps(repo, allowpushback=False):
                 """return the bundle2 capabilities for a given repo
                 Exists to allow extensions (like evolution) to mutate the capabilities.
                 """
                 caps = capabilities.copy()
                 caps['changegroup'] = tuple(sorted(
                     changegroup.supportedincomingversions(repo)))
                 if obsolete.isenabled(repo, obsolete.exchangeopt):
                     supportedformat = tuple('V%i' % v for v in obsolete.formats)
                     caps['obsmarkers'] = supportedformat
                 if allowpushback:
                     caps['pushback'] = ()
                 return caps
             def bundle2caps(remote):
                 """return the bundle capabilities of a peer as dict"""
                 raw = remote.capable('bundle2')
                 if not raw and raw != '':
                     return {}
                 capsblob = urlreq.unquote(remote.capable('bundle2'))
                 return decodecaps(capsblob)
             def obsmarkersversion(caps):
                 """extract the list of supported obsmarkers versions from a bundle2caps dict
                 """
                 obscaps = caps.get('obsmarkers', ())
                 return [int(c[1:]) for c in obscaps if c.startswith('V')]
             def writebundle(ui, cg, filename, bundletype, vfs=None, compression=None):
                 """Write a bundle file and return its filename.
                 Existing files will not be overwritten.
                 If no filename is specified, a temporary file is created.
                 bz2 compression can be turned off.
                 The bundle file will be deleted in case of errors.
                 """
                 if bundletype == "HG20":
                     bundle = bundle20(ui)
                     bundle.setcompression(compression)
                     part = bundle.newpart('changegroup', data=cg.getchunks())
                     part.addparam('version', cg.version)
                     if 'clcount' in cg.extras:
                         part.addparam('nbchanges', str(cg.extras['clcount']),
                                       mandatory=False)
                     chunkiter = bundle.getchunks()
                 else:
                     # compression argument is only for the bundle2 case
                     assert compression is None
                     if cg.version != '01':
                         raise error.Abort(_('old bundle types only supports v1 '
                                             'changegroups'))
                     header, comp = bundletypes[bundletype]
                     if comp not in util.compressors:
                         raise error.Abort(_('unknown stream compression type: %s')
                                           % comp)
                     z = util.compressors[comp]()
                     subchunkiter = cg.getchunks()
                     def chunkiter():
                         yield header
                         for chunk in subchunkiter:
                             yield z.compress(chunk)
                         yield z.flush()
                     chunkiter = chunkiter()
                 # parse the changegroup data, otherwise we will block
                 # in case of sshrepo because we don't know the end of the stream
                 return changegroup.writechunks(ui, chunkiter, filename, vfs=vfs)
             @parthandler('changegroup', ('version', 'nbchanges', 'treemanifest'))
             def handlechangegroup(op, inpart):
                 """apply a changegroup part on the repo
                 This is a very early implementation that will massive rework before being
                 inflicted to any end-user.
                 """
                 # Make sure we trigger a transaction creation
                 #
                 # The addchangegroup function will get a transaction object by itself, but
                 # we need to make sure we trigger the creation of a transaction object used
                 # for the whole processing scope.
                 op.gettransaction()
                 unpackerversion = inpart.params.get('version', '01')
                 # We should raise an appropriate exception here
                 cg = changegroup.getunbundler(unpackerversion, inpart, None)
                 # the source and url passed here are overwritten by the one contained in
                 # the transaction.hookargs argument. So 'bundle2' is a placeholder
                 nbchangesets = None
                 if 'nbchanges' in inpart.params:
                     nbchangesets = int(inpart.params.get('nbchanges'))
                 if ('treemanifest' in inpart.params and
                     'treemanifest' not in op.repo.requirements):
                     if len(op.repo.changelog) != 0:
                         raise error.Abort(_(
                             "bundle contains tree manifests, but local repo is "
                             "non-empty and does not use tree manifests"))
                     op.repo.requirements.add('treemanifest')
                     op.repo._applyopenerreqs()
                     op.repo._writerequirements()
                 ret = cg.apply(op.repo, 'bundle2', 'bundle2', expectedtotal=nbchangesets)
                 op.records.add('changegroup', {'return': ret})
                 if op.reply is not None:
                     # This is definitely not the final form of this
                     # return. But one need to start somewhere.
                     part = op.reply.newpart('reply:changegroup', mandatory=False)
                     part.addparam('in-reply-to', str(inpart.id), mandatory=False)
                     part.addparam('return', '%i' % ret, mandatory=False)
                 assert not inpart.read()
             _remotechangegroupparams = tuple(['url', 'size', 'digests'] +
                 ['digest:%s' % k for k in util.DIGESTS.keys()])
             @parthandler('remote-changegroup', _remotechangegroupparams)
             def handleremotechangegroup(op, inpart):
                 """apply a bundle10 on the repo, given an url and validation information
                 All the information about the remote bundle to import are given as
                 parameters. The parameters include:
                   - url: the url to the bundle10.
                   - size: the bundle10 file size. It is used to validate what was
                     retrieved by the client matches the server knowledge about the bundle.
                   - digests: a space separated list of the digest types provided as
                     parameters.
                   - digest:<digest-type>: the hexadecimal representation of the digest with
                     that name. Like the size, it is used to validate what was retrieved by
                     the client matches what the server knows about the bundle.
                 When multiple digest types are given, all of them are checked.
                 """
                 try:
                     raw_url = inpart.params['url']
                 except KeyError:
                     raise error.Abort(_('remote-changegroup: missing "%s" param') % 'url')
                 parsed_url = util.url(raw_url)
                 if parsed_url.scheme not in capabilities['remote-changegroup']:
                     raise error.Abort(_('remote-changegroup does not support %s urls') %
                         parsed_url.scheme)
                 try:
                     size = int(inpart.params['size'])
                 except ValueError:
                     raise error.Abort(_('remote-changegroup: invalid value for param "%s"')
                         % 'size')
                 except KeyError:
                     raise error.Abort(_('remote-changegroup: missing "%s" param') % 'size')
                 digests = {}
                 for typ in inpart.params.get('digests', '').split():
                     param = 'digest:%s' % typ
                     try:
                         value = inpart.params[param]
                     except KeyError:
                         raise error.Abort(_('remote-changegroup: missing "%s" param') %
                             param)
                     digests[typ] = value
                 real_part = util.digestchecker(url.open(op.ui, raw_url), size, digests)
                 # Make sure we trigger a transaction creation
                 #
                 # The addchangegroup function will get a transaction object by itself, but
                 # we need to make sure we trigger the creation of a transaction object used
                 # for the whole processing scope.
                 op.gettransaction()
                 from . import exchange
                 cg = exchange.readbundle(op.repo.ui, real_part, raw_url)
                 if not isinstance(cg, changegroup.cg1unpacker):
                     raise error.Abort(_('%s: not a bundle version 1.0') %
                         util.hidepassword(raw_url))
                 ret = cg.apply(op.repo, 'bundle2', 'bundle2')
                 op.records.add('changegroup', {'return': ret})
                 if op.reply is not None:
                     # This is definitely not the final form of this
                     # return. But one need to start somewhere.
                     part = op.reply.newpart('reply:changegroup')
                     part.addparam('in-reply-to', str(inpart.id), mandatory=False)
                     part.addparam('return', '%i' % ret, mandatory=False)
                 try:
                     real_part.validate()
                 except error.Abort as e:
                     raise error.Abort(_('bundle at %s is corrupted:\n%s') %
                         (util.hidepassword(raw_url), str(e)))
                 assert not inpart.read()
             @parthandler('reply:changegroup', ('return', 'in-reply-to'))
             def handlereplychangegroup(op, inpart):
                 ret = int(inpart.params['return'])
                 replyto = int(inpart.params['in-reply-to'])
                 op.records.add('changegroup', {'return': ret}, replyto)
             @parthandler('check:heads')
             def handlecheckheads(op, inpart):
                 """check that head of the repo did not change
                 This is used to detect a push race when using unbundle.
                 This replaces the "heads" argument of unbundle."""
                 h = inpart.read(20)
                 heads = []
                 while len(h) == 20:
                     heads.append(h)
                     h = inpart.read(20)
                 assert not h
                 # Trigger a transaction so that we are guaranteed to have the lock now.
                 if op.ui.configbool('experimental', 'bundle2lazylocking'):
                     op.gettransaction()
                 if sorted(heads) != sorted(op.repo.heads()):
                     raise error.PushRaced('repository changed while pushing - '
                                           'please try again')
             @parthandler('output')
             def handleoutput(op, inpart):
                 """forward output captured on the server to the client"""
                 for line in inpart.read().splitlines():
                     op.ui.status(_('remote: %s\n') % line)
             @parthandler('replycaps')
             def handlereplycaps(op, inpart):
                 """Notify that a reply bundle should be created
                 The payload contains the capabilities information for the reply"""
                 caps = decodecaps(inpart.read())
                 if op.reply is None:
                     op.reply = bundle20(op.ui, caps)
             class AbortFromPart(error.Abort):
                 """Sub-class of Abort that denotes an error from a bundle2 part."""
             @parthandler('error:abort', ('message', 'hint'))
             def handleerrorabort(op, inpart):
                 """Used to transmit abort error over the wire"""
                 raise AbortFromPart(inpart.params['message'],
                                     hint=inpart.params.get('hint'))
             @parthandler('error:pushkey', ('namespace', 'key', 'new', 'old', 'ret',
                                            'in-reply-to'))
             def handleerrorpushkey(op, inpart):
                 """Used to transmit failure of a mandatory pushkey over the wire"""
                 kwargs = {}
                 for name in ('namespace', 'key', 'new', 'old', 'ret'):
                     value = inpart.params.get(name)
                     if value is not None:
                         kwargs[name] = value
                 raise error.PushkeyFailed(inpart.params['in-reply-to'], **kwargs)
             @parthandler('error:unsupportedcontent', ('parttype', 'params'))
             def handleerrorunsupportedcontent(op, inpart):
                 """Used to transmit unknown content error over the wire"""
                 kwargs = {}
                 parttype = inpart.params.get('parttype')
                 if parttype is not None:
                     kwargs['parttype'] = parttype
                 params = inpart.params.get('params')
                 if params is not None:
                     kwargs['params'] = params.split('\0')
                 raise error.BundleUnknownFeatureError(**kwargs)
             @parthandler('error:pushraced', ('message',))
             def handleerrorpushraced(op, inpart):
                 """Used to transmit push race error over the wire"""
                 raise error.ResponseError(_('push failed:'), inpart.params['message'])
             @parthandler('listkeys', ('namespace',))
             def handlelistkeys(op, inpart):
                 """retrieve pushkey namespace content stored in a bundle2"""
                 namespace = inpart.params['namespace']
                 r = pushkey.decodekeys(inpart.read())
                 op.records.add('listkeys', (namespace, r))
             @parthandler('pushkey', ('namespace', 'key', 'old', 'new'))
             def handlepushkey(op, inpart):
                 """process a pushkey request"""
                 dec = pushkey.decode
                 namespace = dec(inpart.params['namespace'])
                 key = dec(inpart.params['key'])
                 old = dec(inpart.params['old'])
                 new = dec(inpart.params['new'])
                 # Grab the transaction to ensure that we have the lock before performing the
                 # pushkey.
                 if op.ui.configbool('experimental', 'bundle2lazylocking'):
                     op.gettransaction()
                 ret = op.repo.pushkey(namespace, key, old, new)
                 record = {'namespace': namespace,
                           'key': key,
                           'old': old,
                           'new': new}
                 op.records.add('pushkey', record)
                 if op.reply is not None:
                     rpart = op.reply.newpart('reply:pushkey')
                     rpart.addparam('in-reply-to', str(inpart.id), mandatory=False)
                     rpart.addparam('return', '%i' % ret, mandatory=False)
                 if inpart.mandatory and not ret:
                     kwargs = {}
                     for key in ('namespace', 'key', 'new', 'old', 'ret'):
                         if key in inpart.params:
                             kwargs[key] = inpart.params[key]
                     raise error.PushkeyFailed(partid=str(inpart.id), **kwargs)
             @parthandler('reply:pushkey', ('return', 'in-reply-to'))
             def handlepushkeyreply(op, inpart):
                 """retrieve the result of a pushkey request"""
                 ret = int(inpart.params['return'])
                 partid = int(inpart.params['in-reply-to'])
                 op.records.add('pushkey', {'return': ret}, partid)
             @parthandler('obsmarkers')
             def handleobsmarker(op, inpart):
                 """add a stream of obsmarkers to the repo"""
                 tr = op.gettransaction()
                 markerdata = inpart.read()
                 if op.ui.config('experimental', 'obsmarkers-exchange-debug', False):
                     op.ui.write(('obsmarker-exchange: %i bytes received\n')
                                 % len(markerdata))
                 # The mergemarkers call will crash if marker creation is not enabled.
                 # we want to avoid this if the part is advisory.
                 if not inpart.mandatory and op.repo.obsstore.readonly:
                     op.repo.ui.debug('ignoring obsolescence markers, feature not enabled')
                     return
                 new = op.repo.obsstore.mergemarkers(tr, markerdata)
                 if new:
                     op.repo.ui.status(_('%i new obsolescence markers\n') % new)
                 op.records.add('obsmarkers', {'new': new})
                 if op.reply is not None:
                     rpart = op.reply.newpart('reply:obsmarkers')
                     rpart.addparam('in-reply-to', str(inpart.id), mandatory=False)
                     rpart.addparam('new', '%i' % new, mandatory=False)
             @parthandler('reply:obsmarkers', ('new', 'in-reply-to'))
             def handleobsmarkerreply(op, inpart):
                 """retrieve the result of a pushkey request"""
                 ret = int(inpart.params['new'])
                 partid = int(inpart.params['in-reply-to'])
                 op.records.add('obsmarkers', {'new': ret}, partid)
             @parthandler('hgtagsfnodes')
             def handlehgtagsfnodes(op, inpart):
                 """Applies .hgtags fnodes cache entries to the local repo.
                 Payload is pairs of 20 byte changeset nodes and filenodes.
                 """
                 # Grab the transaction so we ensure that we have the lock at this point.
                 if op.ui.configbool('experimental', 'bundle2lazylocking'):
                     op.gettransaction()
                 cache = tags.hgtagsfnodescache(op.repo.unfiltered())
                 count = 0
                 while True:
                     node = inpart.read(20)
                     fnode = inpart.read(20)
                     if len(node) < 20 or len(fnode) < 20:
                         op.ui.debug('ignoring incomplete received .hgtags fnodes data\n')
                         break
                     cache.setfnode(node, fnode)
                     count += 1
                 cache.write()
                 op.ui.debug('applied %i hgtags fnodes cache entries\n' % count)

mercurial/encoding.py

0 +3 -3

             # encoding.py - character transcoding support for Mercurial
             #
             #  Copyright 2005-2009 Matt Mackall <mpm@selenic.com> and others
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             from __future__ import absolute_import
             import array
             import locale
             import os
-            import sys
             import unicodedata
             from . import (
                 error,
+                pycompat,
             )
-            if sys.version_info[0] >= 3:
+            if pycompat.ispy3:
                 unichr = chr
             # These unicode characters are ignored by HFS+ (Apple Technote 1150,
             # "Unicode Subtleties"), so we need to ignore them in some places for
             # sanity.
             _ignore = [unichr(int(x, 16)).encode("utf-8") for x in
                        "200c 200d 200e 200f 202a 202b 202c 202d 202e "
                        "206a 206b 206c 206d 206e 206f feff".split()]
             # verify the next function will work
-            if sys.version_info[0] >= 3:
+            if pycompat.ispy3:
                 assert set(i[0] for i in _ignore) == set([ord(b'\xe2'), ord(b'\xef')])
             else:
                 assert set(i[0] for i in _ignore) == set(["\xe2", "\xef"])
             def hfsignoreclean(s):
                 """Remove codepoints ignored by HFS+ from s.
                 >>> hfsignoreclean(u'.h\u200cg'.encode('utf-8'))
                 '.hg'
                 >>> hfsignoreclean(u'.h\ufeffg'.encode('utf-8'))
                 '.hg'
                 """
                 if "\xe2" in s or "\xef" in s:
                     for c in _ignore:
                         s = s.replace(c, '')
                 return s
             def _getpreferredencoding():
                 '''
                 On darwin, getpreferredencoding ignores the locale environment and
                 always returns mac-roman. http://bugs.python.org/issue6202 fixes this
                 for Python 2.7 and up. This is the same corrected code for earlier
                 Python versions.
                 However, we can't use a version check for this method, as some distributions
                 patch Python to fix this. Instead, we use it as a 'fixer' for the mac-roman
                 encoding, as it is unlikely that this encoding is the actually expected.
                 '''
                 try:
                     locale.CODESET
                 except AttributeError:
                     # Fall back to parsing environment variables :-(
                     return locale.getdefaultlocale()[1]
                 oldloc = locale.setlocale(locale.LC_CTYPE)
                 locale.setlocale(locale.LC_CTYPE, "")
                 result = locale.nl_langinfo(locale.CODESET)
                 locale.setlocale(locale.LC_CTYPE, oldloc)
                 return result
             _encodingfixers = {
                 '646': lambda: 'ascii',
                 'ANSI_X3.4-1968': lambda: 'ascii',
                 'mac-roman': _getpreferredencoding
             }
             try:
                 encoding = os.environ.get("HGENCODING")
                 if not encoding:
                     encoding = locale.getpreferredencoding() or 'ascii'
                     encoding = _encodingfixers.get(encoding, lambda: encoding)()
             except locale.Error:
                 encoding = 'ascii'
             encodingmode = os.environ.get("HGENCODINGMODE", "strict")
             fallbackencoding = 'ISO-8859-1'
             class localstr(str):
                 '''This class allows strings that are unmodified to be
                 round-tripped to the local encoding and back'''
                 def __new__(cls, u, l):
                     s = str.__new__(cls, l)
                     s._utf8 = u
                     return s
                 def __hash__(self):
                     return hash(self._utf8) # avoid collisions in local string space
             def tolocal(s):
                 """
                 Convert a string from internal UTF-8 to local encoding
                 All internal strings should be UTF-8 but some repos before the
                 implementation of locale support may contain latin1 or possibly
                 other character sets. We attempt to decode everything strictly
                 using UTF-8, then Latin-1, and failing that, we use UTF-8 and
                 replace unknown characters.
                 The localstr class is used to cache the known UTF-8 encoding of
                 strings next to their local representation to allow lossless
                 round-trip conversion back to UTF-8.
                 >>> u = 'foo: \\xc3\\xa4' # utf-8
                 >>> l = tolocal(u)
                 >>> l
                 'foo: ?'
                 >>> fromlocal(l)
                 'foo: \\xc3\\xa4'
                 >>> u2 = 'foo: \\xc3\\xa1'
                 >>> d = { l: 1, tolocal(u2): 2 }
                 >>> len(d) # no collision
                 >>> 'foo: ?' in d
                 False
                 >>> l1 = 'foo: \\xe4' # historical latin1 fallback
                 >>> l = tolocal(l1)
                 >>> l
                 'foo: ?'
                 >>> fromlocal(l) # magically in utf-8
                 'foo: \\xc3\\xa4'
                 """
                 try:
                     try:
                         # make sure string is actually stored in UTF-8
                         u = s.decode('UTF-8')
                         if encoding == 'UTF-8':
                             # fast path
                             return s
                         r = u.encode(encoding, "replace")
                         if u == r.decode(encoding):
                             # r is a safe, non-lossy encoding of s
                             return r
                         return localstr(s, r)
                     except UnicodeDecodeError:
                         # we should only get here if we're looking at an ancient changeset
                         try:
                             u = s.decode(fallbackencoding)
                             r = u.encode(encoding, "replace")
                             if u == r.decode(encoding):
                                 # r is a safe, non-lossy encoding of s
                                 return r
                             return localstr(u.encode('UTF-8'), r)
                         except UnicodeDecodeError:
                             u = s.decode("utf-8", "replace") # last ditch
                             return u.encode(encoding, "replace") # can't round-trip
                 except LookupError as k:
                     raise error.Abort(k, hint="please check your locale settings")
             def fromlocal(s):
                 """
                 Convert a string from the local character encoding to UTF-8
                 We attempt to decode strings using the encoding mode set by
                 HGENCODINGMODE, which defaults to 'strict'. In this mode, unknown
                 characters will cause an error message. Other modes include
                 'replace', which replaces unknown characters with a special
                 Unicode character, and 'ignore', which drops the character.
                 """
                 # can we do a lossless round-trip?
                 if isinstance(s, localstr):
                     return s._utf8
                 try:
                     return s.decode(encoding, encodingmode).encode("utf-8")
                 except UnicodeDecodeError as inst:
                     sub = s[max(0, inst.start - 10):inst.start + 10]
                     raise error.Abort("decoding near '%s': %s!" % (sub, inst))
                 except LookupError as k:
                     raise error.Abort(k, hint="please check your locale settings")
             # How to treat ambiguous-width characters. Set to 'wide' to treat as wide.
             wide = (os.environ.get("HGENCODINGAMBIGUOUS", "narrow") == "wide"
                     and "WFA" or "WF")
             def colwidth(s):
                 "Find the column width of a string for display in the local encoding"
                 return ucolwidth(s.decode(encoding, 'replace'))
             def ucolwidth(d):
                 "Find the column width of a Unicode string for display"
                 eaw = getattr(unicodedata, 'east_asian_width', None)
                 if eaw is not None:
                     return sum([eaw(c) in wide and 2 or 1 for c in d])
                 return len(d)
             def getcols(s, start, c):
                 '''Use colwidth to find a c-column substring of s starting at byte
                 index start'''
                 for x in xrange(start + c, len(s)):
                     t = s[start:x]
                     if colwidth(t) == c:
                         return t
             def trim(s, width, ellipsis='', leftside=False):
                 """Trim string 's' to at most 'width' columns (including 'ellipsis').
                 If 'leftside' is True, left side of string 's' is trimmed.
                 'ellipsis' is always placed at trimmed side.
                 >>> ellipsis = '+++'
                 >>> from . import encoding
                 >>> encoding.encoding = 'utf-8'
                 >>> t= '1234567890'
                 >>> print trim(t, 12, ellipsis=ellipsis)
                 1234567890
                 >>> print trim(t, 10, ellipsis=ellipsis)
                 1234567890
                 >>> print trim(t, 8, ellipsis=ellipsis)
 +++
                 >>> print trim(t, 8, ellipsis=ellipsis, leftside=True)
                 +++67890
                 >>> print trim(t, 8)
                 12345678
                 >>> print trim(t, 8, leftside=True)
                 34567890
                 >>> print trim(t, 3, ellipsis=ellipsis)
                 +++
                 >>> print trim(t, 1, ellipsis=ellipsis)
                 +
                 >>> u = u'\u3042\u3044\u3046\u3048\u304a' # 2 x 5 = 10 columns
                 >>> t = u.encode(encoding.encoding)
                 >>> print trim(t, 12, ellipsis=ellipsis)
                 \xe3\x81\x82\xe3\x81\x84\xe3\x81\x86\xe3\x81\x88\xe3\x81\x8a
                 >>> print trim(t, 10, ellipsis=ellipsis)
                 \xe3\x81\x82\xe3\x81\x84\xe3\x81\x86\xe3\x81\x88\xe3\x81\x8a
                 >>> print trim(t, 8, ellipsis=ellipsis)
                 \xe3\x81\x82\xe3\x81\x84+++
                 >>> print trim(t, 8, ellipsis=ellipsis, leftside=True)
                 +++\xe3\x81\x88\xe3\x81\x8a
                 >>> print trim(t, 5)
                 \xe3\x81\x82\xe3\x81\x84
                 >>> print trim(t, 5, leftside=True)
                 \xe3\x81\x88\xe3\x81\x8a
                 >>> print trim(t, 4, ellipsis=ellipsis)
                 +++
                 >>> print trim(t, 4, ellipsis=ellipsis, leftside=True)
                 +++
                 >>> t = '\x11\x22\x33\x44\x55\x66\x77\x88\x99\xaa' # invalid byte sequence
                 >>> print trim(t, 12, ellipsis=ellipsis)
                 \x11\x22\x33\x44\x55\x66\x77\x88\x99\xaa
                 >>> print trim(t, 10, ellipsis=ellipsis)
                 \x11\x22\x33\x44\x55\x66\x77\x88\x99\xaa
                 >>> print trim(t, 8, ellipsis=ellipsis)
                 \x11\x22\x33\x44\x55+++
                 >>> print trim(t, 8, ellipsis=ellipsis, leftside=True)
                 +++\x66\x77\x88\x99\xaa
                 >>> print trim(t, 8)
                 \x11\x22\x33\x44\x55\x66\x77\x88
                 >>> print trim(t, 8, leftside=True)
                 \x33\x44\x55\x66\x77\x88\x99\xaa
                 >>> print trim(t, 3, ellipsis=ellipsis)
                 +++
                 >>> print trim(t, 1, ellipsis=ellipsis)
                 +
                 """
                 try:
                     u = s.decode(encoding)
                 except UnicodeDecodeError:
                     if len(s) <= width: # trimming is not needed
                         return s
                     width -= len(ellipsis)
                     if width <= 0: # no enough room even for ellipsis
                         return ellipsis[:width + len(ellipsis)]
                     if leftside:
                         return ellipsis + s[-width:]
                     return s[:width] + ellipsis
                 if ucolwidth(u) <= width: # trimming is not needed
                     return s
                 width -= len(ellipsis)
                 if width <= 0: # no enough room even for ellipsis
                     return ellipsis[:width + len(ellipsis)]
                 if leftside:
                     uslice = lambda i: u[i:]
                     concat = lambda s: ellipsis + s
                 else:
                     uslice = lambda i: u[:-i]
                     concat = lambda s: s + ellipsis
                 for i in xrange(1, len(u)):
                     usub = uslice(i)
                     if ucolwidth(usub) <= width:
                         return concat(usub.encode(encoding))
                 return ellipsis # no enough room for multi-column characters
             def _asciilower(s):
                 '''convert a string to lowercase if ASCII
                 Raises UnicodeDecodeError if non-ASCII characters are found.'''
                 s.decode('ascii')
                 return s.lower()
             def asciilower(s):
                 # delay importing avoids cyclic dependency around "parsers" in
                 # pure Python build (util => i18n => encoding => parsers => util)
                 from . import parsers
                 impl = getattr(parsers, 'asciilower', _asciilower)
                 global asciilower
                 asciilower = impl
                 return impl(s)
             def _asciiupper(s):
                 '''convert a string to uppercase if ASCII
                 Raises UnicodeDecodeError if non-ASCII characters are found.'''
                 s.decode('ascii')
                 return s.upper()
             def asciiupper(s):
                 # delay importing avoids cyclic dependency around "parsers" in
                 # pure Python build (util => i18n => encoding => parsers => util)
                 from . import parsers
                 impl = getattr(parsers, 'asciiupper', _asciiupper)
                 global asciiupper
                 asciiupper = impl
                 return impl(s)
             def lower(s):
                 "best-effort encoding-aware case-folding of local string s"
                 try:
                     return asciilower(s)
                 except UnicodeDecodeError:
                     pass
                 try:
                     if isinstance(s, localstr):
                         u = s._utf8.decode("utf-8")
                     else:
                         u = s.decode(encoding, encodingmode)
                     lu = u.lower()
                     if u == lu:
                         return s # preserve localstring
                     return lu.encode(encoding)
                 except UnicodeError:
                     return s.lower() # we don't know how to fold this except in ASCII
                 except LookupError as k:
                     raise error.Abort(k, hint="please check your locale settings")
             def upper(s):
                 "best-effort encoding-aware case-folding of local string s"
                 try:
                     return asciiupper(s)
                 except UnicodeDecodeError:
                     return upperfallback(s)
             def upperfallback(s):
                 try:
                     if isinstance(s, localstr):
                         u = s._utf8.decode("utf-8")
                     else:
                         u = s.decode(encoding, encodingmode)
                     uu = u.upper()
                     if u == uu:
                         return s # preserve localstring
                     return uu.encode(encoding)
                 except UnicodeError:
                     return s.upper() # we don't know how to fold this except in ASCII
                 except LookupError as k:
                     raise error.Abort(k, hint="please check your locale settings")
             class normcasespecs(object):
                 '''what a platform's normcase does to ASCII strings
                 This is specified per platform, and should be consistent with what normcase
                 on that platform actually does.
                 lower: normcase lowercases ASCII strings
                 upper: normcase uppercases ASCII strings
                 other: the fallback function should always be called
                 This should be kept in sync with normcase_spec in util.h.'''
                 lower = -1
                 upper = 1
                 other = 0
             _jsonmap = []
             _jsonmap.extend("\\u%04x" % x for x in range(32))
             _jsonmap.extend(chr(x) for x in range(32, 127))
             _jsonmap.append('\\u007f')
             _jsonmap[0x09] = '\\t'
             _jsonmap[0x0a] = '\\n'
             _jsonmap[0x22] = '\\"'
             _jsonmap[0x5c] = '\\\\'
             _jsonmap[0x08] = '\\b'
             _jsonmap[0x0c] = '\\f'
             _jsonmap[0x0d] = '\\r'
             _paranoidjsonmap = _jsonmap[:]
             _paranoidjsonmap[0x3c] = '\\u003c'  # '<' (e.g. escape "</script>")
             _paranoidjsonmap[0x3e] = '\\u003e'  # '>'
             _jsonmap.extend(chr(x) for x in range(128, 256))
             def jsonescape(s, paranoid=False):
                 '''returns a string suitable for JSON
                 JSON is problematic for us because it doesn't support non-Unicode
                 bytes. To deal with this, we take the following approach:
                 - localstr objects are converted back to UTF-8
                 - valid UTF-8/ASCII strings are passed as-is
                 - other strings are converted to UTF-8b surrogate encoding
                 - apply JSON-specified string escaping
                 (escapes are doubled in these tests)
                 >>> jsonescape('this is a test')
                 'this is a test'
                 >>> jsonescape('escape characters: \\0 \\x0b \\x7f')
                 'escape characters: \\\\u0000 \\\\u000b \\\\u007f'
                 >>> jsonescape('escape characters: \\t \\n \\r \\" \\\\')
                 'escape characters: \\\\t \\\\n \\\\r \\\\" \\\\\\\\'
                 >>> jsonescape('a weird byte: \\xdd')
                 'a weird byte: \\xed\\xb3\\x9d'
                 >>> jsonescape('utf-8: caf\\xc3\\xa9')
                 'utf-8: caf\\xc3\\xa9'
                 >>> jsonescape('')
                 ''
                 If paranoid, non-ascii and common troublesome characters are also escaped.
                 This is suitable for web output.
                 >>> jsonescape('escape boundary: \\x7e \\x7f \\xc2\\x80', paranoid=True)
                 'escape boundary: ~ \\\\u007f \\\\u0080'
                 >>> jsonescape('a weird byte: \\xdd', paranoid=True)
                 'a weird byte: \\\\udcdd'
                 >>> jsonescape('utf-8: caf\\xc3\\xa9', paranoid=True)
                 'utf-8: caf\\\\u00e9'
                 >>> jsonescape('non-BMP: \\xf0\\x9d\\x84\\x9e', paranoid=True)
                 'non-BMP: \\\\ud834\\\\udd1e'
                 >>> jsonescape('<foo@example.org>', paranoid=True)
                 '\\\\u003cfoo@example.org\\\\u003e'
                 '''
                 if paranoid:
                     jm = _paranoidjsonmap
                 else:
                     jm = _jsonmap
                 u8chars = toutf8b(s)
                 try:
                     return ''.join(jm[x] for x in bytearray(u8chars))  # fast path
                 except IndexError:
                     pass
                 # non-BMP char is represented as UTF-16 surrogate pair
                 u16codes = array.array('H', u8chars.decode('utf-8').encode('utf-16'))
                 u16codes.pop(0)  # drop BOM
                 return ''.join(jm[x] if x < 128 else '\\u%04x' % x for x in u16codes)
             _utf8len = [0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 2, 2, 3, 4]
             def getutf8char(s, pos):
                 '''get the next full utf-8 character in the given string, starting at pos
                 Raises a UnicodeError if the given location does not start a valid
                 utf-8 character.
                 '''
                 # find how many bytes to attempt decoding from first nibble
                 l = _utf8len[ord(s[pos]) >> 4]
                 if not l: # ascii
                     return s[pos]
                 c = s[pos:pos + l]
                 # validate with attempted decode
                 c.decode("utf-8")
                 return c
             def toutf8b(s):
                 '''convert a local, possibly-binary string into UTF-8b
                 This is intended as a generic method to preserve data when working
                 with schemes like JSON and XML that have no provision for
                 arbitrary byte strings. As Mercurial often doesn't know
                 what encoding data is in, we use so-called UTF-8b.
                 If a string is already valid UTF-8 (or ASCII), it passes unmodified.
                 Otherwise, unsupported bytes are mapped to UTF-16 surrogate range,
                 uDC00-uDCFF.
                 Principles of operation:
                 - ASCII and UTF-8 data successfully round-trips and is understood
                   by Unicode-oriented clients
                 - filenames and file contents in arbitrary other encodings can have
                   be round-tripped or recovered by clueful clients
                 - local strings that have a cached known UTF-8 encoding (aka
                   localstr) get sent as UTF-8 so Unicode-oriented clients get the
                   Unicode data they want
                 - because we must preserve UTF-8 bytestring in places such as
                   filenames, metadata can't be roundtripped without help
                 (Note: "UTF-8b" often refers to decoding a mix of valid UTF-8 and
                 arbitrary bytes into an internal Unicode format that can be
                 re-encoded back into the original. Here we are exposing the
                 internal surrogate encoding as a UTF-8 string.)
                 '''
                 if "\xed" not in s:
                     if isinstance(s, localstr):
                         return s._utf8
                     try:
                         s.decode('utf-8')
                         return s
                     except UnicodeDecodeError:
                         pass
                 r = ""
                 pos = 0
                 l = len(s)
                 while pos < l:
                     try:
                         c = getutf8char(s, pos)
                         if "\xed\xb0\x80" <= c <= "\xed\xb3\xbf":
                             # have to re-escape existing U+DCxx characters
                             c = unichr(0xdc00 + ord(s[pos])).encode('utf-8')
                             pos += 1
                         else:
                             pos += len(c)
                     except UnicodeDecodeError:
                         c = unichr(0xdc00 + ord(s[pos])).encode('utf-8')
                         pos += 1
                     r += c
                 return r
             def fromutf8b(s):
                 '''Given a UTF-8b string, return a local, possibly-binary string.
                 return the original binary string. This
                 is a round-trip process for strings like filenames, but metadata
                 that's was passed through tolocal will remain in UTF-8.
                 >>> roundtrip = lambda x: fromutf8b(toutf8b(x)) == x
                 >>> m = "\\xc3\\xa9\\x99abcd"
                 >>> toutf8b(m)
                 '\\xc3\\xa9\\xed\\xb2\\x99abcd'
                 >>> roundtrip(m)
                 True
                 >>> roundtrip("\\xc2\\xc2\\x80")
                 True
                 >>> roundtrip("\\xef\\xbf\\xbd")
                 True
                 >>> roundtrip("\\xef\\xef\\xbf\\xbd")
                 True
                 >>> roundtrip("\\xf1\\x80\\x80\\x80\\x80")
                 True
                 '''
                 # fast path - look for uDxxx prefixes in s
                 if "\xed" not in s:
                     return s
                 # We could do this with the unicode type but some Python builds
                 # use UTF-16 internally (issue5031) which causes non-BMP code
                 # points to be escaped. Instead, we use our handy getutf8char
                 # helper again to walk the string without "decoding" it.
                 r = ""
                 pos = 0
                 l = len(s)
                 while pos < l:
                     c = getutf8char(s, pos)
                     pos += len(c)
                     # unescape U+DCxx characters
                     if "\xed\xb0\x80" <= c <= "\xed\xb3\xbf":
                         c = chr(ord(c.decode("utf-8")) & 0xff)
                     r += c
                 return r

mercurial/pycompat.py

0 +5 -3

             # pycompat.py - portability shim for python 3
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             """Mercurial portability shim for python 3.
             This contains aliases to hide python version-specific details from the core.
             """
             from __future__ import absolute_import
             import sys
-            if sys.version_info[0] < 3:
+            ispy3 = (sys.version_info[0] >= 3)
+            if not ispy3:
                 import cPickle as pickle
                 import cStringIO as io
                 import httplib
                 import Queue as _queue
                 import SocketServer as socketserver
                 import urlparse
                 import xmlrpclib
             else:
                 import http.client as httplib
                 import io
                 import pickle
                 import queue as _queue
                 import socketserver
                 import urllib.parse as urlparse
                 import xmlrpc.client as xmlrpclib
-            if sys.version_info[0] >= 3:
+            if ispy3:
                 import builtins
                 import functools
                 def _wrapattrfunc(f):
                     @functools.wraps(f)
                     def w(object, name, *args):
                         if isinstance(name, bytes):
                             name = name.decode(u'utf-8')
                         return f(object, name, *args)
                     return w
                 # these wrappers are automagically imported by hgloader
                 delattr = _wrapattrfunc(builtins.delattr)
                 getattr = _wrapattrfunc(builtins.getattr)
                 hasattr = _wrapattrfunc(builtins.hasattr)
                 setattr = _wrapattrfunc(builtins.setattr)
                 xrange = builtins.range
             stringio = io.StringIO
             empty = _queue.Empty
             queue = _queue.Queue
             class _pycompatstub(object):
                 def __init__(self):
                     self._aliases = {}
                 def _registeraliases(self, origin, items):
                     """Add items that will be populated at the first access"""
                     self._aliases.update((item.replace('_', '').lower(), (origin, item))
                                          for item in items)
                 def __getattr__(self, name):
                     try:
                         origin, item = self._aliases[name]
                     except KeyError:
                         raise AttributeError(name)
                     self.__dict__[name] = obj = getattr(origin, item)
                     return obj
             httpserver = _pycompatstub()
             urlreq = _pycompatstub()
             urlerr = _pycompatstub()
-            if sys.version_info[0] < 3:
+            if not ispy3:
                 import BaseHTTPServer
                 import CGIHTTPServer
                 import SimpleHTTPServer
                 import urllib2
                 import urllib
                 urlreq._registeraliases(urllib, (
                     "addclosehook",
                     "addinfourl",
                     "ftpwrapper",
                     "pathname2url",
                     "quote",
                     "splitattr",
                     "splitpasswd",
                     "splitport",
                     "splituser",
                     "unquote",
                     "url2pathname",
                     "urlencode",
                 ))
                 urlreq._registeraliases(urllib2, (
                     "AbstractHTTPHandler",
                     "BaseHandler",
                     "build_opener",
                     "FileHandler",
                     "FTPHandler",
                     "HTTPBasicAuthHandler",
                     "HTTPDigestAuthHandler",
                     "HTTPHandler",
                     "HTTPPasswordMgrWithDefaultRealm",
                     "HTTPSHandler",
                     "install_opener",
                     "ProxyHandler",
                     "Request",
                     "urlopen",
                 ))
                 urlerr._registeraliases(urllib2, (
                     "HTTPError",
                     "URLError",
                 ))
                 httpserver._registeraliases(BaseHTTPServer, (
                     "HTTPServer",
                     "BaseHTTPRequestHandler",
                 ))
                 httpserver._registeraliases(SimpleHTTPServer, (
                     "SimpleHTTPRequestHandler",
                 ))
                 httpserver._registeraliases(CGIHTTPServer, (
                     "CGIHTTPRequestHandler",
                 ))
             else:
                 import urllib.request
                 urlreq._registeraliases(urllib.request, (
                     "AbstractHTTPHandler",
                     "addclosehook",
                     "addinfourl",
                     "BaseHandler",
                     "build_opener",
                     "FileHandler",
                     "FTPHandler",
                     "ftpwrapper",
                     "HTTPHandler",
                     "HTTPSHandler",
                     "install_opener",
                     "pathname2url",
                     "HTTPBasicAuthHandler",
                     "HTTPDigestAuthHandler",
                     "HTTPPasswordMgrWithDefaultRealm",
                     "ProxyHandler",
                     "quote",
                     "Request",
                     "splitattr",
                     "splitpasswd",
                     "splitport",
                     "splituser",
                     "unquote",
                     "url2pathname",
                     "urlopen",
                 ))
                 import urllib.error
                 urlerr._registeraliases(urllib.error, (
                     "HTTPError",
                     "URLError",
                 ))
                 import http.server
                 httpserver._registeraliases(http.server, (
                     "HTTPServer",
                     "BaseHTTPRequestHandler",
                     "SimpleHTTPRequestHandler",
                     "CGIHTTPRequestHandler",
                 ))

mercurial/util.py

0 +1 -1

             # util.py - Mercurial utility functions and platform specific implementations
             #
             #  Copyright 2005 K. Thananchayan <thananck@yahoo.com>
             #  Copyright 2005-2007 Matt Mackall <mpm@selenic.com>
             #  Copyright 2006 Vadim Gelfer <vadim.gelfer@gmail.com>
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             """Mercurial utility functions and platform specific implementations.
             This contains helper routines that are independent of the SCM core and
             hide platform-specific details from the core.
             """
             from __future__ import absolute_import
             import bz2
             import calendar
             import collections
             import datetime
             import errno
             import gc
             import hashlib
             import imp
             import os
             import re as remod
             import shutil
             import signal
             import socket
             import subprocess
             import sys
             import tempfile
             import textwrap
             import time
             import traceback
             import zlib
             from . import (
                 encoding,
                 error,
                 i18n,
                 osutil,
                 parsers,
                 pycompat,
             )
             for attr in (
                 'empty',
                 'httplib',
                 'httpserver',
                 'pickle',
                 'queue',
                 'urlerr',
                 'urlparse',
                 # we do import urlreq, but we do it outside the loop
                 #'urlreq',
                 'stringio',
                 'socketserver',
                 'xmlrpclib',
             ):
                 globals()[attr] = getattr(pycompat, attr)
             # This line is to make pyflakes happy:
             urlreq = pycompat.urlreq
             if os.name == 'nt':
                 from . import windows as platform
             else:
                 from . import posix as platform
             _ = i18n._
             bindunixsocket = platform.bindunixsocket
             cachestat = platform.cachestat
             checkexec = platform.checkexec
             checklink = platform.checklink
             copymode = platform.copymode
             executablepath = platform.executablepath
             expandglobs = platform.expandglobs
             explainexit = platform.explainexit
             findexe = platform.findexe
             gethgcmd = platform.gethgcmd
             getuser = platform.getuser
             getpid = os.getpid
             groupmembers = platform.groupmembers
             groupname = platform.groupname
             hidewindow = platform.hidewindow
             isexec = platform.isexec
             isowner = platform.isowner
             localpath = platform.localpath
             lookupreg = platform.lookupreg
             makedir = platform.makedir
             nlinks = platform.nlinks
             normpath = platform.normpath
             normcase = platform.normcase
             normcasespec = platform.normcasespec
             normcasefallback = platform.normcasefallback
             openhardlinks = platform.openhardlinks
             oslink = platform.oslink
             parsepatchoutput = platform.parsepatchoutput
             pconvert = platform.pconvert
             poll = platform.poll
             popen = platform.popen
             posixfile = platform.posixfile
             quotecommand = platform.quotecommand
             readpipe = platform.readpipe
             rename = platform.rename
             removedirs = platform.removedirs
             samedevice = platform.samedevice
             samefile = platform.samefile
             samestat = platform.samestat
             setbinary = platform.setbinary
             setflags = platform.setflags
             setsignalhandler = platform.setsignalhandler
             shellquote = platform.shellquote
             spawndetached = platform.spawndetached
             split = platform.split
             sshargs = platform.sshargs
             statfiles = getattr(osutil, 'statfiles', platform.statfiles)
             statisexec = platform.statisexec
             statislink = platform.statislink
             termwidth = platform.termwidth
             testpid = platform.testpid
             umask = platform.umask
             unlink = platform.unlink
             unlinkpath = platform.unlinkpath
             username = platform.username
             # Python compatibility
             _notset = object()
             # disable Python's problematic floating point timestamps (issue4836)
             # (Python hypocritically says you shouldn't change this behavior in
             # libraries, and sure enough Mercurial is not a library.)
             os.stat_float_times(False)
             def safehasattr(thing, attr):
                 return getattr(thing, attr, _notset) is not _notset
             DIGESTS = {
                 'md5': hashlib.md5,
                 'sha1': hashlib.sha1,
                 'sha512': hashlib.sha512,
             }
             # List of digest types from strongest to weakest
             DIGESTS_BY_STRENGTH = ['sha512', 'sha1', 'md5']
             for k in DIGESTS_BY_STRENGTH:
                 assert k in DIGESTS
             class digester(object):
                 """helper to compute digests.
                 This helper can be used to compute one or more digests given their name.
                 >>> d = digester(['md5', 'sha1'])
                 >>> d.update('foo')
                 >>> [k for k in sorted(d)]
                 ['md5', 'sha1']
                 >>> d['md5']
                 'acbd18db4cc2f85cedef654fccc4a4d8'
                 >>> d['sha1']
                 '0beec7b5ea3f0fdbc95d0dd47f3c5bc275da8a33'
                 >>> digester.preferred(['md5', 'sha1'])
                 'sha1'
                 """
                 def __init__(self, digests, s=''):
                     self._hashes = {}
                     for k in digests:
                         if k not in DIGESTS:
                             raise Abort(_('unknown digest type: %s') % k)
                         self._hashes[k] = DIGESTS[k]()
                     if s:
                         self.update(s)
                 def update(self, data):
                     for h in self._hashes.values():
                         h.update(data)
                 def __getitem__(self, key):
                     if key not in DIGESTS:
                         raise Abort(_('unknown digest type: %s') % k)
                     return self._hashes[key].hexdigest()
                 def __iter__(self):
                     return iter(self._hashes)
                 @staticmethod
                 def preferred(supported):
                     """returns the strongest digest type in both supported and DIGESTS."""
                     for k in DIGESTS_BY_STRENGTH:
                         if k in supported:
                             return k
                     return None
             class digestchecker(object):
                 """file handle wrapper that additionally checks content against a given
                 size and digests.
                     d = digestchecker(fh, size, {'md5': '...'})
                 When multiple digests are given, all of them are validated.
                 """
                 def __init__(self, fh, size, digests):
                     self._fh = fh
                     self._size = size
                     self._got = 0
                     self._digests = dict(digests)
                     self._digester = digester(self._digests.keys())
                 def read(self, length=-1):
                     content = self._fh.read(length)
                     self._digester.update(content)
                     self._got += len(content)
                     return content
                 def validate(self):
                     if self._size != self._got:
                         raise Abort(_('size mismatch: expected %d, got %d') %
                             (self._size, self._got))
                     for k, v in self._digests.items():
                         if v != self._digester[k]:
                             # i18n: first parameter is a digest name
                             raise Abort(_('%s mismatch: expected %s, got %s') %
                                 (k, v, self._digester[k]))
             try:
                 buffer = buffer
             except NameError:
-                if sys.version_info[0] < 3:
+                if not pycompat.ispy3:
                     def buffer(sliceable, offset=0):
                         return sliceable[offset:]
                 else:
                     def buffer(sliceable, offset=0):
                         return memoryview(sliceable)[offset:]
             closefds = os.name == 'posix'
             _chunksize = 4096
             class bufferedinputpipe(object):
                 """a manually buffered input pipe
                 Python will not let us use buffered IO and lazy reading with 'polling' at
                 the same time. We cannot probe the buffer state and select will not detect
                 that data are ready to read if they are already buffered.
                 This class let us work around that by implementing its own buffering
                 (allowing efficient readline) while offering a way to know if the buffer is
                 empty from the output (allowing collaboration of the buffer with polling).
                 This class lives in the 'util' module because it makes use of the 'os'
                 module from the python stdlib.
                 """
                 def __init__(self, input):
                     self._input = input
                     self._buffer = []
                     self._eof = False
                     self._lenbuf = 0
                 @property
                 def hasbuffer(self):
                     """True is any data is currently buffered
                     This will be used externally a pre-step for polling IO. If there is
                     already data then no polling should be set in place."""
                     return bool(self._buffer)
                 @property
                 def closed(self):
                     return self._input.closed
                 def fileno(self):
                     return self._input.fileno()
                 def close(self):
                     return self._input.close()
                 def read(self, size):
                     while (not self._eof) and (self._lenbuf < size):
                         self._fillbuffer()
                     return self._frombuffer(size)
                 def readline(self, *args, **kwargs):
                     if 1 < len(self._buffer):
                         # this should not happen because both read and readline end with a
                         # _frombuffer call that collapse it.
                         self._buffer = [''.join(self._buffer)]
                         self._lenbuf = len(self._buffer[0])
                     lfi = -1
                     if self._buffer:
                         lfi = self._buffer[-1].find('\n')
                     while (not self._eof) and lfi < 0:
                         self._fillbuffer()
                         if self._buffer:
                             lfi = self._buffer[-1].find('\n')
                     size = lfi + 1
                     if lfi < 0: # end of file
                         size = self._lenbuf
                     elif 1 < len(self._buffer):
                         # we need to take previous chunks into account
                         size += self._lenbuf - len(self._buffer[-1])
                     return self._frombuffer(size)
                 def _frombuffer(self, size):
                     """return at most 'size' data from the buffer
                     The data are removed from the buffer."""
                     if size == 0 or not self._buffer:
                         return ''
                     buf = self._buffer[0]
                     if 1 < len(self._buffer):
                         buf = ''.join(self._buffer)
                     data = buf[:size]
                     buf = buf[len(data):]
                     if buf:
                         self._buffer = [buf]
                         self._lenbuf = len(buf)
                     else:
                         self._buffer = []
                         self._lenbuf = 0
                     return data
                 def _fillbuffer(self):
                     """read data to the buffer"""
                     data = os.read(self._input.fileno(), _chunksize)
                     if not data:
                         self._eof = True
                     else:
                         self._lenbuf += len(data)
                         self._buffer.append(data)
             def popen2(cmd, env=None, newlines=False):
                 # Setting bufsize to -1 lets the system decide the buffer size.
                 # The default for bufsize is 0, meaning unbuffered. This leads to
                 # poor performance on Mac OS X: http://bugs.python.org/issue4194
                 p = subprocess.Popen(cmd, shell=True, bufsize=-1,
                                      close_fds=closefds,
                                      stdin=subprocess.PIPE, stdout=subprocess.PIPE,
                                      universal_newlines=newlines,
                                      env=env)
                 return p.stdin, p.stdout
             def popen3(cmd, env=None, newlines=False):
                 stdin, stdout, stderr, p = popen4(cmd, env, newlines)
                 return stdin, stdout, stderr
             def popen4(cmd, env=None, newlines=False, bufsize=-1):
                 p = subprocess.Popen(cmd, shell=True, bufsize=bufsize,
                                      close_fds=closefds,
                                      stdin=subprocess.PIPE, stdout=subprocess.PIPE,
                                      stderr=subprocess.PIPE,
                                      universal_newlines=newlines,
                                      env=env)
                 return p.stdin, p.stdout, p.stderr, p
             def version():
                 """Return version information if available."""
                 try:
                     from . import __version__
                     return __version__.version
                 except ImportError:
                     return 'unknown'
             def versiontuple(v=None, n=4):
                 """Parses a Mercurial version string into an N-tuple.
                 The version string to be parsed is specified with the ``v`` argument.
                 If it isn't defined, the current Mercurial version string will be parsed.
                 ``n`` can be 2, 3, or 4. Here is how some version strings map to
                 returned values:
                 >>> v = '3.6.1+190-df9b73d2d444'
                 >>> versiontuple(v, 2)
                 (3, 6)
                 >>> versiontuple(v, 3)
                 (3, 6, 1)
                 >>> versiontuple(v, 4)
                 (3, 6, 1, '190-df9b73d2d444')
                 >>> versiontuple('3.6.1+190-df9b73d2d444+20151118')
                 (3, 6, 1, '190-df9b73d2d444+20151118')
                 >>> v = '3.6'
                 >>> versiontuple(v, 2)
                 (3, 6)
                 >>> versiontuple(v, 3)
                 (3, 6, None)
                 >>> versiontuple(v, 4)
                 (3, 6, None, None)
                 >>> v = '3.9-rc'
                 >>> versiontuple(v, 2)
                 (3, 9)
                 >>> versiontuple(v, 3)
                 (3, 9, None)
                 >>> versiontuple(v, 4)
                 (3, 9, None, 'rc')
                 >>> v = '3.9-rc+2-02a8fea4289b'
                 >>> versiontuple(v, 2)
                 (3, 9)
                 >>> versiontuple(v, 3)
                 (3, 9, None)
                 >>> versiontuple(v, 4)
                 (3, 9, None, 'rc+2-02a8fea4289b')
                 """
                 if not v:
                     v = version()
                 parts = remod.split('[\+-]', v, 1)
                 if len(parts) == 1:
                     vparts, extra = parts[0], None
                 else:
                     vparts, extra = parts
                 vints = []
                 for i in vparts.split('.'):
                     try:
                         vints.append(int(i))
                     except ValueError:
                         break
                 # (3, 6) -> (3, 6, None)
                 while len(vints) < 3:
                     vints.append(None)
                 if n == 2:
                     return (vints[0], vints[1])
                 if n == 3:
                     return (vints[0], vints[1], vints[2])
                 if n == 4:
                     return (vints[0], vints[1], vints[2], extra)
             # used by parsedate
             defaultdateformats = (
                 '%Y-%m-%dT%H:%M:%S', # the 'real' ISO8601
                 '%Y-%m-%dT%H:%M',    #   without seconds
                 '%Y-%m-%dT%H%M%S',   # another awful but legal variant without :
                 '%Y-%m-%dT%H%M',     #   without seconds
                 '%Y-%m-%d %H:%M:%S', # our common legal variant
                 '%Y-%m-%d %H:%M',    #   without seconds
                 '%Y-%m-%d %H%M%S',   # without :
                 '%Y-%m-%d %H%M',     #   without seconds
                 '%Y-%m-%d %I:%M:%S%p',
                 '%Y-%m-%d %H:%M',
                 '%Y-%m-%d %I:%M%p',
                 '%Y-%m-%d',
                 '%m-%d',
                 '%m/%d',
                 '%m/%d/%y',
                 '%m/%d/%Y',
                 '%a %b %d %H:%M:%S %Y',
                 '%a %b %d %I:%M:%S%p %Y',
                 '%a, %d %b %Y %H:%M:%S',        #  GNU coreutils "/bin/date --rfc-2822"
                 '%b %d %H:%M:%S %Y',
                 '%b %d %I:%M:%S%p %Y',
                 '%b %d %H:%M:%S',
                 '%b %d %I:%M:%S%p',
                 '%b %d %H:%M',
                 '%b %d %I:%M%p',
                 '%b %d %Y',
                 '%b %d',
                 '%H:%M:%S',
                 '%I:%M:%S%p',
                 '%H:%M',
                 '%I:%M%p',
             )
             extendeddateformats = defaultdateformats + (
                 "%Y",
                 "%Y-%m",
                 "%b",
                 "%b %Y",
                 )
             def cachefunc(func):
                 '''cache the result of function calls'''
                 # XXX doesn't handle keywords args
                 if func.__code__.co_argcount == 0:
                     cache = []
                     def f():
                         if len(cache) == 0:
                             cache.append(func())
                         return cache[0]
                     return f
                 cache = {}
                 if func.__code__.co_argcount == 1:
                     # we gain a small amount of time because
                     # we don't need to pack/unpack the list
                     def f(arg):
                         if arg not in cache:
                             cache[arg] = func(arg)
                         return cache[arg]
                 else:
                     def f(*args):
                         if args not in cache:
                             cache[args] = func(*args)
                         return cache[args]
                 return f
             class sortdict(dict):
                 '''a simple sorted dictionary'''
                 def __init__(self, data=None):
                     self._list = []
                     if data:
                         self.update(data)
                 def copy(self):
                     return sortdict(self)
                 def __setitem__(self, key, val):
                     if key in self:
                         self._list.remove(key)
                     self._list.append(key)
                     dict.__setitem__(self, key, val)
                 def __iter__(self):
                     return self._list.__iter__()
                 def update(self, src):
                     if isinstance(src, dict):
                         src = src.iteritems()
                     for k, v in src:
                         self[k] = v
                 def clear(self):
                     dict.clear(self)
                     self._list = []
                 def items(self):
                     return [(k, self[k]) for k in self._list]
                 def __delitem__(self, key):
                     dict.__delitem__(self, key)
                     self._list.remove(key)
                 def pop(self, key, *args, **kwargs):
                     dict.pop(self, key, *args, **kwargs)
                     try:
                         self._list.remove(key)
                     except ValueError:
                         pass
                 def keys(self):
                     return self._list
                 def iterkeys(self):
                     return self._list.__iter__()
                 def iteritems(self):
                     for k in self._list:
                         yield k, self[k]
                 def insert(self, index, key, val):
                     self._list.insert(index, key)
                     dict.__setitem__(self, key, val)
                 def __repr__(self):
                     if not self:
                         return '%s()' % self.__class__.__name__
                     return '%s(%r)' % (self.__class__.__name__, self.items())
             class _lrucachenode(object):
                 """A node in a doubly linked list.
                 Holds a reference to nodes on either side as well as a key-value
                 pair for the dictionary entry.
                 """
                 __slots__ = ('next', 'prev', 'key', 'value')
                 def __init__(self):
                     self.next = None
                     self.prev = None
                     self.key = _notset
                     self.value = None
                 def markempty(self):
                     """Mark the node as emptied."""
                     self.key = _notset
             class lrucachedict(object):
                 """Dict that caches most recent accesses and sets.
                 The dict consists of an actual backing dict - indexed by original
                 key - and a doubly linked circular list defining the order of entries in
                 the cache.
                 The head node is the newest entry in the cache. If the cache is full,
                 we recycle head.prev and make it the new head. Cache accesses result in
                 the node being moved to before the existing head and being marked as the
                 new head node.
                 """
                 def __init__(self, max):
                     self._cache = {}
                     self._head = head = _lrucachenode()
                     head.prev = head
                     head.next = head
                     self._size = 1
                     self._capacity = max
                 def __len__(self):
                     return len(self._cache)
                 def __contains__(self, k):
                     return k in self._cache
                 def __iter__(self):
                     # We don't have to iterate in cache order, but why not.
                     n = self._head
                     for i in range(len(self._cache)):
                         yield n.key
                         n = n.next
                 def __getitem__(self, k):
                     node = self._cache[k]
                     self._movetohead(node)
                     return node.value
                 def __setitem__(self, k, v):
                     node = self._cache.get(k)
                     # Replace existing value and mark as newest.
                     if node is not None:
                         node.value = v
                         self._movetohead(node)
                         return
                     if self._size < self._capacity:
                         node = self._addcapacity()
                     else:
                         # Grab the last/oldest item.
                         node = self._head.prev
                     # At capacity. Kill the old entry.
                     if node.key is not _notset:
                         del self._cache[node.key]
                     node.key = k
                     node.value = v
                     self._cache[k] = node
                     # And mark it as newest entry. No need to adjust order since it
                     # is already self._head.prev.
                     self._head = node
                 def __delitem__(self, k):
                     node = self._cache.pop(k)
                     node.markempty()
                     # Temporarily mark as newest item before re-adjusting head to make
                     # this node the oldest item.
                     self._movetohead(node)
                     self._head = node.next
                 # Additional dict methods.
                 def get(self, k, default=None):
                     try:
                         return self._cache[k].value
                     except KeyError:
                         return default
                 def clear(self):
                     n = self._head
                     while n.key is not _notset:
                         n.markempty()
                         n = n.next
                     self._cache.clear()
                 def copy(self):
                     result = lrucachedict(self._capacity)
                     n = self._head.prev
                     # Iterate in oldest-to-newest order, so the copy has the right ordering
                     for i in range(len(self._cache)):
                         result[n.key] = n.value
                         n = n.prev
                     return result
                 def _movetohead(self, node):
                     """Mark a node as the newest, making it the new head.
                     When a node is accessed, it becomes the freshest entry in the LRU
                     list, which is denoted by self._head.
                     Visually, let's make ``N`` the new head node (* denotes head):
                         previous/oldest <-> head <-> next/next newest
                         ----<->--- A* ---<->-----
                         |                       |
                         E <-> D <-> N <-> C <-> B
                     To:
                         ----<->--- N* ---<->-----
                         |                       |
                         E <-> D <-> C <-> B <-> A
                     This requires the following moves:
                        C.next = D  (node.prev.next = node.next)
                        D.prev = C  (node.next.prev = node.prev)
                        E.next = N  (head.prev.next = node)
                        N.prev = E  (node.prev = head.prev)
                        N.next = A  (node.next = head)
                        A.prev = N  (head.prev = node)
                     """
                     head = self._head
                     # C.next = D
                     node.prev.next = node.next
                     # D.prev = C
                     node.next.prev = node.prev
                     # N.prev = E
                     node.prev = head.prev
                     # N.next = A
                     # It is tempting to do just "head" here, however if node is
                     # adjacent to head, this will do bad things.
                     node.next = head.prev.next
                     # E.next = N
                     node.next.prev = node
                     # A.prev = N
                     node.prev.next = node
                     self._head = node
                 def _addcapacity(self):
                     """Add a node to the circular linked list.
                     The new node is inserted before the head node.
                     """
                     head = self._head
                     node = _lrucachenode()
                     head.prev.next = node
                     node.prev = head.prev
                     node.next = head
                     head.prev = node
                     self._size += 1
                     return node
             def lrucachefunc(func):
                 '''cache most recent results of function calls'''
                 cache = {}
                 order = collections.deque()
                 if func.__code__.co_argcount == 1:
                     def f(arg):
                         if arg not in cache:
                             if len(cache) > 20:
                                 del cache[order.popleft()]
                             cache[arg] = func(arg)
                         else:
                             order.remove(arg)
                         order.append(arg)
                         return cache[arg]
                 else:
                     def f(*args):
                         if args not in cache:
                             if len(cache) > 20:
                                 del cache[order.popleft()]
                             cache[args] = func(*args)
                         else:
                             order.remove(args)
                         order.append(args)
                         return cache[args]
                 return f
             class propertycache(object):
                 def __init__(self, func):
                     self.func = func
                     self.name = func.__name__
                 def __get__(self, obj, type=None):
                     result = self.func(obj)
                     self.cachevalue(obj, result)
                     return result
                 def cachevalue(self, obj, value):
                     # __dict__ assignment required to bypass __setattr__ (eg: repoview)
                     obj.__dict__[self.name] = value
             def pipefilter(s, cmd):
                 '''filter string S through command CMD, returning its output'''
                 p = subprocess.Popen(cmd, shell=True, close_fds=closefds,
                                      stdin=subprocess.PIPE, stdout=subprocess.PIPE)
                 pout, perr = p.communicate(s)
                 return pout
             def tempfilter(s, cmd):
                 '''filter string S through a pair of temporary files with CMD.
                 CMD is used as a template to create the real command to be run,
                 with the strings INFILE and OUTFILE replaced by the real names of
                 the temporary files generated.'''
                 inname, outname = None, None
                 try:
                     infd, inname = tempfile.mkstemp(prefix='hg-filter-in-')
                     fp = os.fdopen(infd, 'wb')
                     fp.write(s)
                     fp.close()
                     outfd, outname = tempfile.mkstemp(prefix='hg-filter-out-')
                     os.close(outfd)
                     cmd = cmd.replace('INFILE', inname)
                     cmd = cmd.replace('OUTFILE', outname)
                     code = os.system(cmd)
                     if sys.platform == 'OpenVMS' and code & 1:
                         code = 0
                     if code:
                         raise Abort(_("command '%s' failed: %s") %
                                     (cmd, explainexit(code)))
                     return readfile(outname)
                 finally:
                     try:
                         if inname:
                             os.unlink(inname)
                     except OSError:
                         pass
                     try:
                         if outname:
                             os.unlink(outname)
                     except OSError:
                         pass
             filtertable = {
                 'tempfile:': tempfilter,
                 'pipe:': pipefilter,
                 }
             def filter(s, cmd):
                 "filter a string through a command that transforms its input to its output"
                 for name, fn in filtertable.iteritems():
                     if cmd.startswith(name):
                         return fn(s, cmd[len(name):].lstrip())
                 return pipefilter(s, cmd)
             def binary(s):
                 """return true if a string is binary data"""
                 return bool(s and '\0' in s)
             def increasingchunks(source, min=1024, max=65536):
                 '''return no less than min bytes per chunk while data remains,
                 doubling min after each chunk until it reaches max'''
                 def log2(x):
                     if not x:
                         return 0
                     i = 0
                     while x:
                         x >>= 1
                         i += 1
                     return i - 1
                 buf = []
                 blen = 0
                 for chunk in source:
                     buf.append(chunk)
                     blen += len(chunk)
                     if blen >= min:
                         if min < max:
                             min = min << 1
                             nmin = 1 << log2(blen)
                             if nmin > min:
                                 min = nmin
                             if min > max:
                                 min = max
                         yield ''.join(buf)
                         blen = 0
                         buf = []
                 if buf:
                     yield ''.join(buf)
             Abort = error.Abort
             def always(fn):
                 return True
             def never(fn):
                 return False
             def nogc(func):
                 """disable garbage collector
                 Python's garbage collector triggers a GC each time a certain number of
                 container objects (the number being defined by gc.get_threshold()) are
                 allocated even when marked not to be tracked by the collector. Tracking has
                 no effect on when GCs are triggered, only on what objects the GC looks
                 into. As a workaround, disable GC while building complex (huge)
                 containers.
                 This garbage collector issue have been fixed in 2.7.
                 """
                 if sys.version >= (2, 7):
                     return func
                 def wrapper(*args, **kwargs):
                     gcenabled = gc.isenabled()
                     gc.disable()
                     try:
                         return func(*args, **kwargs)
                     finally:
                         if gcenabled:
                             gc.enable()
                 return wrapper
             def pathto(root, n1, n2):
                 '''return the relative path from one place to another.
                 root should use os.sep to separate directories
                 n1 should use os.sep to separate directories
                 n2 should use "/" to separate directories
                 returns an os.sep-separated path.
                 If n1 is a relative path, it's assumed it's
                 relative to root.
                 n2 should always be relative to root.
                 '''
                 if not n1:
                     return localpath(n2)
                 if os.path.isabs(n1):
                     if os.path.splitdrive(root)[0] != os.path.splitdrive(n1)[0]:
                         return os.path.join(root, localpath(n2))
                     n2 = '/'.join((pconvert(root), n2))
                 a, b = splitpath(n1), n2.split('/')
                 a.reverse()
                 b.reverse()
                 while a and b and a[-1] == b[-1]:
                     a.pop()
                     b.pop()
                 b.reverse()
                 return os.sep.join((['..'] * len(a)) + b) or '.'
             def mainfrozen():
                 """return True if we are a frozen executable.
                 The code supports py2exe (most common, Windows only) and tools/freeze
                 (portable, not much used).
                 """
                 return (safehasattr(sys, "frozen") or # new py2exe
                         safehasattr(sys, "importers") or # old py2exe
                         imp.is_frozen("__main__")) # tools/freeze
             # the location of data files matching the source code
             if mainfrozen() and getattr(sys, 'frozen', None) != 'macosx_app':
                 # executable version (py2exe) doesn't support __file__
                 datapath = os.path.dirname(sys.executable)
             else:
                 datapath = os.path.dirname(__file__)
             i18n.setdatapath(datapath)
             _hgexecutable = None
             def hgexecutable():
                 """return location of the 'hg' executable.
                 Defaults to $HG or 'hg' in the search path.
                 """
                 if _hgexecutable is None:
                     hg = os.environ.get('HG')
                     mainmod = sys.modules['__main__']
                     if hg:
                         _sethgexecutable(hg)
                     elif mainfrozen():
                         if getattr(sys, 'frozen', None) == 'macosx_app':
                             # Env variable set by py2app
                             _sethgexecutable(os.environ['EXECUTABLEPATH'])
                         else:
                             _sethgexecutable(sys.executable)
                     elif os.path.basename(getattr(mainmod, '__file__', '')) == 'hg':
                         _sethgexecutable(mainmod.__file__)
                     else:
                         exe = findexe('hg') or os.path.basename(sys.argv[0])
                         _sethgexecutable(exe)
                 return _hgexecutable
             def _sethgexecutable(path):
                 """set location of the 'hg' executable"""
                 global _hgexecutable
                 _hgexecutable = path
             def _isstdout(f):
                 fileno = getattr(f, 'fileno', None)
                 return fileno and fileno() == sys.__stdout__.fileno()
             def system(cmd, environ=None, cwd=None, onerr=None, errprefix=None, out=None):
                 '''enhanced shell command execution.
                 run with environment maybe modified, maybe in different dir.
                 if command fails and onerr is None, return status, else raise onerr
                 object as exception.
                 if out is specified, it is assumed to be a file-like object that has a
                 write() method. stdout and stderr will be redirected to out.'''
                 if environ is None:
                     environ = {}
                 try:
                     sys.stdout.flush()
                 except Exception:
                     pass
                 def py2shell(val):
                     'convert python object into string that is useful to shell'
                     if val is None or val is False:
                         return '0'
                     if val is True:
                         return '1'
                     return str(val)
                 origcmd = cmd
                 cmd = quotecommand(cmd)
                 if sys.platform == 'plan9' and (sys.version_info[0] == 2
                                                 and sys.version_info[1] < 7):
                     # subprocess kludge to work around issues in half-baked Python
                     # ports, notably bichued/python:
                     if not cwd is None:
                         os.chdir(cwd)
                     rc = os.system(cmd)
                 else:
                     env = dict(os.environ)
                     env.update((k, py2shell(v)) for k, v in environ.iteritems())
                     env['HG'] = hgexecutable()
                     if out is None or _isstdout(out):
                         rc = subprocess.call(cmd, shell=True, close_fds=closefds,
                                              env=env, cwd=cwd)
                     else:
                         proc = subprocess.Popen(cmd, shell=True, close_fds=closefds,
                                                 env=env, cwd=cwd, stdout=subprocess.PIPE,
                                                 stderr=subprocess.STDOUT)
                         for line in iter(proc.stdout.readline, ''):
                             out.write(line)
                         proc.wait()
                         rc = proc.returncode
                     if sys.platform == 'OpenVMS' and rc & 1:
                         rc = 0
                 if rc and onerr:
                     errmsg = '%s %s' % (os.path.basename(origcmd.split(None, 1)[0]),
                                         explainexit(rc)[0])
                     if errprefix:
                         errmsg = '%s: %s' % (errprefix, errmsg)
                     raise onerr(errmsg)
                 return rc
             def checksignature(func):
                 '''wrap a function with code to check for calling errors'''
                 def check(*args, **kwargs):
                     try:
                         return func(*args, **kwargs)
                     except TypeError:
                         if len(traceback.extract_tb(sys.exc_info()[2])) == 1:
                             raise error.SignatureError
                         raise
                 return check
             def copyfile(src, dest, hardlink=False, copystat=False, checkambig=False):
                 '''copy a file, preserving mode and optionally other stat info like
                 atime/mtime
                 checkambig argument is used with filestat, and is useful only if
                 destination file is guarded by any lock (e.g. repo.lock or
                 repo.wlock).
                 copystat and checkambig should be exclusive.
                 '''
                 assert not (copystat and checkambig)
                 oldstat = None
                 if os.path.lexists(dest):
                     if checkambig:
                         oldstat = checkambig and filestat(dest)
                     unlink(dest)
                 # hardlinks are problematic on CIFS, quietly ignore this flag
                 # until we find a way to work around it cleanly (issue4546)
                 if False and hardlink:
                     try:
                         oslink(src, dest)
                         return
                     except (IOError, OSError):
                         pass # fall back to normal copy
                 if os.path.islink(src):
                     os.symlink(os.readlink(src), dest)
                     # copytime is ignored for symlinks, but in general copytime isn't needed
                     # for them anyway
                 else:
                     try:
                         shutil.copyfile(src, dest)
                         if copystat:
                             # copystat also copies mode
                             shutil.copystat(src, dest)
                         else:
                             shutil.copymode(src, dest)
                             if oldstat and oldstat.stat:
                                 newstat = filestat(dest)
                                 if newstat.isambig(oldstat):
                                     # stat of copied file is ambiguous to original one
                                     advanced = (oldstat.stat.st_mtime + 1) & 0x7fffffff
                                     os.utime(dest, (advanced, advanced))
                     except shutil.Error as inst:
                         raise Abort(str(inst))
             def copyfiles(src, dst, hardlink=None, progress=lambda t, pos: None):
                 """Copy a directory tree using hardlinks if possible."""
                 num = 0
                 if hardlink is None:
                     hardlink = (os.stat(src).st_dev ==
                                 os.stat(os.path.dirname(dst)).st_dev)
                 if hardlink:
                     topic = _('linking')
                 else:
                     topic = _('copying')
                 if os.path.isdir(src):
                     os.mkdir(dst)
                     for name, kind in osutil.listdir(src):
                         srcname = os.path.join(src, name)
                         dstname = os.path.join(dst, name)
                         def nprog(t, pos):
                             if pos is not None:
                                 return progress(t, pos + num)
                         hardlink, n = copyfiles(srcname, dstname, hardlink, progress=nprog)
                         num += n
                 else:
                     if hardlink:
                         try:
                             oslink(src, dst)
                         except (IOError, OSError):
                             hardlink = False
                             shutil.copy(src, dst)
                     else:
                         shutil.copy(src, dst)
                     num += 1
                     progress(topic, num)
                 progress(topic, None)
                 return hardlink, num
             _winreservednames = '''con prn aux nul
                 com1 com2 com3 com4 com5 com6 com7 com8 com9
                 lpt1 lpt2 lpt3 lpt4 lpt5 lpt6 lpt7 lpt8 lpt9'''.split()
             _winreservedchars = ':*?"<>|'
             def checkwinfilename(path):
                 r'''Check that the base-relative path is a valid filename on Windows.
                 Returns None if the path is ok, or a UI string describing the problem.
                 >>> checkwinfilename("just/a/normal/path")
                 >>> checkwinfilename("foo/bar/con.xml")
                 "filename contains 'con', which is reserved on Windows"
                 >>> checkwinfilename("foo/con.xml/bar")
                 "filename contains 'con', which is reserved on Windows"
                 >>> checkwinfilename("foo/bar/xml.con")
                 >>> checkwinfilename("foo/bar/AUX/bla.txt")
                 "filename contains 'AUX', which is reserved on Windows"
                 >>> checkwinfilename("foo/bar/bla:.txt")
                 "filename contains ':', which is reserved on Windows"
                 >>> checkwinfilename("foo/bar/b\07la.txt")
                 "filename contains '\\x07', which is invalid on Windows"
                 >>> checkwinfilename("foo/bar/bla ")
                 "filename ends with ' ', which is not allowed on Windows"
                 >>> checkwinfilename("../bar")
                 >>> checkwinfilename("foo\\")
                 "filename ends with '\\', which is invalid on Windows"
                 >>> checkwinfilename("foo\\/bar")
                 "directory name ends with '\\', which is invalid on Windows"
                 '''
                 if path.endswith('\\'):
                     return _("filename ends with '\\', which is invalid on Windows")
                 if '\\/' in path:
                     return _("directory name ends with '\\', which is invalid on Windows")
                 for n in path.replace('\\', '/').split('/'):
                     if not n:
                         continue
                     for c in n:
                         if c in _winreservedchars:
                             return _("filename contains '%s', which is reserved "
                                      "on Windows") % c
                         if ord(c) <= 31:
                             return _("filename contains %r, which is invalid "
                                      "on Windows") % c
                     base = n.split('.')[0]
                     if base and base.lower() in _winreservednames:
                         return _("filename contains '%s', which is reserved "
                                  "on Windows") % base
                     t = n[-1]
                     if t in '. ' and n not in '..':
                         return _("filename ends with '%s', which is not allowed "
                                  "on Windows") % t
             if os.name == 'nt':
                 checkosfilename = checkwinfilename
             else:
                 checkosfilename = platform.checkosfilename
             def makelock(info, pathname):
                 try:
                     return os.symlink(info, pathname)
                 except OSError as why:
                     if why.errno == errno.EEXIST:
                         raise
                 except AttributeError: # no symlink in os
                     pass
                 ld = os.open(pathname, os.O_CREAT | os.O_WRONLY | os.O_EXCL)
                 os.write(ld, info)
                 os.close(ld)
             def readlock(pathname):
                 try:
                     return os.readlink(pathname)
                 except OSError as why:
                     if why.errno not in (errno.EINVAL, errno.ENOSYS):
                         raise
                 except AttributeError: # no symlink in os
                     pass
                 fp = posixfile(pathname)
                 r = fp.read()
                 fp.close()
                 return r
             def fstat(fp):
                 '''stat file object that may not have fileno method.'''
                 try:
                     return os.fstat(fp.fileno())
                 except AttributeError:
                     return os.stat(fp.name)
             # File system features
             def fscasesensitive(path):
                 """
                 Return true if the given path is on a case-sensitive filesystem
                 Requires a path (like /foo/.hg) ending with a foldable final
                 directory component.
                 """
                 s1 = os.lstat(path)
                 d, b = os.path.split(path)
                 b2 = b.upper()
                 if b == b2:
                     b2 = b.lower()
                     if b == b2:
                         return True # no evidence against case sensitivity
                 p2 = os.path.join(d, b2)
                 try:
                     s2 = os.lstat(p2)
                     if s2 == s1:
                         return False
                     return True
                 except OSError:
                     return True
             try:
                 import re2
                 _re2 = None
             except ImportError:
                 _re2 = False
             class _re(object):
                 def _checkre2(self):
                     global _re2
                     try:
                         # check if match works, see issue3964
                         _re2 = bool(re2.match(r'\[([^\[]+)\]', '[ui]'))
                     except ImportError:
                         _re2 = False
                 def compile(self, pat, flags=0):
                     '''Compile a regular expression, using re2 if possible
                     For best performance, use only re2-compatible regexp features. The
                     only flags from the re module that are re2-compatible are
                     IGNORECASE and MULTILINE.'''
                     if _re2 is None:
                         self._checkre2()
                     if _re2 and (flags & ~(remod.IGNORECASE | remod.MULTILINE)) == 0:
                         if flags & remod.IGNORECASE:
                             pat = '(?i)' + pat
                         if flags & remod.MULTILINE:
                             pat = '(?m)' + pat
                         try:
                             return re2.compile(pat)
                         except re2.error:
                             pass
                     return remod.compile(pat, flags)
                 @propertycache
                 def escape(self):
                     '''Return the version of escape corresponding to self.compile.
                     This is imperfect because whether re2 or re is used for a particular
                     function depends on the flags, etc, but it's the best we can do.
                     '''
                     global _re2
                     if _re2 is None:
                         self._checkre2()
                     if _re2:
                         return re2.escape
                     else:
                         return remod.escape
             re = _re()
             _fspathcache = {}
             def fspath(name, root):
                 '''Get name in the case stored in the filesystem
                 The name should be relative to root, and be normcase-ed for efficiency.
                 Note that this function is unnecessary, and should not be
                 called, for case-sensitive filesystems (simply because it's expensive).
                 The root should be normcase-ed, too.
                 '''
                 def _makefspathcacheentry(dir):
                     return dict((normcase(n), n) for n in os.listdir(dir))
                 seps = os.sep
                 if os.altsep:
                     seps = seps + os.altsep
                 # Protect backslashes. This gets silly very quickly.
                 seps.replace('\\','\\\\')
                 pattern = remod.compile(r'([^%s]+)|([%s]+)' % (seps, seps))
                 dir = os.path.normpath(root)
                 result = []
                 for part, sep in pattern.findall(name):
                     if sep:
                         result.append(sep)
                         continue
                     if dir not in _fspathcache:
                         _fspathcache[dir] = _makefspathcacheentry(dir)
                     contents = _fspathcache[dir]
                     found = contents.get(part)
                     if not found:
                         # retry "once per directory" per "dirstate.walk" which
                         # may take place for each patches of "hg qpush", for example
                         _fspathcache[dir] = contents = _makefspathcacheentry(dir)
                         found = contents.get(part)
                     result.append(found or part)
                     dir = os.path.join(dir, part)
                 return ''.join(result)
             def checknlink(testfile):
                 '''check whether hardlink count reporting works properly'''
                 # testfile may be open, so we need a separate file for checking to
                 # work around issue2543 (or testfile may get lost on Samba shares)
                 f1 = testfile + ".hgtmp1"
                 if os.path.lexists(f1):
                     return False
                 try:
                     posixfile(f1, 'w').close()
                 except IOError:
                     try:
                         os.unlink(f1)
                     except OSError:
                         pass
                     return False
                 f2 = testfile + ".hgtmp2"
                 fd = None
                 try:
                     oslink(f1, f2)
                     # nlinks() may behave differently for files on Windows shares if
                     # the file is open.
                     fd = posixfile(f2)
                     return nlinks(f2) > 1
                 except OSError:
                     return False
                 finally:
                     if fd is not None:
                         fd.close()
                     for f in (f1, f2):
                         try:
                             os.unlink(f)
                         except OSError:
                             pass
             def endswithsep(path):
                 '''Check path ends with os.sep or os.altsep.'''
                 return path.endswith(os.sep) or os.altsep and path.endswith(os.altsep)
             def splitpath(path):
                 '''Split path by os.sep.
                 Note that this function does not use os.altsep because this is
                 an alternative of simple "xxx.split(os.sep)".
                 It is recommended to use os.path.normpath() before using this
                 function if need.'''
                 return path.split(os.sep)
             def gui():
                 '''Are we running in a GUI?'''
                 if sys.platform == 'darwin':
                     if 'SSH_CONNECTION' in os.environ:
                         # handle SSH access to a box where the user is logged in
                         return False
                     elif getattr(osutil, 'isgui', None):
                         # check if a CoreGraphics session is available
                         return osutil.isgui()
                     else:
                         # pure build; use a safe default
                         return True
                 else:
                     return os.name == "nt" or os.environ.get("DISPLAY")
             def mktempcopy(name, emptyok=False, createmode=None):
                 """Create a temporary file with the same contents from name
                 The permission bits are copied from the original file.
                 If the temporary file is going to be truncated immediately, you
                 can use emptyok=True as an optimization.
                 Returns the name of the temporary file.
                 """
                 d, fn = os.path.split(name)
                 fd, temp = tempfile.mkstemp(prefix='.%s-' % fn, dir=d)
                 os.close(fd)
                 # Temporary files are created with mode 0600, which is usually not
                 # what we want.  If the original file already exists, just copy
                 # its mode.  Otherwise, manually obey umask.
                 copymode(name, temp, createmode)
                 if emptyok:
                     return temp
                 try:
                     try:
                         ifp = posixfile(name, "rb")
                     except IOError as inst:
                         if inst.errno == errno.ENOENT:
                             return temp
                         if not getattr(inst, 'filename', None):
                             inst.filename = name
                         raise
                     ofp = posixfile(temp, "wb")
                     for chunk in filechunkiter(ifp):
                         ofp.write(chunk)
                     ifp.close()
                     ofp.close()
                 except: # re-raises
                     try: os.unlink(temp)
                     except OSError: pass
                     raise
                 return temp
             class filestat(object):
                 """help to exactly detect change of a file
                 'stat' attribute is result of 'os.stat()' if specified 'path'
                 exists. Otherwise, it is None. This can avoid preparative
                 'exists()' examination on client side of this class.
                 """
                 def __init__(self, path):
                     try:
                         self.stat = os.stat(path)
                     except OSError as err:
                         if err.errno != errno.ENOENT:
                             raise
                         self.stat = None
                 __hash__ = object.__hash__
                 def __eq__(self, old):
                     try:
                         # if ambiguity between stat of new and old file is
                         # avoided, comparision of size, ctime and mtime is enough
                         # to exactly detect change of a file regardless of platform
                         return (self.stat.st_size == old.stat.st_size and
                                 self.stat.st_ctime == old.stat.st_ctime and
                                 self.stat.st_mtime == old.stat.st_mtime)
                     except AttributeError:
                         return False
                 def isambig(self, old):
                     """Examine whether new (= self) stat is ambiguous against old one
                     "S[N]" below means stat of a file at N-th change:
                     - S[n-1].ctime  < S[n].ctime: can detect change of a file
                     - S[n-1].ctime == S[n].ctime
                       - S[n-1].ctime  < S[n].mtime: means natural advancing (*1)
                       - S[n-1].ctime == S[n].mtime: is ambiguous (*2)
                       - S[n-1].ctime  > S[n].mtime: never occurs naturally (don't care)
                     - S[n-1].ctime  > S[n].ctime: never occurs naturally (don't care)
                     Case (*2) above means that a file was changed twice or more at
                     same time in sec (= S[n-1].ctime), and comparison of timestamp
                     is ambiguous.
                     Base idea to avoid such ambiguity is "advance mtime 1 sec, if
                     timestamp is ambiguous".
                     But advancing mtime only in case (*2) doesn't work as
                     expected, because naturally advanced S[n].mtime in case (*1)
                     might be equal to manually advanced S[n-1 or earlier].mtime.
                     Therefore, all "S[n-1].ctime == S[n].ctime" cases should be
                     treated as ambiguous regardless of mtime, to avoid overlooking
                     by confliction between such mtime.
                     Advancing mtime "if isambig(oldstat)" ensures "S[n-1].mtime !=
                     S[n].mtime", even if size of a file isn't changed.
                     """
                     try:
                         return (self.stat.st_ctime == old.stat.st_ctime)
                     except AttributeError:
                         return False
                 def __ne__(self, other):
                     return not self == other
             class atomictempfile(object):
                 '''writable file object that atomically updates a file
                 All writes will go to a temporary copy of the original file. Call
                 close() when you are done writing, and atomictempfile will rename
                 the temporary copy to the original name, making the changes
                 visible. If the object is destroyed without being closed, all your
                 writes are discarded.
                 checkambig argument of constructor is used with filestat, and is
                 useful only if target file is guarded by any lock (e.g. repo.lock
                 or repo.wlock).
                 '''
                 def __init__(self, name, mode='w+b', createmode=None, checkambig=False):
                     self.__name = name      # permanent name
                     self._tempname = mktempcopy(name, emptyok=('w' in mode),
                                                 createmode=createmode)
                     self._fp = posixfile(self._tempname, mode)
                     self._checkambig = checkambig
                     # delegated methods
                     self.read = self._fp.read
                     self.write = self._fp.write
                     self.seek = self._fp.seek
                     self.tell = self._fp.tell
                     self.fileno = self._fp.fileno
                 def close(self):
                     if not self._fp.closed:
                         self._fp.close()
                         filename = localpath(self.__name)
                         oldstat = self._checkambig and filestat(filename)
                         if oldstat and oldstat.stat:
                             rename(self._tempname, filename)
                             newstat = filestat(filename)
                             if newstat.isambig(oldstat):
                                 # stat of changed file is ambiguous to original one
                                 advanced = (oldstat.stat.st_mtime + 1) & 0x7fffffff
                                 os.utime(filename, (advanced, advanced))
                         else:
                             rename(self._tempname, filename)
                 def discard(self):
                     if not self._fp.closed:
                         try:
                             os.unlink(self._tempname)
                         except OSError:
                             pass
                         self._fp.close()
                 def __del__(self):
                     if safehasattr(self, '_fp'): # constructor actually did something
                         self.discard()
                 def __enter__(self):
                     return self
                 def __exit__(self, exctype, excvalue, traceback):
                     if exctype is not None:
                         self.discard()
                     else:
                         self.close()
             def makedirs(name, mode=None, notindexed=False):
                 """recursive directory creation with parent mode inheritance
                 Newly created directories are marked as "not to be indexed by
                 the content indexing service", if ``notindexed`` is specified
                 for "write" mode access.
                 """
                 try:
                     makedir(name, notindexed)
                 except OSError as err:
                     if err.errno == errno.EEXIST:
                         return
                     if err.errno != errno.ENOENT or not name:
                         raise
                     parent = os.path.dirname(os.path.abspath(name))
                     if parent == name:
                         raise
                     makedirs(parent, mode, notindexed)
                     try:
                         makedir(name, notindexed)
                     except OSError as err:
                         # Catch EEXIST to handle races
                         if err.errno == errno.EEXIST:
                             return
                         raise
                 if mode is not None:
                     os.chmod(name, mode)
             def readfile(path):
                 with open(path, 'rb') as fp:
                     return fp.read()
             def writefile(path, text):
                 with open(path, 'wb') as fp:
                     fp.write(text)
             def appendfile(path, text):
                 with open(path, 'ab') as fp:
                     fp.write(text)
             class chunkbuffer(object):
                 """Allow arbitrary sized chunks of data to be efficiently read from an
                 iterator over chunks of arbitrary size."""
                 def __init__(self, in_iter):
                     """in_iter is the iterator that's iterating over the input chunks.
                     targetsize is how big a buffer to try to maintain."""
                     def splitbig(chunks):
                         for chunk in chunks:
                             if len(chunk) > 2**20:
                                 pos = 0
                                 while pos < len(chunk):
                                     end = pos + 2 ** 18
                                     yield chunk[pos:end]
                                     pos = end
                             else:
                                 yield chunk
                     self.iter = splitbig(in_iter)
                     self._queue = collections.deque()
                     self._chunkoffset = 0
                 def read(self, l=None):
                     """Read L bytes of data from the iterator of chunks of data.
                     Returns less than L bytes if the iterator runs dry.
                     If size parameter is omitted, read everything"""
                     if l is None:
                         return ''.join(self.iter)
                     left = l
                     buf = []
                     queue = self._queue
                     while left > 0:
                         # refill the queue
                         if not queue:
                             target = 2**18
                             for chunk in self.iter:
                                 queue.append(chunk)
                                 target -= len(chunk)
                                 if target <= 0:
                                     break
                             if not queue:
                                 break
                         # The easy way to do this would be to queue.popleft(), modify the
                         # chunk (if necessary), then queue.appendleft(). However, for cases
                         # where we read partial chunk content, this incurs 2 dequeue
                         # mutations and creates a new str for the remaining chunk in the
                         # queue. Our code below avoids this overhead.
                         chunk = queue[0]
                         chunkl = len(chunk)
                         offset = self._chunkoffset
                         # Use full chunk.
                         if offset == 0 and left >= chunkl:
                             left -= chunkl
                             queue.popleft()
                             buf.append(chunk)
                             # self._chunkoffset remains at 0.
                             continue
                         chunkremaining = chunkl - offset
                         # Use all of unconsumed part of chunk.
                         if left >= chunkremaining:
                             left -= chunkremaining
                             queue.popleft()
                             # offset == 0 is enabled by block above, so this won't merely
                             # copy via ``chunk[0:]``.
                             buf.append(chunk[offset:])
                             self._chunkoffset = 0
                         # Partial chunk needed.
                         else:
                             buf.append(chunk[offset:offset + left])
                             self._chunkoffset += left
                             left -= chunkremaining
                     return ''.join(buf)
             def filechunkiter(f, size=65536, limit=None):
                 """Create a generator that produces the data in the file size
                 (default 65536) bytes at a time, up to optional limit (default is
                 to read all data).  Chunks may be less than size bytes if the
                 chunk is the last chunk in the file, or the file is a socket or
                 some other type of file that sometimes reads less data than is
                 requested."""
                 assert size >= 0
                 assert limit is None or limit >= 0
                 while True:
                     if limit is None:
                         nbytes = size
                     else:
                         nbytes = min(limit, size)
                     s = nbytes and f.read(nbytes)
                     if not s:
                         break
                     if limit:
                         limit -= len(s)
                     yield s
             def makedate(timestamp=None):
                 '''Return a unix timestamp (or the current time) as a (unixtime,
                 offset) tuple based off the local timezone.'''
                 if timestamp is None:
                     timestamp = time.time()
                 if timestamp < 0:
                     hint = _("check your clock")
                     raise Abort(_("negative timestamp: %d") % timestamp, hint=hint)
                 delta = (datetime.datetime.utcfromtimestamp(timestamp) -
                          datetime.datetime.fromtimestamp(timestamp))
                 tz = delta.days * 86400 + delta.seconds
                 return timestamp, tz
             def datestr(date=None, format='%a %b %d %H:%M:%S %Y %1%2'):
                 """represent a (unixtime, offset) tuple as a localized time.
                 unixtime is seconds since the epoch, and offset is the time zone's
                 number of seconds away from UTC.
                 >>> datestr((0, 0))
                 'Thu Jan 01 00:00:00 1970 +0000'
                 >>> datestr((42, 0))
                 'Thu Jan 01 00:00:42 1970 +0000'
                 >>> datestr((-42, 0))
                 'Wed Dec 31 23:59:18 1969 +0000'
                 >>> datestr((0x7fffffff, 0))
                 'Tue Jan 19 03:14:07 2038 +0000'
                 >>> datestr((-0x80000000, 0))
                 'Fri Dec 13 20:45:52 1901 +0000'
                 """
                 t, tz = date or makedate()
                 if "%1" in format or "%2" in format or "%z" in format:
                     sign = (tz > 0) and "-" or "+"
                     minutes = abs(tz) // 60
                     q, r = divmod(minutes, 60)
                     format = format.replace("%z", "%1%2")
                     format = format.replace("%1", "%c%02d" % (sign, q))
                     format = format.replace("%2", "%02d" % r)
                 d = t - tz
                 if d > 0x7fffffff:
                     d = 0x7fffffff
                 elif d < -0x80000000:
                     d = -0x80000000
                 # Never use time.gmtime() and datetime.datetime.fromtimestamp()
                 # because they use the gmtime() system call which is buggy on Windows
                 # for negative values.
                 t = datetime.datetime(1970, 1, 1) + datetime.timedelta(seconds=d)
                 s = t.strftime(format)
                 return s
             def shortdate(date=None):
                 """turn (timestamp, tzoff) tuple into iso 8631 date."""
                 return datestr(date, format='%Y-%m-%d')
             def parsetimezone(s):
                 """find a trailing timezone, if any, in string, and return a
                    (offset, remainder) pair"""
                 if s.endswith("GMT") or s.endswith("UTC"):
                     return 0, s[:-3].rstrip()
                 # Unix-style timezones [+-]hhmm
                 if len(s) >= 5 and s[-5] in "+-" and s[-4:].isdigit():
                     sign = (s[-5] == "+") and 1 or -1
                     hours = int(s[-4:-2])
                     minutes = int(s[-2:])
                     return -sign * (hours * 60 + minutes) * 60, s[:-5].rstrip()
                 # ISO8601 trailing Z
                 if s.endswith("Z") and s[-2:-1].isdigit():
                     return 0, s[:-1]
                 # ISO8601-style [+-]hh:mm
                 if (len(s) >= 6 and s[-6] in "+-" and s[-3] == ":" and
                     s[-5:-3].isdigit() and s[-2:].isdigit()):
                     sign = (s[-6] == "+") and 1 or -1
                     hours = int(s[-5:-3])
                     minutes = int(s[-2:])
                     return -sign * (hours * 60 + minutes) * 60, s[:-6]
                 return None, s
             def strdate(string, format, defaults=[]):
                 """parse a localized time string and return a (unixtime, offset) tuple.
                 if the string cannot be parsed, ValueError is raised."""
                 # NOTE: unixtime = localunixtime + offset
                 offset, date = parsetimezone(string)
                 # add missing elements from defaults
                 usenow = False # default to using biased defaults
                 for part in ("S", "M", "HI", "d", "mb", "yY"): # decreasing specificity
                     found = [True for p in part if ("%"+p) in format]
                     if not found:
                         date += "@" + defaults[part][usenow]
                         format += "@%" + part[0]
                     else:
                         # We've found a specific time element, less specific time
                         # elements are relative to today
                         usenow = True
                 timetuple = time.strptime(date, format)
                 localunixtime = int(calendar.timegm(timetuple))
                 if offset is None:
                     # local timezone
                     unixtime = int(time.mktime(timetuple))
                     offset = unixtime - localunixtime
                 else:
                     unixtime = localunixtime + offset
                 return unixtime, offset
             def parsedate(date, formats=None, bias=None):
                 """parse a localized date/time and return a (unixtime, offset) tuple.
                 The date may be a "unixtime offset" string or in one of the specified
                 formats. If the date already is a (unixtime, offset) tuple, it is returned.
                 >>> parsedate(' today ') == parsedate(\
                                               datetime.date.today().strftime('%b %d'))
                 True
                 >>> parsedate( 'yesterday ') == parsedate((datetime.date.today() -\
                                                            datetime.timedelta(days=1)\
                                                           ).strftime('%b %d'))
                 True
                 >>> now, tz = makedate()
                 >>> strnow, strtz = parsedate('now')
                 >>> (strnow - now) < 1
                 True
                 >>> tz == strtz
                 True
                 """
                 if bias is None:
                     bias = {}
                 if not date:
                     return 0, 0
                 if isinstance(date, tuple) and len(date) == 2:
                     return date
                 if not formats:
                     formats = defaultdateformats
                 date = date.strip()
                 if date == 'now' or date == _('now'):
                     return makedate()
                 if date == 'today' or date == _('today'):
                     date = datetime.date.today().strftime('%b %d')
                 elif date == 'yesterday' or date == _('yesterday'):
                     date = (datetime.date.today() -
                             datetime.timedelta(days=1)).strftime('%b %d')
                 try:
                     when, offset = map(int, date.split(' '))
                 except ValueError:
                     # fill out defaults
                     now = makedate()
                     defaults = {}
                     for part in ("d", "mb", "yY", "HI", "M", "S"):
                         # this piece is for rounding the specific end of unknowns
                         b = bias.get(part)
                         if b is None:
                             if part[0] in "HMS":
                                 b = "00"
                             else:
                                 b = "0"
                         # this piece is for matching the generic end to today's date
                         n = datestr(now, "%" + part[0])
                         defaults[part] = (b, n)
                     for format in formats:
                         try:
                             when, offset = strdate(date, format, defaults)
                         except (ValueError, OverflowError):
                             pass
                         else:
                             break
                     else:
                         raise Abort(_('invalid date: %r') % date)
                 # validate explicit (probably user-specified) date and
                 # time zone offset. values must fit in signed 32 bits for
                 # current 32-bit linux runtimes. timezones go from UTC-12
                 # to UTC+14
                 if when < -0x80000000 or when > 0x7fffffff:
                     raise Abort(_('date exceeds 32 bits: %d') % when)
                 if offset < -50400 or offset > 43200:
                     raise Abort(_('impossible time zone offset: %d') % offset)
                 return when, offset
             def matchdate(date):
                 """Return a function that matches a given date match specifier
                 Formats include:
                 '{date}' match a given date to the accuracy provided
                 '<{date}' on or before a given date
                 '>{date}' on or after a given date
                 >>> p1 = parsedate("10:29:59")
                 >>> p2 = parsedate("10:30:00")
                 >>> p3 = parsedate("10:30:59")
                 >>> p4 = parsedate("10:31:00")
                 >>> p5 = parsedate("Sep 15 10:30:00 1999")
                 >>> f = matchdate("10:30")
                 >>> f(p1[0])
                 False
                 >>> f(p2[0])
                 True
                 >>> f(p3[0])
                 True
                 >>> f(p4[0])
                 False
                 >>> f(p5[0])
                 False
                 """
                 def lower(date):
                     d = {'mb': "1", 'd': "1"}
                     return parsedate(date, extendeddateformats, d)[0]
                 def upper(date):
                     d = {'mb': "12", 'HI': "23", 'M': "59", 'S': "59"}
                     for days in ("31", "30", "29"):
                         try:
                             d["d"] = days
                             return parsedate(date, extendeddateformats, d)[0]
                         except Abort:
                             pass
                     d["d"] = "28"
                     return parsedate(date, extendeddateformats, d)[0]
                 date = date.strip()
                 if not date:
                     raise Abort(_("dates cannot consist entirely of whitespace"))
                 elif date[0] == "<":
                     if not date[1:]:
                         raise Abort(_("invalid day spec, use '<DATE'"))
                     when = upper(date[1:])
                     return lambda x: x <= when
                 elif date[0] == ">":
                     if not date[1:]:
                         raise Abort(_("invalid day spec, use '>DATE'"))
                     when = lower(date[1:])
                     return lambda x: x >= when
                 elif date[0] == "-":
                     try:
                         days = int(date[1:])
                     except ValueError:
                         raise Abort(_("invalid day spec: %s") % date[1:])
                     if days < 0:
                         raise Abort(_("%s must be nonnegative (see 'hg help dates')")
                             % date[1:])
                     when = makedate()[0] - days * 3600 * 24
                     return lambda x: x >= when
                 elif " to " in date:
                     a, b = date.split(" to ")
                     start, stop = lower(a), upper(b)
                     return lambda x: x >= start and x <= stop
                 else:
                     start, stop = lower(date), upper(date)
                     return lambda x: x >= start and x <= stop
             def stringmatcher(pattern):
                 """
                 accepts a string, possibly starting with 're:' or 'literal:' prefix.
                 returns the matcher name, pattern, and matcher function.
                 missing or unknown prefixes are treated as literal matches.
                 helper for tests:
                 >>> def test(pattern, *tests):
                 ...     kind, pattern, matcher = stringmatcher(pattern)
                 ...     return (kind, pattern, [bool(matcher(t)) for t in tests])
                 exact matching (no prefix):
                 >>> test('abcdefg', 'abc', 'def', 'abcdefg')
                 ('literal', 'abcdefg', [False, False, True])
                 regex matching ('re:' prefix)
                 >>> test('re:a.+b', 'nomatch', 'fooadef', 'fooadefbar')
                 ('re', 'a.+b', [False, False, True])
                 force exact matches ('literal:' prefix)
                 >>> test('literal:re:foobar', 'foobar', 're:foobar')
                 ('literal', 're:foobar', [False, True])
                 unknown prefixes are ignored and treated as literals
                 >>> test('foo:bar', 'foo', 'bar', 'foo:bar')
                 ('literal', 'foo:bar', [False, False, True])
                 """
                 if pattern.startswith('re:'):
                     pattern = pattern[3:]
                     try:
                         regex = remod.compile(pattern)
                     except remod.error as e:
                         raise error.ParseError(_('invalid regular expression: %s')
                                                % e)
                     return 're', pattern, regex.search
                 elif pattern.startswith('literal:'):
                     pattern = pattern[8:]
                 return 'literal', pattern, pattern.__eq__
             def shortuser(user):
                 """Return a short representation of a user name or email address."""
                 f = user.find('@')
                 if f >= 0:
                     user = user[:f]
                 f = user.find('<')
                 if f >= 0:
                     user = user[f + 1:]
                 f = user.find(' ')
                 if f >= 0:
                     user = user[:f]
                 f = user.find('.')
                 if f >= 0:
                     user = user[:f]
                 return user
             def emailuser(user):
                 """Return the user portion of an email address."""
                 f = user.find('@')
                 if f >= 0:
                     user = user[:f]
                 f = user.find('<')
                 if f >= 0:
                     user = user[f + 1:]
                 return user
             def email(author):
                 '''get email of author.'''
                 r = author.find('>')
                 if r == -1:
                     r = None
                 return author[author.find('<') + 1:r]
             def ellipsis(text, maxlength=400):
                 """Trim string to at most maxlength (default: 400) columns in display."""
                 return encoding.trim(text, maxlength, ellipsis='...')
             def unitcountfn(*unittable):
                 '''return a function that renders a readable count of some quantity'''
                 def go(count):
                     for multiplier, divisor, format in unittable:
                         if count >= divisor * multiplier:
                             return format % (count / float(divisor))
                     return unittable[-1][2] % count
                 return go
             bytecount = unitcountfn(
                 (100, 1 << 30, _('%.0f GB')),
                 (10, 1 << 30, _('%.1f GB')),
                 (1, 1 << 30, _('%.2f GB')),
                 (100, 1 << 20, _('%.0f MB')),
                 (10, 1 << 20, _('%.1f MB')),
                 (1, 1 << 20, _('%.2f MB')),
                 (100, 1 << 10, _('%.0f KB')),
                 (10, 1 << 10, _('%.1f KB')),
                 (1, 1 << 10, _('%.2f KB')),
                 (1, 1, _('%.0f bytes')),
                 )
             def uirepr(s):
                 # Avoid double backslash in Windows path repr()
                 return repr(s).replace('\\\\', '\\')
             # delay import of textwrap
             def MBTextWrapper(**kwargs):
                 class tw(textwrap.TextWrapper):
                     """
                     Extend TextWrapper for width-awareness.
                     Neither number of 'bytes' in any encoding nor 'characters' is
                     appropriate to calculate terminal columns for specified string.
                     Original TextWrapper implementation uses built-in 'len()' directly,
                     so overriding is needed to use width information of each characters.
                     In addition, characters classified into 'ambiguous' width are
                     treated as wide in East Asian area, but as narrow in other.
                     This requires use decision to determine width of such characters.
                     """
                     def _cutdown(self, ucstr, space_left):
                         l = 0
                         colwidth = encoding.ucolwidth
                         for i in xrange(len(ucstr)):
                             l += colwidth(ucstr[i])
                             if space_left < l:
                                 return (ucstr[:i], ucstr[i:])
                         return ucstr, ''
                     # overriding of base class
                     def _handle_long_word(self, reversed_chunks, cur_line, cur_len, width):
                         space_left = max(width - cur_len, 1)
                         if self.break_long_words:
                             cut, res = self._cutdown(reversed_chunks[-1], space_left)
                             cur_line.append(cut)
                             reversed_chunks[-1] = res
                         elif not cur_line:
                             cur_line.append(reversed_chunks.pop())
                     # this overriding code is imported from TextWrapper of Python 2.6
                     # to calculate columns of string by 'encoding.ucolwidth()'
                     def _wrap_chunks(self, chunks):
                         colwidth = encoding.ucolwidth
                         lines = []
                         if self.width <= 0:
                             raise ValueError("invalid width %r (must be > 0)" % self.width)
                         # Arrange in reverse order so items can be efficiently popped
                         # from a stack of chucks.
                         chunks.reverse()
                         while chunks:
                             # Start the list of chunks that will make up the current line.
                             # cur_len is just the length of all the chunks in cur_line.
                             cur_line = []
                             cur_len = 0
                             # Figure out which static string will prefix this line.
                             if lines:
                                 indent = self.subsequent_indent
                             else:
                                 indent = self.initial_indent
                             # Maximum width for this line.
                             width = self.width - len(indent)
                             # First chunk on line is whitespace -- drop it, unless this
                             # is the very beginning of the text (i.e. no lines started yet).
                             if self.drop_whitespace and chunks[-1].strip() == '' and lines:
                                 del chunks[-1]
                             while chunks:
                                 l = colwidth(chunks[-1])
                                 # Can at least squeeze this chunk onto the current line.
                                 if cur_len + l <= width:
                                     cur_line.append(chunks.pop())
                                     cur_len += l
                                 # Nope, this line is full.
                                 else:
                                     break
                             # The current line is full, and the next chunk is too big to
                             # fit on *any* line (not just this one).
                             if chunks and colwidth(chunks[-1]) > width:
                                 self._handle_long_word(chunks, cur_line, cur_len, width)
                             # If the last chunk on this line is all whitespace, drop it.
                             if (self.drop_whitespace and
                                 cur_line and cur_line[-1].strip() == ''):
                                 del cur_line[-1]
                             # Convert current line back to a string and store it in list
                             # of all lines (return value).
                             if cur_line:
                                 lines.append(indent + ''.join(cur_line))
                         return lines
                 global MBTextWrapper
                 MBTextWrapper = tw
                 return tw(**kwargs)
             def wrap(line, width, initindent='', hangindent=''):
                 maxindent = max(len(hangindent), len(initindent))
                 if width <= maxindent:
                     # adjust for weird terminal size
                     width = max(78, maxindent + 1)
                 line = line.decode(encoding.encoding, encoding.encodingmode)
                 initindent = initindent.decode(encoding.encoding, encoding.encodingmode)
                 hangindent = hangindent.decode(encoding.encoding, encoding.encodingmode)
                 wrapper = MBTextWrapper(width=width,
                                         initial_indent=initindent,
                                         subsequent_indent=hangindent)
                 return wrapper.fill(line).encode(encoding.encoding)
             def iterlines(iterator):
                 for chunk in iterator:
                     for line in chunk.splitlines():
                         yield line
             def expandpath(path):
                 return os.path.expanduser(os.path.expandvars(path))
             def hgcmd():
                 """Return the command used to execute current hg
                 This is different from hgexecutable() because on Windows we want
                 to avoid things opening new shell windows like batch files, so we
                 get either the python call or current executable.
                 """
                 if mainfrozen():
                     if getattr(sys, 'frozen', None) == 'macosx_app':
                         # Env variable set by py2app
                         return [os.environ['EXECUTABLEPATH']]
                     else:
                         return [sys.executable]
                 return gethgcmd()
             def rundetached(args, condfn):
                 """Execute the argument list in a detached process.
                 condfn is a callable which is called repeatedly and should return
                 True once the child process is known to have started successfully.
                 At this point, the child process PID is returned. If the child
                 process fails to start or finishes before condfn() evaluates to
                 True, return -1.
                 """
                 # Windows case is easier because the child process is either
                 # successfully starting and validating the condition or exiting
                 # on failure. We just poll on its PID. On Unix, if the child
                 # process fails to start, it will be left in a zombie state until
                 # the parent wait on it, which we cannot do since we expect a long
                 # running process on success. Instead we listen for SIGCHLD telling
                 # us our child process terminated.
                 terminated = set()
                 def handler(signum, frame):
                     terminated.add(os.wait())
                 prevhandler = None
                 SIGCHLD = getattr(signal, 'SIGCHLD', None)
                 if SIGCHLD is not None:
                     prevhandler = signal.signal(SIGCHLD, handler)
                 try:
                     pid = spawndetached(args)
                     while not condfn():
                         if ((pid in terminated or not testpid(pid))
                             and not condfn()):
                             return -1
                         time.sleep(0.1)
                     return pid
                 finally:
                     if prevhandler is not None:
                         signal.signal(signal.SIGCHLD, prevhandler)
             def interpolate(prefix, mapping, s, fn=None, escape_prefix=False):
                 """Return the result of interpolating items in the mapping into string s.
                 prefix is a single character string, or a two character string with
                 a backslash as the first character if the prefix needs to be escaped in
                 a regular expression.
                 fn is an optional function that will be applied to the replacement text
                 just before replacement.
                 escape_prefix is an optional flag that allows using doubled prefix for
                 its escaping.
                 """
                 fn = fn or (lambda s: s)
                 patterns = '|'.join(mapping.keys())
                 if escape_prefix:
                     patterns += '|' + prefix
                     if len(prefix) > 1:
                         prefix_char = prefix[1:]
                     else:
                         prefix_char = prefix
                     mapping[prefix_char] = prefix_char
                 r = remod.compile(r'%s(%s)' % (prefix, patterns))
                 return r.sub(lambda x: fn(mapping[x.group()[1:]]), s)
             def getport(port):
                 """Return the port for a given network service.
                 If port is an integer, it's returned as is. If it's a string, it's
                 looked up using socket.getservbyname(). If there's no matching
                 service, error.Abort is raised.
                 """
                 try:
                     return int(port)
                 except ValueError:
                     pass
                 try:
                     return socket.getservbyname(port)
                 except socket.error:
                     raise Abort(_("no port number associated with service '%s'") % port)
             _booleans = {'1': True, 'yes': True, 'true': True, 'on': True, 'always': True,
                          '0': False, 'no': False, 'false': False, 'off': False,
                          'never': False}
             def parsebool(s):
                 """Parse s into a boolean.
                 If s is not a valid boolean, returns None.
                 """
                 return _booleans.get(s.lower(), None)
             _hexdig = '0123456789ABCDEFabcdef'
             _hextochr = dict((a + b, chr(int(a + b, 16)))
                              for a in _hexdig for b in _hexdig)
             def _urlunquote(s):
                 """Decode HTTP/HTML % encoding.
                 >>> _urlunquote('abc%20def')
                 'abc def'
                 """
                 res = s.split('%')
                 # fastpath
                 if len(res) == 1:
                     return s
                 s = res[0]
                 for item in res[1:]:
                     try:
                         s += _hextochr[item[:2]] + item[2:]
                     except KeyError:
                         s += '%' + item
                     except UnicodeDecodeError:
                         s += unichr(int(item[:2], 16)) + item[2:]
                 return s
             class url(object):
                 r"""Reliable URL parser.
                 This parses URLs and provides attributes for the following
                 components:
                 <scheme>://<user>:<passwd>@<host>:<port>/<path>?<query>#<fragment>
                 Missing components are set to None. The only exception is
                 fragment, which is set to '' if present but empty.
                 If parsefragment is False, fragment is included in query. If
                 parsequery is False, query is included in path. If both are
                 False, both fragment and query are included in path.
                 See http://www.ietf.org/rfc/rfc2396.txt for more information.
                 Note that for backward compatibility reasons, bundle URLs do not
                 take host names. That means 'bundle://../' has a path of '../'.
                 Examples:
                 >>> url('http://www.ietf.org/rfc/rfc2396.txt')
                 <url scheme: 'http', host: 'www.ietf.org', path: 'rfc/rfc2396.txt'>
                 >>> url('ssh://[::1]:2200//home/joe/repo')
                 <url scheme: 'ssh', host: '[::1]', port: '2200', path: '/home/joe/repo'>
                 >>> url('file:///home/joe/repo')
                 <url scheme: 'file', path: '/home/joe/repo'>
                 >>> url('file:///c:/temp/foo/')
                 <url scheme: 'file', path: 'c:/temp/foo/'>
                 >>> url('bundle:foo')
                 <url scheme: 'bundle', path: 'foo'>
                 >>> url('bundle://../foo')
                 <url scheme: 'bundle', path: '../foo'>
                 >>> url(r'c:\foo\bar')
                 <url path: 'c:\\foo\\bar'>
                 >>> url(r'\\blah\blah\blah')
                 <url path: '\\\\blah\\blah\\blah'>
                 >>> url(r'\\blah\blah\blah#baz')
                 <url path: '\\\\blah\\blah\\blah', fragment: 'baz'>
                 >>> url(r'file:///C:\users\me')
                 <url scheme: 'file', path: 'C:\\users\\me'>
                 Authentication credentials:
                 >>> url('ssh://joe:xyz@x/repo')
                 <url scheme: 'ssh', user: 'joe', passwd: 'xyz', host: 'x', path: 'repo'>
                 >>> url('ssh://joe@x/repo')
                 <url scheme: 'ssh', user: 'joe', host: 'x', path: 'repo'>
                 Query strings and fragments:
                 >>> url('http://host/a?b#c')
                 <url scheme: 'http', host: 'host', path: 'a', query: 'b', fragment: 'c'>
                 >>> url('http://host/a?b#c', parsequery=False, parsefragment=False)
                 <url scheme: 'http', host: 'host', path: 'a?b#c'>
                 """
                 _safechars = "!~*'()+"
                 _safepchars = "/!~*'()+:\\"
                 _matchscheme = remod.compile(r'^[a-zA-Z0-9+.\-]+:').match
                 def __init__(self, path, parsequery=True, parsefragment=True):
                     # We slowly chomp away at path until we have only the path left
                     self.scheme = self.user = self.passwd = self.host = None
                     self.port = self.path = self.query = self.fragment = None
                     self._localpath = True
                     self._hostport = ''
                     self._origpath = path
                     if parsefragment and '#' in path:
                         path, self.fragment = path.split('#', 1)
                         if not path:
                             path = None
                     # special case for Windows drive letters and UNC paths
                     if hasdriveletter(path) or path.startswith(r'\\'):
                         self.path = path
                         return
                     # For compatibility reasons, we can't handle bundle paths as
                     # normal URLS
                     if path.startswith('bundle:'):
                         self.scheme = 'bundle'
                         path = path[7:]
                         if path.startswith('//'):
                             path = path[2:]
                         self.path = path
                         return
                     if self._matchscheme(path):
                         parts = path.split(':', 1)
                         if parts[0]:
                             self.scheme, path = parts
                             self._localpath = False
                     if not path:
                         path = None
                         if self._localpath:
                             self.path = ''
                             return
                     else:
                         if self._localpath:
                             self.path = path
                             return
                         if parsequery and '?' in path:
                             path, self.query = path.split('?', 1)
                             if not path:
                                 path = None
                             if not self.query:
                                 self.query = None
                         # // is required to specify a host/authority
                         if path and path.startswith('//'):
                             parts = path[2:].split('/', 1)
                             if len(parts) > 1:
                                 self.host, path = parts
                             else:
                                 self.host = parts[0]
                                 path = None
                             if not self.host:
                                 self.host = None
                                 # path of file:///d is /d
                                 # path of file:///d:/ is d:/, not /d:/
                                 if path and not hasdriveletter(path):
                                     path = '/' + path
                         if self.host and '@' in self.host:
                             self.user, self.host = self.host.rsplit('@', 1)
                             if ':' in self.user:
                                 self.user, self.passwd = self.user.split(':', 1)
                             if not self.host:
                                 self.host = None
                         # Don't split on colons in IPv6 addresses without ports
                         if (self.host and ':' in self.host and
                             not (self.host.startswith('[') and self.host.endswith(']'))):
                             self._hostport = self.host
                             self.host, self.port = self.host.rsplit(':', 1)
                             if not self.host:
                                 self.host = None
                         if (self.host and self.scheme == 'file' and
                             self.host not in ('localhost', '127.0.0.1', '[::1]')):
                             raise Abort(_('file:// URLs can only refer to localhost'))
                     self.path = path
                     # leave the query string escaped
                     for a in ('user', 'passwd', 'host', 'port',
                               'path', 'fragment'):
                         v = getattr(self, a)
                         if v is not None:
                             setattr(self, a, _urlunquote(v))
                 def __repr__(self):
                     attrs = []
                     for a in ('scheme', 'user', 'passwd', 'host', 'port', 'path',
                               'query', 'fragment'):
                         v = getattr(self, a)
                         if v is not None:
                             attrs.append('%s: %r' % (a, v))
                     return '<url %s>' % ', '.join(attrs)
                 def __str__(self):
                     r"""Join the URL's components back into a URL string.
                     Examples:
                     >>> str(url('http://user:pw@host:80/c:/bob?fo:oo#ba:ar'))
                     'http://user:pw@host:80/c:/bob?fo:oo#ba:ar'
                     >>> str(url('http://user:pw@host:80/?foo=bar&baz=42'))
                     'http://user:pw@host:80/?foo=bar&baz=42'
                     >>> str(url('http://user:pw@host:80/?foo=bar%3dbaz'))
                     'http://user:pw@host:80/?foo=bar%3dbaz'
                     >>> str(url('ssh://user:pw@[::1]:2200//home/joe#'))
                     'ssh://user:pw@[::1]:2200//home/joe#'
                     >>> str(url('http://localhost:80//'))
                     'http://localhost:80//'
                     >>> str(url('http://localhost:80/'))
                     'http://localhost:80/'
                     >>> str(url('http://localhost:80'))
                     'http://localhost:80/'
                     >>> str(url('bundle:foo'))
                     'bundle:foo'
                     >>> str(url('bundle://../foo'))
                     'bundle:../foo'
                     >>> str(url('path'))
                     'path'
                     >>> str(url('file:///tmp/foo/bar'))
                     'file:///tmp/foo/bar'
                     >>> str(url('file:///c:/tmp/foo/bar'))
                     'file:///c:/tmp/foo/bar'
                     >>> print url(r'bundle:foo\bar')
                     bundle:foo\bar
                     >>> print url(r'file:///D:\data\hg')
                     file:///D:\data\hg
                     """
                     if self._localpath:
                         s = self.path
                         if self.scheme == 'bundle':
                             s = 'bundle:' + s
                         if self.fragment:
                             s += '#' + self.fragment
                         return s
                     s = self.scheme + ':'
                     if self.user or self.passwd or self.host:
                         s += '//'
                     elif self.scheme and (not self.path or self.path.startswith('/')
                                           or hasdriveletter(self.path)):
                         s += '//'
                         if hasdriveletter(self.path):
                             s += '/'
                     if self.user:
                         s += urlreq.quote(self.user, safe=self._safechars)
                     if self.passwd:
                         s += ':' + urlreq.quote(self.passwd, safe=self._safechars)
                     if self.user or self.passwd:
                         s += '@'
                     if self.host:
                         if not (self.host.startswith('[') and self.host.endswith(']')):
                             s += urlreq.quote(self.host)
                         else:
                             s += self.host
                     if self.port:
                         s += ':' + urlreq.quote(self.port)
                     if self.host:
                         s += '/'
                     if self.path:
                         # TODO: similar to the query string, we should not unescape the
                         # path when we store it, the path might contain '%2f' = '/',
                         # which we should *not* escape.
                         s += urlreq.quote(self.path, safe=self._safepchars)
                     if self.query:
                         # we store the query in escaped form.
                         s += '?' + self.query
                     if self.fragment is not None:
                         s += '#' + urlreq.quote(self.fragment, safe=self._safepchars)
                     return s
                 def authinfo(self):
                     user, passwd = self.user, self.passwd
                     try:
                         self.user, self.passwd = None, None
                         s = str(self)
                     finally:
                         self.user, self.passwd = user, passwd
                     if not self.user:
                         return (s, None)
                     # authinfo[1] is passed to urllib2 password manager, and its
                     # URIs must not contain credentials. The host is passed in the
                     # URIs list because Python < 2.4.3 uses only that to search for
                     # a password.
                     return (s, (None, (s, self.host),
                                 self.user, self.passwd or ''))
                 def isabs(self):
                     if self.scheme and self.scheme != 'file':
                         return True # remote URL
                     if hasdriveletter(self.path):
                         return True # absolute for our purposes - can't be joined()
                     if self.path.startswith(r'\\'):
                         return True # Windows UNC path
                     if self.path.startswith('/'):
                         return True # POSIX-style
                     return False
                 def localpath(self):
                     if self.scheme == 'file' or self.scheme == 'bundle':
                         path = self.path or '/'
                         # For Windows, we need to promote hosts containing drive
                         # letters to paths with drive letters.
                         if hasdriveletter(self._hostport):
                             path = self._hostport + '/' + self.path
                         elif (self.host is not None and self.path
                               and not hasdriveletter(path)):
                             path = '/' + path
                         return path
                     return self._origpath
                 def islocal(self):
                     '''whether localpath will return something that posixfile can open'''
                     return (not self.scheme or self.scheme == 'file'
                             or self.scheme == 'bundle')
             def hasscheme(path):
                 return bool(url(path).scheme)
             def hasdriveletter(path):
                 return path and path[1:2] == ':' and path[0:1].isalpha()
             def urllocalpath(path):
                 return url(path, parsequery=False, parsefragment=False).localpath()
             def hidepassword(u):
                 '''hide user credential in a url string'''
                 u = url(u)
                 if u.passwd:
                     u.passwd = '***'
                 return str(u)
             def removeauth(u):
                 '''remove all authentication information from a url string'''
                 u = url(u)
                 u.user = u.passwd = None
                 return str(u)
             def isatty(fp):
                 try:
                     return fp.isatty()
                 except AttributeError:
                     return False
             timecount = unitcountfn(
                 (1, 1e3, _('%.0f s')),
                 (100, 1, _('%.1f s')),
                 (10, 1, _('%.2f s')),
                 (1, 1, _('%.3f s')),
                 (100, 0.001, _('%.1f ms')),
                 (10, 0.001, _('%.2f ms')),
                 (1, 0.001, _('%.3f ms')),
                 (100, 0.000001, _('%.1f us')),
                 (10, 0.000001, _('%.2f us')),
                 (1, 0.000001, _('%.3f us')),
                 (100, 0.000000001, _('%.1f ns')),
                 (10, 0.000000001, _('%.2f ns')),
                 (1, 0.000000001, _('%.3f ns')),
                 )
             _timenesting = [0]
             def timed(func):
                 '''Report the execution time of a function call to stderr.
                 During development, use as a decorator when you need to measure
                 the cost of a function, e.g. as follows:
                 @util.timed
                 def foo(a, b, c):
                     pass
                 '''
                 def wrapper(*args, **kwargs):
                     start = time.time()
                     indent = 2
                     _timenesting[0] += indent
                     try:
                         return func(*args, **kwargs)
                     finally:
                         elapsed = time.time() - start
                         _timenesting[0] -= indent
                         sys.stderr.write('%s%s: %s\n' %
                                          (' ' * _timenesting[0], func.__name__,
                                           timecount(elapsed)))
                 return wrapper
             _sizeunits = (('m', 2**20), ('k', 2**10), ('g', 2**30),
                           ('kb', 2**10), ('mb', 2**20), ('gb', 2**30), ('b', 1))
             def sizetoint(s):
                 '''Convert a space specifier to a byte count.
                 >>> sizetoint('30')
                 >>> sizetoint('2.2kb')
                 >>> sizetoint('6M')
                 6291456
                 '''
                 t = s.strip().lower()
                 try:
                     for k, u in _sizeunits:
                         if t.endswith(k):
                             return int(float(t[:-len(k)]) * u)
                     return int(t)
                 except ValueError:
                     raise error.ParseError(_("couldn't parse size: %s") % s)
             class hooks(object):
                 '''A collection of hook functions that can be used to extend a
                 function's behavior. Hooks are called in lexicographic order,
                 based on the names of their sources.'''
                 def __init__(self):
                     self._hooks = []
                 def add(self, source, hook):
                     self._hooks.append((source, hook))
                 def __call__(self, *args):
                     self._hooks.sort(key=lambda x: x[0])
                     results = []
                     for source, hook in self._hooks:
                         results.append(hook(*args))
                     return results
             def getstackframes(skip=0, line=' %-*s in %s\n', fileline='%s:%s'):
                 '''Yields lines for a nicely formatted stacktrace.
                 Skips the 'skip' last entries.
                 Each file+linenumber is formatted according to fileline.
                 Each line is formatted according to line.
                 If line is None, it yields:
                   length of longest filepath+line number,
                   filepath+linenumber,
                   function
                 Not be used in production code but very convenient while developing.
                 '''
                 entries = [(fileline % (fn, ln), func)
                     for fn, ln, func, _text in traceback.extract_stack()[:-skip - 1]]
                 if entries:
                     fnmax = max(len(entry[0]) for entry in entries)
                     for fnln, func in entries:
                         if line is None:
                             yield (fnmax, fnln, func)
                         else:
                             yield line % (fnmax, fnln, func)
             def debugstacktrace(msg='stacktrace', skip=0, f=sys.stderr, otherf=sys.stdout):
                 '''Writes a message to f (stderr) with a nicely formatted stacktrace.
                 Skips the 'skip' last entries. By default it will flush stdout first.
                 It can be used everywhere and intentionally does not require an ui object.
                 Not be used in production code but very convenient while developing.
                 '''
                 if otherf:
                     otherf.flush()
                 f.write('%s at:\n' % msg)
                 for line in getstackframes(skip + 1):
                     f.write(line)
                 f.flush()
             class dirs(object):
                 '''a multiset of directory names from a dirstate or manifest'''
                 def __init__(self, map, skip=None):
                     self._dirs = {}
                     addpath = self.addpath
                     if safehasattr(map, 'iteritems') and skip is not None:
                         for f, s in map.iteritems():
                             if s[0] != skip:
                                 addpath(f)
                     else:
                         for f in map:
                             addpath(f)
                 def addpath(self, path):
                     dirs = self._dirs
                     for base in finddirs(path):
                         if base in dirs:
                             dirs[base] += 1
                             return
                         dirs[base] = 1
                 def delpath(self, path):
                     dirs = self._dirs
                     for base in finddirs(path):
                         if dirs[base] > 1:
                             dirs[base] -= 1
                             return
                         del dirs[base]
                 def __iter__(self):
                     return self._dirs.iterkeys()
                 def __contains__(self, d):
                     return d in self._dirs
             if safehasattr(parsers, 'dirs'):
                 dirs = parsers.dirs
             def finddirs(path):
                 pos = path.rfind('/')
                 while pos != -1:
                     yield path[:pos]
                     pos = path.rfind('/', 0, pos)
             # compression utility
             class nocompress(object):
                 def compress(self, x):
                     return x
                 def flush(self):
                     return ""
             compressors = {
                 None: nocompress,
                 # lambda to prevent early import
                 'BZ': lambda: bz2.BZ2Compressor(),
                 'GZ': lambda: zlib.compressobj(),
                 }
             # also support the old form by courtesies
             compressors['UN'] = compressors[None]
             def _makedecompressor(decompcls):
                 def generator(f):
                     d = decompcls()
                     for chunk in filechunkiter(f):
                         yield d.decompress(chunk)
                 def func(fh):
                     return chunkbuffer(generator(fh))
                 return func
             class ctxmanager(object):
                 '''A context manager for use in 'with' blocks to allow multiple
                 contexts to be entered at once.  This is both safer and more
                 flexible than contextlib.nested.
                 Once Mercurial supports Python 2.7+, this will become mostly
                 unnecessary.
                 '''
                 def __init__(self, *args):
                     '''Accepts a list of no-argument functions that return context
                     managers.  These will be invoked at __call__ time.'''
                     self._pending = args
                     self._atexit = []
                 def __enter__(self):
                     return self
                 def enter(self):
                     '''Create and enter context managers in the order in which they were
                     passed to the constructor.'''
                     values = []
                     for func in self._pending:
                         obj = func()
                         values.append(obj.__enter__())
                         self._atexit.append(obj.__exit__)
                     del self._pending
                     return values
                 def atexit(self, func, *args, **kwargs):
                     '''Add a function to call when this context manager exits.  The
                     ordering of multiple atexit calls is unspecified, save that
                     they will happen before any __exit__ functions.'''
                     def wrapper(exc_type, exc_val, exc_tb):
                         func(*args, **kwargs)
                     self._atexit.append(wrapper)
                     return func
                 def __exit__(self, exc_type, exc_val, exc_tb):
                     '''Context managers are exited in the reverse order from which
                     they were created.'''
                     received = exc_type is not None
                     suppressed = False
                     pending = None
                     self._atexit.reverse()
                     for exitfunc in self._atexit:
                         try:
                             if exitfunc(exc_type, exc_val, exc_tb):
                                 suppressed = True
                                 exc_type = None
                                 exc_val = None
                                 exc_tb = None
                         except BaseException:
                             pending = sys.exc_info()
                             exc_type, exc_val, exc_tb = pending = sys.exc_info()
                     del self._atexit
                     if pending:
                         raise exc_val
                     return received and suppressed
             def _bz2():
                 d = bz2.BZ2Decompressor()
                 # Bzip2 stream start with BZ, but we stripped it.
                 # we put it back for good measure.
                 d.decompress('BZ')
                 return d
             decompressors = {None: lambda fh: fh,
                              '_truncatedBZ': _makedecompressor(_bz2),
                              'BZ': _makedecompressor(lambda: bz2.BZ2Decompressor()),
                              'GZ': _makedecompressor(lambda: zlib.decompressobj()),
                              }
             # also support the old form by courtesies
             decompressors['UN'] = decompressors[None]
             # convenient shortcut
             dst = debugstacktrace

General Comments 0

Write
Preview

You need to be logged in to leave comments. Login now

No TODOs yet

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages