upstream/mercurial-mirror Files · mercurial/diffhelper.py

manifest: avoid corruption by dropping removed files with pure (issue5801)...

manifest: avoid corruption by dropping removed files with pure (issue5801) Previously, removed files would simply be marked by overwriting the first byte with NUL and dropping their entry in `self.position`. But no effort was made to ignore them when compacting the dictionary into text form. This allowed them to slip into the manifest revision, since the code seems to be trying to minimize the string operations by copying as large a chunk as possible. As part of this, compact() walks the existing text based on entries in the `positions` list, and consumed everything up to the next position entry. This typically resulted in a ValueError complaining about unsorted manifest entries. Sometimes it seems that files do get dropped in large repos- it seems to correspond to there being a new entry that would take the same slot. A much more trivial problem is that if the only changes were removals, `_compact()` didn't even run because `__delitem__` doesn't add anything to `self.extradata`. Now there's an explicit variable to flag this, both to allow `_compact()` to run, and to avoid searching the manifest in cases where there are no removals. In practice, this behavior was mostly obscured by the check in fastdelta() which takes a different path that explicitly drops removed files if there are fewer than 1000 changes. However, timeless has a repo where after rebasing tens of commits, a totally different path[1] is taken that bypasses the change count check and hits this problem. [1] https://www.mercurial-scm.org/repo/hg/file/2338bdea4474/mercurial/manifest.py#l1511

Gregory Szorc - - Load All Authors

File last commit:

r38806:e7aa113b default


                r42569:0546ead3

stable

Download file

             diffhelper.py
        
                    78 lines
            
             | 2.2 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / mercurial / diffhelper.py
          
                    History
                
                 |
                  Annotation
                 | Raw
                 |Copy content
                 |Copy permalink

      # diffhelper.py - helper routines for patch

      #

      # Copyright 2009 Matt Mackall <mpm@selenic.com> and others

      #

      # This software may be used and distributed according to the terms of the

      # GNU General Public License version 2 or any later version.

      from __future__ import absolute_import

      from .i18n import _

      from . import (

          error,

          pycompat,

      )

      def addlines(fp, hunk, lena, lenb, a, b):

          """Read lines from fp into the hunk

          The hunk is parsed into two arrays, a and b. a gets the old state of

          the text, b gets the new state. The control char from the hunk is saved

          when inserting into a, but not b (for performance while deleting files.)

          """

          while True:

              todoa = lena - len(a)

              todob = lenb - len(b)

              num = max(todoa, todob)

              if num == 0:

                  break

              for i in pycompat.xrange(num):

                  s = fp.readline()

                  if not s:

                      raise error.ParseError(_('incomplete hunk'))

                  if s == "\\ No newline at end of file\n":

                      fixnewline(hunk, a, b)

                      continue

                  if s == '\n' or s == '\r\n':

                      # Some patches may be missing the control char

                      # on empty lines. Supply a leading space.

                      s = ' ' + s

                  hunk.append(s)

                  if s.startswith('+'):

                      b.append(s[1:])

                  elif s.startswith('-'):

                      a.append(s)

                  else:

                      b.append(s[1:])

                      a.append(s)

      def fixnewline(hunk, a, b):

          """Fix up the last lines of a and b when the patch has no newline at EOF"""

          l = hunk[-1]

          # tolerate CRLF in last line

          if l.endswith('\r\n'):

              hline = l[:-2]

          else:

              hline = l[:-1]

          if hline.startswith((' ', '+')):

              b[-1] = hline[1:]

          if hline.startswith((' ', '-')):

              a[-1] = hline

          hunk[-1] = hline

      def testhunk(a, b, bstart):

          """Compare the lines in a with the lines in b

          a is assumed to have a control char at the start of each line, this char

          is ignored in the compare.

          """

          alen = len(a)

          blen = len(b)

          if alen > blen - bstart or bstart < 0:

              return False

          for i in pycompat.xrange(alen):

              if a[i][1:] != b[i + bstart]:

                  return False

          return True

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

				# diffhelper.py - helper routines for patch
				#
				# Copyright 2009 Matt Mackall <mpm@selenic.com> and others
				#
				# This software may be used and distributed according to the terms of the
				# GNU General Public License version 2 or any later version.

				from __future__ import absolute_import

				from .i18n import _

				from . import (
				error,
				pycompat,
				)

				def addlines(fp, hunk, lena, lenb, a, b):
				"""Read lines from fp into the hunk

				The hunk is parsed into two arrays, a and b. a gets the old state of
				the text, b gets the new state. The control char from the hunk is saved
				when inserting into a, but not b (for performance while deleting files.)
				"""
				while True:
				todoa = lena - len(a)
				todob = lenb - len(b)
				num = max(todoa, todob)
				if num == 0:
				break
				for i in pycompat.xrange(num):
				s = fp.readline()
				if not s:
				raise error.ParseError(_('incomplete hunk'))
				if s == "\\ No newline at end of file\n":
				fixnewline(hunk, a, b)
				continue
				if s == '\n' or s == '\r\n':
				# Some patches may be missing the control char
				# on empty lines. Supply a leading space.
				s = ' ' + s
				hunk.append(s)
				if s.startswith('+'):
				b.append(s[1:])
				elif s.startswith('-'):
				a.append(s)
				else:
				b.append(s[1:])
				a.append(s)

				def fixnewline(hunk, a, b):
				"""Fix up the last lines of a and b when the patch has no newline at EOF"""
				l = hunk[-1]
				# tolerate CRLF in last line
				if l.endswith('\r\n'):
				hline = l[:-2]
				else:
				hline = l[:-1]

				if hline.startswith((' ', '+')):
				b[-1] = hline[1:]
				if hline.startswith((' ', '-')):
				a[-1] = hline
				hunk[-1] = hline

				def testhunk(a, b, bstart):
				"""Compare the lines in a with the lines in b

				a is assumed to have a control char at the start of each line, this char
				is ignored in the compare.
				"""
				alen = len(a)
				blen = len(b)
				if alen > blen - bstart or bstart < 0:
				return False
				for i in pycompat.xrange(alen):
				if a[i][1:] != b[i + bstart]:
				return False
				return True