upstream/mercurial-mirror Files · mercurial/narrowspec.py

registrar: replace "cmdtype" with an intent-based mechanism (API)...

registrar: replace "cmdtype" with an intent-based mechanism (API) Commands perform varied actions and repositories vary in their capabilities. Historically, the .hg/requires file has been used to lock out clients lacking a requirement. But this is a very heavy-handed approach and is typically reserved for cases where the on-disk storage format changes and we want to prevent incompatible clients from operating on a repo. Outside of the .hg/requires file, we tend to deal with things like optional, extension-provided features via checking at call sites. We'll either have checks in core or extensions will monkeypatch functions in core disabling incompatible features, enabling new features, etc. Things are somewhat tolerable today. But once we introduce alternate storage backends with varying support for repository features and vastly different modes of behavior, the current model will quickly grow unwieldy. For example, the implementation of the "simple store" required a lot of hacks to deal with stripping and verify because various parts of core assume things are implemented a certain way. Partial clone will require new ways of modeling file data retrieval, because we can no longer assume that all file data is already local. In this new world, some commands might not make any sense for certain types of repositories. What we need is a mechanism to affect the construction of repository (and eventually peer) instances so the requirements/capabilities needed for the current operation can be taken into account. "Current operation" can almost certainly be defined by a command. So it makes sense for commands to declare their intended actions. This commit introduces the "intents" concept on the command registrar. "intents" captures a set of strings that declare actions that are anticipated to be taken, requirements the repository must possess, etc. These intents will be passed into hg.repo(), which will pass them into localrepository, where they can be used to influence the object being created. Some use cases for this include: * For read-only intents, constructing a repository object that doesn't expose methods that can mutate the repository. Its VFS instances don't even allow opening a file with write access. * For read-only intents, constructing a repository object without cache invalidation logic. If the repo never changes during its lifetime, nothing ever needs to be invalidated and we don't need to do expensive things like verify the changelog's hidden revisions state is accurate every time we access repo.changelog. * We can automatically hide commands from `hg help` when the current repository doesn't provide that command. For example, an alternate storage backend may not support `hg commit`, so we can hide that command or anything else that would perform local commits. We already kind of had an "intents" mechanism on the registrar in the form of "cmdtype." However, it was never used. And it was limited to a single value. We really need something that supports multiple intents. And because intents may be defined by extensions and at this point are advisory, I think it is best to define them in a set rather than as separate arguments/attributes on the command. Differential Revision: https://phab.mercurial-scm.org/D3376

Martin von Zweigbergk - - Load All Authors

File last commit:

r36488:d851951b default


                r37734:dfc51a48

default

Download file

             narrowspec.py
        
                    199 lines
            
             | 6.8 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / mercurial / narrowspec.py
          
                    History
                
                 |
                  Annotation
                 | Raw
                 |Copy content
                 |Copy permalink

      # narrowspec.py - methods for working with a narrow view of a repository

      #

      # Copyright 2017 Google, Inc.

      #

      # This software may be used and distributed according to the terms of the

      # GNU General Public License version 2 or any later version.

      from __future__ import absolute_import

      import errno

      from .i18n import _

      from . import (

          error,

          match as matchmod,

          util,

      )

      FILENAME = 'narrowspec'

      def _parsestoredpatterns(text):

          """Parses the narrowspec format that's stored on disk."""

          patlist = None

          includepats = []

          excludepats = []

          for l in text.splitlines():

              if l == '[includes]':

                  if patlist is None:

                      patlist = includepats

                  else:

                      raise error.Abort(_('narrowspec includes section must appear '

                                          'at most once, before excludes'))

              elif l == '[excludes]':

                  if patlist is not excludepats:

                      patlist = excludepats

                  else:

                      raise error.Abort(_('narrowspec excludes section must appear '

                                          'at most once'))

              else:

                  patlist.append(l)

          return set(includepats), set(excludepats)

      def parseserverpatterns(text):

          """Parses the narrowspec format that's returned by the server."""

          includepats = set()

          excludepats = set()

          # We get one entry per line, in the format "<key> <value>".

          # It's OK for value to contain other spaces.

          for kp in (l.split(' ', 1) for l in text.splitlines()):

              if len(kp) != 2:

                  raise error.Abort(_('Invalid narrowspec pattern line: "%s"') % kp)

              key = kp[0]

              pat = kp[1]

              if key == 'include':

                  includepats.add(pat)

              elif key == 'exclude':

                  excludepats.add(pat)

              else:

                  raise error.Abort(_('Invalid key "%s" in server response') % key)

          return includepats, excludepats

      def normalizesplitpattern(kind, pat):

          """Returns the normalized version of a pattern and kind.

          Returns a tuple with the normalized kind and normalized pattern.

          """

          pat = pat.rstrip('/')

          _validatepattern(pat)

          return kind, pat

      def _numlines(s):

          """Returns the number of lines in s, including ending empty lines."""

          # We use splitlines because it is Unicode-friendly and thus Python 3

          # compatible. However, it does not count empty lines at the end, so trick

          # it by adding a character at the end.

          return len((s + 'x').splitlines())

      def _validatepattern(pat):

          """Validates the pattern and aborts if it is invalid.

          Patterns are stored in the narrowspec as newline-separated

          POSIX-style bytestring paths. There's no escaping.

          """

          # We use newlines as separators in the narrowspec file, so don't allow them

          # in patterns.

          if _numlines(pat) > 1:

              raise error.Abort(_('newlines are not allowed in narrowspec paths'))

          components = pat.split('/')

          if '.' in components or '..' in components:

              raise error.Abort(_('"." and ".." are not allowed in narrowspec paths'))

      def normalizepattern(pattern, defaultkind='path'):

          """Returns the normalized version of a text-format pattern.

          If the pattern has no kind, the default will be added.

          """

          kind, pat = matchmod._patsplit(pattern, defaultkind)

          return '%s:%s' % normalizesplitpattern(kind, pat)

      def parsepatterns(pats):

          """Parses a list of patterns into a typed pattern set."""

          return set(normalizepattern(p) for p in pats)

      def format(includes, excludes):

          output = '[includes]\n'

          for i in sorted(includes - excludes):

              output += i + '\n'

          output += '[excludes]\n'

          for e in sorted(excludes):

              output += e + '\n'

          return output

      def match(root, include=None, exclude=None):

          if not include:

              # Passing empty include and empty exclude to matchmod.match()

              # gives a matcher that matches everything, so explicitly use

              # the nevermatcher.

              return matchmod.never(root, '')

          return matchmod.match(root, '', [], include=include or [],

                                exclude=exclude or [])

      def needsexpansion(includes):

          return [i for i in includes if i.startswith('include:')]

      def load(repo):

          try:

              spec = repo.vfs.read(FILENAME)

          except IOError as e:

              # Treat "narrowspec does not exist" the same as "narrowspec file exists

              # and is empty".

              if e.errno == errno.ENOENT:

                  # Without this the next call to load will use the cached

                  # non-existence of the file, which can cause some odd issues.

                  repo.invalidate(clearfilecache=True)

                  return set(), set()

              raise

          return _parsestoredpatterns(spec)

      def save(repo, includepats, excludepats):

          spec = format(includepats, excludepats)

          repo.vfs.write(FILENAME, spec)

      def restrictpatterns(req_includes, req_excludes, repo_includes, repo_excludes):

          r""" Restricts the patterns according to repo settings,

          results in a logical AND operation

          :param req_includes: requested includes

          :param req_excludes: requested excludes

          :param repo_includes: repo includes

          :param repo_excludes: repo excludes

          :return: include patterns, exclude patterns, and invalid include patterns.

          >>> restrictpatterns({'f1','f2'}, {}, ['f1'], [])

          (set(['f1']), {}, [])

          >>> restrictpatterns({'f1'}, {}, ['f1','f2'], [])

          (set(['f1']), {}, [])

          >>> restrictpatterns({'f1/fc1', 'f3/fc3'}, {}, ['f1','f2'], [])

          (set(['f1/fc1']), {}, [])

          >>> restrictpatterns({'f1_fc1'}, {}, ['f1','f2'], [])

          ([], set(['path:.']), [])

          >>> restrictpatterns({'f1/../f2/fc2'}, {}, ['f1','f2'], [])

          (set(['f2/fc2']), {}, [])

          >>> restrictpatterns({'f1/../f3/fc3'}, {}, ['f1','f2'], [])

          ([], set(['path:.']), [])

          >>> restrictpatterns({'f1/$non_exitent_var'}, {}, ['f1','f2'], [])

          (set(['f1/$non_exitent_var']), {}, [])

          """

          res_excludes = set(req_excludes)

          res_excludes.update(repo_excludes)

          invalid_includes = []

          if not req_includes:

              res_includes = set(repo_includes)

          elif 'path:.' not in repo_includes:

              res_includes = []

              for req_include in req_includes:

                  req_include = util.expandpath(util.normpath(req_include))

                  if req_include in repo_includes:

                      res_includes.append(req_include)

                      continue

                  valid = False

                  for repo_include in repo_includes:

                      if req_include.startswith(repo_include + '/'):

                          valid = True

                          res_includes.append(req_include)

                          break

                  if not valid:

                      invalid_includes.append(req_include)

              if len(res_includes) == 0:

                  res_excludes = {'path:.'}

              else:

                  res_includes = set(res_includes)

          else:

              res_includes = set(req_includes)

          return res_includes, res_excludes, invalid_includes

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

				# narrowspec.py - methods for working with a narrow view of a repository
				#
				# Copyright 2017 Google, Inc.
				#
				# This software may be used and distributed according to the terms of the
				# GNU General Public License version 2 or any later version.

				from __future__ import absolute_import

				import errno

				from .i18n import _
				from . import (
				error,
				match as matchmod,
				util,
				)

				FILENAME = 'narrowspec'

				def _parsestoredpatterns(text):
				"""Parses the narrowspec format that's stored on disk."""
				patlist = None
				includepats = []
				excludepats = []
				for l in text.splitlines():
				if l == '[includes]':
				if patlist is None:
				patlist = includepats
				else:
				raise error.Abort(_('narrowspec includes section must appear '
				'at most once, before excludes'))
				elif l == '[excludes]':
				if patlist is not excludepats:
				patlist = excludepats
				else:
				raise error.Abort(_('narrowspec excludes section must appear '
				'at most once'))
				else:
				patlist.append(l)

				return set(includepats), set(excludepats)

				def parseserverpatterns(text):
				"""Parses the narrowspec format that's returned by the server."""
				includepats = set()
				excludepats = set()

				# We get one entry per line, in the format "<key> <value>".
				# It's OK for value to contain other spaces.
				for kp in (l.split(' ', 1) for l in text.splitlines()):
				if len(kp) != 2:
				raise error.Abort(_('Invalid narrowspec pattern line: "%s"') % kp)
				key = kp[0]
				pat = kp[1]
				if key == 'include':
				includepats.add(pat)
				elif key == 'exclude':
				excludepats.add(pat)
				else:
				raise error.Abort(_('Invalid key "%s" in server response') % key)

				return includepats, excludepats

				def normalizesplitpattern(kind, pat):
				"""Returns the normalized version of a pattern and kind.

				Returns a tuple with the normalized kind and normalized pattern.
				"""
				pat = pat.rstrip('/')
				_validatepattern(pat)
				return kind, pat

				def _numlines(s):
				"""Returns the number of lines in s, including ending empty lines."""
				# We use splitlines because it is Unicode-friendly and thus Python 3
				# compatible. However, it does not count empty lines at the end, so trick
				# it by adding a character at the end.
				return len((s + 'x').splitlines())

				def _validatepattern(pat):
				"""Validates the pattern and aborts if it is invalid.

				Patterns are stored in the narrowspec as newline-separated
				POSIX-style bytestring paths. There's no escaping.
				"""

				# We use newlines as separators in the narrowspec file, so don't allow them
				# in patterns.
				if _numlines(pat) > 1:
				raise error.Abort(_('newlines are not allowed in narrowspec paths'))

				components = pat.split('/')
				if '.' in components or '..' in components:
				raise error.Abort(_('"." and ".." are not allowed in narrowspec paths'))

				def normalizepattern(pattern, defaultkind='path'):
				"""Returns the normalized version of a text-format pattern.

				If the pattern has no kind, the default will be added.
				"""
				kind, pat = matchmod._patsplit(pattern, defaultkind)
				return '%s:%s' % normalizesplitpattern(kind, pat)

				def parsepatterns(pats):
				"""Parses a list of patterns into a typed pattern set."""
				return set(normalizepattern(p) for p in pats)

				def format(includes, excludes):
				output = '[includes]\n'
				for i in sorted(includes - excludes):
				output += i + '\n'
				output += '[excludes]\n'
				for e in sorted(excludes):
				output += e + '\n'
				return output

				def match(root, include=None, exclude=None):
				if not include:
				# Passing empty include and empty exclude to matchmod.match()
				# gives a matcher that matches everything, so explicitly use
				# the nevermatcher.
				return matchmod.never(root, '')
				return matchmod.match(root, '', [], include=include or [],
				exclude=exclude or [])

				def needsexpansion(includes):
				return [i for i in includes if i.startswith('include:')]

				def load(repo):
				try:
				spec = repo.vfs.read(FILENAME)
				except IOError as e:
				# Treat "narrowspec does not exist" the same as "narrowspec file exists
				# and is empty".
				if e.errno == errno.ENOENT:
				# Without this the next call to load will use the cached
				# non-existence of the file, which can cause some odd issues.
				repo.invalidate(clearfilecache=True)
				return set(), set()
				raise
				return _parsestoredpatterns(spec)

				def save(repo, includepats, excludepats):
				spec = format(includepats, excludepats)
				repo.vfs.write(FILENAME, spec)

				def restrictpatterns(req_includes, req_excludes, repo_includes, repo_excludes):
				r""" Restricts the patterns according to repo settings,
				results in a logical AND operation

				:param req_includes: requested includes
				:param req_excludes: requested excludes
				:param repo_includes: repo includes
				:param repo_excludes: repo excludes
				:return: include patterns, exclude patterns, and invalid include patterns.

				>>> restrictpatterns({'f1','f2'}, {}, ['f1'], [])
				(set(['f1']), {}, [])
				>>> restrictpatterns({'f1'}, {}, ['f1','f2'], [])
				(set(['f1']), {}, [])
				>>> restrictpatterns({'f1/fc1', 'f3/fc3'}, {}, ['f1','f2'], [])
				(set(['f1/fc1']), {}, [])
				>>> restrictpatterns({'f1_fc1'}, {}, ['f1','f2'], [])
				([], set(['path:.']), [])
				>>> restrictpatterns({'f1/../f2/fc2'}, {}, ['f1','f2'], [])
				(set(['f2/fc2']), {}, [])
				>>> restrictpatterns({'f1/../f3/fc3'}, {}, ['f1','f2'], [])
				([], set(['path:.']), [])
				>>> restrictpatterns({'f1/$non_exitent_var'}, {}, ['f1','f2'], [])
				(set(['f1/$non_exitent_var']), {}, [])
				"""
				res_excludes = set(req_excludes)
				res_excludes.update(repo_excludes)
				invalid_includes = []
				if not req_includes:
				res_includes = set(repo_includes)
				elif 'path:.' not in repo_includes:
				res_includes = []
				for req_include in req_includes:
				req_include = util.expandpath(util.normpath(req_include))
				if req_include in repo_includes:
				res_includes.append(req_include)
				continue
				valid = False
				for repo_include in repo_includes:
				if req_include.startswith(repo_include + '/'):
				valid = True
				res_includes.append(req_include)
				break
				if not valid:
				invalid_includes.append(req_include)
				if len(res_includes) == 0:
				res_excludes = {'path:.'}
				else:
				res_includes = set(res_includes)
				else:
				res_includes = set(req_includes)
				return res_includes, res_excludes, invalid_includes