upstream/mercurial-mirror Files · mercurial/i18n.py

parsers: inline fields of dirstate values in C version...

parsers: inline fields of dirstate values in C version Previously, while unpacking the dirstate we'd create 3-4 new CPython objects for most dirstate values: - the state is a single character string, which is pooled by CPython - the mode is a new object if it isn't 0 due to being in the lookup set - the size is a new object if it is greater than 255 - the mtime is a new object if it isn't -1 due to being in the lookup set - the tuple to contain them all In some cases such as regular hg status, we actually look at all the objects. In other cases like hg add, hg status for a subdirectory, or hg status with the third-party hgwatchman enabled, we look at almost none of the objects. This patch eliminates most object creation in these cases by defining a custom C struct that is exposed to Python with an interface similar to a tuple. Only when tuple elements are actually requested are the respective objects created. The gains, where they're expected, are significant. The following tests are run against a working copy with over 270,000 files. parse_dirstate becomes significantly faster: $ hg perfdirstate before: wall 0.186437 comb 0.180000 user 0.160000 sys 0.020000 (best of 35) after: wall 0.093158 comb 0.100000 user 0.090000 sys 0.010000 (best of 95) and as a result, several commands benefit: $ time hg status # with hgwatchman enabled before: 0.42s user 0.14s system 99% cpu 0.563 total after: 0.34s user 0.12s system 99% cpu 0.471 total $ time hg add new-file before: 0.85s user 0.18s system 99% cpu 1.033 total after: 0.76s user 0.17s system 99% cpu 0.931 total There is a slight regression in regular status performance, but this is fixed in an upcoming patch.

Matt Mackall - - Load All Authors

File last commit:

r21746:2d47d81c default


                r21809:e250b830

default

Download file

             i18n.py
        
                    67 lines
            
             | 2.2 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / mercurial / i18n.py
          
                    History
                
                 |
                  Source
                 | Raw
                 |Copy content
                 |Copy permalink

        Martin Geisler
    
put license and copyright info into comment blocks

              r8226
            
      # i18n.py - internationalization support for mercurial

      #

      # Copyright 2005, 2006 Matt Mackall <mpm@selenic.com>

      #

      # This software may be used and distributed according to the terms of the

        Matt Mackall
    
Update license to GPLv2+

              r10263
            
      # GNU General Public License version 2 or any later version.

        Benoit Boissinot
    
i18n first part: make '_' available for files who need it

              r1400
            
        Simon Heimberg
    
separate import lines from mercurial and general python modules

              r8312
            
      import encoding

      import gettext, sys, os

        Martin Geisler
    
i18n: lookup .mo files in private locale/ directory...

              r7650
            
      # modelled after templater.templatepath:

        Augie Fackler
    
i18n: use getattr instead of hasattr...

              r14975
            
      if getattr(sys, 'frozen', None) is not None:

        Martin Geisler
    
i18n: lookup .mo files in private locale/ directory...

              r7650
            
          module = sys.executable

      else:

          module = __file__

      base = os.path.dirname(module)

      for dir in ('.', '..'):

        Martin Geisler
    
i18n: remove unnecessary os.path.normpath call

              r9538
            
          localedir = os.path.join(base, dir, 'locale')

        Martin Geisler
    
i18n: lookup .mo files in private locale/ directory...

              r7650
            
          if os.path.isdir(localedir):

              break

      t = gettext.translation('hg', localedir, fallback=True)

        Martin Geisler
    
i18n: encode output in user's local encoding...

              r7651
            
      def gettext(message):

          """Translate message.

          The message is looked up in the catalog to get a Unicode string,

          which is encoded in the local encoding before being returned.

          Important: message is restricted to characters in the encoding

          given by sys.getdefaultencoding() which is most likely 'ascii'.

          """

          # If message is None, t.ugettext will return u'None' as the

          # translation whereas our callers expect us to return None.

          if message is None:

              return message

        Matt Mackall
    
i18n: explicitly decode paragraphs...

              r21746
            
          if type(message) is unicode:

              # goofy unicode docstrings in test

              paragraphs = message.split(u'\n\n')

          else:

              paragraphs = [p.decode("ascii") for p in message.split('\n\n')]

        Martin Geisler
    
i18n: fix translation of empty paragraphs

              r11403
            
          # Be careful not to translate the empty string -- it holds the

          # meta data of the .po file.

          u = u'\n\n'.join([p and t.ugettext(p) or '' for p in paragraphs])

        Martin Geisler
    
i18n: encode output in user's local encoding...

              r7651
            
          try:

        Martin Geisler
    
i18n: updated outdated comment

              r9319
            
              # encoding.tolocal cannot be used since it will first try to

              # decode the Unicode string. Calling u.decode(enc) really

              # means u.encode(sys.getdefaultencoding()).decode(enc). Since

              # the Python encoding defaults to 'ascii', this fails if the

              # translated string use non-ASCII characters.

        Matt Mackall
    
move encoding bits from util to encoding...

              r7948
            
              return u.encode(encoding.encoding, "replace")

        Martin Geisler
    
i18n: encode output in user's local encoding...

              r7651
            
          except LookupError:

        Martin Geisler
    
i18n: move unrelated line out of try-except block

              r9320
            
              # An unknown encoding results in a LookupError.

        Martin Geisler
    
i18n: encode output in user's local encoding...

              r7651
            
              return message

        Brodie Rao
    
HGPLAIN: allow exceptions to plain mode, like i18n, via HGPLAINEXCEPT...

              r13849
            
      def _plain():

          if 'HGPLAIN' not in os.environ and 'HGPLAINEXCEPT' not in os.environ:

              return False

          exceptions = os.environ.get('HGPLAINEXCEPT', '').strip().split(',')

          return 'i18n' not in exceptions

      if _plain():

        Brodie Rao
    
ui: add HGPLAIN environment variable for easier scripting...

              r10455
            
          _ = lambda message: message

      else:

          _ = gettext

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

Martin Geisler put license and copyright info into comment blocks	r8226	# i18n.py - internationalization support for mercurial
		#
		# Copyright 2005, 2006 Matt Mackall <mpm@selenic.com>
		#
		# This software may be used and distributed according to the terms of the
Matt Mackall Update license to GPLv2+	r10263	# GNU General Public License version 2 or any later version.
Benoit Boissinot i18n first part: make '_' available for files who need it	r1400
Simon Heimberg separate import lines from mercurial and general python modules	r8312	import encoding
		import gettext, sys, os
Martin Geisler i18n: lookup .mo files in private locale/ directory...	r7650
		# modelled after templater.templatepath:
Augie Fackler i18n: use getattr instead of hasattr...	r14975	if getattr(sys, 'frozen', None) is not None:
Martin Geisler i18n: lookup .mo files in private locale/ directory...	r7650	module = sys.executable
		else:
		module = __file__

		base = os.path.dirname(module)
		for dir in ('.', '..'):
Martin Geisler i18n: remove unnecessary os.path.normpath call	r9538	localedir = os.path.join(base, dir, 'locale')
Martin Geisler i18n: lookup .mo files in private locale/ directory...	r7650	if os.path.isdir(localedir):
		break

		t = gettext.translation('hg', localedir, fallback=True)
Martin Geisler i18n: encode output in user's local encoding...	r7651
		def gettext(message):
		"""Translate message.

		The message is looked up in the catalog to get a Unicode string,
		which is encoded in the local encoding before being returned.

		Important: message is restricted to characters in the encoding
		given by sys.getdefaultencoding() which is most likely 'ascii'.
		"""
		# If message is None, t.ugettext will return u'None' as the
		# translation whereas our callers expect us to return None.
		if message is None:
		return message

Matt Mackall i18n: explicitly decode paragraphs...	r21746	if type(message) is unicode:
		# goofy unicode docstrings in test
		paragraphs = message.split(u'\n\n')
		else:
		paragraphs = [p.decode("ascii") for p in message.split('\n\n')]
Martin Geisler i18n: fix translation of empty paragraphs	r11403	# Be careful not to translate the empty string -- it holds the
		# meta data of the .po file.
		u = u'\n\n'.join([p and t.ugettext(p) or '' for p in paragraphs])
Martin Geisler i18n: encode output in user's local encoding...	r7651	try:
Martin Geisler i18n: updated outdated comment	r9319	# encoding.tolocal cannot be used since it will first try to
		# decode the Unicode string. Calling u.decode(enc) really
		# means u.encode(sys.getdefaultencoding()).decode(enc). Since
		# the Python encoding defaults to 'ascii', this fails if the
		# translated string use non-ASCII characters.
Matt Mackall move encoding bits from util to encoding...	r7948	return u.encode(encoding.encoding, "replace")
Martin Geisler i18n: encode output in user's local encoding...	r7651	except LookupError:
Martin Geisler i18n: move unrelated line out of try-except block	r9320	# An unknown encoding results in a LookupError.
Martin Geisler i18n: encode output in user's local encoding...	r7651	return message

Brodie Rao HGPLAIN: allow exceptions to plain mode, like i18n, via HGPLAINEXCEPT...	r13849	def _plain():
		if 'HGPLAIN' not in os.environ and 'HGPLAINEXCEPT' not in os.environ:
		return False
		exceptions = os.environ.get('HGPLAINEXCEPT', '').strip().split(',')
		return 'i18n' not in exceptions

		if _plain():
Brodie Rao ui: add HGPLAIN environment variable for easier scripting...	r10455	_ = lambda message: message
		else:
		_ = gettext