upstream/mercurial-mirror Files · mercurial/hgweb/wsgicgi.py

demandimport: replace more references to _demandmod instances...

demandimport: replace more references to _demandmod instances _demandmod instances may be referenced by multiple importing modules. Before this patch, the _demandmod instance only maintained a reference to its first consumer when using the "from X import Y" syntax. This is because we only created a single _demandmod instance (attached to the parent X module). If multiple modules A and B performed "from X import Y", we'd produce a single _demandmod instance "demandmod" with the following references: X.Y = <demandmod> A.Y = <demandmod> B.Y = <demandmod> The locals from the first consumer (A) would be stored in <demandmod1>. When <demandmod1> was loaded, we'd look at the locals for the first consumer and replace the symbol, if necessary. This resulted in state: X.Y = <module> A.Y = <module> B.Y = <demandmod> B's reference to Y wasn't updated and was still using the proxy object because we just didn't record that B had a reference to <demandmod> that needed updating! With this patch, we add support for tracking which modules in addition to the initial importer have a reference to the _demandmod instance and we replace those references at module load time. In the case of posix.py, this fixes an issue where the "encoding" module was being proxied, resulting in hundreds of thousands of __getattribute__ lookups on the _demandmod instance during dirstate operations on mozilla-central, speeding up execution by many milliseconds. There are likely several other operation that benefit from this change as well. The new mechanism isn't perfect: references in locals (not globals) may likely linger. So, if there is an import inside a function and a symbol from that module is used in a hot loop, we could have unwanted overhead from proxying through _demandmod. Non-global imports are discouraged anyway. So hopefully this isn't a big deal in practice. We could potentially deploy a code checker that bans use of attribute lookups of function-level-imported modules inside loops. This deficiency in theory could be avoided by storing the set of globals and locals dicts to update in the _demandmod instance. However, I tried this and it didn't work. One reason is that some globals are _demandmod instances. We could work around this, but it's a bit more work. There also might be other module import foo at play. The solution as implemented is better than what we had and IMO is good enough for the time being. It's worth noting that this sub-optimal behavior was made worse by the introduction of absolute_import and its recommended "from . import X" syntax for importing modules from the "mercurial" package. If we ever wrote performance tests, measuring the amount of module imports and __getattribute__ proxy calls through _demandmod instances would be something I'd have it check.

Mads Kiilerich - - Load All Authors

File last commit:

r18552:e8efcc8f stable


                r26457:7e813050

default

Download file

             wsgicgi.py
        
                    83 lines
            
             | 2.7 KiB
            
                | text/x-python
            
             |
                PythonLexer
            
             / mercurial / hgweb / wsgicgi.py
          
                    History
                
                 |
                  Annotation
                 | Raw
                 |Copy content
                 |Copy permalink

      # hgweb/wsgicgi.py - CGI->WSGI translator

      #

      # Copyright 2006 Eric Hopper <hopper@omnifarious.org>

      #

      # This software may be used and distributed according to the terms of the

      # GNU General Public License version 2 or any later version.

      #

      # This was originally copied from the public domain code at

      # http://www.python.org/dev/peps/pep-0333/#the-server-gateway-side

      import os, sys

      from mercurial import util

      from mercurial.hgweb import common

      def launch(application):

          util.setbinary(sys.stdin)

          util.setbinary(sys.stdout)

          environ = dict(os.environ.iteritems())

          environ.setdefault('PATH_INFO', '')

          if environ.get('SERVER_SOFTWARE', '').startswith('Microsoft-IIS'):

              # IIS includes script_name in PATH_INFO

              scriptname = environ['SCRIPT_NAME']

              if environ['PATH_INFO'].startswith(scriptname):

                  environ['PATH_INFO'] = environ['PATH_INFO'][len(scriptname):]

          stdin = sys.stdin

          if environ.get('HTTP_EXPECT', '').lower() == '100-continue':

              stdin = common.continuereader(stdin, sys.stdout.write)

          environ['wsgi.input'] = stdin

          environ['wsgi.errors'] = sys.stderr

          environ['wsgi.version'] = (1, 0)

          environ['wsgi.multithread'] = False

          environ['wsgi.multiprocess'] = True

          environ['wsgi.run_once'] = True

          if environ.get('HTTPS', 'off').lower() in ('on', '1', 'yes'):

              environ['wsgi.url_scheme'] = 'https'

          else:

              environ['wsgi.url_scheme'] = 'http'

          headers_set = []

          headers_sent = []

          out = sys.stdout

          def write(data):

              if not headers_set:

                  raise AssertionError("write() before start_response()")

              elif not headers_sent:

                  # Before the first output, send the stored headers

                  status, response_headers = headers_sent[:] = headers_set

                  out.write('Status: %s\r\n' % status)

                  for header in response_headers:

                      out.write('%s: %s\r\n' % header)

                  out.write('\r\n')

              out.write(data)

              out.flush()

          def start_response(status, response_headers, exc_info=None):

              if exc_info:

                  try:

                      if headers_sent:

                          # Re-raise original exception if headers sent

                          raise exc_info[0](exc_info[1], exc_info[2])

                  finally:

                      exc_info = None     # avoid dangling circular ref

              elif headers_set:

                  raise AssertionError("Headers already set!")

              headers_set[:] = [status, response_headers]

              return write

          content = application(environ, start_response)

          try:

              for chunk in content:

                  write(chunk)

              if not headers_sent:

                  write('')   # send headers now if body was empty

          finally:

              getattr(content, 'close', lambda : None)()

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

				# hgweb/wsgicgi.py - CGI->WSGI translator
				#
				# Copyright 2006 Eric Hopper <hopper@omnifarious.org>
				#
				# This software may be used and distributed according to the terms of the
				# GNU General Public License version 2 or any later version.
				#
				# This was originally copied from the public domain code at
				# http://www.python.org/dev/peps/pep-0333/#the-server-gateway-side

				import os, sys
				from mercurial import util
				from mercurial.hgweb import common

				def launch(application):
				util.setbinary(sys.stdin)
				util.setbinary(sys.stdout)

				environ = dict(os.environ.iteritems())
				environ.setdefault('PATH_INFO', '')
				if environ.get('SERVER_SOFTWARE', '').startswith('Microsoft-IIS'):
				# IIS includes script_name in PATH_INFO
				scriptname = environ['SCRIPT_NAME']
				if environ['PATH_INFO'].startswith(scriptname):
				environ['PATH_INFO'] = environ['PATH_INFO'][len(scriptname):]

				stdin = sys.stdin
				if environ.get('HTTP_EXPECT', '').lower() == '100-continue':
				stdin = common.continuereader(stdin, sys.stdout.write)

				environ['wsgi.input'] = stdin
				environ['wsgi.errors'] = sys.stderr
				environ['wsgi.version'] = (1, 0)
				environ['wsgi.multithread'] = False
				environ['wsgi.multiprocess'] = True
				environ['wsgi.run_once'] = True

				if environ.get('HTTPS', 'off').lower() in ('on', '1', 'yes'):
				environ['wsgi.url_scheme'] = 'https'
				else:
				environ['wsgi.url_scheme'] = 'http'

				headers_set = []
				headers_sent = []
				out = sys.stdout

				def write(data):
				if not headers_set:
				raise AssertionError("write() before start_response()")

				elif not headers_sent:
				# Before the first output, send the stored headers
				status, response_headers = headers_sent[:] = headers_set
				out.write('Status: %s\r\n' % status)
				for header in response_headers:
				out.write('%s: %s\r\n' % header)
				out.write('\r\n')

				out.write(data)
				out.flush()

				def start_response(status, response_headers, exc_info=None):
				if exc_info:
				try:
				if headers_sent:
				# Re-raise original exception if headers sent
				raise exc_info[0](exc_info[1], exc_info[2])
				finally:
				exc_info = None # avoid dangling circular ref
				elif headers_set:
				raise AssertionError("Headers already set!")

				headers_set[:] = [status, response_headers]
				return write

				content = application(environ, start_response)
				try:
				for chunk in content:
				write(chunk)
				if not headers_sent:
				write('') # send headers now if body was empty
				finally:
				getattr(content, 'close', lambda : None)()