upstream/mercurial-mirror Files · contrib/perf-utils/discovery-helper.sh

copies: move from a copy on branchpoint to a copy on write approach...

copies: move from a copy on branchpoint to a copy on write approach Before this changes, any branch points results in a copy of the dictionary containing the copy information. This can be very costly for branchy history with few rename information. Instead, we take a "copy on write" approach. Copying the input data only when we are about to update them. In practice we where already doing the copying in half of these case (because `_chain` makes a copy), so we don't add a significant cost here even in the linear case. However the speed up in branchy case is very significant. Here are some timing on the pypy repository. revision: large amount; added files: large amount; rename small amount; 9ba6ab77fd29 before: ! wall 1.399863 comb 1.400000 user 1.370000 sys 0.030000 (median of 10) after: ! wall 0.766453 comb 0.770000 user 0.750000 sys 0.020000 (median of 11) revision: large amount; added files: small amount; rename small amount; f650a9b140d2 before: ! wall 1.876748 comb 1.890000 user 1.870000 sys 0.020000 (median of 10) after: ! wall 1.167223 comb 1.170000 user 1.150000 sys 0.020000 (median of 10) revision: large amount; added files: large amount; rename large amount; d9fa043f30c0 before: ! wall 0.242457 comb 0.240000 user 0.240000 sys 0.000000 (median of 39) after: ! wall 0.211476 comb 0.210000 user 0.210000 sys 0.000000 (median of 45) revision: small amount; added files: large amount; rename large amount; a83dc6a2d56f before: ! wall 0.013193 comb 0.020000 user 0.020000 sys 0.000000 (median of 224) after: ! wall 0.013290 comb 0.010000 user 0.010000 sys 0.000000 (median of 222) revision: small amount; added files: large amount; rename small amount; 169138063d63 before: ! wall 0.001673 comb 0.000000 user 0.000000 sys 0.000000 (median of 1000) after: ! wall 0.001677 comb 0.000000 user 0.000000 sys 0.000000 (median of 1000) revision: small amount; added files: small amount; rename small amount; 964879152e2e before: ! wall 0.000119 comb 0.000000 user 0.000000 sys 0.000000 (median of 8023) after: ! wall 0.000119 comb 0.000000 user 0.000000 sys 0.000000 (median of 7997) revision: medium amount; added files: large amount; rename medium amount; 2c68e87c3efe before: ! wall 0.201898 comb 0.210000 user 0.200000 sys 0.010000 (median of 48) after: ! wall 0.167415 comb 0.170000 user 0.160000 sys 0.010000 (median of 58) revision: medium amount; added files: medium amount; rename small amount; d7746d32bf9d before: ! wall 0.036820 comb 0.040000 user 0.040000 sys 0.000000 (median of 100) after: ! wall 0.035797 comb 0.040000 user 0.040000 sys 0.000000 (median of 100) The extra cost in the linear case can be reclaimed later with some extra logic. Differential Revision: https://phab.mercurial-scm.org/D7124

marmoute - - Load All Authors

File last commit:

r42094:cae3f7e3 default


                r43594:ffd04bc9

default

Download file

             discovery-helper.sh
        
                    107 lines
            
             | 2.6 KiB
            
                | application/x-sh
            
             |
                BashLexer
            
             / contrib / perf-utils / discovery-helper.sh
          
                    History
                
                 |
                  Annotation
                 | Raw
                 |Copy content
                 |Copy permalink

      #!/bin/bash

      #

      # produces two repositories with different common and missing subsets

      #

      #   $ discovery-helper.sh REPO NBHEADS DEPT

      #

      # The Goal is to produce two repositories with some common part and some

      # exclusive part on each side. Provide a source repository REPO, it will

      # produce two repositories REPO-left and REPO-right.

      #

      # Each repository will be missing some revisions exclusive to NBHEADS of the

      # repo topological heads. These heads and revisions exclusive to them (up to

      # DEPTH depth) are stripped.

      #

      # The "left" repository will use the NBHEADS first heads (sorted by

      # description). The "right" use the last NBHEADS one.

      #

      # To find out how many topological heads a repo has, use:

      #

      #   $ hg heads -t -T '{rev}\n' | wc -l

      #

      # Example:

      #

      #  The `pypy-2018-09-01` repository has 192 heads. To produce two repositories

      #  with 92 common heads and ~50 exclusive heads on each side.

      #

      #    $ ./discovery-helper.sh pypy-2018-08-01 50 10

      set -euo pipefail

      printusage () {

           echo "usage: `basename $0` REPO NBHEADS DEPTH [left|right]" >&2

      }

      if [ $# -lt 3 ]; then

          printusage

          exit 64

      fi

      repo="$1"

      shift

      nbheads="$1"

      shift

      depth="$1"

      shift

      doleft=1

      doright=1

      if [ $# -gt 1 ]; then

          printusage

          exit 64

      elif [ $# -eq 1 ]; then

          if [ "$1" == "left" ]; then

              doleft=1

              doright=0

          elif [ "$1" == "right" ]; then

              doleft=0

              doright=1

          else

              printusage

              exit 64

          fi

      fi

      leftrepo="${repo}-${nbheads}h-${depth}d-left"

      rightrepo="${repo}-${nbheads}h-${depth}d-right"

      left="first(sort(heads(all()), 'desc'), $nbheads)"

      right="last(sort(heads(all()), 'desc'), $nbheads)"

      leftsubset="ancestors($left, $depth) and only($left, heads(all() - $left))"

      rightsubset="ancestors($right, $depth) and only($right, heads(all() - $right))"

      echo '### creating left/right repositories with missing changesets:'

      if [ $doleft -eq 1 ]; then

          echo '# left  revset:' '"'${leftsubset}'"'

      fi

      if [ $doright -eq 1 ]; then

          echo '# right revset:' '"'${rightsubset}'"'

      fi

      buildone() {

          side="$1"

          dest="$2"

          revset="$3"

          echo "### building $side repository: $dest"

          if [ -e "$dest" ]; then

              echo "destination repo already exists: $dest" >&2

              exit 1

          fi

          echo '# cloning'

          if ! cp --recursive --reflink=always ${repo} ${dest}; then

              hg clone --noupdate "${repo}" "${dest}"

          fi

          echo '# stripping' '"'${revset}'"'

          hg -R "${dest}" --config extensions.strip= strip --rev "$revset" --no-backup

      }

      if [ $doleft -eq 1 ]; then

          buildone left "$leftrepo" "$leftsubset"

      fi

      if [ $doright -eq 1 ]; then

          buildone right "$rightrepo" "$rightsubset"

      fi

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

				#!/bin/bash
				#
				# produces two repositories with different common and missing subsets
				#
				# $ discovery-helper.sh REPO NBHEADS DEPT
				#
				# The Goal is to produce two repositories with some common part and some
				# exclusive part on each side. Provide a source repository REPO, it will
				# produce two repositories REPO-left and REPO-right.
				#
				# Each repository will be missing some revisions exclusive to NBHEADS of the
				# repo topological heads. These heads and revisions exclusive to them (up to
				# DEPTH depth) are stripped.
				#
				# The "left" repository will use the NBHEADS first heads (sorted by
				# description). The "right" use the last NBHEADS one.
				#
				# To find out how many topological heads a repo has, use:
				#
				# $ hg heads -t -T '{rev}\n' \| wc -l
				#
				# Example:
				#
				# The `pypy-2018-09-01` repository has 192 heads. To produce two repositories
				# with 92 common heads and ~50 exclusive heads on each side.
				#
				# $ ./discovery-helper.sh pypy-2018-08-01 50 10

				set -euo pipefail

				printusage () {
				echo "usage: `basename $0` REPO NBHEADS DEPTH [left\|right]" >&2
				}

				if [ $# -lt 3 ]; then
				printusage
				exit 64
				fi

				repo="$1"
				shift

				nbheads="$1"
				shift

				depth="$1"
				shift

				doleft=1
				doright=1
				if [ $# -gt 1 ]; then
				printusage
				exit 64
				elif [ $# -eq 1 ]; then
				if [ "$1" == "left" ]; then
				doleft=1
				doright=0
				elif [ "$1" == "right" ]; then
				doleft=0
				doright=1
				else
				printusage
				exit 64
				fi
				fi

				leftrepo="${repo}-${nbheads}h-${depth}d-left"
				rightrepo="${repo}-${nbheads}h-${depth}d-right"

				left="first(sort(heads(all()), 'desc'), $nbheads)"
				right="last(sort(heads(all()), 'desc'), $nbheads)"

				leftsubset="ancestors($left, $depth) and only($left, heads(all() - $left))"
				rightsubset="ancestors($right, $depth) and only($right, heads(all() - $right))"

				echo '### creating left/right repositories with missing changesets:'
				if [ $doleft -eq 1 ]; then
				echo '# left revset:' '"'${leftsubset}'"'
				fi
				if [ $doright -eq 1 ]; then
				echo '# right revset:' '"'${rightsubset}'"'
				fi

				buildone() {
				side="$1"
				dest="$2"
				revset="$3"
				echo "### building $side repository: $dest"
				if [ -e "$dest" ]; then
				echo "destination repo already exists: $dest" >&2
				exit 1
				fi
				echo '# cloning'
				if ! cp --recursive --reflink=always ${repo} ${dest}; then
				hg clone --noupdate "${repo}" "${dest}"
				fi
				echo '# stripping' '"'${revset}'"'
				hg -R "${dest}" --config extensions.strip= strip --rev "$revset" --no-backup
				}

				if [ $doleft -eq 1 ]; then
				buildone left "$leftrepo" "$leftsubset"
				fi

				if [ $doright -eq 1 ]; then
				buildone right "$rightrepo" "$rightsubset"
				fi