upstream/mercurial-mirror Commit - r42707:4a8d9ed8

merge: default into stable for release candidate

Augie Fackler -

r42707:4a8d9ed8 5.0rc0 stable

parent child

Expand all files

The requested changes are too big and content was truncated. Show full diff

contrib/automation/README.rst

0 created 644 +127 0

			@@ -0,0 +1,127 b''
		1	====================
		2	Mercurial Automation
		3	====================
		4
		5	This directory contains code and utilities for building and testing Mercurial
		6	on remote machines.
		7
		8	The ``automation.py`` Script
		9	============================
		10
		11	``automation.py`` is an executable Python script (requires Python 3.5+)
		12	that serves as a driver to common automation tasks.
		13
		14	When executed, the script will bootstrap a virtualenv in
		15	``<source-root>/build/venv-automation`` then re-execute itself using
		16	that virtualenv. So there is no need for the caller to have a virtualenv
		17	explicitly activated. This virtualenv will be populated with various
		18	dependencies (as defined by the ``requirements.txt`` file).
		19
		20	To see what you can do with this script, simply run it::
		21
		22	$ ./automation.py
		23
		24	Local State
		25	===========
		26
		27	By default, local state required to interact with remote servers is stored
		28	in the ``~/.hgautomation`` directory.
		29
		30	We attempt to limit persistent state to this directory. Even when
		31	performing tasks that may have side-effects, we try to limit those
		32	side-effects so they don't impact the local system. e.g. when we SSH
		33	into a remote machine, we create a temporary directory for the SSH
		34	config so the user's known hosts file isn't updated.
		35
		36	AWS Integration
		37	===============
		38
		39	Various automation tasks integrate with AWS to provide access to
		40	resources such as EC2 instances for generic compute.
		41
		42	This obviously requires an AWS account and credentials to work.
		43
		44	We use the ``boto3`` library for interacting with AWS APIs. We do not employ
		45	any special functionality for telling ``boto3`` where to find AWS credentials. See
		46	https://boto3.amazonaws.com/v1/documentation/api/latest/guide/configuration.html
		47	for how ``boto3`` works. Once you have configured your environment such
		48	that ``boto3`` can find credentials, interaction with AWS should just work.
		49
		50	.. hint::
		51
		52	Typically you have a ``~/.aws/credentials`` file containing AWS
		53	credentials. If you manage multiple credentials, you can override which
		54	profile to use at run-time by setting the ``AWS_PROFILE`` environment
		55	variable.
		56
		57	Resource Management
		58	-------------------
		59
		60	Depending on the task being performed, various AWS services will be accessed.
		61	This of course requires AWS credentials with permissions to access these
		62	services.
		63
		64	The following AWS services can be accessed by automation tasks:
		65
		66	* EC2
		67	* IAM
		68	* Simple Systems Manager (SSM)
		69
		70	Various resources will also be created as part of performing various tasks.
		71	This also requires various permissions.
		72
		73	The following AWS resources can be created by automation tasks:
		74
		75	* EC2 key pairs
		76	* EC2 security groups
		77	* EC2 instances
		78	* IAM roles and instance profiles
		79	* SSM command invocations
		80
		81	When possible, we prefix resource names with ``hg-`` so they can easily
		82	be identified as belonging to Mercurial.
		83
		84	.. important::
		85
		86	We currently assume that AWS accounts utilized by us are single
		87	tenancy. Attempts to have discrete users of ``automation.py`` (including
		88	sharing credentials across machines) using the same AWS account can result
		89	in them interfering with each other and things breaking.
		90
		91	Cost of Operation
		92	-----------------
		93
		94	``automation.py`` tries to be frugal with regards to utilization of remote
		95	resources. Persistent remote resources are minimized in order to keep costs
		96	in check. For example, EC2 instances are often ephemeral and only live as long
		97	as the operation being performed.
		98
		99	Under normal operation, recurring costs are limited to:
		100
		101	* Storage costs for AMI / EBS snapshots. This should be just a few pennies
		102	per month.
		103
		104	When running EC2 instances, you'll be billed accordingly. By default, we
		105	use small instances, like ``t3.medium``. This instance type costs ~$0.07 per
		106	hour.
		107
		108	.. note::
		109
		110	When running Windows EC2 instances, AWS bills at the full hourly cost, even
		111	if the instance doesn't run for a full hour (per-second billing doesn't
		112	apply to Windows AMIs).
		113
		114	Managing Remote Resources
		115	-------------------------
		116
		117	Occassionally, there may be an error purging a temporary resource. Or you
		118	may wish to forcefully purge remote state. Commands can be invoked to manually
		119	purge remote resources.
		120
		121	To terminate all EC2 instances that we manage::
		122
		123	$ automation.py terminate-ec2-instances
		124
		125	To purge all EC2 resources that we manage::
		126
		127	$ automation.py purge-ec2-resources

contrib/automation/automation.py

0 created 755 +70 0

			@@ -0,0 +1,70 b''
		1	#!/usr/bin/env python3
		2	#
		3	# automation.py - Perform tasks on remote machines
		4	#
		5	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		6	#
		7	# This software may be used and distributed according to the terms of the
		8	# GNU General Public License version 2 or any later version.
		9
		10	import os
		11	import pathlib
		12	import subprocess
		13	import sys
		14	import venv
		15
		16
		17	HERE = pathlib.Path(os.path.abspath(__file__)).parent
		18	REQUIREMENTS_TXT = HERE / 'requirements.txt'
		19	SOURCE_DIR = HERE.parent.parent
		20	VENV = SOURCE_DIR / 'build' / 'venv-automation'
		21
		22
		23	def bootstrap():
		24	venv_created = not VENV.exists()
		25
		26	VENV.parent.mkdir(exist_ok=True)
		27
		28	venv.create(VENV, with_pip=True)
		29
		30	if os.name == 'nt':
		31	venv_bin = VENV / 'Scripts'
		32	pip = venv_bin / 'pip.exe'
		33	python = venv_bin / 'python.exe'
		34	else:
		35	venv_bin = VENV / 'bin'
		36	pip = venv_bin / 'pip'
		37	python = venv_bin / 'python'
		38
		39	args = [str(pip), 'install', '-r', str(REQUIREMENTS_TXT),
		40	'--disable-pip-version-check']
		41
		42	if not venv_created:
		43	args.append('-q')
		44
		45	subprocess.run(args, check=True)
		46
		47	os.environ['HGAUTOMATION_BOOTSTRAPPED'] = '1'
		48	os.environ['PATH'] = '%s%s%s' % (
		49	venv_bin, os.pathsep, os.environ['PATH'])
		50
		51	subprocess.run([str(python), __file__] + sys.argv[1:], check=True)
		52
		53
		54	def run():
		55	import hgautomation.cli as cli
		56
		57	# Need to strip off main Python executable.
		58	cli.main()
		59
		60
		61	if __name__ == '__main__':
		62	try:
		63	if 'HGAUTOMATION_BOOTSTRAPPED' not in os.environ:
		64	bootstrap()
		65	else:
		66	run()
		67	except subprocess.CalledProcessError as e:
		68	sys.exit(e.returncode)
		69	except KeyboardInterrupt:
		70	sys.exit(1)

contrib/automation/hgautomation/__init__.py

0 created 644 +59 0

			@@ -0,0 +1,59 b''
		1	# __init__.py - High-level automation interfaces
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import pathlib
		11	import secrets
		12
		13	from .aws import (
		14	AWSConnection,
		15	)
		16
		17
		18	class HGAutomation:
		19	"""High-level interface for Mercurial automation.
		20
		21	Holds global state, provides access to other primitives, etc.
		22	"""
		23
		24	def __init__(self, state_path: pathlib.Path):
		25	self.state_path = state_path
		26
		27	state_path.mkdir(exist_ok=True)
		28
		29	def default_password(self):
		30	"""Obtain the default password to use for remote machines.
		31
		32	A new password will be generated if one is not stored.
		33	"""
		34	p = self.state_path / 'default-password'
		35
		36	try:
		37	with p.open('r', encoding='ascii') as fh:
		38	data = fh.read().strip()
		39
		40	if data:
		41	return data
		42
		43	except FileNotFoundError:
		44	pass
		45
		46	password = secrets.token_urlsafe(24)
		47
		48	with p.open('w', encoding='ascii') as fh:
		49	fh.write(password)
		50	fh.write('\n')
		51
		52	p.chmod(0o0600)
		53
		54	return password
		55
		56	def aws_connection(self, region: str):
		57	"""Obtain an AWSConnection instance bound to a specific region."""
		58
		59	return AWSConnection(self, region)

contrib/automation/hgautomation/aws.py

0 created 644 +879 0

This diff has been collapsed as it changes many lines, (879 lines changed) Show them Hide them
		@@ -0,0 +1,879 b''
	1	# aws.py - Automation code for Amazon Web Services
	2	#
	3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
	4	#
	5	# This software may be used and distributed according to the terms of the
	6	# GNU General Public License version 2 or any later version.
	7
	8	# no-check-code because Python 3 native.
	9
	10	import contextlib
	11	import copy
	12	import hashlib
	13	import json
	14	import os
	15	import pathlib
	16	import subprocess
	17	import time
	18
	19	import boto3
	20	import botocore.exceptions
	21
	22	from .winrm import (
	23	run_powershell,
	24	wait_for_winrm,
	25	)
	26
	27
	28	SOURCE_ROOT = pathlib.Path(os.path.abspath(__file__)).parent.parent.parent.parent
	29
	30	INSTALL_WINDOWS_DEPENDENCIES = (SOURCE_ROOT / 'contrib' /
	31	'install-windows-dependencies.ps1')
	32
	33
	34	KEY_PAIRS = {
	35	'automation',
	36	}
	37
	38
	39	SECURITY_GROUPS = {
	40	'windows-dev-1': {
	41	'description': 'Mercurial Windows instances that perform build automation',
	42	'ingress': [
	43	{
	44	'FromPort': 22,
	45	'ToPort': 22,
	46	'IpProtocol': 'tcp',
	47	'IpRanges': [
	48	{
	49	'CidrIp': '0.0.0.0/0',
	50	'Description': 'SSH from entire Internet',
	51	},
	52	],
	53	},
	54	{
	55	'FromPort': 3389,
	56	'ToPort': 3389,
	57	'IpProtocol': 'tcp',
	58	'IpRanges': [
	59	{
	60	'CidrIp': '0.0.0.0/0',
	61	'Description': 'RDP from entire Internet',
	62	},
	63	],
	64
	65	},
	66	{
	67	'FromPort': 5985,
	68	'ToPort': 5986,
	69	'IpProtocol': 'tcp',
	70	'IpRanges': [
	71	{
	72	'CidrIp': '0.0.0.0/0',
	73	'Description': 'PowerShell Remoting (Windows Remote Management)',
	74	},
	75	],
	76	}
	77	],
	78	},
	79	}
	80
	81
	82	IAM_ROLES = {
	83	'ephemeral-ec2-role-1': {
	84	'description': 'Mercurial temporary EC2 instances',
	85	'policy_arns': [
	86	'arn:aws:iam::aws:policy/service-role/AmazonEC2RoleforSSM',
	87	],
	88	},
	89	}
	90
	91
	92	ASSUME_ROLE_POLICY_DOCUMENT = '''
	93	{
	94	"Version": "2012-10-17",
	95	"Statement": [
	96	{
	97	"Effect": "Allow",
	98	"Principal": {
	99	"Service": "ec2.amazonaws.com"
	100	},
	101	"Action": "sts:AssumeRole"
	102	}
	103	]
	104	}
	105	'''.strip()
	106
	107
	108	IAM_INSTANCE_PROFILES = {
	109	'ephemeral-ec2-1': {
	110	'roles': [
	111	'ephemeral-ec2-role-1',
	112	],
	113	}
	114	}
	115
	116
	117	# User Data for Windows EC2 instance. Mainly used to set the password
	118	# and configure WinRM.
	119	# Inspired by the User Data script used by Packer
	120	# (from https://www.packer.io/intro/getting-started/build-image.html).
	121	WINDOWS_USER_DATA = r'''
	122	<powershell>
	123
	124	# TODO enable this once we figure out what is failing.
	125	#$ErrorActionPreference = "stop"
	126
	127	# Set administrator password
	128	net user Administrator "%s"
	129	wmic useraccount where "name='Administrator'" set PasswordExpires=FALSE
	130
	131	# First, make sure WinRM can't be connected to
	132	netsh advfirewall firewall set rule name="Windows Remote Management (HTTP-In)" new enable=yes action=block
	133
	134	# Delete any existing WinRM listeners
	135	winrm delete winrm/config/listener?Address=*+Transport=HTTP 2>$Null
	136	winrm delete winrm/config/listener?Address=*+Transport=HTTPS 2>$Null
	137
	138	# Create a new WinRM listener and configure
	139	winrm create winrm/config/listener?Address=*+Transport=HTTP
	140	winrm set winrm/config/winrs '@{MaxMemoryPerShellMB="0"}'
	141	winrm set winrm/config '@{MaxTimeoutms="7200000"}'
	142	winrm set winrm/config/service '@{AllowUnencrypted="true"}'
	143	winrm set winrm/config/service '@{MaxConcurrentOperationsPerUser="12000"}'
	144	winrm set winrm/config/service/auth '@{Basic="true"}'
	145	winrm set winrm/config/client/auth '@{Basic="true"}'
	146
	147	# Configure UAC to allow privilege elevation in remote shells
	148	$Key = 'HKLM:\SOFTWARE\Microsoft\Windows\CurrentVersion\Policies\System'
	149	$Setting = 'LocalAccountTokenFilterPolicy'
	150	Set-ItemProperty -Path $Key -Name $Setting -Value 1 -Force
	151
	152	# Configure and restart the WinRM Service; Enable the required firewall exception
	153	Stop-Service -Name WinRM
	154	Set-Service -Name WinRM -StartupType Automatic
	155	netsh advfirewall firewall set rule name="Windows Remote Management (HTTP-In)" new action=allow localip=any remoteip=any
	156	Start-Service -Name WinRM
	157
	158	# Disable firewall on private network interfaces so prompts don't appear.
	159	Set-NetFirewallProfile -Name private -Enabled false
	160	</powershell>
	161	'''.lstrip()
	162
	163
	164	WINDOWS_BOOTSTRAP_POWERSHELL = '''
	165	Write-Output "installing PowerShell dependencies"
	166	Install-PackageProvider -Name NuGet -MinimumVersion 2.8.5.201 -Force
	167	Set-PSRepository -Name PSGallery -InstallationPolicy Trusted
	168	Install-Module -Name OpenSSHUtils -RequiredVersion 0.0.2.0
	169
	170	Write-Output "installing OpenSSL server"
	171	Add-WindowsCapability -Online -Name OpenSSH.Server~~~~0.0.1.0
	172	# Various tools will attempt to use older versions of .NET. So we enable
	173	# the feature that provides them so it doesn't have to be auto-enabled
	174	# later.
	175	Write-Output "enabling .NET Framework feature"
	176	Install-WindowsFeature -Name Net-Framework-Core
	177	'''
	178
	179
	180	class AWSConnection:
	181	"""Manages the state of a connection with AWS."""
	182
	183	def __init__(self, automation, region: str):
	184	self.automation = automation
	185	self.local_state_path = automation.state_path
	186
	187	self.prefix = 'hg-'
	188
	189	self.session = boto3.session.Session(region_name=region)
	190	self.ec2client = self.session.client('ec2')
	191	self.ec2resource = self.session.resource('ec2')
	192	self.iamclient = self.session.client('iam')
	193	self.iamresource = self.session.resource('iam')
	194
	195	ensure_key_pairs(automation.state_path, self.ec2resource)
	196
	197	self.security_groups = ensure_security_groups(self.ec2resource)
	198	ensure_iam_state(self.iamresource)
	199
	200	def key_pair_path_private(self, name):
	201	"""Path to a key pair private key file."""
	202	return self.local_state_path / 'keys' / ('keypair-%s' % name)
	203
	204	def key_pair_path_public(self, name):
	205	return self.local_state_path / 'keys' / ('keypair-%s.pub' % name)
	206
	207
	208	def rsa_key_fingerprint(p: pathlib.Path):
	209	"""Compute the fingerprint of an RSA private key."""
	210
	211	# TODO use rsa package.
	212	res = subprocess.run(
	213	['openssl', 'pkcs8', '-in', str(p), '-nocrypt', '-topk8',
	214	'-outform', 'DER'],
	215	capture_output=True,
	216	check=True)
	217
	218	sha1 = hashlib.sha1(res.stdout).hexdigest()
	219	return ':'.join(a + b for a, b in zip(sha1[::2], sha1[1::2]))
	220
	221
	222	def ensure_key_pairs(state_path: pathlib.Path, ec2resource, prefix='hg-'):
	223	remote_existing = {}
	224
	225	for kpi in ec2resource.key_pairs.all():
	226	if kpi.name.startswith(prefix):
	227	remote_existing[kpi.name[len(prefix):]] = kpi.key_fingerprint
	228
	229	# Validate that we have these keys locally.
	230	key_path = state_path / 'keys'
	231	key_path.mkdir(exist_ok=True, mode=0o700)
	232
	233	def remove_remote(name):
	234	print('deleting key pair %s' % name)
	235	key = ec2resource.KeyPair(name)
	236	key.delete()
	237
	238	def remove_local(name):
	239	pub_full = key_path / ('keypair-%s.pub' % name)
	240	priv_full = key_path / ('keypair-%s' % name)
	241
	242	print('removing %s' % pub_full)
	243	pub_full.unlink()
	244	print('removing %s' % priv_full)
	245	priv_full.unlink()
	246
	247	local_existing = {}
	248
	249	for f in sorted(os.listdir(key_path)):
	250	if not f.startswith('keypair-') or not f.endswith('.pub'):
	251	continue
	252
	253	name = f[len('keypair-'):-len('.pub')]
	254
	255	pub_full = key_path / f
	256	priv_full = key_path / ('keypair-%s' % name)
	257
	258	with open(pub_full, 'r', encoding='ascii') as fh:
	259	data = fh.read()
	260
	261	if not data.startswith('ssh-rsa '):
	262	print('unexpected format for key pair file: %s; removing' %
	263	pub_full)
	264	pub_full.unlink()
	265	priv_full.unlink()
	266	continue
	267
	268	local_existing[name] = rsa_key_fingerprint(priv_full)
	269
	270	for name in sorted(set(remote_existing) \| set(local_existing)):
	271	if name not in local_existing:
	272	actual = '%s%s' % (prefix, name)
	273	print('remote key %s does not exist locally' % name)
	274	remove_remote(actual)
	275	del remote_existing[name]
	276
	277	elif name not in remote_existing:
	278	print('local key %s does not exist remotely' % name)
	279	remove_local(name)
	280	del local_existing[name]
	281
	282	elif remote_existing[name] != local_existing[name]:
	283	print('key fingerprint mismatch for %s; '
	284	'removing from local and remote' % name)
	285	remove_local(name)
	286	remove_remote('%s%s' % (prefix, name))
	287	del local_existing[name]
	288	del remote_existing[name]
	289
	290	missing = KEY_PAIRS - set(remote_existing)
	291
	292	for name in sorted(missing):
	293	actual = '%s%s' % (prefix, name)
	294	print('creating key pair %s' % actual)
	295
	296	priv_full = key_path / ('keypair-%s' % name)
	297	pub_full = key_path / ('keypair-%s.pub' % name)
	298
	299	kp = ec2resource.create_key_pair(KeyName=actual)
	300
	301	with priv_full.open('w', encoding='ascii') as fh:
	302	fh.write(kp.key_material)
	303	fh.write('\n')
	304
	305	priv_full.chmod(0o0600)
	306
	307	# SSH public key can be extracted via `ssh-keygen`.
	308	with pub_full.open('w', encoding='ascii') as fh:
	309	subprocess.run(
	310	['ssh-keygen', '-y', '-f', str(priv_full)],
	311	stdout=fh,
	312	check=True)
	313
	314	pub_full.chmod(0o0600)
	315
	316
	317	def delete_instance_profile(profile):
	318	for role in profile.roles:
	319	print('removing role %s from instance profile %s' % (role.name,
	320	profile.name))
	321	profile.remove_role(RoleName=role.name)
	322
	323	print('deleting instance profile %s' % profile.name)
	324	profile.delete()
	325
	326
	327	def ensure_iam_state(iamresource, prefix='hg-'):
	328	"""Ensure IAM state is in sync with our canonical definition."""
	329
	330	remote_profiles = {}
	331
	332	for profile in iamresource.instance_profiles.all():
	333	if profile.name.startswith(prefix):
	334	remote_profiles[profile.name[len(prefix):]] = profile
	335
	336	for name in sorted(set(remote_profiles) - set(IAM_INSTANCE_PROFILES)):
	337	delete_instance_profile(remote_profiles[name])
	338	del remote_profiles[name]
	339
	340	remote_roles = {}
	341
	342	for role in iamresource.roles.all():
	343	if role.name.startswith(prefix):
	344	remote_roles[role.name[len(prefix):]] = role
	345
	346	for name in sorted(set(remote_roles) - set(IAM_ROLES)):
	347	role = remote_roles[name]
	348
	349	print('removing role %s' % role.name)
	350	role.delete()
	351	del remote_roles[name]
	352
	353	# We've purged remote state that doesn't belong. Create missing
	354	# instance profiles and roles.
	355	for name in sorted(set(IAM_INSTANCE_PROFILES) - set(remote_profiles)):
	356	actual = '%s%s' % (prefix, name)
	357	print('creating IAM instance profile %s' % actual)
	358
	359	profile = iamresource.create_instance_profile(
	360	InstanceProfileName=actual)
	361	remote_profiles[name] = profile
	362
	363	for name in sorted(set(IAM_ROLES) - set(remote_roles)):
	364	entry = IAM_ROLES[name]
	365
	366	actual = '%s%s' % (prefix, name)
	367	print('creating IAM role %s' % actual)
	368
	369	role = iamresource.create_role(
	370	RoleName=actual,
	371	Description=entry['description'],
	372	AssumeRolePolicyDocument=ASSUME_ROLE_POLICY_DOCUMENT,
	373	)
	374
	375	remote_roles[name] = role
	376
	377	for arn in entry['policy_arns']:
	378	print('attaching policy %s to %s' % (arn, role.name))
	379	role.attach_policy(PolicyArn=arn)
	380
	381	# Now reconcile state of profiles.
	382	for name, meta in sorted(IAM_INSTANCE_PROFILES.items()):
	383	profile = remote_profiles[name]
	384	wanted = {'%s%s' % (prefix, role) for role in meta['roles']}
	385	have = {role.name for role in profile.roles}
	386
	387	for role in sorted(have - wanted):
	388	print('removing role %s from %s' % (role, profile.name))
	389	profile.remove_role(RoleName=role)
	390
	391	for role in sorted(wanted - have):
	392	print('adding role %s to %s' % (role, profile.name))
	393	profile.add_role(RoleName=role)
	394
	395
	396	def find_windows_server_2019_image(ec2resource):
	397	"""Find the Amazon published Windows Server 2019 base image."""
	398
	399	images = ec2resource.images.filter(
	400	Filters=[
	401	{
	402	'Name': 'owner-alias',
	403	'Values': ['amazon'],
	404	},
	405	{
	406	'Name': 'state',
	407	'Values': ['available'],
	408	},
	409	{
	410	'Name': 'image-type',
	411	'Values': ['machine'],
	412	},
	413	{
	414	'Name': 'name',
	415	'Values': ['Windows_Server-2019-English-Full-Base-2019.02.13'],
	416	},
	417	])
	418
	419	for image in images:
	420	return image
	421
	422	raise Exception('unable to find Windows Server 2019 image')
	423
	424
	425	def ensure_security_groups(ec2resource, prefix='hg-'):
	426	"""Ensure all necessary Mercurial security groups are present.
	427
	428	All security groups are prefixed with ``hg-`` by default. Any security
	429	groups having this prefix but aren't in our list are deleted.
	430	"""
	431	existing = {}
	432
	433	for group in ec2resource.security_groups.all():
	434	if group.group_name.startswith(prefix):
	435	existing[group.group_name[len(prefix):]] = group
	436
	437	purge = set(existing) - set(SECURITY_GROUPS)
	438
	439	for name in sorted(purge):
	440	group = existing[name]
	441	print('removing legacy security group: %s' % group.group_name)
	442	group.delete()
	443
	444	security_groups = {}
	445
	446	for name, group in sorted(SECURITY_GROUPS.items()):
	447	if name in existing:
	448	security_groups[name] = existing[name]
	449	continue
	450
	451	actual = '%s%s' % (prefix, name)
	452	print('adding security group %s' % actual)
	453
	454	group_res = ec2resource.create_security_group(
	455	Description=group['description'],
	456	GroupName=actual,
	457	)
	458
	459	group_res.authorize_ingress(
	460	IpPermissions=group['ingress'],
	461	)
	462
	463	security_groups[name] = group_res
	464
	465	return security_groups
	466
	467
	468	def terminate_ec2_instances(ec2resource, prefix='hg-'):
	469	"""Terminate all EC2 instances managed by us."""
	470	waiting = []
	471
	472	for instance in ec2resource.instances.all():
	473	if instance.state['Name'] == 'terminated':
	474	continue
	475
	476	for tag in instance.tags or []:
	477	if tag['Key'] == 'Name' and tag['Value'].startswith(prefix):
	478	print('terminating %s' % instance.id)
	479	instance.terminate()
	480	waiting.append(instance)
	481
	482	for instance in waiting:
	483	instance.wait_until_terminated()
	484
	485
	486	def remove_resources(c, prefix='hg-'):
	487	"""Purge all of our resources in this EC2 region."""
	488	ec2resource = c.ec2resource
	489	iamresource = c.iamresource
	490
	491	terminate_ec2_instances(ec2resource, prefix=prefix)
	492
	493	for image in ec2resource.images.all():
	494	if image.name.startswith(prefix):
	495	remove_ami(ec2resource, image)
	496
	497	for group in ec2resource.security_groups.all():
	498	if group.group_name.startswith(prefix):
	499	print('removing security group %s' % group.group_name)
	500	group.delete()
	501
	502	for profile in iamresource.instance_profiles.all():
	503	if profile.name.startswith(prefix):
	504	delete_instance_profile(profile)
	505
	506	for role in iamresource.roles.all():
	507	if role.name.startswith(prefix):
	508	print('removing role %s' % role.name)
	509	role.delete()
	510
	511
	512	def wait_for_ip_addresses(instances):
	513	"""Wait for the public IP addresses of an iterable of instances."""
	514	for instance in instances:
	515	while True:
	516	if not instance.public_ip_address:
	517	time.sleep(2)
	518	instance.reload()
	519	continue
	520
	521	print('public IP address for %s: %s' % (
	522	instance.id, instance.public_ip_address))
	523	break
	524
	525
	526	def remove_ami(ec2resource, image):
	527	"""Remove an AMI and its underlying snapshots."""
	528	snapshots = []
	529
	530	for device in image.block_device_mappings:
	531	if 'Ebs' in device:
	532	snapshots.append(ec2resource.Snapshot(device['Ebs']['SnapshotId']))
	533
	534	print('deregistering %s' % image.id)
	535	image.deregister()
	536
	537	for snapshot in snapshots:
	538	print('deleting snapshot %s' % snapshot.id)
	539	snapshot.delete()
	540
	541
	542	def wait_for_ssm(ssmclient, instances):
	543	"""Wait for SSM to come online for an iterable of instance IDs."""
	544	while True:
	545	res = ssmclient.describe_instance_information(
	546	Filters=[
	547	{
	548	'Key': 'InstanceIds',
	549	'Values': [i.id for i in instances],
	550	},
	551	],
	552	)
	553
	554	available = len(res['InstanceInformationList'])
	555	wanted = len(instances)
	556
	557	print('%d/%d instances available in SSM' % (available, wanted))
	558
	559	if available == wanted:
	560	return
	561
	562	time.sleep(2)
	563
	564
	565	def run_ssm_command(ssmclient, instances, document_name, parameters):
	566	"""Run a PowerShell script on an EC2 instance."""
	567
	568	res = ssmclient.send_command(
	569	InstanceIds=[i.id for i in instances],
	570	DocumentName=document_name,
	571	Parameters=parameters,
	572	CloudWatchOutputConfig={
	573	'CloudWatchOutputEnabled': True,
	574	},
	575	)
	576
	577	command_id = res['Command']['CommandId']
	578
	579	for instance in instances:
	580	while True:
	581	try:
	582	res = ssmclient.get_command_invocation(
	583	CommandId=command_id,
	584	InstanceId=instance.id,
	585	)
	586	except botocore.exceptions.ClientError as e:
	587	if e.response['Error']['Code'] == 'InvocationDoesNotExist':
	588	print('could not find SSM command invocation; waiting')
	589	time.sleep(1)
	590	continue
	591	else:
	592	raise
	593
	594	if res['Status'] == 'Success':
	595	break
	596	elif res['Status'] in ('Pending', 'InProgress', 'Delayed'):
	597	time.sleep(2)
	598	else:
	599	raise Exception('command failed on %s: %s' % (
	600	instance.id, res['Status']))
	601
	602
	603	@contextlib.contextmanager
	604	def temporary_ec2_instances(ec2resource, config):
	605	"""Create temporary EC2 instances.
	606
	607	This is a proxy to ``ec2client.run_instances(**config)`` that takes care of
	608	managing the lifecycle of the instances.
	609
	610	When the context manager exits, the instances are terminated.
	611
	612	The context manager evaluates to the list of data structures
	613	describing each created instance. The instances may not be available
	614	for work immediately: it is up to the caller to wait for the instance
	615	to start responding.
	616	"""
	617
	618	ids = None
	619
	620	try:
	621	res = ec2resource.create_instances(**config)
	622
	623	ids = [i.id for i in res]
	624	print('started instances: %s' % ' '.join(ids))
	625
	626	yield res
	627	finally:
	628	if ids:
	629	print('terminating instances: %s' % ' '.join(ids))
	630	for instance in res:
	631	instance.terminate()
	632	print('terminated %d instances' % len(ids))
	633
	634
	635	@contextlib.contextmanager
	636	def create_temp_windows_ec2_instances(c: AWSConnection, config):
	637	"""Create temporary Windows EC2 instances.
	638
	639	This is a higher-level wrapper around ``create_temp_ec2_instances()`` that
	640	configures the Windows instance for Windows Remote Management. The emitted
	641	instances will have a ``winrm_client`` attribute containing a
	642	``pypsrp.client.Client`` instance bound to the instance.
	643	"""
	644	if 'IamInstanceProfile' in config:
	645	raise ValueError('IamInstanceProfile cannot be provided in config')
	646	if 'UserData' in config:
	647	raise ValueError('UserData cannot be provided in config')
	648
	649	password = c.automation.default_password()
	650
	651	config = copy.deepcopy(config)
	652	config['IamInstanceProfile'] = {
	653	'Name': 'hg-ephemeral-ec2-1',
	654	}
	655	config.setdefault('TagSpecifications', []).append({
	656	'ResourceType': 'instance',
	657	'Tags': [{'Key': 'Name', 'Value': 'hg-temp-windows'}],
	658	})
	659	config['UserData'] = WINDOWS_USER_DATA % password
	660
	661	with temporary_ec2_instances(c.ec2resource, config) as instances:
	662	wait_for_ip_addresses(instances)
	663
	664	print('waiting for Windows Remote Management service...')
	665
	666	for instance in instances:
	667	client = wait_for_winrm(instance.public_ip_address, 'Administrator', password)
	668	print('established WinRM connection to %s' % instance.id)
	669	instance.winrm_client = client
	670
	671	yield instances
	672
	673
	674	def ensure_windows_dev_ami(c: AWSConnection, prefix='hg-'):
	675	"""Ensure Windows Development AMI is available and up-to-date.
	676
	677	If necessary, a modern AMI will be built by starting a temporary EC2
	678	instance and bootstrapping it.
	679
	680	Obsolete AMIs will be deleted so there is only a single AMI having the
	681	desired name.
	682
	683	Returns an ``ec2.Image`` of either an existing AMI or a newly-built
	684	one.
	685	"""
	686	ec2client = c.ec2client
	687	ec2resource = c.ec2resource
	688	ssmclient = c.session.client('ssm')
	689
	690	name = '%s%s' % (prefix, 'windows-dev')
	691
	692	config = {
	693	'BlockDeviceMappings': [
	694	{
	695	'DeviceName': '/dev/sda1',
	696	'Ebs': {
	697	'DeleteOnTermination': True,
	698	'VolumeSize': 32,
	699	'VolumeType': 'gp2',
	700	},
	701	}
	702	],
	703	'ImageId': find_windows_server_2019_image(ec2resource).id,
	704	'InstanceInitiatedShutdownBehavior': 'stop',
	705	'InstanceType': 't3.medium',
	706	'KeyName': '%sautomation' % prefix,
	707	'MaxCount': 1,
	708	'MinCount': 1,
	709	'SecurityGroupIds': [c.security_groups['windows-dev-1'].id],
	710	}
	711
	712	commands = [
	713	# Need to start the service so sshd_config is generated.
	714	'Start-Service sshd',
	715	'Write-Output "modifying sshd_config"',
	716	r'$content = Get-Content C:\ProgramData\ssh\sshd_config',
	717	'$content = $content -replace "Match Group administrators","" -replace "AuthorizedKeysFile __PROGRAMDATA__/ssh/administrators_authorized_keys",""',
	718	r'$content \| Set-Content C:\ProgramData\ssh\sshd_config',
	719	'Import-Module OpenSSHUtils',
	720	r'Repair-SshdConfigPermission C:\ProgramData\ssh\sshd_config -Confirm:$false',
	721	'Restart-Service sshd',
	722	'Write-Output "installing OpenSSL client"',
	723	'Add-WindowsCapability -Online -Name OpenSSH.Client~~~~0.0.1.0',
	724	'Set-Service -Name sshd -StartupType "Automatic"',
	725	'Write-Output "OpenSSH server running"',
	726	]
	727
	728	with INSTALL_WINDOWS_DEPENDENCIES.open('r', encoding='utf-8') as fh:
	729	commands.extend(l.rstrip() for l in fh)
	730
	731	# Disable Windows Defender when bootstrapping because it just slows
	732	# things down.
	733	commands.insert(0, 'Set-MpPreference -DisableRealtimeMonitoring $true')
	734	commands.append('Set-MpPreference -DisableRealtimeMonitoring $false')
	735
	736	# Compute a deterministic fingerprint to determine whether image needs
	737	# to be regenerated.
	738	fingerprint = {
	739	'instance_config': config,
	740	'user_data': WINDOWS_USER_DATA,
	741	'initial_bootstrap': WINDOWS_BOOTSTRAP_POWERSHELL,
	742	'bootstrap_commands': commands,
	743	}
	744
	745	fingerprint = json.dumps(fingerprint, sort_keys=True)
	746	fingerprint = hashlib.sha256(fingerprint.encode('utf-8')).hexdigest()
	747
	748	# Find existing AMIs with this name and delete the ones that are invalid.
	749	# Store a reference to a good image so it can be returned one the
	750	# image state is reconciled.
	751	images = ec2resource.images.filter(
	752	Filters=[{'Name': 'name', 'Values': [name]}])
	753
	754	existing_image = None
	755
	756	for image in images:
	757	if image.tags is None:
	758	print('image %s for %s lacks required tags; removing' % (
	759	image.id, image.name))
	760	remove_ami(ec2resource, image)
	761	else:
	762	tags = {t['Key']: t['Value'] for t in image.tags}
	763
	764	if tags.get('HGIMAGEFINGERPRINT') == fingerprint:
	765	existing_image = image
	766	else:
	767	print('image %s for %s has wrong fingerprint; removing' % (
	768	image.id, image.name))
	769	remove_ami(ec2resource, image)
	770
	771	if existing_image:
	772	return existing_image
	773
	774	print('no suitable Windows development image found; creating one...')
	775
	776	with create_temp_windows_ec2_instances(c, config) as instances:
	777	assert len(instances) == 1
	778	instance = instances[0]
	779
	780	wait_for_ssm(ssmclient, [instance])
	781
	782	# On first boot, install various Windows updates.
	783	# We would ideally use PowerShell Remoting for this. However, there are
	784	# trust issues that make it difficult to invoke Windows Update
	785	# remotely. So we use SSM, which has a mechanism for running Windows
	786	# Update.
	787	print('installing Windows features...')
	788	run_ssm_command(
	789	ssmclient,
	790	[instance],
	791	'AWS-RunPowerShellScript',
	792	{
	793	'commands': WINDOWS_BOOTSTRAP_POWERSHELL.split('\n'),
	794	},
	795	)
	796
	797	# Reboot so all updates are fully applied.
	798	print('rebooting instance %s' % instance.id)
	799	ec2client.reboot_instances(InstanceIds=[instance.id])
	800
	801	time.sleep(15)
	802
	803	print('waiting for Windows Remote Management to come back...')
	804	client = wait_for_winrm(instance.public_ip_address, 'Administrator',
	805	c.automation.default_password())
	806	print('established WinRM connection to %s' % instance.id)
	807	instance.winrm_client = client
	808
	809	print('bootstrapping instance...')
	810	run_powershell(instance.winrm_client, '\n'.join(commands))
	811
	812	print('bootstrap completed; stopping %s to create image' % instance.id)
	813	instance.stop()
	814
	815	ec2client.get_waiter('instance_stopped').wait(
	816	InstanceIds=[instance.id],
	817	WaiterConfig={
	818	'Delay': 5,
	819	})
	820	print('%s is stopped' % instance.id)
	821
	822	image = instance.create_image(
	823	Name=name,
	824	Description='Mercurial Windows development environment',
	825	)
	826
	827	image.create_tags(Tags=[
	828	{
	829	'Key': 'HGIMAGEFINGERPRINT',
	830	'Value': fingerprint,
	831	},
	832	])
	833
	834	print('waiting for image %s' % image.id)
	835
	836	ec2client.get_waiter('image_available').wait(
	837	ImageIds=[image.id],
	838	)
	839
	840	print('image %s available as %s' % (image.id, image.name))
	841
	842	return image
	843
	844
	845	@contextlib.contextmanager
	846	def temporary_windows_dev_instances(c: AWSConnection, image, instance_type,
	847	prefix='hg-', disable_antivirus=False):
	848	"""Create a temporary Windows development EC2 instance.
	849
	850	Context manager resolves to the list of ``EC2.Instance`` that were created.
	851	"""
	852	config = {
	853	'BlockDeviceMappings': [
	854	{
	855	'DeviceName': '/dev/sda1',
	856	'Ebs': {
	857	'DeleteOnTermination': True,
	858	'VolumeSize': 32,
	859	'VolumeType': 'gp2',
	860	},
	861	}
	862	],
	863	'ImageId': image.id,
	864	'InstanceInitiatedShutdownBehavior': 'stop',
	865	'InstanceType': instance_type,
	866	'KeyName': '%sautomation' % prefix,
	867	'MaxCount': 1,
	868	'MinCount': 1,
	869	'SecurityGroupIds': [c.security_groups['windows-dev-1'].id],
	870	}
	871
	872	with create_temp_windows_ec2_instances(c, config) as instances:
	873	if disable_antivirus:
	874	for instance in instances:
	875	run_powershell(
	876	instance.winrm_client,
	877	'Set-MpPreference -DisableRealtimeMonitoring $true')
	878
	879	yield instances

contrib/automation/hgautomation/cli.py

0 created 644 +273 0

			@@ -0,0 +1,273 b''
		1	# cli.py - Command line interface for automation
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import argparse
		11	import os
		12	import pathlib
		13
		14	from . import (
		15	aws,
		16	HGAutomation,
		17	windows,
		18	)
		19
		20
		21	SOURCE_ROOT = pathlib.Path(os.path.abspath(__file__)).parent.parent.parent.parent
		22	DIST_PATH = SOURCE_ROOT / 'dist'
		23
		24
		25	def bootstrap_windows_dev(hga: HGAutomation, aws_region):
		26	c = hga.aws_connection(aws_region)
		27	image = aws.ensure_windows_dev_ami(c)
		28	print('Windows development AMI available as %s' % image.id)
		29
		30
		31	def build_inno(hga: HGAutomation, aws_region, arch, revision, version):
		32	c = hga.aws_connection(aws_region)
		33	image = aws.ensure_windows_dev_ami(c)
		34	DIST_PATH.mkdir(exist_ok=True)
		35
		36	with aws.temporary_windows_dev_instances(c, image, 't3.medium') as insts:
		37	instance = insts[0]
		38
		39	windows.synchronize_hg(SOURCE_ROOT, revision, instance)
		40
		41	for a in arch:
		42	windows.build_inno_installer(instance.winrm_client, a,
		43	DIST_PATH,
		44	version=version)
		45
		46
		47	def build_wix(hga: HGAutomation, aws_region, arch, revision, version):
		48	c = hga.aws_connection(aws_region)
		49	image = aws.ensure_windows_dev_ami(c)
		50	DIST_PATH.mkdir(exist_ok=True)
		51
		52	with aws.temporary_windows_dev_instances(c, image, 't3.medium') as insts:
		53	instance = insts[0]
		54
		55	windows.synchronize_hg(SOURCE_ROOT, revision, instance)
		56
		57	for a in arch:
		58	windows.build_wix_installer(instance.winrm_client, a,
		59	DIST_PATH, version=version)
		60
		61
		62	def build_windows_wheel(hga: HGAutomation, aws_region, arch, revision):
		63	c = hga.aws_connection(aws_region)
		64	image = aws.ensure_windows_dev_ami(c)
		65	DIST_PATH.mkdir(exist_ok=True)
		66
		67	with aws.temporary_windows_dev_instances(c, image, 't3.medium') as insts:
		68	instance = insts[0]
		69
		70	windows.synchronize_hg(SOURCE_ROOT, revision, instance)
		71
		72	for a in arch:
		73	windows.build_wheel(instance.winrm_client, a, DIST_PATH)
		74
		75
		76	def build_all_windows_packages(hga: HGAutomation, aws_region, revision):
		77	c = hga.aws_connection(aws_region)
		78	image = aws.ensure_windows_dev_ami(c)
		79	DIST_PATH.mkdir(exist_ok=True)
		80
		81	with aws.temporary_windows_dev_instances(c, image, 't3.medium') as insts:
		82	instance = insts[0]
		83
		84	winrm_client = instance.winrm_client
		85
		86	windows.synchronize_hg(SOURCE_ROOT, revision, instance)
		87
		88	for arch in ('x86', 'x64'):
		89	windows.purge_hg(winrm_client)
		90	windows.build_wheel(winrm_client, arch, DIST_PATH)
		91	windows.purge_hg(winrm_client)
		92	windows.build_inno_installer(winrm_client, arch, DIST_PATH)
		93	windows.purge_hg(winrm_client)
		94	windows.build_wix_installer(winrm_client, arch, DIST_PATH)
		95
		96
		97	def terminate_ec2_instances(hga: HGAutomation, aws_region):
		98	c = hga.aws_connection(aws_region)
		99	aws.terminate_ec2_instances(c.ec2resource)
		100
		101
		102	def purge_ec2_resources(hga: HGAutomation, aws_region):
		103	c = hga.aws_connection(aws_region)
		104	aws.remove_resources(c)
		105
		106
		107	def run_tests_windows(hga: HGAutomation, aws_region, instance_type,
		108	python_version, arch, test_flags):
		109	c = hga.aws_connection(aws_region)
		110	image = aws.ensure_windows_dev_ami(c)
		111
		112	with aws.temporary_windows_dev_instances(c, image, instance_type,
		113	disable_antivirus=True) as insts:
		114	instance = insts[0]
		115
		116	windows.synchronize_hg(SOURCE_ROOT, '.', instance)
		117	windows.run_tests(instance.winrm_client, python_version, arch,
		118	test_flags)
		119
		120
		121	def get_parser():
		122	parser = argparse.ArgumentParser()
		123
		124	parser.add_argument(
		125	'--state-path',
		126	default='~/.hgautomation',
		127	help='Path for local state files',
		128	)
		129	parser.add_argument(
		130	'--aws-region',
		131	help='AWS region to use',
		132	default='us-west-1',
		133	)
		134
		135	subparsers = parser.add_subparsers()
		136
		137	sp = subparsers.add_parser(
		138	'bootstrap-windows-dev',
		139	help='Bootstrap the Windows development environment',
		140	)
		141	sp.set_defaults(func=bootstrap_windows_dev)
		142
		143	sp = subparsers.add_parser(
		144	'build-all-windows-packages',
		145	help='Build all Windows packages',
		146	)
		147	sp.add_argument(
		148	'--revision',
		149	help='Mercurial revision to build',
		150	default='.',
		151	)
		152	sp.set_defaults(func=build_all_windows_packages)
		153
		154	sp = subparsers.add_parser(
		155	'build-inno',
		156	help='Build Inno Setup installer(s)',
		157	)
		158	sp.add_argument(
		159	'--arch',
		160	help='Architecture to build for',
		161	choices={'x86', 'x64'},
		162	nargs='*',
		163	default=['x64'],
		164	)
		165	sp.add_argument(
		166	'--revision',
		167	help='Mercurial revision to build',
		168	default='.',
		169	)
		170	sp.add_argument(
		171	'--version',
		172	help='Mercurial version string to use in installer',
		173	)
		174	sp.set_defaults(func=build_inno)
		175
		176	sp = subparsers.add_parser(
		177	'build-windows-wheel',
		178	help='Build Windows wheel(s)',
		179	)
		180	sp.add_argument(
		181	'--arch',
		182	help='Architecture to build for',
		183	choices={'x86', 'x64'},
		184	nargs='*',
		185	default=['x64'],
		186	)
		187	sp.add_argument(
		188	'--revision',
		189	help='Mercurial revision to build',
		190	default='.',
		191	)
		192	sp.set_defaults(func=build_windows_wheel)
		193
		194	sp = subparsers.add_parser(
		195	'build-wix',
		196	help='Build WiX installer(s)'
		197	)
		198	sp.add_argument(
		199	'--arch',
		200	help='Architecture to build for',
		201	choices={'x86', 'x64'},
		202	nargs='*',
		203	default=['x64'],
		204	)
		205	sp.add_argument(
		206	'--revision',
		207	help='Mercurial revision to build',
		208	default='.',
		209	)
		210	sp.add_argument(
		211	'--version',
		212	help='Mercurial version string to use in installer',
		213	)
		214	sp.set_defaults(func=build_wix)
		215
		216	sp = subparsers.add_parser(
		217	'terminate-ec2-instances',
		218	help='Terminate all active EC2 instances managed by us',
		219	)
		220	sp.set_defaults(func=terminate_ec2_instances)
		221
		222	sp = subparsers.add_parser(
		223	'purge-ec2-resources',
		224	help='Purge all EC2 resources managed by us',
		225	)
		226	sp.set_defaults(func=purge_ec2_resources)
		227
		228	sp = subparsers.add_parser(
		229	'run-tests-windows',
		230	help='Run tests on Windows',
		231	)
		232	sp.add_argument(
		233	'--instance-type',
		234	help='EC2 instance type to use',
		235	default='t3.medium',
		236	)
		237	sp.add_argument(
		238	'--python-version',
		239	help='Python version to use',
		240	choices={'2.7', '3.5', '3.6', '3.7', '3.8'},
		241	default='2.7',
		242	)
		243	sp.add_argument(
		244	'--arch',
		245	help='Architecture to test',
		246	choices={'x86', 'x64'},
		247	default='x64',
		248	)
		249	sp.add_argument(
		250	'--test-flags',
		251	help='Extra command line flags to pass to run-tests.py',
		252	)
		253	sp.set_defaults(func=run_tests_windows)
		254
		255	return parser
		256
		257
		258	def main():
		259	parser = get_parser()
		260	args = parser.parse_args()
		261
		262	local_state_path = pathlib.Path(os.path.expanduser(args.state_path))
		263	automation = HGAutomation(local_state_path)
		264
		265	if not hasattr(args, 'func'):
		266	parser.print_help()
		267	return
		268
		269	kwargs = dict(vars(args))
		270	del kwargs['func']
		271	del kwargs['state_path']
		272
		273	args.func(automation, **kwargs)

contrib/automation/hgautomation/windows.py

0 created 644 +287 0

			@@ -0,0 +1,287 b''
		1	# windows.py - Automation specific to Windows
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import os
		11	import pathlib
		12	import re
		13	import subprocess
		14	import tempfile
		15
		16	from .winrm import (
		17	run_powershell,
		18	)
		19
		20
		21	# PowerShell commands to activate a Visual Studio 2008 environment.
		22	# This is essentially a port of vcvarsall.bat to PowerShell.
		23	ACTIVATE_VC9_AMD64 = r'''
		24	Write-Output "activating Visual Studio 2008 environment for AMD64"
		25	$root = "$env:LOCALAPPDATA\Programs\Common\Microsoft\Visual C++ for Python\9.0"
		26	$Env:VCINSTALLDIR = "${root}\VC\"
		27	$Env:WindowsSdkDir = "${root}\WinSDK\"
		28	$Env:PATH = "${root}\VC\Bin\amd64;${root}\WinSDK\Bin\x64;${root}\WinSDK\Bin;$Env:PATH"
		29	$Env:INCLUDE = "${root}\VC\Include;${root}\WinSDK\Include;$Env:PATH"
		30	$Env:LIB = "${root}\VC\Lib\amd64;${root}\WinSDK\Lib\x64;$Env:LIB"
		31	$Env:LIBPATH = "${root}\VC\Lib\amd64;${root}\WinSDK\Lib\x64;$Env:LIBPATH"
		32	'''.lstrip()
		33
		34	ACTIVATE_VC9_X86 = r'''
		35	Write-Output "activating Visual Studio 2008 environment for x86"
		36	$root = "$env:LOCALAPPDATA\Programs\Common\Microsoft\Visual C++ for Python\9.0"
		37	$Env:VCINSTALLDIR = "${root}\VC\"
		38	$Env:WindowsSdkDir = "${root}\WinSDK\"
		39	$Env:PATH = "${root}\VC\Bin;${root}\WinSDK\Bin;$Env:PATH"
		40	$Env:INCLUDE = "${root}\VC\Include;${root}\WinSDK\Include;$Env:INCLUDE"
		41	$Env:LIB = "${root}\VC\Lib;${root}\WinSDK\Lib;$Env:LIB"
		42	$Env:LIBPATH = "${root}\VC\lib;${root}\WinSDK\Lib:$Env:LIBPATH"
		43	'''.lstrip()
		44
		45	HG_PURGE = r'''
		46	$Env:PATH = "C:\hgdev\venv-bootstrap\Scripts;$Env:PATH"
		47	Set-Location C:\hgdev\src
		48	hg.exe --config extensions.purge= purge --all
		49	if ($LASTEXITCODE -ne 0) {
		50	throw "process exited non-0: $LASTEXITCODE"
		51	}
		52	Write-Output "purged Mercurial repo"
		53	'''
		54
		55	HG_UPDATE_CLEAN = r'''
		56	$Env:PATH = "C:\hgdev\venv-bootstrap\Scripts;$Env:PATH"
		57	Set-Location C:\hgdev\src
		58	hg.exe --config extensions.purge= purge --all
		59	if ($LASTEXITCODE -ne 0) {{
		60	throw "process exited non-0: $LASTEXITCODE"
		61	}}
		62	hg.exe update -C {revision}
		63	if ($LASTEXITCODE -ne 0) {{
		64	throw "process exited non-0: $LASTEXITCODE"
		65	}}
		66	hg.exe log -r .
		67	Write-Output "updated Mercurial working directory to {revision}"
		68	'''.lstrip()
		69
		70	BUILD_INNO = r'''
		71	Set-Location C:\hgdev\src
		72	$python = "C:\hgdev\python27-{arch}\python.exe"
		73	C:\hgdev\python37-x64\python.exe contrib\packaging\inno\build.py --python $python
		74	if ($LASTEXITCODE -ne 0) {{
		75	throw "process exited non-0: $LASTEXITCODE"
		76	}}
		77	'''.lstrip()
		78
		79	BUILD_WHEEL = r'''
		80	Set-Location C:\hgdev\src
		81	C:\hgdev\python27-{arch}\Scripts\pip.exe wheel --wheel-dir dist .
		82	if ($LASTEXITCODE -ne 0) {{
		83	throw "process exited non-0: $LASTEXITCODE"
		84	}}
		85	'''
		86
		87	BUILD_WIX = r'''
		88	Set-Location C:\hgdev\src
		89	$python = "C:\hgdev\python27-{arch}\python.exe"
		90	C:\hgdev\python37-x64\python.exe contrib\packaging\wix\build.py --python $python {extra_args}
		91	if ($LASTEXITCODE -ne 0) {{
		92	throw "process exited non-0: $LASTEXITCODE"
		93	}}
		94	'''
		95
		96	RUN_TESTS = r'''
		97	C:\hgdev\MinGW\msys\1.0\bin\sh.exe --login -c "cd /c/hgdev/src/tests && /c/hgdev/{python_path}/python.exe run-tests.py {test_flags}"
		98	if ($LASTEXITCODE -ne 0) {{
		99	throw "process exited non-0: $LASTEXITCODE"
		100	}}
		101	'''
		102
		103
		104	def get_vc_prefix(arch):
		105	if arch == 'x86':
		106	return ACTIVATE_VC9_X86
		107	elif arch == 'x64':
		108	return ACTIVATE_VC9_AMD64
		109	else:
		110	raise ValueError('illegal arch: %s; must be x86 or x64' % arch)
		111
		112
		113	def fix_authorized_keys_permissions(winrm_client, path):
		114	commands = [
		115	'$ErrorActionPreference = "Stop"',
		116	'Repair-AuthorizedKeyPermission -FilePath %s -Confirm:$false' % path,
		117	r'icacls %s /remove:g "NT Service\sshd"' % path,
		118	]
		119
		120	run_powershell(winrm_client, '\n'.join(commands))
		121
		122
		123	def synchronize_hg(hg_repo: pathlib.Path, revision: str, ec2_instance):
		124	"""Synchronize local Mercurial repo to remote EC2 instance."""
		125
		126	winrm_client = ec2_instance.winrm_client
		127
		128	with tempfile.TemporaryDirectory() as temp_dir:
		129	temp_dir = pathlib.Path(temp_dir)
		130
		131	ssh_dir = temp_dir / '.ssh'
		132	ssh_dir.mkdir()
		133	ssh_dir.chmod(0o0700)
		134
		135	# Generate SSH key to use for communication.
		136	subprocess.run([
		137	'ssh-keygen', '-t', 'rsa', '-b', '4096', '-N', '',
		138	'-f', str(ssh_dir / 'id_rsa')],
		139	check=True, capture_output=True)
		140
		141	# Add it to ~/.ssh/authorized_keys on remote.
		142	# This assumes the file doesn't already exist.
		143	authorized_keys = r'c:\Users\Administrator\.ssh\authorized_keys'
		144	winrm_client.execute_cmd(r'mkdir c:\Users\Administrator\.ssh')
		145	winrm_client.copy(str(ssh_dir / 'id_rsa.pub'), authorized_keys)
		146	fix_authorized_keys_permissions(winrm_client, authorized_keys)
		147
		148	public_ip = ec2_instance.public_ip_address
		149
		150	ssh_config = temp_dir / '.ssh' / 'config'
		151
		152	with open(ssh_config, 'w', encoding='utf-8') as fh:
		153	fh.write('Host %s\n' % public_ip)
		154	fh.write(' User Administrator\n')
		155	fh.write(' StrictHostKeyChecking no\n')
		156	fh.write(' UserKnownHostsFile %s\n' % (ssh_dir / 'known_hosts'))
		157	fh.write(' IdentityFile %s\n' % (ssh_dir / 'id_rsa'))
		158
		159	env = dict(os.environ)
		160	env['HGPLAIN'] = '1'
		161	env['HGENCODING'] = 'utf-8'
		162
		163	hg_bin = hg_repo / 'hg'
		164
		165	res = subprocess.run(
		166	['python2.7', str(hg_bin), 'log', '-r', revision, '-T', '{node}'],
		167	cwd=str(hg_repo), env=env, check=True, capture_output=True)
		168
		169	full_revision = res.stdout.decode('ascii')
		170
		171	args = [
		172	'python2.7', hg_bin,
		173	'--config', 'ui.ssh=ssh -F %s' % ssh_config,
		174	'--config', 'ui.remotecmd=c:/hgdev/venv-bootstrap/Scripts/hg.exe',
		175	'push', '-r', full_revision, 'ssh://%s/c:/hgdev/src' % public_ip,
		176	]
		177
		178	subprocess.run(args, cwd=str(hg_repo), env=env, check=True)
		179
		180	run_powershell(winrm_client,
		181	HG_UPDATE_CLEAN.format(revision=full_revision))
		182
		183	# TODO detect dirty local working directory and synchronize accordingly.
		184
		185
		186	def purge_hg(winrm_client):
		187	"""Purge the Mercurial source repository on an EC2 instance."""
		188	run_powershell(winrm_client, HG_PURGE)
		189
		190
		191	def find_latest_dist(winrm_client, pattern):
		192	"""Find path to newest file in dist/ directory matching a pattern."""
		193
		194	res = winrm_client.execute_ps(
		195	r'$v = Get-ChildItem -Path C:\hgdev\src\dist -Filter "%s" '
		196	'\| Sort-Object LastWriteTime -Descending '
		197	'\| Select-Object -First 1\n'
		198	'$v.name' % pattern
		199	)
		200	return res[0]
		201
		202
		203	def copy_latest_dist(winrm_client, pattern, dest_path):
		204	"""Copy latest file matching pattern in dist/ directory.
		205
		206	Given a WinRM client and a file pattern, find the latest file on the remote
		207	matching that pattern and copy it to the ``dest_path`` directory on the
		208	local machine.
		209	"""
		210	latest = find_latest_dist(winrm_client, pattern)
		211	source = r'C:\hgdev\src\dist\%s' % latest
		212	dest = dest_path / latest
		213	print('copying %s to %s' % (source, dest))
		214	winrm_client.fetch(source, str(dest))
		215
		216
		217	def build_inno_installer(winrm_client, arch: str, dest_path: pathlib.Path,
		218	version=None):
		219	"""Build the Inno Setup installer on a remote machine.
		220
		221	Using a WinRM client, remote commands are executed to build
		222	a Mercurial Inno Setup installer.
		223	"""
		224	print('building Inno Setup installer for %s' % arch)
		225
		226	extra_args = []
		227	if version:
		228	extra_args.extend(['--version', version])
		229
		230	ps = get_vc_prefix(arch) + BUILD_INNO.format(arch=arch,
		231	extra_args=' '.join(extra_args))
		232	run_powershell(winrm_client, ps)
		233	copy_latest_dist(winrm_client, '*.exe', dest_path)
		234
		235
		236	def build_wheel(winrm_client, arch: str, dest_path: pathlib.Path):
		237	"""Build Python wheels on a remote machine.
		238
		239	Using a WinRM client, remote commands are executed to build a Python wheel
		240	for Mercurial.
		241	"""
		242	print('Building Windows wheel for %s' % arch)
		243	ps = get_vc_prefix(arch) + BUILD_WHEEL.format(arch=arch)
		244	run_powershell(winrm_client, ps)
		245	copy_latest_dist(winrm_client, '*.whl', dest_path)
		246
		247
		248	def build_wix_installer(winrm_client, arch: str, dest_path: pathlib.Path,
		249	version=None):
		250	"""Build the WiX installer on a remote machine.
		251
		252	Using a WinRM client, remote commands are executed to build a WiX installer.
		253	"""
		254	print('Building WiX installer for %s' % arch)
		255	extra_args = []
		256	if version:
		257	extra_args.extend(['--version', version])
		258
		259	ps = get_vc_prefix(arch) + BUILD_WIX.format(arch=arch,
		260	extra_args=' '.join(extra_args))
		261	run_powershell(winrm_client, ps)
		262	copy_latest_dist(winrm_client, '*.msi', dest_path)
		263
		264
		265	def run_tests(winrm_client, python_version, arch, test_flags=''):
		266	"""Run tests on a remote Windows machine.
		267
		268	``python_version`` is a ``X.Y`` string like ``2.7`` or ``3.7``.
		269	``arch`` is ``x86`` or ``x64``.
		270	``test_flags`` is a str representing extra arguments to pass to
		271	``run-tests.py``.
		272	"""
		273	if not re.match(r'\d\.\d', python_version):
		274	raise ValueError(r'python_version must be \d.\d; got %s' %
		275	python_version)
		276
		277	if arch not in ('x86', 'x64'):
		278	raise ValueError('arch must be x86 or x64; got %s' % arch)
		279
		280	python_path = 'python%s-%s' % (python_version.replace('.', ''), arch)
		281
		282	ps = RUN_TESTS.format(
		283	python_path=python_path,
		284	test_flags=test_flags or '',
		285	)
		286
		287	run_powershell(winrm_client, ps)

contrib/automation/hgautomation/winrm.py

0 created 644 +82 0

			@@ -0,0 +1,82 b''
		1	# winrm.py - Interact with Windows Remote Management (WinRM)
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import logging
		11	import pprint
		12	import time
		13
		14	from pypsrp.client import (
		15	Client,
		16	)
		17	from pypsrp.powershell import (
		18	PowerShell,
		19	PSInvocationState,
		20	RunspacePool,
		21	)
		22	import requests.exceptions
		23
		24
		25	logger = logging.getLogger(__name__)
		26
		27
		28	def wait_for_winrm(host, username, password, timeout=120, ssl=False):
		29	"""Wait for the Windows Remoting (WinRM) service to become available.
		30
		31	Returns a ``psrpclient.Client`` instance.
		32	"""
		33
		34	end_time = time.time() + timeout
		35
		36	while True:
		37	try:
		38	client = Client(host, username=username, password=password,
		39	ssl=ssl, connection_timeout=5)
		40	client.execute_cmd('echo "hello world"')
		41	return client
		42	except requests.exceptions.ConnectionError:
		43	if time.time() >= end_time:
		44	raise
		45
		46	time.sleep(1)
		47
		48
		49	def format_object(o):
		50	if isinstance(o, str):
		51	return o
		52
		53	try:
		54	o = str(o)
		55	except TypeError:
		56	o = pprint.pformat(o.extended_properties)
		57
		58	return o
		59
		60
		61	def run_powershell(client, script):
		62	with RunspacePool(client.wsman) as pool:
		63	ps = PowerShell(pool)
		64	ps.add_script(script)
		65
		66	ps.begin_invoke()
		67
		68	while ps.state == PSInvocationState.RUNNING:
		69	ps.poll_invoke()
		70	for o in ps.output:
		71	print(format_object(o))
		72
		73	ps.output[:] = []
		74
		75	ps.end_invoke()
		76
		77	for o in ps.output:
		78	print(format_object(o))
		79
		80	if ps.state == PSInvocationState.FAILED:
		81	raise Exception('PowerShell execution failed: %s' %
		82	' '.join(map(format_object, ps.streams.error)))

contrib/automation/requirements.txt

0 created 644 +119 0

			@@ -0,0 +1,119 b''
		1	#
		2	# This file is autogenerated by pip-compile
		3	# To update, run:
		4	#
		5	# pip-compile -U --generate-hashes --output-file contrib/automation/requirements.txt contrib/automation/requirements.txt.in
		6	#
		7	asn1crypto==0.24.0 \
		8	--hash=sha256:2f1adbb7546ed199e3c90ef23ec95c5cf3585bac7d11fb7eb562a3fe89c64e87 \
		9	--hash=sha256:9d5c20441baf0cb60a4ac34cc447c6c189024b6b4c6cd7877034f4965c464e49 \
		10	# via cryptography
		11	boto3==1.9.111 \
		12	--hash=sha256:06414c75d1f62af7d04fd652b38d1e4fd3cfd6b35bad978466af88e2aaecd00d \
		13	--hash=sha256:f3b77dff382374773d02411fa47ee408f4f503aeebd837fd9dc9ed8635bc5e8e
		14	botocore==1.12.111 \
		15	--hash=sha256:6af473c52d5e3e7ff82de5334e9fee96b2d5ec2df5d78bc00cd9937e2573a7a8 \
		16	--hash=sha256:9f5123c7be704b17aeacae99b5842ab17bda1f799dd29134de8c70e0a50a45d7 \
		17	# via boto3, s3transfer
		18	certifi==2019.3.9 \
		19	--hash=sha256:59b7658e26ca9c7339e00f8f4636cdfe59d34fa37b9b04f6f9e9926b3cece1a5 \
		20	--hash=sha256:b26104d6835d1f5e49452a26eb2ff87fe7090b89dfcaee5ea2212697e1e1d7ae \
		21	# via requests
		22	cffi==1.12.2 \
		23	--hash=sha256:00b97afa72c233495560a0793cdc86c2571721b4271c0667addc83c417f3d90f \
		24	--hash=sha256:0ba1b0c90f2124459f6966a10c03794082a2f3985cd699d7d63c4a8dae113e11 \
		25	--hash=sha256:0bffb69da295a4fc3349f2ec7cbe16b8ba057b0a593a92cbe8396e535244ee9d \
		26	--hash=sha256:21469a2b1082088d11ccd79dd84157ba42d940064abbfa59cf5f024c19cf4891 \
		27	--hash=sha256:2e4812f7fa984bf1ab253a40f1f4391b604f7fc424a3e21f7de542a7f8f7aedf \
		28	--hash=sha256:2eac2cdd07b9049dd4e68449b90d3ef1adc7c759463af5beb53a84f1db62e36c \
		29	--hash=sha256:2f9089979d7456c74d21303c7851f158833d48fb265876923edcb2d0194104ed \
		30	--hash=sha256:3dd13feff00bddb0bd2d650cdb7338f815c1789a91a6f68fdc00e5c5ed40329b \
		31	--hash=sha256:4065c32b52f4b142f417af6f33a5024edc1336aa845b9d5a8d86071f6fcaac5a \
		32	--hash=sha256:51a4ba1256e9003a3acf508e3b4f4661bebd015b8180cc31849da222426ef585 \
		33	--hash=sha256:59888faac06403767c0cf8cfb3f4a777b2939b1fbd9f729299b5384f097f05ea \
		34	--hash=sha256:59c87886640574d8b14910840327f5cd15954e26ed0bbd4e7cef95fa5aef218f \
		35	--hash=sha256:610fc7d6db6c56a244c2701575f6851461753c60f73f2de89c79bbf1cc807f33 \
		36	--hash=sha256:70aeadeecb281ea901bf4230c6222af0248c41044d6f57401a614ea59d96d145 \
		37	--hash=sha256:71e1296d5e66c59cd2c0f2d72dc476d42afe02aeddc833d8e05630a0551dad7a \
		38	--hash=sha256:8fc7a49b440ea752cfdf1d51a586fd08d395ff7a5d555dc69e84b1939f7ddee3 \
		39	--hash=sha256:9b5c2afd2d6e3771d516045a6cfa11a8da9a60e3d128746a7fe9ab36dfe7221f \
		40	--hash=sha256:9c759051ebcb244d9d55ee791259ddd158188d15adee3c152502d3b69005e6bd \
		41	--hash=sha256:b4d1011fec5ec12aa7cc10c05a2f2f12dfa0adfe958e56ae38dc140614035804 \
		42	--hash=sha256:b4f1d6332339ecc61275bebd1f7b674098a66fea11a00c84d1c58851e618dc0d \
		43	--hash=sha256:c030cda3dc8e62b814831faa4eb93dd9a46498af8cd1d5c178c2de856972fd92 \
		44	--hash=sha256:c2e1f2012e56d61390c0e668c20c4fb0ae667c44d6f6a2eeea5d7148dcd3df9f \
		45	--hash=sha256:c37c77d6562074452120fc6c02ad86ec928f5710fbc435a181d69334b4de1d84 \
		46	--hash=sha256:c8149780c60f8fd02752d0429246088c6c04e234b895c4a42e1ea9b4de8d27fb \
		47	--hash=sha256:cbeeef1dc3c4299bd746b774f019de9e4672f7cc666c777cd5b409f0b746dac7 \
		48	--hash=sha256:e113878a446c6228669144ae8a56e268c91b7f1fafae927adc4879d9849e0ea7 \
		49	--hash=sha256:e21162bf941b85c0cda08224dade5def9360f53b09f9f259adb85fc7dd0e7b35 \
		50	--hash=sha256:fb6934ef4744becbda3143d30c6604718871495a5e36c408431bf33d9c146889 \
		51	# via cryptography
		52	chardet==3.0.4 \
		53	--hash=sha256:84ab92ed1c4d4f16916e05906b6b75a6c0fb5db821cc65e70cbd64a3e2a5eaae \
		54	--hash=sha256:fc323ffcaeaed0e0a02bf4d117757b98aed530d9ed4531e3e15460124c106691 \
		55	# via requests
		56	cryptography==2.6.1 \
		57	--hash=sha256:066f815f1fe46020877c5983a7e747ae140f517f1b09030ec098503575265ce1 \
		58	--hash=sha256:210210d9df0afba9e000636e97810117dc55b7157c903a55716bb73e3ae07705 \
		59	--hash=sha256:26c821cbeb683facb966045e2064303029d572a87ee69ca5a1bf54bf55f93ca6 \
		60	--hash=sha256:2afb83308dc5c5255149ff7d3fb9964f7c9ee3d59b603ec18ccf5b0a8852e2b1 \
		61	--hash=sha256:2db34e5c45988f36f7a08a7ab2b69638994a8923853dec2d4af121f689c66dc8 \
		62	--hash=sha256:409c4653e0f719fa78febcb71ac417076ae5e20160aec7270c91d009837b9151 \
		63	--hash=sha256:45a4f4cf4f4e6a55c8128f8b76b4c057027b27d4c67e3fe157fa02f27e37830d \
		64	--hash=sha256:48eab46ef38faf1031e58dfcc9c3e71756a1108f4c9c966150b605d4a1a7f659 \
		65	--hash=sha256:6b9e0ae298ab20d371fc26e2129fd683cfc0cfde4d157c6341722de645146537 \
		66	--hash=sha256:6c4778afe50f413707f604828c1ad1ff81fadf6c110cb669579dea7e2e98a75e \
		67	--hash=sha256:8c33fb99025d353c9520141f8bc989c2134a1f76bac6369cea060812f5b5c2bb \
		68	--hash=sha256:9873a1760a274b620a135054b756f9f218fa61ca030e42df31b409f0fb738b6c \
		69	--hash=sha256:9b069768c627f3f5623b1cbd3248c5e7e92aec62f4c98827059eed7053138cc9 \
		70	--hash=sha256:9e4ce27a507e4886efbd3c32d120db5089b906979a4debf1d5939ec01b9dd6c5 \
		71	--hash=sha256:acb424eaca214cb08735f1a744eceb97d014de6530c1ea23beb86d9c6f13c2ad \
		72	--hash=sha256:c8181c7d77388fe26ab8418bb088b1a1ef5fde058c6926790c8a0a3d94075a4a \
		73	--hash=sha256:d4afbb0840f489b60f5a580a41a1b9c3622e08ecb5eec8614d4fb4cd914c4460 \
		74	--hash=sha256:d9ed28030797c00f4bc43c86bf819266c76a5ea61d006cd4078a93ebf7da6bfd \
		75	--hash=sha256:e603aa7bb52e4e8ed4119a58a03b60323918467ef209e6ff9db3ac382e5cf2c6 \
		76	# via pypsrp
		77	docutils==0.14 \
		78	--hash=sha256:02aec4bd92ab067f6ff27a38a38a41173bf01bed8f89157768c1573f53e474a6 \
		79	--hash=sha256:51e64ef2ebfb29cae1faa133b3710143496eca21c530f3f71424d77687764274 \
		80	--hash=sha256:7a4bd47eaf6596e1295ecb11361139febe29b084a87bf005bf899f9a42edc3c6 \
		81	# via botocore
		82	idna==2.8 \
		83	--hash=sha256:c357b3f628cf53ae2c4c05627ecc484553142ca23264e593d327bcde5e9c3407 \
		84	--hash=sha256:ea8b7f6188e6fa117537c3df7da9fc686d485087abf6ac197f9c46432f7e4a3c \
		85	# via requests
		86	jmespath==0.9.4 \
		87	--hash=sha256:3720a4b1bd659dd2eecad0666459b9788813e032b83e7ba58578e48254e0a0e6 \
		88	--hash=sha256:bde2aef6f44302dfb30320115b17d030798de8c4110e28d5cf6cf91a7a31074c \
		89	# via boto3, botocore
		90	ntlm-auth==1.2.0 \
		91	--hash=sha256:7bc02a3fbdfee7275d3dc20fce8028ed8eb6d32364637f28be9e9ae9160c6d5c \
		92	--hash=sha256:9b13eaf88f16a831637d75236a93d60c0049536715aafbf8190ba58a590b023e \
		93	# via pypsrp
		94	pycparser==2.19 \
		95	--hash=sha256:a988718abfad80b6b157acce7bf130a30876d27603738ac39f140993246b25b3 \
		96	# via cffi
		97	pypsrp==0.3.1 \
		98	--hash=sha256:309853380fe086090a03cc6662a778ee69b1cae355ae4a932859034fd76e9d0b \
		99	--hash=sha256:90f946254f547dc3493cea8493c819ab87e152a755797c93aa2668678ba8ae85
		100	python-dateutil==2.8.0 \
		101	--hash=sha256:7e6584c74aeed623791615e26efd690f29817a27c73085b78e4bad02493df2fb \
		102	--hash=sha256:c89805f6f4d64db21ed966fda138f8a5ed7a4fdbc1a8ee329ce1b74e3c74da9e \
		103	# via botocore
		104	requests==2.21.0 \
		105	--hash=sha256:502a824f31acdacb3a35b6690b5fbf0bc41d63a24a45c4004352b0242707598e \
		106	--hash=sha256:7bf2a778576d825600030a110f3c0e3e8edc51dfaafe1c146e39a2027784957b \
		107	# via pypsrp
		108	s3transfer==0.2.0 \
		109	--hash=sha256:7b9ad3213bff7d357f888e0fab5101b56fa1a0548ee77d121c3a3dbfbef4cb2e \
		110	--hash=sha256:f23d5cb7d862b104401d9021fc82e5fa0e0cf57b7660a1331425aab0c691d021 \
		111	# via boto3
		112	six==1.12.0 \
		113	--hash=sha256:3350809f0555b11f552448330d0b52d5f24c91a322ea4a15ef22629740f3761c \
		114	--hash=sha256:d16a0141ec1a18405cd4ce8b4613101da75da0e9a7aec5bdd4fa804d0e0eba73 \
		115	# via cryptography, pypsrp, python-dateutil
		116	urllib3==1.24.1 \
		117	--hash=sha256:61bf29cada3fc2fbefad4fdf059ea4bd1b4a86d2b6d15e1c7c0b582b9752fe39 \
		118	--hash=sha256:de9529817c93f27c8ccbfead6985011db27bd0ddfcdb2d86f3f663385c6a9c22 \
		119	# via botocore, requests

contrib/automation/requirements.txt.in

0 created 644 +2 0

			@@ -0,0 +1,2 b''
		1	boto3
		2	pypsrp

contrib/install-windows-dependencies.ps1

0 created 644 +200 0

			@@ -0,0 +1,200 b''
		1	# install-dependencies.ps1 - Install Windows dependencies for building Mercurial
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# This script can be used to bootstrap a Mercurial build environment on
		9	# Windows.
		10	#
		11	# The script makes a lot of assumptions about how things should work.
		12	# For example, the install location of Python is hardcoded to c:\hgdev\*.
		13	#
		14	# The script should be executed from a PowerShell with elevated privileges
		15	# if you don't want to see a UAC prompt for various installers.
		16	#
		17	# The script is tested on Windows 10 and Windows Server 2019 (in EC2).
		18
		19	$VS_BUILD_TOOLS_URL = "https://download.visualstudio.microsoft.com/download/pr/a1603c02-8a66-4b83-b821-811e3610a7c4/aa2db8bb39e0cbd23e9940d8951e0bc3/vs_buildtools.exe"
		20	$VS_BUILD_TOOLS_SHA256 = "911E292B8E6E5F46CBC17003BDCD2D27A70E616E8D5E6E69D5D489A605CAA139"
		21
		22	$VC9_PYTHON_URL = "https://download.microsoft.com/download/7/9/6/796EF2E4-801B-4FC4-AB28-B59FBF6D907B/VCForPython27.msi"
		23	$VC9_PYTHON_SHA256 = "070474db76a2e625513a5835df4595df9324d820f9cc97eab2a596dcbc2f5cbf"
		24
		25	$PYTHON27_x64_URL = "https://www.python.org/ftp/python/2.7.16/python-2.7.16.amd64.msi"
		26	$PYTHON27_x64_SHA256 = "7c0f45993019152d46041a7db4b947b919558fdb7a8f67bcd0535bc98d42b603"
		27	$PYTHON27_X86_URL = "https://www.python.org/ftp/python/2.7.16/python-2.7.16.msi"
		28	$PYTHON27_X86_SHA256 = "d57dc3e1ba490aee856c28b4915d09e3f49442461e46e481bc6b2d18207831d7"
		29
		30	$PYTHON35_x86_URL = "https://www.python.org/ftp/python/3.5.4/python-3.5.4.exe"
		31	$PYTHON35_x86_SHA256 = "F27C2D67FD9688E4970F3BFF799BB9D722A0D6C2C13B04848E1F7D620B524B0E"
		32	$PYTHON35_x64_URL = "https://www.python.org/ftp/python/3.5.4/python-3.5.4-amd64.exe"
		33	$PYTHON35_x64_SHA256 = "9B7741CC32357573A77D2EE64987717E527628C38FD7EAF3E2AACA853D45A1EE"
		34
		35	$PYTHON36_x86_URL = "https://www.python.org/ftp/python/3.6.8/python-3.6.8.exe"
		36	$PYTHON36_x86_SHA256 = "89871D432BC06E4630D7B64CB1A8451E53C80E68DE29029976B12AAD7DBFA5A0"
		37	$PYTHON36_x64_URL = "https://www.python.org/ftp/python/3.6.8/python-3.6.8-amd64.exe"
		38	$PYTHON36_x64_SHA256 = "96088A58B7C43BC83B84E6B67F15E8706C614023DD64F9A5A14E81FF824ADADC"
		39
		40	$PYTHON37_x86_URL = "https://www.python.org/ftp/python/3.7.2/python-3.7.2.exe"
		41	$PYTHON37_x86_SHA256 = "8BACE330FB409E428B04EEEE083DD9CA7F6C754366D07E23B3853891D8F8C3D0"
		42	$PYTHON37_x64_URL = "https://www.python.org/ftp/python/3.7.2/python-3.7.2-amd64.exe"
		43	$PYTHON37_x64_SHA256 = "0FE2A696F5A3E481FED795EF6896ED99157BCEF273EF3C4A96F2905CBDB3AA13"
		44
		45	$PYTHON38_x86_URL = "https://www.python.org/ftp/python/3.8.0/python-3.8.0a2.exe"
		46	$PYTHON38_x86_SHA256 = "013A7DDD317679FE51223DE627688CFCB2F0F1128FD25A987F846AEB476D3FEF"
		47	$PYTHON38_x64_URL = "https://www.python.org/ftp/python/3.8.0/python-3.8.0a2-amd64.exe"
		48	$PYTHON38_X64_SHA256 = "560BC6D1A76BCD6D544AC650709F3892956890753CDCF9CE67E3D7302D76FB41"
		49
		50	# PIP 19.0.3.
		51	$PIP_URL = "https://github.com/pypa/get-pip/raw/fee32c376da1ff6496a798986d7939cd51e1644f/get-pip.py"
		52	$PIP_SHA256 = "efe99298f3fbb1f56201ce6b81d2658067d2f7d7dfc2d412e0d3cacc9a397c61"
		53
		54	$VIRTUALENV_URL = "https://files.pythonhosted.org/packages/37/db/89d6b043b22052109da35416abc3c397655e4bd3cff031446ba02b9654fa/virtualenv-16.4.3.tar.gz"
		55	$VIRTUALENV_SHA256 = "984d7e607b0a5d1329425dd8845bd971b957424b5ba664729fab51ab8c11bc39"
		56
		57	$INNO_SETUP_URL = "http://files.jrsoftware.org/is/5/innosetup-5.6.1-unicode.exe"
		58	$INNO_SETUP_SHA256 = "27D49E9BC769E9D1B214C153011978DB90DC01C2ACD1DDCD9ED7B3FE3B96B538"
		59
		60	$MINGW_BIN_URL = "https://osdn.net/frs/redir.php?m=constant&f=mingw%2F68260%2Fmingw-get-0.6.3-mingw32-pre-20170905-1-bin.zip"
		61	$MINGW_BIN_SHA256 = "2AB8EFD7C7D1FC8EAF8B2FA4DA4EEF8F3E47768284C021599BC7435839A046DF"
		62
		63	$MERCURIAL_WHEEL_FILENAME = "mercurial-4.9-cp27-cp27m-win_amd64.whl"
		64	$MERCURIAL_WHEEL_URL = "https://files.pythonhosted.org/packages/fe/e8/b872d53dfbbf986bdc46af0b30f580b227fb59bddd2587152a55e205b0cc/$MERCURIAL_WHEEL_FILENAME"
		65	$MERCURIAL_WHEEL_SHA256 = "218cc2e7c3f1d535007febbb03351663897edf27df0e57d6842e3b686492b429"
		66
		67	# Writing progress slows down downloads substantially. So disable it.
		68	$progressPreference = 'silentlyContinue'
		69
		70	function Secure-Download($url, $path, $sha256) {
		71	if (Test-Path -Path $path) {
		72	Get-FileHash -Path $path -Algorithm SHA256 -OutVariable hash
		73
		74	if ($hash.Hash -eq $sha256) {
		75	Write-Output "SHA256 of $path verified as $sha256"
		76	return
		77	}
		78
		79	Write-Output "hash mismatch on $path; downloading again"
		80	}
		81
		82	Write-Output "downloading $url to $path"
		83	Invoke-WebRequest -Uri $url -OutFile $path
		84	Get-FileHash -Path $path -Algorithm SHA256 -OutVariable hash
		85
		86	if ($hash.Hash -ne $sha256) {
		87	Remove-Item -Path $path
		88	throw "hash mismatch when downloading $url; got $($hash.Hash), expected $sha256"
		89	}
		90	}
		91
		92	function Invoke-Process($path, $arguments) {
		93	$p = Start-Process -FilePath $path -ArgumentList $arguments -Wait -PassThru -WindowStyle Hidden
		94
		95	if ($p.ExitCode -ne 0) {
		96	throw "process exited non-0: $($p.ExitCode)"
		97	}
		98	}
		99
		100	function Install-Python3($name, $installer, $dest, $pip) {
		101	Write-Output "installing $name"
		102
		103	# We hit this when running the script as part of Simple Systems Manager in
		104	# EC2. The Python 3 installer doesn't seem to like per-user installs
		105	# when running as the SYSTEM user. So enable global installs if executed in
		106	# this mode.
		107	if ($env:USERPROFILE -eq "C:\Windows\system32\config\systemprofile") {
		108	Write-Output "running with SYSTEM account; installing for all users"
		109	$allusers = "1"
		110	}
		111	else {
		112	$allusers = "0"
		113	}
		114
		115	Invoke-Process $installer "/quiet TargetDir=${dest} InstallAllUsers=${allusers} AssociateFiles=0 CompileAll=0 PrependPath=0 Include_doc=0 Include_launcher=0 InstallLauncherAllUsers=0 Include_pip=0 Include_test=0"
		116	Invoke-Process ${dest}\python.exe $pip
		117	}
		118
		119	function Install-Dependencies($prefix) {
		120	if (!(Test-Path -Path $prefix\assets)) {
		121	New-Item -Path $prefix\assets -ItemType Directory
		122	}
		123
		124	$pip = "${prefix}\assets\get-pip.py"
		125
		126	Secure-Download $VC9_PYTHON_URL ${prefix}\assets\VCForPython27.msi $VC9_PYTHON_SHA256
		127	Secure-Download $PYTHON27_x86_URL ${prefix}\assets\python27-x86.msi $PYTHON27_x86_SHA256
		128	Secure-Download $PYTHON27_x64_URL ${prefix}\assets\python27-x64.msi $PYTHON27_x64_SHA256
		129	Secure-Download $PYTHON35_x86_URL ${prefix}\assets\python35-x86.exe $PYTHON35_x86_SHA256
		130	Secure-Download $PYTHON35_x64_URL ${prefix}\assets\python35-x64.exe $PYTHON35_x64_SHA256
		131	Secure-Download $PYTHON36_x86_URL ${prefix}\assets\python36-x86.exe $PYTHON36_x86_SHA256
		132	Secure-Download $PYTHON36_x64_URL ${prefix}\assets\python36-x64.exe $PYTHON36_x64_SHA256
		133	Secure-Download $PYTHON37_x86_URL ${prefix}\assets\python37-x86.exe $PYTHON37_x86_SHA256
		134	Secure-Download $PYTHON37_x64_URL ${prefix}\assets\python37-x64.exe $PYTHON37_x64_SHA256
		135	Secure-Download $PYTHON38_x86_URL ${prefix}\assets\python38-x86.exe $PYTHON38_x86_SHA256
		136	Secure-Download $PYTHON38_x64_URL ${prefix}\assets\python38-x64.exe $PYTHON38_x64_SHA256
		137	Secure-Download $PIP_URL ${pip} $PIP_SHA256
		138	Secure-Download $VIRTUALENV_URL ${prefix}\assets\virtualenv.tar.gz $VIRTUALENV_SHA256
		139	Secure-Download $VS_BUILD_TOOLS_URL ${prefix}\assets\vs_buildtools.exe $VS_BUILD_TOOLS_SHA256
		140	Secure-Download $INNO_SETUP_URL ${prefix}\assets\InnoSetup.exe $INNO_SETUP_SHA256
		141	Secure-Download $MINGW_BIN_URL ${prefix}\assets\mingw-get-bin.zip $MINGW_BIN_SHA256
		142	Secure-Download $MERCURIAL_WHEEL_URL ${prefix}\assets\${MERCURIAL_WHEEL_FILENAME} $MERCURIAL_WHEEL_SHA256
		143
		144	Write-Output "installing Python 2.7 32-bit"
		145	Invoke-Process msiexec.exe "/i ${prefix}\assets\python27-x86.msi /l* ${prefix}\assets\python27-x86.log /q TARGETDIR=${prefix}\python27-x86 ALLUSERS="
		146	Invoke-Process ${prefix}\python27-x86\python.exe ${prefix}\assets\get-pip.py
		147	Invoke-Process ${prefix}\python27-x86\Scripts\pip.exe "install ${prefix}\assets\virtualenv.tar.gz"
		148
		149	Write-Output "installing Python 2.7 64-bit"
		150	Invoke-Process msiexec.exe "/i ${prefix}\assets\python27-x64.msi /l* ${prefix}\assets\python27-x64.log /q TARGETDIR=${prefix}\python27-x64 ALLUSERS="
		151	Invoke-Process ${prefix}\python27-x64\python.exe ${prefix}\assets\get-pip.py
		152	Invoke-Process ${prefix}\python27-x64\Scripts\pip.exe "install ${prefix}\assets\virtualenv.tar.gz"
		153
		154	Install-Python3 "Python 3.5 32-bit" ${prefix}\assets\python35-x86.exe ${prefix}\python35-x86 ${pip}
		155	Install-Python3 "Python 3.5 64-bit" ${prefix}\assets\python35-x64.exe ${prefix}\python35-x64 ${pip}
		156	Install-Python3 "Python 3.6 32-bit" ${prefix}\assets\python36-x86.exe ${prefix}\python36-x86 ${pip}
		157	Install-Python3 "Python 3.6 64-bit" ${prefix}\assets\python36-x64.exe ${prefix}\python36-x64 ${pip}
		158	Install-Python3 "Python 3.7 32-bit" ${prefix}\assets\python37-x86.exe ${prefix}\python37-x86 ${pip}
		159	Install-Python3 "Python 3.7 64-bit" ${prefix}\assets\python37-x64.exe ${prefix}\python37-x64 ${pip}
		160	Install-Python3 "Python 3.8 32-bit" ${prefix}\assets\python38-x86.exe ${prefix}\python38-x86 ${pip}
		161	Install-Python3 "Python 3.8 64-bit" ${prefix}\assets\python38-x64.exe ${prefix}\python38-x64 ${pip}
		162
		163	Write-Output "installing Visual Studio 2017 Build Tools and SDKs"
		164	Invoke-Process ${prefix}\assets\vs_buildtools.exe "--quiet --wait --norestart --nocache --channelUri https://aka.ms/vs/15/release/channel --add Microsoft.VisualStudio.Workload.MSBuildTools --add Microsoft.VisualStudio.Component.Windows10SDK.17763 --add Microsoft.VisualStudio.Workload.VCTools --add Microsoft.VisualStudio.Component.Windows10SDK --add Microsoft.VisualStudio.Component.VC.140"
		165
		166	Write-Output "installing Visual C++ 9.0 for Python 2.7"
		167	Invoke-Process msiexec.exe "/i ${prefix}\assets\VCForPython27.msi /l* ${prefix}\assets\VCForPython27.log /q"
		168
		169	Write-Output "installing Inno Setup"
		170	Invoke-Process ${prefix}\assets\InnoSetup.exe "/SP- /VERYSILENT /SUPPRESSMSGBOXES"
		171
		172	Write-Output "extracting MinGW base archive"
		173	Expand-Archive -Path ${prefix}\assets\mingw-get-bin.zip -DestinationPath "${prefix}\MinGW" -Force
		174
		175	Write-Output "updating MinGW package catalogs"
		176	Invoke-Process ${prefix}\MinGW\bin\mingw-get.exe "update"
		177
		178	Write-Output "installing MinGW packages"
		179	Invoke-Process ${prefix}\MinGW\bin\mingw-get.exe "install msys-base msys-coreutils msys-diffutils msys-unzip"
		180
		181	# Construct a virtualenv useful for bootstrapping. It conveniently contains a
		182	# Mercurial install.
		183	Write-Output "creating bootstrap virtualenv with Mercurial"
		184	Invoke-Process "$prefix\python27-x64\Scripts\virtualenv.exe" "${prefix}\venv-bootstrap"
		185	Invoke-Process "${prefix}\venv-bootstrap\Scripts\pip.exe" "install ${prefix}\assets\${MERCURIAL_WHEEL_FILENAME}"
		186	}
		187
		188	function Clone-Mercurial-Repo($prefix, $repo_url, $dest) {
		189	Write-Output "cloning $repo_url to $dest"
		190	# TODO Figure out why CA verification isn't working in EC2 and remove
		191	# --insecure.
		192	Invoke-Process "${prefix}\venv-bootstrap\Scripts\hg.exe" "clone --insecure $repo_url $dest"
		193
		194	# Mark repo as non-publishing by default for convenience.
		195	Add-Content -Path "$dest\.hg\hgrc" -Value "`n[phases]`npublish = false"
		196	}
		197
		198	$prefix = "c:\hgdev"
		199	Install-Dependencies $prefix
		200	Clone-Mercurial-Repo $prefix "https://www.mercurial-scm.org/repo/hg" $prefix\src

contrib/packaging/hgpackaging/__init__.py

0 created 644 0 0

NO CONTENT: new file 100644

contrib/packaging/hgpackaging/downloads.py

0 created 644 +175 0

			@@ -0,0 +1,175 b''
		1	# downloads.py - Code for downloading dependencies.
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import gzip
		11	import hashlib
		12	import pathlib
		13	import urllib.request
		14
		15
		16	DOWNLOADS = {
		17	'gettext': {
		18	'url': 'https://versaweb.dl.sourceforge.net/project/gnuwin32/gettext/0.14.4/gettext-0.14.4-bin.zip',
		19	'size': 1606131,
		20	'sha256': '60b9ef26bc5cceef036f0424e542106cf158352b2677f43a01affd6d82a1d641',
		21	'version': '0.14.4',
		22	},
		23	'gettext-dep': {
		24	'url': 'https://versaweb.dl.sourceforge.net/project/gnuwin32/gettext/0.14.4/gettext-0.14.4-dep.zip',
		25	'size': 715086,
		26	'sha256': '411f94974492fd2ecf52590cb05b1023530aec67e64154a88b1e4ebcd9c28588',
		27	},
		28	'py2exe': {
		29	'url': 'https://versaweb.dl.sourceforge.net/project/py2exe/py2exe/0.6.9/py2exe-0.6.9.zip',
		30	'size': 149687,
		31	'sha256': '6bd383312e7d33eef2e43a5f236f9445e4f3e0f6b16333c6f183ed445c44ddbd',
		32	'version': '0.6.9',
		33	},
		34	# The VC9 CRT merge modules aren't readily available on most systems because
		35	# they are only installed as part of a full Visual Studio 2008 install.
		36	# While we could potentially extract them from a Visual Studio 2008
		37	# installer, it is easier to just fetch them from a known URL.
		38	'vc9-crt-x86-msm': {
		39	'url': 'https://github.com/indygreg/vc90-merge-modules/raw/9232f8f0b2135df619bf7946eaa176b4ac35ccff/Microsoft_VC90_CRT_x86.msm',
		40	'size': 615424,
		41	'sha256': '837e887ef31b332feb58156f429389de345cb94504228bb9a523c25a9dd3d75e',
		42	},
		43	'vc9-crt-x86-msm-policy': {
		44	'url': 'https://github.com/indygreg/vc90-merge-modules/raw/9232f8f0b2135df619bf7946eaa176b4ac35ccff/policy_9_0_Microsoft_VC90_CRT_x86.msm',
		45	'size': 71168,
		46	'sha256': '3fbcf92e3801a0757f36c5e8d304e134a68d5cafd197a6df7734ae3e8825c940',
		47	},
		48	'vc9-crt-x64-msm': {
		49	'url': 'https://github.com/indygreg/vc90-merge-modules/raw/9232f8f0b2135df619bf7946eaa176b4ac35ccff/Microsoft_VC90_CRT_x86_x64.msm',
		50	'size': 662528,
		51	'sha256': '50d9639b5ad4844a2285269c7551bf5157ec636e32396ddcc6f7ec5bce487a7c',
		52	},
		53	'vc9-crt-x64-msm-policy': {
		54	'url': 'https://github.com/indygreg/vc90-merge-modules/raw/9232f8f0b2135df619bf7946eaa176b4ac35ccff/policy_9_0_Microsoft_VC90_CRT_x86_x64.msm',
		55	'size': 71168,
		56	'sha256': '0550ea1929b21239134ad3a678c944ba0f05f11087117b6cf0833e7110686486',
		57	},
		58	'virtualenv': {
		59	'url': 'https://files.pythonhosted.org/packages/37/db/89d6b043b22052109da35416abc3c397655e4bd3cff031446ba02b9654fa/virtualenv-16.4.3.tar.gz',
		60	'size': 3713208,
		61	'sha256': '984d7e607b0a5d1329425dd8845bd971b957424b5ba664729fab51ab8c11bc39',
		62	'version': '16.4.3',
		63	},
		64	'wix': {
		65	'url': 'https://github.com/wixtoolset/wix3/releases/download/wix3111rtm/wix311-binaries.zip',
		66	'size': 34358269,
		67	'sha256': '37f0a533b0978a454efb5dc3bd3598becf9660aaf4287e55bf68ca6b527d051d',
		68	'version': '3.11.1',
		69	},
		70	}
		71
		72
		73	def hash_path(p: pathlib.Path):
		74	h = hashlib.sha256()
		75
		76	with p.open('rb') as fh:
		77	while True:
		78	chunk = fh.read(65536)
		79	if not chunk:
		80	break
		81
		82	h.update(chunk)
		83
		84	return h.hexdigest()
		85
		86
		87	class IntegrityError(Exception):
		88	"""Represents an integrity error when downloading a URL."""
		89
		90
		91	def secure_download_stream(url, size, sha256):
		92	"""Securely download a URL to a stream of chunks.
		93
		94	If the integrity of the download fails, an IntegrityError is
		95	raised.
		96	"""
		97	h = hashlib.sha256()
		98	length = 0
		99
		100	with urllib.request.urlopen(url) as fh:
		101	if not url.endswith('.gz') and fh.info().get('Content-Encoding') == 'gzip':
		102	fh = gzip.GzipFile(fileobj=fh)
		103
		104	while True:
		105	chunk = fh.read(65536)
		106	if not chunk:
		107	break
		108
		109	h.update(chunk)
		110	length += len(chunk)
		111
		112	yield chunk
		113
		114	digest = h.hexdigest()
		115
		116	if length != size:
		117	raise IntegrityError('size mismatch on %s: wanted %d; got %d' % (
		118	url, size, length))
		119
		120	if digest != sha256:
		121	raise IntegrityError('sha256 mismatch on %s: wanted %s; got %s' % (
		122	url, sha256, digest))
		123
		124
		125	def download_to_path(url: str, path: pathlib.Path, size: int, sha256: str):
		126	"""Download a URL to a filesystem path, possibly with verification."""
		127
		128	# We download to a temporary file and rename at the end so there's
		129	# no chance of the final file being partially written or containing
		130	# bad data.
		131	print('downloading %s to %s' % (url, path))
		132
		133	if path.exists():
		134	good = True
		135
		136	if path.stat().st_size != size:
		137	print('existing file size is wrong; removing')
		138	good = False
		139
		140	if good:
		141	if hash_path(path) != sha256:
		142	print('existing file hash is wrong; removing')
		143	good = False
		144
		145	if good:
		146	print('%s exists and passes integrity checks' % path)
		147	return
		148
		149	path.unlink()
		150
		151	tmp = path.with_name('%s.tmp' % path.name)
		152
		153	try:
		154	with tmp.open('wb') as fh:
		155	for chunk in secure_download_stream(url, size, sha256):
		156	fh.write(chunk)
		157	except IntegrityError:
		158	tmp.unlink()
		159	raise
		160
		161	tmp.rename(path)
		162	print('successfully downloaded %s' % url)
		163
		164
		165	def download_entry(name: dict, dest_path: pathlib.Path, local_name=None) -> pathlib.Path:
		166	entry = DOWNLOADS[name]
		167
		168	url = entry['url']
		169
		170	local_name = local_name or url[url.rindex('/') + 1:]
		171
		172	local_path = dest_path / local_name
		173	download_to_path(url, local_path, entry['size'], entry['sha256'])
		174
		175	return local_path, entry

contrib/packaging/hgpackaging/inno.py

0 created 644 +78 0

			@@ -0,0 +1,78 b''
		1	# inno.py - Inno Setup functionality.
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import os
		11	import pathlib
		12	import shutil
		13	import subprocess
		14
		15	from .py2exe import (
		16	build_py2exe,
		17	)
		18	from .util import (
		19	find_vc_runtime_files,
		20	)
		21
		22
		23	EXTRA_PACKAGES = {
		24	'dulwich',
		25	'keyring',
		26	'pygments',
		27	'win32ctypes',
		28	}
		29
		30
		31	def build(source_dir: pathlib.Path, build_dir: pathlib.Path,
		32	python_exe: pathlib.Path, iscc_exe: pathlib.Path,
		33	version=None):
		34	"""Build the Inno installer.
		35
		36	Build files will be placed in ``build_dir``.
		37
		38	py2exe's setup.py doesn't use setuptools. It doesn't have modern logic
		39	for finding the Python 2.7 toolchain. So, we require the environment
		40	to already be configured with an active toolchain.
		41	"""
		42	if not iscc_exe.exists():
		43	raise Exception('%s does not exist' % iscc_exe)
		44
		45	vc_x64 = r'\x64' in os.environ.get('LIB', '')
		46
		47	requirements_txt = (source_dir / 'contrib' / 'packaging' /
		48	'inno' / 'requirements.txt')
		49
		50	build_py2exe(source_dir, build_dir, python_exe, 'inno',
		51	requirements_txt, extra_packages=EXTRA_PACKAGES)
		52
		53	# hg.exe depends on VC9 runtime DLLs. Copy those into place.
		54	for f in find_vc_runtime_files(vc_x64):
		55	if f.name.endswith('.manifest'):
		56	basename = 'Microsoft.VC90.CRT.manifest'
		57	else:
		58	basename = f.name
		59
		60	dest_path = source_dir / 'dist' / basename
		61
		62	print('copying %s to %s' % (f, dest_path))
		63	shutil.copyfile(f, dest_path)
		64
		65	print('creating installer')
		66
		67	args = [str(iscc_exe)]
		68
		69	if vc_x64:
		70	args.append('/dARCH=x64')
		71
		72	if version:
		73	args.append('/dVERSION=%s' % version)
		74
		75	args.append('/Odist')
		76	args.append('contrib/packaging/inno/mercurial.iss')
		77
		78	subprocess.run(args, cwd=str(source_dir), check=True)

contrib/packaging/hgpackaging/py2exe.py

0 created 644 +150 0

			@@ -0,0 +1,150 b''
		1	# py2exe.py - Functionality for performing py2exe builds.
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import os
		11	import pathlib
		12	import subprocess
		13
		14	from .downloads import (
		15	download_entry,
		16	)
		17	from .util import (
		18	extract_tar_to_directory,
		19	extract_zip_to_directory,
		20	python_exe_info,
		21	)
		22
		23
		24	def build_py2exe(source_dir: pathlib.Path, build_dir: pathlib.Path,
		25	python_exe: pathlib.Path, build_name: str,
		26	venv_requirements_txt: pathlib.Path,
		27	extra_packages=None, extra_excludes=None,
		28	extra_dll_excludes=None,
		29	extra_packages_script=None):
		30	"""Build Mercurial with py2exe.
		31
		32	Build files will be placed in ``build_dir``.
		33
		34	py2exe's setup.py doesn't use setuptools. It doesn't have modern logic
		35	for finding the Python 2.7 toolchain. So, we require the environment
		36	to already be configured with an active toolchain.
		37	"""
		38	if 'VCINSTALLDIR' not in os.environ:
		39	raise Exception('not running from a Visual C++ build environment; '
		40	'execute the "Visual C++ <version> Command Prompt" '
		41	'application shortcut or a vcsvarsall.bat file')
		42
		43	# Identity x86/x64 and validate the environment matches the Python
		44	# architecture.
		45	vc_x64 = r'\x64' in os.environ['LIB']
		46
		47	py_info = python_exe_info(python_exe)
		48
		49	if vc_x64:
		50	if py_info['arch'] != '64bit':
		51	raise Exception('architecture mismatch: Visual C++ environment '
		52	'is configured for 64-bit but Python is 32-bit')
		53	else:
		54	if py_info['arch'] != '32bit':
		55	raise Exception('architecture mismatch: Visual C++ environment '
		56	'is configured for 32-bit but Python is 64-bit')
		57
		58	if py_info['py3']:
		59	raise Exception('Only Python 2 is currently supported')
		60
		61	build_dir.mkdir(exist_ok=True)
		62
		63	gettext_pkg, gettext_entry = download_entry('gettext', build_dir)
		64	gettext_dep_pkg = download_entry('gettext-dep', build_dir)[0]
		65	virtualenv_pkg, virtualenv_entry = download_entry('virtualenv', build_dir)
		66	py2exe_pkg, py2exe_entry = download_entry('py2exe', build_dir)
		67
		68	venv_path = build_dir / ('venv-%s-%s' % (build_name,
		69	'x64' if vc_x64 else 'x86'))
		70
		71	gettext_root = build_dir / (
		72	'gettext-win-%s' % gettext_entry['version'])
		73
		74	if not gettext_root.exists():
		75	extract_zip_to_directory(gettext_pkg, gettext_root)
		76	extract_zip_to_directory(gettext_dep_pkg, gettext_root)
		77
		78	# This assumes Python 2. We don't need virtualenv on Python 3.
		79	virtualenv_src_path = build_dir / (
		80	'virtualenv-%s' % virtualenv_entry['version'])
		81	virtualenv_py = virtualenv_src_path / 'virtualenv.py'
		82
		83	if not virtualenv_src_path.exists():
		84	extract_tar_to_directory(virtualenv_pkg, build_dir)
		85
		86	py2exe_source_path = build_dir / ('py2exe-%s' % py2exe_entry['version'])
		87
		88	if not py2exe_source_path.exists():
		89	extract_zip_to_directory(py2exe_pkg, build_dir)
		90
		91	if not venv_path.exists():
		92	print('creating virtualenv with dependencies')
		93	subprocess.run(
		94	[str(python_exe), str(virtualenv_py), str(venv_path)],
		95	check=True)
		96
		97	venv_python = venv_path / 'Scripts' / 'python.exe'
		98	venv_pip = venv_path / 'Scripts' / 'pip.exe'
		99
		100	subprocess.run([str(venv_pip), 'install', '-r', str(venv_requirements_txt)],
		101	check=True)
		102
		103	# Force distutils to use VC++ settings from environment, which was
		104	# validated above.
		105	env = dict(os.environ)
		106	env['DISTUTILS_USE_SDK'] = '1'
		107	env['MSSdk'] = '1'
		108
		109	if extra_packages_script:
		110	more_packages = set(subprocess.check_output(
		111	extra_packages_script,
		112	cwd=build_dir).split(b'\0')[-1].strip().decode('utf-8').splitlines())
		113	if more_packages:
		114	if not extra_packages:
		115	extra_packages = more_packages
		116	else:
		117	extra_packages \|= more_packages
		118
		119	if extra_packages:
		120	env['HG_PY2EXE_EXTRA_PACKAGES'] = ' '.join(sorted(extra_packages))
		121	hgext3rd_extras = sorted(
		122	e for e in extra_packages if e.startswith('hgext3rd.'))
		123	if hgext3rd_extras:
		124	env['HG_PY2EXE_EXTRA_INSTALL_PACKAGES'] = ' '.join(hgext3rd_extras)
		125	if extra_excludes:
		126	env['HG_PY2EXE_EXTRA_EXCLUDES'] = ' '.join(sorted(extra_excludes))
		127	if extra_dll_excludes:
		128	env['HG_PY2EXE_EXTRA_DLL_EXCLUDES'] = ' '.join(
		129	sorted(extra_dll_excludes))
		130
		131	py2exe_py_path = venv_path / 'Lib' / 'site-packages' / 'py2exe'
		132	if not py2exe_py_path.exists():
		133	print('building py2exe')
		134	subprocess.run([str(venv_python), 'setup.py', 'install'],
		135	cwd=py2exe_source_path,
		136	env=env,
		137	check=True)
		138
		139	# Register location of msgfmt and other binaries.
		140	env['PATH'] = '%s%s%s' % (
		141	env['PATH'], os.pathsep, str(gettext_root / 'bin'))
		142
		143	print('building Mercurial')
		144	subprocess.run(
		145	[str(venv_python), 'setup.py',
		146	'py2exe',
		147	'build_doc', '--html'],
		148	cwd=str(source_dir),
		149	env=env,
		150	check=True)

contrib/packaging/hgpackaging/util.py

0 created 644 +155 0

			@@ -0,0 +1,155 b''
		1	# util.py - Common packaging utility code.
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import distutils.version
		11	import getpass
		12	import os
		13	import pathlib
		14	import subprocess
		15	import tarfile
		16	import zipfile
		17
		18
		19	def extract_tar_to_directory(source: pathlib.Path, dest: pathlib.Path):
		20	with tarfile.open(source, 'r') as tf:
		21	tf.extractall(dest)
		22
		23
		24	def extract_zip_to_directory(source: pathlib.Path, dest: pathlib.Path):
		25	with zipfile.ZipFile(source, 'r') as zf:
		26	zf.extractall(dest)
		27
		28
		29	def find_vc_runtime_files(x64=False):
		30	"""Finds Visual C++ Runtime DLLs to include in distribution."""
		31	winsxs = pathlib.Path(os.environ['SYSTEMROOT']) / 'WinSxS'
		32
		33	prefix = 'amd64' if x64 else 'x86'
		34
		35	candidates = sorted(p for p in os.listdir(winsxs)
		36	if p.lower().startswith('%s_microsoft.vc90.crt_' % prefix))
		37
		38	for p in candidates:
		39	print('found candidate VC runtime: %s' % p)
		40
		41	# Take the newest version.
		42	version = candidates[-1]
		43
		44	d = winsxs / version
		45
		46	return [
		47	d / 'msvcm90.dll',
		48	d / 'msvcp90.dll',
		49	d / 'msvcr90.dll',
		50	winsxs / 'Manifests' / ('%s.manifest' % version),
		51	]
		52
		53
		54	def windows_10_sdk_info():
		55	"""Resolves information about the Windows 10 SDK."""
		56
		57	base = pathlib.Path(os.environ['ProgramFiles(x86)']) / 'Windows Kits' / '10'
		58
		59	if not base.is_dir():
		60	raise Exception('unable to find Windows 10 SDK at %s' % base)
		61
		62	# Find the latest version.
		63	bin_base = base / 'bin'
		64
		65	versions = [v for v in os.listdir(bin_base) if v.startswith('10.')]
		66	version = sorted(versions, reverse=True)[0]
		67
		68	bin_version = bin_base / version
		69
		70	return {
		71	'root': base,
		72	'version': version,
		73	'bin_root': bin_version,
		74	'bin_x86': bin_version / 'x86',
		75	'bin_x64': bin_version / 'x64'
		76	}
		77
		78
		79	def find_signtool():
		80	"""Find signtool.exe from the Windows SDK."""
		81	sdk = windows_10_sdk_info()
		82
		83	for key in ('bin_x64', 'bin_x86'):
		84	p = sdk[key] / 'signtool.exe'
		85
		86	if p.exists():
		87	return p
		88
		89	raise Exception('could not find signtool.exe in Windows 10 SDK')
		90
		91
		92	def sign_with_signtool(file_path, description, subject_name=None,
		93	cert_path=None, cert_password=None,
		94	timestamp_url=None):
		95	"""Digitally sign a file with signtool.exe.
		96
		97	``file_path`` is file to sign.
		98	``description`` is text that goes in the signature.
		99
		100	The signing certificate can be specified by ``cert_path`` or
		101	``subject_name``. These correspond to the ``/f`` and ``/n`` arguments
		102	to signtool.exe, respectively.
		103
		104	The certificate password can be specified via ``cert_password``. If
		105	not provided, you will be prompted for the password.
		106
		107	``timestamp_url`` is the URL of a RFC 3161 timestamp server (``/tr``
		108	argument to signtool.exe).
		109	"""
		110	if cert_path and subject_name:
		111	raise ValueError('cannot specify both cert_path and subject_name')
		112
		113	while cert_path and not cert_password:
		114	cert_password = getpass.getpass('password for %s: ' % cert_path)
		115
		116	args = [
		117	str(find_signtool()), 'sign',
		118	'/v',
		119	'/fd', 'sha256',
		120	'/d', description,
		121	]
		122
		123	if cert_path:
		124	args.extend(['/f', str(cert_path), '/p', cert_password])
		125	elif subject_name:
		126	args.extend(['/n', subject_name])
		127
		128	if timestamp_url:
		129	args.extend(['/tr', timestamp_url, '/td', 'sha256'])
		130
		131	args.append(str(file_path))
		132
		133	print('signing %s' % file_path)
		134	subprocess.run(args, check=True)
		135
		136
		137	PRINT_PYTHON_INFO = '''
		138	import platform; print("%s:%s" % (platform.architecture()[0], platform.python_version()))
		139	'''.strip()
		140
		141
		142	def python_exe_info(python_exe: pathlib.Path):
		143	"""Obtain information about a Python executable."""
		144
		145	res = subprocess.check_output([str(python_exe), '-c', PRINT_PYTHON_INFO])
		146
		147	arch, version = res.decode('utf-8').split(':')
		148
		149	version = distutils.version.LooseVersion(version)
		150
		151	return {
		152	'arch': arch,
		153	'version': version,
		154	'py3': version >= distutils.version.LooseVersion('3'),
		155	}

contrib/packaging/hgpackaging/wix.py

0 created 644 +327 0

			@@ -0,0 +1,327 b''
		1	# wix.py - WiX installer functionality
		2	#
		3	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		4	#
		5	# This software may be used and distributed according to the terms of the
		6	# GNU General Public License version 2 or any later version.
		7
		8	# no-check-code because Python 3 native.
		9
		10	import os
		11	import pathlib
		12	import re
		13	import subprocess
		14	import tempfile
		15	import typing
		16	import xml.dom.minidom
		17
		18	from .downloads import (
		19	download_entry,
		20	)
		21	from .py2exe import (
		22	build_py2exe,
		23	)
		24	from .util import (
		25	extract_zip_to_directory,
		26	sign_with_signtool,
		27	)
		28
		29
		30	SUPPORT_WXS = [
		31	('contrib.wxs', r'contrib'),
		32	('dist.wxs', r'dist'),
		33	('doc.wxs', r'doc'),
		34	('help.wxs', r'mercurial\help'),
		35	('i18n.wxs', r'i18n'),
		36	('locale.wxs', r'mercurial\locale'),
		37	('templates.wxs', r'mercurial\templates'),
		38	]
		39
		40
		41	EXTRA_PACKAGES = {
		42	'distutils',
		43	'pygments',
		44	}
		45
		46
		47	def find_version(source_dir: pathlib.Path):
		48	version_py = source_dir / 'mercurial' / '__version__.py'
		49
		50	with version_py.open('r', encoding='utf-8') as fh:
		51	source = fh.read().strip()
		52
		53	m = re.search('version = b"(.*)"', source)
		54	return m.group(1)
		55
		56
		57	def normalize_version(version):
		58	"""Normalize Mercurial version string so WiX accepts it.
		59
		60	Version strings have to be numeric X.Y.Z.
		61	"""
		62
		63	if '+' in version:
		64	version, extra = version.split('+', 1)
		65	else:
		66	extra = None
		67
		68	# 4.9rc0
		69	if version[:-1].endswith('rc'):
		70	version = version[:-3]
		71
		72	versions = [int(v) for v in version.split('.')]
		73	while len(versions) < 3:
		74	versions.append(0)
		75
		76	major, minor, build = versions[:3]
		77
		78	if extra:
		79	# <commit count>-<hash>+<date>
		80	build = int(extra.split('-')[0])
		81
		82	return '.'.join('%d' % x for x in (major, minor, build))
		83
		84
		85	def ensure_vc90_merge_modules(build_dir):
		86	x86 = (
		87	download_entry('vc9-crt-x86-msm', build_dir,
		88	local_name='microsoft.vcxx.crt.x86_msm.msm')[0],
		89	download_entry('vc9-crt-x86-msm-policy', build_dir,
		90	local_name='policy.x.xx.microsoft.vcxx.crt.x86_msm.msm')[0]
		91	)
		92
		93	x64 = (
		94	download_entry('vc9-crt-x64-msm', build_dir,
		95	local_name='microsoft.vcxx.crt.x64_msm.msm')[0],
		96	download_entry('vc9-crt-x64-msm-policy', build_dir,
		97	local_name='policy.x.xx.microsoft.vcxx.crt.x64_msm.msm')[0]
		98	)
		99	return {
		100	'x86': x86,
		101	'x64': x64,
		102	}
		103
		104
		105	def run_candle(wix, cwd, wxs, source_dir, defines=None):
		106	args = [
		107	str(wix / 'candle.exe'),
		108	'-nologo',
		109	str(wxs),
		110	'-dSourceDir=%s' % source_dir,
		111	]
		112
		113	if defines:
		114	args.extend('-d%s=%s' % define for define in sorted(defines.items()))
		115
		116	subprocess.run(args, cwd=str(cwd), check=True)
		117
		118
		119	def make_post_build_signing_fn(name, subject_name=None, cert_path=None,
		120	cert_password=None, timestamp_url=None):
		121	"""Create a callable that will use signtool to sign hg.exe."""
		122
		123	def post_build_sign(source_dir, build_dir, dist_dir, version):
		124	description = '%s %s' % (name, version)
		125
		126	sign_with_signtool(dist_dir / 'hg.exe', description,
		127	subject_name=subject_name, cert_path=cert_path,
		128	cert_password=cert_password,
		129	timestamp_url=timestamp_url)
		130
		131	return post_build_sign
		132
		133
		134	LIBRARIES_XML = '''
		135	<?xml version="1.0" encoding="utf-8"?>
		136	<Wix xmlns="http://schemas.microsoft.com/wix/2006/wi">
		137
		138	<?include {wix_dir}/guids.wxi ?>
		139	<?include {wix_dir}/defines.wxi ?>
		140
		141	<Fragment>
		142	<DirectoryRef Id="INSTALLDIR" FileSource="$(var.SourceDir)">
		143	<Directory Id="libdir" Name="lib" FileSource="$(var.SourceDir)/lib">
		144	<Component Id="libOutput" Guid="$(var.lib.guid)" Win64='$(var.IsX64)'>
		145	</Component>
		146	</Directory>
		147	</DirectoryRef>
		148	</Fragment>
		149	</Wix>
		150	'''.lstrip()
		151
		152
		153	def make_libraries_xml(wix_dir: pathlib.Path, dist_dir: pathlib.Path):
		154	"""Make XML data for library components WXS."""
		155	# We can't use ElementTree because it doesn't handle the
		156	# <?include ?> directives.
		157	doc = xml.dom.minidom.parseString(
		158	LIBRARIES_XML.format(wix_dir=str(wix_dir)))
		159
		160	component = doc.getElementsByTagName('Component')[0]
		161
		162	f = doc.createElement('File')
		163	f.setAttribute('Name', 'library.zip')
		164	f.setAttribute('KeyPath', 'yes')
		165	component.appendChild(f)
		166
		167	lib_dir = dist_dir / 'lib'
		168
		169	for p in sorted(lib_dir.iterdir()):
		170	if not p.name.endswith(('.dll', '.pyd')):
		171	continue
		172
		173	f = doc.createElement('File')
		174	f.setAttribute('Name', p.name)
		175	component.appendChild(f)
		176
		177	return doc.toprettyxml()
		178
		179
		180	def build_installer(source_dir: pathlib.Path, python_exe: pathlib.Path,
		181	msi_name='mercurial', version=None, post_build_fn=None,
		182	extra_packages_script=None,
		183	extra_wxs:typing.Optional[typing.Dict[str,str]]=None,
		184	extra_features:typing.Optional[typing.List[str]]=None):
		185	"""Build a WiX MSI installer.
		186
		187	``source_dir`` is the path to the Mercurial source tree to use.
		188	``arch`` is the target architecture. either ``x86`` or ``x64``.
		189	``python_exe`` is the path to the Python executable to use/bundle.
		190	``version`` is the Mercurial version string. If not defined,
		191	``mercurial/__version__.py`` will be consulted.
		192	``post_build_fn`` is a callable that will be called after building
		193	Mercurial but before invoking WiX. It can be used to e.g. facilitate
		194	signing. It is passed the paths to the Mercurial source, build, and
		195	dist directories and the resolved Mercurial version.
		196	``extra_packages_script`` is a command to be run to inject extra packages
		197	into the py2exe binary. It should stage packages into the virtualenv and
		198	print a null byte followed by a newline-separated list of packages that
		199	should be included in the exe.
		200	``extra_wxs`` is a dict of {wxs_name: working_dir_for_wxs_build}.
		201	``extra_features`` is a list of additional named Features to include in
		202	the build. These must match Feature names in one of the wxs scripts.
		203	"""
		204	arch = 'x64' if r'\x64' in os.environ.get('LIB', '') else 'x86'
		205
		206	hg_build_dir = source_dir / 'build'
		207	dist_dir = source_dir / 'dist'
		208	wix_dir = source_dir / 'contrib' / 'packaging' / 'wix'
		209
		210	requirements_txt = wix_dir / 'requirements.txt'
		211
		212	build_py2exe(source_dir, hg_build_dir,
		213	python_exe, 'wix', requirements_txt,
		214	extra_packages=EXTRA_PACKAGES,
		215	extra_packages_script=extra_packages_script)
		216
		217	version = version or normalize_version(find_version(source_dir))
		218	print('using version string: %s' % version)
		219
		220	if post_build_fn:
		221	post_build_fn(source_dir, hg_build_dir, dist_dir, version)
		222
		223	build_dir = hg_build_dir / ('wix-%s' % arch)
		224
		225	build_dir.mkdir(exist_ok=True)
		226
		227	wix_pkg, wix_entry = download_entry('wix', hg_build_dir)
		228	wix_path = hg_build_dir / ('wix-%s' % wix_entry['version'])
		229
		230	if not wix_path.exists():
		231	extract_zip_to_directory(wix_pkg, wix_path)
		232
		233	ensure_vc90_merge_modules(hg_build_dir)
		234
		235	source_build_rel = pathlib.Path(os.path.relpath(source_dir, build_dir))
		236
		237	defines = {'Platform': arch}
		238
		239	for wxs, rel_path in SUPPORT_WXS:
		240	wxs = wix_dir / wxs
		241	wxs_source_dir = source_dir / rel_path
		242	run_candle(wix_path, build_dir, wxs, wxs_source_dir, defines=defines)
		243
		244	for source, rel_path in sorted((extra_wxs or {}).items()):
		245	run_candle(wix_path, build_dir, source, rel_path, defines=defines)
		246
		247	# candle.exe doesn't like when we have an open handle on the file.
		248	# So use TemporaryDirectory() instead of NamedTemporaryFile().
		249	with tempfile.TemporaryDirectory() as td:
		250	td = pathlib.Path(td)
		251
		252	tf = td / 'library.wxs'
		253	with tf.open('w') as fh:
		254	fh.write(make_libraries_xml(wix_dir, dist_dir))
		255
		256	run_candle(wix_path, build_dir, tf, dist_dir, defines=defines)
		257
		258	source = wix_dir / 'mercurial.wxs'
		259	defines['Version'] = version
		260	defines['Comments'] = 'Installs Mercurial version %s' % version
		261	defines['VCRedistSrcDir'] = str(hg_build_dir)
		262	if extra_features:
		263	assert all(';' not in f for f in extra_features)
		264	defines['MercurialExtraFeatures'] = ';'.join(extra_features)
		265
		266	run_candle(wix_path, build_dir, source, source_build_rel, defines=defines)
		267
		268	msi_path = source_dir / 'dist' / (
		269	'%s-%s-%s.msi' % (msi_name, version, arch))
		270
		271	args = [
		272	str(wix_path / 'light.exe'),
		273	'-nologo',
		274	'-ext', 'WixUIExtension',
		275	'-sw1076',
		276	'-spdb',
		277	'-o', str(msi_path),
		278	]
		279
		280	for source, rel_path in SUPPORT_WXS:
		281	assert source.endswith('.wxs')
		282	args.append(str(build_dir / ('%s.wixobj' % source[:-4])))
		283
		284	for source, rel_path in sorted((extra_wxs or {}).items()):
		285	assert source.endswith('.wxs')
		286	source = os.path.basename(source)
		287	args.append(str(build_dir / ('%s.wixobj' % source[:-4])))
		288
		289	args.extend([
		290	str(build_dir / 'library.wixobj'),
		291	str(build_dir / 'mercurial.wixobj'),
		292	])
		293
		294	subprocess.run(args, cwd=str(source_dir), check=True)
		295
		296	print('%s created' % msi_path)
		297
		298	return {
		299	'msi_path': msi_path,
		300	}
		301
		302
		303	def build_signed_installer(source_dir: pathlib.Path, python_exe: pathlib.Path,
		304	name: str, version=None, subject_name=None,
		305	cert_path=None, cert_password=None,
		306	timestamp_url=None, extra_packages_script=None,
		307	extra_wxs=None, extra_features=None):
		308	"""Build an installer with signed executables."""
		309
		310	post_build_fn = make_post_build_signing_fn(
		311	name,
		312	subject_name=subject_name,
		313	cert_path=cert_path,
		314	cert_password=cert_password,
		315	timestamp_url=timestamp_url)
		316
		317	info = build_installer(source_dir, python_exe=python_exe,
		318	msi_name=name.lower(), version=version,
		319	post_build_fn=post_build_fn,
		320	extra_packages_script=extra_packages_script,
		321	extra_wxs=extra_wxs, extra_features=extra_features)
		322
		323	description = '%s %s' % (name, version)
		324
		325	sign_with_signtool(info['msi_path'], description,
		326	subject_name=subject_name, cert_path=cert_path,
		327	cert_password=cert_password, timestamp_url=timestamp_url)

contrib/packaging/inno/build.py

0 created 755 +51 0

			@@ -0,0 +1,51 b''
		1	#!/usr/bin/env python3
		2	# build.py - Inno installer build script.
		3	#
		4	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		5	#
		6	# This software may be used and distributed according to the terms of the
		7	# GNU General Public License version 2 or any later version.
		8
		9	# This script automates the building of the Inno MSI installer for Mercurial.
		10
		11	# no-check-code because Python 3 native.
		12
		13	import argparse
		14	import os
		15	import pathlib
		16	import sys
		17
		18
		19	if __name__ == '__main__':
		20	parser = argparse.ArgumentParser()
		21
		22	parser.add_argument('--python',
		23	required=True,
		24	help='path to python.exe to use')
		25	parser.add_argument('--iscc',
		26	help='path to iscc.exe to use')
		27	parser.add_argument('--version',
		28	help='Mercurial version string to use '
		29	'(detected from __version__.py if not defined')
		30
		31	args = parser.parse_args()
		32
		33	if not os.path.isabs(args.python):
		34	raise Exception('--python arg must be an absolute path')
		35
		36	if args.iscc:
		37	iscc = pathlib.Path(args.iscc)
		38	else:
		39	iscc = (pathlib.Path(os.environ['ProgramFiles(x86)']) / 'Inno Setup 5' /
		40	'ISCC.exe')
		41
		42	here = pathlib.Path(os.path.abspath(os.path.dirname(__file__)))
		43	source_dir = here.parent.parent.parent
		44	build_dir = source_dir / 'build'
		45
		46	sys.path.insert(0, str(source_dir / 'contrib' / 'packaging'))
		47
		48	from hgpackaging.inno import build
		49
		50	build(source_dir, build_dir, pathlib.Path(args.python), iscc,
		51	version=args.version)

contrib/packaging/inno/modpath.iss

0 created 644 +219 0

			@@ -0,0 +1,219 b''
		1	// ----------------------------------------------------------------------------
		2	//
		3	// Inno Setup Ver: 5.4.2
		4	// Script Version: 1.4.2
		5	// Author: Jared Breland <jbreland@legroom.net>
		6	// Homepage: http://www.legroom.net/software
		7	// License: GNU Lesser General Public License (LGPL), version 3
		8	// http://www.gnu.org/licenses/lgpl.html
		9	//
		10	// Script Function:
		11	// Allow modification of environmental path directly from Inno Setup installers
		12	//
		13	// Instructions:
		14	// Copy modpath.iss to the same directory as your setup script
		15	//
		16	// Add this statement to your [Setup] section
		17	// ChangesEnvironment=true
		18	//
		19	// Add this statement to your [Tasks] section
		20	// You can change the Description or Flags
		21	// You can change the Name, but it must match the ModPathName setting below
		22	// Name: modifypath; Description: &Add application directory to your environmental path; Flags: unchecked
		23	//
		24	// Add the following to the end of your [Code] section
		25	// ModPathName defines the name of the task defined above
		26	// ModPathType defines whether the 'user' or 'system' path will be modified;
		27	// this will default to user if anything other than system is set
		28	// setArrayLength must specify the total number of dirs to be added
		29	// Result[0] contains first directory, Result[1] contains second, etc.
		30	// const
		31	// ModPathName = 'modifypath';
		32	// ModPathType = 'user';
		33	//
		34	// function ModPathDir(): TArrayOfString;
		35	// begin
		36	// setArrayLength(Result, 1);
		37	// Result[0] := ExpandConstant('{app}');
		38	// end;
		39	// #include "modpath.iss"
		40	// ----------------------------------------------------------------------------
		41
		42	procedure ModPath();
		43	var
		44	oldpath: String;
		45	newpath: String;
		46	updatepath: Boolean;
		47	pathArr: TArrayOfString;
		48	aExecFile: String;
		49	aExecArr: TArrayOfString;
		50	i, d: Integer;
		51	pathdir: TArrayOfString;
		52	regroot: Integer;
		53	regpath: String;
		54
		55	begin
		56	// Get constants from main script and adjust behavior accordingly
		57	// ModPathType MUST be 'system' or 'user'; force 'user' if invalid
		58	if ModPathType = 'system' then begin
		59	regroot := HKEY_LOCAL_MACHINE;
		60	regpath := 'SYSTEM\CurrentControlSet\Control\Session Manager\Environment';
		61	end else begin
		62	regroot := HKEY_CURRENT_USER;
		63	regpath := 'Environment';
		64	end;
		65
		66	// Get array of new directories and act on each individually
		67	pathdir := ModPathDir();
		68	for d := 0 to GetArrayLength(pathdir)-1 do begin
		69	updatepath := true;
		70
		71	// Modify WinNT path
		72	if UsingWinNT() = true then begin
		73
		74	// Get current path, split into an array
		75	RegQueryStringValue(regroot, regpath, 'Path', oldpath);
		76	oldpath := oldpath + ';';
		77	i := 0;
		78
		79	while (Pos(';', oldpath) > 0) do begin
		80	SetArrayLength(pathArr, i+1);
		81	pathArr[i] := Copy(oldpath, 0, Pos(';', oldpath)-1);
		82	oldpath := Copy(oldpath, Pos(';', oldpath)+1, Length(oldpath));
		83	i := i + 1;
		84
		85	// Check if current directory matches app dir
		86	if pathdir[d] = pathArr[i-1] then begin
		87	// if uninstalling, remove dir from path
		88	if IsUninstaller() = true then begin
		89	continue;
		90	// if installing, flag that dir already exists in path
		91	end else begin
		92	updatepath := false;
		93	end;
		94	end;
		95
		96	// Add current directory to new path
		97	if i = 1 then begin
		98	newpath := pathArr[i-1];
		99	end else begin
		100	newpath := newpath + ';' + pathArr[i-1];
		101	end;
		102	end;
		103
		104	// Append app dir to path if not already included
		105	if (IsUninstaller() = false) AND (updatepath = true) then
		106	newpath := newpath + ';' + pathdir[d];
		107
		108	// Write new path
		109	RegWriteStringValue(regroot, regpath, 'Path', newpath);
		110
		111	// Modify Win9x path
		112	end else begin
		113
		114	// Convert to shortened dirname
		115	pathdir[d] := GetShortName(pathdir[d]);
		116
		117	// If autoexec.bat exists, check if app dir already exists in path
		118	aExecFile := 'C:\AUTOEXEC.BAT';
		119	if FileExists(aExecFile) then begin
		120	LoadStringsFromFile(aExecFile, aExecArr);
		121	for i := 0 to GetArrayLength(aExecArr)-1 do begin
		122	if IsUninstaller() = false then begin
		123	// If app dir already exists while installing, skip add
		124	if (Pos(pathdir[d], aExecArr[i]) > 0) then
		125	updatepath := false;
		126	break;
		127	end else begin
		128	// If app dir exists and = what we originally set, then delete at uninstall
		129	if aExecArr[i] = 'SET PATH=%PATH%;' + pathdir[d] then
		130	aExecArr[i] := '';
		131	end;
		132	end;
		133	end;
		134
		135	// If app dir not found, or autoexec.bat didn't exist, then (create and) append to current path
		136	if (IsUninstaller() = false) AND (updatepath = true) then begin
		137	SaveStringToFile(aExecFile, #13#10 + 'SET PATH=%PATH%;' + pathdir[d], True);
		138
		139	// If uninstalling, write the full autoexec out
		140	end else begin
		141	SaveStringsToFile(aExecFile, aExecArr, False);
		142	end;
		143	end;
		144	end;
		145	end;
		146
		147	// Split a string into an array using passed delimeter
		148	procedure MPExplode(var Dest: TArrayOfString; Text: String; Separator: String);
		149	var
		150	i: Integer;
		151	begin
		152	i := 0;
		153	repeat
		154	SetArrayLength(Dest, i+1);
		155	if Pos(Separator,Text) > 0 then begin
		156	Dest[i] := Copy(Text, 1, Pos(Separator, Text)-1);
		157	Text := Copy(Text, Pos(Separator,Text) + Length(Separator), Length(Text));
		158	i := i + 1;
		159	end else begin
		160	Dest[i] := Text;
		161	Text := '';
		162	end;
		163	until Length(Text)=0;
		164	end;
		165
		166
		167	procedure CurStepChanged(CurStep: TSetupStep);
		168	var
		169	taskname: String;
		170	begin
		171	taskname := ModPathName;
		172	if CurStep = ssPostInstall then
		173	if IsTaskSelected(taskname) then
		174	ModPath();
		175	end;
		176
		177	procedure CurUninstallStepChanged(CurUninstallStep: TUninstallStep);
		178	var
		179	aSelectedTasks: TArrayOfString;
		180	i: Integer;
		181	taskname: String;
		182	regpath: String;
		183	regstring: String;
		184	appid: String;
		185	begin
		186	// only run during actual uninstall
		187	if CurUninstallStep = usUninstall then begin
		188	// get list of selected tasks saved in registry at install time
		189	appid := '{#emit SetupSetting("AppId")}';
		190	if appid = '' then appid := '{#emit SetupSetting("AppName")}';
		191	regpath := ExpandConstant('Software\Microsoft\Windows\CurrentVersion\Uninstall\'+appid+'_is1');
		192	RegQueryStringValue(HKLM, regpath, 'Inno Setup: Selected Tasks', regstring);
		193	if regstring = '' then RegQueryStringValue(HKCU, regpath, 'Inno Setup: Selected Tasks', regstring);
		194
		195	// check each task; if matches modpath taskname, trigger patch removal
		196	if regstring <> '' then begin
		197	taskname := ModPathName;
		198	MPExplode(aSelectedTasks, regstring, ',');
		199	if GetArrayLength(aSelectedTasks) > 0 then begin
		200	for i := 0 to GetArrayLength(aSelectedTasks)-1 do begin
		201	if comparetext(aSelectedTasks[i], taskname) = 0 then
		202	ModPath();
		203	end;
		204	end;
		205	end;
		206	end;
		207	end;
		208
		209	function NeedRestart(): Boolean;
		210	var
		211	taskname: String;
		212	begin
		213	taskname := ModPathName;
		214	if IsTaskSelected(taskname) and not UsingWinNT() then begin
		215	Result := True;
		216	end else begin
		217	Result := False;
		218	end;
		219	end;

contrib/packaging/inno/requirements.txt

0 created 644 +38 0

			@@ -0,0 +1,38 b''
		1	#
		2	# This file is autogenerated by pip-compile
		3	# To update, run:
		4	#
		5	# pip-compile --generate-hashes contrib/packaging/inno/requirements.txt.in -o contrib/packaging/inno/requirements.txt
		6	#
		7	certifi==2018.11.29 \
		8	--hash=sha256:47f9c83ef4c0c621eaef743f133f09fa8a74a9b75f037e8624f83bd1b6626cb7 \
		9	--hash=sha256:993f830721089fef441cdfeb4b2c8c9df86f0c63239f06bd025a76a7daddb033 \
		10	# via dulwich
		11	configparser==3.7.3 \
		12	--hash=sha256:27594cf4fc279f321974061ac69164aaebd2749af962ac8686b20503ac0bcf2d \
		13	--hash=sha256:9d51fe0a382f05b6b117c5e601fc219fede4a8c71703324af3f7d883aef476a3 \
		14	# via entrypoints
		15	docutils==0.14 \
		16	--hash=sha256:02aec4bd92ab067f6ff27a38a38a41173bf01bed8f89157768c1573f53e474a6 \
		17	--hash=sha256:51e64ef2ebfb29cae1faa133b3710143496eca21c530f3f71424d77687764274 \
		18	--hash=sha256:7a4bd47eaf6596e1295ecb11361139febe29b084a87bf005bf899f9a42edc3c6
		19	dulwich==0.19.11 \
		20	--hash=sha256:afbe070f6899357e33f63f3f3696e601731fef66c64a489dea1bc9f539f4a725
		21	entrypoints==0.3 \
		22	--hash=sha256:589f874b313739ad35be6e0cd7efde2a4e9b6fea91edcc34e58ecbb8dbe56d19 \
		23	--hash=sha256:c70dd71abe5a8c85e55e12c19bd91ccfeec11a6e99044204511f9ed547d48451 \
		24	# via keyring
		25	keyring==18.0.0 \
		26	--hash=sha256:12833d2b05d2055e0e25931184af9cd6a738f320a2264853cabbd8a3a0f0b65d \
		27	--hash=sha256:ca33f5ccc542b9ffaa196ee9a33488069e5e7eac77d5b81969f8a3ce74d0230c
		28	pygments==2.3.1 \
		29	--hash=sha256:5ffada19f6203563680669ee7f53b64dabbeb100eb51b61996085e99c03b284a \
		30	--hash=sha256:e8218dd399a61674745138520d0d4cf2621d7e032439341bc3f647bff125818d
		31	pywin32-ctypes==0.2.0 \
		32	--hash=sha256:24ffc3b341d457d48e8922352130cf2644024a4ff09762a2261fd34c36ee5942 \
		33	--hash=sha256:9dc2d991b3479cc2df15930958b674a48a227d5361d413827a4cfd0b5876fc98 \
		34	# via keyring
		35	urllib3==1.24.1 \
		36	--hash=sha256:61bf29cada3fc2fbefad4fdf059ea4bd1b4a86d2b6d15e1c7c0b582b9752fe39 \
		37	--hash=sha256:de9529817c93f27c8ccbfead6985011db27bd0ddfcdb2d86f3f663385c6a9c22 \
		38	# via dulwich

contrib/packaging/inno/requirements.txt.in

0 created 644 +4 0

			@@ -0,0 +1,4 b''
		1	docutils
		2	dulwich
		3	keyring
		4	pygments

contrib/packaging/wix/build.py

0 created 755 +84 0

			@@ -0,0 +1,84 b''
		1	#!/usr/bin/env python3
		2	# Copyright 2019 Gregory Szorc <gregory.szorc@gmail.com>
		3	#
		4	# This software may be used and distributed according to the terms of the
		5	# GNU General Public License version 2 or any later version.
		6
		7	# no-check-code because Python 3 native.
		8
		9	"""Code to build Mercurial WiX installer."""
		10
		11	import argparse
		12	import os
		13	import pathlib
		14	import sys
		15
		16
		17	if __name__ == '__main__':
		18	parser = argparse.ArgumentParser()
		19
		20	parser.add_argument('--name',
		21	help='Application name',
		22	default='Mercurial')
		23	parser.add_argument('--python',
		24	help='Path to Python executable to use',
		25	required=True)
		26	parser.add_argument('--sign-sn',
		27	help='Subject name (or fragment thereof) of certificate '
		28	'to use for signing')
		29	parser.add_argument('--sign-cert',
		30	help='Path to certificate to use for signing')
		31	parser.add_argument('--sign-password',
		32	help='Password for signing certificate')
		33	parser.add_argument('--sign-timestamp-url',
		34	help='URL of timestamp server to use for signing')
		35	parser.add_argument('--version',
		36	help='Version string to use')
		37	parser.add_argument('--extra-packages-script',
		38	help=('Script to execute to include extra packages in '
		39	'py2exe binary.'))
		40	parser.add_argument('--extra-wxs',
		41	help='CSV of path_to_wxs_file=working_dir_for_wxs_file')
		42	parser.add_argument('--extra-features',
		43	help=('CSV of extra feature names to include '
		44	'in the installer from the extra wxs files'))
		45
		46	args = parser.parse_args()
		47
		48	here = pathlib.Path(os.path.abspath(os.path.dirname(__file__)))
		49	source_dir = here.parent.parent.parent
		50
		51	sys.path.insert(0, str(source_dir / 'contrib' / 'packaging'))
		52
		53	from hgpackaging.wix import (
		54	build_installer,
		55	build_signed_installer,
		56	)
		57
		58	fn = build_installer
		59	kwargs = {
		60	'source_dir': source_dir,
		61	'python_exe': pathlib.Path(args.python),
		62	'version': args.version,
		63	}
		64
		65	if not os.path.isabs(args.python):
		66	raise Exception('--python arg must be an absolute path')
		67
		68	if args.extra_packages_script:
		69	kwargs['extra_packages_script'] = args.extra_packages_script
		70	if args.extra_wxs:
		71	kwargs['extra_wxs'] = dict(
		72	thing.split("=") for thing in args.extra_wxs.split(','))
		73	if args.extra_features:
		74	kwargs['extra_features'] = args.extra_features.split(',')
		75
		76	if args.sign_sn or args.sign_cert:
		77	fn = build_signed_installer
		78	kwargs['name'] = args.name
		79	kwargs['subject_name'] = args.sign_sn
		80	kwargs['cert_path'] = args.sign_cert
		81	kwargs['cert_password'] = args.sign_password
		82	kwargs['timestamp_url'] = args.sign_timestamp_url
		83
		84	fn(**kwargs)

contrib/packaging/wix/requirements.txt

0 created 644 +13 0

			@@ -0,0 +1,13 b''
		1	#
		2	# This file is autogenerated by pip-compile
		3	# To update, run:
		4	#
		5	# pip-compile --generate-hashes contrib/packaging/wix/requirements.txt.in -o contrib/packaging/wix/requirements.txt
		6	#
		7	docutils==0.14 \
		8	--hash=sha256:02aec4bd92ab067f6ff27a38a38a41173bf01bed8f89157768c1573f53e474a6 \
		9	--hash=sha256:51e64ef2ebfb29cae1faa133b3710143496eca21c530f3f71424d77687764274 \
		10	--hash=sha256:7a4bd47eaf6596e1295ecb11361139febe29b084a87bf005bf899f9a42edc3c6
		11	pygments==2.3.1 \
		12	--hash=sha256:5ffada19f6203563680669ee7f53b64dabbeb100eb51b61996085e99c03b284a \
		13	--hash=sha256:e8218dd399a61674745138520d0d4cf2621d7e032439341bc3f647bff125818d

contrib/packaging/wix/requirements.txt.in

0 created 644 +2 0

			@@ -0,0 +1,2 b''
		1	docutils
		2	pygments

contrib/python-zstandard/zstd/decompress/zstd_ddict.c

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/zstd_ddict.h

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/zstd_decompress_block.c

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/zstd_decompress_block.h

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/zstd_decompress_internal.h

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

mercurial/utils/repoviewutil.py

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-core/tests/test_missing_ancestors.rs

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-cpython/src/dagops.rs

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/filtertraceback.py

0 created 755 0 0

	1		NO CONTENT: new file 100755
The requested commit or file is too big and content was truncated. Show full diff

tests/httpserverauth.py

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/svnurlof.py

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-absorb-unfinished.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-contrib-emacs.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-copies-in-changeset.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-copies.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-merge-tools.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-phase-archived.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remote-hidden.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

tests/test-server-view.t

0 created 644 0 0

	1		NO CONTENT: new file 100644
The requested commit or file is too big and content was truncated. Show full diff

Makefile

0 +1 -1

              # % make PREFIX=/opt/ install
              export PREFIX=/usr/local
-             PYTHON=python
+             PYTHON?=python
              $(eval HGROOT := $(shell pwd))
              HGPYTHONS ?= $(HGROOT)/build/pythons
              PURE=

contrib/base-revsets.txt

0 +3 0

              # The one below is used by rebase
              (children(ancestor(tip~5, tip)) and ::(tip~5))::
              heads(commonancestors(last(head(), 2)))
+             heads(-10000:-1)
+             roots(-10000:-1)
+             only(max(head()), min(head()))

contrib/bdiff-torture.py

0 +1 -1

                      try:
                          test1(a, b)
-                     except Exception as inst:
+                     except Exception:
                          reductions += 1
                          tries = 0
                          a = a2

contrib/check-code.py

0 +165 -32

              except ImportError:
                  re2 = None
+             import testparseutil
              def compilere(pat, multiline=False):
                  if multiline:
                      pat = '(?m)' + pat
                  (r"( +)(#([^!][^\n]*\S)?)", repcomment),
              ]
-             pypats = [
+             # common patterns to check *.py
+             commonpypats = [
                [
+                 (r'\\$', 'Use () to wrap long lines in Python, not \\'),
                  (r'^\s*def\s*\w+\s*\(.*,\s*\(',
                   "tuple parameter unpacking not available in Python 3+"),
                  (r'lambda\s*\(.*,.*\)',
                      # a pass at the same indent level, which is bogus
                      r'(?P=indent)pass[ \t\n#]'
                    ), 'omit superfluous pass'),
-                 (r'.{81}', "line too long"),
                  (r'[^\n]\Z', "no trailing newline"),
                  (r'(\S[ \t]+|^[ \t]+)\n', "trailing whitespace"),
              #    (r'^\s+[^_ \n][^_. \n]+_[^_\n]+\s*=',
                   "wrong whitespace around ="),
                  (r'\([^()]*( =[^=]|[^<>!=]= )',
                   "no whitespace around = for named parameters"),
-                 (r'raise Exception', "don't raise generic exceptions"),
                  (r'raise [^,(]+, (\([^\)]+\)|[^,\(\)]+)$',
                   "don't use old-style two-argument raise, use Exception(message)"),
                  (r' is\s+(not\s+)?["\'0-9-]', "object comparison with literal"),
                   "use opener.read() instead"),
                  (r'opener\([^)]*\).write\(',
                   "use opener.write() instead"),
-                 (r'[\s\(](open|file)\([^)]*\)\.read\(',
-                  "use util.readfile() instead"),
-                 (r'[\s\(](open|file)\([^)]*\)\.write\(',
-                  "use util.writefile() instead"),
-                 (r'^[\s\(]*(open(er)?|file)\([^)]*\)(?!\.close\(\))',
-                  "always assign an opened file to a variable, and close it afterwards"),
-                 (r'[\s\(](open|file)\([^)]*\)\.(?!close\(\))',
-                  "always assign an opened file to a variable, and close it afterwards"),
                  (r'(?i)descend[e]nt', "the proper spelling is descendAnt"),
                  (r'\.debug\(\_', "don't mark debug messages for translation"),
                  (r'\.strip\(\)\.split\(\)', "no need to strip before splitting"),
                  (r'^\s*except\s*:', "naked except clause", r'#.*re-raises'),
                  (r'^\s*except\s([^\(,]+|\([^\)]+\))\s*,',
                   'legacy exception syntax; use "as" instead of ","'),
-                 (r':\n(    )*( ){1,3}[^ ]', "must indent 4 spaces"),
                  (r'release\(.*wlock, .*lock\)', "wrong lock release order"),
                  (r'\bdef\s+__bool__\b', "__bool__ should be __nonzero__ in Python 2"),
                  (r'os\.path\.join\(.*, *(""|\'\')\)',
                  (r'def.*[( ]\w+=\{\}', "don't use mutable default arguments"),
                  (r'\butil\.Abort\b', "directly use error.Abort"),
                  (r'^@(\w*\.)?cachefunc', "module-level @cachefunc is risky, please avoid"),
-                 (r'^import atexit', "don't use atexit, use ui.atexit"),
                  (r'^import Queue', "don't use Queue, use pycompat.queue.Queue + "
                                     "pycompat.queue.Empty"),
                  (r'^import cStringIO', "don't use cStringIO.StringIO, use util.stringio"),
                   "don't convert rev to node before passing to revision(nodeorrev)"),
                  (r'platform\.system\(\)', "don't use platform.system(), use pycompat"),
+               ],
+               # warnings
+               [
+               ]
+             ]
+             # patterns to check normal *.py files
+             pypats = [
+               [
+                 # Ideally, these should be placed in "commonpypats" for
+                 # consistency of coding rules in Mercurial source tree.
+                 # But on the other hand, these are not so seriously required for
+                 # python code fragments embedded in test scripts. Fixing test
+                 # scripts for these patterns requires many changes, and has less
+                 # profit than effort.
+                 (r'.{81}', "line too long"),
+                 (r'raise Exception', "don't raise generic exceptions"),
+                 (r'[\s\(](open|file)\([^)]*\)\.read\(',
+                  "use util.readfile() instead"),
+                 (r'[\s\(](open|file)\([^)]*\)\.write\(',
+                  "use util.writefile() instead"),
+                 (r'^[\s\(]*(open(er)?|file)\([^)]*\)(?!\.close\(\))',
+                  "always assign an opened file to a variable, and close it afterwards"),
+                 (r'[\s\(](open|file)\([^)]*\)\.(?!close\(\))',
+                  "always assign an opened file to a variable, and close it afterwards"),
+                 (r':\n(    )*( ){1,3}[^ ]', "must indent 4 spaces"),
+                 (r'^import atexit', "don't use atexit, use ui.atexit"),
                  # rules depending on implementation of repquote()
                  (r' x+[xpqo%APM][\'"]\n\s+[\'"]x',
                   'string join across lines with no space'),
                         # because _preparepats forcibly adds "\n" into [^...],
                         # even though this regexp wants match it against "\n")''',
                   "missing _() in ui message (use () to hide false-positives)"),
-               ],
+               ] + commonpypats[0],
                # warnings
                [
                  # rules depending on implementation of repquote()
                  (r'(^| )pp +xxxxqq[ \n][^\n]', "add two newlines after '.. note::'"),
+               ]
+               ] + commonpypats[1]
              ]
-             pyfilters = [
+             # patterns to check *.py for embedded ones in test script
+             embeddedpypats = [
+               [
+               ] + commonpypats[0],
+               # warnings
+               [
+               ] + commonpypats[1]
+             ]
+             # common filters to convert *.py
+             commonpyfilters = [
                  (r"""(?msx)(?P<comment>\#.*?$)|
                       ((?P<quote>('''|\"\"\"|(?<!')'(?!')|(?<!")"(?!")))
                        (?P<text>(([^\\]|\\.)*?))
                        (?P=quote))""", reppython),
              ]
+             # filters to convert normal *.py files
+             pyfilters = [
+             ] + commonpyfilters
              # non-filter patterns
              pynfpats = [
                  [
                  [],
              ]
+             # filters to convert *.py for embedded ones in test script
+             embeddedpyfilters = [
+             ] + commonpyfilters
              # extension non-filter patterns
              pyextnfpats = [
                  [(r'^"""\n?[A-Z]', "don't capitalize docstring title")],
              txtpats = [
                [
-                 ('\s$', 'trailing whitespace'),
+                 (r'\s$', 'trailing whitespace'),
                  ('.. note::[ \n][^\n]', 'add two newlines after note::')
                ],
                []
                   allfilesfilters, allfilespats),
              ]
+             # (desc,
+             #  func to pick up embedded code fragments,
+             #  list of patterns to convert target files
+             #  list of patterns to detect errors/warnings)
+             embeddedchecks = [
+                 ('embedded python',
+                  testparseutil.pyembedded, embeddedpyfilters, embeddedpypats)
+             ]
              def _preparepats():
-                 for c in checks:
-                     failandwarn = c[-1]
+                 def preparefailandwarn(failandwarn):
                      for pats in failandwarn:
                          for i, pseq in enumerate(pats):
                              # fix-up regexes for multi-line searches
                              p = re.sub(r'(?<!\\)\[\^', r'[^\\n', p)
                              pats[i] = (re.compile(p, re.MULTILINE),) + pseq[1:]
-                     filters = c[3]
+                 def preparefilters(filters):
                      for i, flt in enumerate(filters):
                          filters[i] = re.compile(flt[0]), flt[1]
+                 for cs in (checks, embeddedchecks):
+                     for c in cs:
+                         failandwarn = c[-1]
+                         preparefailandwarn(failandwarn)
+                         filters = c[-2]
+                         preparefilters(filters)
              class norepeatlogger(object):
                  def __init__(self):
                      self._lastseen = None
                  return True if no error is found, False otherwise.
                  """
-                 blamecache = None
                  result = True
                  try:
                      with opentext(f) as fp:
                          try:
-                             pre = post = fp.read()
+                             pre = fp.read()
                          except UnicodeDecodeError as e:
                              print("%s while reading %s" % (e, f))
                              return result
                      print("Skipping %s, %s" % (f, str(e).split(':', 1)[0]))
                      return result
+                 # context information shared while single checkfile() invocation
+                 context = {'blamecache': None}
                  for name, match, magic, filters, pats in checks:
-                     post = pre # discard filtering result of previous check
                      if debug:
                          print(name, f)
-                     fc = 0
                      if not (re.match(match, f) or (magic and re.search(magic, pre))):
                          if debug:
                              print("Skipping %s for %s it doesn't match %s" % (
                          # tests/test-check-code.t
                          print("Skipping %s it has no-che?k-code (glob)" % f)
                          return "Skip" # skip checking this file
+                     fc = _checkfiledata(name, f, pre, filters, pats, context,
+                                         logfunc, maxerr, warnings, blame, debug, lineno)
+                     if fc:
+                         result = False
+                 if f.endswith('.t') and "no-" "check-code" not in pre:
+                     if debug:
+                         print("Checking embedded code in %s" % (f))
+                     prelines = pre.splitlines()
+                     embeddederros = []
+                     for name, embedded, filters, pats in embeddedchecks:
+                         # "reset curmax at each repetition" treats maxerr as "max
+                         # nubmer of errors in an actual file per entry of
+                         # (embedded)checks"
+                         curmaxerr = maxerr
+                         for found in embedded(f, prelines, embeddederros):
+                             filename, starts, ends, code = found
+                             fc = _checkfiledata(name, f, code, filters, pats, context,
+                                                 logfunc, curmaxerr, warnings, blame, debug,
+                                                 lineno, offset=starts - 1)
+                             if fc:
+                                 result = False
+                                 if curmaxerr:
+                                     if fc >= curmaxerr:
+                                         break
+                                     curmaxerr -= fc
+                 return result
+             def _checkfiledata(name, f, filedata, filters, pats, context,
+                                logfunc, maxerr, warnings, blame, debug, lineno,
+                                offset=None):
+                 """Execute actual error check for file data
+                 :name: of the checking category
+                 :f: filepath
+                 :filedata: content of a file
+                 :filters: to be applied before checking
+                 :pats: to detect errors
+                 :context: a dict of information shared while single checkfile() invocation
+                           Valid keys: 'blamecache'.
+                 :logfunc: function used to report error
+                           logfunc(filename, linenumber, linecontent, errormessage)
+                 :maxerr: number of error to display before aborting, or False to
+                          report all errors
+                 :warnings: whether warning level checks should be applied
+                 :blame: whether blame information should be displayed at error reporting
+                 :debug: whether debug information should be displayed
+                 :lineno: whether lineno should be displayed at error reporting
+                 :offset: line number offset of 'filedata' in 'f' for checking
+                          an embedded code fragment, or None (offset=0 is different
+                          from offset=None)
+                 returns number of detected errors.
+                 """
+                 blamecache = context['blamecache']
+                 if offset is None:
+                     lineoffset = 0
+                 else:
+                     lineoffset = offset
+                 fc = 0
+                 pre = post = filedata
+                 if True: # TODO: get rid of this redundant 'if' block
                      for p, r in filters:
                          post = re.sub(p, r, post)
                      nerrs = len(pats[0]) # nerr elements are errors
                              if ignore and re.search(ignore, l, re.MULTILINE):
                                  if debug:
                                      print("Skipping %s for %s:%s (ignore pattern)" % (
-                                         name, f, n))
+                                         name, f, (n + lineoffset)))
                                  continue
                              bd = ""
                              if blame:
                                  bd = 'working directory'
-                                 if not blamecache:
+                                 if blamecache is None:
                                      blamecache = getblame(f)
-                                 if n < len(blamecache):
-                                     bl, bu, br = blamecache[n]
-                                     if bl == l:
+                                     context['blamecache'] = blamecache
+                                 if (n + lineoffset) < len(blamecache):
+                                     bl, bu, br = blamecache[(n + lineoffset)]
+                                     if offset is None and bl == l:
                                          bd = '%s@%s' % (bu, br)
+                                     elif offset is not None and bl.endswith(l):
+                                         # "offset is not None" means "checking
+                                         # embedded code fragment". In this case,
+                                         # "l" does not have information about the
+                                         # beginning of an *original* line in the
+                                         # file (e.g. '  > ').
+                                         # Therefore, use "str.endswith()", and
+                                         # show "maybe" for a little loose
+                                         # examination.
+                                         bd = '%s@%s, maybe' % (bu, br)
-                             errors.append((f, lineno and n + 1, l, msg, bd))
-                             result = False
+                             errors.append((f, lineno and (n + lineoffset + 1), l, msg, bd))
                      errors.sort()
                      for e in errors:
                              print(" (too many errors, giving up)")
                              break
-                 return result
+                 return fc
              def main():
                  parser = optparse.OptionParser("%prog [options] [files | -]")

contrib/check-commit

0 +1 -1

                   "adds a function with foo_bar naming"),
              ]
-             word = re.compile('\S')
+             word = re.compile(r'\S')
              def nonempty(first, second):
                  if word.search(first):
                      return first

contrib/check-config.py

0 +3 -3

                      (?:default=)?(?P<default>\S+?))?
                  \)''', re.VERBOSE | re.MULTILINE)
-             configwithre = re.compile(b'''
+             configwithre = re.compile(br'''
                  ui\.config(?P<ctype>with)\(
                      # First argument is callback function. This doesn't parse robustly
                      # if it is e.g. a function call.
                          linenum += 1
                          # check topic-like bits
-                         m = re.match(b'\s*``(\S+)``', l)
+                         m = re.match(br'\s*``(\S+)``', l)
                          if m:
                              prevname = m.group(1)
-                         if re.match(b'^\s*-+$', l):
+                         if re.match(br'^\s*-+$', l):
                              sect = prevname
                              prevname = b''

contrib/check-py3-compat.py

0 +8 -2

              import os
              import sys
              import traceback
+             import warnings
              def check_compat_py2(f):
                  """Check Python 3 compatibility for a file with Python 2"""
                      content = fh.read()
                  try:
-                     ast.parse(content)
+                     ast.parse(content, filename=f)
                  except SyntaxError as e:
                      print('%s: invalid syntax: %s' % (f, e))
                      return
                      fn = check_compat_py3
                  for f in sys.argv[1:]:
-                     fn(f)
+                     with warnings.catch_warnings(record=True) as warns:
+                         fn(f)
+                     for w in warns:
+                         print(warnings.formatwarning(w.message, w.category,
+                                                      w.filename, w.lineno).rstrip())
                  sys.exit(0)

contrib/chg/hgclient.c

0 +70 -35

              static void enlargecontext(context_t *ctx, size_t newsize)
              {
-             	if (newsize <= ctx->maxdatasize)
+             	if (newsize <= ctx->maxdatasize) {
              		return;
+             	}
              	newsize = defaultdatasize *
              	          ((newsize + defaultdatasize - 1) / defaultdatasize);
              	uint32_t datasize_n;
              	rsize = recv(hgc->sockfd, &datasize_n, sizeof(datasize_n), 0);
-             	if (rsize != sizeof(datasize_n))
+             	if (rsize != sizeof(datasize_n)) {
              		abortmsg("failed to read data size");
+             	}
              	/* datasize denotes the maximum size to write if input request */
              	hgc->ctx.datasize = ntohl(datasize_n);
              	enlargecontext(&hgc->ctx, hgc->ctx.datasize);
-             	if (isupper(hgc->ctx.ch) && hgc->ctx.ch != 'S')
+             	if (isupper(hgc->ctx.ch) && hgc->ctx.ch != 'S') {
              		return; /* assumes input request */
+             	}
              	size_t cursize = 0;
              	while (cursize < hgc->ctx.datasize) {
              		rsize = recv(hgc->sockfd, hgc->ctx.data + cursize,
              		             hgc->ctx.datasize - cursize, 0);
-             		if (rsize < 1)
+             		if (rsize < 1) {
              			abortmsg("failed to read data block");
+             		}
              		cursize += rsize;
              	}
              }
              	const char *const endp = p + datasize;
              	while (p < endp) {
              		ssize_t r = send(sockfd, p, endp - p, 0);
-             		if (r < 0)
+             		if (r < 0) {
              			abortmsgerrno("cannot communicate");
+             		}
              		p += r;
              	}
              }
              		ctx->datasize += n;
              	}
-             	if (ctx->datasize > 0)
+             	if (ctx->datasize > 0) {
              		--ctx->datasize; /* strip last '\0' */
+             	}
              }
              /* Extract '\0'-separated list of args to new buffer, terminated by NULL */
              		args[nargs] = s;
              		nargs++;
              		s = memchr(s, '\0', e - s);
-             		if (!s)
+             		if (!s) {
              			break;
+             		}
              		s++;
              	}
              	args[nargs] = NULL;
              static void handlereadlinerequest(hgclient_t *hgc)
              {
              	context_t *ctx = &hgc->ctx;
-             	if (!fgets(ctx->data, ctx->datasize, stdin))
+             	if (!fgets(ctx->data, ctx->datasize, stdin)) {
              		ctx->data[0] = '\0';
+             	}
              	ctx->datasize = strlen(ctx->data);
              	writeblock(hgc);
              }
              	ctx->data[ctx->datasize] = '\0'; /* terminate last string */
              	const char **args = unpackcmdargsnul(ctx);
-             	if (!args[0] || !args[1] || !args[2])
+             	if (!args[0] || !args[1] || !args[2]) {
              		abortmsg("missing type or command or cwd in system request");
+             	}
              	if (strcmp(args[0], "system") == 0) {
              		debugmsg("run '%s' at '%s'", args[1], args[2]);
              		int32_t r = runshellcmd(args[1], args + 3, args[2]);
              		writeblock(hgc);
              	} else if (strcmp(args[0], "pager") == 0) {
              		setuppager(args[1], args + 3);
-             		if (hgc->capflags & CAP_ATTACHIO)
+             		if (hgc->capflags & CAP_ATTACHIO) {
              			attachio(hgc);
+             		}
              		/* unblock the server */
              		static const char emptycmd[] = "\n";
              		sendall(hgc->sockfd, emptycmd, sizeof(emptycmd) - 1);
              			handlesystemrequest(hgc);
              			break;
              		default:
-             			if (isupper(ctx->ch))
+             			if (isupper(ctx->ch)) {
              				abortmsg("cannot handle response (ch = %c)",
              				         ctx->ch);
+             			}
              		}
              	}
              }
              	unsigned int flags = 0;
              	while (s < e) {
              		const char *t = strchr(s, ' ');
-             		if (!t || t > e)
+             		if (!t || t > e) {
              			t = e;
+             		}
              		const cappair_t *cap;
              		for (cap = captable; cap->flag; ++cap) {
              			size_t n = t - s;
              	const char *const dataend = ctx->data + ctx->datasize;
              	while (s < dataend) {
              		const char *t = strchr(s, ':');
-             		if (!t || t[1] != ' ')
+             		if (!t || t[1] != ' ') {
              			break;
+             		}
              		const char *u = strchr(t + 2, '\n');
-             		if (!u)
+             		if (!u) {
              			u = dataend;
+             		}
              		if (strncmp(s, "capabilities:", t - s + 1) == 0) {
              			hgc->capflags = parsecapabilities(t + 2, u);
              		} else if (strncmp(s, "pgid:", t - s + 1) == 0) {
              {
              	int r = snprintf(hgc->ctx.data, hgc->ctx.maxdatasize, "chg[worker/%d]",
              	                 (int)getpid());
-             	if (r < 0 || (size_t)r >= hgc->ctx.maxdatasize)
+             	if (r < 0 || (size_t)r >= hgc->ctx.maxdatasize) {
              		abortmsg("insufficient buffer to write procname (r = %d)", r);
+             	}
              	hgc->ctx.datasize = (size_t)r;
              	writeblockrequest(hgc, "setprocname");
              }
              	sendall(hgc->sockfd, chcmd, sizeof(chcmd) - 1);
              	readchannel(hgc);
              	context_t *ctx = &hgc->ctx;
-             	if (ctx->ch != 'I')
+             	if (ctx->ch != 'I') {
              		abortmsg("unexpected response for attachio (ch = %c)", ctx->ch);
+             	}
              	static const int fds[3] = {STDIN_FILENO, STDOUT_FILENO, STDERR_FILENO};
              	struct msghdr msgh;
              	memcpy(CMSG_DATA(cmsg), fds, sizeof(fds));
              	msgh.msg_controllen = cmsg->cmsg_len;
              	ssize_t r = sendmsg(hgc->sockfd, &msgh, 0);
-             	if (r < 0)
+             	if (r < 0) {
              		abortmsgerrno("sendmsg failed");
+             	}
              	handleresponse(hgc);
              	int32_t n;
-             	if (ctx->datasize != sizeof(n))
+             	if (ctx->datasize != sizeof(n)) {
              		abortmsg("unexpected size of attachio result");
+             	}
              	memcpy(&n, ctx->data, sizeof(n));
              	n = ntohl(n);
-             	if (n != sizeof(fds) / sizeof(fds[0]))
+             	if (n != sizeof(fds) / sizeof(fds[0])) {
              		abortmsg("failed to send fds (n = %d)", n);
+             	}
              }
              static void chdirtocwd(hgclient_t *hgc)
              {
-             	if (!getcwd(hgc->ctx.data, hgc->ctx.maxdatasize))
+             	if (!getcwd(hgc->ctx.data, hgc->ctx.maxdatasize)) {
              		abortmsgerrno("failed to getcwd");
+             	}
              	hgc->ctx.datasize = strlen(hgc->ctx.data);
              	writeblockrequest(hgc, "chdir");
              }
              hgclient_t *hgc_open(const char *sockname)
              {
              	int fd = socket(AF_UNIX, SOCK_STREAM, 0);
-             	if (fd < 0)
+             	if (fd < 0) {
              		abortmsgerrno("cannot create socket");
+             	}
              	/* don't keep fd on fork(), so that it can be closed when the parent
              	 * process get terminated. */
              	{
              		const char *split = strrchr(sockname, '/');
              		if (split && split != sockname) {
-             			if (split[1] == '\0')
+             			if (split[1] == '\0') {
              				abortmsg("sockname cannot end with a slash");
+             			}
              			size_t len = split - sockname;
              			char sockdir[len + 1];
              			memcpy(sockdir, sockname, len);
              			sockdir[len] = '\0';
              			bakfd = open(".", O_DIRECTORY);
-             			if (bakfd == -1)
+             			if (bakfd == -1) {
              				abortmsgerrno("cannot open cwd");
+             			}
              			int r = chdir(sockdir);
-             			if (r != 0)
+             			if (r != 0) {
              				abortmsgerrno("cannot chdir %s", sockdir);
+             			}
              			basename = split + 1;
              		}
              	}
-             	if (strlen(basename) >= sizeof(addr.sun_path))
+             	if (strlen(basename) >= sizeof(addr.sun_path)) {
              		abortmsg("sockname is too long: %s", basename);
+             	}
              	strncpy(addr.sun_path, basename, sizeof(addr.sun_path));
              	addr.sun_path[sizeof(addr.sun_path) - 1] = '\0';
              	/* real connect */
              	int r = connect(fd, (struct sockaddr *)&addr, sizeof(addr));
              	if (r < 0) {
-             		if (errno != ENOENT && errno != ECONNREFUSED)
+             		if (errno != ENOENT && errno != ECONNREFUSED) {
              			abortmsgerrno("cannot connect to %s", sockname);
+             		}
              	}
              	if (bakfd != -1) {
              		fchdirx(bakfd);
              	initcontext(&hgc->ctx);
              	readhello(hgc);
-             	if (!(hgc->capflags & CAP_RUNCOMMAND))
+             	if (!(hgc->capflags & CAP_RUNCOMMAND)) {
              		abortmsg("insufficient capability: runcommand");
-             	if (hgc->capflags & CAP_SETPROCNAME)
+             	}
+             	if (hgc->capflags & CAP_SETPROCNAME) {
              		updateprocname(hgc);
-             	if (hgc->capflags & CAP_ATTACHIO)
+             	}
+             	if (hgc->capflags & CAP_ATTACHIO) {
              		attachio(hgc);
-             	if (hgc->capflags & CAP_CHDIR)
+             	}
+             	if (hgc->capflags & CAP_CHDIR) {
              		chdirtocwd(hgc);
-             	if (hgc->capflags & CAP_SETUMASK2)
+             	}
+             	if (hgc->capflags & CAP_SETUMASK2) {
              		forwardumask(hgc);
+             	}
              	return hgc;
              }
                                        size_t argsize)
              {
              	assert(hgc);
-             	if (!(hgc->capflags & CAP_VALIDATE))
+             	if (!(hgc->capflags & CAP_VALIDATE)) {
              		return NULL;
+             	}
              	packcmdargs(&hgc->ctx, args, argsize);
              	writeblockrequest(hgc, "validate");
              	handleresponse(hgc);
              	/* the server returns '\0' if it can handle our request */
-             	if (hgc->ctx.datasize <= 1)
+             	if (hgc->ctx.datasize <= 1) {
              		return NULL;
+             	}
              	/* make sure the buffer is '\0' terminated */
              	enlargecontext(&hgc->ctx, hgc->ctx.datasize + 1);
              void hgc_attachio(hgclient_t *hgc)
              {
              	assert(hgc);
-             	if (!(hgc->capflags & CAP_ATTACHIO))
+             	if (!(hgc->capflags & CAP_ATTACHIO)) {
              		return;
+             	}
              	attachio(hgc);
              }
              void hgc_setenv(hgclient_t *hgc, const char *const envp[])
              {
              	assert(hgc && envp);
-             	if (!(hgc->capflags & CAP_SETENV))
+             	if (!(hgc->capflags & CAP_SETENV)) {
              		return;
+             	}
              	packcmdargs(&hgc->ctx, envp, /*argsize*/ -1);
              	writeblockrequest(hgc, "setenv");
              }

contrib/chg/procutil.c

0 +77 -39

              static void forwardsignal(int sig)
              {
              	assert(peerpid > 0);
-             	if (kill(peerpid, sig) < 0)
+             	if (kill(peerpid, sig) < 0) {
              		abortmsgerrno("cannot kill %d", peerpid);
+             	}
              	debugmsg("forward signal %d", sig);
              }
              {
              	/* prefer kill(-pgid, sig), fallback to pid if pgid is invalid */
              	pid_t killpid = peerpgid > 1 ? -peerpgid : peerpid;
-             	if (kill(killpid, sig) < 0)
+             	if (kill(killpid, sig) < 0) {
              		abortmsgerrno("cannot kill %d", killpid);
+             	}
              	debugmsg("forward signal %d to %d", sig, killpid);
              }
              {
              	sigset_t unblockset, oldset;
              	struct sigaction sa, oldsa;
-             	if (sigemptyset(&unblockset) < 0)
+             	if (sigemptyset(&unblockset) < 0) {
              		goto error;
-             	if (sigaddset(&unblockset, sig) < 0)
+             	}
+             	if (sigaddset(&unblockset, sig) < 0) {
              		goto error;
+             	}
              	memset(&sa, 0, sizeof(sa));
              	sa.sa_handler = SIG_DFL;
              	sa.sa_flags = SA_RESTART;
-             	if (sigemptyset(&sa.sa_mask) < 0)
+             	if (sigemptyset(&sa.sa_mask) < 0) {
              		goto error;
+             	}
              	forwardsignal(sig);
-             	if (raise(sig) < 0) /* resend to self */
+             	if (raise(sig) < 0) { /* resend to self */
              		goto error;
-             	if (sigaction(sig, &sa, &oldsa) < 0)
+             	}
+             	if (sigaction(sig, &sa, &oldsa) < 0) {
              		goto error;
-             	if (sigprocmask(SIG_UNBLOCK, &unblockset, &oldset) < 0)
+             	}
+             	if (sigprocmask(SIG_UNBLOCK, &unblockset, &oldset) < 0) {
              		goto error;
+             	}
              	/* resent signal will be handled before sigprocmask() returns */
-             	if (sigprocmask(SIG_SETMASK, &oldset, NULL) < 0)
+             	if (sigprocmask(SIG_SETMASK, &oldset, NULL) < 0) {
              		goto error;
-             	if (sigaction(sig, &oldsa, NULL) < 0)
+             	}
+             	if (sigaction(sig, &oldsa, NULL) < 0) {
              		goto error;
+             	}
              	return;
              error:
              static void handlechildsignal(int sig UNUSED_)
              {
-             	if (peerpid == 0 || pagerpid == 0)
+             	if (peerpid == 0 || pagerpid == 0) {
              		return;
+             	}
              	/* if pager exits, notify the server with SIGPIPE immediately.
              	 * otherwise the server won't get SIGPIPE if it does not write
              	 * anything. (issue5278) */
-             	if (waitpid(pagerpid, NULL, WNOHANG) == pagerpid)
+             	if (waitpid(pagerpid, NULL, WNOHANG) == pagerpid) {
              		kill(peerpid, SIGPIPE);
+             	}
              }
              void setupsignalhandler(pid_t pid, pid_t pgid)
              {
-             	if (pid <= 0)
+             	if (pid <= 0) {
              		return;
+             	}
              	peerpid = pid;
              	peerpgid = (pgid <= 1 ? 0 : pgid);
              	 * - SIGINT: usually generated by the terminal */
              	sa.sa_handler = forwardsignaltogroup;
              	sa.sa_flags = SA_RESTART;
-             	if (sigemptyset(&sa.sa_mask) < 0)
+             	if (sigemptyset(&sa.sa_mask) < 0) {
+             		goto error;
+             	}
+             	if (sigaction(SIGHUP, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGHUP, &sa, NULL) < 0)
+             	}
+             	if (sigaction(SIGINT, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGINT, &sa, NULL) < 0)
-             		goto error;
+             	}
              	/* terminate frontend by double SIGTERM in case of server freeze */
              	sa.sa_handler = forwardsignal;
              	sa.sa_flags |= SA_RESETHAND;
-             	if (sigaction(SIGTERM, &sa, NULL) < 0)
+             	if (sigaction(SIGTERM, &sa, NULL) < 0) {
              		goto error;
+             	}
              	/* notify the worker about window resize events */
              	sa.sa_flags = SA_RESTART;
-             	if (sigaction(SIGWINCH, &sa, NULL) < 0)
+             	if (sigaction(SIGWINCH, &sa, NULL) < 0) {
              		goto error;
+             	}
              	/* forward user-defined signals */
-             	if (sigaction(SIGUSR1, &sa, NULL) < 0)
+             	if (sigaction(SIGUSR1, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGUSR2, &sa, NULL) < 0)
+             	}
+             	if (sigaction(SIGUSR2, &sa, NULL) < 0) {
              		goto error;
+             	}
              	/* propagate job control requests to worker */
              	sa.sa_handler = forwardsignal;
              	sa.sa_flags = SA_RESTART;
-             	if (sigaction(SIGCONT, &sa, NULL) < 0)
+             	if (sigaction(SIGCONT, &sa, NULL) < 0) {
              		goto error;
+             	}
              	sa.sa_handler = handlestopsignal;
              	sa.sa_flags = SA_RESTART;
-             	if (sigaction(SIGTSTP, &sa, NULL) < 0)
+             	if (sigaction(SIGTSTP, &sa, NULL) < 0) {
              		goto error;
+             	}
              	/* get notified when pager exits */
              	sa.sa_handler = handlechildsignal;
              	sa.sa_flags = SA_RESTART;
-             	if (sigaction(SIGCHLD, &sa, NULL) < 0)
+             	if (sigaction(SIGCHLD, &sa, NULL) < 0) {
              		goto error;
+             	}
              	return;
              	memset(&sa, 0, sizeof(sa));
              	sa.sa_handler = SIG_DFL;
              	sa.sa_flags = SA_RESTART;
-             	if (sigemptyset(&sa.sa_mask) < 0)
+             	if (sigemptyset(&sa.sa_mask) < 0) {
              		goto error;
+             	}
-             	if (sigaction(SIGHUP, &sa, NULL) < 0)
+             	if (sigaction(SIGHUP, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGTERM, &sa, NULL) < 0)
+             	}
+             	if (sigaction(SIGTERM, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGWINCH, &sa, NULL) < 0)
+             	}
+             	if (sigaction(SIGWINCH, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGCONT, &sa, NULL) < 0)
+             	}
+             	if (sigaction(SIGCONT, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGTSTP, &sa, NULL) < 0)
+             	}
+             	if (sigaction(SIGTSTP, &sa, NULL) < 0) {
              		goto error;
-             	if (sigaction(SIGCHLD, &sa, NULL) < 0)
+             	}
+             	if (sigaction(SIGCHLD, &sa, NULL) < 0) {
              		goto error;
+             	}
              	/* ignore Ctrl+C while shutting down to make pager exits cleanly */
              	sa.sa_handler = SIG_IGN;
-             	if (sigaction(SIGINT, &sa, NULL) < 0)
+             	if (sigaction(SIGINT, &sa, NULL) < 0) {
              		goto error;
+             	}
              	peerpid = 0;
              	return;
              pid_t setuppager(const char *pagercmd, const char *envp[])
              {
              	assert(pagerpid == 0);
-             	if (!pagercmd)
+             	if (!pagercmd) {
              		return 0;
+             	}
              	int pipefds[2];
-             	if (pipe(pipefds) < 0)
+             	if (pipe(pipefds) < 0) {
              		return 0;
+             	}
              	pid_t pid = fork();
-             	if (pid < 0)
+             	if (pid < 0) {
              		goto error;
+             	}
              	if (pid > 0) {
              		close(pipefds[0]);
-             		if (dup2(pipefds[1], fileno(stdout)) < 0)
+             		if (dup2(pipefds[1], fileno(stdout)) < 0) {
              			goto error;
+             		}
              		if (isatty(fileno(stderr))) {
-             			if (dup2(pipefds[1], fileno(stderr)) < 0)
+             			if (dup2(pipefds[1], fileno(stderr)) < 0) {
              				goto error;
+             			}
              		}
              		close(pipefds[1]);
              		pagerpid = pid;
              void waitpager(void)
              {
-             	if (pagerpid == 0)
+             	if (pagerpid == 0) {
              		return;
+             	}
              	/* close output streams to notify the pager its input ends */
              	fclose(stdout);
              	fclose(stderr);
              	while (1) {
              		pid_t ret = waitpid(pagerpid, NULL, 0);
-             		if (ret == -1 && errno == EINTR)
+             		if (ret == -1 && errno == EINTR) {
              			continue;
+             		}
              		break;
              	}
              }

contrib/chg/util.c

0 +44 -22

              static inline void fsetcolor(FILE *fp, const char *code)
              {
-             	if (!colorenabled)
+             	if (!colorenabled) {
              		return;
+             	}
              	fprintf(fp, "\033[%sm", code);
              }
              	fsetcolor(stderr, "1;31");
              	fputs("chg: abort: ", stderr);
              	vfprintf(stderr, fmt, args);
-             	if (no != 0)
+             	if (no != 0) {
              		fprintf(stderr, " (errno = %d, %s)", no, strerror(no));
+             	}
              	fsetcolor(stderr, "");
              	fputc('\n', stderr);
              	exit(255);
              void debugmsg(const char *fmt, ...)
              {
-             	if (!debugmsgenabled)
+             	if (!debugmsgenabled) {
              		return;
+             	}
              	va_list args;
              	va_start(args, fmt);
              void fchdirx(int dirfd)
              {
              	int r = fchdir(dirfd);
-             	if (r == -1)
+             	if (r == -1) {
              		abortmsgerrno("failed to fchdir");
+             	}
              }
              void fsetcloexec(int fd)
              {
              	int flags = fcntl(fd, F_GETFD);
-             	if (flags < 0)
+             	if (flags < 0) {
              		abortmsgerrno("cannot get flags of fd %d", fd);
-             	if (fcntl(fd, F_SETFD, flags | FD_CLOEXEC) < 0)
+             	}
+             	if (fcntl(fd, F_SETFD, flags | FD_CLOEXEC) < 0) {
              		abortmsgerrno("cannot set flags of fd %d", fd);
+             	}
              }
              void *mallocx(size_t size)
              {
              	void *result = malloc(size);
-             	if (!result)
+             	if (!result) {
              		abortmsg("failed to malloc");
+             	}
              	return result;
              }
              void *reallocx(void *ptr, size_t size)
              {
              	void *result = realloc(ptr, size);
-             	if (!result)
+             	if (!result) {
              		abortmsg("failed to realloc");
+             	}
              	return result;
              }
              	memset(&newsa, 0, sizeof(newsa));
              	newsa.sa_handler = SIG_IGN;
              	newsa.sa_flags = 0;
-             	if (sigemptyset(&newsa.sa_mask) < 0)
+             	if (sigemptyset(&newsa.sa_mask) < 0) {
              		goto done;
-             	if (sigaction(SIGINT, &newsa, &oldsaint) < 0)
+             	}
+             	if (sigaction(SIGINT, &newsa, &oldsaint) < 0) {
              		goto done;
+             	}
              	doneflags |= F_SIGINT;
-             	if (sigaction(SIGQUIT, &newsa, &oldsaquit) < 0)
+             	if (sigaction(SIGQUIT, &newsa, &oldsaquit) < 0) {
              		goto done;
+             	}
              	doneflags |= F_SIGQUIT;
-             	if (sigaddset(&newsa.sa_mask, SIGCHLD) < 0)
+             	if (sigaddset(&newsa.sa_mask, SIGCHLD) < 0) {
              		goto done;
-             	if (sigprocmask(SIG_BLOCK, &newsa.sa_mask, &oldmask) < 0)
+             	}
+             	if (sigprocmask(SIG_BLOCK, &newsa.sa_mask, &oldmask) < 0) {
              		goto done;
+             	}
              	doneflags |= F_SIGMASK;
              	pid_t pid = fork();
-             	if (pid < 0)
+             	if (pid < 0) {
              		goto done;
+             	}
              	if (pid == 0) {
              		sigaction(SIGINT, &oldsaint, NULL);
              		sigaction(SIGQUIT, &oldsaquit, NULL);
              		sigprocmask(SIG_SETMASK, &oldmask, NULL);
-             		if (cwd && chdir(cwd) < 0)
+             		if (cwd && chdir(cwd) < 0) {
              			_exit(127);
+             		}
              		const char *argv[] = {"sh", "-c", cmd, NULL};
              		if (envp) {
              			execve("/bin/sh", (char **)argv, (char **)envp);
              		}
              		_exit(127);
              	} else {
-             		if (waitpid(pid, &status, 0) < 0)
+             		if (waitpid(pid, &status, 0) < 0) {
              			goto done;
+             		}
              		doneflags |= F_WAITPID;
              	}
              done:
-             	if (doneflags & F_SIGINT)
+             	if (doneflags & F_SIGINT) {
              		sigaction(SIGINT, &oldsaint, NULL);
-             	if (doneflags & F_SIGQUIT)
+             	}
+             	if (doneflags & F_SIGQUIT) {
              		sigaction(SIGQUIT, &oldsaquit, NULL);
-             	if (doneflags & F_SIGMASK)
+             	}
+             	if (doneflags & F_SIGMASK) {
              		sigprocmask(SIG_SETMASK, &oldmask, NULL);
+             	}
              	/* no way to report other errors, use 127 (= shell termination) */
-             	if (!(doneflags & F_WAITPID))
+             	if (!(doneflags & F_WAITPID)) {
              		return 127;
-             	if (WIFEXITED(status))
+             	}
+             	if (WIFEXITED(status)) {
              		return WEXITSTATUS(status);
-             	if (WIFSIGNALED(status))
+             	}
+             	if (WIFSIGNALED(status)) {
              		return -WTERMSIG(status);
+             	}
              	return 127;
              }

contrib/clang-format-ignorelist

0 +5 0

              contrib/python-zstandard/zstd/compress/zstd_opt.c
              contrib/python-zstandard/zstd/compress/zstd_opt.h
              contrib/python-zstandard/zstd/decompress/huf_decompress.c
+             contrib/python-zstandard/zstd/decompress/zstd_ddict.c
+             contrib/python-zstandard/zstd/decompress/zstd_ddict.h
+             contrib/python-zstandard/zstd/decompress/zstd_decompress_block.c
+             contrib/python-zstandard/zstd/decompress/zstd_decompress_block.h
+             contrib/python-zstandard/zstd/decompress/zstd_decompress_internal.h
              contrib/python-zstandard/zstd/decompress/zstd_decompress.c
              contrib/python-zstandard/zstd/deprecated/zbuff_common.c
              contrib/python-zstandard/zstd/deprecated/zbuff_compress.c

contrib/debugshell.py

0 +10 -7

              import sys
              from mercurial import (
                  demandimport,
+                 pycompat,
                  registrar,
              )
                  IPython.embed()
-             @command('debugshell|dbsh', [])
+             @command(b'debugshell|dbsh', [])
              def debugshell(ui, repo, **opts):
-                 bannermsg = "loaded repo : %s\n" \
-                             "using source: %s" % (repo.root,
-                                                   mercurial.__path__[0])
+                 bannermsg = ("loaded repo : %s\n"
+                              "using source: %s" % (pycompat.sysstr(repo.root),
+                                                    mercurial.__path__[0]))
                  pdbmap = {
                      'pdb'  : 'code',
                      'ipdb' : 'IPython'
                  }
-                 debugger = ui.config("ui", "debugger")
+                 debugger = ui.config(b"ui", b"debugger")
                  if not debugger:
                      debugger = 'pdb'
+                 else:
+                     debugger = pycompat.sysstr(debugger)
                  # if IPython doesn't exist, fallback to code.interact
                  try:
                      with demandimport.deactivated():
                          __import__(pdbmap[debugger])
                  except ImportError:
-                     ui.warn(("%s debugger specified but %s module was not found\n")
+                     ui.warn((b"%s debugger specified but %s module was not found\n")
                              % (debugger, pdbmap[debugger]))
-                     debugger = 'pdb'
+                     debugger = b'pdb'
                  getattr(sys.modules[__name__], debugger)(ui, repo, bannermsg, **opts)

contrib/fuzz/manifest.cc

0 +14 -1

                lm = lazymanifest(mdata)
                # iterate the whole thing, which causes the code to fully parse
                # every line in the manifest
-               list(lm.iterentries())
+               for e, _, _ in lm.iterentries():
+                   # also exercise __getitem__ et al
+                   lm[e]
+                   e in lm
+                   (e + 'nope') in lm
                lm[b'xyzzy'] = (b'\0' * 20, 'x')
                # do an insert, text should change
                assert lm.text() != mdata, "insert should change text and didn't: %r %r" % (lm.text(), mdata)
+               cloned = lm.filtercopy(lambda x: x != 'xyzzy')
+               assert cloned.text() == mdata, 'cloned text should equal mdata'
+               cloned.diff(lm)
                del lm[b'xyzzy']
+               cloned.diff(lm)
                # should be back to the same
                assert lm.text() == mdata, "delete should have restored text but didn't: %r %r" % (lm.text(), mdata)
              except Exception as e:
              int LLVMFuzzerTestOneInput(const uint8_t *Data, size_t Size)
              {
+             	// Don't allow fuzzer inputs larger than 100k, since we'll just bog
+             	// down and not accomplish much.
+             	if (Size > 100000) {
+             		return 0;
+             	}
              	PyObject *mtext =
              	    PyBytes_FromStringAndSize((const char *)Data, (Py_ssize_t)Size);
              	PyObject *locals = PyDict_New();

contrib/fuzz/revlog.cc

0 +10 0

              for inline in (True, False):
                  try:
                      index, cache = parse_index2(data, inline)
+                     index.slicechunktodensity(list(range(len(index))), 0.5, 262144)
+                     for rev in range(len(index)):
+                         node = index[rev][7]
+                         partial = index.shortest(node)
+                         index.partialmatch(node[:partial])
                  except Exception as e:
                      pass
                      # uncomment this print if you're editing this Python code
              int LLVMFuzzerTestOneInput(const uint8_t *Data, size_t Size)
              {
+             	// Don't allow fuzzer inputs larger than 60k, since we'll just bog
+             	// down and not accomplish much.
+             	if (Size > 60000) {
+             		return 0;
+             	}
              	PyObject *text =
              	    PyBytes_FromStringAndSize((const char *)Data, (Py_ssize_t)Size);
              	PyObject *locals = PyDict_New();

contrib/hg-test-mode.el

0 +41 0

		@@ -53,4 +53,45 b''
53	53	(setq mode-name "hg-test")
54	54	(run-hooks 'hg-test-mode-hook))
55	55
	56	(with-eval-after-load "compile"
	57	;; Link to Python sources in tracebacks in .t failures.
	58	(add-to-list 'compilation-error-regexp-alist-alist
	59	'(hg-test-output-python-tb
	60	"^\\+ +File ['\"]\$[^'\"]+\$['\"], line \$[0-9]+\$," 1 2))
	61	(add-to-list 'compilation-error-regexp-alist 'hg-test-output-python-tb)
	62	;; Link to source files in test-check-code.t violations.
	63	(add-to-list 'compilation-error-regexp-alist-alist
	64	'(hg-test-check-code-output
	65	"\\+ \$[^:\n]+\$:\$[0-9]+\$:$" 1 2))
	66	(add-to-list 'compilation-error-regexp-alist 'hg-test-check-code-output))
	67
	68	(defun hg-test-mode--test-one-error-line-regexp (test)
	69	(erase-buffer)
	70	(setq compilation-locs (make-hash-table))
	71	(insert (car test))
	72	(compilation-parse-errors (point-min) (point-max))
	73	(let ((msg (get-text-property 1 'compilation-message)))
	74	(should msg)
	75	(let ((loc (compilation--message->loc msg))
	76	(line (nth 1 test))
	77	(file (nth 2 test)))
	78	(should (equal (compilation--loc->line loc) line))
	79	(should (equal (caar (compilation--loc->file-struct loc)) file)))
	80	msg))
	81
	82	(require 'ert)
	83	(ert-deftest hg-test-mode--compilation-mode-support ()
	84	"Test hg-specific compilation-mode regular expressions"
	85	(require 'compile)
	86	(with-temp-buffer
	87	(font-lock-mode -1)
	88	(mapc 'hg-test-mode--test-one-error-line-regexp
	89	'(
	90	("+ contrib/debugshell.py:37:" 37 "contrib/debugshell.py")
	91	("+ File \"/tmp/hg/mercurial/commands.py\", line 3115, in help_"
	92	3115 "/tmp/hg/mercurial/commands.py")
	93	("+ File \"mercurial/dispatch.py\", line 225, in dispatch"
	94	225 "mercurial/dispatch.py")))))
	95
	96
56	97	(provide 'hg-test-mode)

contrib/packaging/hg-docker

0 +1 -1

                  p.communicate(input=dockerfile)
                  if p.returncode:
                      raise subprocess.CalledProcessException(
-                             p.returncode, 'failed to build docker image: %s %s' \
+                             p.returncode, 'failed to build docker image: %s %s'
                              % (p.stdout, p.stderr))
              def command_build(args):

contrib/packaging/inno/mercurial.iss ~~contrib/win32/mercurial.iss~~

0 renamed +16 -12

              #define FileHandle
              #define FileLine
              #define VERSION = "unknown"
-             #if FileHandle = FileOpen(SourcePath + "\..\..\mercurial\__version__.py")
+             #if FileHandle = FileOpen(SourcePath + "\..\..\..\mercurial\__version__.py")
                #expr FileLine = FileRead(FileHandle)
                #expr FileLine = FileRead(FileHandle)
                #define VERSION = Copy(FileLine, Pos('"', FileLine)+1, Len(FileLine)-Pos('"', FileLine)-1)
              AppID={{4B95A5F1-EF59-4B08-BED8-C891C46121B3}
              AppContact=mercurial@mercurial-scm.org
              DefaultDirName={pf}\Mercurial
-             SourceDir=..\..
+             SourceDir=..\..\..
              VersionInfoDescription=Mercurial distributed SCM (version {#VERSION})
              VersionInfoCopyright=Copyright 2005-2019 Matt Mackall and others
              VersionInfoCompany=Matt Mackall and others
              AllowNoIcons=true
              DefaultGroupName=Mercurial
              PrivilegesRequired=none
+             ChangesEnvironment=true
              [Files]
              Source: contrib\mercurial.el; DestDir: {app}/Contrib
              Source: contrib\win32\ReadMe.html; DestDir: {app}; Flags: isreadme
              Source: contrib\win32\postinstall.txt; DestDir: {app}; DestName: ReleaseNotes.txt
              Source: dist\hg.exe; DestDir: {app}; AfterInstall: Touch('{app}\hg.exe.local')
-             #if ARCH == "x64"
              Source: dist\lib\*.dll; Destdir: {app}\lib
              Source: dist\lib\*.pyd; Destdir: {app}\lib
-             #else
-             Source: dist\w9xpopen.exe; DestDir: {app}
-             #endif
              Source: dist\python*.dll; Destdir: {app}; Flags: skipifsourcedoesntexist
              Source: dist\msvc*.dll; DestDir: {app}; Flags: skipifsourcedoesntexist
              Source: dist\Microsoft.VC*.CRT.manifest; DestDir: {app}; Flags: skipifsourcedoesntexist
              Source: dist\lib\library.zip; DestDir: {app}\lib
-             Source: dist\add_path.exe; DestDir: {app}
              Source: doc\*.html; DestDir: {app}\Docs
              Source: doc\style.css; DestDir: {app}\Docs
              Source: mercurial\help\*.txt; DestDir: {app}\help
              Name: {group}\Mercurial Ignore Files; Filename: {app}\Docs\hgignore.5.html
              Name: {group}\Mercurial Web Site; Filename: {app}\Mercurial.url
-             [Run]
-             Filename: "{app}\add_path.exe"; Parameters: "{app}"; Flags: postinstall; Description: "Add the installation path to the search path"
-             [UninstallRun]
-             Filename: "{app}\add_path.exe"; Parameters: "/del {app}"
+             [Tasks]
+             Name: modifypath; Description: Add the installation path to the search path; Flags: unchecked
              [Code]
              procedure Touch(fn: String);
              begin
                SaveStringToFile(ExpandConstant(fn), '', False);
              end;
+             const
+                 ModPathName = 'modifypath';
+                 ModPathType = 'user';
+             function ModPathDir(): TArrayOfString;
+             begin
+                 setArrayLength(Result, 1)
+                 Result[0] := ExpandConstant('{app}');
+             end;
+             #include "modpath.iss"

contrib/packaging/inno/readme.rst ~~contrib/win32/win32-build.txt~~

0 renamed +47 -116

		@@ -1,130 +1,61 b''
1		The standalone Windows installer for Mercurial is built in a somewhat
2		jury-rigged fashion.
	1	Requirements
	2	============
3	3
4		It has the following prerequisites. Ensure to take the packages
5		matching the mercurial version you want to build (32-bit or 64-bit).
	4	Building the Inno installer requires a Windows machine.
6	5
7		Python 2.6 for Windows
8		http://www.python.org/download/releases/
	6	The following system dependencies must be installed:
9	7
10		A compiler:
11		either MinGW
12		http://www.mingw.org/
13		or Microsoft Visual C++ 2008 SP1 Express Edition
14		http://www.microsoft.com/express/Downloads/Download-2008.aspx
15
16		Python for Windows Extensions
17		http://sourceforge.net/projects/pywin32/
18
19		mfc71.dll (just download, don't install; not needed for Python 2.6)
20		http://starship.python.net/crew/mhammond/win32/
21
22		Visual C++ 2008 redistributable package (needed for >= Python 2.6 or if you compile with MSVC)
23		for 32-bit:
24		http://www.microsoft.com/downloads/details.aspx?FamilyID=9b2da534-3e03-4391-8a4d-074b9f2bc1bf
25		for 64-bit:
26		http://www.microsoft.com/downloads/details.aspx?familyid=bd2a6171-e2d6-4230-b809-9a8d7548c1b6
	8	* Python 2.7 (download from https://www.python.org/downloads/)
	9	* Microsoft Visual C++ Compiler for Python 2.7
	10	(https://www.microsoft.com/en-us/download/details.aspx?id=44266)
	11	* Inno Setup (http://jrsoftware.org/isdl.php) version 5.4 or newer.
	12	Be sure to install the optional Inno Setup Preprocessor feature,
	13	which is required.
	14	* Python 3.5+ (to run the ``build.py`` script)
27	15
28		The py2exe distutils extension
29		http://sourceforge.net/projects/py2exe/
30
31		GnuWin32 gettext utility (if you want to build translations)
32		http://gnuwin32.sourceforge.net/packages/gettext.htm
33
34		Inno Setup
35		http://www.jrsoftware.org/isdl.php#qsp
36
37		Get and install ispack-5.3.10.exe or later (includes Inno Setup Processor),
38		which is necessary to package Mercurial.
39
40		ISTool - optional
41		http://www.istool.org/default.aspx/
	16	Building
	17	========
42	18
43		add_path (you need only add_path.exe in the zip file)
44		http://www.barisione.org/apps.html#add_path
45
46		Docutils
47		http://docutils.sourceforge.net/
	19	The ``build.py`` script automates the process of producing an
	20	Inno installer. It manages fetching and configuring the
	21	non-system dependencies (such as py2exe, gettext, and various
	22	Python packages).
48	23
49		CA Certs file
50		http://curl.haxx.se/ca/cacert.pem
51
52		And, of course, Mercurial itself.
53
54		Once you have all this installed and built, clone a copy of the
55		Mercurial repository you want to package, and name the repo
56		C:\hg\hg-release.
57
58		In a shell, build a standalone copy of the hg.exe program.
	24	The script requires an activated ``Visual C++ 2008`` command prompt.
	25	A shortcut to such a prompt was installed with ``Microsoft Visual C++
	26	Compiler for Python 2.7``. From your Start Menu, look for
	27	``Microsoft Visual C++ Compiler Package for Python 2.7`` then launch
	28	either ``Visual C++ 2008 32-bit Command Prompt`` or
	29	``Visual C++ 2008 64-bit Command Prompt``.
59	30
60		Building instructions for MinGW:
61		python setup.py build -c mingw32
62		python setup.py py2exe -b 2
63		Note: the previously suggested combined command of "python setup.py build -c
64		mingw32 py2exe -b 2" doesn't work correctly anymore as it doesn't include the
65		extensions in the mercurial subdirectory.
66		If you want to create a file named setup.cfg with the contents:
67		[build]
68		compiler=mingw32
69		you can skip the first build step.
	31	From the prompt, change to the Mercurial source directory. e.g.
	32	``cd c:\src\hg``.
	33
	34	Next, invoke ``build.py`` to produce an Inno installer. You will
	35	need to supply the path to the Python interpreter to use.:
70	36
71		Building instructions with MSVC 2008 Express Edition:
72		for 32-bit:
73		"C:\Program Files\Microsoft Visual Studio 9.0\VC\vcvarsall.bat" x86
74		python setup.py py2exe -b 2
75		for 64-bit:
76		"C:\Program Files\Microsoft Visual Studio 9.0\VC\vcvarsall.bat" x86_amd64
77		python setup.py py2exe -b 3
	37	$ python3.exe contrib\packaging\inno\build.py \
	38	--python c:\python27\python.exe
78	39
79		Copy add_path.exe and cacert.pem files into the dist directory that just got created.
	40	.. note::
80	41
81		If you are using Python 2.6 or later, or if you are using MSVC 2008 to compile
82		mercurial, you must include the C runtime libraries in the installer. To do so,
83		install the Visual C++ 2008 redistributable package. Then in your windows\winsxs
84		folder, locate the folder containing the dlls version 9.0.21022.8.
85		For x86, it should be named like x86_Microsoft.VC90.CRT_(...)_9.0.21022.8(...).
86		For x64, it should be named like amd64_Microsoft.VC90.CRT_(...)_9.0.21022.8(...).
87		Copy the files named msvcm90.dll, msvcp90.dll and msvcr90.dll into the dist
88		directory.
89		Then in the windows\winsxs\manifests folder, locate the corresponding manifest
90		file (x86_Microsoft.VC90.CRT_(...)_9.0.21022.8(...).manifest for x86,
91		amd64_Microsoft.VC90.CRT_(...)_9.0.21022.8(...).manifest for x64), copy it in the
92		dist directory and rename it to Microsoft.VC90.CRT.manifest.
	42	The script validates that the Visual C++ environment is
	43	active and that the architecture of the specified Python
	44	interpreter matches the Visual C++ environment and errors
	45	if not.
93	46
94		Before building the installer, you have to build Mercurial HTML documentation
95		(or fix mercurial.iss to not reference the doc directory):
96
97		cd doc
98		mingw32-make html
99		cd ..
	47	If everything runs as intended, dependencies will be fetched and
	48	configured into the ``build`` sub-directory, Mercurial will be built,
	49	and an installer placed in the ``dist`` sub-directory. The final
	50	line of output should print the name of the generated installer.
100	51
101		If you use ISTool, you open the C:\hg\hg-release\contrib\win32\mercurial.iss
102		file and type Ctrl-F9 to compile the installer file.
103
104		Otherwise you run the Inno Setup compiler. Assuming it's in the path
105		you should execute:
106
107		iscc contrib\win32\mercurial.iss /dVERSION=foo
	52	Additional options may be configured. Run ``build.py --help`` to
	53	see a list of program flags.
108	54
109		Where 'foo' is the version number you would like to see in the
110		'Add/Remove Applications' tool. The installer will be placed into
111		a directory named Output/ at the root of your repository.
112		If the /dVERSION=foo parameter is not given in the command line, the
113		installer will retrieve the version information from the __version__.py file.
114
115		If you want to build an installer for a 64-bit mercurial, add /dARCH=x64 to
116		your command line:
117		iscc contrib\win32\mercurial.iss /dARCH=x64
	55	MinGW
	56	=====
118	57
119		To automate the steps above you may want to create a batchfile based on the
120		following (MinGW build chain):
121
122		echo [build] > setup.cfg
123		echo compiler=mingw32 >> setup.cfg
124		python setup.py py2exe -b 2
125		cd doc
126		mingw32-make html
127		cd ..
128		iscc contrib\win32\mercurial.iss /dVERSION=snapshot
129
130		and run it from the root of the hg repository (c:\hg\hg-release).
	58	It is theoretically possible to generate an installer that uses
	59	MinGW. This isn't well tested and ``build.py`` and may properly
	60	support it. See old versions of this file in version control for
	61	potentially useful hints as to how to achieve this.

contrib/packaging/wix/COPYING.rtf ~~contrib/wix/COPYING.rtf~~

0 renamed binary 0 0

NO CONTENT: file renamed from contrib/wix/COPYING.rtf to contrib/packaging/wix/COPYING.rtf, binary diff hidden

contrib/packaging/wix/contrib.wxs ~~contrib/wix/contrib.wxs~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/contrib.wxs to contrib/packaging/wix/contrib.wxs

contrib/packaging/wix/defines.wxi ~~contrib/wix/defines.wxi~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/defines.wxi to contrib/packaging/wix/defines.wxi

contrib/packaging/wix/dist.wxs ~~contrib/wix/dist.wxs~~

0 renamed 0 -22

		@@ -9,28 +9,6 b''
9	9	<Component Id="distOutput" Guid="$(var.dist.guid)" Win64='$(var.IsX64)'>
10	10	<File Name="python27.dll" KeyPath="yes" />
11	11	</Component>
12		<Directory Id="libdir" Name="lib" FileSource="$(var.SourceDir)/lib">
13		<Component Id="libOutput" Guid="$(var.lib.guid)" Win64='$(var.IsX64)'>
14		<File Name="library.zip" KeyPath="yes" />
15		<File Name="mercurial.cext.base85.pyd" />
16		<File Name="mercurial.cext.bdiff.pyd" />
17		<File Name="mercurial.cext.mpatch.pyd" />
18		<File Name="mercurial.cext.osutil.pyd" />
19		<File Name="mercurial.cext.parsers.pyd" />
20		<File Name="mercurial.zstd.pyd" />
21		<File Name="hgext.fsmonitor.pywatchman.bser.pyd" />
22		<File Name="pyexpat.pyd" />
23		<File Name="bz2.pyd" />
24		<File Name="select.pyd" />
25		<File Name="unicodedata.pyd" />
26		<File Name="_ctypes.pyd" />
27		<File Name="_elementtree.pyd" />
28		<File Name="_testcapi.pyd" />
29		<File Name="_hashlib.pyd" />
30		<File Name="_socket.pyd" />
31		<File Name="_ssl.pyd" />
32		</Component>
33		</Directory>
34	12	</DirectoryRef>
35	13	</Fragment>
36	14

contrib/packaging/wix/doc.wxs ~~contrib/wix/doc.wxs~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/doc.wxs to contrib/packaging/wix/doc.wxs

contrib/packaging/wix/guids.wxi ~~contrib/wix/guids.wxi~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/guids.wxi to contrib/packaging/wix/guids.wxi

contrib/packaging/wix/help.wxs ~~contrib/wix/help.wxs~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/help.wxs to contrib/packaging/wix/help.wxs

contrib/packaging/wix/i18n.wxs ~~contrib/wix/i18n.wxs~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/i18n.wxs to contrib/packaging/wix/i18n.wxs

contrib/packaging/wix/locale.wxs ~~contrib/wix/locale.wxs~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/locale.wxs to contrib/packaging/wix/locale.wxs

contrib/packaging/wix/mercurial.wxs ~~contrib/wix/mercurial.wxs~~

0 renamed +7 -2

                                  KeyPath='yes'/>
                        </Component>
                        <Component Id='COPYING' Guid='$(var.COPYING.guid)' Win64='$(var.IsX64)'>
-                         <File Id='COPYING' Name='COPYING.rtf' Source='contrib\wix\COPYING.rtf'
+                         <File Id='COPYING' Name='COPYING.rtf' Source='contrib\packaging\wix\COPYING.rtf'
                                KeyPath='yes'/>
                        </Component>
                      <MergeRef Id='VCRuntime' />
                      <MergeRef Id='VCRuntimePolicy' />
                    </Feature>
+                   <?ifdef MercurialExtraFeatures?>
+                     <?foreach EXTRAFEAT in $(var.MercurialExtraFeatures)?>
+                       <FeatureRef Id="$(var.EXTRAFEAT)" />
+                     <?endforeach?>
+                   <?endif?>
                    <Feature Id='Locales' Title='Translations' Description='Translations' Level='1'>
                      <ComponentGroupRef Id='localeFolder' />
                      <ComponentRef Id='i18nFolder' />
                  <UIRef Id="WixUI_FeatureTree" />
                  <UIRef Id="WixUI_ErrorProgressText" />
-                 <WixVariable Id="WixUILicenseRtf" Value="contrib\wix\COPYING.rtf" />
+                 <WixVariable Id="WixUILicenseRtf" Value="contrib\packaging\wix\COPYING.rtf" />
                  <Icon Id="hgIcon.ico" SourceFile="contrib/win32/mercurial.ico" />

contrib/packaging/wix/readme.rst ~~contrib/wix/README.txt~~

0 renamed +67 -27

		@@ -1,31 +1,71 b''
1		WiX installer ~~source files~~
	1	WiX Installer
	2	=============
	3
	4	The files in this directory are used to produce an MSI installer using
	5	the WiX Toolset (http://wixtoolset.org/).
	6
	7	The MSI installers require elevated (admin) privileges due to the
	8	installation of MSVC CRT libraries into the Windows system store. See
	9	the Inno Setup installers in the ``inno`` sibling directory for installers
	10	that do not have this requirement.
	11
	12	Requirements
	13	============
	14
	15	Building the WiX installers requires a Windows machine. The following
	16	dependencies must be installed:
	17
	18	* Python 2.7 (download from https://www.python.org/downloads/)
	19	* Microsoft Visual C++ Compiler for Python 2.7
	20	(https://www.microsoft.com/en-us/download/details.aspx?id=44266)
	21	* Python 3.5+ (to run the ``build.py`` script)
	22
	23	Building
	24	========
	25
	26	The ``build.py`` script automates the process of producing an MSI
	27	installer. It manages fetching and configuring non-system dependencies
	28	(such as py2exe, gettext, and various Python packages).
	29
	30	The script requires an activated ``Visual C++ 2008`` command prompt.
	31	A shortcut to such a prompt was installed with ``Microsoft Visual
	32	C++ Compiler for Python 2.7``. From your Start Menu, look for
	33	``Microsoft Visual C++ Compiler Package for Python 2.7`` then
	34	launch either ``Visual C++ 2008 32-bit Command Prompt`` or
	35	``Visual C++ 2008 64-bit Command Prompt``.
	36
	37	From the prompt, change to the Mercurial source directory. e.g.
	38	``cd c:\src\hg``.
	39
	40	Next, invoke ``build.py`` to produce an MSI installer. You will need
	41	to supply the path to the Python interpreter to use.::
	42
	43	$ python3 contrib\packaging\wix\build.py \
	44	--python c:\python27\python.exe
	45
	46	.. note::
	47
	48	The script validates that the Visual C++ environment is active and
	49	that the architecture of the specified Python interpreter matches the
	50	Visual C++ environment. An error is raised otherwise.
	51
	52	If everything runs as intended, dependencies will be fetched and
	53	configured into the ``build`` sub-directory, Mercurial will be built,
	54	and an installer placed in the ``dist`` sub-directory. The final line
	55	of output should print the name of the generated installer.
	56
	57	Additional options may be configured. Run ``build.py --help`` to see
	58	a list of program flags.
	59
	60	Relationship to TortoiseHG
2	61	==========================
3	62
4		The files in this folder are used by the thg-winbuild [1] package
5		building architecture to create a Mercurial MSI installer. These files
6		are versioned within the Mercurial source tree because the WXS files
7		must kept up to date with distribution changes within their branch. In
8		other words, the default branch WXS files are expected to diverge from
9		the stable branch WXS files. Storing them within the same repository is
10		the only sane way to keep the source tree and the installer in sync.
11
12		The MSI installer builder uses only the mercurial.ini file from the
13		contrib/win32 folder, the contents of which have been historically used
14		to create an InnoSetup based installer. The rest of the files there are
15		ignored.
	63	TortoiseHG uses the WiX files in this directory.
16	64
17		The MSI packages built by thg-winbuild require elevated (admin)
18		privileges to be installed due to the installation of MSVC CRT libraries
19		under the C:\WINDOWS\WinSxS folder. Thus the InnoSetup installers may
20		still be useful to some users.
	65	The code for building TortoiseHG installers lives at
	66	https://bitbucket.org/tortoisehg/thg-winbuild and is maintained by
	67	Steve Borho (steve@borho.org).
21	68
22		To build your own MSI packages, clone the thg-winbuild [1] repository
23		and follow the README.txt [2] instructions closely. There are fewer
24		prerequisites for a WiX [3] installer than an InnoSetup installer, but
25		they are more specific.
26
27		Direct questions or comments to Steve Borho <steve@borho.org>
28
29		[1] http://bitbucket.org/tortoisehg/thg-winbuild
30		[2] http://bitbucket.org/tortoisehg/thg-winbuild/src/tip/README.txt
31		[3] http://wix.sourceforge.net/
	69	When changing behavior of the WiX installer, be sure to notify
	70	the TortoiseHG Project of the changes so they have ample time
	71	provide feedback and react to those changes.

contrib/packaging/wix/templates.wxs ~~contrib/wix/templates.wxs~~

0 renamed 0 0

NO CONTENT: file renamed from contrib/wix/templates.wxs to contrib/packaging/wix/templates.wxs

contrib/perf-utils/discovery-helper.sh ~~contrib/discovery-helper.sh~~

0 renamed +57 -14

		@@ -28,9 +28,13 b''
28	28
29	29	set -euo pipefail
30	30
	31	printusage () {
	32	echo "usage: `basename $0` REPO NBHEADS DEPTH [left\|right]" >&2
	33	}
	34
31	35	if [ $# -lt 3 ]; then
32		echo "usage: `basename $0` REPO NBHEADS DEPTH"
33		exit 64
	36	printusage
	37	exit 64
34	38	fi
35	39
36	40	repo="$1"
		@@ -42,8 +46,26 b' shift'
42	46	depth="$1"
43	47	shift
44	48
45		leftrepo="${repo}-left"
46		rightrepo="${repo}-right"
	49	doleft=1
	50	doright=1
	51	if [ $# -gt 1 ]; then
	52	printusage
	53	exit 64
	54	elif [ $# -eq 1 ]; then
	55	if [ "$1" == "left" ]; then
	56	doleft=1
	57	doright=0
	58	elif [ "$1" == "right" ]; then
	59	doleft=0
	60	doright=1
	61	else
	62	printusage
	63	exit 64
	64	fi
	65	fi
	66
	67	leftrepo="${repo}-${nbheads}h-${depth}d-left"
	68	rightrepo="${repo}-${nbheads}h-${depth}d-right"
47	69
48	70	left="first(sort(heads(all()), 'desc'), $nbheads)"
49	71	right="last(sort(heads(all()), 'desc'), $nbheads)"
		@@ -51,14 +73,35 b' right="last(sort(heads(all()), \'desc\'), '
51	73	leftsubset="ancestors($left, $depth) and only($left, heads(all() - $left))"
52	74	rightsubset="ancestors($right, $depth) and only($right, heads(all() - $right))"
53	75
54		echo '### building left repository:' $left-repo
55		echo '# cloning'
56		hg clone --noupdate "${repo}" "${leftrepo}"
57		echo '# stripping' '"'${leftsubset}'"'
58		hg -R "${leftrepo}" --config extensions.strip= strip --rev "$leftsubset" --no-backup
	76	echo '### creating left/right repositories with missing changesets:'
	77	if [ $doleft -eq 1 ]; then
	78	echo '# left revset:' '"'${leftsubset}'"'
	79	fi
	80	if [ $doright -eq 1 ]; then
	81	echo '# right revset:' '"'${rightsubset}'"'
	82	fi
59	83
60		echo '### building right repository:' $right-repo
61		echo '# cloning'
62		hg clone --noupdate "${repo}" "${rightrepo}"
63		echo '# stripping:' '"'${rightsubset}'"'
64		hg -R "${rightrepo}" --config extensions.strip= strip --rev "$rightsubset" --no-backup
	84	buildone() {
	85	side="$1"
	86	dest="$2"
	87	revset="$3"
	88	echo "### building $side repository: $dest"
	89	if [ -e "$dest" ]; then
	90	echo "destination repo already exists: $dest" >&2
	91	exit 1
	92	fi
	93	echo '# cloning'
	94	if ! cp --recursive --reflink=always ${repo} ${dest}; then
	95	hg clone --noupdate "${repo}" "${dest}"
	96	fi
	97	echo '# stripping' '"'${revset}'"'
	98	hg -R "${dest}" --config extensions.strip= strip --rev "$revset" --no-backup
	99	}
	100
	101	if [ $doleft -eq 1 ]; then
	102	buildone left "$leftrepo" "$leftsubset"
	103	fi
	104
	105	if [ $doright -eq 1 ]; then
	106	buildone right "$rightrepo" "$rightsubset"
	107	fi

contrib/perf.py

0 +214 -26

              # perf.py - performance test routines
-             '''helper extension to measure performance'''
+             '''helper extension to measure performance
+             Configurations
+             ==============
+             ``perf``
+             --------
+             ``all-timing``
+                 When set, additional statistics will be reported for each benchmark: best,
+                 worst, median average. If not set only the best timing is reported
+                 (default: off).
+             ``presleep``
+               number of second to wait before any group of runs (default: 1)
+             ``run-limits``
+               Control the number of runs each benchmark will perform. The option value
+               should be a list of `<time>-<numberofrun>` pairs. After each run the
+               conditions are considered in order with the following logic:
+                   If benchmark has been running for <time> seconds, and we have performed
+                   <numberofrun> iterations, stop the benchmark,
+               The default value is: `3.0-100, 10.0-3`
+             ``stub``
+                 When set, benchmarks will only be run once, useful for testing
+                 (default: off)
+             '''
              # "historical portability" policy of perf.py:
              #
              except ImportError:
                  pass
              try:
+                 from mercurial.utils import repoviewutil # since 5.0
+             except ImportError:
+                 repoviewutil = None
+             try:
                  from mercurial import scmutil # since 1.9 (or 8b252e826c68)
              except ImportError:
                  pass
                  configitem(b'perf', b'all-timing',
                      default=mercurial.configitems.dynamicdefault,
                  )
+                 configitem(b'perf', b'run-limits',
+                     default=mercurial.configitems.dynamicdefault,
+                 )
              except (ImportError, AttributeError):
                  pass
                  # experimental config: perf.all-timing
                  displayall = ui.configbool(b"perf", b"all-timing", False)
-                 return functools.partial(_timer, fm, displayall=displayall), fm
+                 # experimental config: perf.run-limits
+                 limitspec = ui.configlist(b"perf", b"run-limits", [])
+                 limits = []
+                 for item in limitspec:
+                     parts = item.split(b'-', 1)
+                     if len(parts) < 2:
+                         ui.warn((b'malformatted run limit entry, missing "-": %s\n'
+                                  % item))
+                         continue
+                     try:
+                         time_limit = float(pycompat.sysstr(parts[0]))
+                     except ValueError as e:
+                         ui.warn((b'malformatted run limit entry, %s: %s\n'
+                                  % (pycompat.bytestr(e), item)))
+                         continue
+                     try:
+                         run_limit = int(pycompat.sysstr(parts[1]))
+                     except ValueError as e:
+                         ui.warn((b'malformatted run limit entry, %s: %s\n'
+                                  % (pycompat.bytestr(e), item)))
+                         continue
+                     limits.append((time_limit, run_limit))
+                 if not limits:
+                     limits = DEFAULTLIMITS
+                 t = functools.partial(_timer, fm, displayall=displayall, limits=limits)
+                 return t, fm
              def stub_timer(fm, func, setup=None, title=None):
                  if setup is not None:
                  a, b = ostart, ostop
                  r.append((cstop - cstart, b[0] - a[0], b[1]-a[1]))
-             def _timer(fm, func, setup=None, title=None, displayall=False):
+             # list of stop condition (elapsed time, minimal run count)
+             DEFAULTLIMITS = (
+                 (3.0, 100),
+                 (10.0, 3),
+             )
+             def _timer(fm, func, setup=None, title=None, displayall=False,
+                        limits=DEFAULTLIMITS):
                  gc.collect()
                  results = []
                  begin = util.timer()
                  count = 0
-                 while True:
+                 keepgoing = True
+                 while keepgoing:
                      if setup is not None:
                          setup()
                      with timeone() as item:
                      count += 1
                      results.append(item[0])
                      cstop = util.timer()
-                     if cstop - begin > 3 and count >= 100:
-                         break
-                     if cstop - begin > 10 and count >= 3:
-                         break
+                     # Look for a stop condition.
+                     elapsed = cstop - begin
+                     for t, mincount in limits:
+                         if elapsed >= t and count >= mincount:
+                             keepgoing = False
+                             break
                  formatone(fm, results, title=title, result=r,
                            displayall=displayall)
                  # subsettable is defined in:
                  # - branchmap since 2.9 (or 175c6fd8cacc)
                  # - repoview since 2.5 (or 59a9f18d4587)
-                 for mod in (branchmap, repoview):
+                 # - repoviewutil since 5.0
+                 for mod in (branchmap, repoview, repoviewutil):
                      subsettable = getattr(mod, 'subsettable', None)
                      if subsettable:
                          return subsettable
                      repo.ui.quiet = True
                      matcher = scmutil.match(repo[None])
                      opts[b'dry_run'] = True
-                     timer(lambda: scmutil.addremove(repo, matcher, b"", opts))
+                     if b'uipathfn' in getargspec(scmutil.addremove).args:
+                         uipathfn = scmutil.getuipathfn(repo)
+                         timer(lambda: scmutil.addremove(repo, matcher, b"", uipathfn, opts))
+                     else:
+                         timer(lambda: scmutil.addremove(repo, matcher, b"", opts))
                  finally:
                      repo.ui.quiet = oldquiet
                      fm.end()
              @command(b'perfheads', formatteropts)
              def perfheads(ui, repo, **opts):
+                 """benchmark the computation of a changelog heads"""
                  opts = _byteskwargs(opts)
                  timer, fm = gettimer(ui, opts)
                  cl = repo.changelog
+                 def s():
+                     clearcaches(cl)
                  def d():
                      len(cl.headrevs())
-                     clearcaches(cl)
-                 timer(d)
+                 timer(d, setup=s)
                  fm.end()
              @command(b'perftags', formatteropts+
                      raise error.Abort((b'default repository not configured!'),
                                        hint=(b"see 'hg help config.paths'"))
                  dest = path.pushloc or path.loc
-                 branches = (path.branch, opts.get(b'branch') or [])
                  ui.status((b'analysing phase of %s\n') % util.hidepassword(dest))
-                 revs, checkout = hg.addbranchrevs(repo, repo, branches, opts.get(b'rev'))
                  other = hg.peer(repo, opts, dest)
                  # easier to perform discovery through the operation
                  fm.end()
              @command(b'perfindex', [
-                         (b'', b'rev', b'', b'revision to be looked up (default tip)'),
+                         (b'', b'rev', [], b'revision to be looked up (default tip)'),
+                         (b'', b'no-lookup', None, b'do not revision lookup post creation'),
                       ] + formatteropts)
              def perfindex(ui, repo, **opts):
+                 """benchmark index creation time followed by a lookup
+                 The default is to look `tip` up. Depending on the index implementation,
+                 the revision looked up can matters. For example, an implementation
+                 scanning the index will have a faster lookup time for `--rev tip` than for
+                 `--rev 0`. The number of looked up revisions and their order can also
+                 matters.
+                 Example of useful set to test:
+                 * tip
+                 * 0
+                 * -10:
+                 * :10
+                 * -10: + :10
+                 * :10: + -10:
+                 * -10000:
+                 * -10000: + 0
+                 It is not currently possible to check for lookup of a missing node. For
+                 deeper lookup benchmarking, checkout the `perfnodemap` command."""
                  import mercurial.revlog
                  opts = _byteskwargs(opts)
                  timer, fm = gettimer(ui, opts)
                  mercurial.revlog._prereadsize = 2**24 # disable lazy parser in old hg
-                 if opts[b'rev'] is None:
-                     n = repo[b"tip"].node()
+                 if opts[b'no_lookup']:
+                     if opts['rev']:
+                         raise error.Abort('--no-lookup and --rev are mutually exclusive')
+                     nodes = []
+                 elif not opts[b'rev']:
+                     nodes = [repo[b"tip"].node()]
                  else:
-                     rev = scmutil.revsingle(repo, opts[b'rev'])
-                     n = repo[rev].node()
+                     revs = scmutil.revrange(repo, opts[b'rev'])
+                     cl = repo.changelog
+                     nodes = [cl.node(r) for r in revs]
                  unfi = repo.unfiltered()
                  # find the filecache func directly
                      clearchangelog(unfi)
                  def d():
                      cl = makecl(unfi)
-                     cl.rev(n)
+                     for n in nodes:
+                         cl.rev(n)
+                 timer(d, setup=setup)
+                 fm.end()
+             @command(b'perfnodemap', [
+                       (b'', b'rev', [], b'revision to be looked up (default tip)'),
+                       (b'', b'clear-caches', True, b'clear revlog cache between calls'),
+                 ] + formatteropts)
+             def perfnodemap(ui, repo, **opts):
+                 """benchmark the time necessary to look up revision from a cold nodemap
+                 Depending on the implementation, the amount and order of revision we look
+                 up can varies. Example of useful set to test:
+                 * tip
+                 * 0
+                 * -10:
+                 * :10
+                 * -10: + :10
+                 * :10: + -10:
+                 * -10000:
+                 * -10000: + 0
+                 The command currently focus on valid binary lookup. Benchmarking for
+                 hexlookup, prefix lookup and missing lookup would also be valuable.
+                 """
+                 import mercurial.revlog
+                 opts = _byteskwargs(opts)
+                 timer, fm = gettimer(ui, opts)
+                 mercurial.revlog._prereadsize = 2**24 # disable lazy parser in old hg
+                 unfi = repo.unfiltered()
+                 clearcaches = opts['clear_caches']
+                 # find the filecache func directly
+                 # This avoid polluting the benchmark with the filecache logic
+                 makecl = unfi.__class__.changelog.func
+                 if not opts[b'rev']:
+                     raise error.Abort('use --rev to specify revisions to look up')
+                 revs = scmutil.revrange(repo, opts[b'rev'])
+                 cl = repo.changelog
+                 nodes = [cl.node(r) for r in revs]
+                 # use a list to pass reference to a nodemap from one closure to the next
+                 nodeget = [None]
+                 def setnodeget():
+                     # probably not necessary, but for good measure
+                     clearchangelog(unfi)
+                     nodeget[0] = makecl(unfi).nodemap.get
+                 def d():
+                     get = nodeget[0]
+                     for n in nodes:
+                         get(n)
+                 setup = None
+                 if clearcaches:
+                     def setup():
+                         setnodeget()
+                 else:
+                     setnodeget()
+                     d() # prewarm the data structure
                  timer(d, setup=setup)
                  fm.end()
              @command(b'perfparents', formatteropts)
              def perfparents(ui, repo, **opts):
+                 """benchmark the time necessary to fetch one changeset's parents.
+                 The fetch is done using the `node identifier`, traversing all object layers
+                 from the repository object. The first N revisions will be used for this
+                 benchmark. N is controlled by the ``perf.parentscount`` config option
+                 (default: 1000).
+                 """
                  opts = _byteskwargs(opts)
                  timer, fm = gettimer(ui, opts)
                  # control the number of commits perfparents iterates over
                          view = repo
                      else:
                          view = repo.filtered(filtername)
+                     if util.safehasattr(view._branchcaches, '_per_filter'):
+                         filtered = view._branchcaches._per_filter
+                     else:
+                         # older versions
+                         filtered = view._branchcaches
                      def d():
                          if clear_revbranch:
                              repo.revbranchcache()._clear()
                          if full:
                              view._branchcaches.clear()
                          else:
-                             view._branchcaches.pop(filtername, None)
+                             filtered.pop(filtername, None)
                          view.branchmap()
                      return d
                  # add filter in smaller subset to bigger subset
                      # add unfiltered
                      allfilters.append(None)
-                 branchcacheread = safeattrsetter(branchmap, b'read')
+                 if util.safehasattr(branchmap.branchcache, 'fromfile'):
+                     branchcacheread = safeattrsetter(branchmap.branchcache, b'fromfile')
+                     branchcacheread.set(classmethod(lambda *args: None))
+                 else:
+                     # older versions
+                     branchcacheread = safeattrsetter(branchmap, b'read')
+                     branchcacheread.set(lambda *args: None)
                  branchcachewrite = safeattrsetter(branchmap.branchcache, b'write')
-                 branchcacheread.set(lambda repo: None)
-                 branchcachewrite.set(lambda bc, repo: None)
+                 branchcachewrite.set(lambda *args: None)
                  try:
                      for name in allfilters:
                          printname = name
                  repo.branchmap() # make sure we have a relevant, up to date branchmap
+                 try:
+                     fromfile = branchmap.branchcache.fromfile
+                 except AttributeError:
+                     # older versions
+                     fromfile = branchmap.read
                  currentfilter = filter
                  # try once without timer, the filter may not be cached
-                 while branchmap.read(repo) is None:
+                 while fromfile(repo) is None:
                      currentfilter = subsettable.get(currentfilter)
                      if currentfilter is None:
                          raise error.Abort(b'No branchmap cached for %s repo'
                      if clearrevlogs:
                          clearchangelog(repo)
                  def bench():
-                     branchmap.read(repo)
+                     fromfile(repo)
                  timer(bench, setup=setup)
                  fm.end()

contrib/python-zstandard/MANIFEST.in

0 0 -1

              include make_cffi.py
              include setup_zstd.py
              include zstd.c
-             include zstd_cffi.py
              include LICENSE
              include NEWS.rst

contrib/python-zstandard/NEWS.rst

0 +217 -1

		@@ -8,8 +8,18 b' 1.0.0 (not yet released)'
8	8	Actions Blocking Release
9	9	------------------------
10	10
11		* compression and decompression APIs that support ``io.rawIOBase`` interface
	11	* compression and decompression APIs that support ``io.RawIOBase`` interface
12	12	(#13).
	13	* ``stream_writer()`` APIs should support ``io.RawIOBase`` interface.
	14	* Properly handle non-blocking I/O and partial writes for objects implementing
	15	``io.RawIOBase``.
	16	* Make ``write_return_read=True`` the default for objects implementing
	17	``io.RawIOBase``.
	18	* Audit for consistent and proper behavior of ``flush()`` and ``close()`` for
	19	all objects implementing ``io.RawIOBase``. Is calling ``close()`` on
	20	wrapped stream acceptable, should ``__exit__`` always call ``close()``,
	21	should ``close()`` imply ``flush()``, etc.
	22	* Consider making reads across frames configurable behavior.
13	23	* Refactor module names so C and CFFI extensions live under ``zstandard``
14	24	package.
15	25	* Overall API design review.
		@@ -43,6 +53,11 b' Actions Blocking Release'
43	53	* Consider a ``chunker()`` API for decompression.
44	54	* Consider stats for ``chunker()`` API, including finding the last consumed
45	55	offset of input data.
	56	* Consider exposing ``ZSTD_cParam_getBounds()`` and
	57	``ZSTD_dParam_getBounds()`` APIs.
	58	* Consider controls over resetting compression contexts (session only, parameters,
	59	or session and parameters).
	60	* Actually use the CFFI backend in fuzzing tests.
46	61
47	62	Other Actions Not Blocking Release
48	63	---------------------------------------
		@@ -51,6 +66,207 b' Other Actions Not Blocking Release'
51	66	* API for ensuring max memory ceiling isn't exceeded.
52	67	* Move off nose for testing.
53	68
	69	0.11.0 (released 2019-02-24)
	70	============================
	71
	72	Backwards Compatibility Nodes
	73	-----------------------------
	74
	75	* ``ZstdDecompressor.read()`` now allows reading sizes of ``-1`` or ``0``
	76	and defaults to ``-1``, per the documented behavior of
	77	``io.RawIOBase.read()``. Previously, we required an argument that was
	78	a positive value.
	79	* The ``readline()``, ``readlines()``, ``__iter__``, and ``__next__`` methods
	80	of ``ZstdDecompressionReader()`` now raise ``io.UnsupportedOperation``
	81	instead of ``NotImplementedError``.
	82	* ``ZstdDecompressor.stream_reader()`` now accepts a ``read_across_frames``
	83	argument. The default value will likely be changed in a future release
	84	and consumers are advised to pass the argument to avoid unwanted change
	85	of behavior in the future.
	86	* ``setup.py`` now always disables the CFFI backend if the installed
	87	CFFI package does not meet the minimum version requirements. Before, it was
	88	possible for the CFFI backend to be generated and a run-time error to
	89	occur.
	90	* In the CFFI backend, ``CompressionReader`` and ``DecompressionReader``
	91	were renamed to ``ZstdCompressionReader`` and ``ZstdDecompressionReader``,
	92	respectively so naming is identical to the C extension. This should have
	93	no meaningful end-user impact, as instances aren't meant to be
	94	constructed directly.
	95	* ``ZstdDecompressor.stream_writer()`` now accepts a ``write_return_read``
	96	argument to control whether ``write()`` returns the number of bytes
	97	read from the source / written to the decompressor. It defaults to off,
	98	which preserves the existing behavior of returning the number of bytes
	99	emitted from the decompressor. The default will change in a future release
	100	so behavior aligns with the specified behavior of ``io.RawIOBase``.
	101	* ``ZstdDecompressionWriter.__exit__`` now calls ``self.close()``. This
	102	will result in that stream plus the underlying stream being closed as
	103	well. If this behavior is not desirable, do not use instances as
	104	context managers.
	105	* ``ZstdCompressor.stream_writer()`` now accepts a ``write_return_read``
	106	argument to control whether ``write()`` returns the number of bytes read
	107	from the source / written to the compressor. It defaults to off, which
	108	preserves the existing behavior of returning the number of bytes emitted
	109	from the compressor. The default will change in a future release so
	110	behavior aligns with the specified behavior of ``io.RawIOBase``.
	111	* ``ZstdCompressionWriter.__exit__`` now calls ``self.close()``. This will
	112	result in that stream plus any underlying stream being closed as well. If
	113	this behavior is not desirable, do not use instances as context managers.
	114	* ``ZstdDecompressionWriter`` no longer requires being used as a context
	115	manager (#57).
	116	* ``ZstdCompressionWriter`` no longer requires being used as a context
	117	manager (#57).
	118	* The ``overlap_size_log`` attribute on ``CompressionParameters`` instances
	119	has been deprecated and will be removed in a future release. The
	120	``overlap_log`` attribute should be used instead.
	121	* The ``overlap_size_log`` argument to ``CompressionParameters`` has been
	122	deprecated and will be removed in a future release. The ``overlap_log``
	123	argument should be used instead.
	124	* The ``ldm_hash_every_log`` attribute on ``CompressionParameters`` instances
	125	has been deprecated and will be removed in a future release. The
	126	``ldm_hash_rate_log`` attribute should be used instead.
	127	* The ``ldm_hash_every_log`` argument to ``CompressionParameters`` has been
	128	deprecated and will be removed in a future release. The ``ldm_hash_rate_log``
	129	argument should be used instead.
	130	* The ``compression_strategy`` argument to ``CompressionParameters`` has been
	131	deprecated and will be removed in a future release. The ``strategy``
	132	argument should be used instead.
	133	* The ``SEARCHLENGTH_MIN`` and ``SEARCHLENGTH_MAX`` constants are deprecated
	134	and will be removed in a future release. Use ``MINMATCH_MIN`` and
	135	``MINMATCH_MAX`` instead.
	136	* The ``zstd_cffi`` module has been renamed to ``zstandard.cffi``. As had
	137	been documented in the ``README`` file since the ``0.9.0`` release, the
	138	module should not be imported directly at its new location. Instead,
	139	``import zstandard`` to cause an appropriate backend module to be loaded
	140	automatically.
	141
	142	Bug Fixes
	143	---------
	144
	145	* CFFI backend could encounter a failure when sending an empty chunk into
	146	``ZstdDecompressionObj.decompress()``. The issue has been fixed.
	147	* CFFI backend could encounter an error when calling
	148	``ZstdDecompressionReader.read()`` if there was data remaining in an
	149	internal buffer. The issue has been fixed. (#71)
	150
	151	Changes
	152	-------
	153
	154	* ``ZstDecompressionObj.decompress()`` now properly handles empty inputs in
	155	the CFFI backend.
	156	* ``ZstdCompressionReader`` now implements ``read1()`` and ``readinto1()``.
	157	These are part of the ``io.BufferedIOBase`` interface.
	158	* ``ZstdCompressionReader`` has gained a ``readinto(b)`` method for reading
	159	compressed output into an existing buffer.
	160	* ``ZstdCompressionReader.read()`` now defaults to ``size=-1`` and accepts
	161	read sizes of ``-1`` and ``0``. The new behavior aligns with the documented
	162	behavior of ``io.RawIOBase``.
	163	* ``ZstdCompressionReader`` now implements ``readall()``. Previously, this
	164	method raised ``NotImplementedError``.
	165	* ``ZstdDecompressionReader`` now implements ``read1()`` and ``readinto1()``.
	166	These are part of the ``io.BufferedIOBase`` interface.
	167	* ``ZstdDecompressionReader.read()`` now defaults to ``size=-1`` and accepts
	168	read sizes of ``-1`` and ``0``. The new behavior aligns with the documented
	169	behavior of ``io.RawIOBase``.
	170	* ``ZstdDecompressionReader()`` now implements ``readall()``. Previously, this
	171	method raised ``NotImplementedError``.
	172	* The ``readline()``, ``readlines()``, ``__iter__``, and ``__next__`` methods
	173	of ``ZstdDecompressionReader()`` now raise ``io.UnsupportedOperation``
	174	instead of ``NotImplementedError``. This reflects a decision to never
	175	implement text-based I/O on (de)compressors and keep the low-level API
	176	operating in the binary domain. (#13)
	177	* ``README.rst`` now documented how to achieve linewise iteration using
	178	an ``io.TextIOWrapper`` with a ``ZstdDecompressionReader``.
	179	* ``ZstdDecompressionReader`` has gained a ``readinto(b)`` method for
	180	reading decompressed output into an existing buffer. This allows chaining
	181	to an ``io.TextIOWrapper`` on Python 3 without using an ``io.BufferedReader``.
	182	* ``ZstdDecompressor.stream_reader()`` now accepts a ``read_across_frames``
	183	argument to control behavior when the input data has multiple zstd
	184	frames. When ``False`` (the default for backwards compatibility), a
	185	``read()`` will stop when the end of a zstd frame is encountered. When
	186	``True``, ``read()`` can potentially return data spanning multiple zstd
	187	frames. The default will likely be changed to ``True`` in a future
	188	release.
	189	* ``setup.py`` now performs CFFI version sniffing and disables the CFFI
	190	backend if CFFI is too old. Previously, we only used ``install_requires``
	191	to enforce the CFFI version and not all build modes would properly enforce
	192	the minimum CFFI version. (#69)
	193	* CFFI's ``ZstdDecompressionReader.read()`` now properly handles data
	194	remaining in any internal buffer. Before, repeated ``read()`` could
	195	result in random errors. (#71)
	196	* Upgraded various Python packages in CI environment.
	197	* Upgrade to hypothesis 4.5.11.
	198	* In the CFFI backend, ``CompressionReader`` and ``DecompressionReader``
	199	were renamed to ``ZstdCompressionReader`` and ``ZstdDecompressionReader``,
	200	respectively.
	201	* ``ZstdDecompressor.stream_writer()`` now accepts a ``write_return_read``
	202	argument to control whether ``write()`` returns the number of bytes read
	203	from the source. It defaults to ``False`` to preserve backwards
	204	compatibility.
	205	* ``ZstdDecompressor.stream_writer()`` now implements the ``io.RawIOBase``
	206	interface and behaves as a proper stream object.
	207	* ``ZstdCompressor.stream_writer()`` now accepts a ``write_return_read``
	208	argument to control whether ``write()`` returns the number of bytes read
	209	from the source. It defaults to ``False`` to preserve backwards
	210	compatibility.
	211	* ``ZstdCompressionWriter`` now implements the ``io.RawIOBase`` interface and
	212	behaves as a proper stream object. ``close()`` will now close the stream
	213	and the underlying stream (if possible). ``__exit__`` will now call
	214	``close()``. Methods like ``writable()`` and ``fileno()`` are implemented.
	215	* ``ZstdDecompressionWriter`` no longer must be used as a context manager.
	216	* ``ZstdCompressionWriter`` no longer must be used as a context manager.
	217	When not using as a context manager, it is important to call
	218	``flush(FRAME_FRAME)`` or the compression stream won't be properly
	219	terminated and decoders may complain about malformed input.
	220	* ``ZstdCompressionWriter.flush()`` (what is returned from
	221	``ZstdCompressor.stream_writer()``) now accepts an argument controlling the
	222	flush behavior. Its value can be one of the new constants
	223	``FLUSH_BLOCK`` or ``FLUSH_FRAME``.
	224	* ``ZstdDecompressionObj`` instances now have a ``flush([length=None])`` method.
	225	This provides parity with standard library equivalent types. (#65)
	226	* ``CompressionParameters`` no longer redundantly store individual compression
	227	parameters on each instance. Instead, compression parameters are stored inside
	228	the underlying ``ZSTD_CCtx_params`` instance. Attributes for obtaining
	229	parameters are now properties rather than instance variables.
	230	* Exposed the ``STRATEGY_BTULTRA2`` constant.
	231	* ``CompressionParameters`` instances now expose an ``overlap_log`` attribute.
	232	This behaves identically to the ``overlap_size_log`` attribute.
	233	* ``CompressionParameters()`` now accepts an ``overlap_log`` argument that
	234	behaves identically to the ``overlap_size_log`` argument. An error will be
	235	raised if both arguments are specified.
	236	* ``CompressionParameters`` instances now expose an ``ldm_hash_rate_log``
	237	attribute. This behaves identically to the ``ldm_hash_every_log`` attribute.
	238	* ``CompressionParameters()`` now accepts a ``ldm_hash_rate_log`` argument that
	239	behaves identically to the ``ldm_hash_every_log`` argument. An error will be
	240	raised if both arguments are specified.
	241	* ``CompressionParameters()`` now accepts a ``strategy`` argument that behaves
	242	identically to the ``compression_strategy`` argument. An error will be raised
	243	if both arguments are specified.
	244	* The ``MINMATCH_MIN`` and ``MINMATCH_MAX`` constants were added. They are
	245	semantically equivalent to the old ``SEARCHLENGTH_MIN`` and
	246	``SEARCHLENGTH_MAX`` constants.
	247	* Bundled zstandard library upgraded from 1.3.7 to 1.3.8.
	248	* ``setup.py`` denotes support for Python 3.7 (Python 3.7 was supported and
	249	tested in the 0.10 release).
	250	* ``zstd_cffi`` module has been renamed to ``zstandard.cffi``.
	251	* ``ZstdCompressor.stream_writer()`` now reuses a buffer in order to avoid
	252	allocating a new buffer for every operation. This should result in faster
	253	performance in cases where ``write()`` or ``flush()`` are being called
	254	frequently. (#62)
	255	* Bundled zstandard library upgraded from 1.3.6 to 1.3.7.
	256
	257	0.10.2 (released 2018-11-03)
	258	============================
	259
	260	Bug Fixes
	261	---------
	262
	263	* ``zstd_cffi.py`` added to ``setup.py`` (#60).
	264
	265	Changes
	266	-------
	267
	268	* Change some integer casts to avoid ``ssize_t`` (#61).
	269
54	270	0.10.1 (released 2018-10-08)
55	271	============================
56	272

contrib/python-zstandard/README.rst

0 +144 -33

              Requirements
              ============
-             This extension is designed to run with Python 2.7, 3.4, 3.5, and 3.6
-             on common platforms (Linux, Windows, and OS X). x86 and x86_64 are well-tested
-             on Windows. Only x86_64 is well-tested on Linux and macOS.
+             This extension is designed to run with Python 2.7, 3.4, 3.5, 3.6, and 3.7
+             on common platforms (Linux, Windows, and OS X). On PyPy (both PyPy2 and PyPy3) we support version 6.0.0 and above.
+             x86 and x86_64 are well-tested on Windows. Only x86_64 is well-tested on Linux and macOS.
              Installing
              ==========
                             # Do something with compressed chunk.
-             When the context manager exists or ``close()`` is called, the stream is closed,
+             When the context manager exits or ``close()`` is called, the stream is closed,
              underlying resources are released, and future operations against the compression
              stream will fail.
              Streaming Input API
              ^^^^^^^^^^^^^^^^^^^
-             ``stream_writer(fh)`` (which behaves as a context manager) allows you to *stream*
-             data into a compressor.::
+             ``stream_writer(fh)`` allows you to *stream* data into a compressor.
+             Returned instances implement the ``io.RawIOBase`` interface. Only methods
+             that involve writing will do useful things.
+             The argument to ``stream_writer()`` must have a ``write(data)`` method. As
+             compressed data is available, ``write()`` will be called with the compressed
+             data as its argument. Many common Python types implement ``write()``, including
+             open file handles and ``io.BytesIO``.
+             The ``write(data)`` method is used to feed data into the compressor.
+             The ``flush([flush_mode=FLUSH_BLOCK])`` method can be called to evict whatever
+             data remains within the compressor's internal state into the output object. This
+             may result in 0 or more ``write()`` calls to the output object. This method
+             accepts an optional ``flush_mode`` argument to control the flushing behavior.
+             Its value can be any of the ``FLUSH_*`` constants.
+             Both ``write()`` and ``flush()`` return the number of bytes written to the
+             object's ``write()``. In many cases, small inputs do not accumulate enough
+             data to cause a write and ``write()`` will return ``0``.
+             Calling ``close()`` will mark the stream as closed and subsequent I/O
+             operations will raise ``ValueError`` (per the documented behavior of
+             ``io.RawIOBase``). ``close()`` will also call ``close()`` on the underlying
+             stream if such a method exists.
+             Typically usage is as follows::
+                cctx = zstd.ZstdCompressor(level=10)
+                compressor = cctx.stream_writer(fh)
+                compressor.write(b'chunk 0\n')
+                compressor.write(b'chunk 1\n')
+                compressor.flush()
+                # Receiver will be able to decode ``chunk 0\nchunk 1\n`` at this point.
+                # Receiver is also expecting more data in the zstd *frame*.
+                compressor.write(b'chunk 2\n')
+                compressor.flush(zstd.FLUSH_FRAME)
+                # Receiver will be able to decode ``chunk 0\nchunk 1\nchunk 2``.
+                # Receiver is expecting no more data, as the zstd frame is closed.
+                # Any future calls to ``write()`` at this point will construct a new
+                # zstd frame.
+             Instances can be used as context managers. Exiting the context manager is
+             the equivalent of calling ``close()``, which is equivalent to calling
+             ``flush(zstd.FLUSH_FRAME)``::
                 cctx = zstd.ZstdCompressor(level=10)
                 with cctx.stream_writer(fh) as compressor:
                     compressor.write(b'chunk 1')
                     ...
-             The argument to ``stream_writer()`` must have a ``write(data)`` method. As
-             compressed data is available, ``write()`` will be called with the compressed
-             data as its argument. Many common Python types implement ``write()``, including
-             open file handles and ``io.BytesIO``.
+             .. important::
-             ``stream_writer()`` returns an object representing a streaming compressor
-             instance. It **must** be used as a context manager. That object's
-             ``write(data)`` method is used to feed data into the compressor.
-             A ``flush()`` method can be called to evict whatever data remains within the
-             compressor's internal state into the output object. This may result in 0 or
-             more ``write()`` calls to the output object.
-             Both ``write()`` and ``flush()`` return the number of bytes written to the
-             object's ``write()``. In many cases, small inputs do not accumulate enough
-             data to cause a write and ``write()`` will return ``0``.
+                If ``flush(FLUSH_FRAME)`` is not called, emitted data doesn't constitute
+                a full zstd *frame* and consumers of this data may complain about malformed
+                input. It is recommended to use instances as a context manager to ensure
+                *frames* are properly finished.
              If the size of the data being fed to this streaming compressor is known,
              you can declare it before compression begins::
                      ...
                      total_written = compressor.tell()
+             ``stream_writer()`` accepts a ``write_return_read`` boolean argument to control
+             the return value of ``write()``. When ``False`` (the default), ``write()`` returns
+             the number of bytes that were ``write()``en to the underlying object. When
+             ``True``, ``write()`` returns the number of bytes read from the input that
+             were subsequently written to the compressor. ``True`` is the *proper* behavior
+             for ``write()`` as specified by the ``io.RawIOBase`` interface and will become
+             the default value in a future release.
              Streaming Output API
              ^^^^^^^^^^^^^^^^^^^^
              ``tell()`` returns the number of decompressed bytes read so far.
              Not all I/O methods are implemented. Notably missing is support for
-             ``readline()``, ``readlines()``, and linewise iteration support. Support for
-             these is planned for a future release.
+             ``readline()``, ``readlines()``, and linewise iteration support. This is
+             because streams operate on binary data - not text data. If you want to
+             convert decompressed output to text, you can chain an ``io.TextIOWrapper``
+             to the stream::
+                with open(path, 'rb') as fh:
+                    dctx = zstd.ZstdDecompressor()
+                    stream_reader = dctx.stream_reader(fh)
+                    text_stream = io.TextIOWrapper(stream_reader, encoding='utf-8')
+                    for line in text_stream:
+                        ...
+             The ``read_across_frames`` argument to ``stream_reader()`` controls the
+             behavior of read operations when the end of a zstd *frame* is encountered.
+             When ``False`` (the default), a read will complete when the end of a
+             zstd *frame* is encountered. When ``True``, a read can potentially
+             return data spanning multiple zstd *frames*.
              Streaming Input API
              ^^^^^^^^^^^^^^^^^^^
-             ``stream_writer(fh)`` can be used to incrementally send compressed data to a
-             decompressor.::
+             ``stream_writer(fh)`` allows you to *stream* data into a decompressor.
+             Returned instances implement the ``io.RawIOBase`` interface. Only methods
+             that involve writing will do useful things.
+             The argument to ``stream_writer()`` is typically an object that also implements
+             ``io.RawIOBase``. But any object with a ``write(data)`` method will work. Many
+             common Python types conform to this interface, including open file handles
+             and ``io.BytesIO``.
+             Behavior is similar to ``ZstdCompressor.stream_writer()``: compressed data
+             is sent to the decompressor by calling ``write(data)`` and decompressed
+             output is written to the underlying stream by calling its ``write(data)``
+             method.::
                  dctx = zstd.ZstdDecompressor()
-                 with dctx.stream_writer(fh) as decompressor:
-                     decompressor.write(compressed_data)
+                 decompressor = dctx.stream_writer(fh)
-             This behaves similarly to ``zstd.ZstdCompressor``: compressed data is written to
-             the decompressor by calling ``write(data)`` and decompressed output is written
-             to the output object by calling its ``write(data)`` method.
+                 decompressor.write(compressed_data)
+                 ...
              Calls to ``write()`` will return the number of bytes written to the output
              object. Not all inputs will result in bytes being written, so return values
              of ``0`` are possible.
+             Like the ``stream_writer()`` compressor, instances can be used as context
+             managers. However, context managers add no extra special behavior and offer
+             little to no benefit to being used.
+             Calling ``close()`` will mark the stream as closed and subsequent I/O operations
+             will raise ``ValueError`` (per the documented behavior of ``io.RawIOBase``).
+             ``close()`` will also call ``close()`` on the underlying stream if such a
+             method exists.
              The size of chunks being ``write()`` to the destination can be specified::
                  dctx = zstd.ZstdDecompressor()
                  with dctx.stream_writer(fh) as decompressor:
                      byte_size = decompressor.memory_size()
+             ``stream_writer()`` accepts a ``write_return_read`` boolean argument to control
+             the return value of ``write()``. When ``False`` (the default)``, ``write()``
+             returns the number of bytes that were ``write()``en to the underlying stream.
+             When ``True``, ``write()`` returns the number of bytes read from the input.
+             ``True`` is the *proper* behavior for ``write()`` as specified by the
+             ``io.RawIOBase`` interface and will become the default in a future release.
              Streaming Output API
              ^^^^^^^^^^^^^^^^^^^^
                 memory (re)allocations, this streaming decompression API isn't as
                 efficient as other APIs.
+             For compatibility with the standard library APIs, instances expose a
+             ``flush([length=None])`` method. This method no-ops and has no meaningful
+             side-effects, making it safe to call any time.
              Batch Decompression API
              ^^^^^^^^^^^^^^^^^^^^^^^
              * search_log
              * min_match
              * target_length
-             * compression_strategy
+             * strategy
+             * compression_strategy (deprecated: same as ``strategy``)
              * write_content_size
              * write_checksum
              * write_dict_id
              * job_size
-             * overlap_size_log
+             * overlap_log
+             * overlap_size_log (deprecated: same as ``overlap_log``)
              * force_max_window
              * enable_ldm
              * ldm_hash_log
              * ldm_min_match
              * ldm_bucket_size_log
-             * ldm_hash_every_log
+             * ldm_hash_rate_log
+             * ldm_hash_every_log (deprecated: same as ``ldm_hash_rate_log``)
              * threads
              Some of these are very low-level settings. It may help to consult the official
              MAGIC_NUMBER
                  Frame header as an integer
+             FLUSH_BLOCK
+                 Flushing behavior that denotes to flush a zstd block. A decompressor will
+                 be able to decode all data fed into the compressor so far.
+             FLUSH_FRAME
+                 Flushing behavior that denotes to end a zstd frame. Any new data fed
+                 to the compressor will start a new frame.
              CONTENTSIZE_UNKNOWN
                  Value for content size when the content size is unknown.
              CONTENTSIZE_ERROR
                  Minimum value for compression parameter
              SEARCHLOG_MAX
                  Maximum value for compression parameter
+             MINMATCH_MIN
+                 Minimum value for compression parameter
+             MINMATCH_MAX
+                 Maximum value for compression parameter
              SEARCHLENGTH_MIN
                  Minimum value for compression parameter
+                 Deprecated: use ``MINMATCH_MIN``
              SEARCHLENGTH_MAX
                  Maximum value for compression parameter
+                 Deprecated: use ``MINMATCH_MAX``
              TARGETLENGTH_MIN
                  Minimum value for compression parameter
              STRATEGY_FAST
                  Compression strategy
              STRATEGY_BTULTRA
                  Compression strategy
+             STRATEGY_BTULTRA2
+                 Compression strategy
              FORMAT_ZSTD1
                  Zstandard frame format

contrib/python-zstandard/c-ext/compressionchunker.c

0 +2 -2

              	/* If we have data left in the input, consume it. */
              	while (chunker->input.pos < chunker->input.size) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_compress_generic(chunker->compressor->cctx, &chunker->output,
+             		zresult = ZSTD_compressStream2(chunker->compressor->cctx, &chunker->output,
              			&chunker->input, ZSTD_e_continue);
              		Py_END_ALLOW_THREADS
              	}
              	Py_BEGIN_ALLOW_THREADS
-             	zresult = ZSTD_compress_generic(chunker->compressor->cctx, &chunker->output,
+             	zresult = ZSTD_compressStream2(chunker->compressor->cctx, &chunker->output,
              		&chunker->input, zFlushMode);
              	Py_END_ALLOW_THREADS

contrib/python-zstandard/c-ext/compressiondict.c

0 +3 -7

              		cParams = ZSTD_getCParams(level, 0, self->dictSize);
              	}
              	else {
-             		cParams.chainLog = compressionParams->chainLog;
-             		cParams.hashLog = compressionParams->hashLog;
-             		cParams.searchLength = compressionParams->minMatch;
-             		cParams.searchLog = compressionParams->searchLog;
-             		cParams.strategy = compressionParams->compressionStrategy;
-             		cParams.targetLength = compressionParams->targetLength;
-             		cParams.windowLog = compressionParams->windowLog;
+             		if (to_cparams(compressionParams, &cParams)) {
+             			return NULL;
+             		}
              	}
              	assert(!self->cdict);

contrib/python-zstandard/c-ext/compressionparams.c

0 +230 -135

		@@ -10,7 +10,7 b''
10	10
11	11	extern PyObject* ZstdError;
12	12
13		int set_parameter(ZSTD_CCtx_params* params, ZSTD_cParameter param, ~~unsigned~~ value) {
	13	int set_parameter(ZSTD_CCtx_params* params, ZSTD_cParameter param, int value) {
14	14	size_t zresult = ZSTD_CCtxParam_setParameter(params, param, value);
15	15	if (ZSTD_isError(zresult)) {
16	16	PyErr_Format(ZstdError, "unable to set compression context parameter: %s",
		@@ -23,28 +23,41 b' int set_parameter(ZSTD_CCtx_params* para'
23	23
24	24	#define TRY_SET_PARAMETER(params, param, value) if (set_parameter(params, param, value)) return -1;
25	25
	26	#define TRY_COPY_PARAMETER(source, dest, param) { \
	27	int result; \
	28	size_t zresult = ZSTD_CCtxParam_getParameter(source, param, &result); \
	29	if (ZSTD_isError(zresult)) { \
	30	return 1; \
	31	} \
	32	zresult = ZSTD_CCtxParam_setParameter(dest, param, result); \
	33	if (ZSTD_isError(zresult)) { \
	34	return 1; \
	35	} \
	36	}
	37
26	38	int set_parameters(ZSTD_CCtx_params* params, ZstdCompressionParametersObject* obj) {
27		TRY_~~SET~~_PARAMETER(params, ~~ZSTD_p_format~~, ~~obj~~->~~format~~);
28		TRY_SET_PARAMETER(params, ZSTD_p_compressionLevel, (unsigned)obj->compressionLevel);
29		TRY_SET_PARAMETER(params, ZSTD_p_windowLog, obj->windowLog);
30		TRY_SET_PARAMETER(params, ZSTD_p_hashLog, obj->hashLog);
31		TRY_~~SET~~_PARAMETER(params, ~~ZSTD_p_chainLog~~, ~~obj~~->~~chain~~Log);
32		TRY_~~SET~~_PARAMETER(params, ~~ZSTD_p_searchLog~~, ~~obj~~->~~searc~~hLog);
33		TRY_SET_PARAMETER(params, ZSTD_p_minMatch, obj->minMatch);
34		TRY_SET_PARAMETER(params, ZSTD_p_targetLength, obj->targetLength);
35		TRY_SET_PARAMETER(params, ZSTD_p_compressionStrategy, obj->compressionStrategy);
36		TRY_SET_PARAMETER(params, ZSTD_p_contentSizeFlag, obj->contentSizeFlag);
37		TRY_SET_PARAMETER(params, ZSTD_p_checksumFlag, obj->checksumFlag);
38		TRY_SET_PARAMETER(params, ZSTD_p_dictIDFlag, obj->dictIDFlag);
39		TRY_SET_PARAMETER(params, ZSTD_p_nbWorkers, obj->threads);
40		TRY_SET_PARAMETER(params, ZSTD_p_jobSize, obj->jobSize);
41		TRY_SET_PARAMETER(params, ZSTD_p_overlapSizeLog, obj->overlapSizeLog);
42		TRY_SET_PARAMETER(params, ZSTD_p_forceMaxWindow, obj->forceMaxWindow);
43		TRY_SET_PARAMETER(params, ZSTD_p_enableLongDistanceMatching, obj->enableLongDistanceMatching);
44		TRY_SET_PARAMETER(params, ZSTD_p_ldmHashLog, obj->ldmHashLog);
45		TRY_~~SET~~_PARAMETER(params, ~~ZSTD_p_ldmMinMatch~~, ~~obj~~->~~ldmMinMatch~~);
46		TRY_SET_PARAMETER(params, ZSTD_p_ldmBucketSizeLog, obj->ldmBucketSizeLog);
47		TRY_SET_PARAMETER(params, ZSTD_p_ldmHashEveryLog, obj->ldmHashEveryLog);
	39	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_nbWorkers);
	40
	41	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_format);
	42	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_compressionLevel);
	43	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_windowLog);
	44	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_hashLog);
	45	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_chainLog);
	46	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_searchLog);
	47	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_minMatch);
	48	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_targetLength);
	49	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_strategy);
	50	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_contentSizeFlag);
	51	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_checksumFlag);
	52	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_dictIDFlag);
	53	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_jobSize);
	54	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_overlapLog);
	55	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_forceMaxWindow);
	56	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_enableLongDistanceMatching);
	57	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_ldmHashLog);
	58	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_ldmMinMatch);
	59	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_ldmBucketSizeLog);
	60	TRY_COPY_PARAMETER(obj->params, params, ZSTD_c_ldmHashRateLog);
48	61
49	62	return 0;
50	63	}
		@@ -64,6 +77,41 b' int reset_params(ZstdCompressionParamete'
64	77	return set_parameters(params->params, params);
65	78	}
66	79
	80	#define TRY_GET_PARAMETER(params, param, value) { \
	81	size_t zresult = ZSTD_CCtxParam_getParameter(params, param, value); \
	82	if (ZSTD_isError(zresult)) { \
	83	PyErr_Format(ZstdError, "unable to retrieve parameter: %s", ZSTD_getErrorName(zresult)); \
	84	return 1; \
	85	} \
	86	}
	87
	88	int to_cparams(ZstdCompressionParametersObject* params, ZSTD_compressionParameters* cparams) {
	89	int value;
	90
	91	TRY_GET_PARAMETER(params->params, ZSTD_c_windowLog, &value);
	92	cparams->windowLog = value;
	93
	94	TRY_GET_PARAMETER(params->params, ZSTD_c_chainLog, &value);
	95	cparams->chainLog = value;
	96
	97	TRY_GET_PARAMETER(params->params, ZSTD_c_hashLog, &value);
	98	cparams->hashLog = value;
	99
	100	TRY_GET_PARAMETER(params->params, ZSTD_c_searchLog, &value);
	101	cparams->searchLog = value;
	102
	103	TRY_GET_PARAMETER(params->params, ZSTD_c_minMatch, &value);
	104	cparams->minMatch = value;
	105
	106	TRY_GET_PARAMETER(params->params, ZSTD_c_targetLength, &value);
	107	cparams->targetLength = value;
	108
	109	TRY_GET_PARAMETER(params->params, ZSTD_c_strategy, &value);
	110	cparams->strategy = value;
	111
	112	return 0;
	113	}
	114
67	115	static int ZstdCompressionParameters_init(ZstdCompressionParametersObject* self, PyObject* args, PyObject* kwargs) {
68	116	static char* kwlist[] = {
69	117	"format",
		@@ -75,50 +123,60 b' static int ZstdCompressionParameters_ini'
75	123	"min_match",
76	124	"target_length",
77	125	"compression_strategy",
	126	"strategy",
78	127	"write_content_size",
79	128	"write_checksum",
80	129	"write_dict_id",
81	130	"job_size",
	131	"overlap_log",
82	132	"overlap_size_log",
83	133	"force_max_window",
84	134	"enable_ldm",
85	135	"ldm_hash_log",
86	136	"ldm_min_match",
87	137	"ldm_bucket_size_log",
	138	"ldm_hash_rate_log",
88	139	"ldm_hash_every_log",
89	140	"threads",
90	141	NULL
91	142	};
92	143
93		~~unsigned~~ format = 0;
	144	int format = 0;
94	145	int compressionLevel = 0;
95		~~unsigned~~ windowLog = 0;
96		~~unsigned~~ hashLog = 0;
97		~~unsigned~~ chainLog = 0;
98		~~unsigned~~ searchLog = 0;
99		~~unsigned~~ minMatch = 0;
100		~~unsigned~~ targetLength = 0;
101		~~unsigned~~ compressionStrategy = 0;
102		unsigned contentSizeFlag = 1;
103		unsigned checksumFlag = 0;
104		unsigned dictIDFlag = 0;
105		unsigned jobSize = 0;
106		unsigned overlapSizeLog = 0;
107		unsigned forceMaxWindow = 0;
108		unsigned enableLDM = 0;
109		unsigned ldmHashLog = 0;
110		unsigned ldmMinMatch = 0;
111		unsigned ldmBucketSizeLog = 0;
112		unsigned ldmHashEveryLog = 0;
	146	int windowLog = 0;
	147	int hashLog = 0;
	148	int chainLog = 0;
	149	int searchLog = 0;
	150	int minMatch = 0;
	151	int targetLength = 0;
	152	int compressionStrategy = -1;
	153	int strategy = -1;
	154	int contentSizeFlag = 1;
	155	int checksumFlag = 0;
	156	int dictIDFlag = 0;
	157	int jobSize = 0;
	158	int overlapLog = -1;
	159	int overlapSizeLog = -1;
	160	int forceMaxWindow = 0;
	161	int enableLDM = 0;
	162	int ldmHashLog = 0;
	163	int ldmMinMatch = 0;
	164	int ldmBucketSizeLog = 0;
	165	int ldmHashRateLog = -1;
	166	int ldmHashEveryLog = -1;
113	167	int threads = 0;
114	168
115	169	if (!PyArg_ParseTupleAndKeywords(args, kwargs,
116		"\|IiIIIIIIIIIIIIIIIIIIi:CompressionParameters",
	170	"\|iiiiiiiiiiiiiiiiiiiiiiii:CompressionParameters",
117	171	kwlist, &format, &compressionLevel, &windowLog, &hashLog, &chainLog,
118		&searchLog, &minMatch, &targetLength, &compressionStrategy,
119		&contentSizeFlag, &checksumFlag, &dictIDFlag, &jobSize, &overlap~~Size~~Log,
120		&forceMaxWindow, &enableLDM, &ldmHashLog, &ldmMinMatch, &~~ldmBucketSizeLog~~,
121		&ldmHashEveryLog, &threads)) {
	172	&searchLog, &minMatch, &targetLength, &compressionStrategy, &strategy,
	173	&contentSizeFlag, &checksumFlag, &dictIDFlag, &jobSize, &overlapLog,
	174	&overlapSizeLog, &forceMaxWindow, &enableLDM, &ldmHashLog, &ldmMinMatch,
	175	&ldmBucketSizeLog, &ldmHashRateLog, &ldmHashEveryLog, &threads)) {
	176	return -1;
	177	}
	178
	179	if (reset_params(self)) {
122	180	return -1;
123	181	}
124	182
		@@ -126,32 +184,70 b' static int ZstdCompressionParameters_ini'
126	184	threads = cpu_count();
127	185	}
128	186
129		self->format = format;
130		self->compressionLevel = compressionLevel;
131		self->windowLog = windowLog;
132		self->hashLog = hashLog;
133		self->chainLog = chainLog;
134		self->searchLog = searchLog;
135		self->minMatch = minMatch;
136		self->targetLength = targetLength;
137		self->compressionStrategy = compressionStrategy;
138		self->contentSizeFlag = contentSizeFlag;
139		self->checksumFlag = checksumFlag;
140		self->dictIDFlag = dictIDFlag;
141		self->threads = threads;
142		self->jobSize = jobSize;
143		self->overlapSizeLog = overlapSizeLog;
144		self->forceMaxWindow = forceMaxWindow;
145		self->enableLongDistanceMatching = enableLDM;
146		self->ldmHashLog = ldmHashLog;
147		self->ldmMinMatch = ldmMinMatch;
148		self->ldmBucketSizeLog = ldmBucketSizeLog;
149		self->ldmHashEveryLog = ldmHashEveryLog;
	187	/* We need to set ZSTD_c_nbWorkers before ZSTD_c_jobSize and ZSTD_c_overlapLog
	188	* because setting ZSTD_c_nbWorkers resets the other parameters. */
	189	TRY_SET_PARAMETER(self->params, ZSTD_c_nbWorkers, threads);
	190
	191	TRY_SET_PARAMETER(self->params, ZSTD_c_format, format);
	192	TRY_SET_PARAMETER(self->params, ZSTD_c_compressionLevel, compressionLevel);
	193	TRY_SET_PARAMETER(self->params, ZSTD_c_windowLog, windowLog);
	194	TRY_SET_PARAMETER(self->params, ZSTD_c_hashLog, hashLog);
	195	TRY_SET_PARAMETER(self->params, ZSTD_c_chainLog, chainLog);
	196	TRY_SET_PARAMETER(self->params, ZSTD_c_searchLog, searchLog);
	197	TRY_SET_PARAMETER(self->params, ZSTD_c_minMatch, minMatch);
	198	TRY_SET_PARAMETER(self->params, ZSTD_c_targetLength, targetLength);
150	199
151		if (reset_params(self)) {
	200	if (compressionStrategy != -1 && strategy != -1) {
	201	PyErr_SetString(PyExc_ValueError, "cannot specify both compression_strategy and strategy");
	202	return -1;
	203	}
	204
	205	if (compressionStrategy != -1) {
	206	strategy = compressionStrategy;
	207	}
	208	else if (strategy == -1) {
	209	strategy = 0;
	210	}
	211
	212	TRY_SET_PARAMETER(self->params, ZSTD_c_strategy, strategy);
	213	TRY_SET_PARAMETER(self->params, ZSTD_c_contentSizeFlag, contentSizeFlag);
	214	TRY_SET_PARAMETER(self->params, ZSTD_c_checksumFlag, checksumFlag);
	215	TRY_SET_PARAMETER(self->params, ZSTD_c_dictIDFlag, dictIDFlag);
	216	TRY_SET_PARAMETER(self->params, ZSTD_c_jobSize, jobSize);
	217
	218	if (overlapLog != -1 && overlapSizeLog != -1) {
	219	PyErr_SetString(PyExc_ValueError, "cannot specify both overlap_log and overlap_size_log");
152	220	return -1;
153	221	}
154	222
	223	if (overlapSizeLog != -1) {
	224	overlapLog = overlapSizeLog;
	225	}
	226	else if (overlapLog == -1) {
	227	overlapLog = 0;
	228	}
	229
	230	TRY_SET_PARAMETER(self->params, ZSTD_c_overlapLog, overlapLog);
	231	TRY_SET_PARAMETER(self->params, ZSTD_c_forceMaxWindow, forceMaxWindow);
	232	TRY_SET_PARAMETER(self->params, ZSTD_c_enableLongDistanceMatching, enableLDM);
	233	TRY_SET_PARAMETER(self->params, ZSTD_c_ldmHashLog, ldmHashLog);
	234	TRY_SET_PARAMETER(self->params, ZSTD_c_ldmMinMatch, ldmMinMatch);
	235	TRY_SET_PARAMETER(self->params, ZSTD_c_ldmBucketSizeLog, ldmBucketSizeLog);
	236
	237	if (ldmHashRateLog != -1 && ldmHashEveryLog != -1) {
	238	PyErr_SetString(PyExc_ValueError, "cannot specify both ldm_hash_rate_log and ldm_hash_everyLog");
	239	return -1;
	240	}
	241
	242	if (ldmHashEveryLog != -1) {
	243	ldmHashRateLog = ldmHashEveryLog;
	244	}
	245	else if (ldmHashRateLog == -1) {
	246	ldmHashRateLog = 0;
	247	}
	248
	249	TRY_SET_PARAMETER(self->params, ZSTD_c_ldmHashRateLog, ldmHashRateLog);
	250
155	251	return 0;
156	252	}
157	253
		@@ -259,7 +355,7 b' ZstdCompressionParametersObject* Compres'
259	355
260	356	val = PyDict_GetItemString(kwargs, "min_match");
261	357	if (!val) {
262		val = PyLong_FromUnsignedLong(params.~~searchLengt~~h);
	358	val = PyLong_FromUnsignedLong(params.minMatch);
263	359	if (!val) {
264	360	goto cleanup;
265	361	}
		@@ -336,6 +432,41 b' static void ZstdCompressionParameters_de'
336	432	PyObject_Del(self);
337	433	}
338	434
	435	#define PARAM_GETTER(name, param) PyObject* ZstdCompressionParameters_get_##name(PyObject* self, void* unused) { \
	436	int result; \
	437	size_t zresult; \
	438	ZstdCompressionParametersObject* p = (ZstdCompressionParametersObject*)(self); \
	439	zresult = ZSTD_CCtxParam_getParameter(p->params, param, &result); \
	440	if (ZSTD_isError(zresult)) { \
	441	PyErr_Format(ZstdError, "unable to get compression parameter: %s", \
	442	ZSTD_getErrorName(zresult)); \
	443	return NULL; \
	444	} \
	445	return PyLong_FromLong(result); \
	446	}
	447
	448	PARAM_GETTER(format, ZSTD_c_format)
	449	PARAM_GETTER(compression_level, ZSTD_c_compressionLevel)
	450	PARAM_GETTER(window_log, ZSTD_c_windowLog)
	451	PARAM_GETTER(hash_log, ZSTD_c_hashLog)
	452	PARAM_GETTER(chain_log, ZSTD_c_chainLog)
	453	PARAM_GETTER(search_log, ZSTD_c_searchLog)
	454	PARAM_GETTER(min_match, ZSTD_c_minMatch)
	455	PARAM_GETTER(target_length, ZSTD_c_targetLength)
	456	PARAM_GETTER(compression_strategy, ZSTD_c_strategy)
	457	PARAM_GETTER(write_content_size, ZSTD_c_contentSizeFlag)
	458	PARAM_GETTER(write_checksum, ZSTD_c_checksumFlag)
	459	PARAM_GETTER(write_dict_id, ZSTD_c_dictIDFlag)
	460	PARAM_GETTER(job_size, ZSTD_c_jobSize)
	461	PARAM_GETTER(overlap_log, ZSTD_c_overlapLog)
	462	PARAM_GETTER(force_max_window, ZSTD_c_forceMaxWindow)
	463	PARAM_GETTER(enable_ldm, ZSTD_c_enableLongDistanceMatching)
	464	PARAM_GETTER(ldm_hash_log, ZSTD_c_ldmHashLog)
	465	PARAM_GETTER(ldm_min_match, ZSTD_c_ldmMinMatch)
	466	PARAM_GETTER(ldm_bucket_size_log, ZSTD_c_ldmBucketSizeLog)
	467	PARAM_GETTER(ldm_hash_rate_log, ZSTD_c_ldmHashRateLog)
	468	PARAM_GETTER(threads, ZSTD_c_nbWorkers)
	469
339	470	static PyMethodDef ZstdCompressionParameters_methods[] = {
340	471	{
341	472	"from_level",
		@@ -352,70 +483,34 b' static PyMethodDef ZstdCompressionParame'
352	483	{ NULL, NULL }
353	484	};
354	485
355		static PyMemberDef ZstdCompressionParameters_members[] = {
356		{ "format", T_UINT,
357		offsetof(ZstdCompressionParametersObject, format), READONLY,
358		"compression format" },
359		{ "compression_level", T_INT,
360		offsetof(ZstdCompressionParametersObject, compressionLevel), READONLY,
361		"compression level" },
362		{ "window_log", T_UINT,
363		offsetof(ZstdCompressionParametersObject, windowLog), READONLY,
364		"window log" },
365		{ "hash_log", T_UINT,
366		offsetof(ZstdCompressionParametersObject, hashLog), READONLY,
367		"hash log" },
368		{ "chain_log", T_UINT,
369		offsetof(ZstdCompressionParametersObject, chainLog), READONLY,
370		"chain log" },
371		{ "search_log", T_UINT,
372		offsetof(ZstdCompressionParametersObject, searchLog), READONLY,
373		"search log" },
374		{ "min_match", T_UINT,
375		offsetof(ZstdCompressionParametersObject, minMatch), READONLY,
376		"search length" },
377		{ "target_length", T_UINT,
378		offsetof(ZstdCompressionParametersObject, targetLength), READONLY,
379		"target length" },
380		{ "compression_strategy", T_UINT,
381		offsetof(ZstdCompressionParametersObject, compressionStrategy), READONLY,
382		"compression strategy" },
383		{ "write_content_size", T_UINT,
384		offsetof(ZstdCompressionParametersObject, contentSizeFlag), READONLY,
385		"whether to write content size in frames" },
386		{ "write_checksum", T_UINT,
387		offsetof(ZstdCompressionParametersObject, checksumFlag), READONLY,
388		"whether to write checksum in frames" },
389		{ "write_dict_id", T_UINT,
390		offsetof(ZstdCompressionParametersObject, dictIDFlag), READONLY,
391		"whether to write dictionary ID in frames" },
392		{ "threads", T_UINT,
393		offsetof(ZstdCompressionParametersObject, threads), READONLY,
394		"number of threads to use" },
395		{ "job_size", T_UINT,
396		offsetof(ZstdCompressionParametersObject, jobSize), READONLY,
397		"size of compression job when using multiple threads" },
398		{ "overlap_size_log", T_UINT,
399		offsetof(ZstdCompressionParametersObject, overlapSizeLog), READONLY,
400		"Size of previous input reloaded at the beginning of each job" },
401		{ "force_max_window", T_UINT,
402		offsetof(ZstdCompressionParametersObject, forceMaxWindow), READONLY,
403		"force back references to remain smaller than window size" },
404		{ "enable_ldm", T_UINT,
405		offsetof(ZstdCompressionParametersObject, enableLongDistanceMatching), READONLY,
406		"whether to enable long distance matching" },
407		{ "ldm_hash_log", T_UINT,
408		offsetof(ZstdCompressionParametersObject, ldmHashLog), READONLY,
409		"Size of the table for long distance matching, as a power of 2" },
410		{ "ldm_min_match", T_UINT,
411		offsetof(ZstdCompressionParametersObject, ldmMinMatch), READONLY,
412		"minimum size of searched matches for long distance matcher" },
413		{ "ldm_bucket_size_log", T_UINT,
414		offsetof(ZstdCompressionParametersObject, ldmBucketSizeLog), READONLY,
415		"log size of each bucket in the LDM hash table for collision resolution" },
416		{ "ldm_hash_every_log", T_UINT,
417		offsetof(ZstdCompressionParametersObject, ldmHashEveryLog), READONLY,
418		"frequency of inserting/looking up entries in the LDM hash table" },
	486	#define GET_SET_ENTRY(name) { #name, ZstdCompressionParameters_get_##name, NULL, NULL, NULL }
	487
	488	static PyGetSetDef ZstdCompressionParameters_getset[] = {
	489	GET_SET_ENTRY(format),
	490	GET_SET_ENTRY(compression_level),
	491	GET_SET_ENTRY(window_log),
	492	GET_SET_ENTRY(hash_log),
	493	GET_SET_ENTRY(chain_log),
	494	GET_SET_ENTRY(search_log),
	495	GET_SET_ENTRY(min_match),
	496	GET_SET_ENTRY(target_length),
	497	GET_SET_ENTRY(compression_strategy),
	498	GET_SET_ENTRY(write_content_size),
	499	GET_SET_ENTRY(write_checksum),
	500	GET_SET_ENTRY(write_dict_id),
	501	GET_SET_ENTRY(threads),
	502	GET_SET_ENTRY(job_size),
	503	GET_SET_ENTRY(overlap_log),
	504	/* TODO remove this deprecated attribute */
	505	{ "overlap_size_log", ZstdCompressionParameters_get_overlap_log, NULL, NULL, NULL },
	506	GET_SET_ENTRY(force_max_window),
	507	GET_SET_ENTRY(enable_ldm),
	508	GET_SET_ENTRY(ldm_hash_log),
	509	GET_SET_ENTRY(ldm_min_match),
	510	GET_SET_ENTRY(ldm_bucket_size_log),
	511	GET_SET_ENTRY(ldm_hash_rate_log),
	512	/* TODO remove this deprecated attribute */
	513	{ "ldm_hash_every_log", ZstdCompressionParameters_get_ldm_hash_rate_log, NULL, NULL, NULL },
419	514	{ NULL }
420	515	};
421	516
		@@ -448,8 +543,8 b' PyTypeObject ZstdCompressionParametersTy'
448	543	0, /* tp_iter */
449	544	0, /* tp_iternext */
450	545	ZstdCompressionParameters_methods, /* tp_methods */
451		ZstdCompressionParameters_members, /* tp_members */
452		0, /* tp_getset */
	546	0, /* tp_members */
	547	ZstdCompressionParameters_getset, /* tp_getset */
453	548	0, /* tp_base */
454	549	0, /* tp_dict */
455	550	0, /* tp_descr_get */

contrib/python-zstandard/c-ext/compressionreader.c

0 +518 -86

This diff has been collapsed as it changes many lines, (604 lines changed) Show them Hide them
			@@ -128,6 +128,96 b' static PyObject* reader_tell(ZstdCompres'
	128	128	return PyLong_FromUnsignedLongLong(self->bytesCompressed);
	129	129	}
	130	130
		131	int read_compressor_input(ZstdCompressionReader* self) {
		132	if (self->finishedInput) {
		133	return 0;
		134	}
		135
		136	if (self->input.pos != self->input.size) {
		137	return 0;
		138	}
		139
		140	if (self->reader) {
		141	Py_buffer buffer;
		142
		143	assert(self->readResult == NULL);
		144
		145	self->readResult = PyObject_CallMethod(self->reader, "read",
		146	"k", self->readSize);
		147
		148	if (NULL == self->readResult) {
		149	return -1;
		150	}
		151
		152	memset(&buffer, 0, sizeof(buffer));
		153
		154	if (0 != PyObject_GetBuffer(self->readResult, &buffer, PyBUF_CONTIG_RO)) {
		155	return -1;
		156	}
		157
		158	/* EOF */
		159	if (0 == buffer.len) {
		160	self->finishedInput = 1;
		161	Py_CLEAR(self->readResult);
		162	}
		163	else {
		164	self->input.src = buffer.buf;
		165	self->input.size = buffer.len;
		166	self->input.pos = 0;
		167	}
		168
		169	PyBuffer_Release(&buffer);
		170	}
		171	else {
		172	assert(self->buffer.buf);
		173
		174	self->input.src = self->buffer.buf;
		175	self->input.size = self->buffer.len;
		176	self->input.pos = 0;
		177	}
		178
		179	return 1;
		180	}
		181
		182	int compress_input(ZstdCompressionReader* self, ZSTD_outBuffer* output) {
		183	size_t oldPos;
		184	size_t zresult;
		185
		186	/* If we have data left over, consume it. */
		187	if (self->input.pos < self->input.size) {
		188	oldPos = output->pos;
		189
		190	Py_BEGIN_ALLOW_THREADS
		191	zresult = ZSTD_compressStream2(self->compressor->cctx,
		192	output, &self->input, ZSTD_e_continue);
		193	Py_END_ALLOW_THREADS
		194
		195	self->bytesCompressed += output->pos - oldPos;
		196
		197	/* Input exhausted. Clear out state tracking. */
		198	if (self->input.pos == self->input.size) {
		199	memset(&self->input, 0, sizeof(self->input));
		200	Py_CLEAR(self->readResult);
		201
		202	if (self->buffer.buf) {
		203	self->finishedInput = 1;
		204	}
		205	}
		206
		207	if (ZSTD_isError(zresult)) {
		208	PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
		209	return -1;
		210	}
		211	}
		212
		213	if (output->pos && output->pos == output->size) {
		214	return 1;
		215	}
		216	else {
		217	return 0;
		218	}
		219	}
		220
	131	221	static PyObject* reader_read(ZstdCompressionReader* self, PyObject* args, PyObject* kwargs) {
	132	222	static char* kwlist[] = {
	133	223	"size",
			@@ -140,25 +230,30 b' static PyObject* reader_read(ZstdCompres'
	140	230	Py_ssize_t resultSize;
	141	231	size_t zresult;
	142	232	size_t oldPos;
		233	int readResult, compressResult;
	143	234
	144	235	if (self->closed) {
	145	236	PyErr_SetString(PyExc_ValueError, "stream is closed");
	146	237	return NULL;
	147	238	}
	148	239
	149		if (self->finishedOutput) {
	150		return PyBytes_FromStringAndSize("", 0);
	151		}
	152
	153		if (!PyArg_ParseTupleAndKeywords(args, kwargs, "n", kwlist, &size)) {
		240	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "\|n", kwlist, &size)) {
	154	241	return NULL;
	155	242	}
	156	243
	157		if (size < 1) {
	158		PyErr_SetString(PyExc_ValueError, "cannot read negative ~~or size 0~~ amounts");
		244	if (size < -1) {
		245	PyErr_SetString(PyExc_ValueError, "cannot read negative amounts less than -1");
	159	246	return NULL;
	160	247	}
	161	248
		249	if (size == -1) {
		250	return PyObject_CallMethod((PyObject*)self, "readall", NULL);
		251	}
		252
		253	if (self->finishedOutput \|\| size == 0) {
		254	return PyBytes_FromStringAndSize("", 0);
		255	}
		256
	162	257	result = PyBytes_FromStringAndSize(NULL, size);
	163	258	if (NULL == result) {
	164	259	return NULL;
			@@ -172,86 +267,34 b' static PyObject* reader_read(ZstdCompres'
	172	267
	173	268	readinput:
	174	269
	175		/* If we have data left over, consume it. */
	176		if (self->input.pos < self->input.size) {
	177		oldPos = self->output.pos;
	178
	179		Py_BEGIN_ALLOW_THREADS
	180		zresult = ZSTD_compress_generic(self->compressor->cctx,
	181		&self->output, &self->input, ZSTD_e_continue);
	182
	183		Py_END_ALLOW_THREADS
	184
	185		self->bytesCompressed += self->output.pos - oldPos;
	186
	187		/* Input exhausted. Clear out state tracking. */
	188		if (self->input.pos == self->input.size) {
	189		memset(&self->input, 0, sizeof(self->input));
	190		Py_CLEAR(self->readResult);
		270	compressResult = compress_input(self, &self->output);
	191	271
	192		if (self->buffer.buf) {
	193		self->finishedInput = 1;
	194		}
	195		}
	196
	197		if (ZSTD_isError(zresult)) {
	198		PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
	199		return NULL;
	200		}
	201
	202		if (self->output.pos) {
	203		/* If no more room in output, emit it. */
	204		if (self->output.pos == self->output.size) {
	205		memset(&self->output, 0, sizeof(self->output));
	206		return result;
	207		}
	208
	209		/*
	210		* There is room in the output. We fall through to below, which will either
	211		* get more input for us or will attempt to end the stream.
	212		*/
	213		}
	214
	215		/* Fall through to gather more input. */
		272	if (-1 == compressResult) {
		273	Py_XDECREF(result);
		274	return NULL;
		275	}
		276	else if (0 == compressResult) {
		277	/* There is room in the output. We fall through to below, which will
		278	* either get more input for us or will attempt to end the stream.
		279	*/
		280	}
		281	else if (1 == compressResult) {
		282	memset(&self->output, 0, sizeof(self->output));
		283	return result;
		284	}
		285	else {
		286	assert(0);
	216	287	}
	217	288
	218		if (!self->finishedInput) {
	219		if (self->reader) {
	220		Py_buffer buffer;
	221
	222		assert(self->readResult == NULL);
	223		self->readResult = PyObject_CallMethod(self->reader, "read",
	224		"k", self->readSize);
	225		if (self->readResult == NULL) {
	226		return NULL;
	227		}
	228
	229		memset(&buffer, 0, sizeof(buffer));
	230
	231		if (0 != PyObject_GetBuffer(self->readResult, &buffer, PyBUF_CONTIG_RO)) {
	232		return NULL;
	233		}
		289	readResult = read_compressor_input(self);
	234	290
	235		/* EOF */
	236		if (0 == buffer.len) {
	237		self->finishedInput = 1;
	238		Py_CLEAR(self->readResult);
	239		}
	240		else {
	241		self->input.src = buffer.buf;
	242		self->input.size = buffer.len;
	243		self->input.pos = 0;
	244		}
	245
	246		PyBuffer_Release(&buffer);
	247		}
	248		else {
	249		assert(self->buffer.buf);
	250
	251		self->input.src = self->buffer.buf;
	252		self->input.size = self->buffer.len;
	253		self->input.pos = 0;
	254		}
		291	if (-1 == readResult) {
		292	return NULL;
		293	}
		294	else if (0 == readResult) { }
		295	else if (1 == readResult) { }
		296	else {
		297	assert(0);
	255	298	}
	256	299
	257	300	if (self->input.size) {
			@@ -261,7 +304,7 b' readinput:'
	261	304	/* Else EOF */
	262	305	oldPos = self->output.pos;
	263	306
	264		zresult = ZSTD_compress~~_generic~~(self->compressor->cctx, &self->output,
		307	zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output,
	265	308	&self->input, ZSTD_e_end);
	266	309
	267	310	self->bytesCompressed += self->output.pos - oldPos;
			@@ -269,6 +312,7 b' readinput:'
	269	312	if (ZSTD_isError(zresult)) {
	270	313	PyErr_Format(ZstdError, "error ending compression stream: %s",
	271	314	ZSTD_getErrorName(zresult));
		315	Py_XDECREF(result);
	272	316	return NULL;
	273	317	}
	274	318
			@@ -288,9 +332,394 b' readinput:'
	288	332	return result;
	289	333	}
	290	334
		335	static PyObject* reader_read1(ZstdCompressionReader* self, PyObject* args, PyObject* kwargs) {
		336	static char* kwlist[] = {
		337	"size",
		338	NULL
		339	};
		340
		341	Py_ssize_t size = -1;
		342	PyObject* result = NULL;
		343	char* resultBuffer;
		344	Py_ssize_t resultSize;
		345	ZSTD_outBuffer output;
		346	int compressResult;
		347	size_t oldPos;
		348	size_t zresult;
		349
		350	if (self->closed) {
		351	PyErr_SetString(PyExc_ValueError, "stream is closed");
		352	return NULL;
		353	}
		354
		355	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "\|n:read1", kwlist, &size)) {
		356	return NULL;
		357	}
		358
		359	if (size < -1) {
		360	PyErr_SetString(PyExc_ValueError, "cannot read negative amounts less than -1");
		361	return NULL;
		362	}
		363
		364	if (self->finishedOutput \|\| size == 0) {
		365	return PyBytes_FromStringAndSize("", 0);
		366	}
		367
		368	if (size == -1) {
		369	size = ZSTD_CStreamOutSize();
		370	}
		371
		372	result = PyBytes_FromStringAndSize(NULL, size);
		373	if (NULL == result) {
		374	return NULL;
		375	}
		376
		377	PyBytes_AsStringAndSize(result, &resultBuffer, &resultSize);
		378
		379	output.dst = resultBuffer;
		380	output.size = resultSize;
		381	output.pos = 0;
		382
		383	/* read1() is supposed to use at most 1 read() from the underlying stream.
		384	However, we can't satisfy this requirement with compression because
		385	not every input will generate output. We /could/ flush the compressor,
		386	but this may not be desirable. We allow multiple read() from the
		387	underlying stream. But unlike read(), we return as soon as output data
		388	is available.
		389	*/
		390
		391	compressResult = compress_input(self, &output);
		392
		393	if (-1 == compressResult) {
		394	Py_XDECREF(result);
		395	return NULL;
		396	}
		397	else if (0 == compressResult \|\| 1 == compressResult) { }
		398	else {
		399	assert(0);
		400	}
		401
		402	if (output.pos) {
		403	goto finally;
		404	}
		405
		406	while (!self->finishedInput) {
		407	int readResult = read_compressor_input(self);
		408
		409	if (-1 == readResult) {
		410	Py_XDECREF(result);
		411	return NULL;
		412	}
		413	else if (0 == readResult \|\| 1 == readResult) { }
		414	else {
		415	assert(0);
		416	}
		417
		418	compressResult = compress_input(self, &output);
		419
		420	if (-1 == compressResult) {
		421	Py_XDECREF(result);
		422	return NULL;
		423	}
		424	else if (0 == compressResult \|\| 1 == compressResult) { }
		425	else {
		426	assert(0);
		427	}
		428
		429	if (output.pos) {
		430	goto finally;
		431	}
		432	}
		433
		434	/* EOF */
		435	oldPos = output.pos;
		436
		437	zresult = ZSTD_compressStream2(self->compressor->cctx, &output, &self->input,
		438	ZSTD_e_end);
		439
		440	self->bytesCompressed += output.pos - oldPos;
		441
		442	if (ZSTD_isError(zresult)) {
		443	PyErr_Format(ZstdError, "error ending compression stream: %s",
		444	ZSTD_getErrorName(zresult));
		445	Py_XDECREF(result);
		446	return NULL;
		447	}
		448
		449	if (zresult == 0) {
		450	self->finishedOutput = 1;
		451	}
		452
		453	finally:
		454	if (result) {
		455	if (safe_pybytes_resize(&result, output.pos)) {
		456	Py_XDECREF(result);
		457	return NULL;
		458	}
		459	}
		460
		461	return result;
		462	}
		463
	291	464	static PyObject* reader_readall(PyObject* self) {
	292		PyErr_SetNone(PyExc_NotImplementedError);
	293		return NULL;
		465	PyObject* chunks = NULL;
		466	PyObject* empty = NULL;
		467	PyObject* result = NULL;
		468
		469	/* Our strategy is to collect chunks into a list then join all the
		470	* chunks at the end. We could potentially use e.g. an io.BytesIO. But
		471	* this feels simple enough to implement and avoids potentially expensive
		472	* reallocations of large buffers.
		473	*/
		474	chunks = PyList_New(0);
		475	if (NULL == chunks) {
		476	return NULL;
		477	}
		478
		479	while (1) {
		480	PyObject* chunk = PyObject_CallMethod(self, "read", "i", 1048576);
		481	if (NULL == chunk) {
		482	Py_DECREF(chunks);
		483	return NULL;
		484	}
		485
		486	if (!PyBytes_Size(chunk)) {
		487	Py_DECREF(chunk);
		488	break;
		489	}
		490
		491	if (PyList_Append(chunks, chunk)) {
		492	Py_DECREF(chunk);
		493	Py_DECREF(chunks);
		494	return NULL;
		495	}
		496
		497	Py_DECREF(chunk);
		498	}
		499
		500	empty = PyBytes_FromStringAndSize("", 0);
		501	if (NULL == empty) {
		502	Py_DECREF(chunks);
		503	return NULL;
		504	}
		505
		506	result = PyObject_CallMethod(empty, "join", "O", chunks);
		507
		508	Py_DECREF(empty);
		509	Py_DECREF(chunks);
		510
		511	return result;
		512	}
		513
		514	static PyObject* reader_readinto(ZstdCompressionReader* self, PyObject* args) {
		515	Py_buffer dest;
		516	ZSTD_outBuffer output;
		517	int readResult, compressResult;
		518	PyObject* result = NULL;
		519	size_t zresult;
		520	size_t oldPos;
		521
		522	if (self->closed) {
		523	PyErr_SetString(PyExc_ValueError, "stream is closed");
		524	return NULL;
		525	}
		526
		527	if (self->finishedOutput) {
		528	return PyLong_FromLong(0);
		529	}
		530
		531	if (!PyArg_ParseTuple(args, "w*:readinto", &dest)) {
		532	return NULL;
		533	}
		534
		535	if (!PyBuffer_IsContiguous(&dest, 'C') \|\| dest.ndim > 1) {
		536	PyErr_SetString(PyExc_ValueError,
		537	"destination buffer should be contiguous and have at most one dimension");
		538	goto finally;
		539	}
		540
		541	output.dst = dest.buf;
		542	output.size = dest.len;
		543	output.pos = 0;
		544
		545	compressResult = compress_input(self, &output);
		546
		547	if (-1 == compressResult) {
		548	goto finally;
		549	}
		550	else if (0 == compressResult) { }
		551	else if (1 == compressResult) {
		552	result = PyLong_FromSize_t(output.pos);
		553	goto finally;
		554	}
		555	else {
		556	assert(0);
		557	}
		558
		559	while (!self->finishedInput) {
		560	readResult = read_compressor_input(self);
		561
		562	if (-1 == readResult) {
		563	goto finally;
		564	}
		565	else if (0 == readResult \|\| 1 == readResult) {}
		566	else {
		567	assert(0);
		568	}
		569
		570	compressResult = compress_input(self, &output);
		571
		572	if (-1 == compressResult) {
		573	goto finally;
		574	}
		575	else if (0 == compressResult) { }
		576	else if (1 == compressResult) {
		577	result = PyLong_FromSize_t(output.pos);
		578	goto finally;
		579	}
		580	else {
		581	assert(0);
		582	}
		583	}
		584
		585	/* EOF */
		586	oldPos = output.pos;
		587
		588	zresult = ZSTD_compressStream2(self->compressor->cctx, &output, &self->input,
		589	ZSTD_e_end);
		590
		591	self->bytesCompressed += self->output.pos - oldPos;
		592
		593	if (ZSTD_isError(zresult)) {
		594	PyErr_Format(ZstdError, "error ending compression stream: %s",
		595	ZSTD_getErrorName(zresult));
		596	goto finally;
		597	}
		598
		599	assert(output.pos);
		600
		601	if (0 == zresult) {
		602	self->finishedOutput = 1;
		603	}
		604
		605	result = PyLong_FromSize_t(output.pos);
		606
		607	finally:
		608	PyBuffer_Release(&dest);
		609
		610	return result;
		611	}
		612
		613	static PyObject* reader_readinto1(ZstdCompressionReader* self, PyObject* args) {
		614	Py_buffer dest;
		615	PyObject* result = NULL;
		616	ZSTD_outBuffer output;
		617	int compressResult;
		618	size_t oldPos;
		619	size_t zresult;
		620
		621	if (self->closed) {
		622	PyErr_SetString(PyExc_ValueError, "stream is closed");
		623	return NULL;
		624	}
		625
		626	if (self->finishedOutput) {
		627	return PyLong_FromLong(0);
		628	}
		629
		630	if (!PyArg_ParseTuple(args, "w*:readinto1", &dest)) {
		631	return NULL;
		632	}
		633
		634	if (!PyBuffer_IsContiguous(&dest, 'C') \|\| dest.ndim > 1) {
		635	PyErr_SetString(PyExc_ValueError,
		636	"destination buffer should be contiguous and have at most one dimension");
		637	goto finally;
		638	}
		639
		640	output.dst = dest.buf;
		641	output.size = dest.len;
		642	output.pos = 0;
		643
		644	compressResult = compress_input(self, &output);
		645
		646	if (-1 == compressResult) {
		647	goto finally;
		648	}
		649	else if (0 == compressResult \|\| 1 == compressResult) { }
		650	else {
		651	assert(0);
		652	}
		653
		654	if (output.pos) {
		655	result = PyLong_FromSize_t(output.pos);
		656	goto finally;
		657	}
		658
		659	while (!self->finishedInput) {
		660	int readResult = read_compressor_input(self);
		661
		662	if (-1 == readResult) {
		663	goto finally;
		664	}
		665	else if (0 == readResult \|\| 1 == readResult) { }
		666	else {
		667	assert(0);
		668	}
		669
		670	compressResult = compress_input(self, &output);
		671
		672	if (-1 == compressResult) {
		673	goto finally;
		674	}
		675	else if (0 == compressResult) { }
		676	else if (1 == compressResult) {
		677	result = PyLong_FromSize_t(output.pos);
		678	goto finally;
		679	}
		680	else {
		681	assert(0);
		682	}
		683
		684	/* If we produced output and we're not done with input, emit
		685	* that output now, as we've hit restrictions of read1().
		686	*/
		687	if (output.pos && !self->finishedInput) {
		688	result = PyLong_FromSize_t(output.pos);
		689	goto finally;
		690	}
		691
		692	/* Otherwise we either have no output or we've exhausted the
		693	* input. Either we try to get more input or we fall through
		694	* to EOF below */
		695	}
		696
		697	/* EOF */
		698	oldPos = output.pos;
		699
		700	zresult = ZSTD_compressStream2(self->compressor->cctx, &output, &self->input,
		701	ZSTD_e_end);
		702
		703	self->bytesCompressed += self->output.pos - oldPos;
		704
		705	if (ZSTD_isError(zresult)) {
		706	PyErr_Format(ZstdError, "error ending compression stream: %s",
		707	ZSTD_getErrorName(zresult));
		708	goto finally;
		709	}
		710
		711	assert(output.pos);
		712
		713	if (0 == zresult) {
		714	self->finishedOutput = 1;
		715	}
		716
		717	result = PyLong_FromSize_t(output.pos);
		718
		719	finally:
		720	PyBuffer_Release(&dest);
		721
		722	return result;
	294	723	}
	295	724
	296	725	static PyObject* reader_iter(PyObject* self) {
			@@ -315,7 +744,10 b' static PyMethodDef reader_methods[] = {'
	315	744	{ "readable", (PyCFunction)reader_readable, METH_NOARGS,
	316	745	PyDoc_STR("Returns True") },
	317	746	{ "read", (PyCFunction)reader_read, METH_VARARGS \| METH_KEYWORDS, PyDoc_STR("read compressed data") },
		747	{ "read1", (PyCFunction)reader_read1, METH_VARARGS \| METH_KEYWORDS, NULL },
	318	748	{ "readall", (PyCFunction)reader_readall, METH_NOARGS, PyDoc_STR("Not implemented") },
		749	{ "readinto", (PyCFunction)reader_readinto, METH_VARARGS, NULL },
		750	{ "readinto1", (PyCFunction)reader_readinto1, METH_VARARGS, NULL },
	319	751	{ "readline", (PyCFunction)reader_readline, METH_VARARGS, PyDoc_STR("Not implemented") },
	320	752	{ "readlines", (PyCFunction)reader_readlines, METH_VARARGS, PyDoc_STR("Not implemented") },
	321	753	{ "seekable", (PyCFunction)reader_seekable, METH_NOARGS,

contrib/python-zstandard/c-ext/compressionwriter.c

0 +151 -95

              	Py_XDECREF(self->compressor);
              	Py_XDECREF(self->writer);
+             	PyMem_Free(self->output.dst);
+             	self->output.dst = NULL;
              	PyObject_Del(self);
              }
              static PyObject* ZstdCompressionWriter_enter(ZstdCompressionWriter* self) {
-             	size_t zresult;
+             	if (self->closed) {
+             		PyErr_SetString(PyExc_ValueError, "stream is closed");
+             		return NULL;
+             	}
              	if (self->entered) {
              		PyErr_SetString(ZstdError, "cannot __enter__ multiple times");
              		return NULL;
              	}
-             	zresult = ZSTD_CCtx_setPledgedSrcSize(self->compressor->cctx, self->sourceSize);
-             	if (ZSTD_isError(zresult)) {
-             		PyErr_Format(ZstdError, "error setting source size: %s",
-             			ZSTD_getErrorName(zresult));
-             		return NULL;
+             	}
              	self->entered = 1;
              	Py_INCREF(self);
              	PyObject* exc_type;
              	PyObject* exc_value;
              	PyObject* exc_tb;
-             	size_t zresult;
-             	ZSTD_outBuffer output;
-             	PyObject* res;
              	if (!PyArg_ParseTuple(args, "OOO:__exit__", &exc_type, &exc_value, &exc_tb)) {
              		return NULL;
              	self->entered = 0;
              	if (exc_type == Py_None && exc_value == Py_None && exc_tb == Py_None) {
-             		ZSTD_inBuffer inBuffer;
-             		inBuffer.src = NULL;
-             		inBuffer.size = 0;
-             		inBuffer.pos = 0;
-             		output.dst = PyMem_Malloc(self->outSize);
-             		if (!output.dst) {
-             			return PyErr_NoMemory();
+             		}
-             		output.size = self->outSize;
-             		output.pos = 0;
+             		PyObject* result = PyObject_CallMethod((PyObject*)self, "close", NULL);
-             		while (1) {
-             			zresult = ZSTD_compress_generic(self->compressor->cctx, &output, &inBuffer, ZSTD_e_end);
-             			if (ZSTD_isError(zresult)) {
-             				PyErr_Format(ZstdError, "error ending compression stream: %s",
-             					ZSTD_getErrorName(zresult));
-             				PyMem_Free(output.dst);
-             				return NULL;
+             			}
-             			if (output.pos) {
-             #if PY_MAJOR_VERSION >= 3
-             				res = PyObject_CallMethod(self->writer, "write", "y#",
-             #else
-             				res = PyObject_CallMethod(self->writer, "write", "s#",
-             #endif
-             					output.dst, output.pos);
-             				Py_XDECREF(res);
+             			}
-             			if (!zresult) {
-             				break;
+             			}
-             			output.pos = 0;
+             		if (NULL == result) {
+             			return NULL;
              		}
-             		PyMem_Free(output.dst);
              	}
              	Py_RETURN_FALSE;
              	Py_buffer source;
              	size_t zresult;
              	ZSTD_inBuffer input;
-             	ZSTD_outBuffer output;
              	PyObject* res;
              	Py_ssize_t totalWrite = 0;
              		return NULL;
              	}
-             	if (!self->entered) {
-             		PyErr_SetString(ZstdError, "compress must be called from an active context manager");
-             		goto finally;
+             	}
              	if (!PyBuffer_IsContiguous(&source, 'C') || source.ndim > 1) {
              		PyErr_SetString(PyExc_ValueError,
              			"data buffer should be contiguous and have at most one dimension");
              		goto finally;
              	}
-             	output.dst = PyMem_Malloc(self->outSize);
-             	if (!output.dst) {
-             		PyErr_NoMemory();
-             		goto finally;
+             	if (self->closed) {
+             		PyErr_SetString(PyExc_ValueError, "stream is closed");
+             		return NULL;
              	}
-             	output.size = self->outSize;
-             	output.pos = 0;
+             	self->output.pos = 0;
              	input.src = source.buf;
              	input.size = source.len;
              	input.pos = 0;
-             	while ((ssize_t)input.pos < source.len) {
+             	while (input.pos < (size_t)source.len) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_compress_generic(self->compressor->cctx, &output, &input, ZSTD_e_continue);
+             		zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output, &input, ZSTD_e_continue);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
-             			PyMem_Free(output.dst);
              			PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
              			goto finally;
              		}
              		/* Copy data from output buffer to writer. */
-             		if (output.pos) {
+             		if (self->output.pos) {
              #if PY_MAJOR_VERSION >= 3
              			res = PyObject_CallMethod(self->writer, "write", "y#",
              #else
              			res = PyObject_CallMethod(self->writer, "write", "s#",
              #endif
-             				output.dst, output.pos);
+             				self->output.dst, self->output.pos);
              			Py_XDECREF(res);
-             			totalWrite += output.pos;
-             			self->bytesCompressed += output.pos;
+             			totalWrite += self->output.pos;
+             			self->bytesCompressed += self->output.pos;
              		}
-             		output.pos = 0;
+             		self->output.pos = 0;
              	}
-             	PyMem_Free(output.dst);
-             	result = PyLong_FromSsize_t(totalWrite);
+             	if (self->writeReturnRead) {
+             		result = PyLong_FromSize_t(input.pos);
+             	}
+             	else {
+             		result = PyLong_FromSsize_t(totalWrite);
+             	}
              finally:
              	PyBuffer_Release(&source);
              	return result;
              }
-             static PyObject* ZstdCompressionWriter_flush(ZstdCompressionWriter* self, PyObject* args) {
+             static PyObject* ZstdCompressionWriter_flush(ZstdCompressionWriter* self, PyObject* args, PyObject* kwargs) {
+             	static char* kwlist[] = {
+             		"flush_mode",
+             		NULL
+             	};
              	size_t zresult;
-             	ZSTD_outBuffer output;
              	ZSTD_inBuffer input;
              	PyObject* res;
              	Py_ssize_t totalWrite = 0;
+             	unsigned flush_mode = 0;
+             	ZSTD_EndDirective flush;
-             	if (!self->entered) {
-             		PyErr_SetString(ZstdError, "flush must be called from an active context manager");
+                 if (!PyArg_ParseTupleAndKeywords(args, kwargs, "|I:flush",
+             		kwlist, &flush_mode)) {
              		return NULL;
              	}
+             	switch (flush_mode) {
+             		case 0:
+             			flush = ZSTD_e_flush;
+             			break;
+             		case 1:
+             			flush = ZSTD_e_end;
+             			break;
+             		default:
+             			PyErr_Format(PyExc_ValueError, "unknown flush_mode: %d", flush_mode);
+             			return NULL;
+             	}
+             	if (self->closed) {
+             		PyErr_SetString(PyExc_ValueError, "stream is closed");
+             		return NULL;
+             	}
+             	self->output.pos = 0;
              	input.src = NULL;
              	input.size = 0;
              	input.pos = 0;
-             	output.dst = PyMem_Malloc(self->outSize);
-             	if (!output.dst) {
-             		return PyErr_NoMemory();
+             	}
-             	output.size = self->outSize;
-             	output.pos = 0;
              	while (1) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_compress_generic(self->compressor->cctx, &output, &input, ZSTD_e_flush);
+             		zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output, &input, flush);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
-             			PyMem_Free(output.dst);
              			PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
              			return NULL;
              		}
              		/* Copy data from output buffer to writer. */
-             		if (output.pos) {
+             		if (self->output.pos) {
              #if PY_MAJOR_VERSION >= 3
              			res = PyObject_CallMethod(self->writer, "write", "y#",
              #else
              			res = PyObject_CallMethod(self->writer, "write", "s#",
              #endif
-             				output.dst, output.pos);
+             				self->output.dst, self->output.pos);
              			Py_XDECREF(res);
-             			totalWrite += output.pos;
-             			self->bytesCompressed += output.pos;
+             			totalWrite += self->output.pos;
+             			self->bytesCompressed += self->output.pos;
              		}
-             		output.pos = 0;
+             		self->output.pos = 0;
              		if (!zresult) {
              			break;
              		}
              	}
-             	PyMem_Free(output.dst);
+             	return PyLong_FromSsize_t(totalWrite);
+             }
+             static PyObject* ZstdCompressionWriter_close(ZstdCompressionWriter* self) {
+             	PyObject* result;
+             	if (self->closed) {
+             		Py_RETURN_NONE;
+             	}
+             	result = PyObject_CallMethod((PyObject*)self, "flush", "I", 1);
+             	self->closed = 1;
+             	if (NULL == result) {
+             	    return NULL;
+             	}
-             	return PyLong_FromSsize_t(totalWrite);
+                 /* Call close on underlying stream as well. */
+             	if (PyObject_HasAttrString(self->writer, "close")) {
+             		return PyObject_CallMethod(self->writer, "close", NULL);
+             	}
+             	Py_RETURN_NONE;
+             }
+             static PyObject* ZstdCompressionWriter_fileno(ZstdCompressionWriter* self) {
+             	if (PyObject_HasAttrString(self->writer, "fileno")) {
+             		return PyObject_CallMethod(self->writer, "fileno", NULL);
+             	}
+             	else {
+             		PyErr_SetString(PyExc_OSError, "fileno not available on underlying writer");
+             		return NULL;
+             	}
              }
              static PyObject* ZstdCompressionWriter_tell(ZstdCompressionWriter* self) {
              	return PyLong_FromUnsignedLongLong(self->bytesCompressed);
              }
+             static PyObject* ZstdCompressionWriter_writelines(PyObject* self, PyObject* args) {
+             	PyErr_SetNone(PyExc_NotImplementedError);
+             	return NULL;
+             }
+             static PyObject* ZstdCompressionWriter_false(PyObject* self, PyObject* args) {
+             	Py_RETURN_FALSE;
+             }
+             static PyObject* ZstdCompressionWriter_true(PyObject* self, PyObject* args) {
+             	Py_RETURN_TRUE;
+             }
+             static PyObject* ZstdCompressionWriter_unsupported(PyObject* self, PyObject* args, PyObject* kwargs) {
+             	PyObject* iomod;
+             	PyObject* exc;
+             	iomod = PyImport_ImportModule("io");
+             	if (NULL == iomod) {
+             		return NULL;
+             	}
+             	exc = PyObject_GetAttrString(iomod, "UnsupportedOperation");
+             	if (NULL == exc) {
+             		Py_DECREF(iomod);
+             		return NULL;
+             	}
+             	PyErr_SetNone(exc);
+             	Py_DECREF(exc);
+             	Py_DECREF(iomod);
+             	return NULL;
+             }
              static PyMethodDef ZstdCompressionWriter_methods[] = {
              	{ "__enter__", (PyCFunction)ZstdCompressionWriter_enter, METH_NOARGS,
              	PyDoc_STR("Enter a compression context.") },
              	{ "__exit__", (PyCFunction)ZstdCompressionWriter_exit, METH_VARARGS,
              	PyDoc_STR("Exit a compression context.") },
+             	{ "close", (PyCFunction)ZstdCompressionWriter_close, METH_NOARGS, NULL },
+             	{ "fileno", (PyCFunction)ZstdCompressionWriter_fileno, METH_NOARGS, NULL },
+             	{ "isatty", (PyCFunction)ZstdCompressionWriter_false, METH_NOARGS, NULL },
+             	{ "readable", (PyCFunction)ZstdCompressionWriter_false, METH_NOARGS, NULL },
+             	{ "readline", (PyCFunction)ZstdCompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "readlines", (PyCFunction)ZstdCompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "seek", (PyCFunction)ZstdCompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "seekable", ZstdCompressionWriter_false, METH_NOARGS, NULL },
+             	{ "truncate", (PyCFunction)ZstdCompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "writable", ZstdCompressionWriter_true, METH_NOARGS, NULL },
+             	{ "writelines", ZstdCompressionWriter_writelines, METH_VARARGS, NULL },
+             	{ "read", (PyCFunction)ZstdCompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "readall", (PyCFunction)ZstdCompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "readinto", (PyCFunction)ZstdCompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
              	{ "memory_size", (PyCFunction)ZstdCompressionWriter_memory_size, METH_NOARGS,
              	PyDoc_STR("Obtain the memory size of the underlying compressor") },
              	{ "write", (PyCFunction)ZstdCompressionWriter_write, METH_VARARGS | METH_KEYWORDS,
              	PyDoc_STR("Compress data") },
-             	{ "flush", (PyCFunction)ZstdCompressionWriter_flush, METH_NOARGS,
+             	{ "flush", (PyCFunction)ZstdCompressionWriter_flush, METH_VARARGS | METH_KEYWORDS,
              	PyDoc_STR("Flush data and finish a zstd frame") },
              	{ "tell", (PyCFunction)ZstdCompressionWriter_tell, METH_NOARGS,
              	PyDoc_STR("Returns current number of bytes compressed") },
              	{ NULL, NULL }
              };
+             static PyMemberDef ZstdCompressionWriter_members[] = {
+             	 { "closed", T_BOOL, offsetof(ZstdCompressionWriter, closed), READONLY, NULL },
+             	 { NULL }
+             };
              PyTypeObject ZstdCompressionWriterType = {
              	PyVarObject_HEAD_INIT(NULL, 0)
              	"zstd.ZstdCompressionWriter",  /* tp_name */
 ,                              /* tp_iter */
 ,                              /* tp_iternext */
              	ZstdCompressionWriter_methods,  /* tp_methods */
-,                              /* tp_members */
+             	ZstdCompressionWriter_members,  /* tp_members */
 ,                              /* tp_getset */
 ,                              /* tp_base */
 ,                              /* tp_dict */

contrib/python-zstandard/c-ext/compressobj.c

0 +3 -3

              	input.size = source.len;
              	input.pos = 0;
-             	while ((ssize_t)input.pos < source.len) {
+             	while (input.pos < (size_t)source.len) {
              		Py_BEGIN_ALLOW_THREADS
-             			zresult = ZSTD_compress_generic(self->compressor->cctx, &self->output,
+             			zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output,
              				&input, ZSTD_e_continue);
              		Py_END_ALLOW_THREADS
              	while (1) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_compress_generic(self->compressor->cctx, &self->output,
+             		zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output,
              			&input, zFlushMode);
              		Py_END_ALLOW_THREADS

contrib/python-zstandard/c-ext/compressor.c

0 +38 -19

              		}
              	}
              	else {
-             		if (set_parameter(self->params, ZSTD_p_compressionLevel, level)) {
+             		if (set_parameter(self->params, ZSTD_c_compressionLevel, level)) {
              			return -1;
              		}
-             		if (set_parameter(self->params, ZSTD_p_contentSizeFlag,
+             		if (set_parameter(self->params, ZSTD_c_contentSizeFlag,
              			writeContentSize ? PyObject_IsTrue(writeContentSize) : 1)) {
              			return -1;
              		}
-             		if (set_parameter(self->params, ZSTD_p_checksumFlag,
+             		if (set_parameter(self->params, ZSTD_c_checksumFlag,
              			writeChecksum ? PyObject_IsTrue(writeChecksum) : 0)) {
              			return -1;
              		}
-             		if (set_parameter(self->params, ZSTD_p_dictIDFlag,
+             		if (set_parameter(self->params, ZSTD_c_dictIDFlag,
              			writeDictID ? PyObject_IsTrue(writeDictID) : 1)) {
              			return -1;
              		}
              		if (threads) {
-             			if (set_parameter(self->params, ZSTD_p_nbWorkers, threads)) {
+             			if (set_parameter(self->params, ZSTD_c_nbWorkers, threads)) {
              				return -1;
              			}
              		}
              		return NULL;
              	}
-             	ZSTD_CCtx_reset(self->cctx);
+             	ZSTD_CCtx_reset(self->cctx, ZSTD_reset_session_only);
              	zresult = ZSTD_CCtx_setPledgedSrcSize(self->cctx, sourceSize);
              	if (ZSTD_isError(zresult)) {
              		while (input.pos < input.size) {
              			Py_BEGIN_ALLOW_THREADS
-             			zresult = ZSTD_compress_generic(self->cctx, &output, &input, ZSTD_e_continue);
+             			zresult = ZSTD_compressStream2(self->cctx, &output, &input, ZSTD_e_continue);
              			Py_END_ALLOW_THREADS
              			if (ZSTD_isError(zresult)) {
              	while (1) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_compress_generic(self->cctx, &output, &input, ZSTD_e_end);
+             		zresult = ZSTD_compressStream2(self->cctx, &output, &input, ZSTD_e_end);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
              		goto except;
              	}
-             	ZSTD_CCtx_reset(self->cctx);
+             	ZSTD_CCtx_reset(self->cctx, ZSTD_reset_session_only);
              	zresult = ZSTD_CCtx_setPledgedSrcSize(self->cctx, sourceSize);
              	if (ZSTD_isError(zresult)) {
              		goto finally;
              	}
-             	ZSTD_CCtx_reset(self->cctx);
+             	ZSTD_CCtx_reset(self->cctx, ZSTD_reset_session_only);
              	destSize = ZSTD_compressBound(source.len);
              	output = PyBytes_FromStringAndSize(NULL, destSize);
              	/* By avoiding ZSTD_compress(), we don't necessarily write out content
              		size. This means the argument to ZstdCompressor to control frame
              		parameters is honored. */
-             	zresult = ZSTD_compress_generic(self->cctx, &outBuffer, &inBuffer, ZSTD_e_end);
+             	zresult = ZSTD_compressStream2(self->cctx, &outBuffer, &inBuffer, ZSTD_e_end);
              	Py_END_ALLOW_THREADS
              	if (ZSTD_isError(zresult)) {
              		return NULL;
              	}
-             	ZSTD_CCtx_reset(self->cctx);
+             	ZSTD_CCtx_reset(self->cctx, ZSTD_reset_session_only);
              	zresult = ZSTD_CCtx_setPledgedSrcSize(self->cctx, inSize);
              	if (ZSTD_isError(zresult)) {
              		goto except;
              	}
-             	ZSTD_CCtx_reset(self->cctx);
+             	ZSTD_CCtx_reset(self->cctx, ZSTD_reset_session_only);
              	zresult = ZSTD_CCtx_setPledgedSrcSize(self->cctx, sourceSize);
              	if (ZSTD_isError(zresult)) {
              		"writer",
              		"size",
              		"write_size",
+             		"write_return_read",
              		NULL
              	};
              	PyObject* writer;
              	ZstdCompressionWriter* result;
+             	size_t zresult;
              	unsigned long long sourceSize = ZSTD_CONTENTSIZE_UNKNOWN;
              	size_t outSize = ZSTD_CStreamOutSize();
+             	PyObject* writeReturnRead = NULL;
-             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|Kk:stream_writer", kwlist,
-             		&writer, &sourceSize, &outSize)) {
+             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|KkO:stream_writer", kwlist,
+             		&writer, &sourceSize, &outSize, &writeReturnRead)) {
              		return NULL;
              	}
              		return NULL;
              	}
-             	ZSTD_CCtx_reset(self->cctx);
+             	ZSTD_CCtx_reset(self->cctx, ZSTD_reset_session_only);
+             	zresult = ZSTD_CCtx_setPledgedSrcSize(self->cctx, sourceSize);
+             	if (ZSTD_isError(zresult)) {
+             		PyErr_Format(ZstdError, "error setting source size: %s",
+             			ZSTD_getErrorName(zresult));
+             		return NULL;
+             	}
              	result = (ZstdCompressionWriter*)PyObject_CallObject((PyObject*)&ZstdCompressionWriterType, NULL);
              	if (!result) {
              		return NULL;
              	}
+             	result->output.dst = PyMem_Malloc(outSize);
+             	if (!result->output.dst) {
+             		Py_DECREF(result);
+             		return (ZstdCompressionWriter*)PyErr_NoMemory();
+             	}
+             	result->output.pos = 0;
+             	result->output.size = outSize;
              	result->compressor = self;
              	Py_INCREF(result->compressor);
              	result->writer = writer;
              	Py_INCREF(result->writer);
-             	result->sourceSize = sourceSize;
              	result->outSize = outSize;
              	result->bytesCompressed = 0;
+             	result->writeReturnRead = writeReturnRead ? PyObject_IsTrue(writeReturnRead) : 0;
              	return result;
              }
              		return NULL;
              	}
-             	ZSTD_CCtx_reset(self->cctx);
+             	ZSTD_CCtx_reset(self->cctx, ZSTD_reset_session_only);
              	zresult = ZSTD_CCtx_setPledgedSrcSize(self->cctx, sourceSize);
              	if (ZSTD_isError(zresult)) {
              			break;
              		}
-             		zresult = ZSTD_compress_generic(state->cctx, &opOutBuffer, &opInBuffer, ZSTD_e_end);
+             		zresult = ZSTD_compressStream2(state->cctx, &opOutBuffer, &opInBuffer, ZSTD_e_end);
              		if (ZSTD_isError(zresult)) {
              			state->error = WorkerError_zstd;
              			state->zresult = zresult;

contrib/python-zstandard/c-ext/compressoriterator.c

0 +3 -3

              	/* If we have data left in the input, consume it. */
              	if (self->input.pos < self->input.size) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_compress_generic(self->compressor->cctx, &self->output,
+             		zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output,
              			&self->input, ZSTD_e_continue);
              		Py_END_ALLOW_THREADS
              		self->input.size = 0;
              		self->input.pos = 0;
-             		zresult = ZSTD_compress_generic(self->compressor->cctx, &self->output,
+             		zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output,
              			&self->input, ZSTD_e_end);
              		if (ZSTD_isError(zresult)) {
              			PyErr_Format(ZstdError, "error ending compression stream: %s",
              	self->input.pos = 0;
              	Py_BEGIN_ALLOW_THREADS
-             	zresult = ZSTD_compress_generic(self->compressor->cctx, &self->output,
+             	zresult = ZSTD_compressStream2(self->compressor->cctx, &self->output,
              		&self->input, ZSTD_e_continue);
              	Py_END_ALLOW_THREADS

contrib/python-zstandard/c-ext/constants.c

0 +9 -2

              	ZstdError = PyErr_NewException("zstd.ZstdError", NULL, NULL);
              	PyModule_AddObject(mod, "ZstdError", ZstdError);
+             	PyModule_AddIntConstant(mod, "FLUSH_BLOCK", 0);
+             	PyModule_AddIntConstant(mod, "FLUSH_FRAME", 1);
              	PyModule_AddIntConstant(mod, "COMPRESSOBJ_FLUSH_FINISH", compressorobj_flush_finish);
              	PyModule_AddIntConstant(mod, "COMPRESSOBJ_FLUSH_BLOCK", compressorobj_flush_block);
              	PyModule_AddIntConstant(mod, "HASHLOG3_MAX", ZSTD_HASHLOG3_MAX);
              	PyModule_AddIntConstant(mod, "SEARCHLOG_MIN", ZSTD_SEARCHLOG_MIN);
              	PyModule_AddIntConstant(mod, "SEARCHLOG_MAX", ZSTD_SEARCHLOG_MAX);
-             	PyModule_AddIntConstant(mod, "SEARCHLENGTH_MIN", ZSTD_SEARCHLENGTH_MIN);
-             	PyModule_AddIntConstant(mod, "SEARCHLENGTH_MAX", ZSTD_SEARCHLENGTH_MAX);
+             	PyModule_AddIntConstant(mod, "MINMATCH_MIN", ZSTD_MINMATCH_MIN);
+             	PyModule_AddIntConstant(mod, "MINMATCH_MAX", ZSTD_MINMATCH_MAX);
+             	/* TODO SEARCHLENGTH_* is deprecated. */
+             	PyModule_AddIntConstant(mod, "SEARCHLENGTH_MIN", ZSTD_MINMATCH_MIN);
+             	PyModule_AddIntConstant(mod, "SEARCHLENGTH_MAX", ZSTD_MINMATCH_MAX);
              	PyModule_AddIntConstant(mod, "TARGETLENGTH_MIN", ZSTD_TARGETLENGTH_MIN);
              	PyModule_AddIntConstant(mod, "TARGETLENGTH_MAX", ZSTD_TARGETLENGTH_MAX);
              	PyModule_AddIntConstant(mod, "LDM_MINMATCH_MIN", ZSTD_LDM_MINMATCH_MIN);
              	PyModule_AddIntConstant(mod, "STRATEGY_BTLAZY2", ZSTD_btlazy2);
              	PyModule_AddIntConstant(mod, "STRATEGY_BTOPT", ZSTD_btopt);
              	PyModule_AddIntConstant(mod, "STRATEGY_BTULTRA", ZSTD_btultra);
+             	PyModule_AddIntConstant(mod, "STRATEGY_BTULTRA2", ZSTD_btultra2);
              	PyModule_AddIntConstant(mod, "DICT_TYPE_AUTO", ZSTD_dct_auto);
              	PyModule_AddIntConstant(mod, "DICT_TYPE_RAWCONTENT", ZSTD_dct_rawContent);

contrib/python-zstandard/c-ext/decompressionreader.c

0 +425 -86

This diff has been collapsed as it changes many lines, (511 lines changed) Show them Hide them
			@@ -102,6 +102,114 b' static PyObject* reader_isatty(PyObject*'
	102	102	Py_RETURN_FALSE;
	103	103	}
	104	104
		105	/**
		106	* Read available input.
		107	*
		108	* Returns 0 if no data was added to input.
		109	* Returns 1 if new input data is available.
		110	* Returns -1 on error and sets a Python exception as a side-effect.
		111	*/
		112	int read_decompressor_input(ZstdDecompressionReader* self) {
		113	if (self->finishedInput) {
		114	return 0;
		115	}
		116
		117	if (self->input.pos != self->input.size) {
		118	return 0;
		119	}
		120
		121	if (self->reader) {
		122	Py_buffer buffer;
		123
		124	assert(self->readResult == NULL);
		125	self->readResult = PyObject_CallMethod(self->reader, "read",
		126	"k", self->readSize);
		127	if (NULL == self->readResult) {
		128	return -1;
		129	}
		130
		131	memset(&buffer, 0, sizeof(buffer));
		132
		133	if (0 != PyObject_GetBuffer(self->readResult, &buffer, PyBUF_CONTIG_RO)) {
		134	return -1;
		135	}
		136
		137	/* EOF */
		138	if (0 == buffer.len) {
		139	self->finishedInput = 1;
		140	Py_CLEAR(self->readResult);
		141	}
		142	else {
		143	self->input.src = buffer.buf;
		144	self->input.size = buffer.len;
		145	self->input.pos = 0;
		146	}
		147
		148	PyBuffer_Release(&buffer);
		149	}
		150	else {
		151	assert(self->buffer.buf);
		152	/*
		153	* We should only get here once since expectation is we always
		154	* exhaust input buffer before reading again.
		155	*/
		156	assert(self->input.src == NULL);
		157
		158	self->input.src = self->buffer.buf;
		159	self->input.size = self->buffer.len;
		160	self->input.pos = 0;
		161	}
		162
		163	return 1;
		164	}
		165
		166	/**
		167	* Decompresses available input into an output buffer.
		168	*
		169	* Returns 0 if we need more input.
		170	* Returns 1 if output buffer should be emitted.
		171	* Returns -1 on error and sets a Python exception.
		172	*/
		173	int decompress_input(ZstdDecompressionReader* self, ZSTD_outBuffer* output) {
		174	size_t zresult;
		175
		176	if (self->input.pos >= self->input.size) {
		177	return 0;
		178	}
		179
		180	Py_BEGIN_ALLOW_THREADS
		181	zresult = ZSTD_decompressStream(self->decompressor->dctx, output, &self->input);
		182	Py_END_ALLOW_THREADS
		183
		184	/* Input exhausted. Clear our state tracking. */
		185	if (self->input.pos == self->input.size) {
		186	memset(&self->input, 0, sizeof(self->input));
		187	Py_CLEAR(self->readResult);
		188
		189	if (self->buffer.buf) {
		190	self->finishedInput = 1;
		191	}
		192	}
		193
		194	if (ZSTD_isError(zresult)) {
		195	PyErr_Format(ZstdError, "zstd decompress error: %s", ZSTD_getErrorName(zresult));
		196	return -1;
		197	}
		198
		199	/* We fulfilled the full read request. Signal to emit. */
		200	if (output->pos && output->pos == output->size) {
		201	return 1;
		202	}
		203	/* We're at the end of a frame and we aren't allowed to return data
		204	spanning frames. */
		205	else if (output->pos && zresult == 0 && !self->readAcrossFrames) {
		206	return 1;
		207	}
		208
		209	/* There is more room in the output. Signal to collect more data. */
		210	return 0;
		211	}
		212
	105	213	static PyObject* reader_read(ZstdDecompressionReader* self, PyObject* args, PyObject* kwargs) {
	106	214	static char* kwlist[] = {
	107	215	"size",
			@@ -113,26 +221,30 b' static PyObject* reader_read(ZstdDecompr'
	113	221	char* resultBuffer;
	114	222	Py_ssize_t resultSize;
	115	223	ZSTD_outBuffer output;
	116		size_t zresult;
		224	int decompressResult, readResult;
	117	225
	118	226	if (self->closed) {
	119	227	PyErr_SetString(PyExc_ValueError, "stream is closed");
	120	228	return NULL;
	121	229	}
	122	230
	123		if (self->finishedOutput) {
	124		return PyBytes_FromStringAndSize("", 0);
	125		}
	126
	127		if (!PyArg_ParseTupleAndKeywords(args, kwargs, "n", kwlist, &size)) {
		231	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "\|n", kwlist, &size)) {
	128	232	return NULL;
	129	233	}
	130	234
	131		if (size < 1) {
	132		PyErr_SetString(PyExc_ValueError, "cannot read negative ~~or size 0~~ amounts");
		235	if (size < -1) {
		236	PyErr_SetString(PyExc_ValueError, "cannot read negative amounts less than -1");
	133	237	return NULL;
	134	238	}
	135	239
		240	if (size == -1) {
		241	return PyObject_CallMethod((PyObject*)self, "readall", NULL);
		242	}
		243
		244	if (self->finishedOutput \|\| size == 0) {
		245	return PyBytes_FromStringAndSize("", 0);
		246	}
		247
	136	248	result = PyBytes_FromStringAndSize(NULL, size);
	137	249	if (NULL == result) {
	138	250	return NULL;
			@@ -146,85 +258,38 b' static PyObject* reader_read(ZstdDecompr'
	146	258
	147	259	readinput:
	148	260
	149		/* Consume input data left over from last time. */
	150		if (self->input.pos < self->input.size) {
	151		Py_BEGIN_ALLOW_THREADS
	152		zresult = ZSTD_decompress_generic(self->decompressor->dctx,
	153		&output, &self->input);
	154		Py_END_ALLOW_THREADS
		261	decompressResult = decompress_input(self, &output);
	155	262
	156		/* Input exhausted. Clear our state tracking. */
	157		if (self->input.pos == self->input.size) {
	158		memset(&self->input, 0, sizeof(self->input));
	159		Py_CLEAR(self->readResult);
		263	if (-1 == decompressResult) {
		264	Py_XDECREF(result);
		265	return NULL;
		266	}
		267	else if (0 == decompressResult) { }
		268	else if (1 == decompressResult) {
		269	self->bytesDecompressed += output.pos;
	160	270
	161		if (self->buffer.buf) {
	162		self->finishedInput = 1;
		271	if (output.pos != output.size) {
		272	if (safe_pybytes_resize(&result, output.pos)) {
		273	Py_XDECREF(result);
		274	return NULL;
	163	275	}
	164	276	}
	165
	166		if (ZSTD_isError(zresult)) {
	167		PyErr_Format(ZstdError, "zstd decompress error: %s", ZSTD_getErrorName(zresult));
	168		return NULL;
	169		}
	170		else if (0 == zresult) {
	171		self->finishedOutput = 1;
	172		}
	173
	174		/* We fulfilled the full read request. Emit it. */
	175		if (output.pos && output.pos == output.size) {
	176		self->bytesDecompressed += output.size;
	177		return result;
	178		}
	179
	180		/*
	181		* There is more room in the output. Fall through to try to collect
	182		* more data so we can try to fill the output.
	183		*/
		277	return result;
		278	}
		279	else {
		280	assert(0);
	184	281	}
	185	282
	186		if (!self->finishedInput) {
	187		if (self->reader) {
	188		Py_buffer buffer;
	189
	190		assert(self->readResult == NULL);
	191		self->readResult = PyObject_CallMethod(self->reader, "read",
	192		"k", self->readSize);
	193		if (NULL == self->readResult) {
	194		return NULL;
	195		}
	196
	197		memset(&buffer, 0, sizeof(buffer));
	198
	199		if (0 != PyObject_GetBuffer(self->readResult, &buffer, PyBUF_CONTIG_RO)) {
	200		return NULL;
	201		}
		283	readResult = read_decompressor_input(self);
	202	284
	203		/* EOF */
	204		if (0 == buffer.len) {
	205		self->finishedInput = 1;
	206		Py_CLEAR(self->readResult);
	207		}
	208		else {
	209		self->input.src = buffer.buf;
	210		self->input.size = buffer.len;
	211		self->input.pos = 0;
	212		}
	213
	214		PyBuffer_Release(&buffer);
	215		}
	216		else {
	217		assert(self->buffer.buf);
	218		/*
	219		* We should only get here once since above block will exhaust
	220		* source buffer until finishedInput is set.
	221		*/
	222		assert(self->input.src == NULL);
	223
	224		self->input.src = self->buffer.buf;
	225		self->input.size = self->buffer.len;
	226		self->input.pos = 0;
	227		}
		285	if (-1 == readResult) {
		286	Py_XDECREF(result);
		287	return NULL;
		288	}
		289	else if (0 == readResult) {}
		290	else if (1 == readResult) {}
		291	else {
		292	assert(0);
	228	293	}
	229	294
	230	295	if (self->input.size) {
			@@ -242,18 +307,288 b' readinput:'
	242	307	return result;
	243	308	}
	244	309
		310	static PyObject* reader_read1(ZstdDecompressionReader* self, PyObject* args, PyObject* kwargs) {
		311	static char* kwlist[] = {
		312	"size",
		313	NULL
		314	};
		315
		316	Py_ssize_t size = -1;
		317	PyObject* result = NULL;
		318	char* resultBuffer;
		319	Py_ssize_t resultSize;
		320	ZSTD_outBuffer output;
		321
		322	if (self->closed) {
		323	PyErr_SetString(PyExc_ValueError, "stream is closed");
		324	return NULL;
		325	}
		326
		327	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "\|n", kwlist, &size)) {
		328	return NULL;
		329	}
		330
		331	if (size < -1) {
		332	PyErr_SetString(PyExc_ValueError, "cannot read negative amounts less than -1");
		333	return NULL;
		334	}
		335
		336	if (self->finishedOutput \|\| size == 0) {
		337	return PyBytes_FromStringAndSize("", 0);
		338	}
		339
		340	if (size == -1) {
		341	size = ZSTD_DStreamOutSize();
		342	}
		343
		344	result = PyBytes_FromStringAndSize(NULL, size);
		345	if (NULL == result) {
		346	return NULL;
		347	}
		348
		349	PyBytes_AsStringAndSize(result, &resultBuffer, &resultSize);
		350
		351	output.dst = resultBuffer;
		352	output.size = resultSize;
		353	output.pos = 0;
		354
		355	/* read1() is supposed to use at most 1 read() from the underlying stream.
		356	* However, we can't satisfy this requirement with decompression due to the
		357	* nature of how decompression works. Our strategy is to read + decompress
		358	* until we get any output, at which point we return. This satisfies the
		359	* intent of the read1() API to limit read operations.
		360	*/
		361	while (!self->finishedInput) {
		362	int readResult, decompressResult;
		363
		364	readResult = read_decompressor_input(self);
		365	if (-1 == readResult) {
		366	Py_XDECREF(result);
		367	return NULL;
		368	}
		369	else if (0 == readResult \|\| 1 == readResult) { }
		370	else {
		371	assert(0);
		372	}
		373
		374	decompressResult = decompress_input(self, &output);
		375
		376	if (-1 == decompressResult) {
		377	Py_XDECREF(result);
		378	return NULL;
		379	}
		380	else if (0 == decompressResult \|\| 1 == decompressResult) { }
		381	else {
		382	assert(0);
		383	}
		384
		385	if (output.pos) {
		386	break;
		387	}
		388	}
		389
		390	self->bytesDecompressed += output.pos;
		391	if (safe_pybytes_resize(&result, output.pos)) {
		392	Py_XDECREF(result);
		393	return NULL;
		394	}
		395
		396	return result;
		397	}
		398
		399	static PyObject* reader_readinto(ZstdDecompressionReader* self, PyObject* args) {
		400	Py_buffer dest;
		401	ZSTD_outBuffer output;
		402	int decompressResult, readResult;
		403	PyObject* result = NULL;
		404
		405	if (self->closed) {
		406	PyErr_SetString(PyExc_ValueError, "stream is closed");
		407	return NULL;
		408	}
		409
		410	if (self->finishedOutput) {
		411	return PyLong_FromLong(0);
		412	}
		413
		414	if (!PyArg_ParseTuple(args, "w*:readinto", &dest)) {
		415	return NULL;
		416	}
		417
		418	if (!PyBuffer_IsContiguous(&dest, 'C') \|\| dest.ndim > 1) {
		419	PyErr_SetString(PyExc_ValueError,
		420	"destination buffer should be contiguous and have at most one dimension");
		421	goto finally;
		422	}
		423
		424	output.dst = dest.buf;
		425	output.size = dest.len;
		426	output.pos = 0;
		427
		428	readinput:
		429
		430	decompressResult = decompress_input(self, &output);
		431
		432	if (-1 == decompressResult) {
		433	goto finally;
		434	}
		435	else if (0 == decompressResult) { }
		436	else if (1 == decompressResult) {
		437	self->bytesDecompressed += output.pos;
		438	result = PyLong_FromSize_t(output.pos);
		439	goto finally;
		440	}
		441	else {
		442	assert(0);
		443	}
		444
		445	readResult = read_decompressor_input(self);
		446
		447	if (-1 == readResult) {
		448	goto finally;
		449	}
		450	else if (0 == readResult) {}
		451	else if (1 == readResult) {}
		452	else {
		453	assert(0);
		454	}
		455
		456	if (self->input.size) {
		457	goto readinput;
		458	}
		459
		460	/* EOF */
		461	self->bytesDecompressed += output.pos;
		462	result = PyLong_FromSize_t(output.pos);
		463
		464	finally:
		465	PyBuffer_Release(&dest);
		466
		467	return result;
		468	}
		469
		470	static PyObject* reader_readinto1(ZstdDecompressionReader* self, PyObject* args) {
		471	Py_buffer dest;
		472	ZSTD_outBuffer output;
		473	PyObject* result = NULL;
		474
		475	if (self->closed) {
		476	PyErr_SetString(PyExc_ValueError, "stream is closed");
		477	return NULL;
		478	}
		479
		480	if (self->finishedOutput) {
		481	return PyLong_FromLong(0);
		482	}
		483
		484	if (!PyArg_ParseTuple(args, "w*:readinto1", &dest)) {
		485	return NULL;
		486	}
		487
		488	if (!PyBuffer_IsContiguous(&dest, 'C') \|\| dest.ndim > 1) {
		489	PyErr_SetString(PyExc_ValueError,
		490	"destination buffer should be contiguous and have at most one dimension");
		491	goto finally;
		492	}
		493
		494	output.dst = dest.buf;
		495	output.size = dest.len;
		496	output.pos = 0;
		497
		498	while (!self->finishedInput && !self->finishedOutput) {
		499	int decompressResult, readResult;
		500
		501	readResult = read_decompressor_input(self);
		502
		503	if (-1 == readResult) {
		504	goto finally;
		505	}
		506	else if (0 == readResult \|\| 1 == readResult) {}
		507	else {
		508	assert(0);
		509	}
		510
		511	decompressResult = decompress_input(self, &output);
		512
		513	if (-1 == decompressResult) {
		514	goto finally;
		515	}
		516	else if (0 == decompressResult \|\| 1 == decompressResult) {}
		517	else {
		518	assert(0);
		519	}
		520
		521	if (output.pos) {
		522	break;
		523	}
		524	}
		525
		526	self->bytesDecompressed += output.pos;
		527	result = PyLong_FromSize_t(output.pos);
		528
		529	finally:
		530	PyBuffer_Release(&dest);
		531
		532	return result;
		533	}
		534
	245	535	static PyObject* reader_readall(PyObject* self) {
	246		PyErr_SetNone(PyExc_NotImplementedError);
	247		return NULL;
		536	PyObject* chunks = NULL;
		537	PyObject* empty = NULL;
		538	PyObject* result = NULL;
		539
		540	/* Our strategy is to collect chunks into a list then join all the
		541	* chunks at the end. We could potentially use e.g. an io.BytesIO. But
		542	* this feels simple enough to implement and avoids potentially expensive
		543	* reallocations of large buffers.
		544	*/
		545	chunks = PyList_New(0);
		546	if (NULL == chunks) {
		547	return NULL;
		548	}
		549
		550	while (1) {
		551	PyObject* chunk = PyObject_CallMethod(self, "read", "i", 1048576);
		552	if (NULL == chunk) {
		553	Py_DECREF(chunks);
		554	return NULL;
		555	}
		556
		557	if (!PyBytes_Size(chunk)) {
		558	Py_DECREF(chunk);
		559	break;
		560	}
		561
		562	if (PyList_Append(chunks, chunk)) {
		563	Py_DECREF(chunk);
		564	Py_DECREF(chunks);
		565	return NULL;
		566	}
		567
		568	Py_DECREF(chunk);
		569	}
		570
		571	empty = PyBytes_FromStringAndSize("", 0);
		572	if (NULL == empty) {
		573	Py_DECREF(chunks);
		574	return NULL;
		575	}
		576
		577	result = PyObject_CallMethod(empty, "join", "O", chunks);
		578
		579	Py_DECREF(empty);
		580	Py_DECREF(chunks);
		581
		582	return result;
	248	583	}
	249	584
	250	585	static PyObject* reader_readline(PyObject* self) {
	251		PyErr_SetNone(PyExc_NotImplementedError);
		586	set_unsupported_operation();
	252	587	return NULL;
	253	588	}
	254	589
	255	590	static PyObject* reader_readlines(PyObject* self) {
	256		PyErr_SetNone(PyExc_NotImplementedError);
		591	set_unsupported_operation();
	257	592	return NULL;
	258	593	}
	259	594
			@@ -345,12 +680,12 b' static PyObject* reader_writelines(PyObj'
	345	680	}
	346	681
	347	682	static PyObject* reader_iter(PyObject* self) {
	348		PyErr_SetNone(PyExc_NotImplementedError);
		683	set_unsupported_operation();
	349	684	return NULL;
	350	685	}
	351	686
	352	687	static PyObject* reader_iternext(PyObject* self) {
	353		PyErr_SetNone(PyExc_NotImplementedError);
		688	set_unsupported_operation();
	354	689	return NULL;
	355	690	}
	356	691
			@@ -367,6 +702,10 b' static PyMethodDef reader_methods[] = {'
	367	702	PyDoc_STR("Returns True") },
	368	703	{ "read", (PyCFunction)reader_read, METH_VARARGS \| METH_KEYWORDS,
	369	704	PyDoc_STR("read compressed data") },
		705	{ "read1", (PyCFunction)reader_read1, METH_VARARGS \| METH_KEYWORDS,
		706	PyDoc_STR("read compressed data") },
		707	{ "readinto", (PyCFunction)reader_readinto, METH_VARARGS, NULL },
		708	{ "readinto1", (PyCFunction)reader_readinto1, METH_VARARGS, NULL },
	370	709	{ "readall", (PyCFunction)reader_readall, METH_NOARGS, PyDoc_STR("Not implemented") },
	371	710	{ "readline", (PyCFunction)reader_readline, METH_NOARGS, PyDoc_STR("Not implemented") },
	372	711	{ "readlines", (PyCFunction)reader_readlines, METH_NOARGS, PyDoc_STR("Not implemented") },

contrib/python-zstandard/c-ext/decompressionwriter.c

0 +117 -10

              }
              static PyObject* ZstdDecompressionWriter_enter(ZstdDecompressionWriter* self) {
-             	if (self->entered) {
-             		PyErr_SetString(ZstdError, "cannot __enter__ multiple times");
+             	if (self->closed) {
+             		PyErr_SetString(PyExc_ValueError, "stream is closed");
              		return NULL;
              	}
-             	if (ensure_dctx(self->decompressor, 1)) {
+             	if (self->entered) {
+             		PyErr_SetString(ZstdError, "cannot __enter__ multiple times");
              		return NULL;
              	}
              static PyObject* ZstdDecompressionWriter_exit(ZstdDecompressionWriter* self, PyObject* args) {
              	self->entered = 0;
+             	if (NULL == PyObject_CallMethod((PyObject*)self, "close", NULL)) {
+             		return NULL;
+             	}
              	Py_RETURN_FALSE;
              }
              		goto finally;
              	}
-             	if (!self->entered) {
-             		PyErr_SetString(ZstdError, "write must be called from an active context manager");
-             		goto finally;
+             	if (self->closed) {
+             		PyErr_SetString(PyExc_ValueError, "stream is closed");
+             		return NULL;
              	}
              	output.dst = PyMem_Malloc(self->outSize);
              	input.size = source.len;
              	input.pos = 0;
-             	while ((ssize_t)input.pos < source.len) {
+             	while (input.pos < (size_t)source.len) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_decompress_generic(self->decompressor->dctx, &output, &input);
+             		zresult = ZSTD_decompressStream(self->decompressor->dctx, &output, &input);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
              	PyMem_Free(output.dst);
-             	result = PyLong_FromSsize_t(totalWrite);
+             	if (self->writeReturnRead) {
+             		result = PyLong_FromSize_t(input.pos);
+             	}
+             	else {
+             		result = PyLong_FromSsize_t(totalWrite);
+             	}
              finally:
              	PyBuffer_Release(&source);
              	return result;
              }
+             static PyObject* ZstdDecompressionWriter_close(ZstdDecompressionWriter* self) {
+             	PyObject* result;
+             	if (self->closed) {
+             		Py_RETURN_NONE;
+             	}
+             	result = PyObject_CallMethod((PyObject*)self, "flush", NULL);
+             	self->closed = 1;
+             	if (NULL == result) {
+             		return NULL;
+             	}
+             	/* Call close on underlying stream as well. */
+             	if (PyObject_HasAttrString(self->writer, "close")) {
+             		return PyObject_CallMethod(self->writer, "close", NULL);
+             	}
+             	Py_RETURN_NONE;
+             }
+             static PyObject* ZstdDecompressionWriter_fileno(ZstdDecompressionWriter* self) {
+             	if (PyObject_HasAttrString(self->writer, "fileno")) {
+             		return PyObject_CallMethod(self->writer, "fileno", NULL);
+             	}
+             	else {
+             		PyErr_SetString(PyExc_OSError, "fileno not available on underlying writer");
+             		return NULL;
+             	}
+             }
+             static PyObject* ZstdDecompressionWriter_flush(ZstdDecompressionWriter* self) {
+             	if (self->closed) {
+             		PyErr_SetString(PyExc_ValueError, "stream is closed");
+             		return NULL;
+             	}
+             	if (PyObject_HasAttrString(self->writer, "flush")) {
+             		return PyObject_CallMethod(self->writer, "flush", NULL);
+             	}
+             	else {
+             		Py_RETURN_NONE;
+             	}
+             }
+             static PyObject* ZstdDecompressionWriter_false(PyObject* self, PyObject* args) {
+             	Py_RETURN_FALSE;
+             }
+             static PyObject* ZstdDecompressionWriter_true(PyObject* self, PyObject* args) {
+             	Py_RETURN_TRUE;
+             }
+             static PyObject* ZstdDecompressionWriter_unsupported(PyObject* self, PyObject* args, PyObject* kwargs) {
+             	PyObject* iomod;
+             	PyObject* exc;
+             	iomod = PyImport_ImportModule("io");
+             	if (NULL == iomod) {
+             		return NULL;
+             	}
+             	exc = PyObject_GetAttrString(iomod, "UnsupportedOperation");
+             	if (NULL == exc) {
+             		Py_DECREF(iomod);
+             		return NULL;
+             	}
+             	PyErr_SetNone(exc);
+             	Py_DECREF(exc);
+             	Py_DECREF(iomod);
+             	return NULL;
+             }
              static PyMethodDef ZstdDecompressionWriter_methods[] = {
              	{ "__enter__", (PyCFunction)ZstdDecompressionWriter_enter, METH_NOARGS,
              	PyDoc_STR("Enter a decompression context.") },
              	PyDoc_STR("Exit a decompression context.") },
              	{ "memory_size", (PyCFunction)ZstdDecompressionWriter_memory_size, METH_NOARGS,
              	PyDoc_STR("Obtain the memory size in bytes of the underlying decompressor.") },
+             	{ "close", (PyCFunction)ZstdDecompressionWriter_close, METH_NOARGS, NULL },
+             	{ "fileno", (PyCFunction)ZstdDecompressionWriter_fileno, METH_NOARGS, NULL },
+             	{ "flush", (PyCFunction)ZstdDecompressionWriter_flush, METH_NOARGS, NULL },
+             	{ "isatty", ZstdDecompressionWriter_false, METH_NOARGS, NULL },
+             	{ "readable", ZstdDecompressionWriter_false, METH_NOARGS, NULL },
+             	{ "readline", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "readlines", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "seek", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "seekable", ZstdDecompressionWriter_false, METH_NOARGS, NULL },
+             	{ "tell", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "truncate", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "writable", ZstdDecompressionWriter_true, METH_NOARGS, NULL },
+             	{ "writelines" , (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "read", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "readall", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
+             	{ "readinto", (PyCFunction)ZstdDecompressionWriter_unsupported, METH_VARARGS | METH_KEYWORDS, NULL },
              	{ "write", (PyCFunction)ZstdDecompressionWriter_write, METH_VARARGS | METH_KEYWORDS,
              	PyDoc_STR("Compress data") },
              	{ NULL, NULL }
              };
+             static PyMemberDef ZstdDecompressionWriter_members[] = {
+             	{ "closed", T_BOOL, offsetof(ZstdDecompressionWriter, closed), READONLY, NULL },
+             	{ NULL }
+             };
              PyTypeObject ZstdDecompressionWriterType = {
              	PyVarObject_HEAD_INIT(NULL, 0)
              	"zstd.ZstdDecompressionWriter", /* tp_name */
 ,                              /* tp_iter */
 ,                              /* tp_iternext */
              	ZstdDecompressionWriter_methods,/* tp_methods */
-,                              /* tp_members */
+             	ZstdDecompressionWriter_members,/* tp_members */
 ,                              /* tp_getset */
 ,                              /* tp_base */
 ,                              /* tp_dict */

contrib/python-zstandard/c-ext/decompressobj.c

0 +18 -1

              	while (1) {
              		Py_BEGIN_ALLOW_THREADS
-             		zresult = ZSTD_decompress_generic(self->decompressor->dctx, &output, &input);
+             		zresult = ZSTD_decompressStream(self->decompressor->dctx, &output, &input);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
              	return result;
              }
+             static PyObject* DecompressionObj_flush(ZstdDecompressionObj* self, PyObject* args, PyObject* kwargs) {
+             	static char* kwlist[] = {
+             		"length",
+             		NULL
+             	};
+             	PyObject* length = NULL;
+             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "|O:flush", kwlist, &length)) {
+             	return NULL;
+             	}
+             	Py_RETURN_NONE;
+             }
              static PyMethodDef DecompressionObj_methods[] = {
              	{ "decompress", (PyCFunction)DecompressionObj_decompress,
              	  METH_VARARGS | METH_KEYWORDS, PyDoc_STR("decompress data") },
+             	{ "flush", (PyCFunction)DecompressionObj_flush,
+             	  METH_VARARGS | METH_KEYWORDS, PyDoc_STR("no-op") },
              	{ NULL, NULL }
              };

contrib/python-zstandard/c-ext/decompressor.c

0 +27 -12

              int ensure_dctx(ZstdDecompressor* decompressor, int loadDict) {
              	size_t zresult;
-             	ZSTD_DCtx_reset(decompressor->dctx);
+             	ZSTD_DCtx_reset(decompressor->dctx, ZSTD_reset_session_only);
              	if (decompressor->maxWindowSize) {
              		zresult = ZSTD_DCtx_setMaxWindowSize(decompressor->dctx, decompressor->maxWindowSize);
              		while (input.pos < input.size) {
              			Py_BEGIN_ALLOW_THREADS
-             			zresult = ZSTD_decompress_generic(self->dctx, &output, &input);
+             			zresult = ZSTD_decompressStream(self->dctx, &output, &input);
              			Py_END_ALLOW_THREADS
              			if (ZSTD_isError(zresult)) {
              	inBuffer.pos = 0;
              	Py_BEGIN_ALLOW_THREADS
-             	zresult = ZSTD_decompress_generic(self->dctx, &outBuffer, &inBuffer);
+             	zresult = ZSTD_decompressStream(self->dctx, &outBuffer, &inBuffer);
              	Py_END_ALLOW_THREADS
              	if (ZSTD_isError(zresult)) {
              }
              PyDoc_STRVAR(Decompressor_stream_reader__doc__,
-             "stream_reader(source, [read_size=default])\n"
+             "stream_reader(source, [read_size=default, [read_across_frames=False]])\n"
              "\n"
              "Obtain an object that behaves like an I/O stream that can be used for\n"
              "reading decompressed output from an object.\n"
              "\n"
              "The source object can be any object with a ``read(size)`` method or that\n"
              "conforms to the buffer protocol.\n"
+             "\n"
+             "``read_across_frames`` controls the behavior of ``read()`` when the end\n"
+             "of a zstd frame is reached. When ``True``, ``read()`` can potentially\n"
+             "return data belonging to multiple zstd frames. When ``False``, ``read()``\n"
+             "will return when the end of a frame is reached.\n"
              );
              static ZstdDecompressionReader* Decompressor_stream_reader(ZstdDecompressor* self, PyObject* args, PyObject* kwargs) {
              	static char* kwlist[] = {
              		"source",
              		"read_size",
+             		"read_across_frames",
              		NULL
              	};
              	PyObject* source;
              	size_t readSize = ZSTD_DStreamInSize();
+             	PyObject* readAcrossFrames = NULL;
              	ZstdDecompressionReader* result;
-             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|k:stream_reader", kwlist,
-             		&source, &readSize)) {
+             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|kO:stream_reader", kwlist,
+             		&source, &readSize, &readAcrossFrames)) {
              		return NULL;
              	}
              	result->decompressor = self;
              	Py_INCREF(self);
+             	result->readAcrossFrames = readAcrossFrames ? PyObject_IsTrue(readAcrossFrames) : 0;
              	return result;
              }
              	static char* kwlist[] = {
              		"writer",
              		"write_size",
+             		"write_return_read",
              		NULL
              	};
              	PyObject* writer;
              	size_t outSize = ZSTD_DStreamOutSize();
+             	PyObject* writeReturnRead = NULL;
              	ZstdDecompressionWriter* result;
-             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|k:stream_writer", kwlist,
-             		&writer, &outSize)) {
+             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|kO:stream_writer", kwlist,
+             		&writer, &outSize, &writeReturnRead)) {
              		return NULL;
              	}
              		return NULL;
              	}
+             	if (ensure_dctx(self, 1)) {
+             		return NULL;
+             	}
              	result = (ZstdDecompressionWriter*)PyObject_CallObject((PyObject*)&ZstdDecompressionWriterType, NULL);
              	if (!result) {
              		return NULL;
              	Py_INCREF(result->writer);
              	result->outSize = outSize;
+             	result->writeReturnRead = writeReturnRead ? PyObject_IsTrue(writeReturnRead) : 0;
              	return result;
              }
              	inBuffer.pos = 0;
              	Py_BEGIN_ALLOW_THREADS
-             	zresult = ZSTD_decompress_generic(self->dctx, &outBuffer, &inBuffer);
+             	zresult = ZSTD_decompressStream(self->dctx, &outBuffer, &inBuffer);
              	Py_END_ALLOW_THREADS
              	if (ZSTD_isError(zresult)) {
              		PyErr_Format(ZstdError, "could not decompress chunk 0: %s", ZSTD_getErrorName(zresult));
              			outBuffer.pos = 0;
              			Py_BEGIN_ALLOW_THREADS
-             			zresult = ZSTD_decompress_generic(self->dctx, &outBuffer, &inBuffer);
+             			zresult = ZSTD_decompressStream(self->dctx, &outBuffer, &inBuffer);
              			Py_END_ALLOW_THREADS
              			if (ZSTD_isError(zresult)) {
              				PyErr_Format(ZstdError, "could not decompress chunk %zd: %s",
              			outBuffer.pos = 0;
              			Py_BEGIN_ALLOW_THREADS
-             			zresult = ZSTD_decompress_generic(self->dctx, &outBuffer, &inBuffer);
+             			zresult = ZSTD_decompressStream(self->dctx, &outBuffer, &inBuffer);
              			Py_END_ALLOW_THREADS
              			if (ZSTD_isError(zresult)) {
              				PyErr_Format(ZstdError, "could not decompress chunk %zd: %s",
              		inBuffer.size = sourceSize;
              		inBuffer.pos = 0;
-             		zresult = ZSTD_decompress_generic(state->dctx, &outBuffer, &inBuffer);
+             		zresult = ZSTD_decompressStream(state->dctx, &outBuffer, &inBuffer);
              		if (ZSTD_isError(zresult)) {
              			state->error = WorkerError_zstd;
              			state->zresult = zresult;

contrib/python-zstandard/c-ext/decompressoriterator.c

0 +1 -1

              	self->output.pos = 0;
              	Py_BEGIN_ALLOW_THREADS
-             	zresult = ZSTD_decompress_generic(self->decompressor->dctx, &self->output, &self->input);
+             	zresult = ZSTD_decompressStream(self->decompressor->dctx, &self->output, &self->input);
              	Py_END_ALLOW_THREADS
              	/* We're done with the pointer. Nullify to prevent anyone from getting a

contrib/python-zstandard/c-ext/python-zstandard.h

0 +10 -24

              #include <zdict.h>
              /* Remember to change the string in zstandard/__init__ as well */
-             #define PYTHON_ZSTANDARD_VERSION "0.10.1"
+             #define PYTHON_ZSTANDARD_VERSION "0.11.0"
              typedef enum {
              	compressorobj_flush_finish,
              typedef struct {
              	PyObject_HEAD
              	ZSTD_CCtx_params* params;
-             	unsigned format;
-             	int compressionLevel;
-             	unsigned windowLog;
-             	unsigned hashLog;
-             	unsigned chainLog;
-             	unsigned searchLog;
-             	unsigned minMatch;
-             	unsigned targetLength;
-             	unsigned compressionStrategy;
-             	unsigned contentSizeFlag;
-             	unsigned checksumFlag;
-             	unsigned dictIDFlag;
-             	unsigned threads;
-             	unsigned jobSize;
-             	unsigned overlapSizeLog;
-             	unsigned forceMaxWindow;
-             	unsigned enableLongDistanceMatching;
-             	unsigned ldmHashLog;
-             	unsigned ldmMinMatch;
-             	unsigned ldmBucketSizeLog;
-             	unsigned ldmHashEveryLog;
              } ZstdCompressionParametersObject;
              extern PyTypeObject ZstdCompressionParametersType;
              	ZstdCompressor* compressor;
              	PyObject* writer;
-             	unsigned long long sourceSize;
+             	ZSTD_outBuffer output;
              	size_t outSize;
              	int entered;
+             	int closed;
+             	int writeReturnRead;
              	unsigned long long bytesCompressed;
              } ZstdCompressionWriter;
              	PyObject* reader;
              	/* Size for read() operations on reader. */
              	size_t readSize;
+             	/* Whether a read() can return data spanning multiple zstd frames. */
+             	int readAcrossFrames;
              	/* Buffer to read from (if reading from a buffer). */
              	Py_buffer buffer;
              	PyObject* writer;
              	size_t outSize;
              	int entered;
+             	int closed;
+             	int writeReturnRead;
              } ZstdDecompressionWriter;
              extern PyTypeObject ZstdDecompressionWriterType;
              extern PyTypeObject ZstdBufferWithSegmentsCollectionType;
-             int set_parameter(ZSTD_CCtx_params* params, ZSTD_cParameter param, unsigned value);
+             int set_parameter(ZSTD_CCtx_params* params, ZSTD_cParameter param, int value);
              int set_parameters(ZSTD_CCtx_params* params, ZstdCompressionParametersObject* obj);
+             int to_cparams(ZstdCompressionParametersObject* params, ZSTD_compressionParameters* cparams);
              FrameParametersObject* get_frame_parameters(PyObject* self, PyObject* args, PyObject* kwargs);
              int ensure_ddict(ZstdCompressionDict* dict);
              int ensure_dctx(ZstdDecompressor* decompressor, int loadDict);

contrib/python-zstandard/make_cffi.py

0 +2 0

                  'compress/zstd_opt.c',
                  'compress/zstdmt_compress.c',
                  'decompress/huf_decompress.c',
+                 'decompress/zstd_ddict.c',
                  'decompress/zstd_decompress.c',
+                 'decompress/zstd_decompress_block.c',
                  'dictBuilder/cover.c',
                  'dictBuilder/fastcover.c',
                  'dictBuilder/divsufsort.c',

contrib/python-zstandard/setup.py

0 +22 -6

              # This software may be modified and distributed under the terms
              # of the BSD license. See the LICENSE file for details.
+             from __future__ import print_function
+             from distutils.version import LooseVersion
              import os
              import sys
              from setuptools import setup
+             # Need change in 1.10 for ffi.from_buffer() to handle all buffer types
+             # (like memoryview).
+             # Need feature in 1.11 for ffi.gc() to declare size of objects so we avoid
+             # garbage collection pitfalls.
+             MINIMUM_CFFI_VERSION = '1.11'
              try:
                  import cffi
+                 # PyPy (and possibly other distros) have CFFI distributed as part of
+                 # them. The install_requires for CFFI below won't work. We need to sniff
+                 # out the CFFI version here and reject CFFI if it is too old.
+                 cffi_version = LooseVersion(cffi.__version__)
+                 if cffi_version < LooseVersion(MINIMUM_CFFI_VERSION):
+                     print('CFFI 1.11 or newer required (%s found); '
+                           'not building CFFI backend' % cffi_version,
+                           file=sys.stderr)
+                     cffi = None
              except ImportError:
                  cffi = None
              if cffi:
                  import make_cffi
                  extensions.append(make_cffi.ffi.distutils_extension())
-                 # Need change in 1.10 for ffi.from_buffer() to handle all buffer types
-                 # (like memoryview).
-                 # Need feature in 1.11 for ffi.gc() to declare size of objects so we avoid
-                 # garbage collection pitfalls.
-                 install_requires.append('cffi>=1.11')
+                 install_requires.append('cffi>=%s' % MINIMUM_CFFI_VERSION)
              version = None
                      'Programming Language :: Python :: 3.4',
                      'Programming Language :: Python :: 3.5',
                      'Programming Language :: Python :: 3.6',
+                     'Programming Language :: Python :: 3.7',
                  ],
                  keywords='zstandard zstd compression',
                  packages=['zstandard'],

contrib/python-zstandard/setup_zstd.py

0 +2 0

                  'compress/zstd_opt.c',
                  'compress/zstdmt_compress.c',
                  'decompress/huf_decompress.c',
+                 'decompress/zstd_ddict.c',
                  'decompress/zstd_decompress.c',
+                 'decompress/zstd_decompress_block.c',
                  'dictBuilder/cover.c',
                  'dictBuilder/divsufsort.c',
                  'dictBuilder/fastcover.c',

contrib/python-zstandard/tests/common.py

0 +38 -4

                  return cls
-             class OpCountingBytesIO(io.BytesIO):
+             class NonClosingBytesIO(io.BytesIO):
+                 """BytesIO that saves the underlying buffer on close().
+                 This allows us to access written data after close().
+                 """
                  def __init__(self, *args, **kwargs):
+                     super(NonClosingBytesIO, self).__init__(*args, **kwargs)
+                     self._saved_buffer = None
+                 def close(self):
+                     self._saved_buffer = self.getvalue()
+                     return super(NonClosingBytesIO, self).close()
+                 def getvalue(self):
+                     if self.closed:
+                         return self._saved_buffer
+                     else:
+                         return super(NonClosingBytesIO, self).getvalue()
+             class OpCountingBytesIO(NonClosingBytesIO):
+                 def __init__(self, *args, **kwargs):
+                     self._flush_count = 0
                      self._read_count = 0
                      self._write_count = 0
                      return super(OpCountingBytesIO, self).__init__(*args, **kwargs)
+                 def flush(self):
+                     self._flush_count += 1
+                     return super(OpCountingBytesIO, self).flush()
                  def read(self, *args):
                      self._read_count += 1
                      return super(OpCountingBytesIO, self).read(*args)
                          except OSError:
                              pass
+                 # Also add some actual random data.
+                 _source_files.append(os.urandom(100))
+                 _source_files.append(os.urandom(1000))
+                 _source_files.append(os.urandom(10000))
+                 _source_files.append(os.urandom(100000))
+                 _source_files.append(os.urandom(1000000))
                  return _source_files
              if hypothesis:
-                 default_settings = hypothesis.settings()
+                 default_settings = hypothesis.settings(deadline=10000)
                  hypothesis.settings.register_profile('default', default_settings)
-                 ci_settings = hypothesis.settings(max_examples=2500,
-                                                   max_iterations=2500)
+                 ci_settings = hypothesis.settings(deadline=20000, max_examples=1000)
                  hypothesis.settings.register_profile('ci', ci_settings)
+                 expensive_settings = hypothesis.settings(deadline=None, max_examples=10000)
+                 hypothesis.settings.register_profile('expensive', expensive_settings)
                  hypothesis.settings.load_profile(
                      os.environ.get('HYPOTHESIS_PROFILE', 'default'))

contrib/python-zstandard/tests/test_buffer_util.py

0 +27 0

              class TestBufferWithSegments(unittest.TestCase):
                  def test_arguments(self):
+                     if not hasattr(zstd, 'BufferWithSegments'):
+                         self.skipTest('BufferWithSegments not available')
                      with self.assertRaises(TypeError):
                          zstd.BufferWithSegments()
                          zstd.BufferWithSegments(b'foo', b'\x00\x00')
                  def test_invalid_offset(self):
+                     if not hasattr(zstd, 'BufferWithSegments'):
+                         self.skipTest('BufferWithSegments not available')
                      with self.assertRaisesRegexp(ValueError, 'offset within segments array references memory'):
                          zstd.BufferWithSegments(b'foo', ss.pack(0, 4))
                  def test_invalid_getitem(self):
+                     if not hasattr(zstd, 'BufferWithSegments'):
+                         self.skipTest('BufferWithSegments not available')
                      b = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
                      with self.assertRaisesRegexp(IndexError, 'offset must be non-negative'):
                          test = b[2]
                  def test_single(self):
+                     if not hasattr(zstd, 'BufferWithSegments'):
+                         self.skipTest('BufferWithSegments not available')
                      b = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
                      self.assertEqual(len(b), 1)
                      self.assertEqual(b.size, 3)
                      self.assertEqual(b[0].tobytes(), b'foo')
                  def test_multiple(self):
+                     if not hasattr(zstd, 'BufferWithSegments'):
+                         self.skipTest('BufferWithSegments not available')
                      b = zstd.BufferWithSegments(b'foofooxfooxy', b''.join([ss.pack(0, 3),
                                                                             ss.pack(3, 4),
                                                                             ss.pack(7, 5)]))
              class TestBufferWithSegmentsCollection(unittest.TestCase):
                  def test_empty_constructor(self):
+                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
+                         self.skipTest('BufferWithSegmentsCollection not available')
                      with self.assertRaisesRegexp(ValueError, 'must pass at least 1 argument'):
                          zstd.BufferWithSegmentsCollection()
                  def test_argument_validation(self):
+                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
+                         self.skipTest('BufferWithSegmentsCollection not available')
                      with self.assertRaisesRegexp(TypeError, 'arguments must be BufferWithSegments'):
                          zstd.BufferWithSegmentsCollection(None)
                          zstd.BufferWithSegmentsCollection(zstd.BufferWithSegments(b'', b''))
                  def test_length(self):
+                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
+                         self.skipTest('BufferWithSegmentsCollection not available')
                      b1 = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
                      b2 = zstd.BufferWithSegments(b'barbaz', b''.join([ss.pack(0, 3),
                                                                        ss.pack(3, 3)]))
                      self.assertEqual(c.size(), 9)
                  def test_getitem(self):
+                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
+                         self.skipTest('BufferWithSegmentsCollection not available')
                      b1 = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
                      b2 = zstd.BufferWithSegments(b'barbaz', b''.join([ss.pack(0, 3),
                                                                        ss.pack(3, 3)]))

contrib/python-zstandard/tests/test_compressor.py

0 +315 -43

              import hashlib
              import io
+             import os
              import struct
              import sys
              import tarfile
+             import tempfile
              import unittest
              import zstandard as zstd
              from .common import (
                  make_cffi,
+                 NonClosingBytesIO,
                  OpCountingBytesIO,
              )
                      params = zstd.get_frame_parameters(result)
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
-                     self.assertEqual(params.window_size, 1048576)
+                     self.assertEqual(params.window_size, 2097152)
                      self.assertEqual(params.dict_id, 0)
                      self.assertFalse(params.has_checksum)
                      cobj.compress(b'foo')
                      cobj.flush()
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'cannot call compress\(\) after compressor'):
+                     with self.assertRaisesRegexp(zstd.ZstdError, r'cannot call compress\(\) after compressor'):
                          cobj.compress(b'foo')
                      with self.assertRaisesRegexp(zstd.ZstdError, 'compressor object already finished'):
                      params = zstd.get_frame_parameters(dest.getvalue())
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
-                     self.assertEqual(params.window_size, 1048576)
+                     self.assertEqual(params.window_size, 2097152)
                      self.assertEqual(params.dict_id, 0)
                      self.assertFalse(params.has_checksum)
                          with self.assertRaises(io.UnsupportedOperation):
                              reader.readlines()
-                         # This could probably be implemented someday.
-                         with self.assertRaises(NotImplementedError):
-                             reader.readall()
                          with self.assertRaises(io.UnsupportedOperation):
                              iter(reader)
                          with self.assertRaisesRegexp(ValueError, 'stream is closed'):
                              reader.read(10)
-                 def test_read_bad_size(self):
+                 def test_read_sizes(self):
                      cctx = zstd.ZstdCompressor()
+                     foo = cctx.compress(b'foo')
                      with cctx.stream_reader(b'foo') as reader:
-                         with self.assertRaisesRegexp(ValueError, 'cannot read negative or size 0 amounts'):
-                             reader.read(-1)
+                         with self.assertRaisesRegexp(ValueError, 'cannot read negative amounts less than -1'):
+                             reader.read(-2)
-                         with self.assertRaisesRegexp(ValueError, 'cannot read negative or size 0 amounts'):
-                             reader.read(0)
+                         self.assertEqual(reader.read(0), b'')
+                         self.assertEqual(reader.read(), foo)
                  def test_read_buffer(self):
                      cctx = zstd.ZstdCompressor()
                      with cctx.stream_reader(source, size=42):
                          pass
+                 def test_readall(self):
+                     cctx = zstd.ZstdCompressor()
+                     frame = cctx.compress(b'foo' * 1024)
+                     reader = cctx.stream_reader(b'foo' * 1024)
+                     self.assertEqual(reader.readall(), frame)
+                 def test_readinto(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = cctx.compress(b'foo')
+                     reader = cctx.stream_reader(b'foo')
+                     with self.assertRaises(Exception):
+                         reader.readinto(b'foobar')
+                     # readinto() with sufficiently large destination.
+                     b = bytearray(1024)
+                     reader = cctx.stream_reader(b'foo')
+                     self.assertEqual(reader.readinto(b), len(foo))
+                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(reader.readinto(b), 0)
+                     self.assertEqual(b[0:len(foo)], foo)
+                     # readinto() with small reads.
+                     b = bytearray(1024)
+                     reader = cctx.stream_reader(b'foo', read_size=1)
+                     self.assertEqual(reader.readinto(b), len(foo))
+                     self.assertEqual(b[0:len(foo)], foo)
+                     # Too small destination buffer.
+                     b = bytearray(2)
+                     reader = cctx.stream_reader(b'foo')
+                     self.assertEqual(reader.readinto(b), 2)
+                     self.assertEqual(b[:], foo[0:2])
+                     self.assertEqual(reader.readinto(b), 2)
+                     self.assertEqual(b[:], foo[2:4])
+                     self.assertEqual(reader.readinto(b), 2)
+                     self.assertEqual(b[:], foo[4:6])
+                 def test_readinto1(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = b''.join(cctx.read_to_iter(io.BytesIO(b'foo')))
+                     reader = cctx.stream_reader(b'foo')
+                     with self.assertRaises(Exception):
+                         reader.readinto1(b'foobar')
+                     b = bytearray(1024)
+                     source = OpCountingBytesIO(b'foo')
+                     reader = cctx.stream_reader(source)
+                     self.assertEqual(reader.readinto1(b), len(foo))
+                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(source._read_count, 2)
+                     # readinto1() with small reads.
+                     b = bytearray(1024)
+                     source = OpCountingBytesIO(b'foo')
+                     reader = cctx.stream_reader(source, read_size=1)
+                     self.assertEqual(reader.readinto1(b), len(foo))
+                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(source._read_count, 4)
+                 def test_read1(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = b''.join(cctx.read_to_iter(io.BytesIO(b'foo')))
+                     b = OpCountingBytesIO(b'foo')
+                     reader = cctx.stream_reader(b)
+                     self.assertEqual(reader.read1(), foo)
+                     self.assertEqual(b._read_count, 2)
+                     b = OpCountingBytesIO(b'foo')
+                     reader = cctx.stream_reader(b)
+                     self.assertEqual(reader.read1(0), b'')
+                     self.assertEqual(reader.read1(2), foo[0:2])
+                     self.assertEqual(b._read_count, 2)
+                     self.assertEqual(reader.read1(2), foo[2:4])
+                     self.assertEqual(reader.read1(1024), foo[4:])
              @make_cffi
              class TestCompressor_stream_writer(unittest.TestCase):
+                 def test_io_api(self):
+                     buffer = io.BytesIO()
+                     cctx = zstd.ZstdCompressor()
+                     writer = cctx.stream_writer(buffer)
+                     self.assertFalse(writer.isatty())
+                     self.assertFalse(writer.readable())
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readline()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readline(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readline(size=42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readlines()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readlines(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readlines(hint=42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.seek(0)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.seek(10, os.SEEK_SET)
+                     self.assertFalse(writer.seekable())
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.truncate()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.truncate(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.truncate(size=42)
+                     self.assertTrue(writer.writable())
+                     with self.assertRaises(NotImplementedError):
+                         writer.writelines([])
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.read()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.read(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.read(size=42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readall()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readinto(None)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.fileno()
+                     self.assertFalse(writer.closed)
+                 def test_fileno_file(self):
+                     with tempfile.TemporaryFile('wb') as tf:
+                         cctx = zstd.ZstdCompressor()
+                         writer = cctx.stream_writer(tf)
+                         self.assertEqual(writer.fileno(), tf.fileno())
+                 def test_close(self):
+                     buffer = NonClosingBytesIO()
+                     cctx = zstd.ZstdCompressor(level=1)
+                     writer = cctx.stream_writer(buffer)
+                     writer.write(b'foo' * 1024)
+                     self.assertFalse(writer.closed)
+                     self.assertFalse(buffer.closed)
+                     writer.close()
+                     self.assertTrue(writer.closed)
+                     self.assertTrue(buffer.closed)
+                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         writer.write(b'foo')
+                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         writer.flush()
+                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         with writer:
+                             pass
+                     self.assertEqual(buffer.getvalue(),
+                                      b'\x28\xb5\x2f\xfd\x00\x48\x55\x00\x00\x18\x66\x6f'
+                                      b'\x6f\x01\x00\xfa\xd3\x77\x43')
+                     # Context manager exit should close stream.
+                     buffer = io.BytesIO()
+                     writer = cctx.stream_writer(buffer)
+                     with writer:
+                         writer.write(b'foo')
+                     self.assertTrue(writer.closed)
                  def test_empty(self):
-                     buffer = io.BytesIO()
+                     buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      with cctx.stream_writer(buffer) as compressor:
                          compressor.write(b'')
                      self.assertEqual(params.dict_id, 0)
                      self.assertFalse(params.has_checksum)
+                     # Test without context manager.
+                     buffer = io.BytesIO()
+                     compressor = cctx.stream_writer(buffer)
+                     self.assertEqual(compressor.write(b''), 0)
+                     self.assertEqual(buffer.getvalue(), b'')
+                     self.assertEqual(compressor.flush(zstd.FLUSH_FRAME), 9)
+                     result = buffer.getvalue()
+                     self.assertEqual(result, b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     params = zstd.get_frame_parameters(result)
+                     self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
+                     self.assertEqual(params.window_size, 524288)
+                     self.assertEqual(params.dict_id, 0)
+                     self.assertFalse(params.has_checksum)
+                     # Test write_return_read=True
+                     compressor = cctx.stream_writer(buffer, write_return_read=True)
+                     self.assertEqual(compressor.write(b''), 0)
                  def test_input_types(self):
                      expected = b'\x28\xb5\x2f\xfd\x00\x48\x19\x00\x00\x66\x6f\x6f'
                      cctx = zstd.ZstdCompressor(level=1)
                      ]
                      for source in sources:
-                         buffer = io.BytesIO()
+                         buffer = NonClosingBytesIO()
                          with cctx.stream_writer(buffer) as compressor:
                              compressor.write(source)
                          self.assertEqual(buffer.getvalue(), expected)
+                         compressor = cctx.stream_writer(buffer, write_return_read=True)
+                         self.assertEqual(compressor.write(source), len(source))
                  def test_multiple_compress(self):
-                     buffer = io.BytesIO()
+                     buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=5)
                      with cctx.stream_writer(buffer) as compressor:
                          self.assertEqual(compressor.write(b'foo'), 0)
                      result = buffer.getvalue()
                      self.assertEqual(result,
-                                      b'\x28\xb5\x2f\xfd\x00\x50\x75\x00\x00\x38\x66\x6f'
+                                      b'\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x38\x66\x6f'
                                       b'\x6f\x62\x61\x72\x78\x01\x00\xfc\xdf\x03\x23')
+                     # Test without context manager.
+                     buffer = io.BytesIO()
+                     compressor = cctx.stream_writer(buffer)
+                     self.assertEqual(compressor.write(b'foo'), 0)
+                     self.assertEqual(compressor.write(b'bar'), 0)
+                     self.assertEqual(compressor.write(b'x' * 8192), 0)
+                     self.assertEqual(compressor.flush(zstd.FLUSH_FRAME), 23)
+                     result = buffer.getvalue()
+                     self.assertEqual(result,
+                                      b'\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x38\x66\x6f'
+                                      b'\x6f\x62\x61\x72\x78\x01\x00\xfc\xdf\x03\x23')
+                     # Test with write_return_read=True.
+                     compressor = cctx.stream_writer(buffer, write_return_read=True)
+                     self.assertEqual(compressor.write(b'foo'), 3)
+                     self.assertEqual(compressor.write(b'barbiz'), 6)
+                     self.assertEqual(compressor.write(b'x' * 8192), 8192)
                  def test_dictionary(self):
                      samples = []
                      for i in range(128):
                      d = zstd.train_dictionary(8192, samples)
                      h = hashlib.sha1(d.as_bytes()).hexdigest()
-                     self.assertEqual(h, '2b3b6428da5bf2c9cc9d4bb58ba0bc5990dd0e79')
+                     self.assertEqual(h, '88ca0d38332aff379d4ced166a51c280a7679aad')
-                     buffer = io.BytesIO()
+                     buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=9, dict_data=d)
                      with cctx.stream_writer(buffer) as compressor:
                          self.assertEqual(compressor.write(b'foo'), 0)
                      self.assertFalse(params.has_checksum)
                      h = hashlib.sha1(compressed).hexdigest()
-                     self.assertEqual(h, '23f88344263678478f5f82298e0a5d1833125786')
+                     self.assertEqual(h, '8703b4316f274d26697ea5dd480f29c08e85d940')
                      source = b'foo' + b'bar' + (b'foo' * 16384)
                          min_match=5,
                          search_log=4,
                          target_length=10,
-                         compression_strategy=zstd.STRATEGY_FAST)
+                         strategy=zstd.STRATEGY_FAST)
-                     buffer = io.BytesIO()
+                     buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(compression_params=params)
                      with cctx.stream_writer(buffer) as compressor:
                          self.assertEqual(compressor.write(b'foo'), 0)
                      self.assertEqual(h, '2a8111d72eb5004cdcecbdac37da9f26720d30ef')
                  def test_write_checksum(self):
-                     no_checksum = io.BytesIO()
+                     no_checksum = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      with cctx.stream_writer(no_checksum) as compressor:
                          self.assertEqual(compressor.write(b'foobar'), 0)
-                     with_checksum = io.BytesIO()
+                     with_checksum = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      with cctx.stream_writer(with_checksum) as compressor:
                          self.assertEqual(compressor.write(b'foobar'), 0)
                                       len(no_checksum.getvalue()) + 4)
                  def test_write_content_size(self):
-                     no_size = io.BytesIO()
+                     no_size = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      with cctx.stream_writer(no_size) as compressor:
                          self.assertEqual(compressor.write(b'foobar' * 256), 0)
-                     with_size = io.BytesIO()
+                     with_size = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      with cctx.stream_writer(with_size) as compressor:
                          self.assertEqual(compressor.write(b'foobar' * 256), 0)
                                       len(no_size.getvalue()))
                      # Declaring size will write the header.
-                     with_size = io.BytesIO()
+                     with_size = NonClosingBytesIO()
                      with cctx.stream_writer(with_size, size=len(b'foobar' * 256)) as compressor:
                          self.assertEqual(compressor.write(b'foobar' * 256), 0)
                      d = zstd.train_dictionary(1024, samples)
-                     with_dict_id = io.BytesIO()
+                     with_dict_id = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      with cctx.stream_writer(with_dict_id) as compressor:
                          self.assertEqual(compressor.write(b'foobarfoobar'), 0)
                      self.assertEqual(with_dict_id.getvalue()[4:5], b'\x03')
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d, write_dict_id=False)
-                     no_dict_id = io.BytesIO()
+                     no_dict_id = NonClosingBytesIO()
                      with cctx.stream_writer(no_dict_id) as compressor:
                          self.assertEqual(compressor.write(b'foobarfoobar'), 0)
                      header = trailing[0:3]
                      self.assertEqual(header, b'\x01\x00\x00')
+                 def test_flush_frame(self):
+                     cctx = zstd.ZstdCompressor(level=3)
+                     dest = OpCountingBytesIO()
+                     with cctx.stream_writer(dest) as compressor:
+                         self.assertEqual(compressor.write(b'foobar' * 8192), 0)
+                         self.assertEqual(compressor.flush(zstd.FLUSH_FRAME), 23)
+                         compressor.write(b'biz' * 16384)
+                     self.assertEqual(dest.getvalue(),
+                                      # Frame 1.
+                                      b'\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x30\x66\x6f\x6f'
+                                      b'\x62\x61\x72\x01\x00\xf7\xbf\xe8\xa5\x08'
+                                      # Frame 2.
+                                      b'\x28\xb5\x2f\xfd\x00\x58\x5d\x00\x00\x18\x62\x69\x7a'
+                                      b'\x01\x00\xfa\x3f\x75\x37\x04')
+                 def test_bad_flush_mode(self):
+                     cctx = zstd.ZstdCompressor()
+                     dest = io.BytesIO()
+                     with cctx.stream_writer(dest) as compressor:
+                         with self.assertRaisesRegexp(ValueError, 'unknown flush_mode: 42'):
+                             compressor.flush(flush_mode=42)
                  def test_multithreaded(self):
-                     dest = io.BytesIO()
+                     dest = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(threads=2)
                      with cctx.stream_writer(dest) as compressor:
                          compressor.write(b'a' * 1048576)
                          pass
                  def test_tarfile_compat(self):
-                     raise unittest.SkipTest('not yet fully working')
-                     dest = io.BytesIO()
+                     dest = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor()
                      with cctx.stream_writer(dest) as compressor:
-                         with tarfile.open('tf', mode='w', fileobj=compressor) as tf:
+                         with tarfile.open('tf', mode='w|', fileobj=compressor) as tf:
                              tf.add(__file__, 'test_compressor.py')
-                     dest.seek(0)
+                     dest = io.BytesIO(dest.getvalue())
                      dctx = zstd.ZstdDecompressor()
                      with dctx.stream_reader(dest) as reader:
-                         with tarfile.open(mode='r:', fileobj=reader) as tf:
+                         with tarfile.open(mode='r|', fileobj=reader) as tf:
                              for member in tf:
                                  self.assertEqual(member.name, 'test_compressor.py')
              @make_cffi
              class TestCompressor_read_to_iter(unittest.TestCase):
                  def test_type_validation(self):
                      it = chunker.finish()
-                     self.assertEqual(next(it), b'\x28\xb5\x2f\xfd\x00\x50\x01\x00\x00')
+                     self.assertEqual(next(it), b'\x28\xb5\x2f\xfd\x00\x58\x01\x00\x00')
                      with self.assertRaises(StopIteration):
                          next(it)
                      it = chunker.finish()
                      self.assertEqual(next(it),
-                                      b'\x28\xb5\x2f\xfd\x00\x50\x7d\x00\x00\x48\x66\x6f'
+                                      b'\x28\xb5\x2f\xfd\x00\x58\x7d\x00\x00\x48\x66\x6f'
                                       b'\x6f\x62\x61\x72\x62\x61\x7a\x01\x00\xe4\xe4\x8e')
                      with self.assertRaises(StopIteration):
                      self.assertEqual(
                          b''.join(chunks),
-                         b'\x28\xb5\x2f\xfd\x00\x50\x55\x00\x00\x18\x66\x6f\x6f\x01\x00'
+                         b'\x28\xb5\x2f\xfd\x00\x58\x55\x00\x00\x18\x66\x6f\x6f\x01\x00'
                          b'\xfa\xd3\x77\x43')
                      dctx = zstd.ZstdDecompressor()
                          self.assertEqual(list(chunker.compress(source)), [])
                          self.assertEqual(list(chunker.finish()), [
-                             b'\x28\xb5\x2f\xfd\x00\x50\x19\x00\x00\x66\x6f\x6f'
+                             b'\x28\xb5\x2f\xfd\x00\x58\x19\x00\x00\x66\x6f\x6f'
                          ])
                  def test_flush(self):
                      chunks1 = list(chunker.flush())
                      self.assertEqual(chunks1, [
-                         b'\x28\xb5\x2f\xfd\x00\x50\x8c\x00\x00\x30\x66\x6f\x6f\x62\x61\x72'
+                         b'\x28\xb5\x2f\xfd\x00\x58\x8c\x00\x00\x30\x66\x6f\x6f\x62\x61\x72'
                          b'\x02\x00\xfa\x03\xfe\xd0\x9f\xbe\x1b\x02'
                      ])
                      with self.assertRaisesRegexp(
                              zstd.ZstdError,
-                             'cannot call compress\(\) after compression finished'):
+                             r'cannot call compress\(\) after compression finished'):
                          list(chunker.compress(b'foo'))
                  def test_flush_after_finish(self):
                      with self.assertRaisesRegexp(
                              zstd.ZstdError,
-                             'cannot call flush\(\) after compression finished'):
+                             r'cannot call flush\(\) after compression finished'):
                          list(chunker.flush())
                  def test_finish_after_finish(self):
                      with self.assertRaisesRegexp(
                              zstd.ZstdError,
-                             'cannot call finish\(\) after compression finished'):
+                             r'cannot call finish\(\) after compression finished'):
                          list(chunker.finish())
                  def test_invalid_inputs(self):
                      cctx = zstd.ZstdCompressor()
+                     if not hasattr(cctx, 'multi_compress_to_buffer'):
+                         self.skipTest('multi_compress_to_buffer not available')
                      with self.assertRaises(TypeError):
                          cctx.multi_compress_to_buffer(True)
                  def test_empty_input(self):
                      cctx = zstd.ZstdCompressor()
+                     if not hasattr(cctx, 'multi_compress_to_buffer'):
+                         self.skipTest('multi_compress_to_buffer not available')
                      with self.assertRaisesRegexp(ValueError, 'no source elements found'):
                          cctx.multi_compress_to_buffer([])
                  def test_list_input(self):
                      cctx = zstd.ZstdCompressor(write_checksum=True)
+                     if not hasattr(cctx, 'multi_compress_to_buffer'):
+                         self.skipTest('multi_compress_to_buffer not available')
                      original = [b'foo' * 12, b'bar' * 6]
                      frames = [cctx.compress(c) for c in original]
                      b = cctx.multi_compress_to_buffer(original)
                  def test_buffer_with_segments_input(self):
                      cctx = zstd.ZstdCompressor(write_checksum=True)
+                     if not hasattr(cctx, 'multi_compress_to_buffer'):
+                         self.skipTest('multi_compress_to_buffer not available')
                      original = [b'foo' * 4, b'bar' * 6]
                      frames = [cctx.compress(c) for c in original]
                  def test_buffer_with_segments_collection_input(self):
                      cctx = zstd.ZstdCompressor(write_checksum=True)
+                     if not hasattr(cctx, 'multi_compress_to_buffer'):
+                         self.skipTest('multi_compress_to_buffer not available')
                      original = [
                          b'foo1',
                          b'foo2' * 2,
                      cctx = zstd.ZstdCompressor(write_checksum=True)
+                     if not hasattr(cctx, 'multi_compress_to_buffer'):
+                         self.skipTest('multi_compress_to_buffer not available')
                      frames = []
                      frames.extend(b'x' * 64 for i in range(256))
                      frames.extend(b'y' * 64 for i in range(256))

contrib/python-zstandard/tests/test_compressor_fuzzing.py

0 +396 -5

		@@ -12,6 +12,7 b' import zstandard as zstd'
12	12
13	13	from . common import (
14	14	make_cffi,
	15	NonClosingBytesIO,
15	16	random_input_data,
16	17	)
17	18
		@@ -19,6 +20,62 b' from . common import ('
19	20	@unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
20	21	@make_cffi
21	22	class TestCompressor_stream_reader_fuzzing(unittest.TestCase):
	23	@hypothesis.settings(
	24	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	25	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	26	level=strategies.integers(min_value=1, max_value=5),
	27	source_read_size=strategies.integers(1, 16384),
	28	read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	29	def test_stream_source_read(self, original, level, source_read_size,
	30	read_size):
	31	if read_size == 0:
	32	read_size = -1
	33
	34	refctx = zstd.ZstdCompressor(level=level)
	35	ref_frame = refctx.compress(original)
	36
	37	cctx = zstd.ZstdCompressor(level=level)
	38	with cctx.stream_reader(io.BytesIO(original), size=len(original),
	39	read_size=source_read_size) as reader:
	40	chunks = []
	41	while True:
	42	chunk = reader.read(read_size)
	43	if not chunk:
	44	break
	45
	46	chunks.append(chunk)
	47
	48	self.assertEqual(b''.join(chunks), ref_frame)
	49
	50	@hypothesis.settings(
	51	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	52	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	53	level=strategies.integers(min_value=1, max_value=5),
	54	source_read_size=strategies.integers(1, 16384),
	55	read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	56	def test_buffer_source_read(self, original, level, source_read_size,
	57	read_size):
	58	if read_size == 0:
	59	read_size = -1
	60
	61	refctx = zstd.ZstdCompressor(level=level)
	62	ref_frame = refctx.compress(original)
	63
	64	cctx = zstd.ZstdCompressor(level=level)
	65	with cctx.stream_reader(original, size=len(original),
	66	read_size=source_read_size) as reader:
	67	chunks = []
	68	while True:
	69	chunk = reader.read(read_size)
	70	if not chunk:
	71	break
	72
	73	chunks.append(chunk)
	74
	75	self.assertEqual(b''.join(chunks), ref_frame)
	76
	77	@hypothesis.settings(
	78	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
22	79	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
23	80	level=strategies.integers(min_value=1, max_value=5),
24	81	source_read_size=strategies.integers(1, 16384),
		@@ -33,15 +90,17 b' class TestCompressor_stream_reader_fuzzi'
33	90	read_size=source_read_size) as reader:
34	91	chunks = []
35	92	while True:
36		read_size = read_sizes.draw(strategies.integers(1, 16384))
	93	read_size = read_sizes.draw(strategies.integers(-1, 16384))
37	94	chunk = reader.read(read_size)
	95	if not chunk and read_size:
	96	break
38	97
39		if not chunk:
40		break
41	98	chunks.append(chunk)
42	99
43	100	self.assertEqual(b''.join(chunks), ref_frame)
44	101
	102	@hypothesis.settings(
	103	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
45	104	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
46	105	level=strategies.integers(min_value=1, max_value=5),
47	106	source_read_size=strategies.integers(1, 16384),
		@@ -57,14 +116,343 b' class TestCompressor_stream_reader_fuzzi'
57	116	read_size=source_read_size) as reader:
58	117	chunks = []
59	118	while True:
	119	read_size = read_sizes.draw(strategies.integers(-1, 16384))
	120	chunk = reader.read(read_size)
	121	if not chunk and read_size:
	122	break
	123
	124	chunks.append(chunk)
	125
	126	self.assertEqual(b''.join(chunks), ref_frame)
	127
	128	@hypothesis.settings(
	129	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	130	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	131	level=strategies.integers(min_value=1, max_value=5),
	132	source_read_size=strategies.integers(1, 16384),
	133	read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	134	def test_stream_source_readinto(self, original, level,
	135	source_read_size, read_size):
	136	refctx = zstd.ZstdCompressor(level=level)
	137	ref_frame = refctx.compress(original)
	138
	139	cctx = zstd.ZstdCompressor(level=level)
	140	with cctx.stream_reader(io.BytesIO(original), size=len(original),
	141	read_size=source_read_size) as reader:
	142	chunks = []
	143	while True:
	144	b = bytearray(read_size)
	145	count = reader.readinto(b)
	146
	147	if not count:
	148	break
	149
	150	chunks.append(bytes(b[0:count]))
	151
	152	self.assertEqual(b''.join(chunks), ref_frame)
	153
	154	@hypothesis.settings(
	155	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	156	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	157	level=strategies.integers(min_value=1, max_value=5),
	158	source_read_size=strategies.integers(1, 16384),
	159	read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	160	def test_buffer_source_readinto(self, original, level,
	161	source_read_size, read_size):
	162
	163	refctx = zstd.ZstdCompressor(level=level)
	164	ref_frame = refctx.compress(original)
	165
	166	cctx = zstd.ZstdCompressor(level=level)
	167	with cctx.stream_reader(original, size=len(original),
	168	read_size=source_read_size) as reader:
	169	chunks = []
	170	while True:
	171	b = bytearray(read_size)
	172	count = reader.readinto(b)
	173
	174	if not count:
	175	break
	176
	177	chunks.append(bytes(b[0:count]))
	178
	179	self.assertEqual(b''.join(chunks), ref_frame)
	180
	181	@hypothesis.settings(
	182	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	183	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	184	level=strategies.integers(min_value=1, max_value=5),
	185	source_read_size=strategies.integers(1, 16384),
	186	read_sizes=strategies.data())
	187	def test_stream_source_readinto_variance(self, original, level,
	188	source_read_size, read_sizes):
	189	refctx = zstd.ZstdCompressor(level=level)
	190	ref_frame = refctx.compress(original)
	191
	192	cctx = zstd.ZstdCompressor(level=level)
	193	with cctx.stream_reader(io.BytesIO(original), size=len(original),
	194	read_size=source_read_size) as reader:
	195	chunks = []
	196	while True:
60	197	read_size = read_sizes.draw(strategies.integers(1, 16384))
61		~~chunk~~ = ~~reader~~.~~read~~(read_size)
	198	b = bytearray(read_size)
	199	count = reader.readinto(b)
	200
	201	if not count:
	202	break
	203
	204	chunks.append(bytes(b[0:count]))
	205
	206	self.assertEqual(b''.join(chunks), ref_frame)
	207
	208	@hypothesis.settings(
	209	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	210	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	211	level=strategies.integers(min_value=1, max_value=5),
	212	source_read_size=strategies.integers(1, 16384),
	213	read_sizes=strategies.data())
	214	def test_buffer_source_readinto_variance(self, original, level,
	215	source_read_size, read_sizes):
	216
	217	refctx = zstd.ZstdCompressor(level=level)
	218	ref_frame = refctx.compress(original)
	219
	220	cctx = zstd.ZstdCompressor(level=level)
	221	with cctx.stream_reader(original, size=len(original),
	222	read_size=source_read_size) as reader:
	223	chunks = []
	224	while True:
	225	read_size = read_sizes.draw(strategies.integers(1, 16384))
	226	b = bytearray(read_size)
	227	count = reader.readinto(b)
	228
	229	if not count:
	230	break
	231
	232	chunks.append(bytes(b[0:count]))
	233
	234	self.assertEqual(b''.join(chunks), ref_frame)
	235
	236	@hypothesis.settings(
	237	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	238	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	239	level=strategies.integers(min_value=1, max_value=5),
	240	source_read_size=strategies.integers(1, 16384),
	241	read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	242	def test_stream_source_read1(self, original, level, source_read_size,
	243	read_size):
	244	if read_size == 0:
	245	read_size = -1
	246
	247	refctx = zstd.ZstdCompressor(level=level)
	248	ref_frame = refctx.compress(original)
	249
	250	cctx = zstd.ZstdCompressor(level=level)
	251	with cctx.stream_reader(io.BytesIO(original), size=len(original),
	252	read_size=source_read_size) as reader:
	253	chunks = []
	254	while True:
	255	chunk = reader.read1(read_size)
62	256	if not chunk:
63	257	break
	258
64	259	chunks.append(chunk)
65	260
66	261	self.assertEqual(b''.join(chunks), ref_frame)
67	262
	263	@hypothesis.settings(
	264	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	265	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	266	level=strategies.integers(min_value=1, max_value=5),
	267	source_read_size=strategies.integers(1, 16384),
	268	read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	269	def test_buffer_source_read1(self, original, level, source_read_size,
	270	read_size):
	271	if read_size == 0:
	272	read_size = -1
	273
	274	refctx = zstd.ZstdCompressor(level=level)
	275	ref_frame = refctx.compress(original)
	276
	277	cctx = zstd.ZstdCompressor(level=level)
	278	with cctx.stream_reader(original, size=len(original),
	279	read_size=source_read_size) as reader:
	280	chunks = []
	281	while True:
	282	chunk = reader.read1(read_size)
	283	if not chunk:
	284	break
	285
	286	chunks.append(chunk)
	287
	288	self.assertEqual(b''.join(chunks), ref_frame)
	289
	290	@hypothesis.settings(
	291	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	292	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	293	level=strategies.integers(min_value=1, max_value=5),
	294	source_read_size=strategies.integers(1, 16384),
	295	read_sizes=strategies.data())
	296	def test_stream_source_read1_variance(self, original, level, source_read_size,
	297	read_sizes):
	298	refctx = zstd.ZstdCompressor(level=level)
	299	ref_frame = refctx.compress(original)
	300
	301	cctx = zstd.ZstdCompressor(level=level)
	302	with cctx.stream_reader(io.BytesIO(original), size=len(original),
	303	read_size=source_read_size) as reader:
	304	chunks = []
	305	while True:
	306	read_size = read_sizes.draw(strategies.integers(-1, 16384))
	307	chunk = reader.read1(read_size)
	308	if not chunk and read_size:
	309	break
	310
	311	chunks.append(chunk)
	312
	313	self.assertEqual(b''.join(chunks), ref_frame)
	314
	315	@hypothesis.settings(
	316	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	317	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	318	level=strategies.integers(min_value=1, max_value=5),
	319	source_read_size=strategies.integers(1, 16384),
	320	read_sizes=strategies.data())
	321	def test_buffer_source_read1_variance(self, original, level, source_read_size,
	322	read_sizes):
	323
	324	refctx = zstd.ZstdCompressor(level=level)
	325	ref_frame = refctx.compress(original)
	326
	327	cctx = zstd.ZstdCompressor(level=level)
	328	with cctx.stream_reader(original, size=len(original),
	329	read_size=source_read_size) as reader:
	330	chunks = []
	331	while True:
	332	read_size = read_sizes.draw(strategies.integers(-1, 16384))
	333	chunk = reader.read1(read_size)
	334	if not chunk and read_size:
	335	break
	336
	337	chunks.append(chunk)
	338
	339	self.assertEqual(b''.join(chunks), ref_frame)
	340
	341
	342	@hypothesis.settings(
	343	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	344	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	345	level=strategies.integers(min_value=1, max_value=5),
	346	source_read_size=strategies.integers(1, 16384),
	347	read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	348	def test_stream_source_readinto1(self, original, level, source_read_size,
	349	read_size):
	350	if read_size == 0:
	351	read_size = -1
	352
	353	refctx = zstd.ZstdCompressor(level=level)
	354	ref_frame = refctx.compress(original)
	355
	356	cctx = zstd.ZstdCompressor(level=level)
	357	with cctx.stream_reader(io.BytesIO(original), size=len(original),
	358	read_size=source_read_size) as reader:
	359	chunks = []
	360	while True:
	361	b = bytearray(read_size)
	362	count = reader.readinto1(b)
	363
	364	if not count:
	365	break
	366
	367	chunks.append(bytes(b[0:count]))
	368
	369	self.assertEqual(b''.join(chunks), ref_frame)
	370
	371	@hypothesis.settings(
	372	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	373	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	374	level=strategies.integers(min_value=1, max_value=5),
	375	source_read_size=strategies.integers(1, 16384),
	376	read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
	377	def test_buffer_source_readinto1(self, original, level, source_read_size,
	378	read_size):
	379	if read_size == 0:
	380	read_size = -1
	381
	382	refctx = zstd.ZstdCompressor(level=level)
	383	ref_frame = refctx.compress(original)
	384
	385	cctx = zstd.ZstdCompressor(level=level)
	386	with cctx.stream_reader(original, size=len(original),
	387	read_size=source_read_size) as reader:
	388	chunks = []
	389	while True:
	390	b = bytearray(read_size)
	391	count = reader.readinto1(b)
	392
	393	if not count:
	394	break
	395
	396	chunks.append(bytes(b[0:count]))
	397
	398	self.assertEqual(b''.join(chunks), ref_frame)
	399
	400	@hypothesis.settings(
	401	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	402	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	403	level=strategies.integers(min_value=1, max_value=5),
	404	source_read_size=strategies.integers(1, 16384),
	405	read_sizes=strategies.data())
	406	def test_stream_source_readinto1_variance(self, original, level, source_read_size,
	407	read_sizes):
	408	refctx = zstd.ZstdCompressor(level=level)
	409	ref_frame = refctx.compress(original)
	410
	411	cctx = zstd.ZstdCompressor(level=level)
	412	with cctx.stream_reader(io.BytesIO(original), size=len(original),
	413	read_size=source_read_size) as reader:
	414	chunks = []
	415	while True:
	416	read_size = read_sizes.draw(strategies.integers(1, 16384))
	417	b = bytearray(read_size)
	418	count = reader.readinto1(b)
	419
	420	if not count:
	421	break
	422
	423	chunks.append(bytes(b[0:count]))
	424
	425	self.assertEqual(b''.join(chunks), ref_frame)
	426
	427	@hypothesis.settings(
	428	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	429	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	430	level=strategies.integers(min_value=1, max_value=5),
	431	source_read_size=strategies.integers(1, 16384),
	432	read_sizes=strategies.data())
	433	def test_buffer_source_readinto1_variance(self, original, level, source_read_size,
	434	read_sizes):
	435
	436	refctx = zstd.ZstdCompressor(level=level)
	437	ref_frame = refctx.compress(original)
	438
	439	cctx = zstd.ZstdCompressor(level=level)
	440	with cctx.stream_reader(original, size=len(original),
	441	read_size=source_read_size) as reader:
	442	chunks = []
	443	while True:
	444	read_size = read_sizes.draw(strategies.integers(1, 16384))
	445	b = bytearray(read_size)
	446	count = reader.readinto1(b)
	447
	448	if not count:
	449	break
	450
	451	chunks.append(bytes(b[0:count]))
	452
	453	self.assertEqual(b''.join(chunks), ref_frame)
	454
	455
68	456
69	457	@unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
70	458	@make_cffi
		@@ -77,7 +465,7 b' class TestCompressor_stream_writer_fuzzi'
77	465	ref_frame = refctx.compress(original)
78	466
79	467	cctx = zstd.ZstdCompressor(level=level)
80		b = io.BytesIO()
	468	b = NonClosingBytesIO()
81	469	with cctx.stream_writer(b, size=len(original), write_size=write_size) as compressor:
82	470	compressor.write(original)
83	471
		@@ -219,6 +607,9 b' class TestCompressor_multi_compress_to_b'
219	607	write_checksum=True,
220	608	**kwargs)
221	609
	610	if not hasattr(cctx, 'multi_compress_to_buffer'):
	611	self.skipTest('multi_compress_to_buffer not available')
	612
222	613	result = cctx.multi_compress_to_buffer(original, threads=-1)
223	614
224	615	self.assertEqual(len(result), len(original))

contrib/python-zstandard/tests/test_data_structures.py

0 +42 -8

                                                     chain_log=zstd.CHAINLOG_MIN,
                                                     hash_log=zstd.HASHLOG_MIN,
                                                     search_log=zstd.SEARCHLOG_MIN,
-                                                    min_match=zstd.SEARCHLENGTH_MIN + 1,
+                                                    min_match=zstd.MINMATCH_MIN + 1,
                                                     target_length=zstd.TARGETLENGTH_MIN,
-                                                    compression_strategy=zstd.STRATEGY_FAST)
+                                                    strategy=zstd.STRATEGY_FAST)
                      zstd.ZstdCompressionParameters(window_log=zstd.WINDOWLOG_MAX,
                                                     chain_log=zstd.CHAINLOG_MAX,
                                                     hash_log=zstd.HASHLOG_MAX,
                                                     search_log=zstd.SEARCHLOG_MAX,
-                                                    min_match=zstd.SEARCHLENGTH_MAX - 1,
+                                                    min_match=zstd.MINMATCH_MAX - 1,
                                                     target_length=zstd.TARGETLENGTH_MAX,
-                                                    compression_strategy=zstd.STRATEGY_BTULTRA)
+                                                    strategy=zstd.STRATEGY_BTULTRA2)
                  def test_from_level(self):
                      p = zstd.ZstdCompressionParameters.from_level(1)
                                                         search_log=4,
                                                         min_match=5,
                                                         target_length=8,
-                                                        compression_strategy=1)
+                                                        strategy=1)
                      self.assertEqual(p.window_log, 10)
                      self.assertEqual(p.chain_log, 6)
                      self.assertEqual(p.hash_log, 7)
                      self.assertEqual(p.threads, 4)
                      p = zstd.ZstdCompressionParameters(threads=2, job_size=1048576,
-                                                    overlap_size_log=6)
+                                                        overlap_log=6)
                      self.assertEqual(p.threads, 2)
                      self.assertEqual(p.job_size, 1048576)
+                     self.assertEqual(p.overlap_log, 6)
                      self.assertEqual(p.overlap_size_log, 6)
                      p = zstd.ZstdCompressionParameters(compression_level=-1)
                      p = zstd.ZstdCompressionParameters(ldm_bucket_size_log=7)
                      self.assertEqual(p.ldm_bucket_size_log, 7)
-                     p = zstd.ZstdCompressionParameters(ldm_hash_every_log=8)
+                     p = zstd.ZstdCompressionParameters(ldm_hash_rate_log=8)
                      self.assertEqual(p.ldm_hash_every_log, 8)
+                     self.assertEqual(p.ldm_hash_rate_log, 8)
                  def test_estimated_compression_context_size(self):
                      p = zstd.ZstdCompressionParameters(window_log=20,
                                                         search_log=1,
                                                         min_match=5,
                                                         target_length=16,
-                                                        compression_strategy=zstd.STRATEGY_DFAST)
+                                                        strategy=zstd.STRATEGY_DFAST)
                      # 32-bit has slightly different values from 64-bit.
                      self.assertAlmostEqual(p.estimated_compression_context_size(), 1294072,
                                             delta=250)
+                 def test_strategy(self):
+                     with self.assertRaisesRegexp(ValueError, 'cannot specify both compression_strategy'):
+                         zstd.ZstdCompressionParameters(strategy=0, compression_strategy=0)
+                     p = zstd.ZstdCompressionParameters(strategy=2)
+                     self.assertEqual(p.compression_strategy, 2)
+                     p = zstd.ZstdCompressionParameters(strategy=3)
+                     self.assertEqual(p.compression_strategy, 3)
+                 def test_ldm_hash_rate_log(self):
+                     with self.assertRaisesRegexp(ValueError, 'cannot specify both ldm_hash_rate_log'):
+                         zstd.ZstdCompressionParameters(ldm_hash_rate_log=8, ldm_hash_every_log=4)
+                     p = zstd.ZstdCompressionParameters(ldm_hash_rate_log=8)
+                     self.assertEqual(p.ldm_hash_every_log, 8)
+                     p = zstd.ZstdCompressionParameters(ldm_hash_every_log=16)
+                     self.assertEqual(p.ldm_hash_every_log, 16)
+                 def test_overlap_log(self):
+                     with self.assertRaisesRegexp(ValueError, 'cannot specify both overlap_log'):
+                         zstd.ZstdCompressionParameters(overlap_log=1, overlap_size_log=9)
+                     p = zstd.ZstdCompressionParameters(overlap_log=2)
+                     self.assertEqual(p.overlap_log, 2)
+                     self.assertEqual(p.overlap_size_log, 2)
+                     p = zstd.ZstdCompressionParameters(overlap_size_log=4)
+                     self.assertEqual(p.overlap_log, 4)
+                     self.assertEqual(p.overlap_size_log, 4)
              @make_cffi
              class TestFrameParameters(unittest.TestCase):

contrib/python-zstandard/tests/test_data_structures_fuzzing.py

0 +16 -15

                                              max_value=zstd.HASHLOG_MAX)
              s_searchlog = strategies.integers(min_value=zstd.SEARCHLOG_MIN,
                                                  max_value=zstd.SEARCHLOG_MAX)
-             s_searchlength = strategies.integers(min_value=zstd.SEARCHLENGTH_MIN,
-                                                  max_value=zstd.SEARCHLENGTH_MAX)
+             s_minmatch = strategies.integers(min_value=zstd.MINMATCH_MIN,
+                                              max_value=zstd.MINMATCH_MAX)
              s_targetlength = strategies.integers(min_value=zstd.TARGETLENGTH_MIN,
                                                   max_value=zstd.TARGETLENGTH_MAX)
              s_strategy = strategies.sampled_from((zstd.STRATEGY_FAST,
                                                      zstd.STRATEGY_LAZY2,
                                                      zstd.STRATEGY_BTLAZY2,
                                                      zstd.STRATEGY_BTOPT,
-                                                     zstd.STRATEGY_BTULTRA))
+                                                     zstd.STRATEGY_BTULTRA,
+                                                     zstd.STRATEGY_BTULTRA2))
              @make_cffi
              @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
              class TestCompressionParametersHypothesis(unittest.TestCase):
                  @hypothesis.given(s_windowlog, s_chainlog, s_hashlog, s_searchlog,
-                                     s_searchlength, s_targetlength, s_strategy)
+                                     s_minmatch, s_targetlength, s_strategy)
                  def test_valid_init(self, windowlog, chainlog, hashlog, searchlog,
-                                     searchlength, targetlength, strategy):
+                                     minmatch, targetlength, strategy):
                      zstd.ZstdCompressionParameters(window_log=windowlog,
                                                     chain_log=chainlog,
                                                     hash_log=hashlog,
                                                     search_log=searchlog,
-                                                    min_match=searchlength,
+                                                    min_match=minmatch,
                                                     target_length=targetlength,
-                                                    compression_strategy=strategy)
+                                                    strategy=strategy)
                  @hypothesis.given(s_windowlog, s_chainlog, s_hashlog, s_searchlog,
-                                     s_searchlength, s_targetlength, s_strategy)
+                                   s_minmatch, s_targetlength, s_strategy)
                  def test_estimated_compression_context_size(self, windowlog, chainlog,
                                                              hashlog, searchlog,
-                                                             searchlength, targetlength,
+                                                             minmatch, targetlength,
                                                              strategy):
-                     if searchlength == zstd.SEARCHLENGTH_MIN and strategy in (zstd.STRATEGY_FAST, zstd.STRATEGY_GREEDY):
-                         searchlength += 1
-                     elif searchlength == zstd.SEARCHLENGTH_MAX and strategy != zstd.STRATEGY_FAST:
-                         searchlength -= 1
+                     if minmatch == zstd.MINMATCH_MIN and strategy in (zstd.STRATEGY_FAST, zstd.STRATEGY_GREEDY):
+                         minmatch += 1
+                     elif minmatch == zstd.MINMATCH_MAX and strategy != zstd.STRATEGY_FAST:
+                         minmatch -= 1
                      p = zstd.ZstdCompressionParameters(window_log=windowlog,
                                                         chain_log=chainlog,
                                                         hash_log=hashlog,
                                                         search_log=searchlog,
-                                                        min_match=searchlength,
+                                                        min_match=minmatch,
                                                         target_length=targetlength,
-                                                        compression_strategy=strategy)
+                                                        strategy=strategy)
                      size = p.estimated_compression_context_size()

contrib/python-zstandard/tests/test_decompressor.py

0 +459 -26

              import random
              import struct
              import sys
+             import tempfile
              import unittest
              import zstandard as zstd
              from .common import (
                  generate_samples,
                  make_cffi,
+                 NonClosingBytesIO,
                  OpCountingBytesIO,
              )
                      cctx = zstd.ZstdCompressor(write_content_size=False)
                      frame = cctx.compress(source)
-                     dctx = zstd.ZstdDecompressor(max_window_size=1)
+                     dctx = zstd.ZstdDecompressor(max_window_size=2**zstd.WINDOWLOG_MIN)
                      with self.assertRaisesRegexp(
                          zstd.ZstdError, 'decompression error: Frame requires too much memory'):
                      dctx = zstd.ZstdDecompressor()
                      with dctx.stream_reader(b'foo') as reader:
-                         with self.assertRaises(NotImplementedError):
+                         with self.assertRaises(io.UnsupportedOperation):
                              reader.readline()
-                         with self.assertRaises(NotImplementedError):
+                         with self.assertRaises(io.UnsupportedOperation):
                              reader.readlines()
-                         with self.assertRaises(NotImplementedError):
-                             reader.readall()
-                         with self.assertRaises(NotImplementedError):
+                         with self.assertRaises(io.UnsupportedOperation):
                              iter(reader)
-                         with self.assertRaises(NotImplementedError):
+                         with self.assertRaises(io.UnsupportedOperation):
                              next(reader)
                          with self.assertRaises(io.UnsupportedOperation):
                          with self.assertRaisesRegexp(ValueError, 'stream is closed'):
                              reader.read(1)
-                 def test_bad_read_size(self):
+                 def test_read_sizes(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = cctx.compress(b'foo')
                      dctx = zstd.ZstdDecompressor()
-                     with dctx.stream_reader(b'foo') as reader:
-                         with self.assertRaisesRegexp(ValueError, 'cannot read negative or size 0 amounts'):
-                             reader.read(-1)
+                     with dctx.stream_reader(foo) as reader:
+                         with self.assertRaisesRegexp(ValueError, 'cannot read negative amounts less than -1'):
+                             reader.read(-2)
-                         with self.assertRaisesRegexp(ValueError, 'cannot read negative or size 0 amounts'):
-                             reader.read(0)
+                         self.assertEqual(reader.read(0), b'')
+                         self.assertEqual(reader.read(), b'foo')
                  def test_read_buffer(self):
                      cctx = zstd.ZstdCompressor()
                      reader = dctx.stream_reader(source)
                      with reader:
-                         with self.assertRaises(TypeError):
-                             reader.read()
+                         reader.read(0)
                      with reader:
                          with self.assertRaisesRegexp(ValueError, 'stream is closed'):
                              reader.read(100)
+                 def test_partial_read(self):
+                     # Inspired by https://github.com/indygreg/python-zstandard/issues/71.
+                     buffer = io.BytesIO()
+                     cctx = zstd.ZstdCompressor()
+                     writer = cctx.stream_writer(buffer)
+                     writer.write(bytearray(os.urandom(1000000)))
+                     writer.flush(zstd.FLUSH_FRAME)
+                     buffer.seek(0)
+                     dctx = zstd.ZstdDecompressor()
+                     reader = dctx.stream_reader(buffer)
+                     while True:
+                         chunk = reader.read(8192)
+                         if not chunk:
+                             break
+                 def test_read_multiple_frames(self):
+                     cctx = zstd.ZstdCompressor()
+                     source = io.BytesIO()
+                     writer = cctx.stream_writer(source)
+                     writer.write(b'foo')
+                     writer.flush(zstd.FLUSH_FRAME)
+                     writer.write(b'bar')
+                     writer.flush(zstd.FLUSH_FRAME)
+                     dctx = zstd.ZstdDecompressor()
+                     reader = dctx.stream_reader(source.getvalue())
+                     self.assertEqual(reader.read(2), b'fo')
+                     self.assertEqual(reader.read(2), b'o')
+                     self.assertEqual(reader.read(2), b'ba')
+                     self.assertEqual(reader.read(2), b'r')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source)
+                     self.assertEqual(reader.read(2), b'fo')
+                     self.assertEqual(reader.read(2), b'o')
+                     self.assertEqual(reader.read(2), b'ba')
+                     self.assertEqual(reader.read(2), b'r')
+                     reader = dctx.stream_reader(source.getvalue())
+                     self.assertEqual(reader.read(3), b'foo')
+                     self.assertEqual(reader.read(3), b'bar')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source)
+                     self.assertEqual(reader.read(3), b'foo')
+                     self.assertEqual(reader.read(3), b'bar')
+                     reader = dctx.stream_reader(source.getvalue())
+                     self.assertEqual(reader.read(4), b'foo')
+                     self.assertEqual(reader.read(4), b'bar')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source)
+                     self.assertEqual(reader.read(4), b'foo')
+                     self.assertEqual(reader.read(4), b'bar')
+                     reader = dctx.stream_reader(source.getvalue())
+                     self.assertEqual(reader.read(128), b'foo')
+                     self.assertEqual(reader.read(128), b'bar')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source)
+                     self.assertEqual(reader.read(128), b'foo')
+                     self.assertEqual(reader.read(128), b'bar')
+                     # Now tests for reads spanning frames.
+                     reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
+                     self.assertEqual(reader.read(3), b'foo')
+                     self.assertEqual(reader.read(3), b'bar')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source, read_across_frames=True)
+                     self.assertEqual(reader.read(3), b'foo')
+                     self.assertEqual(reader.read(3), b'bar')
+                     reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
+                     self.assertEqual(reader.read(6), b'foobar')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source, read_across_frames=True)
+                     self.assertEqual(reader.read(6), b'foobar')
+                     reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
+                     self.assertEqual(reader.read(7), b'foobar')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source, read_across_frames=True)
+                     self.assertEqual(reader.read(7), b'foobar')
+                     reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
+                     self.assertEqual(reader.read(128), b'foobar')
+                     source.seek(0)
+                     reader = dctx.stream_reader(source, read_across_frames=True)
+                     self.assertEqual(reader.read(128), b'foobar')
+                 def test_readinto(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = cctx.compress(b'foo')
+                     dctx = zstd.ZstdDecompressor()
+                     # Attempting to readinto() a non-writable buffer fails.
+                     # The exact exception varies based on the backend.
+                     reader = dctx.stream_reader(foo)
+                     with self.assertRaises(Exception):
+                         reader.readinto(b'foobar')
+                     # readinto() with sufficiently large destination.
+                     b = bytearray(1024)
+                     reader = dctx.stream_reader(foo)
+                     self.assertEqual(reader.readinto(b), 3)
+                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(reader.readinto(b), 0)
+                     self.assertEqual(b[0:3], b'foo')
+                     # readinto() with small reads.
+                     b = bytearray(1024)
+                     reader = dctx.stream_reader(foo, read_size=1)
+                     self.assertEqual(reader.readinto(b), 3)
+                     self.assertEqual(b[0:3], b'foo')
+                     # Too small destination buffer.
+                     b = bytearray(2)
+                     reader = dctx.stream_reader(foo)
+                     self.assertEqual(reader.readinto(b), 2)
+                     self.assertEqual(b[:], b'fo')
+                 def test_readinto1(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = cctx.compress(b'foo')
+                     dctx = zstd.ZstdDecompressor()
+                     reader = dctx.stream_reader(foo)
+                     with self.assertRaises(Exception):
+                         reader.readinto1(b'foobar')
+                     # Sufficiently large destination.
+                     b = bytearray(1024)
+                     reader = dctx.stream_reader(foo)
+                     self.assertEqual(reader.readinto1(b), 3)
+                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(reader.readinto1(b), 0)
+                     self.assertEqual(b[0:3], b'foo')
+                     # readinto() with small reads.
+                     b = bytearray(1024)
+                     reader = dctx.stream_reader(foo, read_size=1)
+                     self.assertEqual(reader.readinto1(b), 3)
+                     self.assertEqual(b[0:3], b'foo')
+                     # Too small destination buffer.
+                     b = bytearray(2)
+                     reader = dctx.stream_reader(foo)
+                     self.assertEqual(reader.readinto1(b), 2)
+                     self.assertEqual(b[:], b'fo')
+                 def test_readall(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = cctx.compress(b'foo')
+                     dctx = zstd.ZstdDecompressor()
+                     reader = dctx.stream_reader(foo)
+                     self.assertEqual(reader.readall(), b'foo')
+                 def test_read1(self):
+                     cctx = zstd.ZstdCompressor()
+                     foo = cctx.compress(b'foo')
+                     dctx = zstd.ZstdDecompressor()
+                     b = OpCountingBytesIO(foo)
+                     reader = dctx.stream_reader(b)
+                     self.assertEqual(reader.read1(), b'foo')
+                     self.assertEqual(b._read_count, 1)
+                     b = OpCountingBytesIO(foo)
+                     reader = dctx.stream_reader(b)
+                     self.assertEqual(reader.read1(0), b'')
+                     self.assertEqual(reader.read1(2), b'fo')
+                     self.assertEqual(b._read_count, 1)
+                     self.assertEqual(reader.read1(1), b'o')
+                     self.assertEqual(b._read_count, 1)
+                     self.assertEqual(reader.read1(1), b'')
+                     self.assertEqual(b._read_count, 2)
+                 def test_read_lines(self):
+                     cctx = zstd.ZstdCompressor()
+                     source = b'\n'.join(('line %d' % i).encode('ascii') for i in range(1024))
+                     frame = cctx.compress(source)
+                     dctx = zstd.ZstdDecompressor()
+                     reader = dctx.stream_reader(frame)
+                     tr = io.TextIOWrapper(reader, encoding='utf-8')
+                     lines = []
+                     for line in tr:
+                         lines.append(line.encode('utf-8'))
+                     self.assertEqual(len(lines), 1024)
+                     self.assertEqual(b''.join(lines), source)
+                     reader = dctx.stream_reader(frame)
+                     tr = io.TextIOWrapper(reader, encoding='utf-8')
+                     lines = tr.readlines()
+                     self.assertEqual(len(lines), 1024)
+                     self.assertEqual(''.join(lines).encode('utf-8'), source)
+                     reader = dctx.stream_reader(frame)
+                     tr = io.TextIOWrapper(reader, encoding='utf-8')
+                     lines = []
+                     while True:
+                         line = tr.readline()
+                         if not line:
+                             break
+                         lines.append(line.encode('utf-8'))
+                     self.assertEqual(len(lines), 1024)
+                     self.assertEqual(b''.join(lines), source)
              @make_cffi
              class TestDecompressor_decompressobj(unittest.TestCase):
                      dctx = zstd.ZstdDecompressor()
                      dobj = dctx.decompressobj()
                      self.assertEqual(dobj.decompress(data), b'foobar')
+                     self.assertIsNone(dobj.flush())
+                     self.assertIsNone(dobj.flush(10))
+                     self.assertIsNone(dobj.flush(length=100))
                  def test_input_types(self):
                      compressed = zstd.ZstdCompressor(level=1).compress(b'foo')
                      for source in sources:
                          dobj = dctx.decompressobj()
+                         self.assertIsNone(dobj.flush())
+                         self.assertIsNone(dobj.flush(10))
+                         self.assertIsNone(dobj.flush(length=100))
                          self.assertEqual(dobj.decompress(source), b'foo')
+                         self.assertIsNone(dobj.flush())
                  def test_reuse(self):
                      data = zstd.ZstdCompressor(level=1).compress(b'foobar')
                      with self.assertRaisesRegexp(zstd.ZstdError, 'cannot use a decompressobj'):
                          dobj.decompress(data)
+                         self.assertIsNone(dobj.flush())
                  def test_bad_write_size(self):
                      dctx = zstd.ZstdDecompressor()
                          dobj = dctx.decompressobj(write_size=i + 1)
                          self.assertEqual(dobj.decompress(data), source)
              def decompress_via_writer(data):
                  buffer = io.BytesIO()
                  dctx = zstd.ZstdDecompressor()
-                 with dctx.stream_writer(buffer) as decompressor:
-                     decompressor.write(data)
+                 decompressor = dctx.stream_writer(buffer)
+                 decompressor.write(data)
                  return buffer.getvalue()
              @make_cffi
              class TestDecompressor_stream_writer(unittest.TestCase):
+                 def test_io_api(self):
+                     buffer = io.BytesIO()
+                     dctx = zstd.ZstdDecompressor()
+                     writer = dctx.stream_writer(buffer)
+                     self.assertFalse(writer.closed)
+                     self.assertFalse(writer.isatty())
+                     self.assertFalse(writer.readable())
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readline()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readline(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readline(size=42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readlines()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readlines(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readlines(hint=42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.seek(0)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.seek(10, os.SEEK_SET)
+                     self.assertFalse(writer.seekable())
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.tell()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.truncate()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.truncate(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.truncate(size=42)
+                     self.assertTrue(writer.writable())
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.writelines([])
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.read()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.read(42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.read(size=42)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readall()
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.readinto(None)
+                     with self.assertRaises(io.UnsupportedOperation):
+                         writer.fileno()
+                 def test_fileno_file(self):
+                     with tempfile.TemporaryFile('wb') as tf:
+                         dctx = zstd.ZstdDecompressor()
+                         writer = dctx.stream_writer(tf)
+                         self.assertEqual(writer.fileno(), tf.fileno())
+                 def test_close(self):
+                     foo = zstd.ZstdCompressor().compress(b'foo')
+                     buffer = NonClosingBytesIO()
+                     dctx = zstd.ZstdDecompressor()
+                     writer = dctx.stream_writer(buffer)
+                     writer.write(foo)
+                     self.assertFalse(writer.closed)
+                     self.assertFalse(buffer.closed)
+                     writer.close()
+                     self.assertTrue(writer.closed)
+                     self.assertTrue(buffer.closed)
+                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         writer.write(b'')
+                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         writer.flush()
+                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         with writer:
+                             pass
+                     self.assertEqual(buffer.getvalue(), b'foo')
+                     # Context manager exit should close stream.
+                     buffer = NonClosingBytesIO()
+                     writer = dctx.stream_writer(buffer)
+                     with writer:
+                         writer.write(foo)
+                     self.assertTrue(writer.closed)
+                     self.assertEqual(buffer.getvalue(), b'foo')
+                 def test_flush(self):
+                     buffer = OpCountingBytesIO()
+                     dctx = zstd.ZstdDecompressor()
+                     writer = dctx.stream_writer(buffer)
+                     writer.flush()
+                     self.assertEqual(buffer._flush_count, 1)
+                     writer.flush()
+                     self.assertEqual(buffer._flush_count, 2)
                  def test_empty_roundtrip(self):
                      cctx = zstd.ZstdCompressor()
                      empty = cctx.compress(b'')
                      dctx = zstd.ZstdDecompressor()
                      for source in sources:
                          buffer = io.BytesIO()
+                         decompressor = dctx.stream_writer(buffer)
+                         decompressor.write(source)
+                         self.assertEqual(buffer.getvalue(), b'foo')
+                         buffer = NonClosingBytesIO()
                          with dctx.stream_writer(buffer) as decompressor:
-                             decompressor.write(source)
+                             self.assertEqual(decompressor.write(source), 3)
+                         self.assertEqual(buffer.getvalue(), b'foo')
+                         buffer = io.BytesIO()
+                         writer = dctx.stream_writer(buffer, write_return_read=True)
+                         self.assertEqual(writer.write(source), len(source))
                          self.assertEqual(buffer.getvalue(), b'foo')
                  def test_large_roundtrip(self):
                      cctx = zstd.ZstdCompressor()
                      compressed = cctx.compress(orig)
-                     buffer = io.BytesIO()
+                     buffer = NonClosingBytesIO()
                      dctx = zstd.ZstdDecompressor()
                      with dctx.stream_writer(buffer) as decompressor:
                          pos = 0
                              pos += 8192
                      self.assertEqual(buffer.getvalue(), orig)
+                     # Again with write_return_read=True
+                     buffer = io.BytesIO()
+                     writer = dctx.stream_writer(buffer, write_return_read=True)
+                     pos = 0
+                     while pos < len(compressed):
+                         pos2 = pos + 8192
+                         chunk = compressed[pos:pos2]
+                         self.assertEqual(writer.write(chunk), len(chunk))
+                         pos += 8192
+                     self.assertEqual(buffer.getvalue(), orig)
                  def test_dictionary(self):
                      samples = []
                      for i in range(128):
                      d = zstd.train_dictionary(8192, samples)
                      orig = b'foobar' * 16384
-                     buffer = io.BytesIO()
+                     buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(dict_data=d)
                      with cctx.stream_writer(buffer) as compressor:
                          self.assertEqual(compressor.write(orig), 0)
                      buffer = io.BytesIO()
                      dctx = zstd.ZstdDecompressor(dict_data=d)
+                     decompressor = dctx.stream_writer(buffer)
+                     self.assertEqual(decompressor.write(compressed), len(orig))
+                     self.assertEqual(buffer.getvalue(), orig)
+                     buffer = NonClosingBytesIO()
                      with dctx.stream_writer(buffer) as decompressor:
                          self.assertEqual(decompressor.write(compressed), len(orig))
                  def test_memory_size(self):
                      dctx = zstd.ZstdDecompressor()
                      buffer = io.BytesIO()
+                     decompressor = dctx.stream_writer(buffer)
+                     size = decompressor.memory_size()
+                     self.assertGreater(size, 100000)
                      with dctx.stream_writer(buffer) as decompressor:
                          size = decompressor.memory_size()
                  @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
                  def test_large_input(self):
                      bytes = list(struct.Struct('>B').pack(i) for i in range(256))
-                     compressed = io.BytesIO()
+                     compressed = NonClosingBytesIO()
                      input_size = 0
                      cctx = zstd.ZstdCompressor(level=1)
                      with cctx.stream_writer(compressed) as compressor:
                              if have_compressed and have_raw:
                                  break
-                     compressed.seek(0)
+                     compressed = io.BytesIO(compressed.getvalue())
                      self.assertGreater(len(compressed.getvalue()),
                                         zstd.DECOMPRESSION_RECOMMENDED_INPUT_SIZE)
                      source = io.BytesIO()
-                     compressed = io.BytesIO()
+                     compressed = NonClosingBytesIO()
                      with cctx.stream_writer(compressed) as compressor:
                          for i in range(256):
                              chunk = b'\0' * 1024
                                               max_output_size=len(source.getvalue()))
                      self.assertEqual(simple, source.getvalue())
-                     compressed.seek(0)
+                     compressed = io.BytesIO(compressed.getvalue())
                      streamed = b''.join(dctx.read_to_iter(compressed))
                      self.assertEqual(streamed, source.getvalue())
                  def test_invalid_inputs(self):
                      dctx = zstd.ZstdDecompressor()
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      with self.assertRaises(TypeError):
                          dctx.multi_decompress_to_buffer(True)
                      frames = [cctx.compress(d) for d in original]
                      dctx = zstd.ZstdDecompressor()
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      result = dctx.multi_decompress_to_buffer(frames)
                      self.assertEqual(len(result), len(frames))
                      sizes = struct.pack('=' + 'Q' * len(original), *map(len, original))
                      dctx = zstd.ZstdDecompressor()
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      result = dctx.multi_decompress_to_buffer(frames, decompressed_sizes=sizes)
                      self.assertEqual(len(result), len(frames))
                      dctx = zstd.ZstdDecompressor()
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      segments = struct.pack('=QQQQ', 0, len(frames[0]), len(frames[0]), len(frames[1]))
                      b = zstd.BufferWithSegments(b''.join(frames), segments)
                      frames = [cctx.compress(d) for d in original]
                      sizes = struct.pack('=' + 'Q' * len(original), *map(len, original))
+                     dctx = zstd.ZstdDecompressor()
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      segments = struct.pack('=QQQQQQ', 0, len(frames[0]),
                                             len(frames[0]), len(frames[1]),
                                             len(frames[0]) + len(frames[1]), len(frames[2]))
                      b = zstd.BufferWithSegments(b''.join(frames), segments)
-                     dctx = zstd.ZstdDecompressor()
                      result = dctx.multi_decompress_to_buffer(b, decompressed_sizes=sizes)
                      self.assertEqual(len(result), len(frames))
                          b'foo4' * 6,
                      ]
+                     if not hasattr(cctx, 'multi_compress_to_buffer'):
+                         self.skipTest('multi_compress_to_buffer not available')
                      frames = cctx.multi_compress_to_buffer(original)
                      # Check round trip.
                      dctx = zstd.ZstdDecompressor()
                      decompressed = dctx.multi_decompress_to_buffer(frames, threads=3)
                      self.assertEqual(len(decompressed), len(original))
                      frames = [cctx.compress(s) for s in generate_samples()]
                      dctx = zstd.ZstdDecompressor(dict_data=d)
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      result = dctx.multi_decompress_to_buffer(frames)
                      self.assertEqual([o.tobytes() for o in result], generate_samples())
                  def test_multiple_threads(self):
                      frames.extend(cctx.compress(b'y' * 64) for i in range(256))
                      dctx = zstd.ZstdDecompressor()
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      result = dctx.multi_decompress_to_buffer(frames, threads=-1)
                      self.assertEqual(len(result), len(frames))
                      dctx = zstd.ZstdDecompressor()
+                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
+                         self.skipTest('multi_decompress_to_buffer not available')
                      with self.assertRaisesRegexp(zstd.ZstdError,
                                                   'error decompressing item 1: ('
                                                   'Corrupted block|'

contrib/python-zstandard/tests/test_decompressor_fuzzing.py

0 +253 -20

		@@ -12,6 +12,7 b' import zstandard as zstd'
12	12
13	13	from . common import (
14	14	make_cffi,
	15	NonClosingBytesIO,
15	16	random_input_data,
16	17	)
17	18
		@@ -23,22 +24,200 b' class TestDecompressor_stream_reader_fuz'
23	24	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
24	25	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
25	26	level=strategies.integers(min_value=1, max_value=5),
26		s~~ource_read_size~~=strategies.~~integer~~s(1, ~~16384~~),
	27	streaming=strategies.booleans(),
	28	source_read_size=strategies.integers(1, 1048576),
27	29	read_sizes=strategies.data())
28		def test_stream_source_read_variance(self, original, level, s~~ource_read_size~~,
29		read_sizes):
	30	def test_stream_source_read_variance(self, original, level, streaming,
	31	source_read_size, read_sizes):
30	32	cctx = zstd.ZstdCompressor(level=level)
31		frame = cctx.compress(original)
	33
	34	if streaming:
	35	source = io.BytesIO()
	36	writer = cctx.stream_writer(source)
	37	writer.write(original)
	38	writer.flush(zstd.FLUSH_FRAME)
	39	source.seek(0)
	40	else:
	41	frame = cctx.compress(original)
	42	source = io.BytesIO(frame)
32	43
33	44	dctx = zstd.ZstdDecompressor()
34		source = io.BytesIO(frame)
35	45
36	46	chunks = []
37	47	with dctx.stream_reader(source, read_size=source_read_size) as reader:
38	48	while True:
39		read_size = read_sizes.draw(strategies.integers(1, 1~~6384~~))
	49	read_size = read_sizes.draw(strategies.integers(-1, 131072))
	50	chunk = reader.read(read_size)
	51	if not chunk and read_size:
	52	break
	53
	54	chunks.append(chunk)
	55
	56	self.assertEqual(b''.join(chunks), original)
	57
	58	# Similar to above except we have a constant read() size.
	59	@hypothesis.settings(
	60	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	61	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	62	level=strategies.integers(min_value=1, max_value=5),
	63	streaming=strategies.booleans(),
	64	source_read_size=strategies.integers(1, 1048576),
	65	read_size=strategies.integers(-1, 131072))
	66	def test_stream_source_read_size(self, original, level, streaming,
	67	source_read_size, read_size):
	68	if read_size == 0:
	69	read_size = 1
	70
	71	cctx = zstd.ZstdCompressor(level=level)
	72
	73	if streaming:
	74	source = io.BytesIO()
	75	writer = cctx.stream_writer(source)
	76	writer.write(original)
	77	writer.flush(zstd.FLUSH_FRAME)
	78	source.seek(0)
	79	else:
	80	frame = cctx.compress(original)
	81	source = io.BytesIO(frame)
	82
	83	dctx = zstd.ZstdDecompressor()
	84
	85	chunks = []
	86	reader = dctx.stream_reader(source, read_size=source_read_size)
	87	while True:
	88	chunk = reader.read(read_size)
	89	if not chunk and read_size:
	90	break
	91
	92	chunks.append(chunk)
	93
	94	self.assertEqual(b''.join(chunks), original)
	95
	96	@hypothesis.settings(
	97	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	98	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	99	level=strategies.integers(min_value=1, max_value=5),
	100	streaming=strategies.booleans(),
	101	source_read_size=strategies.integers(1, 1048576),
	102	read_sizes=strategies.data())
	103	def test_buffer_source_read_variance(self, original, level, streaming,
	104	source_read_size, read_sizes):
	105	cctx = zstd.ZstdCompressor(level=level)
	106
	107	if streaming:
	108	source = io.BytesIO()
	109	writer = cctx.stream_writer(source)
	110	writer.write(original)
	111	writer.flush(zstd.FLUSH_FRAME)
	112	frame = source.getvalue()
	113	else:
	114	frame = cctx.compress(original)
	115
	116	dctx = zstd.ZstdDecompressor()
	117	chunks = []
	118
	119	with dctx.stream_reader(frame, read_size=source_read_size) as reader:
	120	while True:
	121	read_size = read_sizes.draw(strategies.integers(-1, 131072))
40	122	chunk = reader.read(read_size)
41		if not chunk:
	123	if not chunk and read_size:
	124	break
	125
	126	chunks.append(chunk)
	127
	128	self.assertEqual(b''.join(chunks), original)
	129
	130	# Similar to above except we have a constant read() size.
	131	@hypothesis.settings(
	132	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	133	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	134	level=strategies.integers(min_value=1, max_value=5),
	135	streaming=strategies.booleans(),
	136	source_read_size=strategies.integers(1, 1048576),
	137	read_size=strategies.integers(-1, 131072))
	138	def test_buffer_source_constant_read_size(self, original, level, streaming,
	139	source_read_size, read_size):
	140	if read_size == 0:
	141	read_size = -1
	142
	143	cctx = zstd.ZstdCompressor(level=level)
	144
	145	if streaming:
	146	source = io.BytesIO()
	147	writer = cctx.stream_writer(source)
	148	writer.write(original)
	149	writer.flush(zstd.FLUSH_FRAME)
	150	frame = source.getvalue()
	151	else:
	152	frame = cctx.compress(original)
	153
	154	dctx = zstd.ZstdDecompressor()
	155	chunks = []
	156
	157	reader = dctx.stream_reader(frame, read_size=source_read_size)
	158	while True:
	159	chunk = reader.read(read_size)
	160	if not chunk and read_size:
	161	break
	162
	163	chunks.append(chunk)
	164
	165	self.assertEqual(b''.join(chunks), original)
	166
	167	@hypothesis.settings(
	168	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	169	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	170	level=strategies.integers(min_value=1, max_value=5),
	171	streaming=strategies.booleans(),
	172	source_read_size=strategies.integers(1, 1048576))
	173	def test_stream_source_readall(self, original, level, streaming,
	174	source_read_size):
	175	cctx = zstd.ZstdCompressor(level=level)
	176
	177	if streaming:
	178	source = io.BytesIO()
	179	writer = cctx.stream_writer(source)
	180	writer.write(original)
	181	writer.flush(zstd.FLUSH_FRAME)
	182	source.seek(0)
	183	else:
	184	frame = cctx.compress(original)
	185	source = io.BytesIO(frame)
	186
	187	dctx = zstd.ZstdDecompressor()
	188
	189	data = dctx.stream_reader(source, read_size=source_read_size).readall()
	190	self.assertEqual(data, original)
	191
	192	@hypothesis.settings(
	193	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	194	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
	195	level=strategies.integers(min_value=1, max_value=5),
	196	streaming=strategies.booleans(),
	197	source_read_size=strategies.integers(1, 1048576),
	198	read_sizes=strategies.data())
	199	def test_stream_source_read1_variance(self, original, level, streaming,
	200	source_read_size, read_sizes):
	201	cctx = zstd.ZstdCompressor(level=level)
	202
	203	if streaming:
	204	source = io.BytesIO()
	205	writer = cctx.stream_writer(source)
	206	writer.write(original)
	207	writer.flush(zstd.FLUSH_FRAME)
	208	source.seek(0)
	209	else:
	210	frame = cctx.compress(original)
	211	source = io.BytesIO(frame)
	212
	213	dctx = zstd.ZstdDecompressor()
	214
	215	chunks = []
	216	with dctx.stream_reader(source, read_size=source_read_size) as reader:
	217	while True:
	218	read_size = read_sizes.draw(strategies.integers(-1, 131072))
	219	chunk = reader.read1(read_size)
	220	if not chunk and read_size:
42	221	break
43	222
44	223	chunks.append(chunk)
		@@ -49,24 +228,36 b' class TestDecompressor_stream_reader_fuz'
49	228	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
50	229	@hypothesis.given(original=strategies.sampled_from(random_input_data()),
51	230	level=strategies.integers(min_value=1, max_value=5),
52		s~~ource_read_size~~=strategies.~~integer~~s(1, ~~16384~~),
	231	streaming=strategies.booleans(),
	232	source_read_size=strategies.integers(1, 1048576),
53	233	read_sizes=strategies.data())
54		def test_~~buffer~~_source_read_variance(self, original, level, s~~ource_read_size~~,
55		read_sizes):
	234	def test_stream_source_readinto1_variance(self, original, level, streaming,
	235	source_read_size, read_sizes):
56	236	cctx = zstd.ZstdCompressor(level=level)
57		frame = cctx.compress(original)
	237
	238	if streaming:
	239	source = io.BytesIO()
	240	writer = cctx.stream_writer(source)
	241	writer.write(original)
	242	writer.flush(zstd.FLUSH_FRAME)
	243	source.seek(0)
	244	else:
	245	frame = cctx.compress(original)
	246	source = io.BytesIO(frame)
58	247
59	248	dctx = zstd.ZstdDecompressor()
	249
60	250	chunks = []
61
62		with dctx.stream_reader(frame, read_size=source_read_size) as reader:
	251	with dctx.stream_reader(source, read_size=source_read_size) as reader:
63	252	while True:
64		read_size = read_sizes.draw(strategies.integers(1, 1~~6384~~))
65		~~chunk~~ = ~~reader~~.~~read~~(read_size)
66		if not chunk:
	253	read_size = read_sizes.draw(strategies.integers(1, 131072))
	254	b = bytearray(read_size)
	255	count = reader.readinto1(b)
	256
	257	if not count:
67	258	break
68	259
69		chunks.append(~~chunk~~)
	260	chunks.append(bytes(b[0:count]))
70	261
71	262	self.assertEqual(b''.join(chunks), original)
72	263
		@@ -75,7 +266,7 b' class TestDecompressor_stream_reader_fuz'
75	266	@hypothesis.given(
76	267	original=strategies.sampled_from(random_input_data()),
77	268	level=strategies.integers(min_value=1, max_value=5),
78		source_read_size=strategies.integers(1, 1~~6384~~),
	269	source_read_size=strategies.integers(1, 1048576),
79	270	seek_amounts=strategies.data(),
80	271	read_sizes=strategies.data())
81	272	def test_relative_seeks(self, original, level, source_read_size, seek_amounts,
		@@ -99,6 +290,46 b' class TestDecompressor_stream_reader_fuz'
99	290
100	291	self.assertEqual(original[offset:offset + len(chunk)], chunk)
101	292
	293	@hypothesis.settings(
	294	suppress_health_check=[hypothesis.HealthCheck.large_base_example])
	295	@hypothesis.given(
	296	originals=strategies.data(),
	297	frame_count=strategies.integers(min_value=2, max_value=10),
	298	level=strategies.integers(min_value=1, max_value=5),
	299	source_read_size=strategies.integers(1, 1048576),
	300	read_sizes=strategies.data())
	301	def test_multiple_frames(self, originals, frame_count, level,
	302	source_read_size, read_sizes):
	303
	304	cctx = zstd.ZstdCompressor(level=level)
	305	source = io.BytesIO()
	306	buffer = io.BytesIO()
	307	writer = cctx.stream_writer(buffer)
	308
	309	for i in range(frame_count):
	310	data = originals.draw(strategies.sampled_from(random_input_data()))
	311	source.write(data)
	312	writer.write(data)
	313	writer.flush(zstd.FLUSH_FRAME)
	314
	315	dctx = zstd.ZstdDecompressor()
	316	buffer.seek(0)
	317	reader = dctx.stream_reader(buffer, read_size=source_read_size,
	318	read_across_frames=True)
	319
	320	chunks = []
	321
	322	while True:
	323	read_amount = read_sizes.draw(strategies.integers(-1, 16384))
	324	chunk = reader.read(read_amount)
	325
	326	if not chunk and read_amount:
	327	break
	328
	329	chunks.append(chunk)
	330
	331	self.assertEqual(source.getvalue(), b''.join(chunks))
	332
102	333
103	334	@unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
104	335	@make_cffi
		@@ -113,7 +344,7 b' class TestDecompressor_stream_writer_fuz'
113	344
114	345	dctx = zstd.ZstdDecompressor()
115	346	source = io.BytesIO(frame)
116		dest = io.BytesIO()
	347	dest = NonClosingBytesIO()
117	348
118	349	with dctx.stream_writer(dest, write_size=write_size) as decompressor:
119	350	while True:
		@@ -234,10 +465,12 b' class TestDecompressor_multi_decompress_'
234	465	write_checksum=True,
235	466	**kwargs)
236	467
	468	if not hasattr(cctx, 'multi_compress_to_buffer'):
	469	self.skipTest('multi_compress_to_buffer not available')
	470
237	471	frames_buffer = cctx.multi_compress_to_buffer(original, threads=-1)
238	472
239	473	dctx = zstd.ZstdDecompressor(**kwargs)
240
241	474	result = dctx.multi_decompress_to_buffer(frames_buffer)
242	475
243	476	self.assertEqual(len(result), len(original))

contrib/python-zstandard/tests/test_module_attributes.py

0 +7 -2

              @make_cffi
              class TestModuleAttributes(unittest.TestCase):
                  def test_version(self):
-                     self.assertEqual(zstd.ZSTD_VERSION, (1, 3, 6))
+                     self.assertEqual(zstd.ZSTD_VERSION, (1, 3, 8))
-                     self.assertEqual(zstd.__version__, '0.10.1')
+                     self.assertEqual(zstd.__version__, '0.11.0')
                  def test_constants(self):
                      self.assertEqual(zstd.MAX_COMPRESSION_LEVEL, 22)
                          'DECOMPRESSION_RECOMMENDED_INPUT_SIZE',
                          'DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE',
                          'MAGIC_NUMBER',
+                         'FLUSH_BLOCK',
+                         'FLUSH_FRAME',
                          'BLOCKSIZELOG_MAX',
                          'BLOCKSIZE_MAX',
                          'WINDOWLOG_MIN',
                          'HASHLOG_MIN',
                          'HASHLOG_MAX',
                          'HASHLOG3_MAX',
+                         'MINMATCH_MIN',
+                         'MINMATCH_MAX',
                          'SEARCHLOG_MIN',
                          'SEARCHLOG_MAX',
                          'SEARCHLENGTH_MIN',
                          'STRATEGY_BTLAZY2',
                          'STRATEGY_BTOPT',
                          'STRATEGY_BTULTRA',
+                         'STRATEGY_BTULTRA2',
                          'DICT_TYPE_AUTO',
                          'DICT_TYPE_RAWCONTENT',
                          'DICT_TYPE_FULLDICT',

contrib/python-zstandard/zstandard/__init__.py

0 +5 -5

                      from zstd import *
                      backend = 'cext'
                  elif platform.python_implementation() in ('PyPy',):
-                     from zstd_cffi import *
+                     from .cffi import *
                      backend = 'cffi'
                  else:
                      try:
                          from zstd import *
                          backend = 'cext'
                      except ImportError:
-                         from zstd_cffi import *
+                         from .cffi import *
                          backend = 'cffi'
              elif _module_policy == 'cffi_fallback':
                  try:
                      from zstd import *
                      backend = 'cext'
                  except ImportError:
-                     from zstd_cffi import *
+                     from .cffi import *
                      backend = 'cffi'
              elif _module_policy == 'cext':
                  from zstd import *
                  backend = 'cext'
              elif _module_policy == 'cffi':
-                 from zstd_cffi import *
+                 from .cffi import *
                  backend = 'cffi'
              else:
                  raise ImportError('unknown module import policy: %s; use default, cffi_fallback, '
                                    'cext, or cffi' % _module_policy)
              # Keep this in sync with python-zstandard.h.
-             __version__ = '0.10.1'
+             __version__ = '0.11.0'

contrib/python-zstandard/zstandard/cffi.py ~~contrib/python-zstandard/zstd_cffi.py~~

0 renamed +883 -320

                  'train_dictionary',
                  # Constants.
+                 'FLUSH_BLOCK',
+                 'FLUSH_FRAME',
                  'COMPRESSOBJ_FLUSH_FINISH',
                  'COMPRESSOBJ_FLUSH_BLOCK',
                  'ZSTD_VERSION',
                  'HASHLOG_MIN',
                  'HASHLOG_MAX',
                  'HASHLOG3_MAX',
+                 'MINMATCH_MIN',
+                 'MINMATCH_MAX',
                  'SEARCHLOG_MIN',
                  'SEARCHLOG_MAX',
                  'SEARCHLENGTH_MIN',
                  'STRATEGY_BTLAZY2',
                  'STRATEGY_BTOPT',
                  'STRATEGY_BTULTRA',
+                 'STRATEGY_BTULTRA2',
                  'DICT_TYPE_AUTO',
                  'DICT_TYPE_RAWCONTENT',
                  'DICT_TYPE_FULLDICT',
              HASHLOG_MIN = lib.ZSTD_HASHLOG_MIN
              HASHLOG_MAX = lib.ZSTD_HASHLOG_MAX
              HASHLOG3_MAX = lib.ZSTD_HASHLOG3_MAX
+             MINMATCH_MIN = lib.ZSTD_MINMATCH_MIN
+             MINMATCH_MAX = lib.ZSTD_MINMATCH_MAX
              SEARCHLOG_MIN = lib.ZSTD_SEARCHLOG_MIN
              SEARCHLOG_MAX = lib.ZSTD_SEARCHLOG_MAX
-             SEARCHLENGTH_MIN = lib.ZSTD_SEARCHLENGTH_MIN
-             SEARCHLENGTH_MAX = lib.ZSTD_SEARCHLENGTH_MAX
+             SEARCHLENGTH_MIN = lib.ZSTD_MINMATCH_MIN
+             SEARCHLENGTH_MAX = lib.ZSTD_MINMATCH_MAX
              TARGETLENGTH_MIN = lib.ZSTD_TARGETLENGTH_MIN
              TARGETLENGTH_MAX = lib.ZSTD_TARGETLENGTH_MAX
              LDM_MINMATCH_MIN = lib.ZSTD_LDM_MINMATCH_MIN
              STRATEGY_BTLAZY2 = lib.ZSTD_btlazy2
              STRATEGY_BTOPT = lib.ZSTD_btopt
              STRATEGY_BTULTRA = lib.ZSTD_btultra
+             STRATEGY_BTULTRA2 = lib.ZSTD_btultra2
              DICT_TYPE_AUTO = lib.ZSTD_dct_auto
              DICT_TYPE_RAWCONTENT = lib.ZSTD_dct_rawContent
              FORMAT_ZSTD1 = lib.ZSTD_f_zstd1
              FORMAT_ZSTD1_MAGICLESS = lib.ZSTD_f_zstd1_magicless
+             FLUSH_BLOCK = 0
+             FLUSH_FRAME = 1
              COMPRESSOBJ_FLUSH_FINISH = 0
              COMPRESSOBJ_FLUSH_BLOCK = 1
                  res = ffi.gc(res, lib.ZSTD_freeCCtxParams)
                  attrs = [
-                     (lib.ZSTD_p_format, params.format),
-                     (lib.ZSTD_p_compressionLevel, params.compression_level),
-                     (lib.ZSTD_p_windowLog, params.window_log),
-                     (lib.ZSTD_p_hashLog, params.hash_log),
-                     (lib.ZSTD_p_chainLog, params.chain_log),
-                     (lib.ZSTD_p_searchLog, params.search_log),
-                     (lib.ZSTD_p_minMatch, params.min_match),
-                     (lib.ZSTD_p_targetLength, params.target_length),
-                     (lib.ZSTD_p_compressionStrategy, params.compression_strategy),
-                     (lib.ZSTD_p_contentSizeFlag, params.write_content_size),
-                     (lib.ZSTD_p_checksumFlag, params.write_checksum),
-                     (lib.ZSTD_p_dictIDFlag, params.write_dict_id),
-                     (lib.ZSTD_p_nbWorkers, params.threads),
-                     (lib.ZSTD_p_jobSize, params.job_size),
-                     (lib.ZSTD_p_overlapSizeLog, params.overlap_size_log),
-                     (lib.ZSTD_p_forceMaxWindow, params.force_max_window),
-                     (lib.ZSTD_p_enableLongDistanceMatching, params.enable_ldm),
-                     (lib.ZSTD_p_ldmHashLog, params.ldm_hash_log),
-                     (lib.ZSTD_p_ldmMinMatch, params.ldm_min_match),
-                     (lib.ZSTD_p_ldmBucketSizeLog, params.ldm_bucket_size_log),
-                     (lib.ZSTD_p_ldmHashEveryLog, params.ldm_hash_every_log),
+                     (lib.ZSTD_c_format, params.format),
+                     (lib.ZSTD_c_compressionLevel, params.compression_level),
+                     (lib.ZSTD_c_windowLog, params.window_log),
+                     (lib.ZSTD_c_hashLog, params.hash_log),
+                     (lib.ZSTD_c_chainLog, params.chain_log),
+                     (lib.ZSTD_c_searchLog, params.search_log),
+                     (lib.ZSTD_c_minMatch, params.min_match),
+                     (lib.ZSTD_c_targetLength, params.target_length),
+                     (lib.ZSTD_c_strategy, params.compression_strategy),
+                     (lib.ZSTD_c_contentSizeFlag, params.write_content_size),
+                     (lib.ZSTD_c_checksumFlag, params.write_checksum),
+                     (lib.ZSTD_c_dictIDFlag, params.write_dict_id),
+                     (lib.ZSTD_c_nbWorkers, params.threads),
+                     (lib.ZSTD_c_jobSize, params.job_size),
+                     (lib.ZSTD_c_overlapLog, params.overlap_log),
+                     (lib.ZSTD_c_forceMaxWindow, params.force_max_window),
+                     (lib.ZSTD_c_enableLongDistanceMatching, params.enable_ldm),
+                     (lib.ZSTD_c_ldmHashLog, params.ldm_hash_log),
+                     (lib.ZSTD_c_ldmMinMatch, params.ldm_min_match),
+                     (lib.ZSTD_c_ldmBucketSizeLog, params.ldm_bucket_size_log),
+                     (lib.ZSTD_c_ldmHashRateLog, params.ldm_hash_rate_log),
                  ]
                  for param, value in attrs:
                          'chain_log': 'chainLog',
                          'hash_log': 'hashLog',
                          'search_log': 'searchLog',
-                         'min_match': 'searchLength',
+                         'min_match': 'minMatch',
                          'target_length': 'targetLength',
                          'compression_strategy': 'strategy',
                      }
                  def __init__(self, format=0, compression_level=0, window_log=0, hash_log=0,
                               chain_log=0, search_log=0, min_match=0, target_length=0,
-                              compression_strategy=0, write_content_size=1, write_checksum=0,
-                              write_dict_id=0, job_size=0, overlap_size_log=0,
-                              force_max_window=0, enable_ldm=0, ldm_hash_log=0,
-                              ldm_min_match=0, ldm_bucket_size_log=0, ldm_hash_every_log=0,
-                              threads=0):
+                              strategy=-1, compression_strategy=-1,
+                              write_content_size=1, write_checksum=0,
+                              write_dict_id=0, job_size=0, overlap_log=-1,
+                              overlap_size_log=-1, force_max_window=0, enable_ldm=0,
+                              ldm_hash_log=0, ldm_min_match=0, ldm_bucket_size_log=0,
+                              ldm_hash_rate_log=-1, ldm_hash_every_log=-1, threads=0):
+                     params = lib.ZSTD_createCCtxParams()
+                     if params == ffi.NULL:
+                         raise MemoryError()
+                     params = ffi.gc(params, lib.ZSTD_freeCCtxParams)
+                     self._params = params
                      if threads < 0:
                          threads = _cpu_count()
-                     self.format = format
-                     self.compression_level = compression_level
-                     self.window_log = window_log
-                     self.hash_log = hash_log
-                     self.chain_log = chain_log
-                     self.search_log = search_log
-                     self.min_match = min_match
-                     self.target_length = target_length
-                     self.compression_strategy = compression_strategy
-                     self.write_content_size = write_content_size
-                     self.write_checksum = write_checksum
-                     self.write_dict_id = write_dict_id
-                     self.job_size = job_size
-                     self.overlap_size_log = overlap_size_log
-                     self.force_max_window = force_max_window
-                     self.enable_ldm = enable_ldm
-                     self.ldm_hash_log = ldm_hash_log
-                     self.ldm_min_match = ldm_min_match
-                     self.ldm_bucket_size_log = ldm_bucket_size_log
-                     self.ldm_hash_every_log = ldm_hash_every_log
-                     self.threads = threads
-                     self.params = _make_cctx_params(self)
+                     # We need to set ZSTD_c_nbWorkers before ZSTD_c_jobSize and ZSTD_c_overlapLog
+                     # because setting ZSTD_c_nbWorkers resets the other parameters.
+                     _set_compression_parameter(params, lib.ZSTD_c_nbWorkers, threads)
+                     _set_compression_parameter(params, lib.ZSTD_c_format, format)
+                     _set_compression_parameter(params, lib.ZSTD_c_compressionLevel, compression_level)
+                     _set_compression_parameter(params, lib.ZSTD_c_windowLog, window_log)
+                     _set_compression_parameter(params, lib.ZSTD_c_hashLog, hash_log)
+                     _set_compression_parameter(params, lib.ZSTD_c_chainLog, chain_log)
+                     _set_compression_parameter(params, lib.ZSTD_c_searchLog, search_log)
+                     _set_compression_parameter(params, lib.ZSTD_c_minMatch, min_match)
+                     _set_compression_parameter(params, lib.ZSTD_c_targetLength, target_length)
+                     if strategy != -1 and compression_strategy != -1:
+                         raise ValueError('cannot specify both compression_strategy and strategy')
+                     if compression_strategy != -1:
+                         strategy = compression_strategy
+                     elif strategy == -1:
+                         strategy = 0
+                     _set_compression_parameter(params, lib.ZSTD_c_strategy, strategy)
+                     _set_compression_parameter(params, lib.ZSTD_c_contentSizeFlag, write_content_size)
+                     _set_compression_parameter(params, lib.ZSTD_c_checksumFlag, write_checksum)
+                     _set_compression_parameter(params, lib.ZSTD_c_dictIDFlag, write_dict_id)
+                     _set_compression_parameter(params, lib.ZSTD_c_jobSize, job_size)
+                     if overlap_log != -1 and overlap_size_log != -1:
+                         raise ValueError('cannot specify both overlap_log and overlap_size_log')
+                     if overlap_size_log != -1:
+                         overlap_log = overlap_size_log
+                     elif overlap_log == -1:
+                         overlap_log = 0
+                     _set_compression_parameter(params, lib.ZSTD_c_overlapLog, overlap_log)
+                     _set_compression_parameter(params, lib.ZSTD_c_forceMaxWindow, force_max_window)
+                     _set_compression_parameter(params, lib.ZSTD_c_enableLongDistanceMatching, enable_ldm)
+                     _set_compression_parameter(params, lib.ZSTD_c_ldmHashLog, ldm_hash_log)
+                     _set_compression_parameter(params, lib.ZSTD_c_ldmMinMatch, ldm_min_match)
+                     _set_compression_parameter(params, lib.ZSTD_c_ldmBucketSizeLog, ldm_bucket_size_log)
+                     if ldm_hash_rate_log != -1 and ldm_hash_every_log != -1:
+                         raise ValueError('cannot specify both ldm_hash_rate_log and ldm_hash_every_log')
+                     if ldm_hash_every_log != -1:
+                         ldm_hash_rate_log = ldm_hash_every_log
+                     elif ldm_hash_rate_log == -1:
+                         ldm_hash_rate_log = 0
+                     _set_compression_parameter(params, lib.ZSTD_c_ldmHashRateLog, ldm_hash_rate_log)
+                 @property
+                 def format(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_format)
+                 @property
+                 def compression_level(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_compressionLevel)
+                 @property
+                 def window_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_windowLog)
+                 @property
+                 def hash_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_hashLog)
+                 @property
+                 def chain_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_chainLog)
+                 @property
+                 def search_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_searchLog)
+                 @property
+                 def min_match(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_minMatch)
+                 @property
+                 def target_length(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_targetLength)
+                 @property
+                 def compression_strategy(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_strategy)
+                 @property
+                 def write_content_size(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_contentSizeFlag)
+                 @property
+                 def write_checksum(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_checksumFlag)
+                 @property
+                 def write_dict_id(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_dictIDFlag)
+                 @property
+                 def job_size(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_jobSize)
+                 @property
+                 def overlap_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_overlapLog)
+                 @property
+                 def overlap_size_log(self):
+                     return self.overlap_log
+                 @property
+                 def force_max_window(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_forceMaxWindow)
+                 @property
+                 def enable_ldm(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_enableLongDistanceMatching)
+                 @property
+                 def ldm_hash_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_ldmHashLog)
+                 @property
+                 def ldm_min_match(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_ldmMinMatch)
+                 @property
+                 def ldm_bucket_size_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_ldmBucketSizeLog)
+                 @property
+                 def ldm_hash_rate_log(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_ldmHashRateLog)
+                 @property
+                 def ldm_hash_every_log(self):
+                     return self.ldm_hash_rate_log
+                 @property
+                 def threads(self):
+                     return _get_compression_parameter(self._params, lib.ZSTD_c_nbWorkers)
                  def estimated_compression_context_size(self):
-                     return lib.ZSTD_estimateCCtxSize_usingCCtxParams(self.params)
+                     return lib.ZSTD_estimateCCtxSize_usingCCtxParams(self._params)
              CompressionParameters = ZstdCompressionParameters
              def _set_compression_parameter(params, param, value):
-                 zresult = lib.ZSTD_CCtxParam_setParameter(params, param,
-                                                           ffi.cast('unsigned', value))
+                 zresult = lib.ZSTD_CCtxParam_setParameter(params, param, value)
                  if lib.ZSTD_isError(zresult):
                      raise ZstdError('unable to set compression context parameter: %s' %
                                      _zstd_error(zresult))
+             def _get_compression_parameter(params, param):
+                 result = ffi.new('int *')
+                 zresult = lib.ZSTD_CCtxParam_getParameter(params, param, result)
+                 if lib.ZSTD_isError(zresult):
+                     raise ZstdError('unable to get compression context parameter: %s' %
+                                     _zstd_error(zresult))
+                 return result[0]
              class ZstdCompressionWriter(object):
-                 def __init__(self, compressor, writer, source_size, write_size):
+                 def __init__(self, compressor, writer, source_size, write_size,
+                              write_return_read):
                      self._compressor = compressor
                      self._writer = writer
-                     self._source_size = source_size
                      self._write_size = write_size
+                     self._write_return_read = bool(write_return_read)
                      self._entered = False
+                     self._closed = False
                      self._bytes_compressed = 0
-                 def __enter__(self):
-                     if self._entered:
-                         raise ZstdError('cannot __enter__ multiple times')
-                     zresult = lib.ZSTD_CCtx_setPledgedSrcSize(self._compressor._cctx,
-                                                               self._source_size)
+                     self._dst_buffer = ffi.new('char[]', write_size)
+                     self._out_buffer = ffi.new('ZSTD_outBuffer *')
+                     self._out_buffer.dst = self._dst_buffer
+                     self._out_buffer.size = len(self._dst_buffer)
+                     self._out_buffer.pos = 0
+                     zresult = lib.ZSTD_CCtx_setPledgedSrcSize(compressor._cctx,
+                                                               source_size)
                      if lib.ZSTD_isError(zresult):
                          raise ZstdError('error setting source size: %s' %
                                          _zstd_error(zresult))
+                 def __enter__(self):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     if self._entered:
+                         raise ZstdError('cannot __enter__ multiple times')
                      self._entered = True
                      return self
                      self._entered = False
                      if not exc_type and not exc_value and not exc_tb:
-                         dst_buffer = ffi.new('char[]', self._write_size)
-                         out_buffer = ffi.new('ZSTD_outBuffer *')
-                         in_buffer = ffi.new('ZSTD_inBuffer *')
-                         out_buffer.dst = dst_buffer
-                         out_buffer.size = len(dst_buffer)
-                         out_buffer.pos = 0
-                         in_buffer.src = ffi.NULL
-                         in_buffer.size = 0
-                         in_buffer.pos = 0
-                         while True:
-                             zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                                 out_buffer, in_buffer,
-                                                                 lib.ZSTD_e_end)
-                             if lib.ZSTD_isError(zresult):
-                                 raise ZstdError('error ending compression stream: %s' %
-                                                 _zstd_error(zresult))
-                             if out_buffer.pos:
-                                 self._writer.write(ffi.buffer(out_buffer.dst, out_buffer.pos)[:])
-                                 out_buffer.pos = 0
-                             if zresult == 0:
-                                 break
+                         self.close()
                      self._compressor = None
                      return False
                  def memory_size(self):
-                     if not self._entered:
-                         raise ZstdError('cannot determine size of an inactive compressor; '
-                                         'call when a context manager is active')
                      return lib.ZSTD_sizeof_CCtx(self._compressor._cctx)
+                 def fileno(self):
+                     f = getattr(self._writer, 'fileno', None)
+                     if f:
+                         return f()
+                     else:
+                         raise OSError('fileno not available on underlying writer')
+                 def close(self):
+                     if self._closed:
+                         return
+                     try:
+                         self.flush(FLUSH_FRAME)
+                     finally:
+                         self._closed = True
+                     # Call close() on underlying stream as well.
+                     f = getattr(self._writer, 'close', None)
+                     if f:
+                         f()
+                 @property
+                 def closed(self):
+                     return self._closed
+                 def isatty(self):
+                     return False
+                 def readable(self):
+                     return False
+                 def readline(self, size=-1):
+                     raise io.UnsupportedOperation()
+                 def readlines(self, hint=-1):
+                     raise io.UnsupportedOperation()
+                 def seek(self, offset, whence=None):
+                     raise io.UnsupportedOperation()
+                 def seekable(self):
+                     return False
+                 def truncate(self, size=None):
+                     raise io.UnsupportedOperation()
+                 def writable(self):
+                     return True
+                 def writelines(self, lines):
+                     raise NotImplementedError('writelines() is not yet implemented')
+                 def read(self, size=-1):
+                     raise io.UnsupportedOperation()
+                 def readall(self):
+                     raise io.UnsupportedOperation()
+                 def readinto(self, b):
+                     raise io.UnsupportedOperation()
                  def write(self, data):
-                     if not self._entered:
-                         raise ZstdError('write() must be called from an active context '
-                                         'manager')
+                     if self._closed:
+                         raise ValueError('stream is closed')
                      total_write = 0
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
-                     dst_buffer = ffi.new('char[]', self._write_size)
-                     out_buffer.dst = dst_buffer
-                     out_buffer.size = self._write_size
+                     out_buffer = self._out_buffer
                      out_buffer.pos = 0
                      while in_buffer.pos < in_buffer.size:
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             out_buffer, in_buffer,
-                                                             lib.ZSTD_e_continue)
+                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                            out_buffer, in_buffer,
+                                                            lib.ZSTD_e_continue)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('zstd compress error: %s' %
                                              _zstd_error(zresult))
                              self._bytes_compressed += out_buffer.pos
                              out_buffer.pos = 0
-                     return total_write
-                 def flush(self):
-                     if not self._entered:
-                         raise ZstdError('flush must be called from an active context manager')
+                     if self._write_return_read:
+                         return in_buffer.pos
+                     else:
+                         return total_write
+                 def flush(self, flush_mode=FLUSH_BLOCK):
+                     if flush_mode == FLUSH_BLOCK:
+                         flush = lib.ZSTD_e_flush
+                     elif flush_mode == FLUSH_FRAME:
+                         flush = lib.ZSTD_e_end
+                     else:
+                         raise ValueError('unknown flush_mode: %r' % flush_mode)
+                     if self._closed:
+                         raise ValueError('stream is closed')
                      total_write = 0
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
-                     dst_buffer = ffi.new('char[]', self._write_size)
-                     out_buffer.dst = dst_buffer
-                     out_buffer.size = self._write_size
+                     out_buffer = self._out_buffer
                      out_buffer.pos = 0
                      in_buffer = ffi.new('ZSTD_inBuffer *')
                      in_buffer.pos = 0
                      while True:
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             out_buffer, in_buffer,
-                                                             lib.ZSTD_e_flush)
+                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                            out_buffer, in_buffer,
+                                                            flush)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('zstd compress error: %s' %
                                              _zstd_error(zresult))
                      chunks = []
                      while source.pos < len(data):
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             self._out,
-                                                             source,
-                                                             lib.ZSTD_e_continue)
+                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                            self._out,
+                                                            source,
+                                                            lib.ZSTD_e_continue)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('zstd compress error: %s' %
                                              _zstd_error(zresult))
                      chunks = []
                      while True:
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             self._out,
-                                                             in_buffer,
-                                                             z_flush_mode)
+                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                            self._out,
+                                                            in_buffer,
+                                                            z_flush_mode)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('error ending compression stream: %s' %
                                              _zstd_error(zresult))
                      self._in.pos = 0
                      while self._in.pos < self._in.size:
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             self._out,
-                                                             self._in,
-                                                             lib.ZSTD_e_continue)
+                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                            self._out,
+                                                            self._in,
+                                                            lib.ZSTD_e_continue)
                          if self._in.pos == self._in.size:
                              self._in.src = ffi.NULL
                                          'previous operation')
                      while True:
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             self._out, self._in,
-                                                             lib.ZSTD_e_flush)
+                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                            self._out, self._in,
+                                                            lib.ZSTD_e_flush)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('zstd compress error: %s' % _zstd_error(zresult))
                                          'previous operation')
                      while True:
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             self._out, self._in,
-                                                             lib.ZSTD_e_end)
+                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                            self._out, self._in,
+                                                            lib.ZSTD_e_end)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('zstd compress error: %s' % _zstd_error(zresult))
                              return
-             class CompressionReader(object):
+             class ZstdCompressionReader(object):
                  def __init__(self, compressor, source, read_size):
                      self._compressor = compressor
                      self._source = source
                      return self._bytes_compressed
                  def readall(self):
-                     raise NotImplementedError()
+                     chunks = []
+                     while True:
+                         chunk = self.read(1048576)
+                         if not chunk:
+                             break
+                         chunks.append(chunk)
+                     return b''.join(chunks)
                  def __iter__(self):
                      raise io.UnsupportedOperation()
                  next = __next__
+                 def _read_input(self):
+                     if self._finished_input:
+                         return
+                     if hasattr(self._source, 'read'):
+                         data = self._source.read(self._read_size)
+                         if not data:
+                             self._finished_input = True
+                             return
+                         self._source_buffer = ffi.from_buffer(data)
+                         self._in_buffer.src = self._source_buffer
+                         self._in_buffer.size = len(self._source_buffer)
+                         self._in_buffer.pos = 0
+                     else:
+                         self._source_buffer = ffi.from_buffer(self._source)
+                         self._in_buffer.src = self._source_buffer
+                         self._in_buffer.size = len(self._source_buffer)
+                         self._in_buffer.pos = 0
+                 def _compress_into_buffer(self, out_buffer):
+                     if self._in_buffer.pos >= self._in_buffer.size:
+                         return
+                     old_pos = out_buffer.pos
+                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                        out_buffer, self._in_buffer,
+                                                        lib.ZSTD_e_continue)
+                     self._bytes_compressed += out_buffer.pos - old_pos
+                     if self._in_buffer.pos == self._in_buffer.size:
+                         self._in_buffer.src = ffi.NULL
+                         self._in_buffer.pos = 0
+                         self._in_buffer.size = 0
+                         self._source_buffer = None
+                         if not hasattr(self._source, 'read'):
+                             self._finished_input = True
+                     if lib.ZSTD_isError(zresult):
+                         raise ZstdError('zstd compress error: %s',
+                                         _zstd_error(zresult))
+                     return out_buffer.pos and out_buffer.pos == out_buffer.size
                  def read(self, size=-1):
                      if self._closed:
                          raise ValueError('stream is closed')
-                     if self._finished_output:
+                     if size < -1:
+                         raise ValueError('cannot read negative amounts less than -1')
+                     if size == -1:
+                         return self.readall()
+                     if self._finished_output or size == 0:
                          return b''
-                     if size < 1:
-                         raise ValueError('cannot read negative or size 0 amounts')
                      # Need a dedicated ref to dest buffer otherwise it gets collected.
                      dst_buffer = ffi.new('char[]', size)
                      out_buffer = ffi.new('ZSTD_outBuffer *')
                      out_buffer.size = size
                      out_buffer.pos = 0
-                     def compress_input():
-                         if self._in_buffer.pos >= self._in_buffer.size:
-                             return
-                         old_pos = out_buffer.pos
-                         zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                             out_buffer, self._in_buffer,
-                                                             lib.ZSTD_e_continue)
-                         self._bytes_compressed += out_buffer.pos - old_pos
-                         if self._in_buffer.pos == self._in_buffer.size:
-                             self._in_buffer.src = ffi.NULL
-                             self._in_buffer.pos = 0
-                             self._in_buffer.size = 0
-                             self._source_buffer = None
-                             if not hasattr(self._source, 'read'):
-                                 self._finished_input = True
-                         if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd compress error: %s',
-                                             _zstd_error(zresult))
-                         if out_buffer.pos and out_buffer.pos == out_buffer.size:
-                             return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
-                     def get_input():
-                         if self._finished_input:
-                             return
-                         if hasattr(self._source, 'read'):
-                             data = self._source.read(self._read_size)
-                             if not data:
-                                 self._finished_input = True
-                                 return
-                             self._source_buffer = ffi.from_buffer(data)
-                             self._in_buffer.src = self._source_buffer
-                             self._in_buffer.size = len(self._source_buffer)
-                             self._in_buffer.pos = 0
-                         else:
-                             self._source_buffer = ffi.from_buffer(self._source)
-                             self._in_buffer.src = self._source_buffer
-                             self._in_buffer.size = len(self._source_buffer)
-                             self._in_buffer.pos = 0
-                     result = compress_input()
-                     if result:
-                         return result
+                     if self._compress_into_buffer(out_buffer):
+                         return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
                      while not self._finished_input:
-                         get_input()
-                         result = compress_input()
-                         if result:
-                             return result
+                         self._read_input()
+                         if self._compress_into_buffer(out_buffer):
+                             return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
                      # EOF
                      old_pos = out_buffer.pos
-                     zresult = lib.ZSTD_compress_generic(self._compressor._cctx,
-                                                         out_buffer, self._in_buffer,
-                                                         lib.ZSTD_e_end)
+                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                        out_buffer, self._in_buffer,
+                                                        lib.ZSTD_e_end)
                      self._bytes_compressed += out_buffer.pos - old_pos
                      return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                 def read1(self, size=-1):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     if size < -1:
+                         raise ValueError('cannot read negative amounts less than -1')
+                     if self._finished_output or size == 0:
+                         return b''
+                     # -1 returns arbitrary number of bytes.
+                     if size == -1:
+                         size = COMPRESSION_RECOMMENDED_OUTPUT_SIZE
+                     dst_buffer = ffi.new('char[]', size)
+                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     out_buffer.dst = dst_buffer
+                     out_buffer.size = size
+                     out_buffer.pos = 0
+                     # read1() dictates that we can perform at most 1 call to the
+                     # underlying stream to get input. However, we can't satisfy this
+                     # restriction with compression because not all input generates output.
+                     # It is possible to perform a block flush in order to ensure output.
+                     # But this may not be desirable behavior. So we allow multiple read()
+                     # to the underlying stream. But unlike read(), we stop once we have
+                     # any output.
+                     self._compress_into_buffer(out_buffer)
+                     if out_buffer.pos:
+                         return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                     while not self._finished_input:
+                         self._read_input()
+                         # If we've filled the output buffer, return immediately.
+                         if self._compress_into_buffer(out_buffer):
+                             return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                         # If we've populated the output buffer and we're not at EOF,
+                         # also return, as we've satisfied the read1() limits.
+                         if out_buffer.pos and not self._finished_input:
+                             return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                         # Else if we're at EOS and we have room left in the buffer,
+                         # fall through to below and try to add more data to the output.
+                     # EOF.
+                     old_pos = out_buffer.pos
+                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                        out_buffer, self._in_buffer,
+                                                        lib.ZSTD_e_end)
+                     self._bytes_compressed += out_buffer.pos - old_pos
+                     if lib.ZSTD_isError(zresult):
+                         raise ZstdError('error ending compression stream: %s' %
+                                         _zstd_error(zresult))
+                     if zresult == 0:
+                         self._finished_output = True
+                     return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                 def readinto(self, b):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     if self._finished_output:
+                         return 0
+                     # TODO use writable=True once we require CFFI >= 1.12.
+                     dest_buffer = ffi.from_buffer(b)
+                     ffi.memmove(b, b'', 0)
+                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     out_buffer.dst = dest_buffer
+                     out_buffer.size = len(dest_buffer)
+                     out_buffer.pos = 0
+                     if self._compress_into_buffer(out_buffer):
+                         return out_buffer.pos
+                     while not self._finished_input:
+                         self._read_input()
+                         if self._compress_into_buffer(out_buffer):
+                             return out_buffer.pos
+                     # EOF.
+                     old_pos = out_buffer.pos
+                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                        out_buffer, self._in_buffer,
+                                                        lib.ZSTD_e_end)
+                     self._bytes_compressed += out_buffer.pos - old_pos
+                     if lib.ZSTD_isError(zresult):
+                         raise ZstdError('error ending compression stream: %s',
+                                         _zstd_error(zresult))
+                     if zresult == 0:
+                         self._finished_output = True
+                     return out_buffer.pos
+                 def readinto1(self, b):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     if self._finished_output:
+                         return 0
+                     # TODO use writable=True once we require CFFI >= 1.12.
+                     dest_buffer = ffi.from_buffer(b)
+                     ffi.memmove(b, b'', 0)
+                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     out_buffer.dst = dest_buffer
+                     out_buffer.size = len(dest_buffer)
+                     out_buffer.pos = 0
+                     self._compress_into_buffer(out_buffer)
+                     if out_buffer.pos:
+                         return out_buffer.pos
+                     while not self._finished_input:
+                         self._read_input()
+                         if self._compress_into_buffer(out_buffer):
+                             return out_buffer.pos
+                         if out_buffer.pos and not self._finished_input:
+                             return out_buffer.pos
+                     # EOF.
+                     old_pos = out_buffer.pos
+                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
+                                                        out_buffer, self._in_buffer,
+                                                        lib.ZSTD_e_end)
+                     self._bytes_compressed += out_buffer.pos - old_pos
+                     if lib.ZSTD_isError(zresult):
+                         raise ZstdError('error ending compression stream: %s' %
+                                         _zstd_error(zresult))
+                     if zresult == 0:
+                         self._finished_output = True
+                     return out_buffer.pos
              class ZstdCompressor(object):
                  def __init__(self, level=3, dict_data=None, compression_params=None,
                               write_checksum=None, write_content_size=None,
                          self._params = ffi.gc(params, lib.ZSTD_freeCCtxParams)
                          _set_compression_parameter(self._params,
-                                                    lib.ZSTD_p_compressionLevel,
+                                                    lib.ZSTD_c_compressionLevel,
                                                     level)
                          _set_compression_parameter(
                              self._params,
-                             lib.ZSTD_p_contentSizeFlag,
+                             lib.ZSTD_c_contentSizeFlag,
                              write_content_size if write_content_size is not None else 1)
                          _set_compression_parameter(self._params,
-                                                    lib.ZSTD_p_checksumFlag,
+                                                    lib.ZSTD_c_checksumFlag,
 if write_checksum else 0)
                          _set_compression_parameter(self._params,
-                                                    lib.ZSTD_p_dictIDFlag,
+                                                    lib.ZSTD_c_dictIDFlag,
 if write_dict_id else 0)
                          if threads:
                              _set_compression_parameter(self._params,
-                                                        lib.ZSTD_p_nbWorkers,
+                                                        lib.ZSTD_c_nbWorkers,
                                                         threads)
                      cctx = lib.ZSTD_createCCtx()
                      return lib.ZSTD_sizeof_CCtx(self._cctx)
                  def compress(self, data):
-                     lib.ZSTD_CCtx_reset(self._cctx)
+                     lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      data_buffer = ffi.from_buffer(data)
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
-                     zresult = lib.ZSTD_compress_generic(self._cctx,
-                                                         out_buffer,
-                                                         in_buffer,
-                                                         lib.ZSTD_e_end)
+                     zresult = lib.ZSTD_compressStream2(self._cctx,
+                                                        out_buffer,
+                                                        in_buffer,
+                                                        lib.ZSTD_e_end)
                      if lib.ZSTD_isError(zresult):
                          raise ZstdError('cannot compress: %s' %
                      return ffi.buffer(out, out_buffer.pos)[:]
                  def compressobj(self, size=-1):
-                     lib.ZSTD_CCtx_reset(self._cctx)
+                     lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      if size < 0:
                          size = lib.ZSTD_CONTENTSIZE_UNKNOWN
                      return cobj
                  def chunker(self, size=-1, chunk_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE):
-                     lib.ZSTD_CCtx_reset(self._cctx)
+                     lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      if size < 0:
                          size = lib.ZSTD_CONTENTSIZE_UNKNOWN
                      if not hasattr(ofh, 'write'):
                          raise ValueError('second argument must have a write() method')
-                     lib.ZSTD_CCtx_reset(self._cctx)
+                     lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      if size < 0:
                          size = lib.ZSTD_CONTENTSIZE_UNKNOWN
                          in_buffer.pos = 0
                          while in_buffer.pos < in_buffer.size:
-                             zresult = lib.ZSTD_compress_generic(self._cctx,
-                                                                 out_buffer,
-                                                                 in_buffer,
-                                                                 lib.ZSTD_e_continue)
+                             zresult = lib.ZSTD_compressStream2(self._cctx,
+                                                                out_buffer,
+                                                                in_buffer,
+                                                                lib.ZSTD_e_continue)
                              if lib.ZSTD_isError(zresult):
                                  raise ZstdError('zstd compress error: %s' %
                                                  _zstd_error(zresult))
                      # We've finished reading. Flush the compressor.
                      while True:
-                         zresult = lib.ZSTD_compress_generic(self._cctx,
-                                                             out_buffer,
-                                                             in_buffer,
-                                                             lib.ZSTD_e_end)
+                         zresult = lib.ZSTD_compressStream2(self._cctx,
+                                                            out_buffer,
+                                                            in_buffer,
+                                                            lib.ZSTD_e_end)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('error ending compression stream: %s' %
                                              _zstd_error(zresult))
                  def stream_reader(self, source, size=-1,
                                    read_size=COMPRESSION_RECOMMENDED_INPUT_SIZE):
-                     lib.ZSTD_CCtx_reset(self._cctx)
+                     lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      try:
                          size = len(source)
                          raise ZstdError('error setting source size: %s' %
                                          _zstd_error(zresult))
-                     return CompressionReader(self, source, read_size)
+                     return ZstdCompressionReader(self, source, read_size)
                  def stream_writer(self, writer, size=-1,
-                              write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE):
+                              write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                              write_return_read=False):
                      if not hasattr(writer, 'write'):
                          raise ValueError('must pass an object with a write() method')
-                     lib.ZSTD_CCtx_reset(self._cctx)
+                     lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      if size < 0:
                          size = lib.ZSTD_CONTENTSIZE_UNKNOWN
-                     return ZstdCompressionWriter(self, writer, size, write_size)
+                     return ZstdCompressionWriter(self, writer, size, write_size,
+                                                  write_return_read)
                  write_to = stream_writer
                          raise ValueError('must pass an object with a read() method or '
                                           'conforms to buffer protocol')
-                     lib.ZSTD_CCtx_reset(self._cctx)
+                     lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      if size < 0:
                          size = lib.ZSTD_CONTENTSIZE_UNKNOWN
                          in_buffer.pos = 0
                          while in_buffer.pos < in_buffer.size:
-                             zresult = lib.ZSTD_compress_generic(self._cctx, out_buffer, in_buffer,
-                                                                 lib.ZSTD_e_continue)
+                             zresult = lib.ZSTD_compressStream2(self._cctx, out_buffer, in_buffer,
+                                                                lib.ZSTD_e_continue)
                              if lib.ZSTD_isError(zresult):
                                  raise ZstdError('zstd compress error: %s' %
                                                  _zstd_error(zresult))
                      # remains.
                      while True:
                          assert out_buffer.pos == 0
-                         zresult = lib.ZSTD_compress_generic(self._cctx,
-                                                             out_buffer,
-                                                             in_buffer,
-                                                             lib.ZSTD_e_end)
+                         zresult = lib.ZSTD_compressStream2(self._cctx,
+                                                            out_buffer,
+                                                            in_buffer,
+                                                            lib.ZSTD_e_end)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('error ending compression stream: %s' %
                                              _zstd_error(zresult))
                          cparams = ffi.new('ZSTD_compressionParameters')
                          cparams.chainLog = compression_params.chain_log
                          cparams.hashLog = compression_params.hash_log
-                         cparams.searchLength = compression_params.min_match
+                         cparams.minMatch = compression_params.min_match
                          cparams.searchLog = compression_params.search_log
                          cparams.strategy = compression_params.compression_strategy
                          cparams.targetLength = compression_params.target_length
                      out_buffer = ffi.new('ZSTD_outBuffer *')
                      data_buffer = ffi.from_buffer(data)
+                     if len(data_buffer) == 0:
+                         return b''
                      in_buffer.src = data_buffer
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
                      chunks = []
                      while True:
-                         zresult = lib.ZSTD_decompress_generic(self._decompressor._dctx,
-                                                               out_buffer, in_buffer)
+                         zresult = lib.ZSTD_decompressStream(self._decompressor._dctx,
+                                                             out_buffer, in_buffer)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('zstd decompressor error: %s' %
                                              _zstd_error(zresult))
                      return b''.join(chunks)
-             class DecompressionReader(object):
-                 def __init__(self, decompressor, source, read_size):
+                 def flush(self, length=0):
+                     pass
+             class ZstdDecompressionReader(object):
+                 def __init__(self, decompressor, source, read_size, read_across_frames):
                      self._decompressor = decompressor
                      self._source = source
                      self._read_size = read_size
+                     self._read_across_frames = bool(read_across_frames)
                      self._entered = False
                      self._closed = False
                      self._bytes_decompressed = 0
                      return True
                  def readline(self):
-                     raise NotImplementedError()
+                     raise io.UnsupportedOperation()
                  def readlines(self):
-                     raise NotImplementedError()
+                     raise io.UnsupportedOperation()
                  def write(self, data):
                      raise io.UnsupportedOperation()
                      return self._bytes_decompressed
                  def readall(self):
-                     raise NotImplementedError()
+                     chunks = []
+                     while True:
+                         chunk = self.read(1048576)
+                         if not chunk:
+                             break
+                         chunks.append(chunk)
+                     return b''.join(chunks)
                  def __iter__(self):
-                     raise NotImplementedError()
+                     raise io.UnsupportedOperation()
                  def __next__(self):
-                     raise NotImplementedError()
+                     raise io.UnsupportedOperation()
                  next = __next__
-                 def read(self, size):
+                 def _read_input(self):
+                     # We have data left over in the input buffer. Use it.
+                     if self._in_buffer.pos < self._in_buffer.size:
+                         return
+                     # All input data exhausted. Nothing to do.
+                     if self._finished_input:
+                         return
+                     # Else populate the input buffer from our source.
+                     if hasattr(self._source, 'read'):
+                         data = self._source.read(self._read_size)
+                         if not data:
+                             self._finished_input = True
+                             return
+                         self._source_buffer = ffi.from_buffer(data)
+                         self._in_buffer.src = self._source_buffer
+                         self._in_buffer.size = len(self._source_buffer)
+                         self._in_buffer.pos = 0
+                     else:
+                         self._source_buffer = ffi.from_buffer(self._source)
+                         self._in_buffer.src = self._source_buffer
+                         self._in_buffer.size = len(self._source_buffer)
+                         self._in_buffer.pos = 0
+                 def _decompress_into_buffer(self, out_buffer):
+                     """Decompress available input into an output buffer.
+                     Returns True if data in output buffer should be emitted.
+                     """
+                     zresult = lib.ZSTD_decompressStream(self._decompressor._dctx,
+                                                         out_buffer, self._in_buffer)
+                     if self._in_buffer.pos == self._in_buffer.size:
+                         self._in_buffer.src = ffi.NULL
+                         self._in_buffer.pos = 0
+                         self._in_buffer.size = 0
+                         self._source_buffer = None
+                         if not hasattr(self._source, 'read'):
+                             self._finished_input = True
+                     if lib.ZSTD_isError(zresult):
+                         raise ZstdError('zstd decompress error: %s' %
+                                         _zstd_error(zresult))
+                     # Emit data if there is data AND either:
+                     # a) output buffer is full (read amount is satisfied)
+                     # b) we're at end of a frame and not in frame spanning mode
+                     return (out_buffer.pos and
+                             (out_buffer.pos == out_buffer.size or
+                              zresult == 0 and not self._read_across_frames))
+                 def read(self, size=-1):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     if size < -1:
+                         raise ValueError('cannot read negative amounts less than -1')
+                     if size == -1:
+                         # This is recursive. But it gets the job done.
+                         return self.readall()
+                     if self._finished_output or size == 0:
+                         return b''
+                     # We /could/ call into readinto() here. But that introduces more
+                     # overhead.
+                     dst_buffer = ffi.new('char[]', size)
+                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     out_buffer.dst = dst_buffer
+                     out_buffer.size = size
+                     out_buffer.pos = 0
+                     self._read_input()
+                     if self._decompress_into_buffer(out_buffer):
+                         self._bytes_decompressed += out_buffer.pos
+                         return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                     while not self._finished_input:
+                         self._read_input()
+                         if self._decompress_into_buffer(out_buffer):
+                             self._bytes_decompressed += out_buffer.pos
+                             return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                     self._bytes_decompressed += out_buffer.pos
+                     return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                 def readinto(self, b):
                      if self._closed:
                          raise ValueError('stream is closed')
                      if self._finished_output:
+                         return 0
+                     # TODO use writable=True once we require CFFI >= 1.12.
+                     dest_buffer = ffi.from_buffer(b)
+                     ffi.memmove(b, b'', 0)
+                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     out_buffer.dst = dest_buffer
+                     out_buffer.size = len(dest_buffer)
+                     out_buffer.pos = 0
+                     self._read_input()
+                     if self._decompress_into_buffer(out_buffer):
+                         self._bytes_decompressed += out_buffer.pos
+                         return out_buffer.pos
+                     while not self._finished_input:
+                         self._read_input()
+                         if self._decompress_into_buffer(out_buffer):
+                             self._bytes_decompressed += out_buffer.pos
+                             return out_buffer.pos
+                     self._bytes_decompressed += out_buffer.pos
+                     return out_buffer.pos
+                 def read1(self, size=-1):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     if size < -1:
+                         raise ValueError('cannot read negative amounts less than -1')
+                     if self._finished_output or size == 0:
                          return b''
-                     if size < 1:
-                         raise ValueError('cannot read negative or size 0 amounts')
+                     # -1 returns arbitrary number of bytes.
+                     if size == -1:
+                         size = DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE
                      dst_buffer = ffi.new('char[]', size)
                      out_buffer = ffi.new('ZSTD_outBuffer *')
                      out_buffer.size = size
                      out_buffer.pos = 0
-                     def decompress():
-                         zresult = lib.ZSTD_decompress_generic(self._decompressor._dctx,
-                                                               out_buffer, self._in_buffer)
-                         if self._in_buffer.pos == self._in_buffer.size:
-                             self._in_buffer.src = ffi.NULL
-                             self._in_buffer.pos = 0
-                             self._in_buffer.size = 0
-                             self._source_buffer = None
-                             if not hasattr(self._source, 'read'):
-                                 self._finished_input = True
-                         if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd decompress error: %s',
-                                             _zstd_error(zresult))
-                         elif zresult == 0:
-                             self._finished_output = True
-                         if out_buffer.pos and out_buffer.pos == out_buffer.size:
-                             self._bytes_decompressed += out_buffer.size
-                             return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
-                     def get_input():
-                         if self._finished_input:
-                             return
-                         if hasattr(self._source, 'read'):
-                             data = self._source.read(self._read_size)
-                             if not data:
-                                 self._finished_input = True
-                                 return
-                             self._source_buffer = ffi.from_buffer(data)
-                             self._in_buffer.src = self._source_buffer
-                             self._in_buffer.size = len(self._source_buffer)
-                             self._in_buffer.pos = 0
-                         else:
-                             self._source_buffer = ffi.from_buffer(self._source)
-                             self._in_buffer.src = self._source_buffer
-                             self._in_buffer.size = len(self._source_buffer)
-                             self._in_buffer.pos = 0
-                     get_input()
-                     result = decompress()
-                     if result:
-                         return result
+                     # read1() dictates that we can perform at most 1 call to underlying
+                     # stream to get input. However, we can't satisfy this restriction with
+                     # decompression because not all input generates output. So we allow
+                     # multiple read(). But unlike read(), we stop once we have any output.
                      while not self._finished_input:
-                         get_input()
-                         result = decompress()
-                         if result:
-                             return result
+                         self._read_input()
+                         self._decompress_into_buffer(out_buffer)
+                         if out_buffer.pos:
+                             break
                      self._bytes_decompressed += out_buffer.pos
                      return ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
+                 def readinto1(self, b):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     if self._finished_output:
+                         return 0
+                     # TODO use writable=True once we require CFFI >= 1.12.
+                     dest_buffer = ffi.from_buffer(b)
+                     ffi.memmove(b, b'', 0)
+                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     out_buffer.dst = dest_buffer
+                     out_buffer.size = len(dest_buffer)
+                     out_buffer.pos = 0
+                     while not self._finished_input and not self._finished_output:
+                         self._read_input()
+                         self._decompress_into_buffer(out_buffer)
+                         if out_buffer.pos:
+                             break
+                     self._bytes_decompressed += out_buffer.pos
+                     return out_buffer.pos
                  def seek(self, pos, whence=os.SEEK_SET):
                      if self._closed:
                          raise ValueError('stream is closed')
                      return self._bytes_decompressed
              class ZstdDecompressionWriter(object):
-                 def __init__(self, decompressor, writer, write_size):
+                 def __init__(self, decompressor, writer, write_size, write_return_read):
+                     decompressor._ensure_dctx()
                      self._decompressor = decompressor
                      self._writer = writer
                      self._write_size = write_size
+                     self._write_return_read = bool(write_return_read)
                      self._entered = False
+                     self._closed = False
                  def __enter__(self):
+                     if self._closed:
+                         raise ValueError('stream is closed')
                      if self._entered:
                          raise ZstdError('cannot __enter__ multiple times')
-                     self._decompressor._ensure_dctx()
                      self._entered = True
                      return self
                  def __exit__(self, exc_type, exc_value, exc_tb):
                      self._entered = False
+                     self.close()
                  def memory_size(self):
-                     if not self._decompressor._dctx:
-                         raise ZstdError('cannot determine size of inactive decompressor '
-                                         'call when context manager is active')
                      return lib.ZSTD_sizeof_DCtx(self._decompressor._dctx)
+                 def close(self):
+                     if self._closed:
+                         return
+                     try:
+                         self.flush()
+                     finally:
+                         self._closed = True
+                     f = getattr(self._writer, 'close', None)
+                     if f:
+                         f()
+                 @property
+                 def closed(self):
+                     return self._closed
+                 def fileno(self):
+                     f = getattr(self._writer, 'fileno', None)
+                     if f:
+                         return f()
+                     else:
+                         raise OSError('fileno not available on underlying writer')
+                 def flush(self):
+                     if self._closed:
+                         raise ValueError('stream is closed')
+                     f = getattr(self._writer, 'flush', None)
+                     if f:
+                         return f()
+                 def isatty(self):
+                     return False
+                 def readable(self):
+                     return False
+                 def readline(self, size=-1):
+                     raise io.UnsupportedOperation()
+                 def readlines(self, hint=-1):
+                     raise io.UnsupportedOperation()
+                 def seek(self, offset, whence=None):
+                     raise io.UnsupportedOperation()
+                 def seekable(self):
+                     return False
+                 def tell(self):
+                     raise io.UnsupportedOperation()
+                 def truncate(self, size=None):
+                     raise io.UnsupportedOperation()
+                 def writable(self):
+                     return True
+                 def writelines(self, lines):
+                     raise io.UnsupportedOperation()
+                 def read(self, size=-1):
+                     raise io.UnsupportedOperation()
+                 def readall(self):
+                     raise io.UnsupportedOperation()
+                 def readinto(self, b):
+                     raise io.UnsupportedOperation()
                  def write(self, data):
-                     if not self._entered:
-                         raise ZstdError('write must be called from an active context manager')
+                     if self._closed:
+                         raise ValueError('stream is closed')
                      total_write = 0
                      dctx = self._decompressor._dctx
                      while in_buffer.pos < in_buffer.size:
-                         zresult = lib.ZSTD_decompress_generic(dctx, out_buffer, in_buffer)
+                         zresult = lib.ZSTD_decompressStream(dctx, out_buffer, in_buffer)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('zstd decompress error: %s' %
                                              _zstd_error(zresult))
                              total_write += out_buffer.pos
                              out_buffer.pos = 0
-                     return total_write
+                     if self._write_return_read:
+                         return in_buffer.pos
+                     else:
+                         return total_write
              class ZstdDecompressor(object):
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
-                     zresult = lib.ZSTD_decompress_generic(self._dctx, out_buffer, in_buffer)
+                     zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                      if lib.ZSTD_isError(zresult):
                          raise ZstdError('decompression error: %s' %
                                          _zstd_error(zresult))
                      return ffi.buffer(result_buffer, out_buffer.pos)[:]
-                 def stream_reader(self, source, read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE):
+                 def stream_reader(self, source, read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE,
+                                   read_across_frames=False):
                      self._ensure_dctx()
-                     return DecompressionReader(self, source, read_size)
+                     return ZstdDecompressionReader(self, source, read_size, read_across_frames)
                  def decompressobj(self, write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE):
                      if write_size < 1:
                          while in_buffer.pos < in_buffer.size:
                              assert out_buffer.pos == 0
-                             zresult = lib.ZSTD_decompress_generic(self._dctx, out_buffer, in_buffer)
+                             zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                              if lib.ZSTD_isError(zresult):
                                  raise ZstdError('zstd decompress error: %s' %
                                                  _zstd_error(zresult))
                  read_from = read_to_iter
-                 def stream_writer(self, writer, write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE):
+                 def stream_writer(self, writer, write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                                   write_return_read=False):
                      if not hasattr(writer, 'write'):
                          raise ValueError('must pass an object with a write() method')
-                     return ZstdDecompressionWriter(self, writer, write_size)
+                     return ZstdDecompressionWriter(self, writer, write_size,
+                                                    write_return_read)
                  write_to = stream_writer
                          # Flush all read data to output.
                          while in_buffer.pos < in_buffer.size:
-                             zresult = lib.ZSTD_decompress_generic(self._dctx, out_buffer, in_buffer)
+                             zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                              if lib.ZSTD_isError(zresult):
                                  raise ZstdError('zstd decompressor error: %s' %
                                                  _zstd_error(zresult))
                      in_buffer.size = len(chunk_buffer)
                      in_buffer.pos = 0
-                     zresult = lib.ZSTD_decompress_generic(self._dctx, out_buffer, in_buffer)
+                     zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                      if lib.ZSTD_isError(zresult):
                          raise ZstdError('could not decompress chunk 0: %s' %
                                          _zstd_error(zresult))
                          in_buffer.size = len(chunk_buffer)
                          in_buffer.pos = 0
-                         zresult = lib.ZSTD_decompress_generic(self._dctx, out_buffer, in_buffer)
+                         zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                          if lib.ZSTD_isError(zresult):
                              raise ZstdError('could not decompress chunk %d: %s' %
                                              _zstd_error(zresult))
                      return ffi.buffer(last_buffer, len(last_buffer))[:]
                  def _ensure_dctx(self, load_dict=True):
-                     lib.ZSTD_DCtx_reset(self._dctx)
+                     lib.ZSTD_DCtx_reset(self._dctx, lib.ZSTD_reset_session_only)
                      if self._max_window_size:
                          zresult = lib.ZSTD_DCtx_setMaxWindowSize(self._dctx,

contrib/python-zstandard/zstd.c

0 +1 -1

              	   We detect this mismatch here and refuse to load the module if this
              	   scenario is detected.
              	*/
-             	if (ZSTD_VERSION_NUMBER != 10306 || ZSTD_versionNumber() != 10306) {
+             	if (ZSTD_VERSION_NUMBER != 10308 || ZSTD_versionNumber() != 10308) {
              		PyErr_SetString(PyExc_ImportError, "zstd C API mismatch; Python bindings not compiled against expected zstd version");
              		return;
              	}

contrib/python-zstandard/zstd/common/bitstream.h

0 +10 -13

              MEM_STATIC size_t BIT_getMiddleBits(size_t bitContainer, U32 const start, U32 const nbBits)
              {
-             #if defined(__BMI__) && defined(__GNUC__) && __GNUC__*1000+__GNUC_MINOR__ >= 4008  /* experimental */
-             #  if defined(__x86_64__)
-                 if (sizeof(bitContainer)==8)
-                     return _bextr_u64(bitContainer, start, nbBits);
-                 else
-             #  endif
-                     return _bextr_u32(bitContainer, start, nbBits);
-             #else
+                 U32 const regMask = sizeof(bitContainer)*8 - 1;
+                 /* if start > regMask, bitstream is corrupted, and result is undefined */
                  assert(nbBits < BIT_MASK_SIZE);
-                 return (bitContainer >> start) & BIT_mask[nbBits];
-             #endif
+                 return (bitContainer >> (start & regMask)) & BIT_mask[nbBits];
              }
              MEM_STATIC size_t BIT_getLowerBits(size_t bitContainer, U32 const nbBits)
               * @return : value extracted */
              MEM_STATIC size_t BIT_lookBits(const BIT_DStream_t* bitD, U32 nbBits)
              {
-             #if defined(__BMI__) && defined(__GNUC__)   /* experimental; fails if bitD->bitsConsumed + nbBits > sizeof(bitD->bitContainer)*8 */
+                 /* arbitrate between double-shift and shift+mask */
+             #if 1
+                 /* if bitD->bitsConsumed + nbBits > sizeof(bitD->bitContainer)*8,
+                  * bitstream is likely corrupted, and result is undefined */
                  return BIT_getMiddleBits(bitD->bitContainer, (sizeof(bitD->bitContainer)*8) - bitD->bitsConsumed - nbBits, nbBits);
              #else
+                 /* this code path is slower on my os-x laptop */
                  U32 const regMask = sizeof(bitD->bitContainer)*8 - 1;
                  return ((bitD->bitContainer << (bitD->bitsConsumed & regMask)) >> 1) >> ((regMask-nbBits) & regMask);
              #endif
               *  Read (consume) next n bits from local register and update.
               *  Pay attention to not read more than nbBits contained into local register.
               * @return : extracted value. */
-             MEM_STATIC size_t BIT_readBits(BIT_DStream_t* bitD, U32 nbBits)
+             MEM_STATIC size_t BIT_readBits(BIT_DStream_t* bitD, unsigned nbBits)
              {
                  size_t const value = BIT_lookBits(bitD, nbBits);
                  BIT_skipBits(bitD, nbBits);
              /*! BIT_readBitsFast() :
               *  unsafe version; only works only if nbBits >= 1 */
-             MEM_STATIC size_t BIT_readBitsFast(BIT_DStream_t* bitD, U32 nbBits)
+             MEM_STATIC size_t BIT_readBitsFast(BIT_DStream_t* bitD, unsigned nbBits)
              {
                  size_t const value = BIT_lookBitsFast(bitD, nbBits);
                  assert(nbBits >= 1);

contrib/python-zstandard/zstd/common/compiler.h

0 +19 -12

              *  Compiler specifics
              *********************************************************/
              /* force inlining */
+             #if !defined(ZSTD_NO_INLINE)
              #if defined (__GNUC__) || defined(__cplusplus) || defined(__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* C99 */
              #  define INLINE_KEYWORD inline
              #else
              #  define FORCE_INLINE_ATTR
              #endif
+             #else
+             #define INLINE_KEYWORD
+             #define FORCE_INLINE_ATTR
+             #endif
              /**
               * FORCE_INLINE_TEMPLATE is used to define C "templates", which take constant
               * parameters. They must be inlined for the compiler to elimininate the constant
              #endif
              /* prefetch
-              * can be disabled, by declaring NO_PREFETCH macro
-              * All prefetch invocations use a single default locality 2,
-              * generating instruction prefetcht1,
-              * which, according to Intel, means "load data into L2 cache".
-              * This is a good enough "middle ground" for the time being,
-              * though in theory, it would be better to specialize locality depending on data being prefetched.
-              * Tests could not determine any sensible difference based on locality value. */
+              * can be disabled, by declaring NO_PREFETCH build macro */
              #if defined(NO_PREFETCH)
-             #  define PREFETCH(ptr)     (void)(ptr)  /* disabled */
+             #  define PREFETCH_L1(ptr)  (void)(ptr)  /* disabled */
+             #  define PREFETCH_L2(ptr)  (void)(ptr)  /* disabled */
              #else
              #  if defined(_MSC_VER) && (defined(_M_X64) || defined(_M_I86))  /* _mm_prefetch() is not defined outside of x86/x64 */
              #    include <mmintrin.h>   /* https://msdn.microsoft.com/fr-fr/library/84szxsww(v=vs.90).aspx */
-             #    define PREFETCH(ptr)   _mm_prefetch((const char*)(ptr), _MM_HINT_T1)
+             #    define PREFETCH_L1(ptr)  _mm_prefetch((const char*)(ptr), _MM_HINT_T0)
+             #    define PREFETCH_L2(ptr)  _mm_prefetch((const char*)(ptr), _MM_HINT_T1)
              #  elif defined(__GNUC__) && ( (__GNUC__ >= 4) || ( (__GNUC__ == 3) && (__GNUC_MINOR__ >= 1) ) )
-             #    define PREFETCH(ptr)   __builtin_prefetch((ptr), 0 /* rw==read */, 2 /* locality */)
+             #    define PREFETCH_L1(ptr)  __builtin_prefetch((ptr), 0 /* rw==read */, 3 /* locality */)
+             #    define PREFETCH_L2(ptr)  __builtin_prefetch((ptr), 0 /* rw==read */, 2 /* locality */)
              #  else
-             #    define PREFETCH(ptr)   (void)(ptr)  /* disabled */
+             #    define PREFETCH_L1(ptr) (void)(ptr)  /* disabled */
+             #    define PREFETCH_L2(ptr) (void)(ptr)  /* disabled */
              #  endif
              #endif  /* NO_PREFETCH */
                  size_t const _size = (size_t)(s);     \
                  size_t _pos;                          \
                  for (_pos=0; _pos<_size; _pos+=CACHELINE_SIZE) {  \
-                     PREFETCH(_ptr + _pos);            \
+                     PREFETCH_L2(_ptr + _pos);         \
                  }                                     \
              }

contrib/python-zstandard/zstd/common/cpu.h

0 +1 -1

                    __asm__(
                        "pushl %%ebx\n\t"
                        "cpuid\n\t"
-                       "movl %%ebx, %%eax\n\r"
+                       "movl %%ebx, %%eax\n\t"
                        "popl %%ebx"
                        : "=a"(f7b), "=c"(f7c)
                        : "a"(7), "c"(0)

contrib/python-zstandard/zstd/common/debug.h

0 +22 -11

              #endif
-             /* static assert is triggered at compile time, leaving no runtime artefact,
-              * but can only work with compile-time constants.
-              * This variant can only be used inside a function. */
+             /* static assert is triggered at compile time, leaving no runtime artefact.
+              * static assert only works with compile-time constants.
+              * Also, this variant can only be used inside a function. */
              #define DEBUG_STATIC_ASSERT(c) (void)sizeof(char[(c) ? 1 : -1])
              #  define DEBUGLEVEL 0
              #endif
+             /* DEBUGFILE can be defined externally,
+              * typically through compiler command line.
+              * note : currently useless.
+              * Value must be stderr or stdout */
+             #ifndef DEBUGFILE
+             #  define DEBUGFILE stderr
+             #endif
              /* recommended values for DEBUGLEVEL :
-              * 0 : no debug, all run-time functions disabled
-              * 1 : no display, enables assert() only
+              * 0 : release mode, no debug, all run-time checks disabled
+              * 1 : enables assert() only, no display
               * 2 : reserved, for currently active debug path
               * 3 : events once per object lifetime (CCtx, CDict, etc.)
               * 4 : events once per frame
               * 7+: events at every position (*very* verbose)
               *
               * It's generally inconvenient to output traces > 5.
-              * In which case, it's possible to selectively enable higher verbosity levels
+              * In which case, it's possible to selectively trigger high verbosity levels
               * by modifying g_debug_level.
               */
              #if (DEBUGLEVEL>=2)
              #  include <stdio.h>
-             extern int g_debuglevel; /* here, this variable is only declared,
-                                        it actually lives in debug.c,
-                                        and is shared by the whole process.
-                                        It's typically used to enable very verbose levels
-                                        on selective conditions (such as position in src) */
+             extern int g_debuglevel; /* the variable is only declared,
+                                         it actually lives in debug.c,
+                                         and is shared by the whole process.
+                                         It's not thread-safe.
+                                         It's useful when enabling very verbose levels
+                                         on selective conditions (such as position in src) */
              #  define RAWLOG(l, ...) {                                      \
                              if (l<=g_debuglevel) {                          \

contrib/python-zstandard/zstd/common/error_private.c

0 +6 0

              const char* ERR_getErrorString(ERR_enum code)
              {
+             #ifdef ZSTD_STRIP_ERROR_STRINGS
+                 (void)code;
+                 return "Error strings stripped";
+             #else
                  static const char* const notErrorCode = "Unspecified error code";
                  switch( code )
                  {
                  case PREFIX(dictionaryCreation_failed): return "Cannot create Dictionary from provided samples";
                  case PREFIX(dstSize_tooSmall): return "Destination buffer is too small";
                  case PREFIX(srcSize_wrong): return "Src size is incorrect";
+                 case PREFIX(dstBuffer_null): return "Operation on NULL destination buffer";
                      /* following error codes are not stable and may be removed or changed in a future version */
                  case PREFIX(frameIndex_tooLarge): return "Frame index is too large";
                  case PREFIX(seekableIO): return "An I/O error occurred when reading/seeking";
                  case PREFIX(maxCode):
                  default: return notErrorCode;
                  }
+             #endif
              }

contrib/python-zstandard/zstd/common/fse.h

0 +2 -2

                  const U32 tableLog = MEM_read16(ptr);
                  statePtr->value = (ptrdiff_t)1<<tableLog;
                  statePtr->stateTable = u16ptr+2;
-                 statePtr->symbolTT = ((const U32*)ct + 1 + (tableLog ? (1<<(tableLog-1)) : 1));
+                 statePtr->symbolTT = ct + 1 + (tableLog ? (1<<(tableLog-1)) : 1);
                  statePtr->stateLog = tableLog;
              }
                  }
              }
-             MEM_STATIC void FSE_encodeSymbol(BIT_CStream_t* bitC, FSE_CState_t* statePtr, U32 symbol)
+             MEM_STATIC void FSE_encodeSymbol(BIT_CStream_t* bitC, FSE_CState_t* statePtr, unsigned symbol)
              {
                  FSE_symbolCompressionTransform const symbolTT = ((const FSE_symbolCompressionTransform*)(statePtr->symbolTT))[symbol];
                  const U16* const stateTable = (const U16*)(statePtr->stateTable);

contrib/python-zstandard/zstd/common/huf.h

0 +25 -1

              *  Advanced decompression functions
              ******************************************/
              size_t HUF_decompress4X1 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< single-symbol decoder */
+             #ifndef HUF_FORCE_DECOMPRESS_X1
              size_t HUF_decompress4X2 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< double-symbols decoder */
+             #endif
              size_t HUF_decompress4X_DCtx (HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< decodes RLE and uncompressed */
              size_t HUF_decompress4X_hufOnly(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize); /**< considers RLE and uncompressed as errors */
              size_t HUF_decompress4X_hufOnly_wksp(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize); /**< considers RLE and uncompressed as errors */
              size_t HUF_decompress4X1_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< single-symbol decoder */
              size_t HUF_decompress4X1_DCtx_wksp(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize);   /**< single-symbol decoder */
+             #ifndef HUF_FORCE_DECOMPRESS_X1
              size_t HUF_decompress4X2_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< double-symbols decoder */
              size_t HUF_decompress4X2_DCtx_wksp(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize);   /**< double-symbols decoder */
+             #endif
              /* ****************************************
              #define HUF_CTABLE_WORKSPACE_SIZE_U32 (2*HUF_SYMBOLVALUE_MAX +1 +1)
              #define HUF_CTABLE_WORKSPACE_SIZE (HUF_CTABLE_WORKSPACE_SIZE_U32 * sizeof(unsigned))
              size_t HUF_buildCTable_wksp (HUF_CElt* tree,
-                                    const U32* count, U32 maxSymbolValue, U32 maxNbBits,
+                                    const unsigned* count, U32 maxSymbolValue, U32 maxNbBits,
                                           void* workSpace, size_t wkspSize);
              /*! HUF_readStats() :
              #define HUF_DECOMPRESS_WORKSPACE_SIZE (2 << 10)
              #define HUF_DECOMPRESS_WORKSPACE_SIZE_U32 (HUF_DECOMPRESS_WORKSPACE_SIZE / sizeof(U32))
+             #ifndef HUF_FORCE_DECOMPRESS_X2
              size_t HUF_readDTableX1 (HUF_DTable* DTable, const void* src, size_t srcSize);
              size_t HUF_readDTableX1_wksp (HUF_DTable* DTable, const void* src, size_t srcSize, void* workSpace, size_t wkspSize);
+             #endif
+             #ifndef HUF_FORCE_DECOMPRESS_X1
              size_t HUF_readDTableX2 (HUF_DTable* DTable, const void* src, size_t srcSize);
              size_t HUF_readDTableX2_wksp (HUF_DTable* DTable, const void* src, size_t srcSize, void* workSpace, size_t wkspSize);
+             #endif
              size_t HUF_decompress4X_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
+             #ifndef HUF_FORCE_DECOMPRESS_X2
              size_t HUF_decompress4X1_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
+             #endif
+             #ifndef HUF_FORCE_DECOMPRESS_X1
              size_t HUF_decompress4X2_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
+             #endif
              /* ====================== */
                                     HUF_CElt* hufTable, HUF_repeat* repeat, int preferRepeat, int bmi2);
              size_t HUF_decompress1X1 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /* single-symbol decoder */
+             #ifndef HUF_FORCE_DECOMPRESS_X1
              size_t HUF_decompress1X2 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /* double-symbol decoder */
+             #endif
              size_t HUF_decompress1X_DCtx (HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);
              size_t HUF_decompress1X_DCtx_wksp (HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize);
+             #ifndef HUF_FORCE_DECOMPRESS_X2
              size_t HUF_decompress1X1_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< single-symbol decoder */
              size_t HUF_decompress1X1_DCtx_wksp(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize);   /**< single-symbol decoder */
+             #endif
+             #ifndef HUF_FORCE_DECOMPRESS_X1
              size_t HUF_decompress1X2_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< double-symbols decoder */
              size_t HUF_decompress1X2_DCtx_wksp(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize);   /**< double-symbols decoder */
+             #endif
              size_t HUF_decompress1X_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);   /**< automatic selection of sing or double symbol decoder, based on DTable */
+             #ifndef HUF_FORCE_DECOMPRESS_X2
              size_t HUF_decompress1X1_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
+             #endif
+             #ifndef HUF_FORCE_DECOMPRESS_X1
              size_t HUF_decompress1X2_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
+             #endif
              /* BMI2 variants.
               * If the CPU has BMI2 support, pass bmi2=1, otherwise pass bmi2=0.
               */
              size_t HUF_decompress1X_usingDTable_bmi2(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable, int bmi2);
+             #ifndef HUF_FORCE_DECOMPRESS_X2
              size_t HUF_decompress1X1_DCtx_wksp_bmi2(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize, int bmi2);
+             #endif
              size_t HUF_decompress4X_usingDTable_bmi2(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable, int bmi2);
              size_t HUF_decompress4X_hufOnly_wksp_bmi2(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize, void* workSpace, size_t wkspSize, int bmi2);

contrib/python-zstandard/zstd/common/mem.h

0 +8 -2

              #  define MEM_STATIC static  /* this version may generate warnings for unused static functions; disable the relevant warning */
              #endif
+             #ifndef __has_builtin
+             #  define __has_builtin(x) 0  /* compat. with non-clang compilers */
+             #endif
              /* code only tested on 32 and 64 bits systems */
              #define MEM_STATIC_ASSERT(c)   { enum { MEM_static_assert = 1/(int)(!!(c)) }; }
              MEM_STATIC void MEM_check(void) { MEM_STATIC_ASSERT((sizeof(size_t)==4) || (sizeof(size_t)==8)); }
              {
              #if defined(_MSC_VER)     /* Visual Studio */
                  return _byteswap_ulong(in);
-             #elif defined (__GNUC__) && (__GNUC__ * 100 + __GNUC_MINOR__ >= 403)
+             #elif (defined (__GNUC__) && (__GNUC__ * 100 + __GNUC_MINOR__ >= 403)) \
+               || (defined(__clang__) && __has_builtin(__builtin_bswap32))
                  return __builtin_bswap32(in);
              #else
                  return  ((in << 24) & 0xff000000 ) |
              {
              #if defined(_MSC_VER)     /* Visual Studio */
                  return _byteswap_uint64(in);
-             #elif defined (__GNUC__) && (__GNUC__ * 100 + __GNUC_MINOR__ >= 403)
+             #elif (defined (__GNUC__) && (__GNUC__ * 100 + __GNUC_MINOR__ >= 403)) \
+               || (defined(__clang__) && __has_builtin(__builtin_bswap64))
                  return __builtin_bswap64(in);
              #else
                  return  ((in << 56) & 0xff00000000000000ULL) |

contrib/python-zstandard/zstd/common/pool.c

0 +1 -1

                          ctx->numThreadsBusy++;
                          ctx->queueEmpty = ctx->queueHead == ctx->queueTail;
                          /* Unlock the mutex, signal a pusher, and run the job */
+                         ZSTD_pthread_cond_signal(&ctx->queuePushCond);
                          ZSTD_pthread_mutex_unlock(&ctx->queueMutex);
-                         ZSTD_pthread_cond_signal(&ctx->queuePushCond);
                          job.function(job.opaque);

contrib/python-zstandard/zstd/common/zstd_common.c

0 +3 -1

              /*-****************************************
              *  ZSTD Error Management
              ******************************************/
+             #undef ZSTD_isError   /* defined within zstd_internal.h */
              /*! ZSTD_isError() :
-              *  tells if a return value is an error code */
+              *  tells if a return value is an error code
+              *  symbol is required for external callers */
              unsigned ZSTD_isError(size_t code) { return ERR_isError(code); }
              /*! ZSTD_getErrorName() :

contrib/python-zstandard/zstd/common/zstd_errors.h

0 +1 0

                ZSTD_error_workSpace_tooSmall= 66,
                ZSTD_error_dstSize_tooSmall = 70,
                ZSTD_error_srcSize_wrong    = 72,
+               ZSTD_error_dstBuffer_null   = 74,
                /* following error codes are __NOT STABLE__, they can be removed or changed in future versions */
                ZSTD_error_frameIndex_tooLarge = 100,
                ZSTD_error_seekableIO          = 102,

contrib/python-zstandard/zstd/common/zstd_internal.h

0 +11 -2

              /* ---- static assert (debug) --- */
              #define ZSTD_STATIC_ASSERT(c) DEBUG_STATIC_ASSERT(c)
+             #define ZSTD_isError ERR_isError   /* for inlining */
+             #define FSE_isError  ERR_isError
+             #define HUF_isError  ERR_isError
              /*-*************************************
              #define BIT0   1
              #define ZSTD_WINDOWLOG_ABSOLUTEMIN 10
-             #define ZSTD_WINDOWLOG_DEFAULTMAX 27 /* Default maximum allowed window log */
              static const size_t ZSTD_fcs_fieldSize[4] = { 0, 2, 4, 8 };
              static const size_t ZSTD_did_fieldSize[4] = { 0, 1, 2, 4 };
                  blockType_e blockType;
                  U32 lastBlock;
                  U32 origSize;
-             } blockProperties_t;
+             } blockProperties_t;   /* declared here for decompress and fullbench */
              /*! ZSTD_getcBlockSize() :
               *  Provides the size of compressed block from block header `src` */
              size_t ZSTD_getcBlockSize(const void* src, size_t srcSize,
                                        blockProperties_t* bpPtr);
+             /*! ZSTD_decodeSeqHeaders() :
+              *  decode sequence header from src */
+             /* Used by: decompress, fullbench (does not get its definition from here) */
+             size_t ZSTD_decodeSeqHeaders(ZSTD_DCtx* dctx, int* nbSeqPtr,
+                                    const void* src, size_t srcSize);
              #if defined (__cplusplus)
              }
              #endif

contrib/python-zstandard/zstd/compress/fse_compress.c

0 +3 -3

                  /* symbol start positions */
                  {   U32 u;
                      cumul[0] = 0;
-                     for (u=1; u<=maxSymbolValue+1; u++) {
+                     for (u=1; u <= maxSymbolValue+1; u++) {
                          if (normalizedCounter[u-1]==-1) {  /* Low proba symbol */
                              cumul[u] = cumul[u-1] + 1;
                              tableSymbol[highThreshold--] = (FSE_FUNCTION_TYPE)(u-1);
                  BYTE* op = ostart;
                  BYTE* const oend = ostart + dstSize;
-                 U32   count[FSE_MAX_SYMBOL_VALUE+1];
+                 unsigned count[FSE_MAX_SYMBOL_VALUE+1];
                  S16   norm[FSE_MAX_SYMBOL_VALUE+1];
                  FSE_CTable* CTable = (FSE_CTable*)workSpace;
                  size_t const CTableSize = FSE_CTABLE_SIZE_U32(tableLog, maxSymbolValue);
                  if (!tableLog) tableLog = FSE_DEFAULT_TABLELOG;
                  /* Scan input and build symbol stats */
-                 {   CHECK_V_F(maxCount, HIST_count_wksp(count, &maxSymbolValue, src, srcSize, (unsigned*)scratchBuffer) );
+                 {   CHECK_V_F(maxCount, HIST_count_wksp(count, &maxSymbolValue, src, srcSize, scratchBuffer, scratchBufferSize) );
                      if (maxCount == srcSize) return 1;   /* only a single symbol in src : rle */
                      if (maxCount == 1) return 0;         /* each symbol present maximum once => not compressible */
                      if (maxCount < (srcSize >> 7)) return 0;   /* Heuristic : not compressible enough */

contrib/python-zstandard/zstd/compress/hist.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/hist.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/huf_compress.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_compress.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_compress_internal.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_double_fast.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_fast.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_lazy.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_ldm.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_ldm.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_opt.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstd_opt.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstdmt_compress.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/compress/zstdmt_compress.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/huf_decompress.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/zstd_decompress.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/dictBuilder/cover.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/dictBuilder/fastcover.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/dictBuilder/zdict.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/zstd.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python3-whitelist

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/relnotes

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/revsetbenchmarks.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/showstack.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/synthrepo.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/testparseutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/win32/hgwebdir_wsgi.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/zsh_completion

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

doc/Makefile

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

doc/check-seclevel.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

doc/hgmanpage.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/absorb.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/acl.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/automv.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/blackbox.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/bugzilla.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/commitextras.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/convcmd.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/cvs.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/cvsps.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/git.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/hg.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/monotone.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/p4.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/convert/subversion.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/extdiff.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fastannotate/commands.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fastannotate/formatter.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fastannotate/protocol.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fastannotate/support.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fetch.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fix.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fsmonitor/__init__.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fsmonitor/pywatchman/__init__.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fsmonitor/pywatchman/capabilities.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fsmonitor/pywatchman/pybser.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/fsmonitor/watchmanclient.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/githelp.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/gpg.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/histedit.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/infinitepush/__init__.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/journal.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/largefiles/basestore.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/largefiles/lfcommands.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/largefiles/lfutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/largefiles/overrides.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/largefiles/reposetup.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/largefiles/storefactory.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/lfs/blobstore.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/lfs/wireprotolfsserver.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/mq.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/narrow/narrowcommands.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/notify.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/phabricator.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/rebase.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/record.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/releasenotes.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/__init__.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/basepack.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/basestore.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/datapack.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/debugcommands.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/fileserverclient.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/historypack.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/remotefilectx.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/remotefilelog.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/remotefilelogserver.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/repack.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/shallowbundle.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/shallowrepo.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/remotefilelog/shallowutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/shelve.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/show.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/sparse.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/split.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/strip.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/transplant.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/uncommit.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

hgext/zeroconf/Zeroconf.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

i18n/posplit

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/archival.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/bdiff.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/bookmarks.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/branchmap.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/bundle2.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/base85.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/bdiff.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/charencode.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/mpatch.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/osutil.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/parsers.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/pathencode.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cext/revlog.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/changegroup.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/changelog.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/chgserver.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/cmdutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/color.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/commands.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/config.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/configitems.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/context.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/copies.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/crecord.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/dagop.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/debugcommands.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/diffutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/dirstate.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/discovery.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/encoding.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/exchange.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/filemerge.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/fileset.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/formatter.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/graphmod.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/hbisect.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/help.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/help/config.txt

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/help/scripting.txt

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/help/subrepos.txt

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/hg.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/hgweb/hgwebdir_mod.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/hgweb/server.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/hgweb/webcommands.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/hgweb/webutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/hgweb/wsgiheaders.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/httpconnection.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/httppeer.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/keepalive.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/localrepo.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/logcmdutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/logexchange.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/mail.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/manifest.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/match.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/merge.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/minirst.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/mpatch.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/narrowspec.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/obsolete.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/obsutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/patch.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/posix.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/rcutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/repair.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/repository.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/repoview.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/revlog.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/revlogutils/deltas.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/revset.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/revsetlang.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/scmutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/setdiscovery.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/simplemerge.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/sparse.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/sslutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/statichttprepo.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/statprof.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/store.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/streamclone.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/subrepo.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/subrepoutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/tags.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/templatefilters.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/templatefuncs.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/templatekw.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/thirdparty/attr/_make.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/thirdparty/attr/filters.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/transaction.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/ui.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/upgrade.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/url.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/util.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/utils/compression.py mercurial/util.py

0 copied 0 0

	1		NO CONTENT: file copied from mercurial/util.py to mercurial/utils/compression.py
The requested commit or file is too big and content was truncated. Show full diff

mercurial/utils/procutil.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/verify.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/wireprotoserver.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/wireprototypes.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/wireprotov1server.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/wireprotov2peer.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

mercurial/wireprotov2server.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/Cargo.lock

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/chg/src/sighandlers.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-core/Cargo.toml

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-core/src/ancestors.rs

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-core/src/dagops.rs

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-core/src/lib.rs

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-cpython/src/ancestors.rs

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-cpython/src/conversion.rs

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

rust/hg-cpython/src/lib.rs

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

setup.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/artifacts/scripts/generate-churning-bundle.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/badserverext.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/check-perf-code.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/drawdag.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/flagprocessorext.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/hghave.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/notcapable

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/phabricator/phabsend-create-alpha.json

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/phabricator/phabsend-update-alpha-create-beta.json

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/run-tests.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/svnxml.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-absorb-strip.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-acl.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-ancestor.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-annotate.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-arbitraryfilectx.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-archive.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-batching.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-blackbox.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-bugzilla.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-bundle-r.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-bundle.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-bundle2-format.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-bundle2-multiple-changegroups.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-bundle2-pushback.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-cbor.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-check-code.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-check-module-imports.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-check-py3-compat.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-clone.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-commit-interactive-curses.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-commit-interactive.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-commit-multiple.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-commit.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-completion.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-config.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-context.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-contrib-check-code.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-contrib-perf.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-contrib-relnotes.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-convert-cvs.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-convert-hg-source.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-convert-hg-svn.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-convert-svn-move.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-convert-svn-sink.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-convert-svn-source.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-copy.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-debugcommands.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-demandimport.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-diff-color.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-diff-hashes.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-diffstat.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-dispatch.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-doctest.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-duplicateoptions.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-encoding-align.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-extdiff.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-extension.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-fastannotate-hg.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-fix.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-flagprocessor.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-fncache.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-generaldelta.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-graft.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-grep.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-hardlinks.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-help.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-hgignore.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-hgweb-auth.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-hgweb-json.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-hgweb-no-request-uri.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-hgweb.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-highlight.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-arguments.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-commute.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-edit.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-fold-non-commute.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-fold.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-non-commute.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-histedit-obsolete.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-hook.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-http-api-httpv2.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-http-api.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-http-bad-server.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-http-bundle1.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-http-protocol.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-http.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-https.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-impexp-branch.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-import-context.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-import-eol.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-import-git.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-inherit-mode.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-install.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-issue1175.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-journal-exists.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-keyword.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-largefiles-misc.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-largefiles-small-disk.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-largefiles-wireproto.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-lfs-serve-access.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-lfs-serve.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-linelog.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-locate.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-lock.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-manifest.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-manifest.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-match.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-merge10.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-missing-capability.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-mq-eol.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-mq-missingfiles.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-mq-qimport.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-mq-qnew.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-mq-subrepo-svn.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-mq.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-narrow-trackedcmd.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-narrow-widen-no-ellipsis.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-newcgi.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-notify.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-obsmarker-template.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-obsolete-distributed.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-oldcgi.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-parseindex.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-patch-offset.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-permissions.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-phabricator.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-pull-bundle.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-pull.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-purge.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-push-http.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-qrecord.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-rebase-conflicts.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-rebase-dest.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-rebase-inmemory.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-record.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remotefilelog-bgprefetch.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remotefilelog-blame.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remotefilelog-cacheprocess.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remotefilelog-datapack.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remotefilelog-gc.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remotefilelog-histpack.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-remotefilelog-prefetch.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-removeemptydirs.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-rename-merge1.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-repair-strip.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-repo-compengines.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-resolve.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-revert-interactive.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-revert.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-revlog-raw.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-revset.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-revset2.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-rollback.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-run-tests.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-run-tests.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-rust-ancestor.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-setdiscovery.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-share.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-shelve.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-shelve2.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-simplekeyvaluefile.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-sparse-revlog.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-split.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-sqlitestore.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-ssh-bundle1.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-ssh-repoerror.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-ssh.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-static-http.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-status.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-subrepo-git.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-subrepo-svn.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-subrepo.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-tag.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-tags.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-template-functions.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-template-keywords.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-template-map.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-transplant.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-trusted.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-trusted.py.out

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-unamend.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-uncommit.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-update-atomic.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-upgrade-repo.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-wireproto-command-capabilities.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-wireproto-content-redirects.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-wireproto-exchangev2.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-wireproto.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/test-worker.t

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

tests/tinyproxy.py

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/wix/hg.cmd

0 removed 0 0

	1		NO CONTENT: file was removed
The requested commit or file is too big and content was truncated. Show full diff

tests/test-demandimport.py.out

0 removed 0 0

	1		NO CONTENT: file was removed
The requested commit or file is too big and content was truncated. Show full diff

General Comments 0

Write
Preview

You need to be logged in to leave comments. Login now

No TODOs yet

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages