upstream/mercurial-mirror Commit - r44446:de783805

zstandard: vendor python-zstandard 0.13.0...

Gregory Szorc -

r44446:de783805 default

parent child

contrib/python-zstandard/zstd/compress/zstd_cwksp.h

0 created 644 +535 0

This diff has been collapsed as it changes many lines, (535 lines changed) Show them Hide them
		@@ -0,0 +1,535
	1	/*
	2	* Copyright (c) 2016-present, Yann Collet, Facebook, Inc.
	3	* All rights reserved.
	4	*
	5	* This source code is licensed under both the BSD-style license (found in the
	6	* LICENSE file in the root directory of this source tree) and the GPLv2 (found
	7	* in the COPYING file in the root directory of this source tree).
	8	* You may select, at your option, one of the above-listed licenses.
	9	*/
	10
	11	#ifndef ZSTD_CWKSP_H
	12	#define ZSTD_CWKSP_H
	13
	14	/-************************************
	15	* Dependencies
	16	***************************************/
	17	#include "zstd_internal.h"
	18
	19	#if defined (__cplusplus)
	20	extern "C" {
	21	#endif
	22
	23	/-************************************
	24	* Constants
	25	***************************************/
	26
	27	/* define "workspace is too large" as this number of times larger than needed */
	28	#define ZSTD_WORKSPACETOOLARGE_FACTOR 3
	29
	30	/* when workspace is continuously too large
	31	* during at least this number of times,
	32	* context's memory usage is considered wasteful,
	33	* because it's sized to handle a worst case scenario which rarely happens.
	34	* In which case, resize it down to free some memory */
	35	#define ZSTD_WORKSPACETOOLARGE_MAXDURATION 128
	36
	37	/* Since the workspace is effectively its own little malloc implementation /
	38	* arena, when we run under ASAN, we should similarly insert redzones between
	39	* each internal element of the workspace, so ASAN will catch overruns that
	40	* reach outside an object but that stay inside the workspace.
	41	*
	42	* This defines the size of that redzone.
	43	*/
	44	#ifndef ZSTD_CWKSP_ASAN_REDZONE_SIZE
	45	#define ZSTD_CWKSP_ASAN_REDZONE_SIZE 128
	46	#endif
	47
	48	/-************************************
	49	* Structures
	50	***************************************/
	51	typedef enum {
	52	ZSTD_cwksp_alloc_objects,
	53	ZSTD_cwksp_alloc_buffers,
	54	ZSTD_cwksp_alloc_aligned
	55	} ZSTD_cwksp_alloc_phase_e;
	56
	57	/**
	58	* Zstd fits all its internal datastructures into a single continuous buffer,
	59	* so that it only needs to perform a single OS allocation (or so that a buffer
	60	* can be provided to it and it can perform no allocations at all). This buffer
	61	* is called the workspace.
	62	*
	63	* Several optimizations complicate that process of allocating memory ranges
	64	* from this workspace for each internal datastructure:
	65	*
	66	* - These different internal datastructures have different setup requirements:
	67	*
	68	* - The static objects need to be cleared once and can then be trivially
	69	* reused for each compression.
	70	*
	71	* - Various buffers don't need to be initialized at all--they are always
	72	* written into before they're read.
	73	*
	74	* - The matchstate tables have a unique requirement that they don't need
	75	* their memory to be totally cleared, but they do need the memory to have
	76	* some bound, i.e., a guarantee that all values in the memory they've been
	77	* allocated is less than some maximum value (which is the starting value
	78	* for the indices that they will then use for compression). When this
	79	* guarantee is provided to them, they can use the memory without any setup
	80	* work. When it can't, they have to clear the area.
	81	*
	82	* - These buffers also have different alignment requirements.
	83	*
	84	* - We would like to reuse the objects in the workspace for multiple
	85	* compressions without having to perform any expensive reallocation or
	86	* reinitialization work.
	87	*
	88	* - We would like to be able to efficiently reuse the workspace across
	89	* multiple compressions even when the compression parameters change and
	90	* we need to resize some of the objects (where possible).
	91	*
	92	* To attempt to manage this buffer, given these constraints, the ZSTD_cwksp
	93	* abstraction was created. It works as follows:
	94	*
	95	* Workspace Layout:
	96	*
	97	* [ ... workspace ... ]
	98	* [objects][tables ... ->] free space [<- ... aligned][<- ... buffers]
	99	*
	100	* The various objects that live in the workspace are divided into the
	101	* following categories, and are allocated separately:
	102	*
	103	* - Static objects: this is optionally the enclosing ZSTD_CCtx or ZSTD_CDict,
	104	* so that literally everything fits in a single buffer. Note: if present,
	105	* this must be the first object in the workspace, since ZSTD_free{CCtx,
	106	* CDict}() rely on a pointer comparison to see whether one or two frees are
	107	* required.
	108	*
	109	* - Fixed size objects: these are fixed-size, fixed-count objects that are
	110	* nonetheless "dynamically" allocated in the workspace so that we can
	111	* control how they're initialized separately from the broader ZSTD_CCtx.
	112	* Examples:
	113	* - Entropy Workspace
	114	* - 2 x ZSTD_compressedBlockState_t
	115	* - CDict dictionary contents
	116	*
	117	* - Tables: these are any of several different datastructures (hash tables,
	118	* chain tables, binary trees) that all respect a common format: they are
	119	* uint32_t arrays, all of whose values are between 0 and (nextSrc - base).
	120	* Their sizes depend on the cparams.
	121	*
	122	* - Aligned: these buffers are used for various purposes that require 4 byte
	123	* alignment, but don't require any initialization before they're used.
	124	*
	125	* - Buffers: these buffers are used for various purposes that don't require
	126	* any alignment or initialization before they're used. This means they can
	127	* be moved around at no cost for a new compression.
	128	*
	129	* Allocating Memory:
	130	*
	131	* The various types of objects must be allocated in order, so they can be
	132	* correctly packed into the workspace buffer. That order is:
	133	*
	134	* 1. Objects
	135	* 2. Buffers
	136	* 3. Aligned
	137	* 4. Tables
	138	*
	139	* Attempts to reserve objects of different types out of order will fail.
	140	*/
	141	typedef struct {
	142	void* workspace;
	143	void* workspaceEnd;
	144
	145	void* objectEnd;
	146	void* tableEnd;
	147	void* tableValidEnd;
	148	void* allocStart;
	149
	150	int allocFailed;
	151	int workspaceOversizedDuration;
	152	ZSTD_cwksp_alloc_phase_e phase;
	153	} ZSTD_cwksp;
	154
	155	/-************************************
	156	* Functions
	157	***************************************/
	158
	159	MEM_STATIC size_t ZSTD_cwksp_available_space(ZSTD_cwksp* ws);
	160
	161	MEM_STATIC void ZSTD_cwksp_assert_internal_consistency(ZSTD_cwksp* ws) {
	162	(void)ws;
	163	assert(ws->workspace <= ws->objectEnd);
	164	assert(ws->objectEnd <= ws->tableEnd);
	165	assert(ws->objectEnd <= ws->tableValidEnd);
	166	assert(ws->tableEnd <= ws->allocStart);
	167	assert(ws->tableValidEnd <= ws->allocStart);
	168	assert(ws->allocStart <= ws->workspaceEnd);
	169	}
	170
	171	/**
	172	* Align must be a power of 2.
	173	*/
	174	MEM_STATIC size_t ZSTD_cwksp_align(size_t size, size_t const align) {
	175	size_t const mask = align - 1;
	176	assert((align & mask) == 0);
	177	return (size + mask) & ~mask;
	178	}
	179
	180	/**
	181	* Use this to determine how much space in the workspace we will consume to
	182	* allocate this object. (Normally it should be exactly the size of the object,
	183	* but under special conditions, like ASAN, where we pad each object, it might
	184	* be larger.)
	185	*
	186	* Since tables aren't currently redzoned, you don't need to call through this
	187	* to figure out how much space you need for the matchState tables. Everything
	188	* else is though.
	189	*/
	190	MEM_STATIC size_t ZSTD_cwksp_alloc_size(size_t size) {
	191	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	192	return size + 2 * ZSTD_CWKSP_ASAN_REDZONE_SIZE;
	193	#else
	194	return size;
	195	#endif
	196	}
	197
	198	MEM_STATIC void ZSTD_cwksp_internal_advance_phase(
	199	ZSTD_cwksp* ws, ZSTD_cwksp_alloc_phase_e phase) {
	200	assert(phase >= ws->phase);
	201	if (phase > ws->phase) {
	202	if (ws->phase < ZSTD_cwksp_alloc_buffers &&
	203	phase >= ZSTD_cwksp_alloc_buffers) {
	204	ws->tableValidEnd = ws->objectEnd;
	205	}
	206	if (ws->phase < ZSTD_cwksp_alloc_aligned &&
	207	phase >= ZSTD_cwksp_alloc_aligned) {
	208	/* If unaligned allocations down from a too-large top have left us
	209	* unaligned, we need to realign our alloc ptr. Technically, this
	210	* can consume space that is unaccounted for in the neededSpace
	211	* calculation. However, I believe this can only happen when the
	212	* workspace is too large, and specifically when it is too large
	213	* by a larger margin than the space that will be consumed. */
	214	/* TODO: cleaner, compiler warning friendly way to do this??? */
	215	ws->allocStart = (BYTE*)ws->allocStart - ((size_t)ws->allocStart & (sizeof(U32)-1));
	216	if (ws->allocStart < ws->tableValidEnd) {
	217	ws->tableValidEnd = ws->allocStart;
	218	}
	219	}
	220	ws->phase = phase;
	221	}
	222	}
	223
	224	/**
	225	* Returns whether this object/buffer/etc was allocated in this workspace.
	226	*/
	227	MEM_STATIC int ZSTD_cwksp_owns_buffer(const ZSTD_cwksp* ws, const void* ptr) {
	228	return (ptr != NULL) && (ws->workspace <= ptr) && (ptr <= ws->workspaceEnd);
	229	}
	230
	231	/**
	232	* Internal function. Do not use directly.
	233	*/
	234	MEM_STATIC void* ZSTD_cwksp_reserve_internal(
	235	ZSTD_cwksp* ws, size_t bytes, ZSTD_cwksp_alloc_phase_e phase) {
	236	void* alloc;
	237	void* bottom = ws->tableEnd;
	238	ZSTD_cwksp_internal_advance_phase(ws, phase);
	239	alloc = (BYTE *)ws->allocStart - bytes;
	240
	241	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	242	/* over-reserve space */
	243	alloc = (BYTE )alloc - 2 ZSTD_CWKSP_ASAN_REDZONE_SIZE;
	244	#endif
	245
	246	DEBUGLOG(5, "cwksp: reserving %p %zd bytes, %zd bytes remaining",
	247	alloc, bytes, ZSTD_cwksp_available_space(ws) - bytes);
	248	ZSTD_cwksp_assert_internal_consistency(ws);
	249	assert(alloc >= bottom);
	250	if (alloc < bottom) {
	251	DEBUGLOG(4, "cwksp: alloc failed!");
	252	ws->allocFailed = 1;
	253	return NULL;
	254	}
	255	if (alloc < ws->tableValidEnd) {
	256	ws->tableValidEnd = alloc;
	257	}
	258	ws->allocStart = alloc;
	259
	260	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	261	/* Move alloc so there's ZSTD_CWKSP_ASAN_REDZONE_SIZE unused space on
	262	* either size. */
	263	alloc = (BYTE *)alloc + ZSTD_CWKSP_ASAN_REDZONE_SIZE;
	264	__asan_unpoison_memory_region(alloc, bytes);
	265	#endif
	266
	267	return alloc;
	268	}
	269
	270	/**
	271	* Reserves and returns unaligned memory.
	272	*/
	273	MEM_STATIC BYTE* ZSTD_cwksp_reserve_buffer(ZSTD_cwksp* ws, size_t bytes) {
	274	return (BYTE*)ZSTD_cwksp_reserve_internal(ws, bytes, ZSTD_cwksp_alloc_buffers);
	275	}
	276
	277	/**
	278	* Reserves and returns memory sized on and aligned on sizeof(unsigned).
	279	*/
	280	MEM_STATIC void* ZSTD_cwksp_reserve_aligned(ZSTD_cwksp* ws, size_t bytes) {
	281	assert((bytes & (sizeof(U32)-1)) == 0);
	282	return ZSTD_cwksp_reserve_internal(ws, ZSTD_cwksp_align(bytes, sizeof(U32)), ZSTD_cwksp_alloc_aligned);
	283	}
	284
	285	/**
	286	* Aligned on sizeof(unsigned). These buffers have the special property that
	287	* their values remain constrained, allowing us to re-use them without
	288	* memset()-ing them.
	289	*/
	290	MEM_STATIC void* ZSTD_cwksp_reserve_table(ZSTD_cwksp* ws, size_t bytes) {
	291	const ZSTD_cwksp_alloc_phase_e phase = ZSTD_cwksp_alloc_aligned;
	292	void* alloc = ws->tableEnd;
	293	void* end = (BYTE *)alloc + bytes;
	294	void* top = ws->allocStart;
	295
	296	DEBUGLOG(5, "cwksp: reserving %p table %zd bytes, %zd bytes remaining",
	297	alloc, bytes, ZSTD_cwksp_available_space(ws) - bytes);
	298	assert((bytes & (sizeof(U32)-1)) == 0);
	299	ZSTD_cwksp_internal_advance_phase(ws, phase);
	300	ZSTD_cwksp_assert_internal_consistency(ws);
	301	assert(end <= top);
	302	if (end > top) {
	303	DEBUGLOG(4, "cwksp: table alloc failed!");
	304	ws->allocFailed = 1;
	305	return NULL;
	306	}
	307	ws->tableEnd = end;
	308
	309	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	310	__asan_unpoison_memory_region(alloc, bytes);
	311	#endif
	312
	313	return alloc;
	314	}
	315
	316	/**
	317	* Aligned on sizeof(void*).
	318	*/
	319	MEM_STATIC void* ZSTD_cwksp_reserve_object(ZSTD_cwksp* ws, size_t bytes) {
	320	size_t roundedBytes = ZSTD_cwksp_align(bytes, sizeof(void*));
	321	void* alloc = ws->objectEnd;
	322	void* end = (BYTE*)alloc + roundedBytes;
	323
	324	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	325	/* over-reserve space */
	326	end = (BYTE )end + 2 ZSTD_CWKSP_ASAN_REDZONE_SIZE;
	327	#endif
	328
	329	DEBUGLOG(5,
	330	"cwksp: reserving %p object %zd bytes (rounded to %zd), %zd bytes remaining",
	331	alloc, bytes, roundedBytes, ZSTD_cwksp_available_space(ws) - roundedBytes);
	332	assert(((size_t)alloc & (sizeof(void*)-1)) == 0);
	333	assert((bytes & (sizeof(void*)-1)) == 0);
	334	ZSTD_cwksp_assert_internal_consistency(ws);
	335	/* we must be in the first phase, no advance is possible */
	336	if (ws->phase != ZSTD_cwksp_alloc_objects \|\| end > ws->workspaceEnd) {
	337	DEBUGLOG(4, "cwksp: object alloc failed!");
	338	ws->allocFailed = 1;
	339	return NULL;
	340	}
	341	ws->objectEnd = end;
	342	ws->tableEnd = end;
	343	ws->tableValidEnd = end;
	344
	345	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	346	/* Move alloc so there's ZSTD_CWKSP_ASAN_REDZONE_SIZE unused space on
	347	* either size. */
	348	alloc = (BYTE *)alloc + ZSTD_CWKSP_ASAN_REDZONE_SIZE;
	349	__asan_unpoison_memory_region(alloc, bytes);
	350	#endif
	351
	352	return alloc;
	353	}
	354
	355	MEM_STATIC void ZSTD_cwksp_mark_tables_dirty(ZSTD_cwksp* ws) {
	356	DEBUGLOG(4, "cwksp: ZSTD_cwksp_mark_tables_dirty");
	357
	358	#if defined (MEMORY_SANITIZER) && !defined (ZSTD_MSAN_DONT_POISON_WORKSPACE)
	359	/* To validate that the table re-use logic is sound, and that we don't
	360	* access table space that we haven't cleaned, we re-"poison" the table
	361	* space every time we mark it dirty. */
	362	{
	363	size_t size = (BYTE)ws->tableValidEnd - (BYTE)ws->objectEnd;
	364	assert(__msan_test_shadow(ws->objectEnd, size) == -1);
	365	__msan_poison(ws->objectEnd, size);
	366	}
	367	#endif
	368
	369	assert(ws->tableValidEnd >= ws->objectEnd);
	370	assert(ws->tableValidEnd <= ws->allocStart);
	371	ws->tableValidEnd = ws->objectEnd;
	372	ZSTD_cwksp_assert_internal_consistency(ws);
	373	}
	374
	375	MEM_STATIC void ZSTD_cwksp_mark_tables_clean(ZSTD_cwksp* ws) {
	376	DEBUGLOG(4, "cwksp: ZSTD_cwksp_mark_tables_clean");
	377	assert(ws->tableValidEnd >= ws->objectEnd);
	378	assert(ws->tableValidEnd <= ws->allocStart);
	379	if (ws->tableValidEnd < ws->tableEnd) {
	380	ws->tableValidEnd = ws->tableEnd;
	381	}
	382	ZSTD_cwksp_assert_internal_consistency(ws);
	383	}
	384
	385	/**
	386	* Zero the part of the allocated tables not already marked clean.
	387	*/
	388	MEM_STATIC void ZSTD_cwksp_clean_tables(ZSTD_cwksp* ws) {
	389	DEBUGLOG(4, "cwksp: ZSTD_cwksp_clean_tables");
	390	assert(ws->tableValidEnd >= ws->objectEnd);
	391	assert(ws->tableValidEnd <= ws->allocStart);
	392	if (ws->tableValidEnd < ws->tableEnd) {
	393	memset(ws->tableValidEnd, 0, (BYTE)ws->tableEnd - (BYTE)ws->tableValidEnd);
	394	}
	395	ZSTD_cwksp_mark_tables_clean(ws);
	396	}
	397
	398	/**
	399	* Invalidates table allocations.
	400	* All other allocations remain valid.
	401	*/
	402	MEM_STATIC void ZSTD_cwksp_clear_tables(ZSTD_cwksp* ws) {
	403	DEBUGLOG(4, "cwksp: clearing tables!");
	404
	405	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	406	{
	407	size_t size = (BYTE)ws->tableValidEnd - (BYTE)ws->objectEnd;
	408	__asan_poison_memory_region(ws->objectEnd, size);
	409	}
	410	#endif
	411
	412	ws->tableEnd = ws->objectEnd;
	413	ZSTD_cwksp_assert_internal_consistency(ws);
	414	}
	415
	416	/**
	417	* Invalidates all buffer, aligned, and table allocations.
	418	* Object allocations remain valid.
	419	*/
	420	MEM_STATIC void ZSTD_cwksp_clear(ZSTD_cwksp* ws) {
	421	DEBUGLOG(4, "cwksp: clearing!");
	422
	423	#if defined (MEMORY_SANITIZER) && !defined (ZSTD_MSAN_DONT_POISON_WORKSPACE)
	424	/* To validate that the context re-use logic is sound, and that we don't
	425	* access stuff that this compression hasn't initialized, we re-"poison"
	426	* the workspace (or at least the non-static, non-table parts of it)
	427	* every time we start a new compression. */
	428	{
	429	size_t size = (BYTE)ws->workspaceEnd - (BYTE)ws->tableValidEnd;
	430	__msan_poison(ws->tableValidEnd, size);
	431	}
	432	#endif
	433
	434	#if defined (ADDRESS_SANITIZER) && !defined (ZSTD_ASAN_DONT_POISON_WORKSPACE)
	435	{
	436	size_t size = (BYTE)ws->workspaceEnd - (BYTE)ws->objectEnd;
	437	__asan_poison_memory_region(ws->objectEnd, size);
	438	}
	439	#endif
	440
	441	ws->tableEnd = ws->objectEnd;
	442	ws->allocStart = ws->workspaceEnd;
	443	ws->allocFailed = 0;
	444	if (ws->phase > ZSTD_cwksp_alloc_buffers) {
	445	ws->phase = ZSTD_cwksp_alloc_buffers;
	446	}
	447	ZSTD_cwksp_assert_internal_consistency(ws);
	448	}
	449
	450	/**
	451	* The provided workspace takes ownership of the buffer [start, start+size).
	452	* Any existing values in the workspace are ignored (the previously managed
	453	* buffer, if present, must be separately freed).
	454	*/
	455	MEM_STATIC void ZSTD_cwksp_init(ZSTD_cwksp* ws, void* start, size_t size) {
	456	DEBUGLOG(4, "cwksp: init'ing workspace with %zd bytes", size);
	457	assert(((size_t)start & (sizeof(void)-1)) == 0); / ensure correct alignment */
	458	ws->workspace = start;
	459	ws->workspaceEnd = (BYTE*)start + size;
	460	ws->objectEnd = ws->workspace;
	461	ws->tableValidEnd = ws->objectEnd;
	462	ws->phase = ZSTD_cwksp_alloc_objects;
	463	ZSTD_cwksp_clear(ws);
	464	ws->workspaceOversizedDuration = 0;
	465	ZSTD_cwksp_assert_internal_consistency(ws);
	466	}
	467
	468	MEM_STATIC size_t ZSTD_cwksp_create(ZSTD_cwksp* ws, size_t size, ZSTD_customMem customMem) {
	469	void* workspace = ZSTD_malloc(size, customMem);
	470	DEBUGLOG(4, "cwksp: creating new workspace with %zd bytes", size);
	471	RETURN_ERROR_IF(workspace == NULL, memory_allocation);
	472	ZSTD_cwksp_init(ws, workspace, size);
	473	return 0;
	474	}
	475
	476	MEM_STATIC void ZSTD_cwksp_free(ZSTD_cwksp* ws, ZSTD_customMem customMem) {
	477	void *ptr = ws->workspace;
	478	DEBUGLOG(4, "cwksp: freeing workspace");
	479	memset(ws, 0, sizeof(ZSTD_cwksp));
	480	ZSTD_free(ptr, customMem);
	481	}
	482
	483	/**
	484	* Moves the management of a workspace from one cwksp to another. The src cwksp
	485	* is left in an invalid state (src must be re-init()'ed before its used again).
	486	*/
	487	MEM_STATIC void ZSTD_cwksp_move(ZSTD_cwksp* dst, ZSTD_cwksp* src) {
	488	dst = src;
	489	memset(src, 0, sizeof(ZSTD_cwksp));
	490	}
	491
	492	MEM_STATIC size_t ZSTD_cwksp_sizeof(const ZSTD_cwksp* ws) {
	493	return (size_t)((BYTE)ws->workspaceEnd - (BYTE)ws->workspace);
	494	}
	495
	496	MEM_STATIC int ZSTD_cwksp_reserve_failed(const ZSTD_cwksp* ws) {
	497	return ws->allocFailed;
	498	}
	499
	500	/-************************************
	501	* Functions Checking Free Space
	502	***************************************/
	503
	504	MEM_STATIC size_t ZSTD_cwksp_available_space(ZSTD_cwksp* ws) {
	505	return (size_t)((BYTE)ws->allocStart - (BYTE)ws->tableEnd);
	506	}
	507
	508	MEM_STATIC int ZSTD_cwksp_check_available(ZSTD_cwksp* ws, size_t additionalNeededSpace) {
	509	return ZSTD_cwksp_available_space(ws) >= additionalNeededSpace;
	510	}
	511
	512	MEM_STATIC int ZSTD_cwksp_check_too_large(ZSTD_cwksp* ws, size_t additionalNeededSpace) {
	513	return ZSTD_cwksp_check_available(
	514	ws, additionalNeededSpace * ZSTD_WORKSPACETOOLARGE_FACTOR);
	515	}
	516
	517	MEM_STATIC int ZSTD_cwksp_check_wasteful(ZSTD_cwksp* ws, size_t additionalNeededSpace) {
	518	return ZSTD_cwksp_check_too_large(ws, additionalNeededSpace)
	519	&& ws->workspaceOversizedDuration > ZSTD_WORKSPACETOOLARGE_MAXDURATION;
	520	}
	521
	522	MEM_STATIC void ZSTD_cwksp_bump_oversized_duration(
	523	ZSTD_cwksp* ws, size_t additionalNeededSpace) {
	524	if (ZSTD_cwksp_check_too_large(ws, additionalNeededSpace)) {
	525	ws->workspaceOversizedDuration++;
	526	} else {
	527	ws->workspaceOversizedDuration = 0;
	528	}
	529	}
	530
	531	#if defined (__cplusplus)
	532	}
	533	#endif
	534
	535	#endif /* ZSTD_CWKSP_H */

contrib/clang-format-ignorelist

0 +1 0

              contrib/python-zstandard/zstd/compress/zstd_compress_literals.h
              contrib/python-zstandard/zstd/compress/zstd_compress_sequences.c
              contrib/python-zstandard/zstd/compress/zstd_compress_sequences.h
+             contrib/python-zstandard/zstd/compress/zstd_cwksp.h
              contrib/python-zstandard/zstd/compress/zstd_double_fast.c
              contrib/python-zstandard/zstd/compress/zstd_double_fast.h
              contrib/python-zstandard/zstd/compress/zstd_fast.c

contrib/python-zstandard/NEWS.rst

0 +20 -1

              * Support modifying compression parameters mid operation when supported by
                zstd API.
              * Expose ``ZSTD_CLEVEL_DEFAULT`` constant.
+             * Expose ``ZSTD_SRCSIZEHINT_{MIN,MAX}`` constants.
              * Support ``ZSTD_p_forceAttachDict`` compression parameter.
-             * Support ``ZSTD_c_literalCompressionMode `` compression parameter.
+             * Support ``ZSTD_dictForceLoad`` dictionary compression parameter.
+             * Support ``ZSTD_c_targetCBlockSize`` compression parameter.
+             * Support ``ZSTD_c_literalCompressionMode`` compression parameter.
+             * Support ``ZSTD_c_srcSizeHint`` compression parameter.
              * Use ``ZSTD_CCtx_getParameter()``/``ZSTD_CCtxParam_getParameter()`` for retrieving
                compression parameters.
              * Consider exposing ``ZSTDMT_toFlushNow()``.
              * Expose ``ZDICT_trainFromBuffer_fastCover()``,
                ``ZDICT_optimizeTrainFromBuffer_fastCover``.
+             * Expose ``ZSTD_Sequence`` struct and related ``ZSTD_getSequences()`` API.
              * Expose and enforce ``ZSTD_minCLevel()`` for minimum compression level.
              * Consider a ``chunker()`` API for decompression.
              * Consider stats for ``chunker()`` API, including finding the last consumed
              * API for ensuring max memory ceiling isn't exceeded.
              * Move off nose for testing.
+.13.0 (released 2019-12-28)
+             ============================
+             Changes
+             -------
+             * ``pytest-xdist`` ``pytest`` extension is now installed so tests can be
+               run in parallel.
+             * CI now builds ``manylinux2010`` and ``manylinux2014`` binary wheels
+               instead of a mix of ``manylinux2010`` and ``manylinux1``.
+             * Official support for Python 3.8 has been added.
+             * Bundled zstandard library upgraded from 1.4.3 to 1.4.4.
+             * Python code has been reformatted with black.
 .12.0 (released 2019-09-15)
              ============================

contrib/python-zstandard/README.rst

0 +1 -1

              Requirements
              ============
-             This extension is designed to run with Python 2.7, 3.4, 3.5, 3.6, and 3.7
+             This extension is designed to run with Python 2.7, 3.5, 3.6, 3.7, and 3.8
              on common platforms (Linux, Windows, and OS X). On PyPy (both PyPy2 and PyPy3) we support version 6.0.0 and above.
              x86 and x86_64 are well-tested on Windows. Only x86_64 is well-tested on Linux and macOS.

contrib/python-zstandard/c-ext/python-zstandard.h

0 +1 -1

              #include <zdict.h>
              /* Remember to change the string in zstandard/__init__ as well */
-             #define PYTHON_ZSTANDARD_VERSION "0.12.0"
+             #define PYTHON_ZSTANDARD_VERSION "0.13.0"
              typedef enum {
              	compressorobj_flush_finish,

contrib/python-zstandard/make_cffi.py

0 +103 -85

              HERE = os.path.abspath(os.path.dirname(__file__))
-             SOURCES = ['zstd/%s' % p for p in (
-                 'common/debug.c',
-                 'common/entropy_common.c',
-                 'common/error_private.c',
-                 'common/fse_decompress.c',
-                 'common/pool.c',
-                 'common/threading.c',
-                 'common/xxhash.c',
-                 'common/zstd_common.c',
-                 'compress/fse_compress.c',
-                 'compress/hist.c',
-                 'compress/huf_compress.c',
-                 'compress/zstd_compress.c',
-                 'compress/zstd_compress_literals.c',
-                 'compress/zstd_compress_sequences.c',
-                 'compress/zstd_double_fast.c',
-                 'compress/zstd_fast.c',
-                 'compress/zstd_lazy.c',
-                 'compress/zstd_ldm.c',
-                 'compress/zstd_opt.c',
-                 'compress/zstdmt_compress.c',
-                 'decompress/huf_decompress.c',
-                 'decompress/zstd_ddict.c',
-                 'decompress/zstd_decompress.c',
-                 'decompress/zstd_decompress_block.c',
-                 'dictBuilder/cover.c',
-                 'dictBuilder/fastcover.c',
-                 'dictBuilder/divsufsort.c',
-                 'dictBuilder/zdict.c',
-             )]
+             SOURCES = [
+                 "zstd/%s" % p
+                 for p in (
+                     "common/debug.c",
+                     "common/entropy_common.c",
+                     "common/error_private.c",
+                     "common/fse_decompress.c",
+                     "common/pool.c",
+                     "common/threading.c",
+                     "common/xxhash.c",
+                     "common/zstd_common.c",
+                     "compress/fse_compress.c",
+                     "compress/hist.c",
+                     "compress/huf_compress.c",
+                     "compress/zstd_compress.c",
+                     "compress/zstd_compress_literals.c",
+                     "compress/zstd_compress_sequences.c",
+                     "compress/zstd_double_fast.c",
+                     "compress/zstd_fast.c",
+                     "compress/zstd_lazy.c",
+                     "compress/zstd_ldm.c",
+                     "compress/zstd_opt.c",
+                     "compress/zstdmt_compress.c",
+                     "decompress/huf_decompress.c",
+                     "decompress/zstd_ddict.c",
+                     "decompress/zstd_decompress.c",
+                     "decompress/zstd_decompress_block.c",
+                     "dictBuilder/cover.c",
+                     "dictBuilder/fastcover.c",
+                     "dictBuilder/divsufsort.c",
+                     "dictBuilder/zdict.c",
+                 )
+             ]
              # Headers whose preprocessed output will be fed into cdef().
-             HEADERS = [os.path.join(HERE, 'zstd', *p) for p in (
-                 ('zstd.h',),
-                 ('dictBuilder', 'zdict.h'),
-             )]
+             HEADERS = [
+                 os.path.join(HERE, "zstd", *p) for p in (("zstd.h",), ("dictBuilder", "zdict.h"),)
+             ]
-             INCLUDE_DIRS = [os.path.join(HERE, d) for d in (
-                 'zstd',
-                 'zstd/common',
-                 'zstd/compress',
-                 'zstd/decompress',
-                 'zstd/dictBuilder',
-             )]
+             INCLUDE_DIRS = [
+                 os.path.join(HERE, d)
+                 for d in (
+                     "zstd",
+                     "zstd/common",
+                     "zstd/compress",
+                     "zstd/decompress",
+                     "zstd/dictBuilder",
+                 )
+             ]
              # cffi can't parse some of the primitives in zstd.h. So we invoke the
              # preprocessor and feed its output into cffi.
              compiler = distutils.ccompiler.new_compiler()
              # Needed for MSVC.
-             if hasattr(compiler, 'initialize'):
+             if hasattr(compiler, "initialize"):
                  compiler.initialize()
              # Distutils doesn't set compiler.preprocessor, so invoke the preprocessor
              # manually.
-             if compiler.compiler_type == 'unix':
-                 args = list(compiler.executables['compiler'])
-                 args.extend([
-                     '-E',
-                     '-DZSTD_STATIC_LINKING_ONLY',
-                     '-DZDICT_STATIC_LINKING_ONLY',
-                 ])
-             elif compiler.compiler_type == 'msvc':
+             if compiler.compiler_type == "unix":
+                 args = list(compiler.executables["compiler"])
+                 args.extend(
+                     ["-E", "-DZSTD_STATIC_LINKING_ONLY", "-DZDICT_STATIC_LINKING_ONLY",]
+                 )
+             elif compiler.compiler_type == "msvc":
                  args = [compiler.cc]
-                 args.extend([
-                     '/EP',
-                     '/DZSTD_STATIC_LINKING_ONLY',
-                     '/DZDICT_STATIC_LINKING_ONLY',
-                 ])
+                 args.extend(
+                     ["/EP", "/DZSTD_STATIC_LINKING_ONLY", "/DZDICT_STATIC_LINKING_ONLY",]
+                 )
              else:
-                 raise Exception('unsupported compiler type: %s' % compiler.compiler_type)
+                 raise Exception("unsupported compiler type: %s" % compiler.compiler_type)
              def preprocess(path):
-                 with open(path, 'rb') as fh:
+                 with open(path, "rb") as fh:
                      lines = []
                      it = iter(fh)
                          # We define ZSTD_STATIC_LINKING_ONLY, which is redundant with the inline
                          # #define in zstdmt_compress.h and results in a compiler warning. So drop
                          # the inline #define.
-                         if l.startswith((b'#include <stddef.h>',
-                                          b'#include "zstd.h"',
-                                          b'#define ZSTD_STATIC_LINKING_ONLY')):
+                         if l.startswith(
+                             (
+                                 b"#include <stddef.h>",
+                                 b'#include "zstd.h"',
+                                 b"#define ZSTD_STATIC_LINKING_ONLY",
+                             )
+                         ):
                              continue
+                         # The preprocessor environment on Windows doesn't define include
+                         # paths, so the #include of limits.h fails. We work around this
+                         # by removing that import and defining INT_MAX ourselves. This is
+                         # a bit hacky. But it gets the job done.
+                         # TODO make limits.h work on Windows so we ensure INT_MAX is
+                         # correct.
+                         if l.startswith(b"#include <limits.h>"):
+                             l = b"#define INT_MAX 2147483647\n"
                          # ZSTDLIB_API may not be defined if we dropped zstd.h. It isn't
                          # important so just filter it out.
-                         if l.startswith(b'ZSTDLIB_API'):
-                             l = l[len(b'ZSTDLIB_API '):]
+                         if l.startswith(b"ZSTDLIB_API"):
+                             l = l[len(b"ZSTDLIB_API ") :]
                          lines.append(l)
-                 fd, input_file = tempfile.mkstemp(suffix='.h')
-                 os.write(fd, b''.join(lines))
+                 fd, input_file = tempfile.mkstemp(suffix=".h")
+                 os.write(fd, b"".join(lines))
                  os.close(fd)
                  try:
                      env = dict(os.environ)
-                     if getattr(compiler, '_paths', None):
-                         env['PATH'] = compiler._paths
-                     process = subprocess.Popen(args + [input_file], stdout=subprocess.PIPE,
-                                                env=env)
+                     if getattr(compiler, "_paths", None):
+                         env["PATH"] = compiler._paths
+                     process = subprocess.Popen(args + [input_file], stdout=subprocess.PIPE, env=env)
                      output = process.communicate()[0]
                      ret = process.poll()
                      if ret:
-                         raise Exception('preprocessor exited with error')
+                         raise Exception("preprocessor exited with error")
                      return output
                  finally:
                  for line in output.splitlines():
                      # CFFI's parser doesn't like __attribute__ on UNIX compilers.
                      if line.startswith(b'__attribute__ ((visibility ("default"))) '):
-                         line = line[len(b'__attribute__ ((visibility ("default"))) '):]
+                         line = line[len(b'__attribute__ ((visibility ("default"))) ') :]
-                     if line.startswith(b'__attribute__((deprecated('):
+                     if line.startswith(b"__attribute__((deprecated("):
                          continue
-                     elif b'__declspec(deprecated(' in line:
+                     elif b"__declspec(deprecated(" in line:
                          continue
                      lines.append(line)
-                 return b'\n'.join(lines)
+                 return b"\n".join(lines)
              ffi = cffi.FFI()
              # *_DISABLE_DEPRECATE_WARNINGS prevents the compiler from emitting a warning
              # when cffi uses the function. Since we statically link against zstd, even
              # if we use the deprecated functions it shouldn't be a huge problem.
-             ffi.set_source('_zstd_cffi', '''
+             ffi.set_source(
+                 "_zstd_cffi",
+                 """
-             #define MIN(a,b) ((a)<(b) ? (a) : (b))
-             #define ZSTD_STATIC_LINKING_ONLY
-             #include <zstd.h>
-             #define ZDICT_STATIC_LINKING_ONLY
-             #define ZDICT_DISABLE_DEPRECATE_WARNINGS
-             #include <zdict.h>
-             ''', sources=SOURCES,
-                  include_dirs=INCLUDE_DIRS,
-                  extra_compile_args=['-DZSTD_MULTITHREAD'])
+             """,
+                 sources=SOURCES,
+                 include_dirs=INCLUDE_DIRS,
+                 extra_compile_args=["-DZSTD_MULTITHREAD"],
+             )
-             DEFINE = re.compile(b'^\\#define ([a-zA-Z0-9_]+) ')
+             DEFINE = re.compile(b"^\\#define ([a-zA-Z0-9_]+) ")
              sources = []
                  # #define's are effectively erased as part of going through preprocessor.
                  # So perform a manual pass to re-add those to the cdef source.
-                 with open(header, 'rb') as fh:
+                 with open(header, "rb") as fh:
                      for line in fh:
                          line = line.strip()
                          m = DEFINE.match(line)
                          if not m:
                              continue
-                         if m.group(1) == b'ZSTD_STATIC_LINKING_ONLY':
+                         if m.group(1) == b"ZSTD_STATIC_LINKING_ONLY":
                              continue
                          # The parser doesn't like some constants with complex values.
-                         if m.group(1) in (b'ZSTD_LIB_VERSION', b'ZSTD_VERSION_STRING'):
+                         if m.group(1) in (b"ZSTD_LIB_VERSION", b"ZSTD_VERSION_STRING"):
                              continue
                          # The ... is magic syntax by the cdef parser to resolve the
                          # value at compile time.
-                         sources.append(m.group(0) + b' ...')
+                         sources.append(m.group(0) + b" ...")
-             cdeflines = b'\n'.join(sources).splitlines()
+             cdeflines = b"\n".join(sources).splitlines()
              cdeflines = [l for l in cdeflines if l.strip()]
-             ffi.cdef(b'\n'.join(cdeflines).decode('latin1'))
+             ffi.cdef(b"\n".join(cdeflines).decode("latin1"))
-             if __name__ == '__main__':
+             if __name__ == "__main__":
                  ffi.compile()

contrib/python-zstandard/setup.py

0 +44 -38

              # (like memoryview).
              # Need feature in 1.11 for ffi.gc() to declare size of objects so we avoid
              # garbage collection pitfalls.
-             MINIMUM_CFFI_VERSION = '1.11'
+             MINIMUM_CFFI_VERSION = "1.11"
              try:
                  import cffi
                  # out the CFFI version here and reject CFFI if it is too old.
                  cffi_version = LooseVersion(cffi.__version__)
                  if cffi_version < LooseVersion(MINIMUM_CFFI_VERSION):
-                     print('CFFI 1.11 or newer required (%s found); '
-                           'not building CFFI backend' % cffi_version,
-                           file=sys.stderr)
+                     print(
+                         "CFFI 1.11 or newer required (%s found); "
+                         "not building CFFI backend" % cffi_version,
+                         file=sys.stderr,
+                     )
                      cffi = None
              except ImportError:
              SYSTEM_ZSTD = False
              WARNINGS_AS_ERRORS = False
-             if os.environ.get('ZSTD_WARNINGS_AS_ERRORS', ''):
+             if os.environ.get("ZSTD_WARNINGS_AS_ERRORS", ""):
                  WARNINGS_AS_ERRORS = True
-             if '--legacy' in sys.argv:
+             if "--legacy" in sys.argv:
                  SUPPORT_LEGACY = True
-                 sys.argv.remove('--legacy')
+                 sys.argv.remove("--legacy")
-             if '--system-zstd' in sys.argv:
+             if "--system-zstd" in sys.argv:
                  SYSTEM_ZSTD = True
-                 sys.argv.remove('--system-zstd')
+                 sys.argv.remove("--system-zstd")
-             if '--warnings-as-errors' in sys.argv:
+             if "--warnings-as-errors" in sys.argv:
                  WARNINGS_AS_ERRORS = True
-                 sys.argv.remove('--warning-as-errors')
+                 sys.argv.remove("--warning-as-errors")
              # Code for obtaining the Extension instance is in its own module to
              # facilitate reuse in other projects.
              extensions = [
-                 setup_zstd.get_c_extension(name='zstd',
-                                            support_legacy=SUPPORT_LEGACY,
-                                            system_zstd=SYSTEM_ZSTD,
-                                            warnings_as_errors=WARNINGS_AS_ERRORS),
+                 setup_zstd.get_c_extension(
+                     name="zstd",
+                     support_legacy=SUPPORT_LEGACY,
+                     system_zstd=SYSTEM_ZSTD,
+                     warnings_as_errors=WARNINGS_AS_ERRORS,
+                 ),
              ]
              install_requires = []
              if cffi:
                  import make_cffi
                  extensions.append(make_cffi.ffi.distutils_extension())
-                 install_requires.append('cffi>=%s' % MINIMUM_CFFI_VERSION)
+                 install_requires.append("cffi>=%s" % MINIMUM_CFFI_VERSION)
              version = None
-             with open('c-ext/python-zstandard.h', 'r') as fh:
+             with open("c-ext/python-zstandard.h", "r") as fh:
                  for line in fh:
-                     if not line.startswith('#define PYTHON_ZSTANDARD_VERSION'):
+                     if not line.startswith("#define PYTHON_ZSTANDARD_VERSION"):
                          continue
                      version = line.split()[2][1:-1]
                      break
              if not version:
-                 raise Exception('could not resolve package version; '
-                                 'this should never happen')
+                 raise Exception("could not resolve package version; " "this should never happen")
              setup(
-                 name='zstandard',
+                 name="zstandard",
                  version=version,
-                 description='Zstandard bindings for Python',
-                 long_description=open('README.rst', 'r').read(),
-                 url='https://github.com/indygreg/python-zstandard',
-                 author='Gregory Szorc',
-                 author_email='gregory.szorc@gmail.com',
-                 license='BSD',
+                 description="Zstandard bindings for Python",
+                 long_description=open("README.rst", "r").read(),
+                 url="https://github.com/indygreg/python-zstandard",
+                 author="Gregory Szorc",
+                 author_email="gregory.szorc@gmail.com",
+                 license="BSD",
                  classifiers=[
-                     'Development Status :: 4 - Beta',
-                     'Intended Audience :: Developers',
-                     'License :: OSI Approved :: BSD License',
-                     'Programming Language :: C',
-                     'Programming Language :: Python :: 2.7',
-                     'Programming Language :: Python :: 3.5',
-                     'Programming Language :: Python :: 3.6',
-                     'Programming Language :: Python :: 3.7',
+                     "Development Status :: 4 - Beta",
+                     "Intended Audience :: Developers",
+                     "License :: OSI Approved :: BSD License",
+                     "Programming Language :: C",
+                     "Programming Language :: Python :: 2.7",
+                     "Programming Language :: Python :: 3.5",
+                     "Programming Language :: Python :: 3.6",
+                     "Programming Language :: Python :: 3.7",
+                     "Programming Language :: Python :: 3.8",
                  ],
-                 keywords='zstandard zstd compression',
-                 packages=['zstandard'],
+                 keywords="zstandard zstd compression",
+                 packages=["zstandard"],
                  ext_modules=extensions,
-                 test_suite='tests',
+                 test_suite="tests",
                  install_requires=install_requires,
+                 tests_require=["hypothesis"],
              )

contrib/python-zstandard/setup_zstd.py

0 +119 -105

		@@ -10,97 +10,110 import os
10	10	from distutils.extension import Extension
11	11
12	12
13		zstd_sources = ['zstd/%s' % p for p in (
14		'common/debug.c',
15		'common/entropy_common.c',
16		'common/error_private.c',
17		'common/fse_decompress.c',
18		'common/pool.c',
19		'common/threading.c',
20		~~'common/xxhash.c'~~,
21		~~'common/zstd_common.c'~~,
22		'compress/fse_compress.c',
23		'compress/hist.c',
24		'compress/huf_compress.c',
25		'compress/zstd_compress_literals.c',
26		~~'compress/zstd_compress_sequences.c'~~,
27		'compress/zstd_compress~~.c'~~,
28		'compress/zstd_~~double_fast.c'~~,
29		'compress/zstd_~~fast.c'~~,
30		'compress/zstd_~~lazy.c'~~,
31		'compress/zstd_~~ldm.c'~~,
32		'compress/zstd_~~opt.c'~~,
33		'compress/zstd~~mt_compress.c'~~,
34		'decompress/huf_decompress.c',
35		~~'de~~compress/zstd~~_ddict.c'~~,
36		'decompress/~~zstd~~_decompress.c',
37		'decompress/zstd_d~~ecompress_block.c'~~,
38		'dictBuilder/cover.c',
39		'dictBuilder/divsufsort.c',
40		'dictBuilder/~~fast~~cover.c',
41		'dictBuilder/~~zdict.c'~~,
42		)]
	13	zstd_sources = [
	14	"zstd/%s" % p
	15	for p in (
	16	"common/debug.c",
	17	"common/entropy_common.c",
	18	"common/error_private.c",
	19	"common/fse_decompress.c",
	20	"common/pool.c",
	21	"common/threading.c",
	22	"common/xxhash.c",
	23	"common/zstd_common.c",
	24	"compress/fse_compress.c",
	25	"compress/hist.c",
	26	"compress/huf_compress.c",
	27	"compress/zstd_compress_literals.c",
	28	"compress/zstd_compress_sequences.c",
	29	"compress/zstd_compress.c",
	30	"compress/zstd_double_fast.c",
	31	"compress/zstd_fast.c",
	32	"compress/zstd_lazy.c",
	33	"compress/zstd_ldm.c",
	34	"compress/zstd_opt.c",
	35	"compress/zstdmt_compress.c",
	36	"decompress/huf_decompress.c",
	37	"decompress/zstd_ddict.c",
	38	"decompress/zstd_decompress.c",
	39	"decompress/zstd_decompress_block.c",
	40	"dictBuilder/cover.c",
	41	"dictBuilder/divsufsort.c",
	42	"dictBuilder/fastcover.c",
	43	"dictBuilder/zdict.c",
	44	)
	45	]
43	46
44		zstd_sources_legacy = [~~'zstd/~~%s' % p ~~for~~ p in (
45		'deprecated/zbuff_common.c',
46		'deprecated/zbuff_compress.c',
47		'deprecated/zbuff_~~decompress.c'~~,
48		'legacy/zstd_v01.c',
49		'legacy/zstd_v02.c',
50		'legacy/zstd_v0~~3.c'~~,
51		'legacy/zstd_v0~~4.c'~~,
52		'legacy/zstd_v0~~5.c'~~,
53		'legacy/zstd_v0~~6.c'~~,
54		'legacy/zstd_v0~~7.c'~~
55		)]
	47	zstd_sources_legacy = [
	48	"zstd/%s" % p
	49	for p in (
	50	"deprecated/zbuff_common.c",
	51	"deprecated/zbuff_compress.c",
	52	"deprecated/zbuff_decompress.c",
	53	"legacy/zstd_v01.c",
	54	"legacy/zstd_v02.c",
	55	"legacy/zstd_v03.c",
	56	"legacy/zstd_v04.c",
	57	"legacy/zstd_v05.c",
	58	"legacy/zstd_v06.c",
	59	"legacy/zstd_v07.c",
	60	)
	61	]
56	62
57	63	zstd_includes = [
58		'zstd',
59		'zstd/common',
60		'zstd/compress',
61		'zstd/decompress',
62		'zstd/dictBuilder',
	64	"zstd",
	65	"zstd/common",
	66	"zstd/compress",
	67	"zstd/decompress",
	68	"zstd/dictBuilder",
63	69	]
64	70
65	71	zstd_includes_legacy = [
66		'zstd/deprecated',
67		'zstd/legacy',
	72	"zstd/deprecated",
	73	"zstd/legacy",
68	74	]
69	75
70	76	ext_includes = [
71		'c-ext',
72		'zstd/common',
	77	"c-ext",
	78	"zstd/common",
73	79	]
74	80
75	81	ext_sources = [
76		'zstd/common/~~pool.c'~~,
77		'zstd/common/~~threading.c'~~,
78		'zstd.c',
79		'c-ext/bufferutil.c',
80		'c-ext/compressiondict.c',
81		'c-ext/compressobj.c',
82		'c-ext/compress~~or.c'~~,
83		'c-ext/compresso~~riterator.c'~~,
84		'c-ext/compress~~ionchunker.c'~~,
85		'c-ext/compress~~ionparams.c'~~,
86		'c-ext/compression~~reader.c'~~,
87		'c-ext/compression~~writer.c'~~,
88		~~'c-ext/constants.c'~~,
89		'c-ext/decompress~~obj.c'~~,
90		~~'c-ext/decompressor.c'~~,
91		'c-ext/decompresso~~riterator.c'~~,
92		'c-ext/decompress~~ionreader.c'~~,
93		'c-ext/decompress~~ionwriter.c'~~,
94		'c-ext/frameparams.c',
	82	"zstd/common/error_private.c",
	83	"zstd/common/pool.c",
	84	"zstd/common/threading.c",
	85	"zstd/common/zstd_common.c",
	86	"zstd.c",
	87	"c-ext/bufferutil.c",
	88	"c-ext/compressiondict.c",
	89	"c-ext/compressobj.c",
	90	"c-ext/compressor.c",
	91	"c-ext/compressoriterator.c",
	92	"c-ext/compressionchunker.c",
	93	"c-ext/compressionparams.c",
	94	"c-ext/compressionreader.c",
	95	"c-ext/compressionwriter.c",
	96	"c-ext/constants.c",
	97	"c-ext/decompressobj.c",
	98	"c-ext/decompressor.c",
	99	"c-ext/decompressoriterator.c",
	100	"c-ext/decompressionreader.c",
	101	"c-ext/decompressionwriter.c",
	102	"c-ext/frameparams.c",
95	103	]
96	104
97	105	zstd_depends = [
98		'c-ext/python-zstandard.h',
	106	"c-ext/python-zstandard.h",
99	107	]
100	108
101	109
102		def get_c_extension(support_legacy=False, system_zstd=False, name='zstd',
103		warnings_as_errors=False, root=None):
	110	def get_c_extension(
	111	support_legacy=False,
	112	system_zstd=False,
	113	name="zstd",
	114	warnings_as_errors=False,
	115	root=None,
	116	):
104	117	"""Obtain a distutils.extension.Extension for the C extension.
105	118
106	119	``support_legacy`` controls whether to compile in legacy zstd format support.
		@@ -125,17 +138,16 def get_c_extension(support_legacy=False
125	138	if not system_zstd:
126	139	sources.update([os.path.join(actual_root, p) for p in zstd_sources])
127	140	if support_legacy:
128		sources.update([os.path.join(actual_root, p)
129		for p in zstd_sources_legacy])
	141	sources.update([os.path.join(actual_root, p) for p in zstd_sources_legacy])
130	142	sources = list(sources)
131	143
132	144	include_dirs = set([os.path.join(actual_root, d) for d in ext_includes])
133	145	if not system_zstd:
134		include_dirs.update([os.path.join(actual_root, d)
135		for d in zstd_includes])
	146	include_dirs.update([os.path.join(actual_root, d) for d in zstd_includes])
136	147	if support_legacy:
137		include_dirs.update([os.~~path~~.~~join~~(~~actual_root~~, d)
138		for d in zstd_includes_legacy])
	148	include_dirs.update(
	149	[os.path.join(actual_root, d) for d in zstd_includes_legacy]
	150	)
139	151	include_dirs = list(include_dirs)
140	152
141	153	depends = [os.path.join(actual_root, p) for p in zstd_depends]
		@@ -143,41 +155,40 def get_c_extension(support_legacy=False
143	155	compiler = distutils.ccompiler.new_compiler()
144	156
145	157	# Needed for MSVC.
146		if hasattr(compiler, 'initialize'):
	158	if hasattr(compiler, "initialize"):
147	159	compiler.initialize()
148	160
149		if compiler.compiler_type == 'unix':
150		compiler_type = 'unix'
151		elif compiler.compiler_type == 'msvc':
152		compiler_type = 'msvc'
153		elif compiler.compiler_type == 'mingw32':
154		compiler_type = 'mingw32'
	161	if compiler.compiler_type == "unix":
	162	compiler_type = "unix"
	163	elif compiler.compiler_type == "msvc":
	164	compiler_type = "msvc"
	165	elif compiler.compiler_type == "mingw32":
	166	compiler_type = "mingw32"
155	167	else:
156		raise Exception('unhandled compiler type: %s' %
157		compiler.compiler_type)
	168	raise Exception("unhandled compiler type: %s" % compiler.compiler_type)
158	169
159		extra_args = ['-DZSTD_MULTITHREAD']
	170	extra_args = ["-DZSTD_MULTITHREAD"]
160	171
161	172	if not system_zstd:
162		extra_args.append('-DZSTDLIB_VISIBILITY=')
163		extra_args.append('-DZDICTLIB_VISIBILITY=')
164		extra_args.append('-DZSTDERRORLIB_VISIBILITY=')
	173	extra_args.append("-DZSTDLIB_VISIBILITY=")
	174	extra_args.append("-DZDICTLIB_VISIBILITY=")
	175	extra_args.append("-DZSTDERRORLIB_VISIBILITY=")
165	176
166		if compiler_type == 'unix':
167		extra_args.append('-fvisibility=hidden')
	177	if compiler_type == "unix":
	178	extra_args.append("-fvisibility=hidden")
168	179
169	180	if not system_zstd and support_legacy:
170		extra_args.append('-DZSTD_LEGACY_SUPPORT=1')
	181	extra_args.append("-DZSTD_LEGACY_SUPPORT=1")
171	182
172	183	if warnings_as_errors:
173		if compiler_type in ('unix', 'mingw32'):
174		extra_args.append('-Werror')
175		elif compiler_type == 'msvc':
176		extra_args.append('/WX')
	184	if compiler_type in ("unix", "mingw32"):
	185	extra_args.append("-Werror")
	186	elif compiler_type == "msvc":
	187	extra_args.append("/WX")
177	188	else:
178	189	assert False
179	190
180		libraries = ['zstd'] if system_zstd else []
	191	libraries = ["zstd"] if system_zstd else []
181	192
182	193	# Python 3.7 doesn't like absolute paths. So normalize to relative.
183	194	sources = [os.path.relpath(p, root) for p in sources]
		@@ -185,8 +196,11 def get_c_extension(support_legacy=False
185	196	depends = [os.path.relpath(p, root) for p in depends]
186	197
187	198	# TODO compile with optimizations.
188		return Extension(~~name~~, ~~sources~~,
189		include_dirs=include_dirs,
190		depends=depends,
191		extra_compile_args=extra_args,
192		libraries=libraries)
	199	return Extension(
	200	name,
	201	sources,
	202	include_dirs=include_dirs,
	203	depends=depends,
	204	extra_compile_args=extra_args,
	205	libraries=libraries,
	206	)

contrib/python-zstandard/tests/common.py

0 +44 -32

              import io
              import os
              import types
+             import unittest
              try:
                  import hypothesis
                  hypothesis = None
+             class TestCase(unittest.TestCase):
+                 if not getattr(unittest.TestCase, "assertRaisesRegex", False):
+                     assertRaisesRegex = unittest.TestCase.assertRaisesRegexp
              def make_cffi(cls):
                  """Decorator to add CFFI versions of each test method."""
                  # The module containing this class definition should
                  # `import zstandard as zstd`. Otherwise things may blow up.
                  mod = inspect.getmodule(cls)
-                 if not hasattr(mod, 'zstd'):
+                 if not hasattr(mod, "zstd"):
                      raise Exception('test module does not contain "zstd" symbol')
-                 if not hasattr(mod.zstd, 'backend'):
-                     raise Exception('zstd symbol does not have "backend" attribute; did '
-                                     'you `import zstandard as zstd`?')
+                 if not hasattr(mod.zstd, "backend"):
+                     raise Exception(
+                         'zstd symbol does not have "backend" attribute; did '
+                         "you `import zstandard as zstd`?"
+                     )
                  # If `import zstandard` already chose the cffi backend, there is nothing
                  # for us to do: we only add the cffi variation if the default backend
                  # is the C extension.
-                 if mod.zstd.backend == 'cffi':
+                 if mod.zstd.backend == "cffi":
                      return cls
                  old_env = dict(os.environ)
-                 os.environ['PYTHON_ZSTANDARD_IMPORT_POLICY'] = 'cffi'
+                 os.environ["PYTHON_ZSTANDARD_IMPORT_POLICY"] = "cffi"
                  try:
                      try:
-                         mod_info = imp.find_module('zstandard')
-                         mod = imp.load_module('zstandard_cffi', *mod_info)
+                         mod_info = imp.find_module("zstandard")
+                         mod = imp.load_module("zstandard_cffi", *mod_info)
                      except ImportError:
                          return cls
                  finally:
                      os.environ.clear()
                      os.environ.update(old_env)
-                 if mod.backend != 'cffi':
-                     raise Exception('got the zstandard %s backend instead of cffi' % mod.backend)
+                 if mod.backend != "cffi":
+                     raise Exception("got the zstandard %s backend instead of cffi" % mod.backend)
                  # If CFFI version is available, dynamically construct test methods
                  # that use it.
                      if not inspect.ismethod(fn) and not inspect.isfunction(fn):
                          continue
-                     if not fn.__name__.startswith('test_'):
+                     if not fn.__name__.startswith("test_"):
                          continue
-                     name = '%s_cffi' % fn.__name__
+                     name = "%s_cffi" % fn.__name__
                      # Replace the "zstd" symbol with the CFFI module instance. Then copy
                      # the function object and install it in a new attribute.
                      if isinstance(fn, types.FunctionType):
                          globs = dict(fn.__globals__)
-                         globs['zstd'] = mod
-                         new_fn = types.FunctionType(fn.__code__, globs, name,
-                                                     fn.__defaults__, fn.__closure__)
+                         globs["zstd"] = mod
+                         new_fn = types.FunctionType(
+                             fn.__code__, globs, name, fn.__defaults__, fn.__closure__
+                         )
                          new_method = new_fn
                      else:
                          globs = dict(fn.__func__.func_globals)
-                         globs['zstd'] = mod
-                         new_fn = types.FunctionType(fn.__func__.func_code, globs, name,
-                                                     fn.__func__.func_defaults,
-                                                     fn.__func__.func_closure)
-                         new_method = types.UnboundMethodType(new_fn, fn.im_self,
-                                                              fn.im_class)
+                         globs["zstd"] = mod
+                         new_fn = types.FunctionType(
+                             fn.__func__.func_code,
+                             globs,
+                             name,
+                             fn.__func__.func_defaults,
+                             fn.__func__.func_closure,
+                         )
+                         new_method = types.UnboundMethodType(new_fn, fn.im_self, fn.im_class)
                      setattr(cls, name, new_method)
                  This allows us to access written data after close().
                  """
                  def __init__(self, *args, **kwargs):
                      super(NonClosingBytesIO, self).__init__(*args, **kwargs)
                      self._saved_buffer = None
                      dirs[:] = list(sorted(dirs))
                      for f in sorted(files):
                          try:
-                             with open(os.path.join(root, f), 'rb') as fh:
+                             with open(os.path.join(root, f), "rb") as fh:
                                  data = fh.read()
                                  if data:
                                      _source_files.append(data)
              def generate_samples():
                  inputs = [
-                     b'foo',
-                     b'bar',
-                     b'abcdef',
-                     b'sometext',
-                     b'baz',
+                     b"foo",
+                     b"bar",
+                     b"abcdef",
+                     b"sometext",
+                     b"baz",
                  ]
                  samples = []
              if hypothesis:
                  default_settings = hypothesis.settings(deadline=10000)
-                 hypothesis.settings.register_profile('default', default_settings)
+                 hypothesis.settings.register_profile("default", default_settings)
                  ci_settings = hypothesis.settings(deadline=20000, max_examples=1000)
-                 hypothesis.settings.register_profile('ci', ci_settings)
+                 hypothesis.settings.register_profile("ci", ci_settings)
                  expensive_settings = hypothesis.settings(deadline=None, max_examples=10000)
-                 hypothesis.settings.register_profile('expensive', expensive_settings)
+                 hypothesis.settings.register_profile("expensive", expensive_settings)
-                 hypothesis.settings.load_profile(
-                     os.environ.get('HYPOTHESIS_PROFILE', 'default'))
+                 hypothesis.settings.load_profile(os.environ.get("HYPOTHESIS_PROFILE", "default"))

contrib/python-zstandard/tests/test_buffer_util.py

0 +69 -58

              import zstandard as zstd
-             ss = struct.Struct('=QQ')
+             from .common import TestCase
+             ss = struct.Struct("=QQ")
-             class TestBufferWithSegments(unittest.TestCase):
+             class TestBufferWithSegments(TestCase):
                  def test_arguments(self):
-                     if not hasattr(zstd, 'BufferWithSegments'):
-                         self.skipTest('BufferWithSegments not available')
+                     if not hasattr(zstd, "BufferWithSegments"):
+                         self.skipTest("BufferWithSegments not available")
                      with self.assertRaises(TypeError):
                          zstd.BufferWithSegments()
                      with self.assertRaises(TypeError):
-                         zstd.BufferWithSegments(b'foo')
+                         zstd.BufferWithSegments(b"foo")
                      # Segments data should be a multiple of 16.
-                     with self.assertRaisesRegexp(ValueError, 'segments array size is not a multiple of 16'):
-                         zstd.BufferWithSegments(b'foo', b'\x00\x00')
+                     with self.assertRaisesRegex(
+                         ValueError, "segments array size is not a multiple of 16"
+                     ):
+                         zstd.BufferWithSegments(b"foo", b"\x00\x00")
                  def test_invalid_offset(self):
-                     if not hasattr(zstd, 'BufferWithSegments'):
-                         self.skipTest('BufferWithSegments not available')
+                     if not hasattr(zstd, "BufferWithSegments"):
+                         self.skipTest("BufferWithSegments not available")
-                     with self.assertRaisesRegexp(ValueError, 'offset within segments array references memory'):
-                         zstd.BufferWithSegments(b'foo', ss.pack(0, 4))
+                     with self.assertRaisesRegex(
+                         ValueError, "offset within segments array references memory"
+                     ):
+                         zstd.BufferWithSegments(b"foo", ss.pack(0, 4))
                  def test_invalid_getitem(self):
-                     if not hasattr(zstd, 'BufferWithSegments'):
-                         self.skipTest('BufferWithSegments not available')
+                     if not hasattr(zstd, "BufferWithSegments"):
+                         self.skipTest("BufferWithSegments not available")
-                     b = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
+                     b = zstd.BufferWithSegments(b"foo", ss.pack(0, 3))
-                     with self.assertRaisesRegexp(IndexError, 'offset must be non-negative'):
+                     with self.assertRaisesRegex(IndexError, "offset must be non-negative"):
                          test = b[-10]
-                     with self.assertRaisesRegexp(IndexError, 'offset must be less than 1'):
+                     with self.assertRaisesRegex(IndexError, "offset must be less than 1"):
                          test = b[1]
-                     with self.assertRaisesRegexp(IndexError, 'offset must be less than 1'):
+                     with self.assertRaisesRegex(IndexError, "offset must be less than 1"):
                          test = b[2]
                  def test_single(self):
-                     if not hasattr(zstd, 'BufferWithSegments'):
-                         self.skipTest('BufferWithSegments not available')
+                     if not hasattr(zstd, "BufferWithSegments"):
+                         self.skipTest("BufferWithSegments not available")
-                     b = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
+                     b = zstd.BufferWithSegments(b"foo", ss.pack(0, 3))
                      self.assertEqual(len(b), 1)
                      self.assertEqual(b.size, 3)
-                     self.assertEqual(b.tobytes(), b'foo')
+                     self.assertEqual(b.tobytes(), b"foo")
                      self.assertEqual(len(b[0]), 3)
                      self.assertEqual(b[0].offset, 0)
-                     self.assertEqual(b[0].tobytes(), b'foo')
+                     self.assertEqual(b[0].tobytes(), b"foo")
                  def test_multiple(self):
-                     if not hasattr(zstd, 'BufferWithSegments'):
-                         self.skipTest('BufferWithSegments not available')
+                     if not hasattr(zstd, "BufferWithSegments"):
+                         self.skipTest("BufferWithSegments not available")
-                     b = zstd.BufferWithSegments(b'foofooxfooxy', b''.join([ss.pack(0, 3),
-                                                                            ss.pack(3, 4),
-                                                                            ss.pack(7, 5)]))
+                     b = zstd.BufferWithSegments(
+                         b"foofooxfooxy", b"".join([ss.pack(0, 3), ss.pack(3, 4), ss.pack(7, 5)])
+                     )
                      self.assertEqual(len(b), 3)
                      self.assertEqual(b.size, 12)
-                     self.assertEqual(b.tobytes(), b'foofooxfooxy')
+                     self.assertEqual(b.tobytes(), b"foofooxfooxy")
-                     self.assertEqual(b[0].tobytes(), b'foo')
-                     self.assertEqual(b[1].tobytes(), b'foox')
-                     self.assertEqual(b[2].tobytes(), b'fooxy')
+                     self.assertEqual(b[0].tobytes(), b"foo")
+                     self.assertEqual(b[1].tobytes(), b"foox")
+                     self.assertEqual(b[2].tobytes(), b"fooxy")
-             class TestBufferWithSegmentsCollection(unittest.TestCase):
+             class TestBufferWithSegmentsCollection(TestCase):
                  def test_empty_constructor(self):
-                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
-                         self.skipTest('BufferWithSegmentsCollection not available')
+                     if not hasattr(zstd, "BufferWithSegmentsCollection"):
+                         self.skipTest("BufferWithSegmentsCollection not available")
-                     with self.assertRaisesRegexp(ValueError, 'must pass at least 1 argument'):
+                     with self.assertRaisesRegex(ValueError, "must pass at least 1 argument"):
                          zstd.BufferWithSegmentsCollection()
                  def test_argument_validation(self):
-                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
-                         self.skipTest('BufferWithSegmentsCollection not available')
+                     if not hasattr(zstd, "BufferWithSegmentsCollection"):
+                         self.skipTest("BufferWithSegmentsCollection not available")
-                     with self.assertRaisesRegexp(TypeError, 'arguments must be BufferWithSegments'):
+                     with self.assertRaisesRegex(TypeError, "arguments must be BufferWithSegments"):
                          zstd.BufferWithSegmentsCollection(None)
-                     with self.assertRaisesRegexp(TypeError, 'arguments must be BufferWithSegments'):
-                         zstd.BufferWithSegmentsCollection(zstd.BufferWithSegments(b'foo', ss.pack(0, 3)),
-                                                           None)
+                     with self.assertRaisesRegex(TypeError, "arguments must be BufferWithSegments"):
+                         zstd.BufferWithSegmentsCollection(
+                             zstd.BufferWithSegments(b"foo", ss.pack(0, 3)), None
+                         )
-                     with self.assertRaisesRegexp(ValueError, 'ZstdBufferWithSegments cannot be empty'):
-                         zstd.BufferWithSegmentsCollection(zstd.BufferWithSegments(b'', b''))
+                     with self.assertRaisesRegex(
+                         ValueError, "ZstdBufferWithSegments cannot be empty"
+                     ):
+                         zstd.BufferWithSegmentsCollection(zstd.BufferWithSegments(b"", b""))
                  def test_length(self):
-                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
-                         self.skipTest('BufferWithSegmentsCollection not available')
+                     if not hasattr(zstd, "BufferWithSegmentsCollection"):
+                         self.skipTest("BufferWithSegmentsCollection not available")
-                     b1 = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
-                     b2 = zstd.BufferWithSegments(b'barbaz', b''.join([ss.pack(0, 3),
-                                                                       ss.pack(3, 3)]))
+                     b1 = zstd.BufferWithSegments(b"foo", ss.pack(0, 3))
+                     b2 = zstd.BufferWithSegments(
+                         b"barbaz", b"".join([ss.pack(0, 3), ss.pack(3, 3)])
+                     )
                      c = zstd.BufferWithSegmentsCollection(b1)
                      self.assertEqual(len(c), 1)
                      self.assertEqual(c.size(), 9)
                  def test_getitem(self):
-                     if not hasattr(zstd, 'BufferWithSegmentsCollection'):
-                         self.skipTest('BufferWithSegmentsCollection not available')
+                     if not hasattr(zstd, "BufferWithSegmentsCollection"):
+                         self.skipTest("BufferWithSegmentsCollection not available")
-                     b1 = zstd.BufferWithSegments(b'foo', ss.pack(0, 3))
-                     b2 = zstd.BufferWithSegments(b'barbaz', b''.join([ss.pack(0, 3),
-                                                                       ss.pack(3, 3)]))
+                     b1 = zstd.BufferWithSegments(b"foo", ss.pack(0, 3))
+                     b2 = zstd.BufferWithSegments(
+                         b"barbaz", b"".join([ss.pack(0, 3), ss.pack(3, 3)])
+                     )
                      c = zstd.BufferWithSegmentsCollection(b1, b2)
-                     with self.assertRaisesRegexp(IndexError, 'offset must be less than 3'):
+                     with self.assertRaisesRegex(IndexError, "offset must be less than 3"):
                          c[3]
-                     with self.assertRaisesRegexp(IndexError, 'offset must be less than 3'):
+                     with self.assertRaisesRegex(IndexError, "offset must be less than 3"):
                          c[4]
-                     self.assertEqual(c[0].tobytes(), b'foo')
-                     self.assertEqual(c[1].tobytes(), b'bar')
-                     self.assertEqual(c[2].tobytes(), b'baz')
+                     self.assertEqual(c[0].tobytes(), b"foo")
+                     self.assertEqual(c[1].tobytes(), b"bar")
+                     self.assertEqual(c[2].tobytes(), b"baz")

contrib/python-zstandard/tests/test_compressor.py

0 +423 -388

                  make_cffi,
                  NonClosingBytesIO,
                  OpCountingBytesIO,
+                 TestCase,
              )
              def multithreaded_chunk_size(level, source_size=0):
-                 params = zstd.ZstdCompressionParameters.from_level(level,
-                                                                    source_size=source_size)
+                 params = zstd.ZstdCompressionParameters.from_level(level, source_size=source_size)
                  return 1 << (params.window_log + 2)
              @make_cffi
-             class TestCompressor(unittest.TestCase):
+             class TestCompressor(TestCase):
                  def test_level_bounds(self):
                      with self.assertRaises(ValueError):
                          zstd.ZstdCompressor(level=23)
              @make_cffi
-             class TestCompressor_compress(unittest.TestCase):
+             class TestCompressor_compress(TestCase):
                  def test_compress_empty(self):
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
-                     result = cctx.compress(b'')
-                     self.assertEqual(result, b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     result = cctx.compress(b"")
+                     self.assertEqual(result, b"\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00")
                      params = zstd.get_frame_parameters(result)
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      self.assertEqual(params.window_size, 524288)
                      self.assertFalse(params.has_checksum, 0)
                      cctx = zstd.ZstdCompressor()
-                     result = cctx.compress(b'')
-                     self.assertEqual(result, b'\x28\xb5\x2f\xfd\x20\x00\x01\x00\x00')
+                     result = cctx.compress(b"")
+                     self.assertEqual(result, b"\x28\xb5\x2f\xfd\x20\x00\x01\x00\x00")
                      params = zstd.get_frame_parameters(result)
                      self.assertEqual(params.content_size, 0)
                  def test_input_types(self):
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
-                     expected = b'\x28\xb5\x2f\xfd\x00\x00\x19\x00\x00\x66\x6f\x6f'
+                     expected = b"\x28\xb5\x2f\xfd\x00\x00\x19\x00\x00\x66\x6f\x6f"
                      mutable_array = bytearray(3)
-                     mutable_array[:] = b'foo'
+                     mutable_array[:] = b"foo"
                      sources = [
-                         memoryview(b'foo'),
-                         bytearray(b'foo'),
+                         memoryview(b"foo"),
+                         bytearray(b"foo"),
                          mutable_array,
                      ]
                  def test_compress_large(self):
                      chunks = []
                      for i in range(255):
-                         chunks.append(struct.Struct('>B').pack(i) * 16384)
+                         chunks.append(struct.Struct(">B").pack(i) * 16384)
                      cctx = zstd.ZstdCompressor(level=3, write_content_size=False)
-                     result = cctx.compress(b''.join(chunks))
+                     result = cctx.compress(b"".join(chunks))
                      self.assertEqual(len(result), 999)
-                     self.assertEqual(result[0:4], b'\x28\xb5\x2f\xfd')
+                     self.assertEqual(result[0:4], b"\x28\xb5\x2f\xfd")
                      # This matches the test for read_to_iter() below.
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
-                     result = cctx.compress(b'f' * zstd.COMPRESSION_RECOMMENDED_INPUT_SIZE + b'o')
-                     self.assertEqual(result, b'\x28\xb5\x2f\xfd\x00\x40\x54\x00\x00'
-                                              b'\x10\x66\x66\x01\x00\xfb\xff\x39\xc0'
-                                              b'\x02\x09\x00\x00\x6f')
+                     result = cctx.compress(b"f" * zstd.COMPRESSION_RECOMMENDED_INPUT_SIZE + b"o")
+                     self.assertEqual(
+                         result,
+                         b"\x28\xb5\x2f\xfd\x00\x40\x54\x00\x00"
+                         b"\x10\x66\x66\x01\x00\xfb\xff\x39\xc0"
+                         b"\x02\x09\x00\x00\x6f",
+                     )
                  def test_negative_level(self):
                      cctx = zstd.ZstdCompressor(level=-4)
-                     result = cctx.compress(b'foo' * 256)
+                     result = cctx.compress(b"foo" * 256)
                  def test_no_magic(self):
-                     params = zstd.ZstdCompressionParameters.from_level(
-, format=zstd.FORMAT_ZSTD1)
+                     params = zstd.ZstdCompressionParameters.from_level(1, format=zstd.FORMAT_ZSTD1)
                      cctx = zstd.ZstdCompressor(compression_params=params)
-                     magic = cctx.compress(b'foobar')
+                     magic = cctx.compress(b"foobar")
                      params = zstd.ZstdCompressionParameters.from_level(
-, format=zstd.FORMAT_ZSTD1_MAGICLESS)
+, format=zstd.FORMAT_ZSTD1_MAGICLESS
+                     )
                      cctx = zstd.ZstdCompressor(compression_params=params)
-                     no_magic = cctx.compress(b'foobar')
+                     no_magic = cctx.compress(b"foobar")
-                     self.assertEqual(magic[0:4], b'\x28\xb5\x2f\xfd')
+                     self.assertEqual(magic[0:4], b"\x28\xb5\x2f\xfd")
                      self.assertEqual(magic[4:], no_magic)
                  def test_write_checksum(self):
                      cctx = zstd.ZstdCompressor(level=1)
-                     no_checksum = cctx.compress(b'foobar')
+                     no_checksum = cctx.compress(b"foobar")
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
-                     with_checksum = cctx.compress(b'foobar')
+                     with_checksum = cctx.compress(b"foobar")
                      self.assertEqual(len(with_checksum), len(no_checksum) + 4)
                  def test_write_content_size(self):
                      cctx = zstd.ZstdCompressor(level=1)
-                     with_size = cctx.compress(b'foobar' * 256)
+                     with_size = cctx.compress(b"foobar" * 256)
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
-                     no_size = cctx.compress(b'foobar' * 256)
+                     no_size = cctx.compress(b"foobar" * 256)
                      self.assertEqual(len(with_size), len(no_size) + 1)
                  def test_no_dict_id(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(1024, samples)
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
-                     with_dict_id = cctx.compress(b'foobarfoobar')
+                     with_dict_id = cctx.compress(b"foobarfoobar")
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d, write_dict_id=False)
-                     no_dict_id = cctx.compress(b'foobarfoobar')
+                     no_dict_id = cctx.compress(b"foobarfoobar")
                      self.assertEqual(len(with_dict_id), len(no_dict_id) + 4)
                  def test_compress_dict_multiple(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(8192, samples)
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      for i in range(32):
-                         cctx.compress(b'foo bar foobar foo bar foobar')
+                         cctx.compress(b"foo bar foobar foo bar foobar")
                  def test_dict_precompute(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(8192, samples)
                      d.precompute_compress(level=1)
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      for i in range(32):
-                         cctx.compress(b'foo bar foobar foo bar foobar')
+                         cctx.compress(b"foo bar foobar foo bar foobar")
                  def test_multithreaded(self):
                      chunk_size = multithreaded_chunk_size(1)
-                     source = b''.join([b'x' * chunk_size, b'y' * chunk_size])
+                     source = b"".join([b"x" * chunk_size, b"y" * chunk_size])
                      cctx = zstd.ZstdCompressor(level=1, threads=2)
                      compressed = cctx.compress(source)
                  def test_multithreaded_dict(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(1024, samples)
                      cctx = zstd.ZstdCompressor(dict_data=d, threads=2)
-                     result = cctx.compress(b'foo')
-                     params = zstd.get_frame_parameters(result);
-                     self.assertEqual(params.content_size, 3);
+                     result = cctx.compress(b"foo")
+                     params = zstd.get_frame_parameters(result)
+                     self.assertEqual(params.content_size, 3)
                      self.assertEqual(params.dict_id, d.dict_id())
-                     self.assertEqual(result,
-                                      b'\x28\xb5\x2f\xfd\x23\x8f\x55\x0f\x70\x03\x19\x00\x00'
-                                      b'\x66\x6f\x6f')
+                     self.assertEqual(
+                         result,
+                         b"\x28\xb5\x2f\xfd\x23\x8f\x55\x0f\x70\x03\x19\x00\x00" b"\x66\x6f\x6f",
+                     )
                  def test_multithreaded_compression_params(self):
                      params = zstd.ZstdCompressionParameters.from_level(0, threads=2)
                      cctx = zstd.ZstdCompressor(compression_params=params)
-                     result = cctx.compress(b'foo')
-                     params = zstd.get_frame_parameters(result);
-                     self.assertEqual(params.content_size, 3);
+                     result = cctx.compress(b"foo")
+                     params = zstd.get_frame_parameters(result)
+                     self.assertEqual(params.content_size, 3)
-                     self.assertEqual(result,
-                                      b'\x28\xb5\x2f\xfd\x20\x03\x19\x00\x00\x66\x6f\x6f')
+                     self.assertEqual(result, b"\x28\xb5\x2f\xfd\x20\x03\x19\x00\x00\x66\x6f\x6f")
              @make_cffi
-             class TestCompressor_compressobj(unittest.TestCase):
+             class TestCompressor_compressobj(TestCase):
                  def test_compressobj_empty(self):
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      cobj = cctx.compressobj()
-                     self.assertEqual(cobj.compress(b''), b'')
-                     self.assertEqual(cobj.flush(),
-                                      b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     self.assertEqual(cobj.compress(b""), b"")
+                     self.assertEqual(cobj.flush(), b"\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00")
                  def test_input_types(self):
-                     expected = b'\x28\xb5\x2f\xfd\x00\x48\x19\x00\x00\x66\x6f\x6f'
+                     expected = b"\x28\xb5\x2f\xfd\x00\x48\x19\x00\x00\x66\x6f\x6f"
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      mutable_array = bytearray(3)
-                     mutable_array[:] = b'foo'
+                     mutable_array[:] = b"foo"
                      sources = [
-                         memoryview(b'foo'),
-                         bytearray(b'foo'),
+                         memoryview(b"foo"),
+                         bytearray(b"foo"),
                          mutable_array,
                      ]
                      for source in sources:
                          cobj = cctx.compressobj()
-                         self.assertEqual(cobj.compress(source), b'')
+                         self.assertEqual(cobj.compress(source), b"")
                          self.assertEqual(cobj.flush(), expected)
                  def test_compressobj_large(self):
                      chunks = []
                      for i in range(255):
-                         chunks.append(struct.Struct('>B').pack(i) * 16384)
+                         chunks.append(struct.Struct(">B").pack(i) * 16384)
                      cctx = zstd.ZstdCompressor(level=3)
                      cobj = cctx.compressobj()
-                     result = cobj.compress(b''.join(chunks)) + cobj.flush()
+                     result = cobj.compress(b"".join(chunks)) + cobj.flush()
                      self.assertEqual(len(result), 999)
-                     self.assertEqual(result[0:4], b'\x28\xb5\x2f\xfd')
+                     self.assertEqual(result[0:4], b"\x28\xb5\x2f\xfd")
                      params = zstd.get_frame_parameters(result)
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                  def test_write_checksum(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      cobj = cctx.compressobj()
-                     no_checksum = cobj.compress(b'foobar') + cobj.flush()
+                     no_checksum = cobj.compress(b"foobar") + cobj.flush()
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      cobj = cctx.compressobj()
-                     with_checksum = cobj.compress(b'foobar') + cobj.flush()
+                     with_checksum = cobj.compress(b"foobar") + cobj.flush()
                      no_params = zstd.get_frame_parameters(no_checksum)
                      with_params = zstd.get_frame_parameters(with_checksum)
                  def test_write_content_size(self):
                      cctx = zstd.ZstdCompressor(level=1)
-                     cobj = cctx.compressobj(size=len(b'foobar' * 256))
-                     with_size = cobj.compress(b'foobar' * 256) + cobj.flush()
+                     cobj = cctx.compressobj(size=len(b"foobar" * 256))
+                     with_size = cobj.compress(b"foobar" * 256) + cobj.flush()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
-                     cobj = cctx.compressobj(size=len(b'foobar' * 256))
-                     no_size = cobj.compress(b'foobar' * 256) + cobj.flush()
+                     cobj = cctx.compressobj(size=len(b"foobar" * 256))
+                     no_size = cobj.compress(b"foobar" * 256) + cobj.flush()
                      no_params = zstd.get_frame_parameters(no_size)
                      with_params = zstd.get_frame_parameters(with_size)
                      cctx = zstd.ZstdCompressor()
                      cobj = cctx.compressobj()
-                     cobj.compress(b'foo')
+                     cobj.compress(b"foo")
                      cobj.flush()
-                     with self.assertRaisesRegexp(zstd.ZstdError, r'cannot call compress\(\) after compressor'):
-                         cobj.compress(b'foo')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, r"cannot call compress\(\) after compressor"
+                     ):
+                         cobj.compress(b"foo")
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'compressor object already finished'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "compressor object already finished"
+                     ):
                          cobj.flush()
                  def test_flush_block_repeated(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      cobj = cctx.compressobj()
-                     self.assertEqual(cobj.compress(b'foo'), b'')
-                     self.assertEqual(cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK),
-                                      b'\x28\xb5\x2f\xfd\x00\x48\x18\x00\x00foo')
-                     self.assertEqual(cobj.compress(b'bar'), b'')
+                     self.assertEqual(cobj.compress(b"foo"), b"")
+                     self.assertEqual(
+                         cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK),
+                         b"\x28\xb5\x2f\xfd\x00\x48\x18\x00\x00foo",
+                     )
+                     self.assertEqual(cobj.compress(b"bar"), b"")
                      # 3 byte header plus content.
-                     self.assertEqual(cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK),
-                                      b'\x18\x00\x00bar')
-                     self.assertEqual(cobj.flush(), b'\x01\x00\x00')
+                     self.assertEqual(cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK), b"\x18\x00\x00bar")
+                     self.assertEqual(cobj.flush(), b"\x01\x00\x00")
                  def test_flush_empty_block(self):
                      cctx = zstd.ZstdCompressor(write_checksum=True)
                      cobj = cctx.compressobj()
-                     cobj.compress(b'foobar')
+                     cobj.compress(b"foobar")
                      cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK)
                      # No-op if no block is active (this is internal to zstd).
-                     self.assertEqual(cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK), b'')
+                     self.assertEqual(cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK), b"")
                      trailing = cobj.flush()
                      # 3 bytes block header + 4 bytes frame checksum
                      self.assertEqual(len(trailing), 7)
                      header = trailing[0:3]
-                     self.assertEqual(header, b'\x01\x00\x00')
+                     self.assertEqual(header, b"\x01\x00\x00")
                  def test_multithreaded(self):
                      source = io.BytesIO()
-                     source.write(b'a' * 1048576)
-                     source.write(b'b' * 1048576)
-                     source.write(b'c' * 1048576)
+                     source.write(b"a" * 1048576)
+                     source.write(b"b" * 1048576)
+                     source.write(b"c" * 1048576)
                      source.seek(0)
                      cctx = zstd.ZstdCompressor(level=1, threads=2)
                      chunks.append(cobj.flush())
-                     compressed = b''.join(chunks)
+                     compressed = b"".join(chunks)
-                     self.assertEqual(len(compressed), 295)
+                     self.assertEqual(len(compressed), 119)
                  def test_frame_progression(self):
                      cctx = zstd.ZstdCompressor()
                      cobj = cctx.compressobj()
-                     cobj.compress(b'foobar')
+                     cobj.compress(b"foobar")
                      self.assertEqual(cctx.frame_progression(), (6, 0, 0))
                      cobj.flush()
                      cctx = zstd.ZstdCompressor()
                      cobj = cctx.compressobj(size=2)
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Src size is incorrect'):
-                         cobj.compress(b'foo')
+                     with self.assertRaisesRegex(zstd.ZstdError, "Src size is incorrect"):
+                         cobj.compress(b"foo")
                      # Try another operation on this instance.
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Src size is incorrect'):
-                         cobj.compress(b'aa')
+                     with self.assertRaisesRegex(zstd.ZstdError, "Src size is incorrect"):
+                         cobj.compress(b"aa")
                      # Try another operation on the compressor.
                      cctx.compressobj(size=4)
-                     cctx.compress(b'foobar')
+                     cctx.compress(b"foobar")
              @make_cffi
-             class TestCompressor_copy_stream(unittest.TestCase):
+             class TestCompressor_copy_stream(TestCase):
                  def test_no_read(self):
                      source = object()
                      dest = io.BytesIO()
                      self.assertEqual(int(r), 0)
                      self.assertEqual(w, 9)
-                     self.assertEqual(dest.getvalue(),
-                                      b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     self.assertEqual(dest.getvalue(), b"\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00")
                  def test_large_data(self):
                      source = io.BytesIO()
                      for i in range(255):
-                         source.write(struct.Struct('>B').pack(i) * 16384)
+                         source.write(struct.Struct(">B").pack(i) * 16384)
                      source.seek(0)
                      dest = io.BytesIO()
                      self.assertFalse(params.has_checksum)
                  def test_write_checksum(self):
-                     source = io.BytesIO(b'foobar')
+                     source = io.BytesIO(b"foobar")
                      no_checksum = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      cctx.copy_stream(source, with_checksum)
-                     self.assertEqual(len(with_checksum.getvalue()),
-                                      len(no_checksum.getvalue()) + 4)
+                     self.assertEqual(len(with_checksum.getvalue()), len(no_checksum.getvalue()) + 4)
                      no_params = zstd.get_frame_parameters(no_checksum.getvalue())
                      with_params = zstd.get_frame_parameters(with_checksum.getvalue())
                      self.assertTrue(with_params.has_checksum)
                  def test_write_content_size(self):
-                     source = io.BytesIO(b'foobar' * 256)
+                     source = io.BytesIO(b"foobar" * 256)
                      no_size = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      cctx.copy_stream(source, with_size)
                      # Source content size is unknown, so no content size written.
-                     self.assertEqual(len(with_size.getvalue()),
-                                      len(no_size.getvalue()))
+                     self.assertEqual(len(with_size.getvalue()), len(no_size.getvalue()))
                      source.seek(0)
                      with_size = io.BytesIO()
                      cctx.copy_stream(source, with_size, size=len(source.getvalue()))
                      # We specified source size, so content size header is present.
-                     self.assertEqual(len(with_size.getvalue()),
-                                      len(no_size.getvalue()) + 1)
+                     self.assertEqual(len(with_size.getvalue()), len(no_size.getvalue()) + 1)
                      no_params = zstd.get_frame_parameters(no_size.getvalue())
                      with_params = zstd.get_frame_parameters(with_size.getvalue())
                      self.assertFalse(with_params.has_checksum)
                  def test_read_write_size(self):
-                     source = OpCountingBytesIO(b'foobarfoobar')
+                     source = OpCountingBytesIO(b"foobarfoobar")
                      dest = OpCountingBytesIO()
                      cctx = zstd.ZstdCompressor()
                      r, w = cctx.copy_stream(source, dest, read_size=1, write_size=1)
                  def test_multithreaded(self):
                      source = io.BytesIO()
-                     source.write(b'a' * 1048576)
-                     source.write(b'b' * 1048576)
-                     source.write(b'c' * 1048576)
+                     source.write(b"a" * 1048576)
+                     source.write(b"b" * 1048576)
+                     source.write(b"c" * 1048576)
                      source.seek(0)
                      dest = io.BytesIO()
                      cctx = zstd.ZstdCompressor(threads=2, write_content_size=False)
                      r, w = cctx.copy_stream(source, dest)
                      self.assertEqual(r, 3145728)
-                     self.assertEqual(w, 295)
+                     self.assertEqual(w, 111)
                      params = zstd.get_frame_parameters(dest.getvalue())
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                  def test_bad_size(self):
                      source = io.BytesIO()
-                     source.write(b'a' * 32768)
-                     source.write(b'b' * 32768)
+                     source.write(b"a" * 32768)
+                     source.write(b"b" * 32768)
                      source.seek(0)
                      dest = io.BytesIO()
                      cctx = zstd.ZstdCompressor()
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Src size is incorrect'):
+                     with self.assertRaisesRegex(zstd.ZstdError, "Src size is incorrect"):
                          cctx.copy_stream(source, dest, size=42)
                      # Try another operation on this compressor.
              @make_cffi
-             class TestCompressor_stream_reader(unittest.TestCase):
+             class TestCompressor_stream_reader(TestCase):
                  def test_context_manager(self):
                      cctx = zstd.ZstdCompressor()
-                     with cctx.stream_reader(b'foo') as reader:
-                         with self.assertRaisesRegexp(ValueError, 'cannot __enter__ multiple times'):
+                     with cctx.stream_reader(b"foo") as reader:
+                         with self.assertRaisesRegex(ValueError, "cannot __enter__ multiple times"):
                              with reader as reader2:
                                  pass
                  def test_no_context_manager(self):
                      cctx = zstd.ZstdCompressor()
-                     reader = cctx.stream_reader(b'foo')
+                     reader = cctx.stream_reader(b"foo")
                      reader.read(4)
                      self.assertFalse(reader.closed)
                      reader.close()
                      self.assertTrue(reader.closed)
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          reader.read(1)
                  def test_not_implemented(self):
                      cctx = zstd.ZstdCompressor()
-                     with cctx.stream_reader(b'foo' * 60) as reader:
+                     with cctx.stream_reader(b"foo" * 60) as reader:
                          with self.assertRaises(io.UnsupportedOperation):
                              reader.readline()
                              reader.writelines([])
                          with self.assertRaises(OSError):
-                             reader.write(b'foo')
+                             reader.write(b"foo")
                  def test_constant_methods(self):
                      cctx = zstd.ZstdCompressor()
-                     with cctx.stream_reader(b'boo') as reader:
+                     with cctx.stream_reader(b"boo") as reader:
                          self.assertTrue(reader.readable())
                          self.assertFalse(reader.writable())
                          self.assertFalse(reader.seekable())
                  def test_read_closed(self):
                      cctx = zstd.ZstdCompressor()
-                     with cctx.stream_reader(b'foo' * 60) as reader:
+                     with cctx.stream_reader(b"foo" * 60) as reader:
                          reader.close()
                          self.assertTrue(reader.closed)
-                         with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         with self.assertRaisesRegex(ValueError, "stream is closed"):
                              reader.read(10)
                  def test_read_sizes(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = cctx.compress(b'foo')
+                     foo = cctx.compress(b"foo")
-                     with cctx.stream_reader(b'foo') as reader:
-                         with self.assertRaisesRegexp(ValueError, 'cannot read negative amounts less than -1'):
+                     with cctx.stream_reader(b"foo") as reader:
+                         with self.assertRaisesRegex(
+                             ValueError, "cannot read negative amounts less than -1"
+                         ):
                              reader.read(-2)
-                         self.assertEqual(reader.read(0), b'')
+                         self.assertEqual(reader.read(0), b"")
                          self.assertEqual(reader.read(), foo)
                  def test_read_buffer(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b''.join([b'foo' * 60, b'bar' * 60, b'baz' * 60])
+                     source = b"".join([b"foo" * 60, b"bar" * 60, b"baz" * 60])
                      frame = cctx.compress(source)
                      with cctx.stream_reader(source) as reader:
                          result = reader.read(8192)
                          self.assertEqual(result, frame)
                          self.assertEqual(reader.tell(), len(result))
-                         self.assertEqual(reader.read(), b'')
+                         self.assertEqual(reader.read(), b"")
                          self.assertEqual(reader.tell(), len(result))
                  def test_read_buffer_small_chunks(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b'foo' * 60
+                     source = b"foo" * 60
                      chunks = []
                      with cctx.stream_reader(source) as reader:
                              chunks.append(chunk)
                              self.assertEqual(reader.tell(), sum(map(len, chunks)))
-                     self.assertEqual(b''.join(chunks), cctx.compress(source))
+                     self.assertEqual(b"".join(chunks), cctx.compress(source))
                  def test_read_stream(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b''.join([b'foo' * 60, b'bar' * 60, b'baz' * 60])
+                     source = b"".join([b"foo" * 60, b"bar" * 60, b"baz" * 60])
                      frame = cctx.compress(source)
                      with cctx.stream_reader(io.BytesIO(source), size=len(source)) as reader:
                          chunk = reader.read(8192)
                          self.assertEqual(chunk, frame)
                          self.assertEqual(reader.tell(), len(chunk))
-                         self.assertEqual(reader.read(), b'')
+                         self.assertEqual(reader.read(), b"")
                          self.assertEqual(reader.tell(), len(chunk))
                  def test_read_stream_small_chunks(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b'foo' * 60
+                     source = b"foo" * 60
                      chunks = []
                      with cctx.stream_reader(io.BytesIO(source), size=len(source)) as reader:
                              chunks.append(chunk)
                              self.assertEqual(reader.tell(), sum(map(len, chunks)))
-                     self.assertEqual(b''.join(chunks), cctx.compress(source))
+                     self.assertEqual(b"".join(chunks), cctx.compress(source))
                  def test_read_after_exit(self):
                      cctx = zstd.ZstdCompressor()
-                     with cctx.stream_reader(b'foo' * 60) as reader:
+                     with cctx.stream_reader(b"foo" * 60) as reader:
                          while reader.read(8192):
                              pass
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          reader.read(10)
                  def test_bad_size(self):
                      cctx = zstd.ZstdCompressor()
-                     source = io.BytesIO(b'foobar')
+                     source = io.BytesIO(b"foobar")
                      with cctx.stream_reader(source, size=2) as reader:
-                         with self.assertRaisesRegexp(zstd.ZstdError, 'Src size is incorrect'):
+                         with self.assertRaisesRegex(zstd.ZstdError, "Src size is incorrect"):
                              reader.read(10)
                      # Try another compression operation.
                  def test_readall(self):
                      cctx = zstd.ZstdCompressor()
-                     frame = cctx.compress(b'foo' * 1024)
+                     frame = cctx.compress(b"foo" * 1024)
-                     reader = cctx.stream_reader(b'foo' * 1024)
+                     reader = cctx.stream_reader(b"foo" * 1024)
                      self.assertEqual(reader.readall(), frame)
                  def test_readinto(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = cctx.compress(b'foo')
+                     foo = cctx.compress(b"foo")
-                     reader = cctx.stream_reader(b'foo')
+                     reader = cctx.stream_reader(b"foo")
                      with self.assertRaises(Exception):
-                         reader.readinto(b'foobar')
+                         reader.readinto(b"foobar")
                      # readinto() with sufficiently large destination.
                      b = bytearray(1024)
-                     reader = cctx.stream_reader(b'foo')
+                     reader = cctx.stream_reader(b"foo")
                      self.assertEqual(reader.readinto(b), len(foo))
-                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(b[0 : len(foo)], foo)
                      self.assertEqual(reader.readinto(b), 0)
-                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(b[0 : len(foo)], foo)
                      # readinto() with small reads.
                      b = bytearray(1024)
-                     reader = cctx.stream_reader(b'foo', read_size=1)
+                     reader = cctx.stream_reader(b"foo", read_size=1)
                      self.assertEqual(reader.readinto(b), len(foo))
-                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(b[0 : len(foo)], foo)
                      # Too small destination buffer.
                      b = bytearray(2)
-                     reader = cctx.stream_reader(b'foo')
+                     reader = cctx.stream_reader(b"foo")
                      self.assertEqual(reader.readinto(b), 2)
                      self.assertEqual(b[:], foo[0:2])
                      self.assertEqual(reader.readinto(b), 2)
                  def test_readinto1(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = b''.join(cctx.read_to_iter(io.BytesIO(b'foo')))
+                     foo = b"".join(cctx.read_to_iter(io.BytesIO(b"foo")))
-                     reader = cctx.stream_reader(b'foo')
+                     reader = cctx.stream_reader(b"foo")
                      with self.assertRaises(Exception):
-                         reader.readinto1(b'foobar')
+                         reader.readinto1(b"foobar")
                      b = bytearray(1024)
-                     source = OpCountingBytesIO(b'foo')
+                     source = OpCountingBytesIO(b"foo")
                      reader = cctx.stream_reader(source)
                      self.assertEqual(reader.readinto1(b), len(foo))
-                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(b[0 : len(foo)], foo)
                      self.assertEqual(source._read_count, 2)
                      # readinto1() with small reads.
                      b = bytearray(1024)
-                     source = OpCountingBytesIO(b'foo')
+                     source = OpCountingBytesIO(b"foo")
                      reader = cctx.stream_reader(source, read_size=1)
                      self.assertEqual(reader.readinto1(b), len(foo))
-                     self.assertEqual(b[0:len(foo)], foo)
+                     self.assertEqual(b[0 : len(foo)], foo)
                      self.assertEqual(source._read_count, 4)
                  def test_read1(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = b''.join(cctx.read_to_iter(io.BytesIO(b'foo')))
+                     foo = b"".join(cctx.read_to_iter(io.BytesIO(b"foo")))
-                     b = OpCountingBytesIO(b'foo')
+                     b = OpCountingBytesIO(b"foo")
                      reader = cctx.stream_reader(b)
                      self.assertEqual(reader.read1(), foo)
                      self.assertEqual(b._read_count, 2)
-                     b = OpCountingBytesIO(b'foo')
+                     b = OpCountingBytesIO(b"foo")
                      reader = cctx.stream_reader(b)
-                     self.assertEqual(reader.read1(0), b'')
+                     self.assertEqual(reader.read1(0), b"")
                      self.assertEqual(reader.read1(2), foo[0:2])
                      self.assertEqual(b._read_count, 2)
                      self.assertEqual(reader.read1(2), foo[2:4])
              @make_cffi
-             class TestCompressor_stream_writer(unittest.TestCase):
+             class TestCompressor_stream_writer(TestCase):
                  def test_io_api(self):
                      buffer = io.BytesIO()
                      cctx = zstd.ZstdCompressor()
                      self.assertFalse(writer.closed)
                  def test_fileno_file(self):
-                     with tempfile.TemporaryFile('wb') as tf:
+                     with tempfile.TemporaryFile("wb") as tf:
                          cctx = zstd.ZstdCompressor()
                          writer = cctx.stream_writer(tf)
                      cctx = zstd.ZstdCompressor(level=1)
                      writer = cctx.stream_writer(buffer)
-                     writer.write(b'foo' * 1024)
+                     writer.write(b"foo" * 1024)
                      self.assertFalse(writer.closed)
                      self.assertFalse(buffer.closed)
                      writer.close()
                      self.assertTrue(writer.closed)
                      self.assertTrue(buffer.closed)
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
-                         writer.write(b'foo')
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
+                         writer.write(b"foo")
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          writer.flush()
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          with writer:
                              pass
-                     self.assertEqual(buffer.getvalue(),
-                                      b'\x28\xb5\x2f\xfd\x00\x48\x55\x00\x00\x18\x66\x6f'
-                                      b'\x6f\x01\x00\xfa\xd3\x77\x43')
+                     self.assertEqual(
+                         buffer.getvalue(),
+                         b"\x28\xb5\x2f\xfd\x00\x48\x55\x00\x00\x18\x66\x6f"
+                         b"\x6f\x01\x00\xfa\xd3\x77\x43",
+                     )
                      # Context manager exit should close stream.
                      buffer = io.BytesIO()
                      writer = cctx.stream_writer(buffer)
                      with writer:
-                         writer.write(b'foo')
+                         writer.write(b"foo")
                      self.assertTrue(writer.closed)
                      buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      with cctx.stream_writer(buffer) as compressor:
-                         compressor.write(b'')
+                         compressor.write(b"")
                      result = buffer.getvalue()
-                     self.assertEqual(result, b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     self.assertEqual(result, b"\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00")
                      params = zstd.get_frame_parameters(result)
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      # Test without context manager.
                      buffer = io.BytesIO()
                      compressor = cctx.stream_writer(buffer)
-                     self.assertEqual(compressor.write(b''), 0)
-                     self.assertEqual(buffer.getvalue(), b'')
+                     self.assertEqual(compressor.write(b""), 0)
+                     self.assertEqual(buffer.getvalue(), b"")
                      self.assertEqual(compressor.flush(zstd.FLUSH_FRAME), 9)
                      result = buffer.getvalue()
-                     self.assertEqual(result, b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     self.assertEqual(result, b"\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00")
                      params = zstd.get_frame_parameters(result)
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      # Test write_return_read=True
                      compressor = cctx.stream_writer(buffer, write_return_read=True)
-                     self.assertEqual(compressor.write(b''), 0)
+                     self.assertEqual(compressor.write(b""), 0)
                  def test_input_types(self):
-                     expected = b'\x28\xb5\x2f\xfd\x00\x48\x19\x00\x00\x66\x6f\x6f'
+                     expected = b"\x28\xb5\x2f\xfd\x00\x48\x19\x00\x00\x66\x6f\x6f"
                      cctx = zstd.ZstdCompressor(level=1)
                      mutable_array = bytearray(3)
-                     mutable_array[:] = b'foo'
+                     mutable_array[:] = b"foo"
                      sources = [
-                         memoryview(b'foo'),
-                         bytearray(b'foo'),
+                         memoryview(b"foo"),
+                         bytearray(b"foo"),
                          mutable_array,
                      ]
                      buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=5)
                      with cctx.stream_writer(buffer) as compressor:
-                         self.assertEqual(compressor.write(b'foo'), 0)
-                         self.assertEqual(compressor.write(b'bar'), 0)
-                         self.assertEqual(compressor.write(b'x' * 8192), 0)
+                         self.assertEqual(compressor.write(b"foo"), 0)
+                         self.assertEqual(compressor.write(b"bar"), 0)
+                         self.assertEqual(compressor.write(b"x" * 8192), 0)
                      result = buffer.getvalue()
-                     self.assertEqual(result,
-                                      b'\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x38\x66\x6f'
-                                      b'\x6f\x62\x61\x72\x78\x01\x00\xfc\xdf\x03\x23')
+                     self.assertEqual(
+                         result,
+                         b"\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x38\x66\x6f"
+                         b"\x6f\x62\x61\x72\x78\x01\x00\xfc\xdf\x03\x23",
+                     )
                      # Test without context manager.
                      buffer = io.BytesIO()
                      compressor = cctx.stream_writer(buffer)
-                     self.assertEqual(compressor.write(b'foo'), 0)
-                     self.assertEqual(compressor.write(b'bar'), 0)
-                     self.assertEqual(compressor.write(b'x' * 8192), 0)
+                     self.assertEqual(compressor.write(b"foo"), 0)
+                     self.assertEqual(compressor.write(b"bar"), 0)
+                     self.assertEqual(compressor.write(b"x" * 8192), 0)
                      self.assertEqual(compressor.flush(zstd.FLUSH_FRAME), 23)
                      result = buffer.getvalue()
-                     self.assertEqual(result,
-                                      b'\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x38\x66\x6f'
-                                      b'\x6f\x62\x61\x72\x78\x01\x00\xfc\xdf\x03\x23')
+                     self.assertEqual(
+                         result,
+                         b"\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x38\x66\x6f"
+                         b"\x6f\x62\x61\x72\x78\x01\x00\xfc\xdf\x03\x23",
+                     )
                      # Test with write_return_read=True.
                      compressor = cctx.stream_writer(buffer, write_return_read=True)
-                     self.assertEqual(compressor.write(b'foo'), 3)
-                     self.assertEqual(compressor.write(b'barbiz'), 6)
-                     self.assertEqual(compressor.write(b'x' * 8192), 8192)
+                     self.assertEqual(compressor.write(b"foo"), 3)
+                     self.assertEqual(compressor.write(b"barbiz"), 6)
+                     self.assertEqual(compressor.write(b"x" * 8192), 8192)
                  def test_dictionary(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(8192, samples)
                      h = hashlib.sha1(d.as_bytes()).hexdigest()
-                     self.assertEqual(h, '7a2e59a876db958f74257141045af8f912e00d4e')
+                     self.assertEqual(h, "7a2e59a876db958f74257141045af8f912e00d4e")
                      buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=9, dict_data=d)
                      with cctx.stream_writer(buffer) as compressor:
-                         self.assertEqual(compressor.write(b'foo'), 0)
-                         self.assertEqual(compressor.write(b'bar'), 0)
-                         self.assertEqual(compressor.write(b'foo' * 16384), 0)
+                         self.assertEqual(compressor.write(b"foo"), 0)
+                         self.assertEqual(compressor.write(b"bar"), 0)
+                         self.assertEqual(compressor.write(b"foo" * 16384), 0)
                      compressed = buffer.getvalue()
                      self.assertFalse(params.has_checksum)
                      h = hashlib.sha1(compressed).hexdigest()
-                     self.assertEqual(h, '0a7c05635061f58039727cdbe76388c6f4cfef06')
+                     self.assertEqual(h, "0a7c05635061f58039727cdbe76388c6f4cfef06")
-                     source = b'foo' + b'bar' + (b'foo' * 16384)
+                     source = b"foo" + b"bar" + (b"foo" * 16384)
                      dctx = zstd.ZstdDecompressor(dict_data=d)
-                     self.assertEqual(dctx.decompress(compressed, max_output_size=len(source)),
-                                      source)
+                     self.assertEqual(
+                         dctx.decompress(compressed, max_output_size=len(source)), source
+                     )
                  def test_compression_params(self):
                      params = zstd.ZstdCompressionParameters(
                          min_match=5,
                          search_log=4,
                          target_length=10,
-                         strategy=zstd.STRATEGY_FAST)
+                         strategy=zstd.STRATEGY_FAST,
+                     )
                      buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(compression_params=params)
                      with cctx.stream_writer(buffer) as compressor:
-                         self.assertEqual(compressor.write(b'foo'), 0)
-                         self.assertEqual(compressor.write(b'bar'), 0)
-                         self.assertEqual(compressor.write(b'foobar' * 16384), 0)
+                         self.assertEqual(compressor.write(b"foo"), 0)
+                         self.assertEqual(compressor.write(b"bar"), 0)
+                         self.assertEqual(compressor.write(b"foobar" * 16384), 0)
                      compressed = buffer.getvalue()
                      self.assertFalse(params.has_checksum)
                      h = hashlib.sha1(compressed).hexdigest()
-                     self.assertEqual(h, 'dd4bb7d37c1a0235b38a2f6b462814376843ef0b')
+                     self.assertEqual(h, "dd4bb7d37c1a0235b38a2f6b462814376843ef0b")
                  def test_write_checksum(self):
                      no_checksum = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      with cctx.stream_writer(no_checksum) as compressor:
-                         self.assertEqual(compressor.write(b'foobar'), 0)
+                         self.assertEqual(compressor.write(b"foobar"), 0)
                      with_checksum = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      with cctx.stream_writer(with_checksum) as compressor:
-                         self.assertEqual(compressor.write(b'foobar'), 0)
+                         self.assertEqual(compressor.write(b"foobar"), 0)
                      no_params = zstd.get_frame_parameters(no_checksum.getvalue())
                      with_params = zstd.get_frame_parameters(with_checksum.getvalue())
                      self.assertFalse(no_params.has_checksum)
                      self.assertTrue(with_params.has_checksum)
-                     self.assertEqual(len(with_checksum.getvalue()),
-                                      len(no_checksum.getvalue()) + 4)
+                     self.assertEqual(len(with_checksum.getvalue()), len(no_checksum.getvalue()) + 4)
                  def test_write_content_size(self):
                      no_size = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      with cctx.stream_writer(no_size) as compressor:
-                         self.assertEqual(compressor.write(b'foobar' * 256), 0)
+                         self.assertEqual(compressor.write(b"foobar" * 256), 0)
                      with_size = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      with cctx.stream_writer(with_size) as compressor:
-                         self.assertEqual(compressor.write(b'foobar' * 256), 0)
+                         self.assertEqual(compressor.write(b"foobar" * 256), 0)
                      # Source size is not known in streaming mode, so header not
                      # written.
-                     self.assertEqual(len(with_size.getvalue()),
-                                      len(no_size.getvalue()))
+                     self.assertEqual(len(with_size.getvalue()), len(no_size.getvalue()))
                      # Declaring size will write the header.
                      with_size = NonClosingBytesIO()
-                     with cctx.stream_writer(with_size, size=len(b'foobar' * 256)) as compressor:
-                         self.assertEqual(compressor.write(b'foobar' * 256), 0)
+                     with cctx.stream_writer(with_size, size=len(b"foobar" * 256)) as compressor:
+                         self.assertEqual(compressor.write(b"foobar" * 256), 0)
                      no_params = zstd.get_frame_parameters(no_size.getvalue())
                      with_params = zstd.get_frame_parameters(with_size.getvalue())
                      self.assertFalse(no_params.has_checksum)
                      self.assertFalse(with_params.has_checksum)
-                     self.assertEqual(len(with_size.getvalue()),
-                                      len(no_size.getvalue()) + 1)
+                     self.assertEqual(len(with_size.getvalue()), len(no_size.getvalue()) + 1)
                  def test_no_dict_id(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(1024, samples)
                      with_dict_id = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      with cctx.stream_writer(with_dict_id) as compressor:
-                         self.assertEqual(compressor.write(b'foobarfoobar'), 0)
+                         self.assertEqual(compressor.write(b"foobarfoobar"), 0)
-                     self.assertEqual(with_dict_id.getvalue()[4:5], b'\x03')
+                     self.assertEqual(with_dict_id.getvalue()[4:5], b"\x03")
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d, write_dict_id=False)
                      no_dict_id = NonClosingBytesIO()
                      with cctx.stream_writer(no_dict_id) as compressor:
-                         self.assertEqual(compressor.write(b'foobarfoobar'), 0)
+                         self.assertEqual(compressor.write(b"foobarfoobar"), 0)
-                     self.assertEqual(no_dict_id.getvalue()[4:5], b'\x00')
+                     self.assertEqual(no_dict_id.getvalue()[4:5], b"\x00")
                      no_params = zstd.get_frame_parameters(no_dict_id.getvalue())
                      with_params = zstd.get_frame_parameters(with_dict_id.getvalue())
                      self.assertFalse(no_params.has_checksum)
                      self.assertFalse(with_params.has_checksum)
-                     self.assertEqual(len(with_dict_id.getvalue()),
-                                      len(no_dict_id.getvalue()) + 4)
+                     self.assertEqual(len(with_dict_id.getvalue()), len(no_dict_id.getvalue()) + 4)
                  def test_memory_size(self):
                      cctx = zstd.ZstdCompressor(level=3)
                      buffer = io.BytesIO()
                      with cctx.stream_writer(buffer) as compressor:
-                         compressor.write(b'foo')
+                         compressor.write(b"foo")
                          size = compressor.memory_size()
                      self.assertGreater(size, 100000)
                      cctx = zstd.ZstdCompressor(level=3)
                      dest = OpCountingBytesIO()
                      with cctx.stream_writer(dest, write_size=1) as compressor:
-                         self.assertEqual(compressor.write(b'foo'), 0)
-                         self.assertEqual(compressor.write(b'bar'), 0)
-                         self.assertEqual(compressor.write(b'foobar'), 0)
+                         self.assertEqual(compressor.write(b"foo"), 0)
+                         self.assertEqual(compressor.write(b"bar"), 0)
+                         self.assertEqual(compressor.write(b"foobar"), 0)
                      self.assertEqual(len(dest.getvalue()), dest._write_count)
                      cctx = zstd.ZstdCompressor(level=3)
                      dest = OpCountingBytesIO()
                      with cctx.stream_writer(dest) as compressor:
-                         self.assertEqual(compressor.write(b'foo'), 0)
+                         self.assertEqual(compressor.write(b"foo"), 0)
                          self.assertEqual(dest._write_count, 0)
                          self.assertEqual(compressor.flush(), 12)
                          self.assertEqual(dest._write_count, 1)
-                         self.assertEqual(compressor.write(b'bar'), 0)
+                         self.assertEqual(compressor.write(b"bar"), 0)
                          self.assertEqual(dest._write_count, 1)
                          self.assertEqual(compressor.flush(), 6)
                          self.assertEqual(dest._write_count, 2)
-                         self.assertEqual(compressor.write(b'baz'), 0)
+                         self.assertEqual(compressor.write(b"baz"), 0)
                      self.assertEqual(dest._write_count, 3)
                      cctx = zstd.ZstdCompressor(level=3, write_checksum=True)
                      dest = OpCountingBytesIO()
                      with cctx.stream_writer(dest) as compressor:
-                         self.assertEqual(compressor.write(b'foobar' * 8192), 0)
+                         self.assertEqual(compressor.write(b"foobar" * 8192), 0)
                          count = dest._write_count
                          offset = dest.tell()
                          self.assertEqual(compressor.flush(), 23)
                      self.assertEqual(len(trailing), 7)
                      header = trailing[0:3]
-                     self.assertEqual(header, b'\x01\x00\x00')
+                     self.assertEqual(header, b"\x01\x00\x00")
                  def test_flush_frame(self):
                      cctx = zstd.ZstdCompressor(level=3)
                      dest = OpCountingBytesIO()
                      with cctx.stream_writer(dest) as compressor:
-                         self.assertEqual(compressor.write(b'foobar' * 8192), 0)
+                         self.assertEqual(compressor.write(b"foobar" * 8192), 0)
                          self.assertEqual(compressor.flush(zstd.FLUSH_FRAME), 23)
-                         compressor.write(b'biz' * 16384)
+                         compressor.write(b"biz" * 16384)
-                     self.assertEqual(dest.getvalue(),
-                                      # Frame 1.
-                                      b'\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x30\x66\x6f\x6f'
-                                      b'\x62\x61\x72\x01\x00\xf7\xbf\xe8\xa5\x08'
-                                      # Frame 2.
-                                      b'\x28\xb5\x2f\xfd\x00\x58\x5d\x00\x00\x18\x62\x69\x7a'
-                                      b'\x01\x00\xfa\x3f\x75\x37\x04')
+                     self.assertEqual(
+                         dest.getvalue(),
+                         # Frame 1.
+                         b"\x28\xb5\x2f\xfd\x00\x58\x75\x00\x00\x30\x66\x6f\x6f"
+                         b"\x62\x61\x72\x01\x00\xf7\xbf\xe8\xa5\x08"
+                         # Frame 2.
+                         b"\x28\xb5\x2f\xfd\x00\x58\x5d\x00\x00\x18\x62\x69\x7a"
+                         b"\x01\x00\xfa\x3f\x75\x37\x04",
+                     )
                  def test_bad_flush_mode(self):
                      cctx = zstd.ZstdCompressor()
                      dest = io.BytesIO()
                      with cctx.stream_writer(dest) as compressor:
-                         with self.assertRaisesRegexp(ValueError, 'unknown flush_mode: 42'):
+                         with self.assertRaisesRegex(ValueError, "unknown flush_mode: 42"):
                              compressor.flush(flush_mode=42)
                  def test_multithreaded(self):
                      dest = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(threads=2)
                      with cctx.stream_writer(dest) as compressor:
-                         compressor.write(b'a' * 1048576)
-                         compressor.write(b'b' * 1048576)
-                         compressor.write(b'c' * 1048576)
+                         compressor.write(b"a" * 1048576)
+                         compressor.write(b"b" * 1048576)
+                         compressor.write(b"c" * 1048576)
-                     self.assertEqual(len(dest.getvalue()), 295)
+                     self.assertEqual(len(dest.getvalue()), 111)
                  def test_tell(self):
                      dest = io.BytesIO()
                          self.assertEqual(compressor.tell(), 0)
                          for i in range(256):
-                             compressor.write(b'foo' * (i + 1))
+                             compressor.write(b"foo" * (i + 1))
                              self.assertEqual(compressor.tell(), dest.tell())
                  def test_bad_size(self):
                      dest = io.BytesIO()
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Src size is incorrect'):
+                     with self.assertRaisesRegex(zstd.ZstdError, "Src size is incorrect"):
                          with cctx.stream_writer(dest, size=2) as compressor:
-                             compressor.write(b'foo')
+                             compressor.write(b"foo")
                      # Test another operation.
                      with cctx.stream_writer(dest, size=42):
                      dest = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor()
                      with cctx.stream_writer(dest) as compressor:
-                         with tarfile.open('tf', mode='w|', fileobj=compressor) as tf:
-                             tf.add(__file__, 'test_compressor.py')
+                         with tarfile.open("tf", mode="w|", fileobj=compressor) as tf:
+                             tf.add(__file__, "test_compressor.py")
                      dest = io.BytesIO(dest.getvalue())
                      dctx = zstd.ZstdDecompressor()
                      with dctx.stream_reader(dest) as reader:
-                         with tarfile.open(mode='r|', fileobj=reader) as tf:
+                         with tarfile.open(mode="r|", fileobj=reader) as tf:
                              for member in tf:
-                                 self.assertEqual(member.name, 'test_compressor.py')
+                                 self.assertEqual(member.name, "test_compressor.py")
              @make_cffi
-             class TestCompressor_read_to_iter(unittest.TestCase):
+             class TestCompressor_read_to_iter(TestCase):
                  def test_type_validation(self):
                      cctx = zstd.ZstdCompressor()
                          pass
                      # Buffer protocol works.
-                     for chunk in cctx.read_to_iter(b'foobar'):
+                     for chunk in cctx.read_to_iter(b"foobar"):
                          pass
-                     with self.assertRaisesRegexp(ValueError, 'must pass an object with a read'):
+                     with self.assertRaisesRegex(ValueError, "must pass an object with a read"):
                          for chunk in cctx.read_to_iter(True):
                              pass
                      it = cctx.read_to_iter(source)
                      chunks = list(it)
                      self.assertEqual(len(chunks), 1)
-                     compressed = b''.join(chunks)
-                     self.assertEqual(compressed, b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     compressed = b"".join(chunks)
+                     self.assertEqual(compressed, b"\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00")
                      # And again with the buffer protocol.
-                     it = cctx.read_to_iter(b'')
+                     it = cctx.read_to_iter(b"")
                      chunks = list(it)
                      self.assertEqual(len(chunks), 1)
-                     compressed2 = b''.join(chunks)
+                     compressed2 = b"".join(chunks)
                      self.assertEqual(compressed2, compressed)
                  def test_read_large(self):
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
                      source = io.BytesIO()
-                     source.write(b'f' * zstd.COMPRESSION_RECOMMENDED_INPUT_SIZE)
-                     source.write(b'o')
+                     source.write(b"f" * zstd.COMPRESSION_RECOMMENDED_INPUT_SIZE)
+                     source.write(b"o")
                      source.seek(0)
                      # Creating an iterator should not perform any compression until
                          next(it)
                      # We should get the same output as the one-shot compression mechanism.
-                     self.assertEqual(b''.join(chunks), cctx.compress(source.getvalue()))
+                     self.assertEqual(b"".join(chunks), cctx.compress(source.getvalue()))
-                     params = zstd.get_frame_parameters(b''.join(chunks))
+                     params = zstd.get_frame_parameters(b"".join(chunks))
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      self.assertEqual(params.window_size, 262144)
                      self.assertEqual(params.dict_id, 0)
                      chunks = list(it)
                      self.assertEqual(len(chunks), 2)
-                     params = zstd.get_frame_parameters(b''.join(chunks))
+                     params = zstd.get_frame_parameters(b"".join(chunks))
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
-                     #self.assertEqual(params.window_size, 262144)
+                     # self.assertEqual(params.window_size, 262144)
                      self.assertEqual(params.dict_id, 0)
                      self.assertFalse(params.has_checksum)
-                     self.assertEqual(b''.join(chunks), cctx.compress(source.getvalue()))
+                     self.assertEqual(b"".join(chunks), cctx.compress(source.getvalue()))
                  def test_read_write_size(self):
-                     source = OpCountingBytesIO(b'foobarfoobar')
+                     source = OpCountingBytesIO(b"foobarfoobar")
                      cctx = zstd.ZstdCompressor(level=3)
                      for chunk in cctx.read_to_iter(source, read_size=1, write_size=1):
                          self.assertEqual(len(chunk), 1)
                  def test_multithreaded(self):
                      source = io.BytesIO()
-                     source.write(b'a' * 1048576)
-                     source.write(b'b' * 1048576)
-                     source.write(b'c' * 1048576)
+                     source.write(b"a" * 1048576)
+                     source.write(b"b" * 1048576)
+                     source.write(b"c" * 1048576)
                      source.seek(0)
                      cctx = zstd.ZstdCompressor(threads=2)
-                     compressed = b''.join(cctx.read_to_iter(source))
-                     self.assertEqual(len(compressed), 295)
+                     compressed = b"".join(cctx.read_to_iter(source))
+                     self.assertEqual(len(compressed), 111)
                  def test_bad_size(self):
                      cctx = zstd.ZstdCompressor()
-                     source = io.BytesIO(b'a' * 42)
+                     source = io.BytesIO(b"a" * 42)
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Src size is incorrect'):
-                         b''.join(cctx.read_to_iter(source, size=2))
+                     with self.assertRaisesRegex(zstd.ZstdError, "Src size is incorrect"):
+                         b"".join(cctx.read_to_iter(source, size=2))
                      # Test another operation on errored compressor.
-                     b''.join(cctx.read_to_iter(source))
+                     b"".join(cctx.read_to_iter(source))
              @make_cffi
-             class TestCompressor_chunker(unittest.TestCase):
+             class TestCompressor_chunker(TestCase):
                  def test_empty(self):
                      cctx = zstd.ZstdCompressor(write_content_size=False)
                      chunker = cctx.chunker()
-                     it = chunker.compress(b'')
+                     it = chunker.compress(b"")
                      with self.assertRaises(StopIteration):
                          next(it)
                      it = chunker.finish()
-                     self.assertEqual(next(it), b'\x28\xb5\x2f\xfd\x00\x58\x01\x00\x00')
+                     self.assertEqual(next(it), b"\x28\xb5\x2f\xfd\x00\x58\x01\x00\x00")
                      with self.assertRaises(StopIteration):
                          next(it)
                      cctx = zstd.ZstdCompressor()
                      chunker = cctx.chunker()
-                     it = chunker.compress(b'foobar')
+                     it = chunker.compress(b"foobar")
                      with self.assertRaises(StopIteration):
                          next(it)
-                     it = chunker.compress(b'baz' * 30)
+                     it = chunker.compress(b"baz" * 30)
                      with self.assertRaises(StopIteration):
                          next(it)
                      it = chunker.finish()
-                     self.assertEqual(next(it),
-                                      b'\x28\xb5\x2f\xfd\x00\x58\x7d\x00\x00\x48\x66\x6f'
-                                      b'\x6f\x62\x61\x72\x62\x61\x7a\x01\x00\xe4\xe4\x8e')
+                     self.assertEqual(
+                         next(it),
+                         b"\x28\xb5\x2f\xfd\x00\x58\x7d\x00\x00\x48\x66\x6f"
+                         b"\x6f\x62\x61\x72\x62\x61\x7a\x01\x00\xe4\xe4\x8e",
+                     )
                      with self.assertRaises(StopIteration):
                          next(it)
                      cctx = zstd.ZstdCompressor()
                      chunker = cctx.chunker(size=1024)
-                     it = chunker.compress(b'x' * 1000)
+                     it = chunker.compress(b"x" * 1000)
                      with self.assertRaises(StopIteration):
                          next(it)
-                     it = chunker.compress(b'y' * 24)
+                     it = chunker.compress(b"y" * 24)
                      with self.assertRaises(StopIteration):
                          next(it)
                      chunks = list(chunker.finish())
-                     self.assertEqual(chunks, [
-                         b'\x28\xb5\x2f\xfd\x60\x00\x03\x65\x00\x00\x18\x78\x78\x79\x02\x00'
-                         b'\xa0\x16\xe3\x2b\x80\x05'
-                     ])
+                     self.assertEqual(
+                         chunks,
+                         [
+                             b"\x28\xb5\x2f\xfd\x60\x00\x03\x65\x00\x00\x18\x78\x78\x79\x02\x00"
+                             b"\xa0\x16\xe3\x2b\x80\x05"
+                         ],
+                     )
                      dctx = zstd.ZstdDecompressor()
-                     self.assertEqual(dctx.decompress(b''.join(chunks)),
-                                      (b'x' * 1000) + (b'y' * 24))
+                     self.assertEqual(dctx.decompress(b"".join(chunks)), (b"x" * 1000) + (b"y" * 24))
                  def test_small_chunk_size(self):
                      cctx = zstd.ZstdCompressor()
                      chunker = cctx.chunker(chunk_size=1)
-                     chunks = list(chunker.compress(b'foo' * 1024))
+                     chunks = list(chunker.compress(b"foo" * 1024))
                      self.assertEqual(chunks, [])
                      chunks = list(chunker.finish())
                      self.assertTrue(all(len(chunk) == 1 for chunk in chunks))
                      self.assertEqual(
-                         b''.join(chunks),
-                         b'\x28\xb5\x2f\xfd\x00\x58\x55\x00\x00\x18\x66\x6f\x6f\x01\x00'
-                         b'\xfa\xd3\x77\x43')
+                         b"".join(chunks),
+                         b"\x28\xb5\x2f\xfd\x00\x58\x55\x00\x00\x18\x66\x6f\x6f\x01\x00"
+                         b"\xfa\xd3\x77\x43",
+                     )
                      dctx = zstd.ZstdDecompressor()
-                     self.assertEqual(dctx.decompress(b''.join(chunks),
-                                                      max_output_size=10000),
-                                      b'foo' * 1024)
+                     self.assertEqual(
+                         dctx.decompress(b"".join(chunks), max_output_size=10000), b"foo" * 1024
+                     )
                  def test_input_types(self):
                      cctx = zstd.ZstdCompressor()
                      mutable_array = bytearray(3)
-                     mutable_array[:] = b'foo'
+                     mutable_array[:] = b"foo"
                      sources = [
-                         memoryview(b'foo'),
-                         bytearray(b'foo'),
+                         memoryview(b"foo"),
+                         bytearray(b"foo"),
                          mutable_array,
                      ]
                          chunker = cctx.chunker()
                          self.assertEqual(list(chunker.compress(source)), [])
-                         self.assertEqual(list(chunker.finish()), [
-                             b'\x28\xb5\x2f\xfd\x00\x58\x19\x00\x00\x66\x6f\x6f'
-                         ])
+                         self.assertEqual(
+                             list(chunker.finish()),
+                             [b"\x28\xb5\x2f\xfd\x00\x58\x19\x00\x00\x66\x6f\x6f"],
+                         )
                  def test_flush(self):
                      cctx = zstd.ZstdCompressor()
                      chunker = cctx.chunker()
-                     self.assertEqual(list(chunker.compress(b'foo' * 1024)), [])
-                     self.assertEqual(list(chunker.compress(b'bar' * 1024)), [])
+                     self.assertEqual(list(chunker.compress(b"foo" * 1024)), [])
+                     self.assertEqual(list(chunker.compress(b"bar" * 1024)), [])
                      chunks1 = list(chunker.flush())
-                     self.assertEqual(chunks1, [
-                         b'\x28\xb5\x2f\xfd\x00\x58\x8c\x00\x00\x30\x66\x6f\x6f\x62\x61\x72'
-                         b'\x02\x00\xfa\x03\xfe\xd0\x9f\xbe\x1b\x02'
-                     ])
+                     self.assertEqual(
+                         chunks1,
+                         [
+                             b"\x28\xb5\x2f\xfd\x00\x58\x8c\x00\x00\x30\x66\x6f\x6f\x62\x61\x72"
+                             b"\x02\x00\xfa\x03\xfe\xd0\x9f\xbe\x1b\x02"
+                         ],
+                     )
                      self.assertEqual(list(chunker.flush()), [])
                      self.assertEqual(list(chunker.flush()), [])
-                     self.assertEqual(list(chunker.compress(b'baz' * 1024)), [])
+                     self.assertEqual(list(chunker.compress(b"baz" * 1024)), [])
                      chunks2 = list(chunker.flush())
                      self.assertEqual(len(chunks2), 1)
                      dctx = zstd.ZstdDecompressor()
-                     self.assertEqual(dctx.decompress(b''.join(chunks1 + chunks2 + chunks3),
-                                                      max_output_size=10000),
-                                      (b'foo' * 1024) + (b'bar' * 1024) + (b'baz' * 1024))
+                     self.assertEqual(
+                         dctx.decompress(
+                             b"".join(chunks1 + chunks2 + chunks3), max_output_size=10000
+                         ),
+                         (b"foo" * 1024) + (b"bar" * 1024) + (b"baz" * 1024),
+                     )
                  def test_compress_after_finish(self):
                      cctx = zstd.ZstdCompressor()
                      chunker = cctx.chunker()
-                     list(chunker.compress(b'foo'))
+                     list(chunker.compress(b"foo"))
                      list(chunker.finish())
-                     with self.assertRaisesRegexp(
-                             zstd.ZstdError,
-                             r'cannot call compress\(\) after compression finished'):
-                         list(chunker.compress(b'foo'))
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, r"cannot call compress\(\) after compression finished"
+                     ):
+                         list(chunker.compress(b"foo"))
                  def test_flush_after_finish(self):
                      cctx = zstd.ZstdCompressor()
                      chunker = cctx.chunker()
-                     list(chunker.compress(b'foo'))
+                     list(chunker.compress(b"foo"))
                      list(chunker.finish())
-                     with self.assertRaisesRegexp(
-                             zstd.ZstdError,
-                             r'cannot call flush\(\) after compression finished'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, r"cannot call flush\(\) after compression finished"
+                     ):
                          list(chunker.flush())
                  def test_finish_after_finish(self):
                      cctx = zstd.ZstdCompressor()
                      chunker = cctx.chunker()
-                     list(chunker.compress(b'foo'))
+                     list(chunker.compress(b"foo"))
                      list(chunker.finish())
-                     with self.assertRaisesRegexp(
-                             zstd.ZstdError,
-                             r'cannot call finish\(\) after compression finished'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, r"cannot call finish\(\) after compression finished"
+                     ):
                          list(chunker.finish())
-             class TestCompressor_multi_compress_to_buffer(unittest.TestCase):
+             class TestCompressor_multi_compress_to_buffer(TestCase):
                  def test_invalid_inputs(self):
                      cctx = zstd.ZstdCompressor()
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
                      with self.assertRaises(TypeError):
                          cctx.multi_compress_to_buffer(True)
                      with self.assertRaises(TypeError):
                          cctx.multi_compress_to_buffer((1, 2))
-                     with self.assertRaisesRegexp(TypeError, 'item 0 not a bytes like object'):
-                         cctx.multi_compress_to_buffer([u'foo'])
+                     with self.assertRaisesRegex(TypeError, "item 0 not a bytes like object"):
+                         cctx.multi_compress_to_buffer([u"foo"])
                  def test_empty_input(self):
                      cctx = zstd.ZstdCompressor()
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
-                     with self.assertRaisesRegexp(ValueError, 'no source elements found'):
+                     with self.assertRaisesRegex(ValueError, "no source elements found"):
                          cctx.multi_compress_to_buffer([])
-                     with self.assertRaisesRegexp(ValueError, 'source elements are empty'):
-                         cctx.multi_compress_to_buffer([b'', b'', b''])
+                     with self.assertRaisesRegex(ValueError, "source elements are empty"):
+                         cctx.multi_compress_to_buffer([b"", b"", b""])
                  def test_list_input(self):
                      cctx = zstd.ZstdCompressor(write_checksum=True)
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
-                     original = [b'foo' * 12, b'bar' * 6]
+                     original = [b"foo" * 12, b"bar" * 6]
                      frames = [cctx.compress(c) for c in original]
                      b = cctx.multi_compress_to_buffer(original)
                  def test_buffer_with_segments_input(self):
                      cctx = zstd.ZstdCompressor(write_checksum=True)
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
-                     original = [b'foo' * 4, b'bar' * 6]
+                     original = [b"foo" * 4, b"bar" * 6]
                      frames = [cctx.compress(c) for c in original]
-                     offsets = struct.pack('=QQQQ', 0, len(original[0]),
-                                                    len(original[0]), len(original[1]))
-                     segments = zstd.BufferWithSegments(b''.join(original), offsets)
+                     offsets = struct.pack(
+                         "=QQQQ", 0, len(original[0]), len(original[0]), len(original[1])
+                     )
+                     segments = zstd.BufferWithSegments(b"".join(original), offsets)
                      result = cctx.multi_compress_to_buffer(segments)
                  def test_buffer_with_segments_collection_input(self):
                      cctx = zstd.ZstdCompressor(write_checksum=True)
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
                      original = [
-                         b'foo1',
-                         b'foo2' * 2,
-                         b'foo3' * 3,
-                         b'foo4' * 4,
-                         b'foo5' * 5,
+                         b"foo1",
+                         b"foo2" * 2,
+                         b"foo3" * 3,
+                         b"foo4" * 4,
+                         b"foo5" * 5,
                      ]
                      frames = [cctx.compress(c) for c in original]
-                     b = b''.join([original[0], original[1]])
-                     b1 = zstd.BufferWithSegments(b, struct.pack('=QQQQ',
-, len(original[0]),
-                                                                 len(original[0]), len(original[1])))
-                     b = b''.join([original[2], original[3], original[4]])
-                     b2 = zstd.BufferWithSegments(b, struct.pack('=QQQQQQ',
-, len(original[2]),
-                                                                 len(original[2]), len(original[3]),
-                                                                 len(original[2]) + len(original[3]), len(original[4])))
+                     b = b"".join([original[0], original[1]])
+                     b1 = zstd.BufferWithSegments(
+                         b,
+                         struct.pack(
+                             "=QQQQ", 0, len(original[0]), len(original[0]), len(original[1])
+                         ),
+                     )
+                     b = b"".join([original[2], original[3], original[4]])
+                     b2 = zstd.BufferWithSegments(
+                         b,
+                         struct.pack(
+                             "=QQQQQQ",
+,
+                             len(original[2]),
+                             len(original[2]),
+                             len(original[3]),
+                             len(original[2]) + len(original[3]),
+                             len(original[4]),
+                         ),
+                     )
                      c = zstd.BufferWithSegmentsCollection(b1, b2)
                      # threads argument will cause multi-threaded ZSTD APIs to be used, which will
                      # make output different.
                      refcctx = zstd.ZstdCompressor(write_checksum=True)
-                     reference = [refcctx.compress(b'x' * 64), refcctx.compress(b'y' * 64)]
+                     reference = [refcctx.compress(b"x" * 64), refcctx.compress(b"y" * 64)]
                      cctx = zstd.ZstdCompressor(write_checksum=True)
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
                      frames = []
-                     frames.extend(b'x' * 64 for i in range(256))
-                     frames.extend(b'y' * 64 for i in range(256))
+                     frames.extend(b"x" * 64 for i in range(256))
+                     frames.extend(b"y" * 64 for i in range(256))
                      result = cctx.multi_compress_to_buffer(frames, threads=-1)

contrib/python-zstandard/tests/test_compressor_fuzzing.py

0 +378 -253

                  import hypothesis
                  import hypothesis.strategies as strategies
              except ImportError:
-                 raise unittest.SkipTest('hypothesis not available')
+                 raise unittest.SkipTest("hypothesis not available")
              import zstandard as zstd
-             from . common import (
+             from .common import (
                  make_cffi,
                  NonClosingBytesIO,
                  random_input_data,
+                 TestCase,
              )
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestCompressor_stream_reader_fuzzing(unittest.TestCase):
+             class TestCompressor_stream_reader_fuzzing(TestCase):
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_stream_source_read(self, original, level, source_read_size,
-                                             read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_stream_source_read(self, original, level, source_read_size, read_size):
                      if read_size == 0:
                          read_size = -1
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              chunk = reader.read(read_size)
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_buffer_source_read(self, original, level, source_read_size,
-                                             read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_buffer_source_read(self, original, level, source_read_size, read_size):
                      if read_size == 0:
                          read_size = -1
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              chunk = reader.read(read_size)
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_stream_source_read_variance(self, original, level, source_read_size,
-                                                      read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_stream_source_read_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(-1, 16384))
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_buffer_source_read_variance(self, original, level, source_read_size,
-                                                      read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_buffer_source_read_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(-1, 16384))
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_stream_source_readinto(self, original, level,
-                                                 source_read_size, read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_stream_source_readinto(self, original, level, source_read_size, read_size):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              b = bytearray(read_size)
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_buffer_source_readinto(self, original, level,
-                                                 source_read_size, read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_buffer_source_readinto(self, original, level, source_read_size, read_size):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              b = bytearray(read_size)
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_stream_source_readinto_variance(self, original, level,
-                                                          source_read_size, read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_stream_source_readinto_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(1, 16384))
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_buffer_source_readinto_variance(self, original, level,
-                                                          source_read_size, read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_buffer_source_readinto_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(1, 16384))
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_stream_source_read1(self, original, level, source_read_size,
-                                              read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_stream_source_read1(self, original, level, source_read_size, read_size):
                      if read_size == 0:
                          read_size = -1
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              chunk = reader.read1(read_size)
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_buffer_source_read1(self, original, level, source_read_size,
-                                              read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(-1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_buffer_source_read1(self, original, level, source_read_size, read_size):
                      if read_size == 0:
                          read_size = -1
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              chunk = reader.read1(read_size)
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_stream_source_read1_variance(self, original, level, source_read_size,
-                                                       read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_stream_source_read1_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(-1, 16384))
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_buffer_source_read1_variance(self, original, level, source_read_size,
-                                                       read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_buffer_source_read1_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(-1, 16384))
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_stream_source_readinto1(self, original, level, source_read_size,
-                                                  read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_stream_source_readinto1(
+                     self, original, level, source_read_size, read_size
+                 ):
                      if read_size == 0:
                          read_size = -1
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              b = bytearray(read_size)
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE))
-                 def test_buffer_source_readinto1(self, original, level, source_read_size,
-                                                  read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_size=strategies.integers(1, zstd.COMPRESSION_RECOMMENDED_OUTPUT_SIZE),
+                 )
+                 def test_buffer_source_readinto1(
+                     self, original, level, source_read_size, read_size
+                 ):
                      if read_size == 0:
                          read_size = -1
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              b = bytearray(read_size)
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_stream_source_readinto1_variance(self, original, level, source_read_size,
-                                                           read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_stream_source_readinto1_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(io.BytesIO(original), size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         io.BytesIO(original), size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(1, 16384))
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   source_read_size=strategies.integers(1, 16384),
-                                   read_sizes=strategies.data())
-                 def test_buffer_source_readinto1_variance(self, original, level, source_read_size,
-                                                           read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     source_read_size=strategies.integers(1, 16384),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_buffer_source_readinto1_variance(
+                     self, original, level, source_read_size, read_sizes
+                 ):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     with cctx.stream_reader(original, size=len(original),
-                                             read_size=source_read_size) as reader:
+                     with cctx.stream_reader(
+                         original, size=len(original), read_size=source_read_size
+                     ) as reader:
                          chunks = []
                          while True:
                              read_size = read_sizes.draw(strategies.integers(1, 16384))
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestCompressor_stream_writer_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                     level=strategies.integers(min_value=1, max_value=5),
-                                     write_size=strategies.integers(min_value=1, max_value=1048576))
+             class TestCompressor_stream_writer_fuzzing(TestCase):
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     write_size=strategies.integers(min_value=1, max_value=1048576),
+                 )
                  def test_write_size_variance(self, original, level, write_size):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      cctx = zstd.ZstdCompressor(level=level)
                      b = NonClosingBytesIO()
-                     with cctx.stream_writer(b, size=len(original), write_size=write_size) as compressor:
+                     with cctx.stream_writer(
+                         b, size=len(original), write_size=write_size
+                     ) as compressor:
                          compressor.write(original)
                      self.assertEqual(b.getvalue(), ref_frame)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestCompressor_copy_stream_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   read_size=strategies.integers(min_value=1, max_value=1048576),
-                                   write_size=strategies.integers(min_value=1, max_value=1048576))
+             class TestCompressor_copy_stream_fuzzing(TestCase):
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     read_size=strategies.integers(min_value=1, max_value=1048576),
+                     write_size=strategies.integers(min_value=1, max_value=1048576),
+                 )
                  def test_read_write_size_variance(self, original, level, read_size, write_size):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      source = io.BytesIO(original)
                      dest = io.BytesIO()
-                     cctx.copy_stream(source, dest, size=len(original), read_size=read_size,
-                                      write_size=write_size)
+                     cctx.copy_stream(
+                         source, dest, size=len(original), read_size=read_size, write_size=write_size
+                     )
                      self.assertEqual(dest.getvalue(), ref_frame)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestCompressor_compressobj_fuzzing(unittest.TestCase):
+             class TestCompressor_compressobj_fuzzing(TestCase):
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   chunk_sizes=strategies.data())
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     chunk_sizes=strategies.data(),
+                 )
                  def test_random_input_sizes(self, original, level, chunk_sizes):
                      refctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refctx.compress(original)
                      i = 0
                      while True:
                          chunk_size = chunk_sizes.draw(strategies.integers(1, 4096))
-                         source = original[i:i + chunk_size]
+                         source = original[i : i + chunk_size]
                          if not source:
                              break
                      chunks.append(cobj.flush())
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   chunk_sizes=strategies.data(),
-                                   flushes=strategies.data())
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     chunk_sizes=strategies.data(),
+                     flushes=strategies.data(),
+                 )
                  def test_flush_block(self, original, level, chunk_sizes, flushes):
                      cctx = zstd.ZstdCompressor(level=level)
                      cobj = cctx.compressobj()
                      i = 0
                      while True:
                          input_size = chunk_sizes.draw(strategies.integers(1, 4096))
-                         source = original[i:i + input_size]
+                         source = original[i : i + input_size]
                          if not source:
                              break
                          compressed_chunks.append(chunk)
                          decompressed_chunks.append(dobj.decompress(chunk))
-                         self.assertEqual(b''.join(decompressed_chunks), original[0:i])
+                         self.assertEqual(b"".join(decompressed_chunks), original[0:i])
                      chunk = cobj.flush(zstd.COMPRESSOBJ_FLUSH_FINISH)
                      compressed_chunks.append(chunk)
                      decompressed_chunks.append(dobj.decompress(chunk))
-                     self.assertEqual(dctx.decompress(b''.join(compressed_chunks),
-                                                      max_output_size=len(original)),
-                                      original)
-                     self.assertEqual(b''.join(decompressed_chunks), original)
+                     self.assertEqual(
+                         dctx.decompress(b"".join(compressed_chunks), max_output_size=len(original)),
+                         original,
+                     )
+                     self.assertEqual(b"".join(decompressed_chunks), original)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestCompressor_read_to_iter_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   read_size=strategies.integers(min_value=1, max_value=4096),
-                                   write_size=strategies.integers(min_value=1, max_value=4096))
+             class TestCompressor_read_to_iter_fuzzing(TestCase):
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     read_size=strategies.integers(min_value=1, max_value=4096),
+                     write_size=strategies.integers(min_value=1, max_value=4096),
+                 )
                  def test_read_write_size_variance(self, original, level, read_size, write_size):
                      refcctx = zstd.ZstdCompressor(level=level)
                      ref_frame = refcctx.compress(original)
                      source = io.BytesIO(original)
                      cctx = zstd.ZstdCompressor(level=level)
-                     chunks = list(cctx.read_to_iter(source, size=len(original),
-                                                     read_size=read_size,
-                                                     write_size=write_size))
+                     chunks = list(
+                         cctx.read_to_iter(
+                             source, size=len(original), read_size=read_size, write_size=write_size
+                         )
+                     )
-                     self.assertEqual(b''.join(chunks), ref_frame)
+                     self.assertEqual(b"".join(chunks), ref_frame)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
-             class TestCompressor_multi_compress_to_buffer_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.lists(strategies.sampled_from(random_input_data()),
-                                                             min_size=1, max_size=1024),
-                                     threads=strategies.integers(min_value=1, max_value=8),
-                                     use_dict=strategies.booleans())
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
+             class TestCompressor_multi_compress_to_buffer_fuzzing(TestCase):
+                 @hypothesis.given(
+                     original=strategies.lists(
+                         strategies.sampled_from(random_input_data()), min_size=1, max_size=1024
+                     ),
+                     threads=strategies.integers(min_value=1, max_value=8),
+                     use_dict=strategies.booleans(),
+                 )
                  def test_data_equivalence(self, original, threads, use_dict):
                      kwargs = {}
                      # Use a content dictionary because it is cheap to create.
                      if use_dict:
-                         kwargs['dict_data'] = zstd.ZstdCompressionDict(original[0])
+                         kwargs["dict_data"] = zstd.ZstdCompressionDict(original[0])
-                     cctx = zstd.ZstdCompressor(level=1,
-                                                write_checksum=True,
-                                                **kwargs)
+                     cctx = zstd.ZstdCompressor(level=1, write_checksum=True, **kwargs)
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
                      result = cctx.multi_compress_to_buffer(original, threads=-1)
                          self.assertEqual(dctx.decompress(frame), original[i])
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestCompressor_chunker_fuzzing(unittest.TestCase):
+             class TestCompressor_chunker_fuzzing(TestCase):
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   chunk_size=strategies.integers(
-                                       min_value=1,
-                                       max_value=32 * 1048576),
-                                   input_sizes=strategies.data())
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     chunk_size=strategies.integers(min_value=1, max_value=32 * 1048576),
+                     input_sizes=strategies.data(),
+                 )
                  def test_random_input_sizes(self, original, level, chunk_size, input_sizes):
                      cctx = zstd.ZstdCompressor(level=level)
                      chunker = cctx.chunker(chunk_size=chunk_size)
                      i = 0
                      while True:
                          input_size = input_sizes.draw(strategies.integers(1, 4096))
-                         source = original[i:i + input_size]
+                         source = original[i : i + input_size]
                          if not source:
                              break
                      dctx = zstd.ZstdDecompressor()
-                     self.assertEqual(dctx.decompress(b''.join(chunks),
-                                                      max_output_size=len(original)),
-                                      original)
+                     self.assertEqual(
+                         dctx.decompress(b"".join(chunks), max_output_size=len(original)), original
+                     )
                      self.assertTrue(all(len(chunk) == chunk_size for chunk in chunks[:-1]))
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   chunk_size=strategies.integers(
-                                       min_value=1,
-                                       max_value=32 * 1048576),
-                                   input_sizes=strategies.data(),
-                                   flushes=strategies.data())
-                 def test_flush_block(self, original, level, chunk_size, input_sizes,
-                                      flushes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     chunk_size=strategies.integers(min_value=1, max_value=32 * 1048576),
+                     input_sizes=strategies.data(),
+                     flushes=strategies.data(),
+                 )
+                 def test_flush_block(self, original, level, chunk_size, input_sizes, flushes):
                      cctx = zstd.ZstdCompressor(level=level)
                      chunker = cctx.chunker(chunk_size=chunk_size)
                      i = 0
                      while True:
                          input_size = input_sizes.draw(strategies.integers(1, 4096))
-                         source = original[i:i + input_size]
+                         source = original[i : i + input_size]
                          if not source:
                              break
                          chunks = list(chunker.compress(source))
                          compressed_chunks.extend(chunks)
-                         decompressed_chunks.append(dobj.decompress(b''.join(chunks)))
+                         decompressed_chunks.append(dobj.decompress(b"".join(chunks)))
                          if not flushes.draw(strategies.booleans()):
                              continue
                          chunks = list(chunker.flush())
                          compressed_chunks.extend(chunks)
-                         decompressed_chunks.append(dobj.decompress(b''.join(chunks)))
+                         decompressed_chunks.append(dobj.decompress(b"".join(chunks)))
-                         self.assertEqual(b''.join(decompressed_chunks), original[0:i])
+                         self.assertEqual(b"".join(decompressed_chunks), original[0:i])
                      chunks = list(chunker.finish())
                      compressed_chunks.extend(chunks)
-                     decompressed_chunks.append(dobj.decompress(b''.join(chunks)))
+                     decompressed_chunks.append(dobj.decompress(b"".join(chunks)))
-                     self.assertEqual(dctx.decompress(b''.join(compressed_chunks),
-                                                      max_output_size=len(original)),
-                                      original)
-                     self.assertEqual(b''.join(decompressed_chunks), original)
  No newline at end of file
+                     self.assertEqual(
+                         dctx.decompress(b"".join(compressed_chunks), max_output_size=len(original)),
+                         original,
+                     )
+                     self.assertEqual(b"".join(decompressed_chunks), original)

contrib/python-zstandard/tests/test_data_structures.py

0 +67 -54

              import zstandard as zstd
-             from . common import (
+             from .common import (
                  make_cffi,
+                 TestCase,
              )
              @make_cffi
-             class TestCompressionParameters(unittest.TestCase):
+             class TestCompressionParameters(TestCase):
                  def test_bounds(self):
-                     zstd.ZstdCompressionParameters(window_log=zstd.WINDOWLOG_MIN,
-                                                    chain_log=zstd.CHAINLOG_MIN,
-                                                    hash_log=zstd.HASHLOG_MIN,
-                                                    search_log=zstd.SEARCHLOG_MIN,
-                                                    min_match=zstd.MINMATCH_MIN + 1,
-                                                    target_length=zstd.TARGETLENGTH_MIN,
-                                                    strategy=zstd.STRATEGY_FAST)
+                     zstd.ZstdCompressionParameters(
+                         window_log=zstd.WINDOWLOG_MIN,
+                         chain_log=zstd.CHAINLOG_MIN,
+                         hash_log=zstd.HASHLOG_MIN,
+                         search_log=zstd.SEARCHLOG_MIN,
+                         min_match=zstd.MINMATCH_MIN + 1,
+                         target_length=zstd.TARGETLENGTH_MIN,
+                         strategy=zstd.STRATEGY_FAST,
+                     )
-                     zstd.ZstdCompressionParameters(window_log=zstd.WINDOWLOG_MAX,
-                                                    chain_log=zstd.CHAINLOG_MAX,
-                                                    hash_log=zstd.HASHLOG_MAX,
-                                                    search_log=zstd.SEARCHLOG_MAX,
-                                                    min_match=zstd.MINMATCH_MAX - 1,
-                                                    target_length=zstd.TARGETLENGTH_MAX,
-                                                    strategy=zstd.STRATEGY_BTULTRA2)
+                     zstd.ZstdCompressionParameters(
+                         window_log=zstd.WINDOWLOG_MAX,
+                         chain_log=zstd.CHAINLOG_MAX,
+                         hash_log=zstd.HASHLOG_MAX,
+                         search_log=zstd.SEARCHLOG_MAX,
+                         min_match=zstd.MINMATCH_MAX - 1,
+                         target_length=zstd.TARGETLENGTH_MAX,
+                         strategy=zstd.STRATEGY_BTULTRA2,
+                     )
                  def test_from_level(self):
                      p = zstd.ZstdCompressionParameters.from_level(1)
                      self.assertEqual(p.window_log, 19)
                  def test_members(self):
-                     p = zstd.ZstdCompressionParameters(window_log=10,
-                                                        chain_log=6,
-                                                        hash_log=7,
-                                                        search_log=4,
-                                                        min_match=5,
-                                                        target_length=8,
-                                                        strategy=1)
+                     p = zstd.ZstdCompressionParameters(
+                         window_log=10,
+                         chain_log=6,
+                         hash_log=7,
+                         search_log=4,
+                         min_match=5,
+                         target_length=8,
+                         strategy=1,
+                     )
                      self.assertEqual(p.window_log, 10)
                      self.assertEqual(p.chain_log, 6)
                      self.assertEqual(p.hash_log, 7)
                      p = zstd.ZstdCompressionParameters(threads=4)
                      self.assertEqual(p.threads, 4)
-                     p = zstd.ZstdCompressionParameters(threads=2, job_size=1048576,
-                                                        overlap_log=6)
+                     p = zstd.ZstdCompressionParameters(threads=2, job_size=1048576, overlap_log=6)
                      self.assertEqual(p.threads, 2)
                      self.assertEqual(p.job_size, 1048576)
                      self.assertEqual(p.overlap_log, 6)
                      self.assertEqual(p.ldm_hash_rate_log, 8)
                  def test_estimated_compression_context_size(self):
-                     p = zstd.ZstdCompressionParameters(window_log=20,
-                                                        chain_log=16,
-                                                        hash_log=17,
-                                                        search_log=1,
-                                                        min_match=5,
-                                                        target_length=16,
-                                                        strategy=zstd.STRATEGY_DFAST)
+                     p = zstd.ZstdCompressionParameters(
+                         window_log=20,
+                         chain_log=16,
+                         hash_log=17,
+                         search_log=1,
+                         min_match=5,
+                         target_length=16,
+                         strategy=zstd.STRATEGY_DFAST,
+                     )
                      # 32-bit has slightly different values from 64-bit.
-                     self.assertAlmostEqual(p.estimated_compression_context_size(), 1294144,
-                                            delta=250)
+                     self.assertAlmostEqual(
+                         p.estimated_compression_context_size(), 1294464, delta=400
+                     )
                  def test_strategy(self):
-                     with self.assertRaisesRegexp(ValueError, 'cannot specify both compression_strategy'):
+                     with self.assertRaisesRegex(
+                         ValueError, "cannot specify both compression_strategy"
+                     ):
                          zstd.ZstdCompressionParameters(strategy=0, compression_strategy=0)
                      p = zstd.ZstdCompressionParameters(strategy=2)
                      self.assertEqual(p.compression_strategy, 3)
                  def test_ldm_hash_rate_log(self):
-                     with self.assertRaisesRegexp(ValueError, 'cannot specify both ldm_hash_rate_log'):
+                     with self.assertRaisesRegex(
+                         ValueError, "cannot specify both ldm_hash_rate_log"
+                     ):
                          zstd.ZstdCompressionParameters(ldm_hash_rate_log=8, ldm_hash_every_log=4)
                      p = zstd.ZstdCompressionParameters(ldm_hash_rate_log=8)
                      self.assertEqual(p.ldm_hash_every_log, 16)
                  def test_overlap_log(self):
-                     with self.assertRaisesRegexp(ValueError, 'cannot specify both overlap_log'):
+                     with self.assertRaisesRegex(ValueError, "cannot specify both overlap_log"):
                          zstd.ZstdCompressionParameters(overlap_log=1, overlap_size_log=9)
                      p = zstd.ZstdCompressionParameters(overlap_log=2)
              @make_cffi
-             class TestFrameParameters(unittest.TestCase):
+             class TestFrameParameters(TestCase):
                  def test_invalid_type(self):
                      with self.assertRaises(TypeError):
                          zstd.get_frame_parameters(None)
                      # Python 3 doesn't appear to convert unicode to Py_buffer.
                      if sys.version_info[0] >= 3:
                          with self.assertRaises(TypeError):
-                             zstd.get_frame_parameters(u'foobarbaz')
+                             zstd.get_frame_parameters(u"foobarbaz")
                      else:
                          # CPython will convert unicode to Py_buffer. But CFFI won't.
-                         if zstd.backend == 'cffi':
+                         if zstd.backend == "cffi":
                              with self.assertRaises(TypeError):
-                                 zstd.get_frame_parameters(u'foobarbaz')
+                                 zstd.get_frame_parameters(u"foobarbaz")
                          else:
                              with self.assertRaises(zstd.ZstdError):
-                                 zstd.get_frame_parameters(u'foobarbaz')
+                                 zstd.get_frame_parameters(u"foobarbaz")
                  def test_invalid_input_sizes(self):
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'not enough data for frame'):
-                         zstd.get_frame_parameters(b'')
+                     with self.assertRaisesRegex(zstd.ZstdError, "not enough data for frame"):
+                         zstd.get_frame_parameters(b"")
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'not enough data for frame'):
+                     with self.assertRaisesRegex(zstd.ZstdError, "not enough data for frame"):
                          zstd.get_frame_parameters(zstd.FRAME_HEADER)
                  def test_invalid_frame(self):
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Unknown frame descriptor'):
-                         zstd.get_frame_parameters(b'foobarbaz')
+                     with self.assertRaisesRegex(zstd.ZstdError, "Unknown frame descriptor"):
+                         zstd.get_frame_parameters(b"foobarbaz")
                  def test_attributes(self):
-                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b'\x00\x00')
+                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b"\x00\x00")
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      self.assertEqual(params.window_size, 1024)
                      self.assertEqual(params.dict_id, 0)
                      self.assertFalse(params.has_checksum)
                      # Lowest 2 bits indicate a dictionary and length. Here, the dict id is 1 byte.
-                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b'\x01\x00\xff')
+                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b"\x01\x00\xff")
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      self.assertEqual(params.window_size, 1024)
                      self.assertEqual(params.dict_id, 255)
                      self.assertFalse(params.has_checksum)
                      # Lowest 3rd bit indicates if checksum is present.
-                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b'\x04\x00')
+                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b"\x04\x00")
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      self.assertEqual(params.window_size, 1024)
                      self.assertEqual(params.dict_id, 0)
                      self.assertTrue(params.has_checksum)
                      # Upper 2 bits indicate content size.
-                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b'\x40\x00\xff\x00')
+                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b"\x40\x00\xff\x00")
                      self.assertEqual(params.content_size, 511)
                      self.assertEqual(params.window_size, 1024)
                      self.assertEqual(params.dict_id, 0)
                      self.assertFalse(params.has_checksum)
                      # Window descriptor is 2nd byte after frame header.
-                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b'\x00\x40')
+                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b"\x00\x40")
                      self.assertEqual(params.content_size, zstd.CONTENTSIZE_UNKNOWN)
                      self.assertEqual(params.window_size, 262144)
                      self.assertEqual(params.dict_id, 0)
                      self.assertFalse(params.has_checksum)
                      # Set multiple things.
-                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b'\x45\x40\x0f\x10\x00')
+                     params = zstd.get_frame_parameters(zstd.FRAME_HEADER + b"\x45\x40\x0f\x10\x00")
                      self.assertEqual(params.content_size, 272)
                      self.assertEqual(params.window_size, 262144)
                      self.assertEqual(params.dict_id, 15)
                      self.assertTrue(params.has_checksum)
                  def test_input_types(self):
-                     v = zstd.FRAME_HEADER + b'\x00\x00'
+                     v = zstd.FRAME_HEADER + b"\x00\x00"
                      mutable_array = bytearray(len(v))
                      mutable_array[:] = v

contrib/python-zstandard/tests/test_data_structures_fuzzing.py

0 +82 -53

		@@ -7,70 +7,99 try:
7	7	import hypothesis
8	8	import hypothesis.strategies as strategies
9	9	except ImportError:
10		raise unittest.SkipTest('hypothesis not available')
	10	raise unittest.SkipTest("hypothesis not available")
11	11
12	12	import zstandard as zstd
13	13
14	14	from .common import (
15	15	make_cffi,
	16	TestCase,
	17	)
	18
	19
	20	s_windowlog = strategies.integers(
	21	min_value=zstd.WINDOWLOG_MIN, max_value=zstd.WINDOWLOG_MAX
	22	)
	23	s_chainlog = strategies.integers(
	24	min_value=zstd.CHAINLOG_MIN, max_value=zstd.CHAINLOG_MAX
	25	)
	26	s_hashlog = strategies.integers(min_value=zstd.HASHLOG_MIN, max_value=zstd.HASHLOG_MAX)
	27	s_searchlog = strategies.integers(
	28	min_value=zstd.SEARCHLOG_MIN, max_value=zstd.SEARCHLOG_MAX
	29	)
	30	s_minmatch = strategies.integers(
	31	min_value=zstd.MINMATCH_MIN, max_value=zstd.MINMATCH_MAX
	32	)
	33	s_targetlength = strategies.integers(
	34	min_value=zstd.TARGETLENGTH_MIN, max_value=zstd.TARGETLENGTH_MAX
	35	)
	36	s_strategy = strategies.sampled_from(
	37	(
	38	zstd.STRATEGY_FAST,
	39	zstd.STRATEGY_DFAST,
	40	zstd.STRATEGY_GREEDY,
	41	zstd.STRATEGY_LAZY,
	42	zstd.STRATEGY_LAZY2,
	43	zstd.STRATEGY_BTLAZY2,
	44	zstd.STRATEGY_BTOPT,
	45	zstd.STRATEGY_BTULTRA,
	46	zstd.STRATEGY_BTULTRA2,
	47	)
16	48	)
17	49
18	50
19		s_windowlog = strategies.integers(min_value=zstd.WINDOWLOG_MIN,
20		max_value=zstd.WINDOWLOG_MAX)
21		s_chainlog = strategies.integers(min_value=zstd.CHAINLOG_MIN,
22		max_value=zstd.CHAINLOG_MAX)
23		s_hashlog = strategies.integers(min_value=zstd.HASHLOG_MIN,
24		max_value=zstd.HASHLOG_MAX)
25		s_searchlog = strategies.integers(min_value=zstd.SEARCHLOG_MIN,
26		max_value=zstd.SEARCHLOG_MAX)
27		s_minmatch = strategies.integers(min_value=zstd.MINMATCH_MIN,
28		max_value=zstd.MINMATCH_MAX)
29		s_targetlength = strategies.integers(min_value=zstd.TARGETLENGTH_MIN,
30		max_value=zstd.TARGETLENGTH_MAX)
31		s_strategy = strategies.sampled_from((zstd.STRATEGY_FAST,
32		zstd.STRATEGY_DFAST,
33		zstd.STRATEGY_GREEDY,
34		zstd.STRATEGY_LAZY,
35		zstd.STRATEGY_LAZY2,
36		zstd.STRATEGY_BTLAZY2,
37		zstd.STRATEGY_BTOPT,
38		zstd.STRATEGY_BTULTRA,
39		zstd.STRATEGY_BTULTRA2))
40
	51	@make_cffi
	52	@unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
	53	class TestCompressionParametersHypothesis(TestCase):
	54	@hypothesis.given(
	55	s_windowlog,
	56	s_chainlog,
	57	s_hashlog,
	58	s_searchlog,
	59	s_minmatch,
	60	s_targetlength,
	61	s_strategy,
	62	)
	63	def test_valid_init(
	64	self, windowlog, chainlog, hashlog, searchlog, minmatch, targetlength, strategy
	65	):
	66	zstd.ZstdCompressionParameters(
	67	window_log=windowlog,
	68	chain_log=chainlog,
	69	hash_log=hashlog,
	70	search_log=searchlog,
	71	min_match=minmatch,
	72	target_length=targetlength,
	73	strategy=strategy,
	74	)
41	75
42		@make_cffi
43		@unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
44		class TestCompressionParametersHypothesis(unittest.TestCase):
45		@hypothesis.given(s_windowlog, s_chainlog, s_hashlog, s_searchlog,
46		s_minmatch, s_targetlength, s_strategy)
47		def test_valid_init(self, windowlog, chainlog, hashlog, searchlog,
48		minmatch, targetlength, strategy):
49		zstd.ZstdCompressionParameters(window_log=windowlog,
50		chain_log=chainlog,
51		hash_log=hashlog,
52		search_log=searchlog,
53		min_match=minmatch,
54		target_length=targetlength,
55		strategy=strategy)
56
57		@hypothesis.given(s_windowlog, s_chainlog, s_hashlog, s_searchlog,
58		s_minmatch, s_targetlength, s_strategy)
59		def test_estimated_compression_context_size(self, windowlog, chainlog,
60		hashlog, searchlog,
61		minmatch, targetlength,
62		strategy):
63		if minmatch == zstd.MINMATCH_MIN and strategy in (zstd.STRATEGY_FAST, zstd.STRATEGY_GREEDY):
	76	@hypothesis.given(
	77	s_windowlog,
	78	s_chainlog,
	79	s_hashlog,
	80	s_searchlog,
	81	s_minmatch,
	82	s_targetlength,
	83	s_strategy,
	84	)
	85	def test_estimated_compression_context_size(
	86	self, windowlog, chainlog, hashlog, searchlog, minmatch, targetlength, strategy
	87	):
	88	if minmatch == zstd.MINMATCH_MIN and strategy in (
	89	zstd.STRATEGY_FAST,
	90	zstd.STRATEGY_GREEDY,
	91	):
64	92	minmatch += 1
65	93	elif minmatch == zstd.MINMATCH_MAX and strategy != zstd.STRATEGY_FAST:
66	94	minmatch -= 1
67	95
68		p = zstd.ZstdCompressionParameters(~~window_log~~=~~windowlog~~,
69		chain_log=chainlog,
70		hash_log=hashlog,
71		search_log=searchlog,
72		min_match=minmatch,
73		target_length=targetlength,
74		strategy=strategy)
	96	p = zstd.ZstdCompressionParameters(
	97	window_log=windowlog,
	98	chain_log=chainlog,
	99	hash_log=hashlog,
	100	search_log=searchlog,
	101	min_match=minmatch,
	102	target_length=targetlength,
	103	strategy=strategy,
	104	)
75	105	size = p.estimated_compression_context_size()
76

contrib/python-zstandard/tests/test_decompressor.py

0 +394 -335

                  make_cffi,
                  NonClosingBytesIO,
                  OpCountingBytesIO,
+                 TestCase,
              )
              @make_cffi
-             class TestFrameHeaderSize(unittest.TestCase):
+             class TestFrameHeaderSize(TestCase):
                  def test_empty(self):
-                     with self.assertRaisesRegexp(
-                         zstd.ZstdError, 'could not determine frame header size: Src size '
-                                         'is incorrect'):
-                         zstd.frame_header_size(b'')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError,
+                         "could not determine frame header size: Src size " "is incorrect",
+                     ):
+                         zstd.frame_header_size(b"")
                  def test_too_small(self):
-                     with self.assertRaisesRegexp(
-                         zstd.ZstdError, 'could not determine frame header size: Src size '
-                                         'is incorrect'):
-                         zstd.frame_header_size(b'foob')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError,
+                         "could not determine frame header size: Src size " "is incorrect",
+                     ):
+                         zstd.frame_header_size(b"foob")
                  def test_basic(self):
                      # It doesn't matter that it isn't a valid frame.
-                     self.assertEqual(zstd.frame_header_size(b'long enough but no magic'), 6)
+                     self.assertEqual(zstd.frame_header_size(b"long enough but no magic"), 6)
              @make_cffi
-             class TestFrameContentSize(unittest.TestCase):
+             class TestFrameContentSize(TestCase):
                  def test_empty(self):
-                     with self.assertRaisesRegexp(zstd.ZstdError,
-                                                  'error when determining content size'):
-                         zstd.frame_content_size(b'')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "error when determining content size"
+                     ):
+                         zstd.frame_content_size(b"")
                  def test_too_small(self):
-                     with self.assertRaisesRegexp(zstd.ZstdError,
-                                                  'error when determining content size'):
-                         zstd.frame_content_size(b'foob')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "error when determining content size"
+                     ):
+                         zstd.frame_content_size(b"foob")
                  def test_bad_frame(self):
-                     with self.assertRaisesRegexp(zstd.ZstdError,
-                                                  'error when determining content size'):
-                         zstd.frame_content_size(b'invalid frame header')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "error when determining content size"
+                     ):
+                         zstd.frame_content_size(b"invalid frame header")
                  def test_unknown(self):
                      cctx = zstd.ZstdCompressor(write_content_size=False)
-                     frame = cctx.compress(b'foobar')
+                     frame = cctx.compress(b"foobar")
                      self.assertEqual(zstd.frame_content_size(frame), -1)
                  def test_empty(self):
                      cctx = zstd.ZstdCompressor()
-                     frame = cctx.compress(b'')
+                     frame = cctx.compress(b"")
                      self.assertEqual(zstd.frame_content_size(frame), 0)
                  def test_basic(self):
                      cctx = zstd.ZstdCompressor()
-                     frame = cctx.compress(b'foobar')
+                     frame = cctx.compress(b"foobar")
                      self.assertEqual(zstd.frame_content_size(frame), 6)
              @make_cffi
-             class TestDecompressor(unittest.TestCase):
+             class TestDecompressor(TestCase):
                  def test_memory_size(self):
                      dctx = zstd.ZstdDecompressor()
              @make_cffi
-             class TestDecompressor_decompress(unittest.TestCase):
+             class TestDecompressor_decompress(TestCase):
                  def test_empty_input(self):
                      dctx = zstd.ZstdDecompressor()
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'error determining content size from frame header'):
-                         dctx.decompress(b'')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "error determining content size from frame header"
+                     ):
+                         dctx.decompress(b"")
                  def test_invalid_input(self):
                      dctx = zstd.ZstdDecompressor()
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'error determining content size from frame header'):
-                         dctx.decompress(b'foobar')
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "error determining content size from frame header"
+                     ):
+                         dctx.decompress(b"foobar")
                  def test_input_types(self):
                      cctx = zstd.ZstdCompressor(level=1)
-                     compressed = cctx.compress(b'foo')
+                     compressed = cctx.compress(b"foo")
                      mutable_array = bytearray(len(compressed))
                      mutable_array[:] = compressed
                      dctx = zstd.ZstdDecompressor()
                      for source in sources:
-                         self.assertEqual(dctx.decompress(source), b'foo')
+                         self.assertEqual(dctx.decompress(source), b"foo")
                  def test_no_content_size_in_frame(self):
                      cctx = zstd.ZstdCompressor(write_content_size=False)
-                     compressed = cctx.compress(b'foobar')
+                     compressed = cctx.compress(b"foobar")
                      dctx = zstd.ZstdDecompressor()
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'could not determine content size in frame header'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "could not determine content size in frame header"
+                     ):
                          dctx.decompress(compressed)
                  def test_content_size_present(self):
                      cctx = zstd.ZstdCompressor()
-                     compressed = cctx.compress(b'foobar')
+                     compressed = cctx.compress(b"foobar")
                      dctx = zstd.ZstdDecompressor()
                      decompressed = dctx.decompress(compressed)
-                     self.assertEqual(decompressed, b'foobar')
+                     self.assertEqual(decompressed, b"foobar")
                  def test_empty_roundtrip(self):
                      cctx = zstd.ZstdCompressor()
-                     compressed = cctx.compress(b'')
+                     compressed = cctx.compress(b"")
                      dctx = zstd.ZstdDecompressor()
                      decompressed = dctx.decompress(compressed)
-                     self.assertEqual(decompressed, b'')
+                     self.assertEqual(decompressed, b"")
                  def test_max_output_size(self):
                      cctx = zstd.ZstdCompressor(write_content_size=False)
-                     source = b'foobar' * 256
+                     source = b"foobar" * 256
                      compressed = cctx.compress(source)
                      dctx = zstd.ZstdDecompressor()
                      self.assertEqual(decompressed, source)
                      # Input size - 1 fails
-                     with self.assertRaisesRegexp(zstd.ZstdError,
-                             'decompression error: did not decompress full frame'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "decompression error: did not decompress full frame"
+                     ):
                          dctx.decompress(compressed, max_output_size=len(source) - 1)
                      # Input size + 1 works
                  def test_stupidly_large_output_buffer(self):
                      cctx = zstd.ZstdCompressor(write_content_size=False)
-                     compressed = cctx.compress(b'foobar' * 256)
+                     compressed = cctx.compress(b"foobar" * 256)
                      dctx = zstd.ZstdDecompressor()
                      # Will get OverflowError on some Python distributions that can't
                      # handle really large integers.
                      with self.assertRaises((MemoryError, OverflowError)):
-                         dctx.decompress(compressed, max_output_size=2**62)
+                         dctx.decompress(compressed, max_output_size=2 ** 62)
                  def test_dictionary(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(8192, samples)
-                     orig = b'foobar' * 16384
+                     orig = b"foobar" * 16384
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      compressed = cctx.compress(orig)
                  def test_dictionary_multiple(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(8192, samples)
-                     sources = (b'foobar' * 8192, b'foo' * 8192, b'bar' * 8192)
+                     sources = (b"foobar" * 8192, b"foo" * 8192, b"bar" * 8192)
                      compressed = []
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      for source in sources:
                          self.assertEqual(decompressed, sources[i])
                  def test_max_window_size(self):
-                     with open(__file__, 'rb') as fh:
+                     with open(__file__, "rb") as fh:
                          source = fh.read()
                      # If we write a content size, the decompressor engages single pass
                      cctx = zstd.ZstdCompressor(write_content_size=False)
                      frame = cctx.compress(source)
-                     dctx = zstd.ZstdDecompressor(max_window_size=2**zstd.WINDOWLOG_MIN)
+                     dctx = zstd.ZstdDecompressor(max_window_size=2 ** zstd.WINDOWLOG_MIN)
-                     with self.assertRaisesRegexp(
-                         zstd.ZstdError, 'decompression error: Frame requires too much memory'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "decompression error: Frame requires too much memory"
+                     ):
                          dctx.decompress(frame, max_output_size=len(source))
              @make_cffi
-             class TestDecompressor_copy_stream(unittest.TestCase):
+             class TestDecompressor_copy_stream(TestCase):
                  def test_no_read(self):
                      source = object()
                      dest = io.BytesIO()
                      self.assertEqual(r, 0)
                      self.assertEqual(w, 0)
-                     self.assertEqual(dest.getvalue(), b'')
+                     self.assertEqual(dest.getvalue(), b"")
                  def test_large_data(self):
                      source = io.BytesIO()
                      for i in range(255):
-                         source.write(struct.Struct('>B').pack(i) * 16384)
+                         source.write(struct.Struct(">B").pack(i) * 16384)
                      source.seek(0)
                      compressed = io.BytesIO()
                      self.assertEqual(w, len(source.getvalue()))
                  def test_read_write_size(self):
-                     source = OpCountingBytesIO(zstd.ZstdCompressor().compress(
-                         b'foobarfoobar'))
+                     source = OpCountingBytesIO(zstd.ZstdCompressor().compress(b"foobarfoobar"))
                      dest = OpCountingBytesIO()
                      dctx = zstd.ZstdDecompressor()
                      r, w = dctx.copy_stream(source, dest, read_size=1, write_size=1)
                      self.assertEqual(r, len(source.getvalue()))
-                     self.assertEqual(w, len(b'foobarfoobar'))
+                     self.assertEqual(w, len(b"foobarfoobar"))
                      self.assertEqual(source._read_count, len(source.getvalue()) + 1)
                      self.assertEqual(dest._write_count, len(dest.getvalue()))
              @make_cffi
-             class TestDecompressor_stream_reader(unittest.TestCase):
+             class TestDecompressor_stream_reader(TestCase):
                  def test_context_manager(self):
                      dctx = zstd.ZstdDecompressor()
-                     with dctx.stream_reader(b'foo') as reader:
-                         with self.assertRaisesRegexp(ValueError, 'cannot __enter__ multiple times'):
+                     with dctx.stream_reader(b"foo") as reader:
+                         with self.assertRaisesRegex(ValueError, "cannot __enter__ multiple times"):
                              with reader as reader2:
                                  pass
                  def test_not_implemented(self):
                      dctx = zstd.ZstdDecompressor()
-                     with dctx.stream_reader(b'foo') as reader:
+                     with dctx.stream_reader(b"foo") as reader:
                          with self.assertRaises(io.UnsupportedOperation):
                              reader.readline()
                              next(reader)
                          with self.assertRaises(io.UnsupportedOperation):
-                             reader.write(b'foo')
+                             reader.write(b"foo")
                          with self.assertRaises(io.UnsupportedOperation):
                              reader.writelines([])
                  def test_constant_methods(self):
                      dctx = zstd.ZstdDecompressor()
-                     with dctx.stream_reader(b'foo') as reader:
+                     with dctx.stream_reader(b"foo") as reader:
                          self.assertFalse(reader.closed)
                          self.assertTrue(reader.readable())
                          self.assertFalse(reader.writable())
                  def test_read_closed(self):
                      dctx = zstd.ZstdDecompressor()
-                     with dctx.stream_reader(b'foo') as reader:
+                     with dctx.stream_reader(b"foo") as reader:
                          reader.close()
                          self.assertTrue(reader.closed)
-                         with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         with self.assertRaisesRegex(ValueError, "stream is closed"):
                              reader.read(1)
                  def test_read_sizes(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = cctx.compress(b'foo')
+                     foo = cctx.compress(b"foo")
                      dctx = zstd.ZstdDecompressor()
                      with dctx.stream_reader(foo) as reader:
-                         with self.assertRaisesRegexp(ValueError, 'cannot read negative amounts less than -1'):
+                         with self.assertRaisesRegex(
+                             ValueError, "cannot read negative amounts less than -1"
+                         ):
                              reader.read(-2)
-                         self.assertEqual(reader.read(0), b'')
-                         self.assertEqual(reader.read(), b'foo')
+                         self.assertEqual(reader.read(0), b"")
+                         self.assertEqual(reader.read(), b"foo")
                  def test_read_buffer(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b''.join([b'foo' * 60, b'bar' * 60, b'baz' * 60])
+                     source = b"".join([b"foo" * 60, b"bar" * 60, b"baz" * 60])
                      frame = cctx.compress(source)
                      dctx = zstd.ZstdDecompressor()
                          self.assertEqual(reader.tell(), len(source))
                          # Read after EOF should return empty bytes.
-                         self.assertEqual(reader.read(1), b'')
+                         self.assertEqual(reader.read(1), b"")
                          self.assertEqual(reader.tell(), len(result))
                      self.assertTrue(reader.closed)
                  def test_read_buffer_small_chunks(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b''.join([b'foo' * 60, b'bar' * 60, b'baz' * 60])
+                     source = b"".join([b"foo" * 60, b"bar" * 60, b"baz" * 60])
                      frame = cctx.compress(source)
                      dctx = zstd.ZstdDecompressor()
                              chunks.append(chunk)
                              self.assertEqual(reader.tell(), sum(map(len, chunks)))
-                     self.assertEqual(b''.join(chunks), source)
+                     self.assertEqual(b"".join(chunks), source)
                  def test_read_stream(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b''.join([b'foo' * 60, b'bar' * 60, b'baz' * 60])
+                     source = b"".join([b"foo" * 60, b"bar" * 60, b"baz" * 60])
                      frame = cctx.compress(source)
                      dctx = zstd.ZstdDecompressor()
                          chunk = reader.read(8192)
                          self.assertEqual(chunk, source)
                          self.assertEqual(reader.tell(), len(source))
-                         self.assertEqual(reader.read(1), b'')
+                         self.assertEqual(reader.read(1), b"")
                          self.assertEqual(reader.tell(), len(source))
                          self.assertFalse(reader.closed)
                  def test_read_stream_small_chunks(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b''.join([b'foo' * 60, b'bar' * 60, b'baz' * 60])
+                     source = b"".join([b"foo" * 60, b"bar" * 60, b"baz" * 60])
                      frame = cctx.compress(source)
                      dctx = zstd.ZstdDecompressor()
                              chunks.append(chunk)
                              self.assertEqual(reader.tell(), sum(map(len, chunks)))
-                     self.assertEqual(b''.join(chunks), source)
+                     self.assertEqual(b"".join(chunks), source)
                  def test_read_after_exit(self):
                      cctx = zstd.ZstdCompressor()
-                     frame = cctx.compress(b'foo' * 60)
+                     frame = cctx.compress(b"foo" * 60)
                      dctx = zstd.ZstdDecompressor()
                      self.assertTrue(reader.closed)
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          reader.read(10)
                  def test_illegal_seeks(self):
                      cctx = zstd.ZstdCompressor()
-                     frame = cctx.compress(b'foo' * 60)
+                     frame = cctx.compress(b"foo" * 60)
                      dctx = zstd.ZstdDecompressor()
                      with dctx.stream_reader(frame) as reader:
-                         with self.assertRaisesRegexp(ValueError,
-                                                      'cannot seek to negative position'):
+                         with self.assertRaisesRegex(ValueError, "cannot seek to negative position"):
                              reader.seek(-1, os.SEEK_SET)
                          reader.read(1)
-                         with self.assertRaisesRegexp(
-                             ValueError, 'cannot seek zstd decompression stream backwards'):
+                         with self.assertRaisesRegex(
+                             ValueError, "cannot seek zstd decompression stream backwards"
+                         ):
                              reader.seek(0, os.SEEK_SET)
-                         with self.assertRaisesRegexp(
-                             ValueError, 'cannot seek zstd decompression stream backwards'):
+                         with self.assertRaisesRegex(
+                             ValueError, "cannot seek zstd decompression stream backwards"
+                         ):
                              reader.seek(-1, os.SEEK_CUR)
-                         with self.assertRaisesRegexp(
-                             ValueError,
-                             'zstd decompression streams cannot be seeked with SEEK_END'):
+                         with self.assertRaisesRegex(
+                             ValueError, "zstd decompression streams cannot be seeked with SEEK_END"
+                         ):
                              reader.seek(0, os.SEEK_END)
                          reader.close()
-                         with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         with self.assertRaisesRegex(ValueError, "stream is closed"):
                              reader.seek(4, os.SEEK_SET)
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          reader.seek(0)
                  def test_seek(self):
-                     source = b'foobar' * 60
+                     source = b"foobar" * 60
                      cctx = zstd.ZstdCompressor()
                      frame = cctx.compress(source)
                      with dctx.stream_reader(frame) as reader:
                          reader.seek(3)
-                         self.assertEqual(reader.read(3), b'bar')
+                         self.assertEqual(reader.read(3), b"bar")
                          reader.seek(4, os.SEEK_CUR)
-                         self.assertEqual(reader.read(2), b'ar')
+                         self.assertEqual(reader.read(2), b"ar")
                  def test_no_context_manager(self):
-                     source = b'foobar' * 60
+                     source = b"foobar" * 60
                      cctx = zstd.ZstdCompressor()
                      frame = cctx.compress(source)
                      dctx = zstd.ZstdDecompressor()
                      reader = dctx.stream_reader(frame)
-                     self.assertEqual(reader.read(6), b'foobar')
-                     self.assertEqual(reader.read(18), b'foobar' * 3)
+                     self.assertEqual(reader.read(6), b"foobar")
+                     self.assertEqual(reader.read(18), b"foobar" * 3)
                      self.assertFalse(reader.closed)
                      # Calling close prevents subsequent use.
                      reader.close()
                      self.assertTrue(reader.closed)
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          reader.read(6)
                  def test_read_after_error(self):
-                     source = io.BytesIO(b'')
+                     source = io.BytesIO(b"")
                      dctx = zstd.ZstdDecompressor()
                      reader = dctx.stream_reader(source)
                          reader.read(0)
                      with reader:
-                         with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                         with self.assertRaisesRegex(ValueError, "stream is closed"):
                              reader.read(100)
                  def test_partial_read(self):
                      cctx = zstd.ZstdCompressor()
                      source = io.BytesIO()
                      writer = cctx.stream_writer(source)
-                     writer.write(b'foo')
+                     writer.write(b"foo")
                      writer.flush(zstd.FLUSH_FRAME)
-                     writer.write(b'bar')
+                     writer.write(b"bar")
                      writer.flush(zstd.FLUSH_FRAME)
                      dctx = zstd.ZstdDecompressor()
                      reader = dctx.stream_reader(source.getvalue())
-                     self.assertEqual(reader.read(2), b'fo')
-                     self.assertEqual(reader.read(2), b'o')
-                     self.assertEqual(reader.read(2), b'ba')
-                     self.assertEqual(reader.read(2), b'r')
+                     self.assertEqual(reader.read(2), b"fo")
+                     self.assertEqual(reader.read(2), b"o")
+                     self.assertEqual(reader.read(2), b"ba")
+                     self.assertEqual(reader.read(2), b"r")
                      source.seek(0)
                      reader = dctx.stream_reader(source)
-                     self.assertEqual(reader.read(2), b'fo')
-                     self.assertEqual(reader.read(2), b'o')
-                     self.assertEqual(reader.read(2), b'ba')
-                     self.assertEqual(reader.read(2), b'r')
+                     self.assertEqual(reader.read(2), b"fo")
+                     self.assertEqual(reader.read(2), b"o")
+                     self.assertEqual(reader.read(2), b"ba")
+                     self.assertEqual(reader.read(2), b"r")
                      reader = dctx.stream_reader(source.getvalue())
-                     self.assertEqual(reader.read(3), b'foo')
-                     self.assertEqual(reader.read(3), b'bar')
+                     self.assertEqual(reader.read(3), b"foo")
+                     self.assertEqual(reader.read(3), b"bar")
                      source.seek(0)
                      reader = dctx.stream_reader(source)
-                     self.assertEqual(reader.read(3), b'foo')
-                     self.assertEqual(reader.read(3), b'bar')
+                     self.assertEqual(reader.read(3), b"foo")
+                     self.assertEqual(reader.read(3), b"bar")
                      reader = dctx.stream_reader(source.getvalue())
-                     self.assertEqual(reader.read(4), b'foo')
-                     self.assertEqual(reader.read(4), b'bar')
+                     self.assertEqual(reader.read(4), b"foo")
+                     self.assertEqual(reader.read(4), b"bar")
                      source.seek(0)
                      reader = dctx.stream_reader(source)
-                     self.assertEqual(reader.read(4), b'foo')
-                     self.assertEqual(reader.read(4), b'bar')
+                     self.assertEqual(reader.read(4), b"foo")
+                     self.assertEqual(reader.read(4), b"bar")
                      reader = dctx.stream_reader(source.getvalue())
-                     self.assertEqual(reader.read(128), b'foo')
-                     self.assertEqual(reader.read(128), b'bar')
+                     self.assertEqual(reader.read(128), b"foo")
+                     self.assertEqual(reader.read(128), b"bar")
                      source.seek(0)
                      reader = dctx.stream_reader(source)
-                     self.assertEqual(reader.read(128), b'foo')
-                     self.assertEqual(reader.read(128), b'bar')
+                     self.assertEqual(reader.read(128), b"foo")
+                     self.assertEqual(reader.read(128), b"bar")
                      # Now tests for reads spanning frames.
                      reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
-                     self.assertEqual(reader.read(3), b'foo')
-                     self.assertEqual(reader.read(3), b'bar')
+                     self.assertEqual(reader.read(3), b"foo")
+                     self.assertEqual(reader.read(3), b"bar")
                      source.seek(0)
                      reader = dctx.stream_reader(source, read_across_frames=True)
-                     self.assertEqual(reader.read(3), b'foo')
-                     self.assertEqual(reader.read(3), b'bar')
+                     self.assertEqual(reader.read(3), b"foo")
+                     self.assertEqual(reader.read(3), b"bar")
                      reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
-                     self.assertEqual(reader.read(6), b'foobar')
+                     self.assertEqual(reader.read(6), b"foobar")
                      source.seek(0)
                      reader = dctx.stream_reader(source, read_across_frames=True)
-                     self.assertEqual(reader.read(6), b'foobar')
+                     self.assertEqual(reader.read(6), b"foobar")
                      reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
-                     self.assertEqual(reader.read(7), b'foobar')
+                     self.assertEqual(reader.read(7), b"foobar")
                      source.seek(0)
                      reader = dctx.stream_reader(source, read_across_frames=True)
-                     self.assertEqual(reader.read(7), b'foobar')
+                     self.assertEqual(reader.read(7), b"foobar")
                      reader = dctx.stream_reader(source.getvalue(), read_across_frames=True)
-                     self.assertEqual(reader.read(128), b'foobar')
+                     self.assertEqual(reader.read(128), b"foobar")
                      source.seek(0)
                      reader = dctx.stream_reader(source, read_across_frames=True)
-                     self.assertEqual(reader.read(128), b'foobar')
+                     self.assertEqual(reader.read(128), b"foobar")
                  def test_readinto(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = cctx.compress(b'foo')
+                     foo = cctx.compress(b"foo")
                      dctx = zstd.ZstdDecompressor()
                      # The exact exception varies based on the backend.
                      reader = dctx.stream_reader(foo)
                      with self.assertRaises(Exception):
-                         reader.readinto(b'foobar')
+                         reader.readinto(b"foobar")
                      # readinto() with sufficiently large destination.
                      b = bytearray(1024)
                      reader = dctx.stream_reader(foo)
                      self.assertEqual(reader.readinto(b), 3)
-                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(b[0:3], b"foo")
                      self.assertEqual(reader.readinto(b), 0)
-                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(b[0:3], b"foo")
                      # readinto() with small reads.
                      b = bytearray(1024)
                      reader = dctx.stream_reader(foo, read_size=1)
                      self.assertEqual(reader.readinto(b), 3)
-                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(b[0:3], b"foo")
                      # Too small destination buffer.
                      b = bytearray(2)
                      reader = dctx.stream_reader(foo)
                      self.assertEqual(reader.readinto(b), 2)
-                     self.assertEqual(b[:], b'fo')
+                     self.assertEqual(b[:], b"fo")
                  def test_readinto1(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = cctx.compress(b'foo')
+                     foo = cctx.compress(b"foo")
                      dctx = zstd.ZstdDecompressor()
                      reader = dctx.stream_reader(foo)
                      with self.assertRaises(Exception):
-                         reader.readinto1(b'foobar')
+                         reader.readinto1(b"foobar")
                      # Sufficiently large destination.
                      b = bytearray(1024)
                      reader = dctx.stream_reader(foo)
                      self.assertEqual(reader.readinto1(b), 3)
-                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(b[0:3], b"foo")
                      self.assertEqual(reader.readinto1(b), 0)
-                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(b[0:3], b"foo")
                      # readinto() with small reads.
                      b = bytearray(1024)
                      reader = dctx.stream_reader(foo, read_size=1)
                      self.assertEqual(reader.readinto1(b), 3)
-                     self.assertEqual(b[0:3], b'foo')
+                     self.assertEqual(b[0:3], b"foo")
                      # Too small destination buffer.
                      b = bytearray(2)
                      reader = dctx.stream_reader(foo)
                      self.assertEqual(reader.readinto1(b), 2)
-                     self.assertEqual(b[:], b'fo')
+                     self.assertEqual(b[:], b"fo")
                  def test_readall(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = cctx.compress(b'foo')
+                     foo = cctx.compress(b"foo")
                      dctx = zstd.ZstdDecompressor()
                      reader = dctx.stream_reader(foo)
-                     self.assertEqual(reader.readall(), b'foo')
+                     self.assertEqual(reader.readall(), b"foo")
                  def test_read1(self):
                      cctx = zstd.ZstdCompressor()
-                     foo = cctx.compress(b'foo')
+                     foo = cctx.compress(b"foo")
                      dctx = zstd.ZstdDecompressor()
                      b = OpCountingBytesIO(foo)
                      reader = dctx.stream_reader(b)
-                     self.assertEqual(reader.read1(), b'foo')
+                     self.assertEqual(reader.read1(), b"foo")
                      self.assertEqual(b._read_count, 1)
                      b = OpCountingBytesIO(foo)
                      reader = dctx.stream_reader(b)
-                     self.assertEqual(reader.read1(0), b'')
-                     self.assertEqual(reader.read1(2), b'fo')
+                     self.assertEqual(reader.read1(0), b"")
+                     self.assertEqual(reader.read1(2), b"fo")
                      self.assertEqual(b._read_count, 1)
-                     self.assertEqual(reader.read1(1), b'o')
+                     self.assertEqual(reader.read1(1), b"o")
                      self.assertEqual(b._read_count, 1)
-                     self.assertEqual(reader.read1(1), b'')
+                     self.assertEqual(reader.read1(1), b"")
                      self.assertEqual(b._read_count, 2)
                  def test_read_lines(self):
                      cctx = zstd.ZstdCompressor()
-                     source = b'\n'.join(('line %d' % i).encode('ascii') for i in range(1024))
+                     source = b"\n".join(("line %d" % i).encode("ascii") for i in range(1024))
                      frame = cctx.compress(source)
                      dctx = zstd.ZstdDecompressor()
                      reader = dctx.stream_reader(frame)
-                     tr = io.TextIOWrapper(reader, encoding='utf-8')
+                     tr = io.TextIOWrapper(reader, encoding="utf-8")
                      lines = []
                      for line in tr:
-                         lines.append(line.encode('utf-8'))
+                         lines.append(line.encode("utf-8"))
                      self.assertEqual(len(lines), 1024)
-                     self.assertEqual(b''.join(lines), source)
+                     self.assertEqual(b"".join(lines), source)
                      reader = dctx.stream_reader(frame)
-                     tr = io.TextIOWrapper(reader, encoding='utf-8')
+                     tr = io.TextIOWrapper(reader, encoding="utf-8")
                      lines = tr.readlines()
                      self.assertEqual(len(lines), 1024)
-                     self.assertEqual(''.join(lines).encode('utf-8'), source)
+                     self.assertEqual("".join(lines).encode("utf-8"), source)
                      reader = dctx.stream_reader(frame)
-                     tr = io.TextIOWrapper(reader, encoding='utf-8')
+                     tr = io.TextIOWrapper(reader, encoding="utf-8")
                      lines = []
                      while True:
                          if not line:
                              break
-                         lines.append(line.encode('utf-8'))
+                         lines.append(line.encode("utf-8"))
                      self.assertEqual(len(lines), 1024)
-                     self.assertEqual(b''.join(lines), source)
+                     self.assertEqual(b"".join(lines), source)
              @make_cffi
-             class TestDecompressor_decompressobj(unittest.TestCase):
+             class TestDecompressor_decompressobj(TestCase):
                  def test_simple(self):
-                     data = zstd.ZstdCompressor(level=1).compress(b'foobar')
+                     data = zstd.ZstdCompressor(level=1).compress(b"foobar")
                      dctx = zstd.ZstdDecompressor()
                      dobj = dctx.decompressobj()
-                     self.assertEqual(dobj.decompress(data), b'foobar')
+                     self.assertEqual(dobj.decompress(data), b"foobar")
                      self.assertIsNone(dobj.flush())
                      self.assertIsNone(dobj.flush(10))
                      self.assertIsNone(dobj.flush(length=100))
                  def test_input_types(self):
-                     compressed = zstd.ZstdCompressor(level=1).compress(b'foo')
+                     compressed = zstd.ZstdCompressor(level=1).compress(b"foo")
                      dctx = zstd.ZstdDecompressor()
                          self.assertIsNone(dobj.flush())
                          self.assertIsNone(dobj.flush(10))
                          self.assertIsNone(dobj.flush(length=100))
-                         self.assertEqual(dobj.decompress(source), b'foo')
+                         self.assertEqual(dobj.decompress(source), b"foo")
                          self.assertIsNone(dobj.flush())
                  def test_reuse(self):
-                     data = zstd.ZstdCompressor(level=1).compress(b'foobar')
+                     data = zstd.ZstdCompressor(level=1).compress(b"foobar")
                      dctx = zstd.ZstdDecompressor()
                      dobj = dctx.decompressobj()
                      dobj.decompress(data)
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'cannot use a decompressobj'):
+                     with self.assertRaisesRegex(zstd.ZstdError, "cannot use a decompressobj"):
                          dobj.decompress(data)
                          self.assertIsNone(dobj.flush())
                  def test_bad_write_size(self):
                      dctx = zstd.ZstdDecompressor()
-                     with self.assertRaisesRegexp(ValueError, 'write_size must be positive'):
+                     with self.assertRaisesRegex(ValueError, "write_size must be positive"):
                          dctx.decompressobj(write_size=0)
                  def test_write_size(self):
-                     source = b'foo' * 64 + b'bar' * 128
+                     source = b"foo" * 64 + b"bar" * 128
                      data = zstd.ZstdCompressor(level=1).compress(source)
                      dctx = zstd.ZstdDecompressor()
              @make_cffi
-             class TestDecompressor_stream_writer(unittest.TestCase):
+             class TestDecompressor_stream_writer(TestCase):
                  def test_io_api(self):
                      buffer = io.BytesIO()
                      dctx = zstd.ZstdDecompressor()
                          writer.fileno()
                  def test_fileno_file(self):
-                     with tempfile.TemporaryFile('wb') as tf:
+                     with tempfile.TemporaryFile("wb") as tf:
                          dctx = zstd.ZstdDecompressor()
                          writer = dctx.stream_writer(tf)
                          self.assertEqual(writer.fileno(), tf.fileno())
                  def test_close(self):
-                     foo = zstd.ZstdCompressor().compress(b'foo')
+                     foo = zstd.ZstdCompressor().compress(b"foo")
                      buffer = NonClosingBytesIO()
                      dctx = zstd.ZstdDecompressor()
                      self.assertTrue(writer.closed)
                      self.assertTrue(buffer.closed)
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
-                         writer.write(b'')
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
+                         writer.write(b"")
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          writer.flush()
-                     with self.assertRaisesRegexp(ValueError, 'stream is closed'):
+                     with self.assertRaisesRegex(ValueError, "stream is closed"):
                          with writer:
                              pass
-                     self.assertEqual(buffer.getvalue(), b'foo')
+                     self.assertEqual(buffer.getvalue(), b"foo")
                      # Context manager exit should close stream.
                      buffer = NonClosingBytesIO()
                          writer.write(foo)
                      self.assertTrue(writer.closed)
-                     self.assertEqual(buffer.getvalue(), b'foo')
+                     self.assertEqual(buffer.getvalue(), b"foo")
                  def test_flush(self):
                      buffer = OpCountingBytesIO()
                  def test_empty_roundtrip(self):
                      cctx = zstd.ZstdCompressor()
-                     empty = cctx.compress(b'')
-                     self.assertEqual(decompress_via_writer(empty), b'')
+                     empty = cctx.compress(b"")
+                     self.assertEqual(decompress_via_writer(empty), b"")
                  def test_input_types(self):
                      cctx = zstd.ZstdCompressor(level=1)
-                     compressed = cctx.compress(b'foo')
+                     compressed = cctx.compress(b"foo")
                      mutable_array = bytearray(len(compressed))
                      mutable_array[:] = compressed
                          decompressor = dctx.stream_writer(buffer)
                          decompressor.write(source)
-                         self.assertEqual(buffer.getvalue(), b'foo')
+                         self.assertEqual(buffer.getvalue(), b"foo")
                          buffer = NonClosingBytesIO()
                          with dctx.stream_writer(buffer) as decompressor:
                              self.assertEqual(decompressor.write(source), 3)
-                         self.assertEqual(buffer.getvalue(), b'foo')
+                         self.assertEqual(buffer.getvalue(), b"foo")
                          buffer = io.BytesIO()
                          writer = dctx.stream_writer(buffer, write_return_read=True)
                          self.assertEqual(writer.write(source), len(source))
-                         self.assertEqual(buffer.getvalue(), b'foo')
+                         self.assertEqual(buffer.getvalue(), b"foo")
                  def test_large_roundtrip(self):
                      chunks = []
                      for i in range(255):
-                         chunks.append(struct.Struct('>B').pack(i) * 16384)
-                     orig = b''.join(chunks)
+                         chunks.append(struct.Struct(">B").pack(i) * 16384)
+                     orig = b"".join(chunks)
                      cctx = zstd.ZstdCompressor()
                      compressed = cctx.compress(orig)
                      chunks = []
                      for i in range(255):
                          for j in range(255):
-                             chunks.append(struct.Struct('>B').pack(j) * i)
+                             chunks.append(struct.Struct(">B").pack(j) * i)
-                     orig = b''.join(chunks)
+                     orig = b"".join(chunks)
                      cctx = zstd.ZstdCompressor()
                      compressed = cctx.compress(orig)
                  def test_dictionary(self):
                      samples = []
                      for i in range(128):
-                         samples.append(b'foo' * 64)
-                         samples.append(b'bar' * 64)
-                         samples.append(b'foobar' * 64)
+                         samples.append(b"foo" * 64)
+                         samples.append(b"bar" * 64)
+                         samples.append(b"foobar" * 64)
                      d = zstd.train_dictionary(8192, samples)
-                     orig = b'foobar' * 16384
+                     orig = b"foobar" * 16384
                      buffer = NonClosingBytesIO()
                      cctx = zstd.ZstdCompressor(dict_data=d)
                      with cctx.stream_writer(buffer) as compressor:
                      self.assertGreater(size, 100000)
                  def test_write_size(self):
-                     source = zstd.ZstdCompressor().compress(b'foobarfoobar')
+                     source = zstd.ZstdCompressor().compress(b"foobarfoobar")
                      dest = OpCountingBytesIO()
                      dctx = zstd.ZstdDecompressor()
                      with dctx.stream_writer(dest, write_size=1) as decompressor:
-                         s = struct.Struct('>B')
+                         s = struct.Struct(">B")
                          for c in source:
                              if not isinstance(c, str):
                                  c = s.pack(c)
                              decompressor.write(c)
-                     self.assertEqual(dest.getvalue(), b'foobarfoobar')
+                     self.assertEqual(dest.getvalue(), b"foobarfoobar")
                      self.assertEqual(dest._write_count, len(dest.getvalue()))
              @make_cffi
-             class TestDecompressor_read_to_iter(unittest.TestCase):
+             class TestDecompressor_read_to_iter(TestCase):
                  def test_type_validation(self):
                      dctx = zstd.ZstdDecompressor()
                      dctx.read_to_iter(io.BytesIO())
                      # Buffer protocol works.
-                     dctx.read_to_iter(b'foobar')
+                     dctx.read_to_iter(b"foobar")
-                     with self.assertRaisesRegexp(ValueError, 'must pass an object with a read'):
-                         b''.join(dctx.read_to_iter(True))
+                     with self.assertRaisesRegex(ValueError, "must pass an object with a read"):
+                         b"".join(dctx.read_to_iter(True))
                  def test_empty_input(self):
                      dctx = zstd.ZstdDecompressor()
                      with self.assertRaises(StopIteration):
                          next(it)
-                     it = dctx.read_to_iter(b'')
+                     it = dctx.read_to_iter(b"")
                      with self.assertRaises(StopIteration):
                          next(it)
                  def test_invalid_input(self):
                      dctx = zstd.ZstdDecompressor()
-                     source = io.BytesIO(b'foobar')
+                     source = io.BytesIO(b"foobar")
                      it = dctx.read_to_iter(source)
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Unknown frame descriptor'):
+                     with self.assertRaisesRegex(zstd.ZstdError, "Unknown frame descriptor"):
                          next(it)
-                     it = dctx.read_to_iter(b'foobar')
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'Unknown frame descriptor'):
+                     it = dctx.read_to_iter(b"foobar")
+                     with self.assertRaisesRegex(zstd.ZstdError, "Unknown frame descriptor"):
                          next(it)
                  def test_empty_roundtrip(self):
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=False)
-                     empty = cctx.compress(b'')
+                     empty = cctx.compress(b"")
                      source = io.BytesIO(empty)
                      source.seek(0)
                  def test_skip_bytes_too_large(self):
                      dctx = zstd.ZstdDecompressor()
-                     with self.assertRaisesRegexp(ValueError, 'skip_bytes must be smaller than read_size'):
-                         b''.join(dctx.read_to_iter(b'', skip_bytes=1, read_size=1))
+                     with self.assertRaisesRegex(
+                         ValueError, "skip_bytes must be smaller than read_size"
+                     ):
+                         b"".join(dctx.read_to_iter(b"", skip_bytes=1, read_size=1))
-                     with self.assertRaisesRegexp(ValueError, 'skip_bytes larger than first input chunk'):
-                         b''.join(dctx.read_to_iter(b'foobar', skip_bytes=10))
+                     with self.assertRaisesRegex(
+                         ValueError, "skip_bytes larger than first input chunk"
+                     ):
+                         b"".join(dctx.read_to_iter(b"foobar", skip_bytes=10))
                  def test_skip_bytes(self):
                      cctx = zstd.ZstdCompressor(write_content_size=False)
-                     compressed = cctx.compress(b'foobar')
+                     compressed = cctx.compress(b"foobar")
                      dctx = zstd.ZstdDecompressor()
-                     output = b''.join(dctx.read_to_iter(b'hdr' + compressed, skip_bytes=3))
-                     self.assertEqual(output, b'foobar')
+                     output = b"".join(dctx.read_to_iter(b"hdr" + compressed, skip_bytes=3))
+                     self.assertEqual(output, b"foobar")
                  def test_large_output(self):
                      source = io.BytesIO()
-                     source.write(b'f' * zstd.DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE)
-                     source.write(b'o')
+                     source.write(b"f" * zstd.DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE)
+                     source.write(b"o")
                      source.seek(0)
                      cctx = zstd.ZstdCompressor(level=1)
                      with self.assertRaises(StopIteration):
                          next(it)
-                     decompressed = b''.join(chunks)
+                     decompressed = b"".join(chunks)
                      self.assertEqual(decompressed, source.getvalue())
                      # And again with buffer protocol.
                      with self.assertRaises(StopIteration):
                          next(it)
-                     decompressed = b''.join(chunks)
+                     decompressed = b"".join(chunks)
                      self.assertEqual(decompressed, source.getvalue())
-                 @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+                 @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
                  def test_large_input(self):
-                     bytes = list(struct.Struct('>B').pack(i) for i in range(256))
+                     bytes = list(struct.Struct(">B").pack(i) for i in range(256))
                      compressed = NonClosingBytesIO()
                      input_size = 0
                      cctx = zstd.ZstdCompressor(level=1)
                              compressor.write(random.choice(bytes))
                              input_size += 1
-                             have_compressed = len(compressed.getvalue()) > zstd.DECOMPRESSION_RECOMMENDED_INPUT_SIZE
+                             have_compressed = (
+                                 len(compressed.getvalue())
+                                 > zstd.DECOMPRESSION_RECOMMENDED_INPUT_SIZE
+                             )
                              have_raw = input_size > zstd.DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE * 2
                              if have_compressed and have_raw:
                                  break
                      compressed = io.BytesIO(compressed.getvalue())
-                     self.assertGreater(len(compressed.getvalue()),
-                                        zstd.DECOMPRESSION_RECOMMENDED_INPUT_SIZE)
+                     self.assertGreater(
+                         len(compressed.getvalue()), zstd.DECOMPRESSION_RECOMMENDED_INPUT_SIZE
+                     )
                      dctx = zstd.ZstdDecompressor()
                      it = dctx.read_to_iter(compressed)
                      with self.assertRaises(StopIteration):
                          next(it)
-                     decompressed = b''.join(chunks)
+                     decompressed = b"".join(chunks)
                      self.assertEqual(len(decompressed), input_size)
                      # And again with buffer protocol.
                      with self.assertRaises(StopIteration):
                          next(it)
-                     decompressed = b''.join(chunks)
+                     decompressed = b"".join(chunks)
                      self.assertEqual(len(decompressed), input_size)
                  def test_interesting(self):
                      compressed = NonClosingBytesIO()
                      with cctx.stream_writer(compressed) as compressor:
                          for i in range(256):
-                             chunk = b'\0' * 1024
+                             chunk = b"\0" * 1024
                              compressor.write(chunk)
                              source.write(chunk)
                      dctx = zstd.ZstdDecompressor()
-                     simple = dctx.decompress(compressed.getvalue(),
-                                              max_output_size=len(source.getvalue()))
+                     simple = dctx.decompress(
+                         compressed.getvalue(), max_output_size=len(source.getvalue())
+                     )
                      self.assertEqual(simple, source.getvalue())
                      compressed = io.BytesIO(compressed.getvalue())
-                     streamed = b''.join(dctx.read_to_iter(compressed))
+                     streamed = b"".join(dctx.read_to_iter(compressed))
                      self.assertEqual(streamed, source.getvalue())
                  def test_read_write_size(self):
-                     source = OpCountingBytesIO(zstd.ZstdCompressor().compress(b'foobarfoobar'))
+                     source = OpCountingBytesIO(zstd.ZstdCompressor().compress(b"foobarfoobar"))
                      dctx = zstd.ZstdDecompressor()
                      for chunk in dctx.read_to_iter(source, read_size=1, write_size=1):
                          self.assertEqual(len(chunk), 1)
                  def test_magic_less(self):
                      params = zstd.CompressionParameters.from_level(
-, format=zstd.FORMAT_ZSTD1_MAGICLESS)
+, format=zstd.FORMAT_ZSTD1_MAGICLESS
+                     )
                      cctx = zstd.ZstdCompressor(compression_params=params)
-                     frame = cctx.compress(b'foobar')
+                     frame = cctx.compress(b"foobar")
-                     self.assertNotEqual(frame[0:4], b'\x28\xb5\x2f\xfd')
+                     self.assertNotEqual(frame[0:4], b"\x28\xb5\x2f\xfd")
                      dctx = zstd.ZstdDecompressor()
-                     with self.assertRaisesRegexp(
-                         zstd.ZstdError, 'error determining content size from frame header'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "error determining content size from frame header"
+                     ):
                          dctx.decompress(frame)
                      dctx = zstd.ZstdDecompressor(format=zstd.FORMAT_ZSTD1_MAGICLESS)
-                     res = b''.join(dctx.read_to_iter(frame))
-                     self.assertEqual(res, b'foobar')
+                     res = b"".join(dctx.read_to_iter(frame))
+                     self.assertEqual(res, b"foobar")
              @make_cffi
-             class TestDecompressor_content_dict_chain(unittest.TestCase):
+             class TestDecompressor_content_dict_chain(TestCase):
                  def test_bad_inputs_simple(self):
                      dctx = zstd.ZstdDecompressor()
                      with self.assertRaises(TypeError):
-                         dctx.decompress_content_dict_chain(b'foo')
+                         dctx.decompress_content_dict_chain(b"foo")
                      with self.assertRaises(TypeError):
-                         dctx.decompress_content_dict_chain((b'foo', b'bar'))
+                         dctx.decompress_content_dict_chain((b"foo", b"bar"))
-                     with self.assertRaisesRegexp(ValueError, 'empty input chain'):
+                     with self.assertRaisesRegex(ValueError, "empty input chain"):
                          dctx.decompress_content_dict_chain([])
-                     with self.assertRaisesRegexp(ValueError, 'chunk 0 must be bytes'):
-                         dctx.decompress_content_dict_chain([u'foo'])
+                     with self.assertRaisesRegex(ValueError, "chunk 0 must be bytes"):
+                         dctx.decompress_content_dict_chain([u"foo"])
-                     with self.assertRaisesRegexp(ValueError, 'chunk 0 must be bytes'):
+                     with self.assertRaisesRegex(ValueError, "chunk 0 must be bytes"):
                          dctx.decompress_content_dict_chain([True])
-                     with self.assertRaisesRegexp(ValueError, 'chunk 0 is too small to contain a zstd frame'):
+                     with self.assertRaisesRegex(
+                         ValueError, "chunk 0 is too small to contain a zstd frame"
+                     ):
                          dctx.decompress_content_dict_chain([zstd.FRAME_HEADER])
-                     with self.assertRaisesRegexp(ValueError, 'chunk 0 is not a valid zstd frame'):
-                         dctx.decompress_content_dict_chain([b'foo' * 8])
+                     with self.assertRaisesRegex(ValueError, "chunk 0 is not a valid zstd frame"):
+                         dctx.decompress_content_dict_chain([b"foo" * 8])
-                     no_size = zstd.ZstdCompressor(write_content_size=False).compress(b'foo' * 64)
+                     no_size = zstd.ZstdCompressor(write_content_size=False).compress(b"foo" * 64)
-                     with self.assertRaisesRegexp(ValueError, 'chunk 0 missing content size in frame'):
+                     with self.assertRaisesRegex(
+                         ValueError, "chunk 0 missing content size in frame"
+                     ):
                          dctx.decompress_content_dict_chain([no_size])
                      # Corrupt first frame.
-                     frame = zstd.ZstdCompressor().compress(b'foo' * 64)
+                     frame = zstd.ZstdCompressor().compress(b"foo" * 64)
                      frame = frame[0:12] + frame[15:]
-                     with self.assertRaisesRegexp(zstd.ZstdError,
-                                                  'chunk 0 did not decompress full frame'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "chunk 0 did not decompress full frame"
+                     ):
                          dctx.decompress_content_dict_chain([frame])
                  def test_bad_subsequent_input(self):
-                     initial = zstd.ZstdCompressor().compress(b'foo' * 64)
+                     initial = zstd.ZstdCompressor().compress(b"foo" * 64)
                      dctx = zstd.ZstdDecompressor()
-                     with self.assertRaisesRegexp(ValueError, 'chunk 1 must be bytes'):
-                         dctx.decompress_content_dict_chain([initial, u'foo'])
+                     with self.assertRaisesRegex(ValueError, "chunk 1 must be bytes"):
+                         dctx.decompress_content_dict_chain([initial, u"foo"])
-                     with self.assertRaisesRegexp(ValueError, 'chunk 1 must be bytes'):
+                     with self.assertRaisesRegex(ValueError, "chunk 1 must be bytes"):
                          dctx.decompress_content_dict_chain([initial, None])
-                     with self.assertRaisesRegexp(ValueError, 'chunk 1 is too small to contain a zstd frame'):
+                     with self.assertRaisesRegex(
+                         ValueError, "chunk 1 is too small to contain a zstd frame"
+                     ):
                          dctx.decompress_content_dict_chain([initial, zstd.FRAME_HEADER])
-                     with self.assertRaisesRegexp(ValueError, 'chunk 1 is not a valid zstd frame'):
-                         dctx.decompress_content_dict_chain([initial, b'foo' * 8])
+                     with self.assertRaisesRegex(ValueError, "chunk 1 is not a valid zstd frame"):
+                         dctx.decompress_content_dict_chain([initial, b"foo" * 8])
-                     no_size = zstd.ZstdCompressor(write_content_size=False).compress(b'foo' * 64)
+                     no_size = zstd.ZstdCompressor(write_content_size=False).compress(b"foo" * 64)
-                     with self.assertRaisesRegexp(ValueError, 'chunk 1 missing content size in frame'):
+                     with self.assertRaisesRegex(
+                         ValueError, "chunk 1 missing content size in frame"
+                     ):
                          dctx.decompress_content_dict_chain([initial, no_size])
                      # Corrupt second frame.
-                     cctx = zstd.ZstdCompressor(dict_data=zstd.ZstdCompressionDict(b'foo' * 64))
-                     frame = cctx.compress(b'bar' * 64)
+                     cctx = zstd.ZstdCompressor(dict_data=zstd.ZstdCompressionDict(b"foo" * 64))
+                     frame = cctx.compress(b"bar" * 64)
                      frame = frame[0:12] + frame[15:]
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'chunk 1 did not decompress full frame'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError, "chunk 1 did not decompress full frame"
+                     ):
                          dctx.decompress_content_dict_chain([initial, frame])
                  def test_simple(self):
                      original = [
-                         b'foo' * 64,
-                         b'foobar' * 64,
-                         b'baz' * 64,
-                         b'foobaz' * 64,
-                         b'foobarbaz' * 64,
+                         b"foo" * 64,
+                         b"foobar" * 64,
+                         b"baz" * 64,
+                         b"foobaz" * 64,
+                         b"foobarbaz" * 64,
                      ]
                      chunks = []
              # TODO enable for CFFI
-             class TestDecompressor_multi_decompress_to_buffer(unittest.TestCase):
+             class TestDecompressor_multi_decompress_to_buffer(TestCase):
                  def test_invalid_inputs(self):
                      dctx = zstd.ZstdDecompressor()
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
                      with self.assertRaises(TypeError):
                          dctx.multi_decompress_to_buffer(True)
                      with self.assertRaises(TypeError):
                          dctx.multi_decompress_to_buffer((1, 2))
-                     with self.assertRaisesRegexp(TypeError, 'item 0 not a bytes like object'):
-                         dctx.multi_decompress_to_buffer([u'foo'])
+                     with self.assertRaisesRegex(TypeError, "item 0 not a bytes like object"):
+                         dctx.multi_decompress_to_buffer([u"foo"])
-                     with self.assertRaisesRegexp(ValueError, 'could not determine decompressed size of item 0'):
-                         dctx.multi_decompress_to_buffer([b'foobarbaz'])
+                     with self.assertRaisesRegex(
+                         ValueError, "could not determine decompressed size of item 0"
+                     ):
+                         dctx.multi_decompress_to_buffer([b"foobarbaz"])
                  def test_list_input(self):
                      cctx = zstd.ZstdCompressor()
-                     original = [b'foo' * 4, b'bar' * 6]
+                     original = [b"foo" * 4, b"bar" * 6]
                      frames = [cctx.compress(d) for d in original]
                      dctx = zstd.ZstdDecompressor()
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
                      result = dctx.multi_decompress_to_buffer(frames)
                  def test_list_input_frame_sizes(self):
                      cctx = zstd.ZstdCompressor()
-                     original = [b'foo' * 4, b'bar' * 6, b'baz' * 8]
+                     original = [b"foo" * 4, b"bar" * 6, b"baz" * 8]
                      frames = [cctx.compress(d) for d in original]
-                     sizes = struct.pack('=' + 'Q' * len(original), *map(len, original))
+                     sizes = struct.pack("=" + "Q" * len(original), *map(len, original))
                      dctx = zstd.ZstdDecompressor()
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
                      result = dctx.multi_decompress_to_buffer(frames, decompressed_sizes=sizes)
                  def test_buffer_with_segments_input(self):
                      cctx = zstd.ZstdCompressor()
-                     original = [b'foo' * 4, b'bar' * 6]
+                     original = [b"foo" * 4, b"bar" * 6]
                      frames = [cctx.compress(d) for d in original]
                      dctx = zstd.ZstdDecompressor()
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
-                     segments = struct.pack('=QQQQ', 0, len(frames[0]), len(frames[0]), len(frames[1]))
-                     b = zstd.BufferWithSegments(b''.join(frames), segments)
+                     segments = struct.pack(
+                         "=QQQQ", 0, len(frames[0]), len(frames[0]), len(frames[1])
+                     )
+                     b = zstd.BufferWithSegments(b"".join(frames), segments)
                      result = dctx.multi_decompress_to_buffer(b)
                  def test_buffer_with_segments_sizes(self):
                      cctx = zstd.ZstdCompressor(write_content_size=False)
-                     original = [b'foo' * 4, b'bar' * 6, b'baz' * 8]
+                     original = [b"foo" * 4, b"bar" * 6, b"baz" * 8]
                      frames = [cctx.compress(d) for d in original]
-                     sizes = struct.pack('=' + 'Q' * len(original), *map(len, original))
+                     sizes = struct.pack("=" + "Q" * len(original), *map(len, original))
                      dctx = zstd.ZstdDecompressor()
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
-                     segments = struct.pack('=QQQQQQ', 0, len(frames[0]),
-                                            len(frames[0]), len(frames[1]),
-                                            len(frames[0]) + len(frames[1]), len(frames[2]))
-                     b = zstd.BufferWithSegments(b''.join(frames), segments)
+                     segments = struct.pack(
+                         "=QQQQQQ",
+,
+                         len(frames[0]),
+                         len(frames[0]),
+                         len(frames[1]),
+                         len(frames[0]) + len(frames[1]),
+                         len(frames[2]),
+                     )
+                     b = zstd.BufferWithSegments(b"".join(frames), segments)
                      result = dctx.multi_decompress_to_buffer(b, decompressed_sizes=sizes)
                      cctx = zstd.ZstdCompressor()
                      original = [
-                         b'foo0' * 2,
-                         b'foo1' * 3,
-                         b'foo2' * 4,
-                         b'foo3' * 5,
-                         b'foo4' * 6,
+                         b"foo0" * 2,
+                         b"foo1" * 3,
+                         b"foo2" * 4,
+                         b"foo3" * 5,
+                         b"foo4" * 6,
                      ]
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
                      frames = cctx.multi_compress_to_buffer(original)
                          self.assertEqual(data, decompressed[i].tobytes())
                      # And a manual mode.
-                     b = b''.join([frames[0].tobytes(), frames[1].tobytes()])
-                     b1 = zstd.BufferWithSegments(b, struct.pack('=QQQQ',
-, len(frames[0]),
-                                                                 len(frames[0]), len(frames[1])))
+                     b = b"".join([frames[0].tobytes(), frames[1].tobytes()])
+                     b1 = zstd.BufferWithSegments(
+                         b, struct.pack("=QQQQ", 0, len(frames[0]), len(frames[0]), len(frames[1]))
+                     )
-                     b = b''.join([frames[2].tobytes(), frames[3].tobytes(), frames[4].tobytes()])
-                     b2 = zstd.BufferWithSegments(b, struct.pack('=QQQQQQ',
-, len(frames[2]),
-                                                                 len(frames[2]), len(frames[3]),
-                                                                 len(frames[2]) + len(frames[3]), len(frames[4])))
+                     b = b"".join([frames[2].tobytes(), frames[3].tobytes(), frames[4].tobytes()])
+                     b2 = zstd.BufferWithSegments(
+                         b,
+                         struct.pack(
+                             "=QQQQQQ",
+,
+                             len(frames[2]),
+                             len(frames[2]),
+                             len(frames[3]),
+                             len(frames[2]) + len(frames[3]),
+                             len(frames[4]),
+                         ),
+                     )
                      c = zstd.BufferWithSegmentsCollection(b1, b2)
                      dctx = zstd.ZstdDecompressor(dict_data=d)
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
                      result = dctx.multi_decompress_to_buffer(frames)
                      cctx = zstd.ZstdCompressor()
                      frames = []
-                     frames.extend(cctx.compress(b'x' * 64) for i in range(256))
-                     frames.extend(cctx.compress(b'y' * 64) for i in range(256))
+                     frames.extend(cctx.compress(b"x" * 64) for i in range(256))
+                     frames.extend(cctx.compress(b"y" * 64) for i in range(256))
                      dctx = zstd.ZstdDecompressor()
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
                      result = dctx.multi_decompress_to_buffer(frames, threads=-1)
                      self.assertEqual(len(result), len(frames))
                      self.assertEqual(result.size(), 2 * 64 * 256)
-                     self.assertEqual(result[0].tobytes(), b'x' * 64)
-                     self.assertEqual(result[256].tobytes(), b'y' * 64)
+                     self.assertEqual(result[0].tobytes(), b"x" * 64)
+                     self.assertEqual(result[256].tobytes(), b"y" * 64)
                  def test_item_failure(self):
                      cctx = zstd.ZstdCompressor()
-                     frames = [cctx.compress(b'x' * 128), cctx.compress(b'y' * 128)]
+                     frames = [cctx.compress(b"x" * 128), cctx.compress(b"y" * 128)]
-                     frames[1] = frames[1][0:15] + b'extra' + frames[1][15:]
+                     frames[1] = frames[1][0:15] + b"extra" + frames[1][15:]
                      dctx = zstd.ZstdDecompressor()
-                     if not hasattr(dctx, 'multi_decompress_to_buffer'):
-                         self.skipTest('multi_decompress_to_buffer not available')
+                     if not hasattr(dctx, "multi_decompress_to_buffer"):
+                         self.skipTest("multi_decompress_to_buffer not available")
-                     with self.assertRaisesRegexp(zstd.ZstdError,
-                                                  'error decompressing item 1: ('
-                                                  'Corrupted block|'
-                                                  'Destination buffer is too small)'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError,
+                         "error decompressing item 1: ("
+                         "Corrupted block|"
+                         "Destination buffer is too small)",
+                     ):
                          dctx.multi_decompress_to_buffer(frames)
-                     with self.assertRaisesRegexp(zstd.ZstdError,
-                                         'error decompressing item 1: ('
-                                         'Corrupted block|'
-                                         'Destination buffer is too small)'):
+                     with self.assertRaisesRegex(
+                         zstd.ZstdError,
+                         "error decompressing item 1: ("
+                         "Corrupted block|"
+                         "Destination buffer is too small)",
+                     ):
                          dctx.multi_decompress_to_buffer(frames, threads=2)

contrib/python-zstandard/tests/test_decompressor_fuzzing.py

0 +213 -122

                  import hypothesis
                  import hypothesis.strategies as strategies
              except ImportError:
-                 raise unittest.SkipTest('hypothesis not available')
+                 raise unittest.SkipTest("hypothesis not available")
              import zstandard as zstd
-             from . common import (
+             from .common import (
                  make_cffi,
                  NonClosingBytesIO,
                  random_input_data,
+                 TestCase,
              )
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestDecompressor_stream_reader_fuzzing(unittest.TestCase):
+             class TestDecompressor_stream_reader_fuzzing(TestCase):
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   streaming=strategies.booleans(),
-                                   source_read_size=strategies.integers(1, 1048576),
-                                   read_sizes=strategies.data())
-                 def test_stream_source_read_variance(self, original, level, streaming,
-                                                      source_read_size, read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     streaming=strategies.booleans(),
+                     source_read_size=strategies.integers(1, 1048576),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_stream_source_read_variance(
+                     self, original, level, streaming, source_read_size, read_sizes
+                 ):
                      cctx = zstd.ZstdCompressor(level=level)
                      if streaming:
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
                  # Similar to above except we have a constant read() size.
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   streaming=strategies.booleans(),
-                                   source_read_size=strategies.integers(1, 1048576),
-                                   read_size=strategies.integers(-1, 131072))
-                 def test_stream_source_read_size(self, original, level, streaming,
-                                                  source_read_size, read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     streaming=strategies.booleans(),
+                     source_read_size=strategies.integers(1, 1048576),
+                     read_size=strategies.integers(-1, 131072),
+                 )
+                 def test_stream_source_read_size(
+                     self, original, level, streaming, source_read_size, read_size
+                 ):
                      if read_size == 0:
                          read_size = 1
                          chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   streaming=strategies.booleans(),
-                                   source_read_size=strategies.integers(1, 1048576),
-                                   read_sizes=strategies.data())
-                 def test_buffer_source_read_variance(self, original, level, streaming,
-                                                      source_read_size, read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     streaming=strategies.booleans(),
+                     source_read_size=strategies.integers(1, 1048576),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_buffer_source_read_variance(
+                     self, original, level, streaming, source_read_size, read_sizes
+                 ):
                      cctx = zstd.ZstdCompressor(level=level)
                      if streaming:
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
                  # Similar to above except we have a constant read() size.
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   streaming=strategies.booleans(),
-                                   source_read_size=strategies.integers(1, 1048576),
-                                   read_size=strategies.integers(-1, 131072))
-                 def test_buffer_source_constant_read_size(self, original, level, streaming,
-                                                           source_read_size, read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     streaming=strategies.booleans(),
+                     source_read_size=strategies.integers(1, 1048576),
+                     read_size=strategies.integers(-1, 131072),
+                 )
+                 def test_buffer_source_constant_read_size(
+                     self, original, level, streaming, source_read_size, read_size
+                 ):
                      if read_size == 0:
                          read_size = -1
                          chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   streaming=strategies.booleans(),
-                                   source_read_size=strategies.integers(1, 1048576))
-                 def test_stream_source_readall(self, original, level, streaming,
-                                                      source_read_size):
+                     suppress_health_check=[hypothesis.HealthCheck.large_base_example]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     streaming=strategies.booleans(),
+                     source_read_size=strategies.integers(1, 1048576),
+                 )
+                 def test_stream_source_readall(self, original, level, streaming, source_read_size):
                      cctx = zstd.ZstdCompressor(level=level)
                      if streaming:
                      self.assertEqual(data, original)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   streaming=strategies.booleans(),
-                                   source_read_size=strategies.integers(1, 1048576),
-                                   read_sizes=strategies.data())
-                 def test_stream_source_read1_variance(self, original, level, streaming,
-                                                       source_read_size, read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     streaming=strategies.booleans(),
+                     source_read_size=strategies.integers(1, 1048576),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_stream_source_read1_variance(
+                     self, original, level, streaming, source_read_size, read_sizes
+                 ):
                      cctx = zstd.ZstdCompressor(level=level)
                      if streaming:
                              chunks.append(chunk)
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   streaming=strategies.booleans(),
-                                   source_read_size=strategies.integers(1, 1048576),
-                                   read_sizes=strategies.data())
-                 def test_stream_source_readinto1_variance(self, original, level, streaming,
-                                                       source_read_size, read_sizes):
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     streaming=strategies.booleans(),
+                     source_read_size=strategies.integers(1, 1048576),
+                     read_sizes=strategies.data(),
+                 )
+                 def test_stream_source_readinto1_variance(
+                     self, original, level, streaming, source_read_size, read_sizes
+                 ):
                      cctx = zstd.ZstdCompressor(level=level)
                      if streaming:
                              chunks.append(bytes(b[0:count]))
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
                  @hypothesis.given(
                      original=strategies.sampled_from(random_input_data()),
                      level=strategies.integers(min_value=1, max_value=5),
                      source_read_size=strategies.integers(1, 1048576),
                      seek_amounts=strategies.data(),
-                     read_sizes=strategies.data())
-                 def test_relative_seeks(self, original, level, source_read_size, seek_amounts,
-                                         read_sizes):
+                     read_sizes=strategies.data(),
+                 )
+                 def test_relative_seeks(
+                     self, original, level, source_read_size, seek_amounts, read_sizes
+                 ):
                      cctx = zstd.ZstdCompressor(level=level)
                      frame = cctx.compress(original)
                              if not chunk:
                                  break
-                             self.assertEqual(original[offset:offset + len(chunk)], chunk)
+                             self.assertEqual(original[offset : offset + len(chunk)], chunk)
                  @hypothesis.settings(
-                     suppress_health_check=[hypothesis.HealthCheck.large_base_example])
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
                  @hypothesis.given(
                      originals=strategies.data(),
                      frame_count=strategies.integers(min_value=2, max_value=10),
                      level=strategies.integers(min_value=1, max_value=5),
                      source_read_size=strategies.integers(1, 1048576),
-                     read_sizes=strategies.data())
-                 def test_multiple_frames(self, originals, frame_count, level,
-                                          source_read_size, read_sizes):
+                     read_sizes=strategies.data(),
+                 )
+                 def test_multiple_frames(
+                     self, originals, frame_count, level, source_read_size, read_sizes
+                 ):
                      cctx = zstd.ZstdCompressor(level=level)
                      source = io.BytesIO()
                      dctx = zstd.ZstdDecompressor()
                      buffer.seek(0)
-                     reader = dctx.stream_reader(buffer, read_size=source_read_size,
-                                                 read_across_frames=True)
+                     reader = dctx.stream_reader(
+                         buffer, read_size=source_read_size, read_across_frames=True
+                     )
                      chunks = []
                          chunks.append(chunk)
-                     self.assertEqual(source.getvalue(), b''.join(chunks))
+                     self.assertEqual(source.getvalue(), b"".join(chunks))
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestDecompressor_stream_writer_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   write_size=strategies.integers(min_value=1, max_value=8192),
-                                   input_sizes=strategies.data())
+             class TestDecompressor_stream_writer_fuzzing(TestCase):
+                 @hypothesis.settings(
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     write_size=strategies.integers(min_value=1, max_value=8192),
+                     input_sizes=strategies.data(),
+                 )
                  def test_write_size_variance(self, original, level, write_size, input_sizes):
                      cctx = zstd.ZstdCompressor(level=level)
                      frame = cctx.compress(original)
                      self.assertEqual(dest.getvalue(), original)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestDecompressor_copy_stream_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   read_size=strategies.integers(min_value=1, max_value=8192),
-                                   write_size=strategies.integers(min_value=1, max_value=8192))
+             class TestDecompressor_copy_stream_fuzzing(TestCase):
+                 @hypothesis.settings(
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     read_size=strategies.integers(min_value=1, max_value=8192),
+                     write_size=strategies.integers(min_value=1, max_value=8192),
+                 )
                  def test_read_write_size_variance(self, original, level, read_size, write_size):
                      cctx = zstd.ZstdCompressor(level=level)
                      frame = cctx.compress(original)
                      self.assertEqual(dest.getvalue(), original)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestDecompressor_decompressobj_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   chunk_sizes=strategies.data())
+             class TestDecompressor_decompressobj_fuzzing(TestCase):
+                 @hypothesis.settings(
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     chunk_sizes=strategies.data(),
+                 )
                  def test_random_input_sizes(self, original, level, chunk_sizes):
                      cctx = zstd.ZstdCompressor(level=level)
                      frame = cctx.compress(original)
                          chunks.append(dobj.decompress(chunk))
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   write_size=strategies.integers(min_value=1,
-                                                                  max_value=4 * zstd.DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE),
-                                   chunk_sizes=strategies.data())
+                 @hypothesis.settings(
+                     suppress_health_check=[
+                         hypothesis.HealthCheck.large_base_example,
+                         hypothesis.HealthCheck.too_slow,
+                     ]
+                 )
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     write_size=strategies.integers(
+                         min_value=1, max_value=4 * zstd.DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE
+                     ),
+                     chunk_sizes=strategies.data(),
+                 )
                  def test_random_output_sizes(self, original, level, write_size, chunk_sizes):
                      cctx = zstd.ZstdCompressor(level=level)
                      frame = cctx.compress(original)
                          chunks.append(dobj.decompress(chunk))
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
              @make_cffi
-             class TestDecompressor_read_to_iter_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.sampled_from(random_input_data()),
-                                   level=strategies.integers(min_value=1, max_value=5),
-                                   read_size=strategies.integers(min_value=1, max_value=4096),
-                                   write_size=strategies.integers(min_value=1, max_value=4096))
+             class TestDecompressor_read_to_iter_fuzzing(TestCase):
+                 @hypothesis.given(
+                     original=strategies.sampled_from(random_input_data()),
+                     level=strategies.integers(min_value=1, max_value=5),
+                     read_size=strategies.integers(min_value=1, max_value=4096),
+                     write_size=strategies.integers(min_value=1, max_value=4096),
+                 )
                  def test_read_write_size_variance(self, original, level, read_size, write_size):
                      cctx = zstd.ZstdCompressor(level=level)
                      frame = cctx.compress(original)
                      source = io.BytesIO(frame)
                      dctx = zstd.ZstdDecompressor()
-                     chunks = list(dctx.read_to_iter(source, read_size=read_size, write_size=write_size))
+                     chunks = list(
+                         dctx.read_to_iter(source, read_size=read_size, write_size=write_size)
+                     )
-                     self.assertEqual(b''.join(chunks), original)
+                     self.assertEqual(b"".join(chunks), original)
-             @unittest.skipUnless('ZSTD_SLOW_TESTS' in os.environ, 'ZSTD_SLOW_TESTS not set')
-             class TestDecompressor_multi_decompress_to_buffer_fuzzing(unittest.TestCase):
-                 @hypothesis.given(original=strategies.lists(strategies.sampled_from(random_input_data()),
-                                                     min_size=1, max_size=1024),
-                             threads=strategies.integers(min_value=1, max_value=8),
-                             use_dict=strategies.booleans())
+             @unittest.skipUnless("ZSTD_SLOW_TESTS" in os.environ, "ZSTD_SLOW_TESTS not set")
+             class TestDecompressor_multi_decompress_to_buffer_fuzzing(TestCase):
+                 @hypothesis.given(
+                     original=strategies.lists(
+                         strategies.sampled_from(random_input_data()), min_size=1, max_size=1024
+                     ),
+                     threads=strategies.integers(min_value=1, max_value=8),
+                     use_dict=strategies.booleans(),
+                 )
                  def test_data_equivalence(self, original, threads, use_dict):
                      kwargs = {}
                      if use_dict:
-                         kwargs['dict_data'] = zstd.ZstdCompressionDict(original[0])
+                         kwargs["dict_data"] = zstd.ZstdCompressionDict(original[0])
-                     cctx = zstd.ZstdCompressor(level=1,
-                                                write_content_size=True,
-                                                write_checksum=True,
-                                                **kwargs)
+                     cctx = zstd.ZstdCompressor(
+                         level=1, write_content_size=True, write_checksum=True, **kwargs
+                     )
-                     if not hasattr(cctx, 'multi_compress_to_buffer'):
-                         self.skipTest('multi_compress_to_buffer not available')
+                     if not hasattr(cctx, "multi_compress_to_buffer"):
+                         self.skipTest("multi_compress_to_buffer not available")
                      frames_buffer = cctx.multi_compress_to_buffer(original, threads=-1)

contrib/python-zstandard/tests/test_estimate_sizes.py

0 +3 -3

              import zstandard as zstd
-             from . common import (
+             from .common import (
                  make_cffi,
+                 TestCase,
              )
              @make_cffi
-             class TestSizes(unittest.TestCase):
+             class TestSizes(TestCase):
                  def test_decompression_size(self):
                      size = zstd.estimate_decompression_context_size()
                      self.assertGreater(size, 100000)

contrib/python-zstandard/tests/test_module_attributes.py

0 +47 -46

		@@ -4,65 +4,66 import unittest
4	4
5	5	import zstandard as zstd
6	6
7		from . common import (
	7	from .common import (
8	8	make_cffi,
	9	TestCase,
9	10	)
10	11
11	12
12	13	@make_cffi
13		class TestModuleAttributes(~~unittest~~.TestCase):
	14	class TestModuleAttributes(TestCase):
14	15	def test_version(self):
15		self.assertEqual(zstd.ZSTD_VERSION, (1, 4, 3))
	16	self.assertEqual(zstd.ZSTD_VERSION, (1, 4, 4))
16	17
17		self.assertEqual(zstd.__version__, ~~'0.12.0'~~)
	18	self.assertEqual(zstd.__version__, "0.13.0")
18	19
19	20	def test_constants(self):
20	21	self.assertEqual(zstd.MAX_COMPRESSION_LEVEL, 22)
21		self.assertEqual(zstd.FRAME_HEADER, b'\x28\xb5\x2f\xfd')
	22	self.assertEqual(zstd.FRAME_HEADER, b"\x28\xb5\x2f\xfd")
22	23
23	24	def test_hasattr(self):
24	25	attrs = (
25		'CONTENTSIZE_UNKNOWN',
26		'CONTENTSIZE_ERROR',
27		'COMPRESSION_RECOMMENDED_INPUT_SIZE',
28		'COMPRESSION_RECOMMENDED_OUTPUT_SIZE',
29		'DECOMPRESSION_RECOMMENDED_INPUT_SIZE',
30		'DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE',
31		'MAGIC_NUMBER',
32		'FLUSH_BLOCK',
33		'FLUSH_FRAME',
34		'BLOCKSIZELOG_MAX',
35		'BLOCKSIZE_MAX',
36		'WINDOWLOG_MIN',
37		'WINDOWLOG_MAX',
38		'CHAINLOG_MIN',
39		'CHAINLOG_MAX',
40		'HASHLOG_MIN',
41		'HASHLOG_MAX',
42		'HASHLOG3_MAX',
43		'MINMATCH_MIN',
44		'MINMATCH_MAX',
45		'SEARCHLOG_MIN',
46		'SEARCHLOG_MAX',
47		'SEARCHLENGTH_MIN',
48		'SEARCHLENGTH_MAX',
49		'TARGETLENGTH_MIN',
50		'TARGETLENGTH_MAX',
51		'LDM_MINMATCH_MIN',
52		'LDM_MINMATCH_MAX',
53		'LDM_BUCKETSIZELOG_MAX',
54		'STRATEGY_FAST',
55		'STRATEGY_DFAST',
56		'STRATEGY_GREEDY',
57		'STRATEGY_LAZY',
58		'STRATEGY_LAZY2',
59		'STRATEGY_BTLAZY2',
60		'STRATEGY_BTOPT',
61		'STRATEGY_BTULTRA',
62		'STRATEGY_BTULTRA2',
63		'DICT_TYPE_AUTO',
64		'DICT_TYPE_RAWCONTENT',
65		'DICT_TYPE_FULLDICT',
	26	"CONTENTSIZE_UNKNOWN",
	27	"CONTENTSIZE_ERROR",
	28	"COMPRESSION_RECOMMENDED_INPUT_SIZE",
	29	"COMPRESSION_RECOMMENDED_OUTPUT_SIZE",
	30	"DECOMPRESSION_RECOMMENDED_INPUT_SIZE",
	31	"DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE",
	32	"MAGIC_NUMBER",
	33	"FLUSH_BLOCK",
	34	"FLUSH_FRAME",
	35	"BLOCKSIZELOG_MAX",
	36	"BLOCKSIZE_MAX",
	37	"WINDOWLOG_MIN",
	38	"WINDOWLOG_MAX",
	39	"CHAINLOG_MIN",
	40	"CHAINLOG_MAX",
	41	"HASHLOG_MIN",
	42	"HASHLOG_MAX",
	43	"HASHLOG3_MAX",
	44	"MINMATCH_MIN",
	45	"MINMATCH_MAX",
	46	"SEARCHLOG_MIN",
	47	"SEARCHLOG_MAX",
	48	"SEARCHLENGTH_MIN",
	49	"SEARCHLENGTH_MAX",
	50	"TARGETLENGTH_MIN",
	51	"TARGETLENGTH_MAX",
	52	"LDM_MINMATCH_MIN",
	53	"LDM_MINMATCH_MAX",
	54	"LDM_BUCKETSIZELOG_MAX",
	55	"STRATEGY_FAST",
	56	"STRATEGY_DFAST",
	57	"STRATEGY_GREEDY",
	58	"STRATEGY_LAZY",
	59	"STRATEGY_LAZY2",
	60	"STRATEGY_BTLAZY2",
	61	"STRATEGY_BTOPT",
	62	"STRATEGY_BTULTRA",
	63	"STRATEGY_BTULTRA2",
	64	"DICT_TYPE_AUTO",
	65	"DICT_TYPE_RAWCONTENT",
	66	"DICT_TYPE_FULLDICT",
66	67	)
67	68
68	69	for a in attrs:

contrib/python-zstandard/tests/test_train_dictionary.py

0 +25 -22

              import zstandard as zstd
-             from . common import (
+             from .common import (
                  generate_samples,
                  make_cffi,
                  random_input_data,
+                 TestCase,
              )
              if sys.version_info[0] >= 3:
              @make_cffi
-             class TestTrainDictionary(unittest.TestCase):
+             class TestTrainDictionary(TestCase):
                  def test_no_args(self):
                      with self.assertRaises(TypeError):
                          zstd.train_dictionary()
                  def test_bad_args(self):
                      with self.assertRaises(TypeError):
-                         zstd.train_dictionary(8192, u'foo')
+                         zstd.train_dictionary(8192, u"foo")
                      with self.assertRaises(ValueError):
-                         zstd.train_dictionary(8192, [u'foo'])
+                         zstd.train_dictionary(8192, [u"foo"])
                  def test_no_params(self):
                      d = zstd.train_dictionary(8192, random_input_data())
                      self.assertIsInstance(d.dict_id(), int_type)
                      # The dictionary ID may be different across platforms.
-                     expected = b'\x37\xa4\x30\xec' + struct.pack('<I', d.dict_id())
+                     expected = b"\x37\xa4\x30\xec" + struct.pack("<I", d.dict_id())
                      data = d.as_bytes()
                      self.assertEqual(data[0:8], expected)
                      self.assertIsInstance(d.dict_id(), int_type)
                      data = d.as_bytes()
-                     self.assertEqual(data[0:4], b'\x37\xa4\x30\xec')
+                     self.assertEqual(data[0:4], b"\x37\xa4\x30\xec")
                      self.assertEqual(d.k, 64)
                      self.assertEqual(d.d, 16)
                  def test_set_dict_id(self):
-                     d = zstd.train_dictionary(8192, generate_samples(), k=64, d=16,
-                                               dict_id=42)
+                     d = zstd.train_dictionary(8192, generate_samples(), k=64, d=16, dict_id=42)
                      self.assertEqual(d.dict_id(), 42)
                  def test_optimize(self):
-                     d = zstd.train_dictionary(8192, generate_samples(), threads=-1, steps=1,
-                                               d=16)
+                     d = zstd.train_dictionary(8192, generate_samples(), threads=-1, steps=1, d=16)
                      # This varies by platform.
                      self.assertIn(d.k, (50, 2000))
                      self.assertEqual(d.d, 16)
              @make_cffi
-             class TestCompressionDict(unittest.TestCase):
+             class TestCompressionDict(TestCase):
                  def test_bad_mode(self):
-                     with self.assertRaisesRegexp(ValueError, 'invalid dictionary load mode'):
-                         zstd.ZstdCompressionDict(b'foo', dict_type=42)
+                     with self.assertRaisesRegex(ValueError, "invalid dictionary load mode"):
+                         zstd.ZstdCompressionDict(b"foo", dict_type=42)
                  def test_bad_precompute_compress(self):
                      d = zstd.train_dictionary(8192, generate_samples(), k=64, d=16)
-                     with self.assertRaisesRegexp(ValueError, 'must specify one of level or '):
+                     with self.assertRaisesRegex(ValueError, "must specify one of level or "):
                          d.precompute_compress()
-                     with self.assertRaisesRegexp(ValueError, 'must only specify one of level or '):
-                         d.precompute_compress(level=3,
-                                               compression_params=zstd.CompressionParameters())
+                     with self.assertRaisesRegex(ValueError, "must only specify one of level or "):
+                         d.precompute_compress(
+                             level=3, compression_params=zstd.CompressionParameters()
+                         )
                  def test_precompute_compress_rawcontent(self):
-                     d = zstd.ZstdCompressionDict(b'dictcontent' * 64,
-                                                  dict_type=zstd.DICT_TYPE_RAWCONTENT)
+                     d = zstd.ZstdCompressionDict(
+                         b"dictcontent" * 64, dict_type=zstd.DICT_TYPE_RAWCONTENT
+                     )
                      d.precompute_compress(level=1)
-                     d = zstd.ZstdCompressionDict(b'dictcontent' * 64,
-                                                  dict_type=zstd.DICT_TYPE_FULLDICT)
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'unable to precompute dictionary'):
+                     d = zstd.ZstdCompressionDict(
+                         b"dictcontent" * 64, dict_type=zstd.DICT_TYPE_FULLDICT
+                     )
+                     with self.assertRaisesRegex(zstd.ZstdError, "unable to precompute dictionary"):
                          d.precompute_compress(level=1)

contrib/python-zstandard/zstandard/__init__.py

0 +28 -18

              # defining a variable and `setup.py` could write the file with whatever
              # policy was specified at build time. Until someone needs it, we go with
              # the hacky but simple environment variable approach.
-             _module_policy = os.environ.get('PYTHON_ZSTANDARD_IMPORT_POLICY', 'default')
+             _module_policy = os.environ.get("PYTHON_ZSTANDARD_IMPORT_POLICY", "default")
-             if _module_policy == 'default':
-                 if platform.python_implementation() in ('CPython',):
+             if _module_policy == "default":
+                 if platform.python_implementation() in ("CPython",):
                      from zstd import *
-                     backend = 'cext'
-                 elif platform.python_implementation() in ('PyPy',):
+                     backend = "cext"
+                 elif platform.python_implementation() in ("PyPy",):
                      from .cffi import *
-                     backend = 'cffi'
+                     backend = "cffi"
                  else:
                      try:
                          from zstd import *
-                         backend = 'cext'
+                         backend = "cext"
                      except ImportError:
                          from .cffi import *
-                         backend = 'cffi'
-             elif _module_policy == 'cffi_fallback':
+                         backend = "cffi"
+             elif _module_policy == "cffi_fallback":
                  try:
                      from zstd import *
-                     backend = 'cext'
+                     backend = "cext"
                  except ImportError:
                      from .cffi import *
-                     backend = 'cffi'
-             elif _module_policy == 'cext':
+                     backend = "cffi"
+             elif _module_policy == "cext":
                  from zstd import *
-                 backend = 'cext'
-             elif _module_policy == 'cffi':
+                 backend = "cext"
+             elif _module_policy == "cffi":
                  from .cffi import *
-                 backend = 'cffi'
+                 backend = "cffi"
              else:
-                 raise ImportError('unknown module import policy: %s; use default, cffi_fallback, '
-                                   'cext, or cffi' % _module_policy)
+                 raise ImportError(
+                     "unknown module import policy: %s; use default, cffi_fallback, "
+                     "cext, or cffi" % _module_policy
+                 )
              # Keep this in sync with python-zstandard.h.
-             __version__ = '0.12.0'
+             __version__ = "0.13.0"

contrib/python-zstandard/zstandard/cffi.py

0 +648 -548

                  #'BufferSegments',
                  #'BufferWithSegments',
                  #'BufferWithSegmentsCollection',
-                 'CompressionParameters',
-                 'ZstdCompressionDict',
-                 'ZstdCompressionParameters',
-                 'ZstdCompressor',
-                 'ZstdError',
-                 'ZstdDecompressor',
-                 'FrameParameters',
-                 'estimate_decompression_context_size',
-                 'frame_content_size',
-                 'frame_header_size',
-                 'get_frame_parameters',
-                 'train_dictionary',
+                 "CompressionParameters",
+                 "ZstdCompressionDict",
+                 "ZstdCompressionParameters",
+                 "ZstdCompressor",
+                 "ZstdError",
+                 "ZstdDecompressor",
+                 "FrameParameters",
+                 "estimate_decompression_context_size",
+                 "frame_content_size",
+                 "frame_header_size",
+                 "get_frame_parameters",
+                 "train_dictionary",
                  # Constants.
-                 'FLUSH_BLOCK',
-                 'FLUSH_FRAME',
-                 'COMPRESSOBJ_FLUSH_FINISH',
-                 'COMPRESSOBJ_FLUSH_BLOCK',
-                 'ZSTD_VERSION',
-                 'FRAME_HEADER',
-                 'CONTENTSIZE_UNKNOWN',
-                 'CONTENTSIZE_ERROR',
-                 'MAX_COMPRESSION_LEVEL',
-                 'COMPRESSION_RECOMMENDED_INPUT_SIZE',
-                 'COMPRESSION_RECOMMENDED_OUTPUT_SIZE',
-                 'DECOMPRESSION_RECOMMENDED_INPUT_SIZE',
-                 'DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE',
-                 'MAGIC_NUMBER',
-                 'BLOCKSIZELOG_MAX',
-                 'BLOCKSIZE_MAX',
-                 'WINDOWLOG_MIN',
-                 'WINDOWLOG_MAX',
-                 'CHAINLOG_MIN',
-                 'CHAINLOG_MAX',
-                 'HASHLOG_MIN',
-                 'HASHLOG_MAX',
-                 'HASHLOG3_MAX',
-                 'MINMATCH_MIN',
-                 'MINMATCH_MAX',
-                 'SEARCHLOG_MIN',
-                 'SEARCHLOG_MAX',
-                 'SEARCHLENGTH_MIN',
-                 'SEARCHLENGTH_MAX',
-                 'TARGETLENGTH_MIN',
-                 'TARGETLENGTH_MAX',
-                 'LDM_MINMATCH_MIN',
-                 'LDM_MINMATCH_MAX',
-                 'LDM_BUCKETSIZELOG_MAX',
-                 'STRATEGY_FAST',
-                 'STRATEGY_DFAST',
-                 'STRATEGY_GREEDY',
-                 'STRATEGY_LAZY',
-                 'STRATEGY_LAZY2',
-                 'STRATEGY_BTLAZY2',
-                 'STRATEGY_BTOPT',
-                 'STRATEGY_BTULTRA',
-                 'STRATEGY_BTULTRA2',
-                 'DICT_TYPE_AUTO',
-                 'DICT_TYPE_RAWCONTENT',
-                 'DICT_TYPE_FULLDICT',
-                 'FORMAT_ZSTD1',
-                 'FORMAT_ZSTD1_MAGICLESS',
+                 "FLUSH_BLOCK",
+                 "FLUSH_FRAME",
+                 "COMPRESSOBJ_FLUSH_FINISH",
+                 "COMPRESSOBJ_FLUSH_BLOCK",
+                 "ZSTD_VERSION",
+                 "FRAME_HEADER",
+                 "CONTENTSIZE_UNKNOWN",
+                 "CONTENTSIZE_ERROR",
+                 "MAX_COMPRESSION_LEVEL",
+                 "COMPRESSION_RECOMMENDED_INPUT_SIZE",
+                 "COMPRESSION_RECOMMENDED_OUTPUT_SIZE",
+                 "DECOMPRESSION_RECOMMENDED_INPUT_SIZE",
+                 "DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE",
+                 "MAGIC_NUMBER",
+                 "BLOCKSIZELOG_MAX",
+                 "BLOCKSIZE_MAX",
+                 "WINDOWLOG_MIN",
+                 "WINDOWLOG_MAX",
+                 "CHAINLOG_MIN",
+                 "CHAINLOG_MAX",
+                 "HASHLOG_MIN",
+                 "HASHLOG_MAX",
+                 "HASHLOG3_MAX",
+                 "MINMATCH_MIN",
+                 "MINMATCH_MAX",
+                 "SEARCHLOG_MIN",
+                 "SEARCHLOG_MAX",
+                 "SEARCHLENGTH_MIN",
+                 "SEARCHLENGTH_MAX",
+                 "TARGETLENGTH_MIN",
+                 "TARGETLENGTH_MAX",
+                 "LDM_MINMATCH_MIN",
+                 "LDM_MINMATCH_MAX",
+                 "LDM_BUCKETSIZELOG_MAX",
+                 "STRATEGY_FAST",
+                 "STRATEGY_DFAST",
+                 "STRATEGY_GREEDY",
+                 "STRATEGY_LAZY",
+                 "STRATEGY_LAZY2",
+                 "STRATEGY_BTLAZY2",
+                 "STRATEGY_BTOPT",
+                 "STRATEGY_BTULTRA",
+                 "STRATEGY_BTULTRA2",
+                 "DICT_TYPE_AUTO",
+                 "DICT_TYPE_RAWCONTENT",
+                 "DICT_TYPE_FULLDICT",
+                 "FORMAT_ZSTD1",
+                 "FORMAT_ZSTD1_MAGICLESS",
              ]
              import io
              MAX_COMPRESSION_LEVEL = lib.ZSTD_maxCLevel()
              MAGIC_NUMBER = lib.ZSTD_MAGICNUMBER
-             FRAME_HEADER = b'\x28\xb5\x2f\xfd'
+             FRAME_HEADER = b"\x28\xb5\x2f\xfd"
              CONTENTSIZE_UNKNOWN = lib.ZSTD_CONTENTSIZE_UNKNOWN
              CONTENTSIZE_ERROR = lib.ZSTD_CONTENTSIZE_ERROR
-             ZSTD_VERSION = (lib.ZSTD_VERSION_MAJOR, lib.ZSTD_VERSION_MINOR, lib.ZSTD_VERSION_RELEASE)
+             ZSTD_VERSION = (
+                 lib.ZSTD_VERSION_MAJOR,
+                 lib.ZSTD_VERSION_MINOR,
+                 lib.ZSTD_VERSION_RELEASE,
+             )
              BLOCKSIZELOG_MAX = lib.ZSTD_BLOCKSIZELOG_MAX
              BLOCKSIZE_MAX = lib.ZSTD_BLOCKSIZE_MAX
                  # Linux.
                  try:
                      if sys.version_info[0] == 2:
-                         return os.sysconf(b'SC_NPROCESSORS_ONLN')
+                         return os.sysconf(b"SC_NPROCESSORS_ONLN")
                      else:
-                         return os.sysconf(u'SC_NPROCESSORS_ONLN')
+                         return os.sysconf("SC_NPROCESSORS_ONLN")
                  except (AttributeError, ValueError):
                      pass
                  # Resolves to bytes on Python 2 and 3. We use the string for formatting
                  # into error messages, which will be literal unicode. So convert it to
                  # unicode.
-                 return ffi.string(lib.ZSTD_getErrorName(zresult)).decode('utf-8')
+                 return ffi.string(lib.ZSTD_getErrorName(zresult)).decode("utf-8")
              def _make_cctx_params(params):
                  res = lib.ZSTD_createCCtxParams()
                  return res
              class ZstdCompressionParameters(object):
                  @staticmethod
                  def from_level(level, source_size=0, dict_size=0, **kwargs):
                      params = lib.ZSTD_getCParams(level, source_size, dict_size)
                      args = {
-                         'window_log': 'windowLog',
-                         'chain_log': 'chainLog',
-                         'hash_log': 'hashLog',
-                         'search_log': 'searchLog',
-                         'min_match': 'minMatch',
-                         'target_length': 'targetLength',
-                         'compression_strategy': 'strategy',
+                         "window_log": "windowLog",
+                         "chain_log": "chainLog",
+                         "hash_log": "hashLog",
+                         "search_log": "searchLog",
+                         "min_match": "minMatch",
+                         "target_length": "targetLength",
+                         "compression_strategy": "strategy",
                      }
                      for arg, attr in args.items():
                      return ZstdCompressionParameters(**kwargs)
-                 def __init__(self, format=0, compression_level=0, window_log=0, hash_log=0,
-                              chain_log=0, search_log=0, min_match=0, target_length=0,
-                              strategy=-1, compression_strategy=-1,
-                              write_content_size=1, write_checksum=0,
-                              write_dict_id=0, job_size=0, overlap_log=-1,
-                              overlap_size_log=-1, force_max_window=0, enable_ldm=0,
-                              ldm_hash_log=0, ldm_min_match=0, ldm_bucket_size_log=0,
-                              ldm_hash_rate_log=-1, ldm_hash_every_log=-1, threads=0):
+                 def __init__(
+                     self,
+                     format=0,
+                     compression_level=0,
+                     window_log=0,
+                     hash_log=0,
+                     chain_log=0,
+                     search_log=0,
+                     min_match=0,
+                     target_length=0,
+                     strategy=-1,
+                     compression_strategy=-1,
+                     write_content_size=1,
+                     write_checksum=0,
+                     write_dict_id=0,
+                     job_size=0,
+                     overlap_log=-1,
+                     overlap_size_log=-1,
+                     force_max_window=0,
+                     enable_ldm=0,
+                     ldm_hash_log=0,
+                     ldm_min_match=0,
+                     ldm_bucket_size_log=0,
+                     ldm_hash_rate_log=-1,
+                     ldm_hash_every_log=-1,
+                     threads=0,
+                 ):
                      params = lib.ZSTD_createCCtxParams()
                      if params == ffi.NULL:
                      _set_compression_parameter(params, lib.ZSTD_c_nbWorkers, threads)
                      _set_compression_parameter(params, lib.ZSTD_c_format, format)
-                     _set_compression_parameter(params, lib.ZSTD_c_compressionLevel, compression_level)
+                     _set_compression_parameter(
+                         params, lib.ZSTD_c_compressionLevel, compression_level
+                     )
                      _set_compression_parameter(params, lib.ZSTD_c_windowLog, window_log)
                      _set_compression_parameter(params, lib.ZSTD_c_hashLog, hash_log)
                      _set_compression_parameter(params, lib.ZSTD_c_chainLog, chain_log)
                      _set_compression_parameter(params, lib.ZSTD_c_targetLength, target_length)
                      if strategy != -1 and compression_strategy != -1:
-                         raise ValueError('cannot specify both compression_strategy and strategy')
+                         raise ValueError("cannot specify both compression_strategy and strategy")
                      if compression_strategy != -1:
                          strategy = compression_strategy
                          strategy = 0
                      _set_compression_parameter(params, lib.ZSTD_c_strategy, strategy)
-                     _set_compression_parameter(params, lib.ZSTD_c_contentSizeFlag, write_content_size)
+                     _set_compression_parameter(
+                         params, lib.ZSTD_c_contentSizeFlag, write_content_size
+                     )
                      _set_compression_parameter(params, lib.ZSTD_c_checksumFlag, write_checksum)
                      _set_compression_parameter(params, lib.ZSTD_c_dictIDFlag, write_dict_id)
                      _set_compression_parameter(params, lib.ZSTD_c_jobSize, job_size)
                      if overlap_log != -1 and overlap_size_log != -1:
-                         raise ValueError('cannot specify both overlap_log and overlap_size_log')
+                         raise ValueError("cannot specify both overlap_log and overlap_size_log")
                      if overlap_size_log != -1:
                          overlap_log = overlap_size_log
                      _set_compression_parameter(params, lib.ZSTD_c_overlapLog, overlap_log)
                      _set_compression_parameter(params, lib.ZSTD_c_forceMaxWindow, force_max_window)
-                     _set_compression_parameter(params, lib.ZSTD_c_enableLongDistanceMatching, enable_ldm)
+                     _set_compression_parameter(
+                         params, lib.ZSTD_c_enableLongDistanceMatching, enable_ldm
+                     )
                      _set_compression_parameter(params, lib.ZSTD_c_ldmHashLog, ldm_hash_log)
                      _set_compression_parameter(params, lib.ZSTD_c_ldmMinMatch, ldm_min_match)
-                     _set_compression_parameter(params, lib.ZSTD_c_ldmBucketSizeLog, ldm_bucket_size_log)
+                     _set_compression_parameter(
+                         params, lib.ZSTD_c_ldmBucketSizeLog, ldm_bucket_size_log
+                     )
                      if ldm_hash_rate_log != -1 and ldm_hash_every_log != -1:
-                         raise ValueError('cannot specify both ldm_hash_rate_log and ldm_hash_every_log')
+                         raise ValueError(
+                             "cannot specify both ldm_hash_rate_log and ldm_hash_every_log"
+                         )
                      if ldm_hash_every_log != -1:
                          ldm_hash_rate_log = ldm_hash_every_log
                  @property
                  def enable_ldm(self):
-                     return _get_compression_parameter(self._params, lib.ZSTD_c_enableLongDistanceMatching)
+                     return _get_compression_parameter(
+                         self._params, lib.ZSTD_c_enableLongDistanceMatching
+                     )
                  @property
                  def ldm_hash_log(self):
                  def estimated_compression_context_size(self):
                      return lib.ZSTD_estimateCCtxSize_usingCCtxParams(self._params)
              CompressionParameters = ZstdCompressionParameters
              def estimate_decompression_context_size():
                  return lib.ZSTD_estimateDCtxSize()
              def _set_compression_parameter(params, param, value):
                  zresult = lib.ZSTD_CCtxParams_setParameter(params, param, value)
                  if lib.ZSTD_isError(zresult):
-                     raise ZstdError('unable to set compression context parameter: %s' %
-                                     _zstd_error(zresult))
+                     raise ZstdError(
+                         "unable to set compression context parameter: %s" % _zstd_error(zresult)
+                     )
              def _get_compression_parameter(params, param):
-                 result = ffi.new('int *')
+                 result = ffi.new("int *")
                  zresult = lib.ZSTD_CCtxParams_getParameter(params, param, result)
                  if lib.ZSTD_isError(zresult):
-                     raise ZstdError('unable to get compression context parameter: %s' %
-                                     _zstd_error(zresult))
+                     raise ZstdError(
+                         "unable to get compression context parameter: %s" % _zstd_error(zresult)
+                     )
                  return result[0]
              class ZstdCompressionWriter(object):
-                 def __init__(self, compressor, writer, source_size, write_size,
-                              write_return_read):
+                 def __init__(self, compressor, writer, source_size, write_size, write_return_read):
                      self._compressor = compressor
                      self._writer = writer
                      self._write_size = write_size
                      self._closed = False
                      self._bytes_compressed = 0
-                     self._dst_buffer = ffi.new('char[]', write_size)
-                     self._out_buffer = ffi.new('ZSTD_outBuffer *')
+                     self._dst_buffer = ffi.new("char[]", write_size)
+                     self._out_buffer = ffi.new("ZSTD_outBuffer *")
                      self._out_buffer.dst = self._dst_buffer
                      self._out_buffer.size = len(self._dst_buffer)
                      self._out_buffer.pos = 0
-                     zresult = lib.ZSTD_CCtx_setPledgedSrcSize(compressor._cctx,
-                                                               source_size)
+                     zresult = lib.ZSTD_CCtx_setPledgedSrcSize(compressor._cctx, source_size)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error setting source size: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("error setting source size: %s" % _zstd_error(zresult))
                  def __enter__(self):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if self._entered:
-                         raise ZstdError('cannot __enter__ multiple times')
+                         raise ZstdError("cannot __enter__ multiple times")
                      self._entered = True
                      return self
                      return lib.ZSTD_sizeof_CCtx(self._compressor._cctx)
                  def fileno(self):
-                     f = getattr(self._writer, 'fileno', None)
+                     f = getattr(self._writer, "fileno", None)
                      if f:
                          return f()
                      else:
-                         raise OSError('fileno not available on underlying writer')
+                         raise OSError("fileno not available on underlying writer")
                  def close(self):
                      if self._closed:
                          self._closed = True
                      # Call close() on underlying stream as well.
-                     f = getattr(self._writer, 'close', None)
+                     f = getattr(self._writer, "close", None)
                      if f:
                          f()
                      return True
                  def writelines(self, lines):
-                     raise NotImplementedError('writelines() is not yet implemented')
+                     raise NotImplementedError("writelines() is not yet implemented")
                  def read(self, size=-1):
                      raise io.UnsupportedOperation()
                  def write(self, data):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      total_write = 0
                      data_buffer = ffi.from_buffer(data)
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
                      in_buffer.src = data_buffer
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
                      out_buffer.pos = 0
                      while in_buffer.pos < in_buffer.size:
-                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                            out_buffer, in_buffer,
-                                                            lib.ZSTD_e_continue)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._compressor._cctx, out_buffer, in_buffer, lib.ZSTD_e_continue
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd compress error: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                          if out_buffer.pos:
                              self._writer.write(ffi.buffer(out_buffer.dst, out_buffer.pos)[:])
                      elif flush_mode == FLUSH_FRAME:
                          flush = lib.ZSTD_e_end
                      else:
-                         raise ValueError('unknown flush_mode: %r' % flush_mode)
+                         raise ValueError("unknown flush_mode: %r" % flush_mode)
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      total_write = 0
                      out_buffer = self._out_buffer
                      out_buffer.pos = 0
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
                      in_buffer.src = ffi.NULL
                      in_buffer.size = 0
                      in_buffer.pos = 0
                      while True:
-                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                            out_buffer, in_buffer,
-                                                            flush)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._compressor._cctx, out_buffer, in_buffer, flush
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd compress error: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                          if out_buffer.pos:
                              self._writer.write(ffi.buffer(out_buffer.dst, out_buffer.pos)[:])
              class ZstdCompressionObj(object):
                  def compress(self, data):
                      if self._finished:
-                         raise ZstdError('cannot call compress() after compressor finished')
+                         raise ZstdError("cannot call compress() after compressor finished")
                      data_buffer = ffi.from_buffer(data)
-                     source = ffi.new('ZSTD_inBuffer *')
+                     source = ffi.new("ZSTD_inBuffer *")
                      source.src = data_buffer
                      source.size = len(data_buffer)
                      source.pos = 0
                      chunks = []
                      while source.pos < len(data):
-                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                            self._out,
-                                                            source,
-                                                            lib.ZSTD_e_continue)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._compressor._cctx, self._out, source, lib.ZSTD_e_continue
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd compress error: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                          if self._out.pos:
                              chunks.append(ffi.buffer(self._out.dst, self._out.pos)[:])
                              self._out.pos = 0
-                     return b''.join(chunks)
+                     return b"".join(chunks)
                  def flush(self, flush_mode=COMPRESSOBJ_FLUSH_FINISH):
                      if flush_mode not in (COMPRESSOBJ_FLUSH_FINISH, COMPRESSOBJ_FLUSH_BLOCK):
-                         raise ValueError('flush mode not recognized')
+                         raise ValueError("flush mode not recognized")
                      if self._finished:
-                         raise ZstdError('compressor object already finished')
+                         raise ZstdError("compressor object already finished")
                      if flush_mode == COMPRESSOBJ_FLUSH_BLOCK:
                          z_flush_mode = lib.ZSTD_e_flush
                          z_flush_mode = lib.ZSTD_e_end
                          self._finished = True
                      else:
-                         raise ZstdError('unhandled flush mode')
+                         raise ZstdError("unhandled flush mode")
                      assert self._out.pos == 0
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
                      in_buffer.src = ffi.NULL
                      in_buffer.size = 0
                      in_buffer.pos = 0
                      chunks = []
                      while True:
-                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                            self._out,
-                                                            in_buffer,
-                                                            z_flush_mode)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._compressor._cctx, self._out, in_buffer, z_flush_mode
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('error ending compression stream: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError(
+                                 "error ending compression stream: %s" % _zstd_error(zresult)
+                             )
                          if self._out.pos:
                              chunks.append(ffi.buffer(self._out.dst, self._out.pos)[:])
                          if not zresult:
                              break
-                     return b''.join(chunks)
+                     return b"".join(chunks)
              class ZstdCompressionChunker(object):
                  def __init__(self, compressor, chunk_size):
                      self._compressor = compressor
-                     self._out = ffi.new('ZSTD_outBuffer *')
-                     self._dst_buffer = ffi.new('char[]', chunk_size)
+                     self._out = ffi.new("ZSTD_outBuffer *")
+                     self._dst_buffer = ffi.new("char[]", chunk_size)
                      self._out.dst = self._dst_buffer
                      self._out.size = chunk_size
                      self._out.pos = 0
-                     self._in = ffi.new('ZSTD_inBuffer *')
+                     self._in = ffi.new("ZSTD_inBuffer *")
                      self._in.src = ffi.NULL
                      self._in.size = 0
                      self._in.pos = 0
                  def compress(self, data):
                      if self._finished:
-                         raise ZstdError('cannot call compress() after compression finished')
+                         raise ZstdError("cannot call compress() after compression finished")
                      if self._in.src != ffi.NULL:
-                         raise ZstdError('cannot perform operation before consuming output '
-                                         'from previous operation')
+                         raise ZstdError(
+                             "cannot perform operation before consuming output "
+                             "from previous operation"
+                         )
                      data_buffer = ffi.from_buffer(data)
                      self._in.pos = 0
                      while self._in.pos < self._in.size:
-                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                            self._out,
-                                                            self._in,
-                                                            lib.ZSTD_e_continue)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._compressor._cctx, self._out, self._in, lib.ZSTD_e_continue
+                         )
                          if self._in.pos == self._in.size:
                              self._in.src = ffi.NULL
                              self._in.pos = 0
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd compress error: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                          if self._out.pos == self._out.size:
                              yield ffi.buffer(self._out.dst, self._out.pos)[:]
                  def flush(self):
                      if self._finished:
-                         raise ZstdError('cannot call flush() after compression finished')
+                         raise ZstdError("cannot call flush() after compression finished")
                      if self._in.src != ffi.NULL:
-                         raise ZstdError('cannot call flush() before consuming output from '
-                                         'previous operation')
+                         raise ZstdError(
+                             "cannot call flush() before consuming output from " "previous operation"
+                         )
                      while True:
-                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                            self._out, self._in,
-                                                            lib.ZSTD_e_flush)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._compressor._cctx, self._out, self._in, lib.ZSTD_e_flush
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd compress error: %s' % _zstd_error(zresult))
+                             raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                          if self._out.pos:
                              yield ffi.buffer(self._out.dst, self._out.pos)[:]
                  def finish(self):
                      if self._finished:
-                         raise ZstdError('cannot call finish() after compression finished')
+                         raise ZstdError("cannot call finish() after compression finished")
                      if self._in.src != ffi.NULL:
-                         raise ZstdError('cannot call finish() before consuming output from '
-                                         'previous operation')
+                         raise ZstdError(
+                             "cannot call finish() before consuming output from "
+                             "previous operation"
+                         )
                      while True:
-                         zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                            self._out, self._in,
-                                                            lib.ZSTD_e_end)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._compressor._cctx, self._out, self._in, lib.ZSTD_e_end
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd compress error: %s' % _zstd_error(zresult))
+                             raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                          if self._out.pos:
                              yield ffi.buffer(self._out.dst, self._out.pos)[:]
                      self._finished_input = False
                      self._finished_output = False
-                     self._in_buffer = ffi.new('ZSTD_inBuffer *')
+                     self._in_buffer = ffi.new("ZSTD_inBuffer *")
                      # Holds a ref so backing bytes in self._in_buffer stay alive.
                      self._source_buffer = None
                  def __enter__(self):
                      if self._entered:
-                         raise ValueError('cannot __enter__ multiple times')
+                         raise ValueError("cannot __enter__ multiple times")
                      self._entered = True
                      return self
                      raise io.UnsupportedOperation()
                  def write(self, data):
-                     raise OSError('stream is not writable')
+                     raise OSError("stream is not writable")
                  def writelines(self, ignored):
-                     raise OSError('stream is not writable')
+                     raise OSError("stream is not writable")
                  def isatty(self):
                      return False
                          chunks.append(chunk)
-                     return b''.join(chunks)
+                     return b"".join(chunks)
                  def __iter__(self):
                      raise io.UnsupportedOperation()
                      if self._finished_input:
                          return
-                     if hasattr(self._source, 'read'):
+                     if hasattr(self._source, "read"):
                          data = self._source.read(self._read_size)
                          if not data:
                      old_pos = out_buffer.pos
-                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                        out_buffer, self._in_buffer,
-                                                        lib.ZSTD_e_continue)
+                     zresult = lib.ZSTD_compressStream2(
+                         self._compressor._cctx, out_buffer, self._in_buffer, lib.ZSTD_e_continue
+                     )
                      self._bytes_compressed += out_buffer.pos - old_pos
                          self._in_buffer.size = 0
                          self._source_buffer = None
-                         if not hasattr(self._source, 'read'):
+                         if not hasattr(self._source, "read"):
                              self._finished_input = True
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('zstd compress error: %s',
-                                         _zstd_error(zresult))
+                         raise ZstdError("zstd compress error: %s", _zstd_error(zresult))
                      return out_buffer.pos and out_buffer.pos == out_buffer.size
                  def read(self, size=-1):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if size < -1:
-                         raise ValueError('cannot read negative amounts less than -1')
+                         raise ValueError("cannot read negative amounts less than -1")
                      if size == -1:
                          return self.readall()
                      if self._finished_output or size == 0:
-                         return b''
+                         return b""
                      # Need a dedicated ref to dest buffer otherwise it gets collected.
-                     dst_buffer = ffi.new('char[]', size)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     dst_buffer = ffi.new("char[]", size)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dst_buffer
                      out_buffer.size = size
                      out_buffer.pos = 0
                      # EOF
                      old_pos = out_buffer.pos
-                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                        out_buffer, self._in_buffer,
-                                                        lib.ZSTD_e_end)
+                     zresult = lib.ZSTD_compressStream2(
+                         self._compressor._cctx, out_buffer, self._in_buffer, lib.ZSTD_e_end
+                     )
                      self._bytes_compressed += out_buffer.pos - old_pos
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error ending compression stream: %s',
-                                         _zstd_error(zresult))
+                         raise ZstdError("error ending compression stream: %s", _zstd_error(zresult))
                      if zresult == 0:
                          self._finished_output = True
                  def read1(self, size=-1):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if size < -1:
-                         raise ValueError('cannot read negative amounts less than -1')
+                         raise ValueError("cannot read negative amounts less than -1")
                      if self._finished_output or size == 0:
-                         return b''
+                         return b""
                      # -1 returns arbitrary number of bytes.
                      if size == -1:
                          size = COMPRESSION_RECOMMENDED_OUTPUT_SIZE
-                     dst_buffer = ffi.new('char[]', size)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     dst_buffer = ffi.new("char[]", size)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dst_buffer
                      out_buffer.size = size
                      out_buffer.pos = 0
                      # EOF.
                      old_pos = out_buffer.pos
-                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                        out_buffer, self._in_buffer,
-                                                        lib.ZSTD_e_end)
+                     zresult = lib.ZSTD_compressStream2(
+                         self._compressor._cctx, out_buffer, self._in_buffer, lib.ZSTD_e_end
+                     )
                      self._bytes_compressed += out_buffer.pos - old_pos
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error ending compression stream: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError(
+                             "error ending compression stream: %s" % _zstd_error(zresult)
+                         )
                      if zresult == 0:
                          self._finished_output = True
                  def readinto(self, b):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if self._finished_output:
                          return 0
                      # TODO use writable=True once we require CFFI >= 1.12.
                      dest_buffer = ffi.from_buffer(b)
-                     ffi.memmove(b, b'', 0)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     ffi.memmove(b, b"", 0)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dest_buffer
                      out_buffer.size = len(dest_buffer)
                      out_buffer.pos = 0
                      # EOF.
                      old_pos = out_buffer.pos
-                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                        out_buffer, self._in_buffer,
-                                                        lib.ZSTD_e_end)
+                     zresult = lib.ZSTD_compressStream2(
+                         self._compressor._cctx, out_buffer, self._in_buffer, lib.ZSTD_e_end
+                     )
                      self._bytes_compressed += out_buffer.pos - old_pos
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error ending compression stream: %s',
-                                         _zstd_error(zresult))
+                         raise ZstdError("error ending compression stream: %s", _zstd_error(zresult))
                      if zresult == 0:
                          self._finished_output = True
                  def readinto1(self, b):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if self._finished_output:
                          return 0
                      # TODO use writable=True once we require CFFI >= 1.12.
                      dest_buffer = ffi.from_buffer(b)
-                     ffi.memmove(b, b'', 0)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     ffi.memmove(b, b"", 0)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dest_buffer
                      out_buffer.size = len(dest_buffer)
                      out_buffer.pos = 0
                      # EOF.
                      old_pos = out_buffer.pos
-                     zresult = lib.ZSTD_compressStream2(self._compressor._cctx,
-                                                        out_buffer, self._in_buffer,
-                                                        lib.ZSTD_e_end)
+                     zresult = lib.ZSTD_compressStream2(
+                         self._compressor._cctx, out_buffer, self._in_buffer, lib.ZSTD_e_end
+                     )
                      self._bytes_compressed += out_buffer.pos - old_pos
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error ending compression stream: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError(
+                             "error ending compression stream: %s" % _zstd_error(zresult)
+                         )
                      if zresult == 0:
                          self._finished_output = True
              class ZstdCompressor(object):
-                 def __init__(self, level=3, dict_data=None, compression_params=None,
-                              write_checksum=None, write_content_size=None,
-                              write_dict_id=None, threads=0):
+                 def __init__(
+                     self,
+                     level=3,
+                     dict_data=None,
+                     compression_params=None,
+                     write_checksum=None,
+                     write_content_size=None,
+                     write_dict_id=None,
+                     threads=0,
+                 ):
                      if level > lib.ZSTD_maxCLevel():
-                         raise ValueError('level must be less than %d' % lib.ZSTD_maxCLevel())
+                         raise ValueError("level must be less than %d" % lib.ZSTD_maxCLevel())
                      if threads < 0:
                          threads = _cpu_count()
                      if compression_params and write_checksum is not None:
-                         raise ValueError('cannot define compression_params and '
-                                          'write_checksum')
+                         raise ValueError("cannot define compression_params and " "write_checksum")
                      if compression_params and write_content_size is not None:
-                         raise ValueError('cannot define compression_params and '
-                                          'write_content_size')
+                         raise ValueError(
+                             "cannot define compression_params and " "write_content_size"
+                         )
                      if compression_params and write_dict_id is not None:
-                         raise ValueError('cannot define compression_params and '
-                                          'write_dict_id')
+                         raise ValueError("cannot define compression_params and " "write_dict_id")
                      if compression_params and threads:
-                         raise ValueError('cannot define compression_params and threads')
+                         raise ValueError("cannot define compression_params and threads")
                      if compression_params:
                          self._params = _make_cctx_params(compression_params)
                          self._params = ffi.gc(params, lib.ZSTD_freeCCtxParams)
-                         _set_compression_parameter(self._params,
-                                                    lib.ZSTD_c_compressionLevel,
-                                                    level)
+                         _set_compression_parameter(self._params, lib.ZSTD_c_compressionLevel, level)
                          _set_compression_parameter(
                              self._params,
                              lib.ZSTD_c_contentSizeFlag,
-                             write_content_size if write_content_size is not None else 1)
-                         _set_compression_parameter(self._params,
-                                                    lib.ZSTD_c_checksumFlag,
-if write_checksum else 0)
-                         _set_compression_parameter(self._params,
-                                                    lib.ZSTD_c_dictIDFlag,
-if write_dict_id else 0)
+                             write_content_size if write_content_size is not None else 1,
+                         )
+                         _set_compression_parameter(
+                             self._params, lib.ZSTD_c_checksumFlag, 1 if write_checksum else 0
+                         )
+                         _set_compression_parameter(
+                             self._params, lib.ZSTD_c_dictIDFlag, 1 if write_dict_id else 0
+                         )
                          if threads:
-                             _set_compression_parameter(self._params,
-                                                        lib.ZSTD_c_nbWorkers,
-                                                        threads)
+                             _set_compression_parameter(self._params, lib.ZSTD_c_nbWorkers, threads)
                      cctx = lib.ZSTD_createCCtx()
                      if cctx == ffi.NULL:
                      try:
                          self._setup_cctx()
                      finally:
-                         self._cctx = ffi.gc(cctx, lib.ZSTD_freeCCtx,
-                                             size=lib.ZSTD_sizeof_CCtx(cctx))
+                         self._cctx = ffi.gc(
+                             cctx, lib.ZSTD_freeCCtx, size=lib.ZSTD_sizeof_CCtx(cctx)
+                         )
                  def _setup_cctx(self):
-                     zresult = lib.ZSTD_CCtx_setParametersUsingCCtxParams(self._cctx,
-                                                                          self._params)
+                     zresult = lib.ZSTD_CCtx_setParametersUsingCCtxParams(self._cctx, self._params)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('could not set compression parameters: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError(
+                             "could not set compression parameters: %s" % _zstd_error(zresult)
+                         )
                      dict_data = self._dict_data
                              zresult = lib.ZSTD_CCtx_refCDict(self._cctx, dict_data._cdict)
                          else:
                              zresult = lib.ZSTD_CCtx_loadDictionary_advanced(
-                                 self._cctx, dict_data.as_bytes(), len(dict_data),
-                                 lib.ZSTD_dlm_byRef, dict_data._dict_type)
+                                 self._cctx,
+                                 dict_data.as_bytes(),
+                                 len(dict_data),
+                                 lib.ZSTD_dlm_byRef,
+                                 dict_data._dict_type,
+                             )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('could not load compression dictionary: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError(
+                                 "could not load compression dictionary: %s" % _zstd_error(zresult)
+                             )
                  def memory_size(self):
                      return lib.ZSTD_sizeof_CCtx(self._cctx)
                      data_buffer = ffi.from_buffer(data)
                      dest_size = lib.ZSTD_compressBound(len(data_buffer))
-                     out = new_nonzero('char[]', dest_size)
+                     out = new_nonzero("char[]", dest_size)
                      zresult = lib.ZSTD_CCtx_setPledgedSrcSize(self._cctx, len(data_buffer))
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error setting source size: %s' %
-                                         _zstd_error(zresult))
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
+                         raise ZstdError("error setting source size: %s" % _zstd_error(zresult))
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
                      out_buffer.dst = out
                      out_buffer.size = dest_size
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
-                     zresult = lib.ZSTD_compressStream2(self._cctx,
-                                                        out_buffer,
-                                                        in_buffer,
-                                                        lib.ZSTD_e_end)
+                     zresult = lib.ZSTD_compressStream2(
+                         self._cctx, out_buffer, in_buffer, lib.ZSTD_e_end
+                     )
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('cannot compress: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("cannot compress: %s" % _zstd_error(zresult))
                      elif zresult:
-                         raise ZstdError('unexpected partial frame flush')
+                         raise ZstdError("unexpected partial frame flush")
                      return ffi.buffer(out, out_buffer.pos)[:]
                      zresult = lib.ZSTD_CCtx_setPledgedSrcSize(self._cctx, size)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error setting source size: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("error setting source size: %s" % _zstd_error(zresult))
                      cobj = ZstdCompressionObj()
-                     cobj._out = ffi.new('ZSTD_outBuffer *')
-                     cobj._dst_buffer = ffi.new('char[]', COMPRESSION_RECOMMENDED_OUTPUT_SIZE)
+                     cobj._out = ffi.new("ZSTD_outBuffer *")
+                     cobj._dst_buffer = ffi.new("char[]", COMPRESSION_RECOMMENDED_OUTPUT_SIZE)
                      cobj._out.dst = cobj._dst_buffer
                      cobj._out.size = COMPRESSION_RECOMMENDED_OUTPUT_SIZE
                      cobj._out.pos = 0
                      zresult = lib.ZSTD_CCtx_setPledgedSrcSize(self._cctx, size)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error setting source size: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("error setting source size: %s" % _zstd_error(zresult))
                      return ZstdCompressionChunker(self, chunk_size=chunk_size)
-                 def copy_stream(self, ifh, ofh, size=-1,
-                                 read_size=COMPRESSION_RECOMMENDED_INPUT_SIZE,
-                                 write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE):
-                     if not hasattr(ifh, 'read'):
-                         raise ValueError('first argument must have a read() method')
-                     if not hasattr(ofh, 'write'):
-                         raise ValueError('second argument must have a write() method')
+                 def copy_stream(
+                     self,
+                     ifh,
+                     ofh,
+                     size=-1,
+                     read_size=COMPRESSION_RECOMMENDED_INPUT_SIZE,
+                     write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                 ):
+                     if not hasattr(ifh, "read"):
+                         raise ValueError("first argument must have a read() method")
+                     if not hasattr(ofh, "write"):
+                         raise ValueError("second argument must have a write() method")
                      lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      zresult = lib.ZSTD_CCtx_setPledgedSrcSize(self._cctx, size)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error setting source size: %s' %
-                                         _zstd_error(zresult))
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
-                     dst_buffer = ffi.new('char[]', write_size)
+                         raise ZstdError("error setting source size: %s" % _zstd_error(zresult))
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
+                     dst_buffer = ffi.new("char[]", write_size)
                      out_buffer.dst = dst_buffer
                      out_buffer.size = write_size
                      out_buffer.pos = 0
                          in_buffer.pos = 0
                          while in_buffer.pos < in_buffer.size:
-                             zresult = lib.ZSTD_compressStream2(self._cctx,
-                                                                out_buffer,
-                                                                in_buffer,
-                                                                lib.ZSTD_e_continue)
+                             zresult = lib.ZSTD_compressStream2(
+                                 self._cctx, out_buffer, in_buffer, lib.ZSTD_e_continue
+                             )
                              if lib.ZSTD_isError(zresult):
-                                 raise ZstdError('zstd compress error: %s' %
-                                                 _zstd_error(zresult))
+                                 raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                              if out_buffer.pos:
                                  ofh.write(ffi.buffer(out_buffer.dst, out_buffer.pos))
                      # We've finished reading. Flush the compressor.
                      while True:
-                         zresult = lib.ZSTD_compressStream2(self._cctx,
-                                                            out_buffer,
-                                                            in_buffer,
-                                                            lib.ZSTD_e_end)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._cctx, out_buffer, in_buffer, lib.ZSTD_e_end
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('error ending compression stream: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError(
+                                 "error ending compression stream: %s" % _zstd_error(zresult)
+                             )
                          if out_buffer.pos:
                              ofh.write(ffi.buffer(out_buffer.dst, out_buffer.pos))
                      return total_read, total_write
-                 def stream_reader(self, source, size=-1,
-                                   read_size=COMPRESSION_RECOMMENDED_INPUT_SIZE):
+                 def stream_reader(
+                     self, source, size=-1, read_size=COMPRESSION_RECOMMENDED_INPUT_SIZE
+                 ):
                      lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      try:
                      zresult = lib.ZSTD_CCtx_setPledgedSrcSize(self._cctx, size)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error setting source size: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("error setting source size: %s" % _zstd_error(zresult))
                      return ZstdCompressionReader(self, source, read_size)
-                 def stream_writer(self, writer, size=-1,
-                              write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE,
-                              write_return_read=False):
-                     if not hasattr(writer, 'write'):
-                         raise ValueError('must pass an object with a write() method')
+                 def stream_writer(
+                     self,
+                     writer,
+                     size=-1,
+                     write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                     write_return_read=False,
+                 ):
+                     if not hasattr(writer, "write"):
+                         raise ValueError("must pass an object with a write() method")
                      lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      if size < 0:
                          size = lib.ZSTD_CONTENTSIZE_UNKNOWN
-                     return ZstdCompressionWriter(self, writer, size, write_size,
-                                                  write_return_read)
+                     return ZstdCompressionWriter(self, writer, size, write_size, write_return_read)
                  write_to = stream_writer
-                 def read_to_iter(self, reader, size=-1,
-                                  read_size=COMPRESSION_RECOMMENDED_INPUT_SIZE,
-                                  write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE):
-                     if hasattr(reader, 'read'):
+                 def read_to_iter(
+                     self,
+                     reader,
+                     size=-1,
+                     read_size=COMPRESSION_RECOMMENDED_INPUT_SIZE,
+                     write_size=COMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                 ):
+                     if hasattr(reader, "read"):
                          have_read = True
-                     elif hasattr(reader, '__getitem__'):
+                     elif hasattr(reader, "__getitem__"):
                          have_read = False
                          buffer_offset = 0
                          size = len(reader)
                      else:
-                         raise ValueError('must pass an object with a read() method or '
-                                          'conforms to buffer protocol')
+                         raise ValueError(
+                             "must pass an object with a read() method or "
+                             "conforms to buffer protocol"
+                         )
                      lib.ZSTD_CCtx_reset(self._cctx, lib.ZSTD_reset_session_only)
                      zresult = lib.ZSTD_CCtx_setPledgedSrcSize(self._cctx, size)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('error setting source size: %s' %
-                                         _zstd_error(zresult))
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                         raise ZstdError("error setting source size: %s" % _zstd_error(zresult))
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      in_buffer.src = ffi.NULL
                      in_buffer.size = 0
                      in_buffer.pos = 0
-                     dst_buffer = ffi.new('char[]', write_size)
+                     dst_buffer = ffi.new("char[]", write_size)
                      out_buffer.dst = dst_buffer
                      out_buffer.size = write_size
                      out_buffer.pos = 0
                          else:
                              remaining = len(reader) - buffer_offset
                              slice_size = min(remaining, read_size)
-                             read_result = reader[buffer_offset:buffer_offset + slice_size]
+                             read_result = reader[buffer_offset : buffer_offset + slice_size]
                              buffer_offset += slice_size
                          # No new input data. Break out of the read loop.
                          in_buffer.pos = 0
                          while in_buffer.pos < in_buffer.size:
-                             zresult = lib.ZSTD_compressStream2(self._cctx, out_buffer, in_buffer,
-                                                                lib.ZSTD_e_continue)
+                             zresult = lib.ZSTD_compressStream2(
+                                 self._cctx, out_buffer, in_buffer, lib.ZSTD_e_continue
+                             )
                              if lib.ZSTD_isError(zresult):
-                                 raise ZstdError('zstd compress error: %s' %
-                                                 _zstd_error(zresult))
+                                 raise ZstdError("zstd compress error: %s" % _zstd_error(zresult))
                              if out_buffer.pos:
                                  data = ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
                      # remains.
                      while True:
                          assert out_buffer.pos == 0
-                         zresult = lib.ZSTD_compressStream2(self._cctx,
-                                                            out_buffer,
-                                                            in_buffer,
-                                                            lib.ZSTD_e_end)
+                         zresult = lib.ZSTD_compressStream2(
+                             self._cctx, out_buffer, in_buffer, lib.ZSTD_e_end
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('error ending compression stream: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError(
+                                 "error ending compression stream: %s" % _zstd_error(zresult)
+                             )
                          if out_buffer.pos:
                              data = ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
                  size = lib.ZSTD_getFrameContentSize(data_buffer, len(data_buffer))
                  if size == lib.ZSTD_CONTENTSIZE_ERROR:
-                     raise ZstdError('error when determining content size')
+                     raise ZstdError("error when determining content size")
                  elif size == lib.ZSTD_CONTENTSIZE_UNKNOWN:
                      return -1
                  else:
                  zresult = lib.ZSTD_frameHeaderSize(data_buffer, len(data_buffer))
                  if lib.ZSTD_isError(zresult):
-                     raise ZstdError('could not determine frame header size: %s' %
-                                     _zstd_error(zresult))
+                     raise ZstdError(
+                         "could not determine frame header size: %s" % _zstd_error(zresult)
+                     )
                  return zresult
              def get_frame_parameters(data):
-                 params = ffi.new('ZSTD_frameHeader *')
+                 params = ffi.new("ZSTD_frameHeader *")
                  data_buffer = ffi.from_buffer(data)
                  zresult = lib.ZSTD_getFrameHeader(params, data_buffer, len(data_buffer))
                  if lib.ZSTD_isError(zresult):
-                     raise ZstdError('cannot get frame parameters: %s' %
-                                     _zstd_error(zresult))
+                     raise ZstdError("cannot get frame parameters: %s" % _zstd_error(zresult))
                  if zresult:
-                     raise ZstdError('not enough data for frame parameters; need %d bytes' %
-                                     zresult)
+                     raise ZstdError("not enough data for frame parameters; need %d bytes" % zresult)
                  return FrameParameters(params[0])
                      self.k = k
                      self.d = d
-                     if dict_type not in (DICT_TYPE_AUTO, DICT_TYPE_RAWCONTENT,
-                                          DICT_TYPE_FULLDICT):
-                         raise ValueError('invalid dictionary load mode: %d; must use '
-                                          'DICT_TYPE_* constants')
+                     if dict_type not in (DICT_TYPE_AUTO, DICT_TYPE_RAWCONTENT, DICT_TYPE_FULLDICT):
+                         raise ValueError(
+                             "invalid dictionary load mode: %d; must use " "DICT_TYPE_* constants"
+                         )
                      self._dict_type = dict_type
                      self._cdict = None
                  def precompute_compress(self, level=0, compression_params=None):
                      if level and compression_params:
-                         raise ValueError('must only specify one of level or '
-                                          'compression_params')
+                         raise ValueError("must only specify one of level or " "compression_params")
                      if not level and not compression_params:
-                         raise ValueError('must specify one of level or compression_params')
+                         raise ValueError("must specify one of level or compression_params")
                      if level:
                          cparams = lib.ZSTD_getCParams(level, 0, len(self._data))
                      else:
-                         cparams = ffi.new('ZSTD_compressionParameters')
+                         cparams = ffi.new("ZSTD_compressionParameters")
                          cparams.chainLog = compression_params.chain_log
                          cparams.hashLog = compression_params.hash_log
                          cparams.minMatch = compression_params.min_match
                          cparams.targetLength = compression_params.target_length
                          cparams.windowLog = compression_params.window_log
-                     cdict = lib.ZSTD_createCDict_advanced(self._data, len(self._data),
-                                                           lib.ZSTD_dlm_byRef,
-                                                           self._dict_type,
-                                                           cparams,
-                                                           lib.ZSTD_defaultCMem)
+                     cdict = lib.ZSTD_createCDict_advanced(
+                         self._data,
+                         len(self._data),
+                         lib.ZSTD_dlm_byRef,
+                         self._dict_type,
+                         cparams,
+                         lib.ZSTD_defaultCMem,
+                     )
                      if cdict == ffi.NULL:
-                         raise ZstdError('unable to precompute dictionary')
-                     self._cdict = ffi.gc(cdict, lib.ZSTD_freeCDict,
-                                          size=lib.ZSTD_sizeof_CDict(cdict))
+                         raise ZstdError("unable to precompute dictionary")
+                     self._cdict = ffi.gc(
+                         cdict, lib.ZSTD_freeCDict, size=lib.ZSTD_sizeof_CDict(cdict)
+                     )
                  @property
                  def _ddict(self):
-                     ddict = lib.ZSTD_createDDict_advanced(self._data, len(self._data),
-                                                           lib.ZSTD_dlm_byRef,
-                                                           self._dict_type,
-                                                           lib.ZSTD_defaultCMem)
+                     ddict = lib.ZSTD_createDDict_advanced(
+                         self._data,
+                         len(self._data),
+                         lib.ZSTD_dlm_byRef,
+                         self._dict_type,
+                         lib.ZSTD_defaultCMem,
+                     )
                      if ddict == ffi.NULL:
-                         raise ZstdError('could not create decompression dict')
-                     ddict = ffi.gc(ddict, lib.ZSTD_freeDDict,
-                                    size=lib.ZSTD_sizeof_DDict(ddict))
-                     self.__dict__['_ddict'] = ddict
+                         raise ZstdError("could not create decompression dict")
+                     ddict = ffi.gc(ddict, lib.ZSTD_freeDDict, size=lib.ZSTD_sizeof_DDict(ddict))
+                     self.__dict__["_ddict"] = ddict
                      return ddict
-             def train_dictionary(dict_size, samples, k=0, d=0, notifications=0, dict_id=0,
-                                  level=0, steps=0, threads=0):
+             def train_dictionary(
+                 dict_size,
+                 samples,
+                 k=0,
+                 d=0,
+                 notifications=0,
+                 dict_id=0,
+                 level=0,
+                 steps=0,
+                 threads=0,
+             ):
                  if not isinstance(samples, list):
-                     raise TypeError('samples must be a list')
+                     raise TypeError("samples must be a list")
                  if threads < 0:
                      threads = _cpu_count()
                  total_size = sum(map(len, samples))
-                 samples_buffer = new_nonzero('char[]', total_size)
-                 sample_sizes = new_nonzero('size_t[]', len(samples))
+                 samples_buffer = new_nonzero("char[]", total_size)
+                 sample_sizes = new_nonzero("size_t[]", len(samples))
                  offset = 0
                  for i, sample in enumerate(samples):
                      if not isinstance(sample, bytes_type):
-                         raise ValueError('samples must be bytes')
+                         raise ValueError("samples must be bytes")
                      l = len(sample)
                      ffi.memmove(samples_buffer + offset, sample, l)
                      offset += l
                      sample_sizes[i] = l
-                 dict_data = new_nonzero('char[]', dict_size)
-                 dparams = ffi.new('ZDICT_cover_params_t *')[0]
+                 dict_data = new_nonzero("char[]", dict_size)
+                 dparams = ffi.new("ZDICT_cover_params_t *")[0]
                  dparams.k = k
                  dparams.d = d
                  dparams.steps = steps
                  dparams.zParams.dictID = dict_id
                  dparams.zParams.compressionLevel = level
-                 if (not dparams.k and not dparams.d and not dparams.steps
-                     and not dparams.nbThreads and not dparams.zParams.notificationLevel
+                 if (
+                     not dparams.k
+                     and not dparams.d
+                     and not dparams.steps
+                     and not dparams.nbThreads
+                     and not dparams.zParams.notificationLevel
                      and not dparams.zParams.dictID
-                     and not dparams.zParams.compressionLevel):
+                     and not dparams.zParams.compressionLevel
+                 ):
                      zresult = lib.ZDICT_trainFromBuffer(
-                         ffi.addressof(dict_data), dict_size,
+                         ffi.addressof(dict_data),
+                         dict_size,
                          ffi.addressof(samples_buffer),
-                         ffi.addressof(sample_sizes, 0), len(samples))
+                         ffi.addressof(sample_sizes, 0),
+                         len(samples),
+                     )
                  elif dparams.steps or dparams.nbThreads:
                      zresult = lib.ZDICT_optimizeTrainFromBuffer_cover(
-                         ffi.addressof(dict_data), dict_size,
+                         ffi.addressof(dict_data),
+                         dict_size,
                          ffi.addressof(samples_buffer),
-                         ffi.addressof(sample_sizes, 0), len(samples),
-                         ffi.addressof(dparams))
+                         ffi.addressof(sample_sizes, 0),
+                         len(samples),
+                         ffi.addressof(dparams),
+                     )
                  else:
                      zresult = lib.ZDICT_trainFromBuffer_cover(
-                         ffi.addressof(dict_data), dict_size,
+                         ffi.addressof(dict_data),
+                         dict_size,
                          ffi.addressof(samples_buffer),
-                         ffi.addressof(sample_sizes, 0), len(samples),
-                         dparams)
+                         ffi.addressof(sample_sizes, 0),
+                         len(samples),
+                         dparams,
+                     )
                  if lib.ZDICT_isError(zresult):
-                     msg = ffi.string(lib.ZDICT_getErrorName(zresult)).decode('utf-8')
-                     raise ZstdError('cannot train dict: %s' % msg)
-                 return ZstdCompressionDict(ffi.buffer(dict_data, zresult)[:],
-                                            dict_type=DICT_TYPE_FULLDICT,
-                                            k=dparams.k, d=dparams.d)
+                     msg = ffi.string(lib.ZDICT_getErrorName(zresult)).decode("utf-8")
+                     raise ZstdError("cannot train dict: %s" % msg)
+                 return ZstdCompressionDict(
+                     ffi.buffer(dict_data, zresult)[:],
+                     dict_type=DICT_TYPE_FULLDICT,
+                     k=dparams.k,
+                     d=dparams.d,
+                 )
              class ZstdDecompressionObj(object):
                  def decompress(self, data):
                      if self._finished:
-                         raise ZstdError('cannot use a decompressobj multiple times')
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                         raise ZstdError("cannot use a decompressobj multiple times")
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      data_buffer = ffi.from_buffer(data)
                      if len(data_buffer) == 0:
-                         return b''
+                         return b""
                      in_buffer.src = data_buffer
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
-                     dst_buffer = ffi.new('char[]', self._write_size)
+                     dst_buffer = ffi.new("char[]", self._write_size)
                      out_buffer.dst = dst_buffer
                      out_buffer.size = len(dst_buffer)
                      out_buffer.pos = 0
                      chunks = []
                      while True:
-                         zresult = lib.ZSTD_decompressStream(self._decompressor._dctx,
-                                                             out_buffer, in_buffer)
+                         zresult = lib.ZSTD_decompressStream(
+                             self._decompressor._dctx, out_buffer, in_buffer
+                         )
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd decompressor error: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError("zstd decompressor error: %s" % _zstd_error(zresult))
                          if zresult == 0:
                              self._finished = True
                          if out_buffer.pos:
                              chunks.append(ffi.buffer(out_buffer.dst, out_buffer.pos)[:])
-                         if (zresult == 0 or
-                                 (in_buffer.pos == in_buffer.size and out_buffer.pos == 0)):
+                         if zresult == 0 or (
+                             in_buffer.pos == in_buffer.size and out_buffer.pos == 0
+                         ):
                              break
                          out_buffer.pos = 0
-                     return b''.join(chunks)
+                     return b"".join(chunks)
                  def flush(self, length=0):
                      pass
                      self._bytes_decompressed = 0
                      self._finished_input = False
                      self._finished_output = False
-                     self._in_buffer = ffi.new('ZSTD_inBuffer *')
+                     self._in_buffer = ffi.new("ZSTD_inBuffer *")
                      # Holds a ref to self._in_buffer.src.
                      self._source_buffer = None
                  def __enter__(self):
                      if self._entered:
-                         raise ValueError('cannot __enter__ multiple times')
+                         raise ValueError("cannot __enter__ multiple times")
                      self._entered = True
                      return self
                          chunks.append(chunk)
-                     return b''.join(chunks)
+                     return b"".join(chunks)
                  def __iter__(self):
                      raise io.UnsupportedOperation()
                          return
                      # Else populate the input buffer from our source.
-                     if hasattr(self._source, 'read'):
+                     if hasattr(self._source, "read"):
                          data = self._source.read(self._read_size)
                          if not data:
                      Returns True if data in output buffer should be emitted.
                      """
-                     zresult = lib.ZSTD_decompressStream(self._decompressor._dctx,
-                                                         out_buffer, self._in_buffer)
+                     zresult = lib.ZSTD_decompressStream(
+                         self._decompressor._dctx, out_buffer, self._in_buffer
+                     )
                      if self._in_buffer.pos == self._in_buffer.size:
                          self._in_buffer.src = ffi.NULL
                          self._in_buffer.size = 0
                          self._source_buffer = None
-                         if not hasattr(self._source, 'read'):
+                         if not hasattr(self._source, "read"):
                              self._finished_input = True
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('zstd decompress error: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("zstd decompress error: %s" % _zstd_error(zresult))
                      # Emit data if there is data AND either:
                      # a) output buffer is full (read amount is satisfied)
                      # b) we're at end of a frame and not in frame spanning mode
-                     return (out_buffer.pos and
-                             (out_buffer.pos == out_buffer.size or
-                              zresult == 0 and not self._read_across_frames))
+                     return out_buffer.pos and (
+                         out_buffer.pos == out_buffer.size
+                         or zresult == 0
+                         and not self._read_across_frames
+                     )
                  def read(self, size=-1):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if size < -1:
-                         raise ValueError('cannot read negative amounts less than -1')
+                         raise ValueError("cannot read negative amounts less than -1")
                      if size == -1:
                          # This is recursive. But it gets the job done.
                          return self.readall()
                      if self._finished_output or size == 0:
-                         return b''
+                         return b""
                      # We /could/ call into readinto() here. But that introduces more
                      # overhead.
-                     dst_buffer = ffi.new('char[]', size)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     dst_buffer = ffi.new("char[]", size)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dst_buffer
                      out_buffer.size = size
                      out_buffer.pos = 0
                  def readinto(self, b):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if self._finished_output:
                          return 0
                      # TODO use writable=True once we require CFFI >= 1.12.
                      dest_buffer = ffi.from_buffer(b)
-                     ffi.memmove(b, b'', 0)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     ffi.memmove(b, b"", 0)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dest_buffer
                      out_buffer.size = len(dest_buffer)
                      out_buffer.pos = 0
                  def read1(self, size=-1):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if size < -1:
-                         raise ValueError('cannot read negative amounts less than -1')
+                         raise ValueError("cannot read negative amounts less than -1")
                      if self._finished_output or size == 0:
-                         return b''
+                         return b""
                      # -1 returns arbitrary number of bytes.
                      if size == -1:
                          size = DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE
-                     dst_buffer = ffi.new('char[]', size)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     dst_buffer = ffi.new("char[]", size)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dst_buffer
                      out_buffer.size = size
                      out_buffer.pos = 0
                  def readinto1(self, b):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if self._finished_output:
                          return 0
                      # TODO use writable=True once we require CFFI >= 1.12.
                      dest_buffer = ffi.from_buffer(b)
-                     ffi.memmove(b, b'', 0)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     ffi.memmove(b, b"", 0)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = dest_buffer
                      out_buffer.size = len(dest_buffer)
                      out_buffer.pos = 0
                  def seek(self, pos, whence=os.SEEK_SET):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      read_amount = 0
                      if whence == os.SEEK_SET:
                          if pos < 0:
-                             raise ValueError('cannot seek to negative position with SEEK_SET')
+                             raise ValueError("cannot seek to negative position with SEEK_SET")
                          if pos < self._bytes_decompressed:
-                             raise ValueError('cannot seek zstd decompression stream '
-                                              'backwards')
+                             raise ValueError("cannot seek zstd decompression stream " "backwards")
                          read_amount = pos - self._bytes_decompressed
                      elif whence == os.SEEK_CUR:
                          if pos < 0:
-                             raise ValueError('cannot seek zstd decompression stream '
-                                              'backwards')
+                             raise ValueError("cannot seek zstd decompression stream " "backwards")
                          read_amount = pos
                      elif whence == os.SEEK_END:
-                         raise ValueError('zstd decompression streams cannot be seeked '
-                                          'with SEEK_END')
+                         raise ValueError(
+                             "zstd decompression streams cannot be seeked " "with SEEK_END"
+                         )
                      while read_amount:
-                         result = self.read(min(read_amount,
-                                                DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE))
+                         result = self.read(min(read_amount, DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE))
                          if not result:
                              break
                      return self._bytes_decompressed
              class ZstdDecompressionWriter(object):
                  def __init__(self, decompressor, writer, write_size, write_return_read):
                      decompressor._ensure_dctx()
                  def __enter__(self):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      if self._entered:
-                         raise ZstdError('cannot __enter__ multiple times')
+                         raise ZstdError("cannot __enter__ multiple times")
                      self._entered = True
                      finally:
                          self._closed = True
-                     f = getattr(self._writer, 'close', None)
+                     f = getattr(self._writer, "close", None)
                      if f:
                          f()
                      return self._closed
                  def fileno(self):
-                     f = getattr(self._writer, 'fileno', None)
+                     f = getattr(self._writer, "fileno", None)
                      if f:
                          return f()
                      else:
-                         raise OSError('fileno not available on underlying writer')
+                         raise OSError("fileno not available on underlying writer")
                  def flush(self):
                      if self._closed:
-                         raise ValueError('stream is closed')
-                     f = getattr(self._writer, 'flush', None)
+                         raise ValueError("stream is closed")
+                     f = getattr(self._writer, "flush", None)
                      if f:
                          return f()
                  def write(self, data):
                      if self._closed:
-                         raise ValueError('stream is closed')
+                         raise ValueError("stream is closed")
                      total_write = 0
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      data_buffer = ffi.from_buffer(data)
                      in_buffer.src = data_buffer
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
-                     dst_buffer = ffi.new('char[]', self._write_size)
+                     dst_buffer = ffi.new("char[]", self._write_size)
                      out_buffer.dst = dst_buffer
                      out_buffer.size = len(dst_buffer)
                      out_buffer.pos = 0
                      while in_buffer.pos < in_buffer.size:
                          zresult = lib.ZSTD_decompressStream(dctx, out_buffer, in_buffer)
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('zstd decompress error: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError("zstd decompress error: %s" % _zstd_error(zresult))
                          if out_buffer.pos:
                              self._writer.write(ffi.buffer(out_buffer.dst, out_buffer.pos)[:])
                      try:
                          self._ensure_dctx()
                      finally:
-                         self._dctx = ffi.gc(dctx, lib.ZSTD_freeDCtx,
-                                             size=lib.ZSTD_sizeof_DCtx(dctx))
+                         self._dctx = ffi.gc(
+                             dctx, lib.ZSTD_freeDCtx, size=lib.ZSTD_sizeof_DCtx(dctx)
+                         )
                  def memory_size(self):
                      return lib.ZSTD_sizeof_DCtx(self._dctx)
                      output_size = lib.ZSTD_getFrameContentSize(data_buffer, len(data_buffer))
                      if output_size == lib.ZSTD_CONTENTSIZE_ERROR:
-                         raise ZstdError('error determining content size from frame header')
+                         raise ZstdError("error determining content size from frame header")
                      elif output_size == 0:
-                         return b''
+                         return b""
                      elif output_size == lib.ZSTD_CONTENTSIZE_UNKNOWN:
                          if not max_output_size:
-                             raise ZstdError('could not determine content size in frame header')
-                         result_buffer = ffi.new('char[]', max_output_size)
+                             raise ZstdError("could not determine content size in frame header")
+                         result_buffer = ffi.new("char[]", max_output_size)
                          result_size = max_output_size
                          output_size = 0
                      else:
-                         result_buffer = ffi.new('char[]', output_size)
+                         result_buffer = ffi.new("char[]", output_size)
                          result_size = output_size
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = result_buffer
                      out_buffer.size = result_size
                      out_buffer.pos = 0
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
                      in_buffer.src = data_buffer
                      in_buffer.size = len(data_buffer)
                      in_buffer.pos = 0
                      zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('decompression error: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("decompression error: %s" % _zstd_error(zresult))
                      elif zresult:
-                         raise ZstdError('decompression error: did not decompress full frame')
+                         raise ZstdError("decompression error: did not decompress full frame")
                      elif output_size and out_buffer.pos != output_size:
-                         raise ZstdError('decompression error: decompressed %d bytes; expected %d' %
-                                         (zresult, output_size))
+                         raise ZstdError(
+                             "decompression error: decompressed %d bytes; expected %d"
+                             % (zresult, output_size)
+                         )
                      return ffi.buffer(result_buffer, out_buffer.pos)[:]
-                 def stream_reader(self, source, read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE,
-                                   read_across_frames=False):
+                 def stream_reader(
+                     self,
+                     source,
+                     read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE,
+                     read_across_frames=False,
+                 ):
                      self._ensure_dctx()
                      return ZstdDecompressionReader(self, source, read_size, read_across_frames)
                  def decompressobj(self, write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE):
                      if write_size < 1:
-                         raise ValueError('write_size must be positive')
+                         raise ValueError("write_size must be positive")
                      self._ensure_dctx()
                      return ZstdDecompressionObj(self, write_size=write_size)
-                 def read_to_iter(self, reader, read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE,
-                                  write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE,
-                                  skip_bytes=0):
+                 def read_to_iter(
+                     self,
+                     reader,
+                     read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE,
+                     write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                     skip_bytes=0,
+                 ):
                      if skip_bytes >= read_size:
-                         raise ValueError('skip_bytes must be smaller than read_size')
-                     if hasattr(reader, 'read'):
+                         raise ValueError("skip_bytes must be smaller than read_size")
+                     if hasattr(reader, "read"):
                          have_read = True
-                     elif hasattr(reader, '__getitem__'):
+                     elif hasattr(reader, "__getitem__"):
                          have_read = False
                          buffer_offset = 0
                          size = len(reader)
                      else:
-                         raise ValueError('must pass an object with a read() method or '
-                                          'conforms to buffer protocol')
+                         raise ValueError(
+                             "must pass an object with a read() method or "
+                             "conforms to buffer protocol"
+                         )
                      if skip_bytes:
                          if have_read:
                              reader.read(skip_bytes)
                          else:
                              if skip_bytes > size:
-                                 raise ValueError('skip_bytes larger than first input chunk')
+                                 raise ValueError("skip_bytes larger than first input chunk")
                              buffer_offset = skip_bytes
                      self._ensure_dctx()
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
-                     dst_buffer = ffi.new('char[]', write_size)
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
+                     dst_buffer = ffi.new("char[]", write_size)
                      out_buffer.dst = dst_buffer
                      out_buffer.size = len(dst_buffer)
                      out_buffer.pos = 0
                          else:
                              remaining = size - buffer_offset
                              slice_size = min(remaining, read_size)
-                             read_result = reader[buffer_offset:buffer_offset + slice_size]
+                             read_result = reader[buffer_offset : buffer_offset + slice_size]
                              buffer_offset += slice_size
                          # No new input. Break out of read loop.
                              zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                              if lib.ZSTD_isError(zresult):
-                                 raise ZstdError('zstd decompress error: %s' %
-                                                 _zstd_error(zresult))
+                                 raise ZstdError("zstd decompress error: %s" % _zstd_error(zresult))
                              if out_buffer.pos:
                                  data = ffi.buffer(out_buffer.dst, out_buffer.pos)[:]
                  read_from = read_to_iter
-                 def stream_writer(self, writer, write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE,
-                                   write_return_read=False):
-                     if not hasattr(writer, 'write'):
-                         raise ValueError('must pass an object with a write() method')
-                     return ZstdDecompressionWriter(self, writer, write_size,
-                                                    write_return_read)
+                 def stream_writer(
+                     self,
+                     writer,
+                     write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                     write_return_read=False,
+                 ):
+                     if not hasattr(writer, "write"):
+                         raise ValueError("must pass an object with a write() method")
+                     return ZstdDecompressionWriter(self, writer, write_size, write_return_read)
                  write_to = stream_writer
-                 def copy_stream(self, ifh, ofh,
-                                 read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE,
-                                 write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE):
-                     if not hasattr(ifh, 'read'):
-                         raise ValueError('first argument must have a read() method')
-                     if not hasattr(ofh, 'write'):
-                         raise ValueError('second argument must have a write() method')
+                 def copy_stream(
+                     self,
+                     ifh,
+                     ofh,
+                     read_size=DECOMPRESSION_RECOMMENDED_INPUT_SIZE,
+                     write_size=DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE,
+                 ):
+                     if not hasattr(ifh, "read"):
+                         raise ValueError("first argument must have a read() method")
+                     if not hasattr(ofh, "write"):
+                         raise ValueError("second argument must have a write() method")
                      self._ensure_dctx()
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
-                     dst_buffer = ffi.new('char[]', write_size)
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
+                     dst_buffer = ffi.new("char[]", write_size)
                      out_buffer.dst = dst_buffer
                      out_buffer.size = write_size
                      out_buffer.pos = 0
                          while in_buffer.pos < in_buffer.size:
                              zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                              if lib.ZSTD_isError(zresult):
-                                 raise ZstdError('zstd decompressor error: %s' %
-                                                 _zstd_error(zresult))
+                                 raise ZstdError(
+                                     "zstd decompressor error: %s" % _zstd_error(zresult)
+                                 )
                              if out_buffer.pos:
                                  ofh.write(ffi.buffer(out_buffer.dst, out_buffer.pos))
                  def decompress_content_dict_chain(self, frames):
                      if not isinstance(frames, list):
-                         raise TypeError('argument must be a list')
+                         raise TypeError("argument must be a list")
                      if not frames:
-                         raise ValueError('empty input chain')
+                         raise ValueError("empty input chain")
                      # First chunk should not be using a dictionary. We handle it specially.
                      chunk = frames[0]
                      if not isinstance(chunk, bytes_type):
-                         raise ValueError('chunk 0 must be bytes')
+                         raise ValueError("chunk 0 must be bytes")
                      # All chunks should be zstd frames and should have content size set.
                      chunk_buffer = ffi.from_buffer(chunk)
-                     params = ffi.new('ZSTD_frameHeader *')
+                     params = ffi.new("ZSTD_frameHeader *")
                      zresult = lib.ZSTD_getFrameHeader(params, chunk_buffer, len(chunk_buffer))
                      if lib.ZSTD_isError(zresult):
-                         raise ValueError('chunk 0 is not a valid zstd frame')
+                         raise ValueError("chunk 0 is not a valid zstd frame")
                      elif zresult:
-                         raise ValueError('chunk 0 is too small to contain a zstd frame')
+                         raise ValueError("chunk 0 is too small to contain a zstd frame")
                      if params.frameContentSize == lib.ZSTD_CONTENTSIZE_UNKNOWN:
-                         raise ValueError('chunk 0 missing content size in frame')
+                         raise ValueError("chunk 0 missing content size in frame")
                      self._ensure_dctx(load_dict=False)
-                     last_buffer = ffi.new('char[]', params.frameContentSize)
-                     out_buffer = ffi.new('ZSTD_outBuffer *')
+                     last_buffer = ffi.new("char[]", params.frameContentSize)
+                     out_buffer = ffi.new("ZSTD_outBuffer *")
                      out_buffer.dst = last_buffer
                      out_buffer.size = len(last_buffer)
                      out_buffer.pos = 0
-                     in_buffer = ffi.new('ZSTD_inBuffer *')
+                     in_buffer = ffi.new("ZSTD_inBuffer *")
                      in_buffer.src = chunk_buffer
                      in_buffer.size = len(chunk_buffer)
                      in_buffer.pos = 0
                      zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('could not decompress chunk 0: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("could not decompress chunk 0: %s" % _zstd_error(zresult))
                      elif zresult:
-                         raise ZstdError('chunk 0 did not decompress full frame')
+                         raise ZstdError("chunk 0 did not decompress full frame")
                      # Special case of chain length of 1
                      if len(frames) == 1:
                      while i < len(frames):
                          chunk = frames[i]
                          if not isinstance(chunk, bytes_type):
-                             raise ValueError('chunk %d must be bytes' % i)
+                             raise ValueError("chunk %d must be bytes" % i)
                          chunk_buffer = ffi.from_buffer(chunk)
                          zresult = lib.ZSTD_getFrameHeader(params, chunk_buffer, len(chunk_buffer))
                          if lib.ZSTD_isError(zresult):
-                             raise ValueError('chunk %d is not a valid zstd frame' % i)
+                             raise ValueError("chunk %d is not a valid zstd frame" % i)
                          elif zresult:
-                             raise ValueError('chunk %d is too small to contain a zstd frame' % i)
+                             raise ValueError("chunk %d is too small to contain a zstd frame" % i)
                          if params.frameContentSize == lib.ZSTD_CONTENTSIZE_UNKNOWN:
-                             raise ValueError('chunk %d missing content size in frame' % i)
-                         dest_buffer = ffi.new('char[]', params.frameContentSize)
+                             raise ValueError("chunk %d missing content size in frame" % i)
+                         dest_buffer = ffi.new("char[]", params.frameContentSize)
                          out_buffer.dst = dest_buffer
                          out_buffer.size = len(dest_buffer)
                          zresult = lib.ZSTD_decompressStream(self._dctx, out_buffer, in_buffer)
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('could not decompress chunk %d: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError(
+                                 "could not decompress chunk %d: %s" % _zstd_error(zresult)
+                             )
                          elif zresult:
-                             raise ZstdError('chunk %d did not decompress full frame' % i)
+                             raise ZstdError("chunk %d did not decompress full frame" % i)
                          last_buffer = dest_buffer
                          i += 1
                      lib.ZSTD_DCtx_reset(self._dctx, lib.ZSTD_reset_session_only)
                      if self._max_window_size:
-                         zresult = lib.ZSTD_DCtx_setMaxWindowSize(self._dctx,
-                                                                  self._max_window_size)
+                         zresult = lib.ZSTD_DCtx_setMaxWindowSize(self._dctx, self._max_window_size)
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('unable to set max window size: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError(
+                                 "unable to set max window size: %s" % _zstd_error(zresult)
+                             )
                      zresult = lib.ZSTD_DCtx_setFormat(self._dctx, self._format)
                      if lib.ZSTD_isError(zresult):
-                         raise ZstdError('unable to set decoding format: %s' %
-                                         _zstd_error(zresult))
+                         raise ZstdError("unable to set decoding format: %s" % _zstd_error(zresult))
                      if self._dict_data and load_dict:
                          zresult = lib.ZSTD_DCtx_refDDict(self._dctx, self._dict_data._ddict)
                          if lib.ZSTD_isError(zresult):
-                             raise ZstdError('unable to reference prepared dictionary: %s' %
-                                             _zstd_error(zresult))
+                             raise ZstdError(
+                                 "unable to reference prepared dictionary: %s" % _zstd_error(zresult)
+                             )

contrib/python-zstandard/zstd.c

0 +1 -1

              	   We detect this mismatch here and refuse to load the module if this
              	   scenario is detected.
              	*/
-             	if (ZSTD_VERSION_NUMBER != 10403 || ZSTD_versionNumber() != 10403) {
+             	if (ZSTD_VERSION_NUMBER != 10404 || ZSTD_versionNumber() != 10404) {
              		PyErr_SetString(PyExc_ImportError, "zstd C API mismatch; Python bindings not compiled against expected zstd version");
              		return;
              	}

contrib/python-zstandard/zstd/common/bitstream.h

0 +3 -2

                      _BitScanReverse ( &r, val );
                      return (unsigned) r;
              #   elif defined(__GNUC__) && (__GNUC__ >= 3)   /* Use GCC Intrinsic */
-                     return 31 - __builtin_clz (val);
+                     return __builtin_clz (val) ^ 31;
              #   elif defined(__ICCARM__)    /* IAR Intrinsic */
                      return 31 - __CLZ(val);
              #   else   /* Software version */
              {
                  size_t const nbBytes = bitC->bitPos >> 3;
                  assert(bitC->bitPos < sizeof(bitC->bitContainer) * 8);
+                 assert(bitC->ptr <= bitC->endPtr);
                  MEM_writeLEST(bitC->ptr, bitC->bitContainer);
                  bitC->ptr += nbBytes;
-                 assert(bitC->ptr <= bitC->endPtr);
                  bitC->bitPos &= 7;
                  bitC->bitContainer >>= nbBytes*8;
              }
              {
                  size_t const nbBytes = bitC->bitPos >> 3;
                  assert(bitC->bitPos < sizeof(bitC->bitContainer) * 8);
+                 assert(bitC->ptr <= bitC->endPtr);
                  MEM_writeLEST(bitC->ptr, bitC->bitContainer);
                  bitC->ptr += nbBytes;
                  if (bitC->ptr > bitC->endPtr) bitC->ptr = bitC->endPtr;

contrib/python-zstandard/zstd/common/compiler.h

0 +14 -2

              #  define HINT_INLINE static INLINE_KEYWORD FORCE_INLINE_ATTR
              #endif
+             /* UNUSED_ATTR tells the compiler it is okay if the function is unused. */
+             #if defined(__GNUC__)
+             #  define UNUSED_ATTR __attribute__((unused))
+             #else
+             #  define UNUSED_ATTR
+             #endif
              /* force no inlining */
              #ifdef _MSC_VER
              #  define FORCE_NOINLINE static __declspec(noinline)
                  }                                     \
              }
-             /* vectorization */
+             /* vectorization
+              * older GCC (pre gcc-4.3 picked as the cutoff) uses a different syntax */
              #if !defined(__clang__) && defined(__GNUC__)
-             #  define DONT_VECTORIZE __attribute__((optimize("no-tree-vectorize")))
+             #  if (__GNUC__ == 4 && __GNUC_MINOR__ > 3) || (__GNUC__ >= 5)
+             #    define DONT_VECTORIZE __attribute__((optimize("no-tree-vectorize")))
+             #  else
+             #    define DONT_VECTORIZE _Pragma("GCC optimize(\"no-tree-vectorize\")")
+             #  endif
              #else
              #  define DONT_VECTORIZE
              #endif

contrib/python-zstandard/zstd/common/fse.h

0 +1 -1

              *******************************************/
              /* FSE buffer bounds */
              #define FSE_NCOUNTBOUND 512
-             #define FSE_BLOCKBOUND(size) (size + (size>>7))
+             #define FSE_BLOCKBOUND(size) (size + (size>>7) + 4 /* fse states */ + sizeof(size_t) /* bitContainer */)
              #define FSE_COMPRESSBOUND(size) (FSE_NCOUNTBOUND + FSE_BLOCKBOUND(size))   /* Macro version, useful for static allocation */
              /* It is possible to statically allocate FSE CTable/DTable as a table of FSE_CTable/FSE_DTable using below macros */

contrib/python-zstandard/zstd/common/fse_decompress.c

0 +2 0

              #define FSE_STATIC_ASSERT(c) DEBUG_STATIC_ASSERT(c)   /* use only *after* variable declarations */
              /* check and forward error code */
+             #ifndef CHECK_F
              #define CHECK_F(f) { size_t const e = f; if (FSE_isError(e)) return e; }
+             #endif
              /* **************************************************************

contrib/python-zstandard/zstd/common/mem.h

0 +73 0

		@@ -47,6 +47,79 extern "C" {
47	47	#define MEM_STATIC_ASSERT(c) { enum { MEM_static_assert = 1/(int)(!!(c)) }; }
48	48	MEM_STATIC void MEM_check(void) { MEM_STATIC_ASSERT((sizeof(size_t)==4) \|\| (sizeof(size_t)==8)); }
49	49
	50	/* detects whether we are being compiled under msan */
	51	#if defined (__has_feature)
	52	# if __has_feature(memory_sanitizer)
	53	# define MEMORY_SANITIZER 1
	54	# endif
	55	#endif
	56
	57	#if defined (MEMORY_SANITIZER)
	58	/* Not all platforms that support msan provide sanitizers/msan_interface.h.
	59	* We therefore declare the functions we need ourselves, rather than trying to
	60	* include the header file... */
	61
	62	#include <stdint.h> /* intptr_t */
	63
	64	/* Make memory region fully initialized (without changing its contents). */
	65	void __msan_unpoison(const volatile void *a, size_t size);
	66
	67	/* Make memory region fully uninitialized (without changing its contents).
	68	This is a legacy interface that does not update origin information. Use
	69	__msan_allocated_memory() instead. */
	70	void __msan_poison(const volatile void *a, size_t size);
	71
	72	/* Returns the offset of the first (at least partially) poisoned byte in the
	73	memory range, or -1 if the whole range is good. */
	74	intptr_t __msan_test_shadow(const volatile void *x, size_t size);
	75	#endif
	76
	77	/* detects whether we are being compiled under asan */
	78	#if defined (__has_feature)
	79	# if __has_feature(address_sanitizer)
	80	# define ADDRESS_SANITIZER 1
	81	# endif
	82	#elif defined(__SANITIZE_ADDRESS__)
	83	# define ADDRESS_SANITIZER 1
	84	#endif
	85
	86	#if defined (ADDRESS_SANITIZER)
	87	/* Not all platforms that support asan provide sanitizers/asan_interface.h.
	88	* We therefore declare the functions we need ourselves, rather than trying to
	89	* include the header file... */
	90
	91	/**
	92	* Marks a memory region (<c>[addr, addr+size)</c>) as unaddressable.
	93	*
	94	* This memory must be previously allocated by your program. Instrumented
	95	* code is forbidden from accessing addresses in this region until it is
	96	* unpoisoned. This function is not guaranteed to poison the entire region -
	97	* it could poison only a subregion of <c>[addr, addr+size)</c> due to ASan
	98	* alignment restrictions.
	99	*
	100	* \note This function is not thread-safe because no two threads can poison or
	101	* unpoison memory in the same memory region simultaneously.
	102	*
	103	* \param addr Start of memory region.
	104	* \param size Size of memory region. */
	105	void __asan_poison_memory_region(void const volatile *addr, size_t size);
	106
	107	/**
	108	* Marks a memory region (<c>[addr, addr+size)</c>) as addressable.
	109	*
	110	* This memory must be previously allocated by your program. Accessing
	111	* addresses in this region is allowed until this region is poisoned again.
	112	* This function could unpoison a super-region of <c>[addr, addr+size)</c> due
	113	* to ASan alignment restrictions.
	114	*
	115	* \note This function is not thread-safe because no two threads can
	116	* poison or unpoison memory in the same memory region simultaneously.
	117	*
	118	* \param addr Start of memory region.
	119	* \param size Size of memory region. */
	120	void __asan_unpoison_memory_region(void const volatile *addr, size_t size);
	121	#endif
	122
50	123
51	124	/-*************************************************************
52	125	* Basic Types

contrib/python-zstandard/zstd/common/pool.c

0 +7 -3

                  ctx->queueTail = 0;
                  ctx->numThreadsBusy = 0;
                  ctx->queueEmpty = 1;
-                 (void)ZSTD_pthread_mutex_init(&ctx->queueMutex, NULL);
-                 (void)ZSTD_pthread_cond_init(&ctx->queuePushCond, NULL);
-                 (void)ZSTD_pthread_cond_init(&ctx->queuePopCond, NULL);
+                 {
+                     int error = 0;
+                     error |= ZSTD_pthread_mutex_init(&ctx->queueMutex, NULL);
+                     error |= ZSTD_pthread_cond_init(&ctx->queuePushCond, NULL);
+                     error |= ZSTD_pthread_cond_init(&ctx->queuePopCond, NULL);
+                     if (error) { POOL_free(ctx); return NULL; }
+                 }
                  ctx->shutdown = 0;
                  /* Allocate space for the thread handles */
                  ctx->threads = (ZSTD_pthread_t*)ZSTD_malloc(numThreads * sizeof(ZSTD_pthread_t), customMem);

contrib/python-zstandard/zstd/common/threading.c

0 +46 -1

               * This file will hold wrapper for systems, which do not support pthreads
               */
+             #include "threading.h"
              /* create fake symbol to avoid empty translation unit warning */
              int g_ZSTD_threading_useless_symbol;
              /* ===  Dependencies  === */
              #include <process.h>
              #include <errno.h>
-             #include "threading.h"
              /* ===  Implementation  === */
              }
              #endif   /* ZSTD_MULTITHREAD */
+             #if defined(ZSTD_MULTITHREAD) && DEBUGLEVEL >= 1 && !defined(_WIN32)
+             #include <stdlib.h>
+             int ZSTD_pthread_mutex_init(ZSTD_pthread_mutex_t* mutex, pthread_mutexattr_t const* attr)
+             {
+                 *mutex = (pthread_mutex_t*)malloc(sizeof(pthread_mutex_t));
+                 if (!*mutex)
+                     return 1;
+                 return pthread_mutex_init(*mutex, attr);
+             }
+             int ZSTD_pthread_mutex_destroy(ZSTD_pthread_mutex_t* mutex)
+             {
+                 if (!*mutex)
+                     return 0;
+                 {
+                     int const ret = pthread_mutex_destroy(*mutex);
+                     free(*mutex);
+                     return ret;
+                 }
+             }
+             int ZSTD_pthread_cond_init(ZSTD_pthread_cond_t* cond, pthread_condattr_t const* attr)
+             {
+                 *cond = (pthread_cond_t*)malloc(sizeof(pthread_cond_t));
+                 if (!*cond)
+                     return 1;
+                 return pthread_cond_init(*cond, attr);
+             }
+             int ZSTD_pthread_cond_destroy(ZSTD_pthread_cond_t* cond)
+             {
+                 if (!*cond)
+                     return 0;
+                 {
+                     int const ret = pthread_cond_destroy(*cond);
+                     free(*cond);
+                     return ret;
+                 }
+             }
+             #endif

contrib/python-zstandard/zstd/common/threading.h

0 +32 -1

              #ifndef THREADING_H_938743
              #define THREADING_H_938743
+             #include "debug.h"
              #if defined (__cplusplus)
              extern "C" {
              #endif
               */
-             #elif defined(ZSTD_MULTITHREAD)   /* posix assumed ; need a better detection method */
+             #elif defined(ZSTD_MULTITHREAD)    /* posix assumed ; need a better detection method */
              /* ===   POSIX Systems   === */
              #  include <pthread.h>
+             #if DEBUGLEVEL < 1
              #define ZSTD_pthread_mutex_t            pthread_mutex_t
              #define ZSTD_pthread_mutex_init(a, b)   pthread_mutex_init((a), (b))
              #define ZSTD_pthread_mutex_destroy(a)   pthread_mutex_destroy((a))
              #define ZSTD_pthread_create(a, b, c, d) pthread_create((a), (b), (c), (d))
              #define ZSTD_pthread_join(a, b)         pthread_join((a),(b))
+             #else /* DEBUGLEVEL >= 1 */
+             /* Debug implementation of threading.
+              * In this implementation we use pointers for mutexes and condition variables.
+              * This way, if we forget to init/destroy them the program will crash or ASAN
+              * will report leaks.
+              */
+             #define ZSTD_pthread_mutex_t            pthread_mutex_t*
+             int ZSTD_pthread_mutex_init(ZSTD_pthread_mutex_t* mutex, pthread_mutexattr_t const* attr);
+             int ZSTD_pthread_mutex_destroy(ZSTD_pthread_mutex_t* mutex);
+             #define ZSTD_pthread_mutex_lock(a)      pthread_mutex_lock(*(a))
+             #define ZSTD_pthread_mutex_unlock(a)    pthread_mutex_unlock(*(a))
+             #define ZSTD_pthread_cond_t             pthread_cond_t*
+             int ZSTD_pthread_cond_init(ZSTD_pthread_cond_t* cond, pthread_condattr_t const* attr);
+             int ZSTD_pthread_cond_destroy(ZSTD_pthread_cond_t* cond);
+             #define ZSTD_pthread_cond_wait(a, b)    pthread_cond_wait(*(a), *(b))
+             #define ZSTD_pthread_cond_signal(a)     pthread_cond_signal(*(a))
+             #define ZSTD_pthread_cond_broadcast(a)  pthread_cond_broadcast(*(a))
+             #define ZSTD_pthread_t                  pthread_t
+             #define ZSTD_pthread_create(a, b, c, d) pthread_create((a), (b), (c), (d))
+             #define ZSTD_pthread_join(a, b)         pthread_join((a),(b))
+             #endif
              #else  /* ZSTD_MULTITHREAD not defined */
              /* No multithreading support */

contrib/python-zstandard/zstd/common/zstd_internal.h

0 +32 -55

              static void ZSTD_copy16(void* dst, const void* src) { memcpy(dst, src, 16); }
              #define COPY16(d,s) { ZSTD_copy16(d,s); d+=16; s+=16; }
-             #define WILDCOPY_OVERLENGTH 8
-             #define VECLEN 16
+             #define WILDCOPY_OVERLENGTH 32
+             #define WILDCOPY_VECLEN 16
              typedef enum {
                  ZSTD_no_overlap,
-                 ZSTD_overlap_src_before_dst,
+                 ZSTD_overlap_src_before_dst
                  /*  ZSTD_overlap_dst_before_src, */
              } ZSTD_overlap_e;
              /*! ZSTD_wildcopy() :
-              *  custom version of memcpy(), can overwrite up to WILDCOPY_OVERLENGTH bytes (if length==0) */
+              *  Custom version of memcpy(), can over read/write up to WILDCOPY_OVERLENGTH bytes (if length==0)
+              *  @param ovtype controls the overlap detection
+              *         - ZSTD_no_overlap: The source and destination are guaranteed to be at least WILDCOPY_VECLEN bytes apart.
+              *         - ZSTD_overlap_src_before_dst: The src and dst may overlap, but they MUST be at least 8 bytes apart.
+              *           The src buffer must be before the dst buffer.
+              */
              MEM_STATIC FORCE_INLINE_ATTR DONT_VECTORIZE
-             void ZSTD_wildcopy(void* dst, const void* src, ptrdiff_t length, ZSTD_overlap_e ovtype)
+             void ZSTD_wildcopy(void* dst, const void* src, ptrdiff_t length, ZSTD_overlap_e const ovtype)
              {
                  ptrdiff_t diff = (BYTE*)dst - (const BYTE*)src;
                  const BYTE* ip = (const BYTE*)src;
                  BYTE* op = (BYTE*)dst;
                  BYTE* const oend = op + length;
-                 assert(diff >= 8 || (ovtype == ZSTD_no_overlap && diff < -8));
-                 if (length < VECLEN || (ovtype == ZSTD_overlap_src_before_dst && diff < VECLEN)) {
-                   do
-                       COPY8(op, ip)
-                   while (op < oend);
+                 }
-                 else {
-                   if ((length & 8) == 0)
-                     COPY8(op, ip);
-                   do {
-                     COPY16(op, ip);
+                   }
-                   while (op < oend);
+                 }
+             }
-             /*! ZSTD_wildcopy_16min() :
-              *  same semantics as ZSTD_wilcopy() except guaranteed to be able to copy 16 bytes at the start */
-             MEM_STATIC FORCE_INLINE_ATTR DONT_VECTORIZE
-             void ZSTD_wildcopy_16min(void* dst, const void* src, ptrdiff_t length, ZSTD_overlap_e ovtype)
+             {
-                 ptrdiff_t diff = (BYTE*)dst - (const BYTE*)src;
-                 const BYTE* ip = (const BYTE*)src;
-                 BYTE* op = (BYTE*)dst;
-                 BYTE* const oend = op + length;
+                 assert(diff >= 8 || (ovtype == ZSTD_no_overlap && diff <= -WILDCOPY_VECLEN));
-                 assert(length >= 8);
-                 assert(diff >= 8 || (ovtype == ZSTD_no_overlap && diff < -8));
-                 if (ovtype == ZSTD_overlap_src_before_dst && diff < VECLEN) {
-                   do
-                       COPY8(op, ip)
-                   while (op < oend);
+                 }
-                 else {
-                   if ((length & 8) == 0)
-                     COPY8(op, ip);
-                   do {
+                 if (ovtype == ZSTD_overlap_src_before_dst && diff < WILDCOPY_VECLEN) {
+                     /* Handle short offset copies. */
+                     do {
+                         COPY8(op, ip)
+                     } while (op < oend);
+                 } else {
+                     assert(diff >= WILDCOPY_VECLEN || diff <= -WILDCOPY_VECLEN);
+                     /* Separate out the first two COPY16() calls because the copy length is
+                      * almost certain to be short, so the branches have different
+                      * probabilities.
+                      * On gcc-9 unrolling once is +1.6%, twice is +2%, thrice is +1.8%.
+                      * On clang-8 unrolling once is +1.4%, twice is +3.3%, thrice is +3%.
+                      */
                      COPY16(op, ip);
+                   }
-                   while (op < oend);
+                     COPY16(op, ip);
+                     if (op >= oend) return;
+                     do {
+                         COPY16(op, ip);
+                         COPY16(op, ip);
+                     }
+                     while (op < oend);
                  }
              }
-             MEM_STATIC void ZSTD_wildcopy_e(void* dst, const void* src, void* dstEnd)   /* should be faster for decoding, but strangely, not verified on all platform */
+             {
-                 const BYTE* ip = (const BYTE*)src;
-                 BYTE* op = (BYTE*)dst;
-                 BYTE* const oend = (BYTE*)dstEnd;
-                 do
-                     COPY8(op, ip)
-                 while (op < oend);
+             }
              /*-*******************************************
              *  Private declarations
                      _BitScanReverse(&r, val);
                      return (unsigned)r;
              #   elif defined(__GNUC__) && (__GNUC__ >= 3)   /* GCC Intrinsic */
-                     return 31 - __builtin_clz(val);
+                     return __builtin_clz (val) ^ 31;
              #   elif defined(__ICCARM__)    /* IAR Intrinsic */
                      return 31 - __CLZ(val);
              #   else   /* Software version */

contrib/python-zstandard/zstd/compress/zstd_compress.c

0 +648 -451

              *  Context memory management
              ***************************************/
              struct ZSTD_CDict_s {
-                 void* dictBuffer;
                  const void* dictContent;
                  size_t dictContentSize;
-                 void* workspace;
-                 size_t workspaceSize;
+                 U32* entropyWorkspace; /* entropy workspace of HUF_WORKSPACE_SIZE bytes */
+                 ZSTD_cwksp workspace;
                  ZSTD_matchState_t matchState;
                  ZSTD_compressedBlockState_t cBlockState;
                  ZSTD_customMem customMem;
                  U32 dictID;
+                 int compressionLevel; /* 0 indicates that advanced API was used to select CDict params */
              };  /* typedef'd to ZSTD_CDict within "zstd.h" */
              ZSTD_CCtx* ZSTD_createCCtx(void)
              ZSTD_CCtx* ZSTD_initStaticCCtx(void *workspace, size_t workspaceSize)
              {
-                 ZSTD_CCtx* const cctx = (ZSTD_CCtx*) workspace;
+                 ZSTD_cwksp ws;
+                 ZSTD_CCtx* cctx;
                  if (workspaceSize <= sizeof(ZSTD_CCtx)) return NULL;  /* minimum size */
                  if ((size_t)workspace & 7) return NULL;  /* must be 8-aligned */
-                 memset(workspace, 0, workspaceSize);   /* may be a bit generous, could memset be smaller ? */
+                 ZSTD_cwksp_init(&ws, workspace, workspaceSize);
+                 cctx = (ZSTD_CCtx*)ZSTD_cwksp_reserve_object(&ws, sizeof(ZSTD_CCtx));
+                 if (cctx == NULL) {
+                     return NULL;
+                 }
+                 memset(cctx, 0, sizeof(ZSTD_CCtx));
+                 ZSTD_cwksp_move(&cctx->workspace, &ws);
                  cctx->staticSize = workspaceSize;
-                 cctx->workSpace = (void*)(cctx+1);
-                 cctx->workSpaceSize = workspaceSize - sizeof(ZSTD_CCtx);
                  /* statically sized space. entropyWorkspace never moves (but prev/next block swap places) */
-                 if (cctx->workSpaceSize < HUF_WORKSPACE_SIZE + 2 * sizeof(ZSTD_compressedBlockState_t)) return NULL;
-                 assert(((size_t)cctx->workSpace & (sizeof(void*)-1)) == 0);   /* ensure correct alignment */
-                 cctx->blockState.prevCBlock = (ZSTD_compressedBlockState_t*)cctx->workSpace;
-                 cctx->blockState.nextCBlock = cctx->blockState.prevCBlock + 1;
+                 {
-                     void* const ptr = cctx->blockState.nextCBlock + 1;
-                     cctx->entropyWorkspace = (U32*)ptr;
+                 }
+                 if (!ZSTD_cwksp_check_available(&cctx->workspace, HUF_WORKSPACE_SIZE + 2 * sizeof(ZSTD_compressedBlockState_t))) return NULL;
+                 cctx->blockState.prevCBlock = (ZSTD_compressedBlockState_t*)ZSTD_cwksp_reserve_object(&cctx->workspace, sizeof(ZSTD_compressedBlockState_t));
+                 cctx->blockState.nextCBlock = (ZSTD_compressedBlockState_t*)ZSTD_cwksp_reserve_object(&cctx->workspace, sizeof(ZSTD_compressedBlockState_t));
+                 cctx->entropyWorkspace = (U32*)ZSTD_cwksp_reserve_object(
+                     &cctx->workspace, HUF_WORKSPACE_SIZE);
                  cctx->bmi2 = ZSTD_cpuid_bmi2(ZSTD_cpuid());
                  return cctx;
              }
              {
                  assert(cctx != NULL);
                  assert(cctx->staticSize == 0);
-                 ZSTD_free(cctx->workSpace, cctx->customMem); cctx->workSpace = NULL;
                  ZSTD_clearAllDicts(cctx);
              #ifdef ZSTD_MULTITHREAD
                  ZSTDMT_freeCCtx(cctx->mtctx); cctx->mtctx = NULL;
              #endif
+                 ZSTD_cwksp_free(&cctx->workspace, cctx->customMem);
              }
              size_t ZSTD_freeCCtx(ZSTD_CCtx* cctx)
                  if (cctx==NULL) return 0;   /* support free on NULL */
                  RETURN_ERROR_IF(cctx->staticSize, memory_allocation,
                                  "not compatible with static CCtx");
-                 ZSTD_freeCCtxContent(cctx);
-                 ZSTD_free(cctx, cctx->customMem);
+                 {
+                     int cctxInWorkspace = ZSTD_cwksp_owns_buffer(&cctx->workspace, cctx);
+                     ZSTD_freeCCtxContent(cctx);
+                     if (!cctxInWorkspace) {
+                         ZSTD_free(cctx, cctx->customMem);
+                     }
+                 }
                  return 0;
              }
              size_t ZSTD_sizeof_CCtx(const ZSTD_CCtx* cctx)
              {
                  if (cctx==NULL) return 0;   /* support sizeof on NULL */
-                 return sizeof(*cctx) + cctx->workSpaceSize
+                 /* cctx may be in the workspace */
+                 return (cctx->workspace.workspace == cctx ? 0 : sizeof(*cctx))
+                        + ZSTD_cwksp_sizeof(&cctx->workspace)
                         + ZSTD_sizeof_localDict(cctx->localDict)
                         + ZSTD_sizeof_mtctx(cctx);
              }
                  RETURN_ERROR_IF(!cctxParams, GENERIC);
                  FORWARD_IF_ERROR( ZSTD_checkCParams(params.cParams) );
                  memset(cctxParams, 0, sizeof(*cctxParams));
+                 assert(!ZSTD_checkCParams(params.cParams));
                  cctxParams->cParams = params.cParams;
                  cctxParams->fParams = params.fParams;
                  cctxParams->compressionLevel = ZSTD_CLEVEL_DEFAULT;   /* should not matter, as all cParams are presumed properly defined */
-                 assert(!ZSTD_checkCParams(params.cParams));
                  return 0;
              }
              /* ZSTD_assignParamsToCCtxParams() :
               * params is presumed valid at this stage */
              static ZSTD_CCtx_params ZSTD_assignParamsToCCtxParams(
-                     ZSTD_CCtx_params cctxParams, ZSTD_parameters params)
+                     const ZSTD_CCtx_params* cctxParams, ZSTD_parameters params)
              {
-                 ZSTD_CCtx_params ret = cctxParams;
+                 ZSTD_CCtx_params ret = *cctxParams;
+                 assert(!ZSTD_checkCParams(params.cParams));
                  ret.cParams = params.cParams;
                  ret.fParams = params.fParams;
                  ret.compressionLevel = ZSTD_CLEVEL_DEFAULT;   /* should not matter, as all cParams are presumed properly defined */
-                 assert(!ZSTD_checkCParams(params.cParams));
                  return ret;
              }
                  case ZSTD_c_forceAttachDict:
                      ZSTD_STATIC_ASSERT(ZSTD_dictDefaultAttach < ZSTD_dictForceCopy);
                      bounds.lowerBound = ZSTD_dictDefaultAttach;
-                     bounds.upperBound = ZSTD_dictForceCopy;       /* note : how to ensure at compile time that this is the highest value enum ? */
+                     bounds.upperBound = ZSTD_dictForceLoad;       /* note : how to ensure at compile time that this is the highest value enum ? */
                      return bounds;
                  case ZSTD_c_literalCompressionMode:
                      bounds.upperBound = ZSTD_TARGETCBLOCKSIZE_MAX;
                      return bounds;
+                 case ZSTD_c_srcSizeHint:
+                     bounds.lowerBound = ZSTD_SRCSIZEHINT_MIN;
+                     bounds.upperBound = ZSTD_SRCSIZEHINT_MAX;
+                     return bounds;
                  default:
                      {   ZSTD_bounds const boundError = { ERROR(parameter_unsupported), 0, 0 };
                          return boundError;
                  case ZSTD_c_forceAttachDict:
                  case ZSTD_c_literalCompressionMode:
                  case ZSTD_c_targetCBlockSize:
+                 case ZSTD_c_srcSizeHint:
                  default:
                      return 0;
                  }
                  case ZSTD_c_ldmMinMatch:
                  case ZSTD_c_ldmBucketSizeLog:
                  case ZSTD_c_targetCBlockSize:
+                 case ZSTD_c_srcSizeHint:
                      break;
                  default: RETURN_ERROR(parameter_unsupported);
                      if (value) {  /* 0 : does not change current level */
                          CCtxParams->compressionLevel = value;
                      }
-                     if (CCtxParams->compressionLevel >= 0) return CCtxParams->compressionLevel;
+                     if (CCtxParams->compressionLevel >= 0) return (size_t)CCtxParams->compressionLevel;
                      return 0;  /* return type (size_t) cannot represent negative values */
                  }
                  case ZSTD_c_windowLog :
                      if (value!=0)   /* 0 => use default */
                          BOUNDCHECK(ZSTD_c_windowLog, value);
-                     CCtxParams->cParams.windowLog = value;
+                     CCtxParams->cParams.windowLog = (U32)value;
                      return CCtxParams->cParams.windowLog;
                  case ZSTD_c_hashLog :
                      if (value!=0)   /* 0 => use default */
                          BOUNDCHECK(ZSTD_c_hashLog, value);
-                     CCtxParams->cParams.hashLog = value;
+                     CCtxParams->cParams.hashLog = (U32)value;
                      return CCtxParams->cParams.hashLog;
                  case ZSTD_c_chainLog :
                      if (value!=0)   /* 0 => use default */
                          BOUNDCHECK(ZSTD_c_chainLog, value);
-                     CCtxParams->cParams.chainLog = value;
+                     CCtxParams->cParams.chainLog = (U32)value;
                      return CCtxParams->cParams.chainLog;
                  case ZSTD_c_searchLog :
                      if (value!=0)   /* 0 => use default */
                          BOUNDCHECK(ZSTD_c_searchLog, value);
-                     CCtxParams->cParams.searchLog = value;
-                     return value;
+                     CCtxParams->cParams.searchLog = (U32)value;
+                     return (size_t)value;
                  case ZSTD_c_minMatch :
                      if (value!=0)   /* 0 => use default */
                      CCtxParams->targetCBlockSize = value;
                      return CCtxParams->targetCBlockSize;
+                 case ZSTD_c_srcSizeHint :
+                     if (value!=0)    /* 0 ==> default */
+                         BOUNDCHECK(ZSTD_c_srcSizeHint, value);
+                     CCtxParams->srcSizeHint = value;
+                     return CCtxParams->srcSizeHint;
                  default: RETURN_ERROR(parameter_unsupported, "unknown parameter");
                  }
              }
                  case ZSTD_c_targetCBlockSize :
                      *value = (int)CCtxParams->targetCBlockSize;
                      break;
+                 case ZSTD_c_srcSizeHint :
+                     *value = (int)CCtxParams->srcSizeHint;
+                     break;
                  default: RETURN_ERROR(parameter_unsupported, "unknown parameter");
                  }
                  return 0;
              ZSTD_compressionParameters ZSTD_getCParamsFromCCtxParams(
                      const ZSTD_CCtx_params* CCtxParams, U64 srcSizeHint, size_t dictSize)
              {
-                 ZSTD_compressionParameters cParams = ZSTD_getCParams(CCtxParams->compressionLevel, srcSizeHint, dictSize);
+                 ZSTD_compressionParameters cParams;
+                 if (srcSizeHint == ZSTD_CONTENTSIZE_UNKNOWN && CCtxParams->srcSizeHint > 0) {
+                   srcSizeHint = CCtxParams->srcSizeHint;
+                 }
+                 cParams = ZSTD_getCParams(CCtxParams->compressionLevel, srcSizeHint, dictSize);
                  if (CCtxParams->ldmParams.enableLdm) cParams.windowLog = ZSTD_LDM_DEFAULT_WINDOW_LOG;
                  if (CCtxParams->cParams.windowLog) cParams.windowLog = CCtxParams->cParams.windowLog;
                  if (CCtxParams->cParams.hashLog) cParams.hashLog = CCtxParams->cParams.hashLog;
                  size_t const chainSize = (cParams->strategy == ZSTD_fast) ? 0 : ((size_t)1 << cParams->chainLog);
                  size_t const hSize = ((size_t)1) << cParams->hashLog;
                  U32    const hashLog3 = (forCCtx && cParams->minMatch==3) ? MIN(ZSTD_HASHLOG3_MAX, cParams->windowLog) : 0;
-                 size_t const h3Size = ((size_t)1) << hashLog3;
-                 size_t const tableSpace = (chainSize + hSize + h3Size) * sizeof(U32);
-                 size_t const optPotentialSpace = ((MaxML+1) + (MaxLL+1) + (MaxOff+1) + (1<<Litbits)) * sizeof(U32)
-                                       + (ZSTD_OPT_NUM+1) * (sizeof(ZSTD_match_t)+sizeof(ZSTD_optimal_t));
+                 size_t const h3Size = hashLog3 ? ((size_t)1) << hashLog3 : 0;
+                 /* We don't use ZSTD_cwksp_alloc_size() here because the tables aren't
+                  * surrounded by redzones in ASAN. */
+                 size_t const tableSpace = chainSize * sizeof(U32)
+                                         + hSize * sizeof(U32)
+                                         + h3Size * sizeof(U32);
+                 size_t const optPotentialSpace =
+                     ZSTD_cwksp_alloc_size((MaxML+1) * sizeof(U32))
+                   + ZSTD_cwksp_alloc_size((MaxLL+1) * sizeof(U32))
+                   + ZSTD_cwksp_alloc_size((MaxOff+1) * sizeof(U32))
+                   + ZSTD_cwksp_alloc_size((1<<Litbits) * sizeof(U32))
+                   + ZSTD_cwksp_alloc_size((ZSTD_OPT_NUM+1) * sizeof(ZSTD_match_t))
+                   + ZSTD_cwksp_alloc_size((ZSTD_OPT_NUM+1) * sizeof(ZSTD_optimal_t));
                  size_t const optSpace = (forCCtx && (cParams->strategy >= ZSTD_btopt))
                                              ? optPotentialSpace
                                              : 0;
                      size_t const blockSize = MIN(ZSTD_BLOCKSIZE_MAX, (size_t)1 << cParams.windowLog);
                      U32    const divider = (cParams.minMatch==3) ? 3 : 4;
                      size_t const maxNbSeq = blockSize / divider;
-                     size_t const tokenSpace = WILDCOPY_OVERLENGTH + blockSize + 11*maxNbSeq;
-                     size_t const entropySpace = HUF_WORKSPACE_SIZE;
-                     size_t const blockStateSpace = 2 * sizeof(ZSTD_compressedBlockState_t);
+                     size_t const tokenSpace = ZSTD_cwksp_alloc_size(WILDCOPY_OVERLENGTH + blockSize)
+                                             + ZSTD_cwksp_alloc_size(maxNbSeq * sizeof(seqDef))
+                                             + 3 * ZSTD_cwksp_alloc_size(maxNbSeq * sizeof(BYTE));
+                     size_t const entropySpace = ZSTD_cwksp_alloc_size(HUF_WORKSPACE_SIZE);
+                     size_t const blockStateSpace = 2 * ZSTD_cwksp_alloc_size(sizeof(ZSTD_compressedBlockState_t));
                      size_t const matchStateSize = ZSTD_sizeof_matchState(&cParams, /* forCCtx */ 1);
                      size_t const ldmSpace = ZSTD_ldm_getTableSize(params->ldmParams);
-                     size_t const ldmSeqSpace = ZSTD_ldm_getMaxNbSeq(params->ldmParams, blockSize) * sizeof(rawSeq);
+                     size_t const ldmSeqSpace = ZSTD_cwksp_alloc_size(ZSTD_ldm_getMaxNbSeq(params->ldmParams, blockSize) * sizeof(rawSeq));
                      size_t const neededSpace = entropySpace + blockStateSpace + tokenSpace +
                                                 matchStateSize + ldmSpace + ldmSeqSpace;
-                     DEBUGLOG(5, "sizeof(ZSTD_CCtx) : %u", (U32)sizeof(ZSTD_CCtx));
-                     DEBUGLOG(5, "estimate workSpace : %u", (U32)neededSpace);
-                     return sizeof(ZSTD_CCtx) + neededSpace;
+                     size_t const cctxSpace = ZSTD_cwksp_alloc_size(sizeof(ZSTD_CCtx));
+                     DEBUGLOG(5, "sizeof(ZSTD_CCtx) : %u", (U32)cctxSpace);
+                     DEBUGLOG(5, "estimate workspace : %u", (U32)neededSpace);
+                     return cctxSpace + neededSpace;
                  }
              }
                      size_t const blockSize = MIN(ZSTD_BLOCKSIZE_MAX, (size_t)1 << cParams.windowLog);
                      size_t const inBuffSize = ((size_t)1 << cParams.windowLog) + blockSize;
                      size_t const outBuffSize = ZSTD_compressBound(blockSize) + 1;
-                     size_t const streamingSize = inBuffSize + outBuffSize;
+                     size_t const streamingSize = ZSTD_cwksp_alloc_size(inBuffSize)
+                                                + ZSTD_cwksp_alloc_size(outBuffSize);
                      return CCtxSize + streamingSize;
                  }
                  return 0;   /* over-simplification; could also check if context is currently running in streaming mode, and in which case, report how many bytes are left to be flushed within output buffer */
              }
-             static U32 ZSTD_equivalentCParams(ZSTD_compressionParameters cParams1,
-                                               ZSTD_compressionParameters cParams2)
+             {
-                 return (cParams1.hashLog  == cParams2.hashLog)
-                      & (cParams1.chainLog == cParams2.chainLog)
-                      & (cParams1.strategy == cParams2.strategy)   /* opt parser space */
-                      & ((cParams1.minMatch==3) == (cParams2.minMatch==3));  /* hashlog3 space */
+             }
              static void ZSTD_assertEqualCParams(ZSTD_compressionParameters cParams1,
                                                  ZSTD_compressionParameters cParams2)
              {
                  assert(cParams1.strategy     == cParams2.strategy);
              }
-             /** The parameters are equivalent if ldm is not enabled in both sets or
-              *  all the parameters are equivalent. */
-             static U32 ZSTD_equivalentLdmParams(ldmParams_t ldmParams1,
-                                                 ldmParams_t ldmParams2)
+             {
-                 return (!ldmParams1.enableLdm && !ldmParams2.enableLdm) ||
-                        (ldmParams1.enableLdm == ldmParams2.enableLdm &&
-                         ldmParams1.hashLog == ldmParams2.hashLog &&
-                         ldmParams1.bucketSizeLog == ldmParams2.bucketSizeLog &&
-                         ldmParams1.minMatchLength == ldmParams2.minMatchLength &&
-                         ldmParams1.hashRateLog == ldmParams2.hashRateLog);
+             }
-             typedef enum { ZSTDb_not_buffered, ZSTDb_buffered } ZSTD_buffered_policy_e;
-             /* ZSTD_sufficientBuff() :
-              * check internal buffers exist for streaming if buffPol == ZSTDb_buffered .
-              * Note : they are assumed to be correctly sized if ZSTD_equivalentCParams()==1 */
-             static U32 ZSTD_sufficientBuff(size_t bufferSize1, size_t maxNbSeq1,
-                                         size_t maxNbLit1,
-                                         ZSTD_buffered_policy_e buffPol2,
-                                         ZSTD_compressionParameters cParams2,
-                                         U64 pledgedSrcSize)
+             {
-                 size_t const windowSize2 = MAX(1, (size_t)MIN(((U64)1 << cParams2.windowLog), pledgedSrcSize));
-                 size_t const blockSize2 = MIN(ZSTD_BLOCKSIZE_MAX, windowSize2);
-                 size_t const maxNbSeq2 = blockSize2 / ((cParams2.minMatch == 3) ? 3 : 4);
-                 size_t const maxNbLit2 = blockSize2;
-                 size_t const neededBufferSize2 = (buffPol2==ZSTDb_buffered) ? windowSize2 + blockSize2 : 0;
-                 DEBUGLOG(4, "ZSTD_sufficientBuff: is neededBufferSize2=%u <= bufferSize1=%u",
-                             (U32)neededBufferSize2, (U32)bufferSize1);
-                 DEBUGLOG(4, "ZSTD_sufficientBuff: is maxNbSeq2=%u <= maxNbSeq1=%u",
-                             (U32)maxNbSeq2, (U32)maxNbSeq1);
-                 DEBUGLOG(4, "ZSTD_sufficientBuff: is maxNbLit2=%u <= maxNbLit1=%u",
-                             (U32)maxNbLit2, (U32)maxNbLit1);
-                 return (maxNbLit2 <= maxNbLit1)
-                      & (maxNbSeq2 <= maxNbSeq1)
-                      & (neededBufferSize2 <= bufferSize1);
+             }
-             /** Equivalence for resetCCtx purposes */
-             static U32 ZSTD_equivalentParams(ZSTD_CCtx_params params1,
-                                              ZSTD_CCtx_params params2,
-                                              size_t buffSize1,
-                                              size_t maxNbSeq1, size_t maxNbLit1,
-                                              ZSTD_buffered_policy_e buffPol2,
-                                              U64 pledgedSrcSize)
+             {
-                 DEBUGLOG(4, "ZSTD_equivalentParams: pledgedSrcSize=%u", (U32)pledgedSrcSize);
-                 if (!ZSTD_equivalentCParams(params1.cParams, params2.cParams)) {
-                   DEBUGLOG(4, "ZSTD_equivalentCParams() == 0");
-                   return 0;
+                 }
-                 if (!ZSTD_equivalentLdmParams(params1.ldmParams, params2.ldmParams)) {
-                   DEBUGLOG(4, "ZSTD_equivalentLdmParams() == 0");
-                   return 0;
+                 }
-                 if (!ZSTD_sufficientBuff(buffSize1, maxNbSeq1, maxNbLit1, buffPol2,
-                                          params2.cParams, pledgedSrcSize)) {
-                   DEBUGLOG(4, "ZSTD_sufficientBuff() == 0");
-                   return 0;
+                 }
-                 return 1;
+             }
              static void ZSTD_reset_compressedBlockState(ZSTD_compressedBlockState_t* bs)
              {
                  int i;
                  ms->dictMatchState = NULL;
              }
-             /*! ZSTD_continueCCtx() :
-              *  reuse CCtx without reset (note : requires no dictionary) */
-             static size_t ZSTD_continueCCtx(ZSTD_CCtx* cctx, ZSTD_CCtx_params params, U64 pledgedSrcSize)
+             {
-                 size_t const windowSize = MAX(1, (size_t)MIN(((U64)1 << params.cParams.windowLog), pledgedSrcSize));
-                 size_t const blockSize = MIN(ZSTD_BLOCKSIZE_MAX, windowSize);
-                 DEBUGLOG(4, "ZSTD_continueCCtx: re-use context in place");
-                 cctx->blockSize = blockSize;   /* previous block size could be different even for same windowLog, due to pledgedSrcSize */
-                 cctx->appliedParams = params;
-                 cctx->blockState.matchState.cParams = params.cParams;
-                 cctx->pledgedSrcSizePlusOne = pledgedSrcSize+1;
-                 cctx->consumedSrcSize = 0;
-                 cctx->producedCSize = 0;
-                 if (pledgedSrcSize == ZSTD_CONTENTSIZE_UNKNOWN)
-                     cctx->appliedParams.fParams.contentSizeFlag = 0;
-                 DEBUGLOG(4, "pledged content size : %u ; flag : %u",
-                     (U32)pledgedSrcSize, cctx->appliedParams.fParams.contentSizeFlag);
-                 cctx->stage = ZSTDcs_init;
-                 cctx->dictID = 0;
-                 if (params.ldmParams.enableLdm)
-                     ZSTD_window_clear(&cctx->ldmState.window);
-                 ZSTD_referenceExternalSequences(cctx, NULL, 0);
-                 ZSTD_invalidateMatchState(&cctx->blockState.matchState);
-                 ZSTD_reset_compressedBlockState(cctx->blockState.prevCBlock);
-                 XXH64_reset(&cctx->xxhState, 0);
-                 return 0;
+             }
-             typedef enum { ZSTDcrp_continue, ZSTDcrp_noMemset } ZSTD_compResetPolicy_e;
-             typedef enum { ZSTD_resetTarget_CDict, ZSTD_resetTarget_CCtx } ZSTD_resetTarget_e;
-             static void*
+             /**
+              * Indicates whether this compression proceeds directly from user-provided
+              * source buffer to user-provided destination buffer (ZSTDb_not_buffered), or
+              * whether the context needs to buffer the input/output (ZSTDb_buffered).
+              */
+             typedef enum {
+                 ZSTDb_not_buffered,
+                 ZSTDb_buffered
+             } ZSTD_buffered_policy_e;
+             /**
+              * Controls, for this matchState reset, whether the tables need to be cleared /
+              * prepared for the coming compression (ZSTDcrp_makeClean), or whether the
+              * tables can be left unclean (ZSTDcrp_leaveDirty), because we know that a
+              * subsequent operation will overwrite the table space anyways (e.g., copying
+              * the matchState contents in from a CDict).
+              */
+             typedef enum {
+                 ZSTDcrp_makeClean,
+                 ZSTDcrp_leaveDirty
+             } ZSTD_compResetPolicy_e;
+             /**
+              * Controls, for this matchState reset, whether indexing can continue where it
+              * left off (ZSTDirp_continue), or whether it needs to be restarted from zero
+              * (ZSTDirp_reset).
+              */
+             typedef enum {
+                 ZSTDirp_continue,
+                 ZSTDirp_reset
+             } ZSTD_indexResetPolicy_e;
+             typedef enum {
+                 ZSTD_resetTarget_CDict,
+                 ZSTD_resetTarget_CCtx
+             } ZSTD_resetTarget_e;
+             static size_t
              ZSTD_reset_matchState(ZSTD_matchState_t* ms,
-                                   void* ptr,
+                                   ZSTD_cwksp* ws,
                              const ZSTD_compressionParameters* cParams,
-                                   ZSTD_compResetPolicy_e const crp, ZSTD_resetTarget_e const forWho)
+                             const ZSTD_compResetPolicy_e crp,
+                             const ZSTD_indexResetPolicy_e forceResetIndex,
+                             const ZSTD_resetTarget_e forWho)
              {
                  size_t const chainSize = (cParams->strategy == ZSTD_fast) ? 0 : ((size_t)1 << cParams->chainLog);
                  size_t const hSize = ((size_t)1) << cParams->hashLog;
                  U32    const hashLog3 = ((forWho == ZSTD_resetTarget_CCtx) && cParams->minMatch==3) ? MIN(ZSTD_HASHLOG3_MAX, cParams->windowLog) : 0;
-                 size_t const h3Size = ((size_t)1) << hashLog3;
-                 size_t const tableSpace = (chainSize + hSize + h3Size) * sizeof(U32);
-                 assert(((size_t)ptr & 3) == 0);
+                 size_t const h3Size = hashLog3 ? ((size_t)1) << hashLog3 : 0;
+                 DEBUGLOG(4, "reset indices : %u", forceResetIndex == ZSTDirp_reset);
+                 if (forceResetIndex == ZSTDirp_reset) {
+                     memset(&ms->window, 0, sizeof(ms->window));
+                     ms->window.dictLimit = 1;    /* start from 1, so that 1st position is valid */
+                     ms->window.lowLimit = 1;     /* it ensures first and later CCtx usages compress the same */
+                     ms->window.nextSrc = ms->window.base + 1;   /* see issue #1241 */
+                     ZSTD_cwksp_mark_tables_dirty(ws);
+                 }
                  ms->hashLog3 = hashLog3;
-                 memset(&ms->window, 0, sizeof(ms->window));
-                 ms->window.dictLimit = 1;    /* start from 1, so that 1st position is valid */
-                 ms->window.lowLimit = 1;     /* it ensures first and later CCtx usages compress the same */
-                 ms->window.nextSrc = ms->window.base + 1;   /* see issue #1241 */
                  ZSTD_invalidateMatchState(ms);
+                 assert(!ZSTD_cwksp_reserve_failed(ws)); /* check that allocation hasn't already failed */
+                 ZSTD_cwksp_clear_tables(ws);
+                 DEBUGLOG(5, "reserving table space");
+                 /* table Space */
+                 ms->hashTable = (U32*)ZSTD_cwksp_reserve_table(ws, hSize * sizeof(U32));
+                 ms->chainTable = (U32*)ZSTD_cwksp_reserve_table(ws, chainSize * sizeof(U32));
+                 ms->hashTable3 = (U32*)ZSTD_cwksp_reserve_table(ws, h3Size * sizeof(U32));
+                 RETURN_ERROR_IF(ZSTD_cwksp_reserve_failed(ws), memory_allocation,
+                                 "failed a workspace allocation in ZSTD_reset_matchState");
+                 DEBUGLOG(4, "reset table : %u", crp!=ZSTDcrp_leaveDirty);
+                 if (crp!=ZSTDcrp_leaveDirty) {
+                     /* reset tables only */
+                     ZSTD_cwksp_clean_tables(ws);
+                 }
                  /* opt parser space */
                  if ((forWho == ZSTD_resetTarget_CCtx) && (cParams->strategy >= ZSTD_btopt)) {
                      DEBUGLOG(4, "reserving optimal parser space");
-                     ms->opt.litFreq = (unsigned*)ptr;
-                     ms->opt.litLengthFreq = ms->opt.litFreq + (1<<Litbits);
-                     ms->opt.matchLengthFreq = ms->opt.litLengthFreq + (MaxLL+1);
-                     ms->opt.offCodeFreq = ms->opt.matchLengthFreq + (MaxML+1);
-                     ptr = ms->opt.offCodeFreq + (MaxOff+1);
-                     ms->opt.matchTable = (ZSTD_match_t*)ptr;
-                     ptr = ms->opt.matchTable + ZSTD_OPT_NUM+1;
-                     ms->opt.priceTable = (ZSTD_optimal_t*)ptr;
-                     ptr = ms->opt.priceTable + ZSTD_OPT_NUM+1;
+                     ms->opt.litFreq = (unsigned*)ZSTD_cwksp_reserve_aligned(ws, (1<<Litbits) * sizeof(unsigned));
+                     ms->opt.litLengthFreq = (unsigned*)ZSTD_cwksp_reserve_aligned(ws, (MaxLL+1) * sizeof(unsigned));
+                     ms->opt.matchLengthFreq = (unsigned*)ZSTD_cwksp_reserve_aligned(ws, (MaxML+1) * sizeof(unsigned));
+                     ms->opt.offCodeFreq = (unsigned*)ZSTD_cwksp_reserve_aligned(ws, (MaxOff+1) * sizeof(unsigned));
+                     ms->opt.matchTable = (ZSTD_match_t*)ZSTD_cwksp_reserve_aligned(ws, (ZSTD_OPT_NUM+1) * sizeof(ZSTD_match_t));
+                     ms->opt.priceTable = (ZSTD_optimal_t*)ZSTD_cwksp_reserve_aligned(ws, (ZSTD_OPT_NUM+1) * sizeof(ZSTD_optimal_t));
                  }
-                 /* table Space */
-                 DEBUGLOG(4, "reset table : %u", crp!=ZSTDcrp_noMemset);
-                 assert(((size_t)ptr & 3) == 0);  /* ensure ptr is properly aligned */
-                 if (crp!=ZSTDcrp_noMemset) memset(ptr, 0, tableSpace);   /* reset tables only */
-                 ms->hashTable = (U32*)(ptr);
-                 ms->chainTable = ms->hashTable + hSize;
-                 ms->hashTable3 = ms->chainTable + chainSize;
-                 ptr = ms->hashTable3 + h3Size;
                  ms->cParams = *cParams;
-                 assert(((size_t)ptr & 3) == 0);
-                 return ptr;
+                 RETURN_ERROR_IF(ZSTD_cwksp_reserve_failed(ws), memory_allocation,
+                                 "failed a workspace allocation in ZSTD_reset_matchState");
+                 return 0;
              }
              /* ZSTD_indexTooCloseToMax() :
                  return (size_t)(w.nextSrc - w.base) > (ZSTD_CURRENT_MAX - ZSTD_INDEXOVERFLOW_MARGIN);
              }
-             #define ZSTD_WORKSPACETOOLARGE_FACTOR 3 /* define "workspace is too large" as this number of times larger than needed */
-             #define ZSTD_WORKSPACETOOLARGE_MAXDURATION 128  /* when workspace is continuously too large
-                                                      * during at least this number of times,
-                                                      * context's memory usage is considered wasteful,
-                                                      * because it's sized to handle a worst case scenario which rarely happens.
-                                                      * In which case, resize it down to free some memory */
              /*! ZSTD_resetCCtx_internal() :
                  note : `params` are assumed fully validated at this stage */
              static size_t ZSTD_resetCCtx_internal(ZSTD_CCtx* zc,
                                                    ZSTD_compResetPolicy_e const crp,
                                                    ZSTD_buffered_policy_e const zbuff)
              {
+                 ZSTD_cwksp* const ws = &zc->workspace;
                  DEBUGLOG(4, "ZSTD_resetCCtx_internal: pledgedSrcSize=%u, wlog=%u",
                              (U32)pledgedSrcSize, params.cParams.windowLog);
                  assert(!ZSTD_isError(ZSTD_checkCParams(params.cParams)));
-                 if (crp == ZSTDcrp_continue) {
-                     if (ZSTD_equivalentParams(zc->appliedParams, params,
-                                               zc->inBuffSize,
-                                               zc->seqStore.maxNbSeq, zc->seqStore.maxNbLit,
-                                               zbuff, pledgedSrcSize) ) {
-                         DEBUGLOG(4, "ZSTD_equivalentParams()==1 -> consider continue mode");
-                         zc->workSpaceOversizedDuration += (zc->workSpaceOversizedDuration > 0);   /* if it was too large, it still is */
-                         if (zc->workSpaceOversizedDuration <= ZSTD_WORKSPACETOOLARGE_MAXDURATION) {
-                             DEBUGLOG(4, "continue mode confirmed (wLog1=%u, blockSize1=%zu)",
-                                         zc->appliedParams.cParams.windowLog, zc->blockSize);
-                             if (ZSTD_indexTooCloseToMax(zc->blockState.matchState.window)) {
-                                 /* prefer a reset, faster than a rescale */
-                                 ZSTD_reset_matchState(&zc->blockState.matchState,
-                                                        zc->entropyWorkspace + HUF_WORKSPACE_SIZE_U32,
-                                                       &params.cParams,
-                                                        crp, ZSTD_resetTarget_CCtx);
+                             }
-                             return ZSTD_continueCCtx(zc, params, pledgedSrcSize);
-                 }   }   }
-                 DEBUGLOG(4, "ZSTD_equivalentParams()==0 -> reset CCtx");
+                 zc->isFirstBlock = 1;
                  if (params.ldmParams.enableLdm) {
                      /* Adjust long distance matching parameters */
                      size_t const blockSize = MIN(ZSTD_BLOCKSIZE_MAX, windowSize);
                      U32    const divider = (params.cParams.minMatch==3) ? 3 : 4;
                      size_t const maxNbSeq = blockSize / divider;
-                     size_t const tokenSpace = WILDCOPY_OVERLENGTH + blockSize + 11*maxNbSeq;
+                     size_t const tokenSpace = ZSTD_cwksp_alloc_size(WILDCOPY_OVERLENGTH + blockSize)
+                                             + ZSTD_cwksp_alloc_size(maxNbSeq * sizeof(seqDef))
+                                             + 3 * ZSTD_cwksp_alloc_size(maxNbSeq * sizeof(BYTE));
                      size_t const buffOutSize = (zbuff==ZSTDb_buffered) ? ZSTD_compressBound(blockSize)+1 : 0;
                      size_t const buffInSize = (zbuff==ZSTDb_buffered) ? windowSize + blockSize : 0;
                      size_t const matchStateSize = ZSTD_sizeof_matchState(&params.cParams, /* forCCtx */ 1);
                      size_t const maxNbLdmSeq = ZSTD_ldm_getMaxNbSeq(params.ldmParams, blockSize);
-                     void* ptr;   /* used to partition workSpace */
-                     /* Check if workSpace is large enough, alloc a new one if needed */
-                     {   size_t const entropySpace = HUF_WORKSPACE_SIZE;
-                         size_t const blockStateSpace = 2 * sizeof(ZSTD_compressedBlockState_t);
-                         size_t const bufferSpace = buffInSize + buffOutSize;
+                     ZSTD_indexResetPolicy_e needsIndexReset = ZSTDirp_continue;
+                     if (ZSTD_indexTooCloseToMax(zc->blockState.matchState.window)) {
+                         needsIndexReset = ZSTDirp_reset;
+                     }
+                     ZSTD_cwksp_bump_oversized_duration(ws, 0);
+                     /* Check if workspace is large enough, alloc a new one if needed */
+                     {   size_t const cctxSpace = zc->staticSize ? ZSTD_cwksp_alloc_size(sizeof(ZSTD_CCtx)) : 0;
+                         size_t const entropySpace = ZSTD_cwksp_alloc_size(HUF_WORKSPACE_SIZE);
+                         size_t const blockStateSpace = 2 * ZSTD_cwksp_alloc_size(sizeof(ZSTD_compressedBlockState_t));
+                         size_t const bufferSpace = ZSTD_cwksp_alloc_size(buffInSize) + ZSTD_cwksp_alloc_size(buffOutSize);
                          size_t const ldmSpace = ZSTD_ldm_getTableSize(params.ldmParams);
-                         size_t const ldmSeqSpace = maxNbLdmSeq * sizeof(rawSeq);
-                         size_t const neededSpace = entropySpace + blockStateSpace + ldmSpace +
-                                                    ldmSeqSpace + matchStateSize + tokenSpace +
-                                                    bufferSpace;
-                         int const workSpaceTooSmall = zc->workSpaceSize < neededSpace;
-                         int const workSpaceTooLarge = zc->workSpaceSize > ZSTD_WORKSPACETOOLARGE_FACTOR * neededSpace;
-                         int const workSpaceWasteful = workSpaceTooLarge && (zc->workSpaceOversizedDuration > ZSTD_WORKSPACETOOLARGE_MAXDURATION);
-                         zc->workSpaceOversizedDuration = workSpaceTooLarge ? zc->workSpaceOversizedDuration+1 : 0;
+                         size_t const ldmSeqSpace = ZSTD_cwksp_alloc_size(maxNbLdmSeq * sizeof(rawSeq));
+                         size_t const neededSpace =
+                             cctxSpace +
+                             entropySpace +
+                             blockStateSpace +
+                             ldmSpace +
+                             ldmSeqSpace +
+                             matchStateSize +
+                             tokenSpace +
+                             bufferSpace;
+                         int const workspaceTooSmall = ZSTD_cwksp_sizeof(ws) < neededSpace;
+                         int const workspaceWasteful = ZSTD_cwksp_check_wasteful(ws, neededSpace);
                          DEBUGLOG(4, "Need %zuKB workspace, including %zuKB for match state, and %zuKB for buffers",
                                      neededSpace>>10, matchStateSize>>10, bufferSpace>>10);
                          DEBUGLOG(4, "windowSize: %zu - blockSize: %zu", windowSize, blockSize);
-                         if (workSpaceTooSmall || workSpaceWasteful) {
-                             DEBUGLOG(4, "Resize workSpaceSize from %zuKB to %zuKB",
-                                         zc->workSpaceSize >> 10,
+                         if (workspaceTooSmall || workspaceWasteful) {
+                             DEBUGLOG(4, "Resize workspaceSize from %zuKB to %zuKB",
+                                         ZSTD_cwksp_sizeof(ws) >> 10,
                                          neededSpace >> 10);
                              RETURN_ERROR_IF(zc->staticSize, memory_allocation, "static cctx : no resize");
-                             zc->workSpaceSize = 0;
-                             ZSTD_free(zc->workSpace, zc->customMem);
-                             zc->workSpace = ZSTD_malloc(neededSpace, zc->customMem);
-                             RETURN_ERROR_IF(zc->workSpace == NULL, memory_allocation);
-                             zc->workSpaceSize = neededSpace;
-                             zc->workSpaceOversizedDuration = 0;
+                             needsIndexReset = ZSTDirp_reset;
+                             ZSTD_cwksp_free(ws, zc->customMem);
+                             FORWARD_IF_ERROR(ZSTD_cwksp_create(ws, neededSpace, zc->customMem));
+                             DEBUGLOG(5, "reserving object space");
                              /* Statically sized space.
                               * entropyWorkspace never moves,
                               * though prev/next block swap places */
-                             assert(((size_t)zc->workSpace & 3) == 0);   /* ensure correct alignment */
-                             assert(zc->workSpaceSize >= 2 * sizeof(ZSTD_compressedBlockState_t));
-                             zc->blockState.prevCBlock = (ZSTD_compressedBlockState_t*)zc->workSpace;
-                             zc->blockState.nextCBlock = zc->blockState.prevCBlock + 1;
-                             ptr = zc->blockState.nextCBlock + 1;
-                             zc->entropyWorkspace = (U32*)ptr;
+                             assert(ZSTD_cwksp_check_available(ws, 2 * sizeof(ZSTD_compressedBlockState_t)));
+                             zc->blockState.prevCBlock = (ZSTD_compressedBlockState_t*) ZSTD_cwksp_reserve_object(ws, sizeof(ZSTD_compressedBlockState_t));
+                             RETURN_ERROR_IF(zc->blockState.prevCBlock == NULL, memory_allocation, "couldn't allocate prevCBlock");
+                             zc->blockState.nextCBlock = (ZSTD_compressedBlockState_t*) ZSTD_cwksp_reserve_object(ws, sizeof(ZSTD_compressedBlockState_t));
+                             RETURN_ERROR_IF(zc->blockState.nextCBlock == NULL, memory_allocation, "couldn't allocate nextCBlock");
+                             zc->entropyWorkspace = (U32*) ZSTD_cwksp_reserve_object(ws, HUF_WORKSPACE_SIZE);
+                             RETURN_ERROR_IF(zc->blockState.nextCBlock == NULL, memory_allocation, "couldn't allocate entropyWorkspace");
                      }   }
+                     ZSTD_cwksp_clear(ws);
                      /* init params */
                      zc->appliedParams = params;
                      zc->blockState.matchState.cParams = params.cParams;
                      ZSTD_reset_compressedBlockState(zc->blockState.prevCBlock);
-                     ptr = ZSTD_reset_matchState(&zc->blockState.matchState,
-                                                  zc->entropyWorkspace + HUF_WORKSPACE_SIZE_U32,
-                                                 &params.cParams,
-                                                  crp, ZSTD_resetTarget_CCtx);
+                     /* ZSTD_wildcopy() is used to copy into the literals buffer,
+                      * so we have to oversize the buffer by WILDCOPY_OVERLENGTH bytes.
+                      */
+                     zc->seqStore.litStart = ZSTD_cwksp_reserve_buffer(ws, blockSize + WILDCOPY_OVERLENGTH);
+                     zc->seqStore.maxNbLit = blockSize;
+                     /* buffers */
+                     zc->inBuffSize = buffInSize;
+                     zc->inBuff = (char*)ZSTD_cwksp_reserve_buffer(ws, buffInSize);
+                     zc->outBuffSize = buffOutSize;
+                     zc->outBuff = (char*)ZSTD_cwksp_reserve_buffer(ws, buffOutSize);
+                     /* ldm bucketOffsets table */
+                     if (params.ldmParams.enableLdm) {
+                         /* TODO: avoid memset? */
+                         size_t const ldmBucketSize =
+                               ((size_t)1) << (params.ldmParams.hashLog -
+                                               params.ldmParams.bucketSizeLog);
+                         zc->ldmState.bucketOffsets = ZSTD_cwksp_reserve_buffer(ws, ldmBucketSize);
+                         memset(zc->ldmState.bucketOffsets, 0, ldmBucketSize);
+                     }
+                     /* sequences storage */
+                     ZSTD_referenceExternalSequences(zc, NULL, 0);
+                     zc->seqStore.maxNbSeq = maxNbSeq;
+                     zc->seqStore.llCode = ZSTD_cwksp_reserve_buffer(ws, maxNbSeq * sizeof(BYTE));
+                     zc->seqStore.mlCode = ZSTD_cwksp_reserve_buffer(ws, maxNbSeq * sizeof(BYTE));
+                     zc->seqStore.ofCode = ZSTD_cwksp_reserve_buffer(ws, maxNbSeq * sizeof(BYTE));
+                     zc->seqStore.sequencesStart = (seqDef*)ZSTD_cwksp_reserve_aligned(ws, maxNbSeq * sizeof(seqDef));
+                     FORWARD_IF_ERROR(ZSTD_reset_matchState(
+                         &zc->blockState.matchState,
+                         ws,
+                         &params.cParams,
+                         crp,
+                         needsIndexReset,
+                         ZSTD_resetTarget_CCtx));
                      /* ldm hash table */
-                     /* initialize bucketOffsets table later for pointer alignment */
                      if (params.ldmParams.enableLdm) {
+                         /* TODO: avoid memset? */
                          size_t const ldmHSize = ((size_t)1) << params.ldmParams.hashLog;
-                         memset(ptr, 0, ldmHSize * sizeof(ldmEntry_t));
-                         assert(((size_t)ptr & 3) == 0); /* ensure ptr is properly aligned */
-                         zc->ldmState.hashTable = (ldmEntry_t*)ptr;
-                         ptr = zc->ldmState.hashTable + ldmHSize;
-                         zc->ldmSequences = (rawSeq*)ptr;
-                         ptr = zc->ldmSequences + maxNbLdmSeq;
+                         zc->ldmState.hashTable = (ldmEntry_t*)ZSTD_cwksp_reserve_aligned(ws, ldmHSize * sizeof(ldmEntry_t));
+                         memset(zc->ldmState.hashTable, 0, ldmHSize * sizeof(ldmEntry_t));
+                         zc->ldmSequences = (rawSeq*)ZSTD_cwksp_reserve_aligned(ws, maxNbLdmSeq * sizeof(rawSeq));
                          zc->maxNbLdmSequences = maxNbLdmSeq;
                          memset(&zc->ldmState.window, 0, sizeof(zc->ldmState.window));
+                     }
-                     assert(((size_t)ptr & 3) == 0); /* ensure ptr is properly aligned */
-                     /* sequences storage */
-                     zc->seqStore.maxNbSeq = maxNbSeq;
-                     zc->seqStore.sequencesStart = (seqDef*)ptr;
-                     ptr = zc->seqStore.sequencesStart + maxNbSeq;
-                     zc->seqStore.llCode = (BYTE*) ptr;
-                     zc->seqStore.mlCode = zc->seqStore.llCode + maxNbSeq;
-                     zc->seqStore.ofCode = zc->seqStore.mlCode + maxNbSeq;
-                     zc->seqStore.litStart = zc->seqStore.ofCode + maxNbSeq;
-                     /* ZSTD_wildcopy() is used to copy into the literals buffer,
-                      * so we have to oversize the buffer by WILDCOPY_OVERLENGTH bytes.
-                      */
-                     zc->seqStore.maxNbLit = blockSize;
-                     ptr = zc->seqStore.litStart + blockSize + WILDCOPY_OVERLENGTH;
-                     /* ldm bucketOffsets table */
-                     if (params.ldmParams.enableLdm) {
-                         size_t const ldmBucketSize =
-                               ((size_t)1) << (params.ldmParams.hashLog -
-                                               params.ldmParams.bucketSizeLog);
-                         memset(ptr, 0, ldmBucketSize);
-                         zc->ldmState.bucketOffsets = (BYTE*)ptr;
-                         ptr = zc->ldmState.bucketOffsets + ldmBucketSize;
                          ZSTD_window_clear(&zc->ldmState.window);
                      }
-                     ZSTD_referenceExternalSequences(zc, NULL, 0);
-                     /* buffers */
-                     zc->inBuffSize = buffInSize;
-                     zc->inBuff = (char*)ptr;
-                     zc->outBuffSize = buffOutSize;
-                     zc->outBuff = zc->inBuff + buffInSize;
+                     DEBUGLOG(3, "wksp: finished allocating, %zd bytes remain available", ZSTD_cwksp_available_space(ws));
                      return 0;
                  }
              };
              static int ZSTD_shouldAttachDict(const ZSTD_CDict* cdict,
-                                              ZSTD_CCtx_params params,
+                                              const ZSTD_CCtx_params* params,
                                               U64 pledgedSrcSize)
              {
                  size_t cutoff = attachDictSizeCutoffs[cdict->matchState.cParams.strategy];
                  return ( pledgedSrcSize <= cutoff
                        || pledgedSrcSize == ZSTD_CONTENTSIZE_UNKNOWN
-                       || params.attachDictPref == ZSTD_dictForceAttach )
-                     && params.attachDictPref != ZSTD_dictForceCopy
-                     && !params.forceWindow; /* dictMatchState isn't correctly
+                       || params->attachDictPref == ZSTD_dictForceAttach )
+                     && params->attachDictPref != ZSTD_dictForceCopy
+                     && !params->forceWindow; /* dictMatchState isn't correctly
                                               * handled in _enforceMaxDist */
              }
                       * has its own tables. */
                      params.cParams = ZSTD_adjustCParams_internal(*cdict_cParams, pledgedSrcSize, 0);
                      params.cParams.windowLog = windowLog;
-                     ZSTD_resetCCtx_internal(cctx, params, pledgedSrcSize,
-                                             ZSTDcrp_continue, zbuff);
+                     FORWARD_IF_ERROR(ZSTD_resetCCtx_internal(cctx, params, pledgedSrcSize,
+                                                              ZSTDcrp_makeClean, zbuff));
                      assert(cctx->appliedParams.cParams.strategy == cdict_cParams->strategy);
                  }
                      /* Copy only compression parameters related to tables. */
                      params.cParams = *cdict_cParams;
                      params.cParams.windowLog = windowLog;
-                     ZSTD_resetCCtx_internal(cctx, params, pledgedSrcSize,
-                                             ZSTDcrp_noMemset, zbuff);
+                     FORWARD_IF_ERROR(ZSTD_resetCCtx_internal(cctx, params, pledgedSrcSize,
+                                                              ZSTDcrp_leaveDirty, zbuff));
                      assert(cctx->appliedParams.cParams.strategy == cdict_cParams->strategy);
                      assert(cctx->appliedParams.cParams.hashLog == cdict_cParams->hashLog);
                      assert(cctx->appliedParams.cParams.chainLog == cdict_cParams->chainLog);
                  }
+                 ZSTD_cwksp_mark_tables_dirty(&cctx->workspace);
                  /* copy tables */
                  {   size_t const chainSize = (cdict_cParams->strategy == ZSTD_fast) ? 0 : ((size_t)1 << cdict_cParams->chainLog);
                      size_t const hSize =  (size_t)1 << cdict_cParams->hashLog;
-                     size_t const tableSpace = (chainSize + hSize) * sizeof(U32);
-                     assert((U32*)cctx->blockState.matchState.chainTable == (U32*)cctx->blockState.matchState.hashTable + hSize);  /* chainTable must follow hashTable */
-                     assert((U32*)cctx->blockState.matchState.hashTable3 == (U32*)cctx->blockState.matchState.chainTable + chainSize);
-                     assert((U32*)cdict->matchState.chainTable == (U32*)cdict->matchState.hashTable + hSize);  /* chainTable must follow hashTable */
-                     assert((U32*)cdict->matchState.hashTable3 == (U32*)cdict->matchState.chainTable + chainSize);
-                     memcpy(cctx->blockState.matchState.hashTable, cdict->matchState.hashTable, tableSpace);   /* presumes all tables follow each other */
+                     memcpy(cctx->blockState.matchState.hashTable,
+                            cdict->matchState.hashTable,
+                            hSize * sizeof(U32));
+                     memcpy(cctx->blockState.matchState.chainTable,
+                            cdict->matchState.chainTable,
+                            chainSize * sizeof(U32));
                  }
                  /* Zero the hashTable3, since the cdict never fills it */
-                 {   size_t const h3Size = (size_t)1 << cctx->blockState.matchState.hashLog3;
+                 {   int const h3log = cctx->blockState.matchState.hashLog3;
+                     size_t const h3Size = h3log ? ((size_t)1 << h3log) : 0;
                      assert(cdict->matchState.hashLog3 == 0);
                      memset(cctx->blockState.matchState.hashTable3, 0, h3Size * sizeof(U32));
                  }
+                 ZSTD_cwksp_mark_tables_clean(&cctx->workspace);
                  /* copy dictionary offsets */
                  {   ZSTD_matchState_t const* srcMatchState = &cdict->matchState;
                      ZSTD_matchState_t* dstMatchState = &cctx->blockState.matchState;
               * in-place. We decide here which strategy to use. */
              static size_t ZSTD_resetCCtx_usingCDict(ZSTD_CCtx* cctx,
                                          const ZSTD_CDict* cdict,
-                                         ZSTD_CCtx_params params,
+                                         const ZSTD_CCtx_params* params,
                                          U64 pledgedSrcSize,
                                          ZSTD_buffered_policy_e zbuff)
              {
                  if (ZSTD_shouldAttachDict(cdict, params, pledgedSrcSize)) {
                      return ZSTD_resetCCtx_byAttachingCDict(
-                         cctx, cdict, params, pledgedSrcSize, zbuff);
+                         cctx, cdict, *params, pledgedSrcSize, zbuff);
                  } else {
                      return ZSTD_resetCCtx_byCopyingCDict(
-                         cctx, cdict, params, pledgedSrcSize, zbuff);
+                         cctx, cdict, *params, pledgedSrcSize, zbuff);
                  }
              }
                      params.cParams = srcCCtx->appliedParams.cParams;
                      params.fParams = fParams;
                      ZSTD_resetCCtx_internal(dstCCtx, params, pledgedSrcSize,
-                                             ZSTDcrp_noMemset, zbuff);
+                                             ZSTDcrp_leaveDirty, zbuff);
                      assert(dstCCtx->appliedParams.cParams.windowLog == srcCCtx->appliedParams.cParams.windowLog);
                      assert(dstCCtx->appliedParams.cParams.strategy == srcCCtx->appliedParams.cParams.strategy);
                      assert(dstCCtx->appliedParams.cParams.hashLog == srcCCtx->appliedParams.cParams.hashLog);
                      assert(dstCCtx->blockState.matchState.hashLog3 == srcCCtx->blockState.matchState.hashLog3);
                  }
+                 ZSTD_cwksp_mark_tables_dirty(&dstCCtx->workspace);
                  /* copy tables */
                  {   size_t const chainSize = (srcCCtx->appliedParams.cParams.strategy == ZSTD_fast) ? 0 : ((size_t)1 << srcCCtx->appliedParams.cParams.chainLog);
                      size_t const hSize =  (size_t)1 << srcCCtx->appliedParams.cParams.hashLog;
-                     size_t const h3Size = (size_t)1 << srcCCtx->blockState.matchState.hashLog3;
-                     size_t const tableSpace = (chainSize + hSize + h3Size) * sizeof(U32);
-                     assert((U32*)dstCCtx->blockState.matchState.chainTable == (U32*)dstCCtx->blockState.matchState.hashTable + hSize);  /* chainTable must follow hashTable */
-                     assert((U32*)dstCCtx->blockState.matchState.hashTable3 == (U32*)dstCCtx->blockState.matchState.chainTable + chainSize);
-                     memcpy(dstCCtx->blockState.matchState.hashTable, srcCCtx->blockState.matchState.hashTable, tableSpace);   /* presumes all tables follow each other */
+                     int const h3log = srcCCtx->blockState.matchState.hashLog3;
+                     size_t const h3Size = h3log ? ((size_t)1 << h3log) : 0;
+                     memcpy(dstCCtx->blockState.matchState.hashTable,
+                            srcCCtx->blockState.matchState.hashTable,
+                            hSize * sizeof(U32));
+                     memcpy(dstCCtx->blockState.matchState.chainTable,
+                            srcCCtx->blockState.matchState.chainTable,
+                            chainSize * sizeof(U32));
+                     memcpy(dstCCtx->blockState.matchState.hashTable3,
+                            srcCCtx->blockState.matchState.hashTable3,
+                            h3Size * sizeof(U32));
                  }
+                 ZSTD_cwksp_mark_tables_clean(&dstCCtx->workspace);
                  /* copy dictionary offsets */
                  {
                      const ZSTD_matchState_t* srcMatchState = &srcCCtx->blockState.matchState;
                  int rowNb;
                  assert((size & (ZSTD_ROWSIZE-1)) == 0);  /* multiple of ZSTD_ROWSIZE */
                  assert(size < (1U<<31));   /* can be casted to int */
+             #if defined (MEMORY_SANITIZER) && !defined (ZSTD_MSAN_DONT_POISON_WORKSPACE)
+                 /* To validate that the table re-use logic is sound, and that we don't
+                  * access table space that we haven't cleaned, we re-"poison" the table
+                  * space every time we mark it dirty.
+                  *
+                  * This function however is intended to operate on those dirty tables and
+                  * re-clean them. So when this function is used correctly, we can unpoison
+                  * the memory it operated on. This introduces a blind spot though, since
+                  * if we now try to operate on __actually__ poisoned memory, we will not
+                  * detect that. */
+                 __msan_unpoison(table, size * sizeof(U32));
+             #endif
                  for (rowNb=0 ; rowNb < nbRows ; rowNb++) {
                      int column;
                      for (column=0; column<ZSTD_ROWSIZE; column++) {
                                              ZSTD_entropyCTables_t* nextEntropy,
                                        const ZSTD_CCtx_params* cctxParams,
                                              void* dst, size_t dstCapacity,
-                                             void* workspace, size_t wkspSize,
+                                             void* entropyWorkspace, size_t entropyWkspSize,
                                        const int bmi2)
              {
                  const int longOffsets = cctxParams->cParams.windowLog > STREAM_ACCUMULATOR_MIN;
                                                  ZSTD_disableLiteralsCompression(cctxParams),
                                                  op, dstCapacity,
                                                  literals, litSize,
-                                                 workspace, wkspSize,
+                                                 entropyWorkspace, entropyWkspSize,
                                                  bmi2);
                      FORWARD_IF_ERROR(cSize);
                      assert(cSize <= dstCapacity);
                  /* Sequences Header */
                  RETURN_ERROR_IF((oend-op) < 3 /*max nbSeq Size*/ + 1 /*seqHead*/,
                                  dstSize_tooSmall);
-                 if (nbSeq < 0x7F)
+                 if (nbSeq < 128) {
                      *op++ = (BYTE)nbSeq;
-                 else if (nbSeq < LONGNBSEQ)
-                     op[0] = (BYTE)((nbSeq>>8) + 0x80), op[1] = (BYTE)nbSeq, op+=2;
-                 else
-                     op[0]=0xFF, MEM_writeLE16(op+1, (U16)(nbSeq - LONGNBSEQ)), op+=3;
+                 } else if (nbSeq < LONGNBSEQ) {
+                     op[0] = (BYTE)((nbSeq>>8) + 0x80);
+                     op[1] = (BYTE)nbSeq;
+                     op+=2;
+                 } else {
+                     op[0]=0xFF;
+                     MEM_writeLE16(op+1, (U16)(nbSeq - LONGNBSEQ));
+                     op+=3;
+                 }
                  assert(op <= oend);
                  if (nbSeq==0) {
                      /* Copy the old tables over as if we repeated them */
                  ZSTD_seqToCodes(seqStorePtr);
                  /* build CTable for Literal Lengths */
                  {   unsigned max = MaxLL;
-                     size_t const mostFrequent = HIST_countFast_wksp(count, &max, llCodeTable, nbSeq, workspace, wkspSize);   /* can't fail */
+                     size_t const mostFrequent = HIST_countFast_wksp(count, &max, llCodeTable, nbSeq, entropyWorkspace, entropyWkspSize);   /* can't fail */
                      DEBUGLOG(5, "Building LL table");
                      nextEntropy->fse.litlength_repeatMode = prevEntropy->fse.litlength_repeatMode;
                      LLtype = ZSTD_selectEncodingType(&nextEntropy->fse.litlength_repeatMode,
                                                      ZSTD_defaultAllowed, strategy);
                      assert(set_basic < set_compressed && set_rle < set_compressed);
                      assert(!(LLtype < set_compressed && nextEntropy->fse.litlength_repeatMode != FSE_repeat_none)); /* We don't copy tables */
-                     {   size_t const countSize = ZSTD_buildCTable(op, (size_t)(oend - op), CTable_LitLength, LLFSELog, (symbolEncodingType_e)LLtype,
-                                                                 count, max, llCodeTable, nbSeq, LL_defaultNorm, LL_defaultNormLog, MaxLL,
-                                                                 prevEntropy->fse.litlengthCTable, sizeof(prevEntropy->fse.litlengthCTable),
-                                                                 workspace, wkspSize);
+                     {   size_t const countSize = ZSTD_buildCTable(
+                             op, (size_t)(oend - op),
+                             CTable_LitLength, LLFSELog, (symbolEncodingType_e)LLtype,
+                             count, max, llCodeTable, nbSeq,
+                             LL_defaultNorm, LL_defaultNormLog, MaxLL,
+                             prevEntropy->fse.litlengthCTable,
+                             sizeof(prevEntropy->fse.litlengthCTable),
+                             entropyWorkspace, entropyWkspSize);
                          FORWARD_IF_ERROR(countSize);
                          if (LLtype == set_compressed)
                              lastNCount = op;
                  }   }
                  /* build CTable for Offsets */
                  {   unsigned max = MaxOff;
-                     size_t const mostFrequent = HIST_countFast_wksp(count, &max, ofCodeTable, nbSeq, workspace, wkspSize);  /* can't fail */
+                     size_t const mostFrequent = HIST_countFast_wksp(
+                         count, &max, ofCodeTable, nbSeq, entropyWorkspace, entropyWkspSize);  /* can't fail */
                      /* We can only use the basic table if max <= DefaultMaxOff, otherwise the offsets are too large */
                      ZSTD_defaultPolicy_e const defaultPolicy = (max <= DefaultMaxOff) ? ZSTD_defaultAllowed : ZSTD_defaultDisallowed;
                      DEBUGLOG(5, "Building OF table");
                                                      OF_defaultNorm, OF_defaultNormLog,
                                                      defaultPolicy, strategy);
                      assert(!(Offtype < set_compressed && nextEntropy->fse.offcode_repeatMode != FSE_repeat_none)); /* We don't copy tables */
-                     {   size_t const countSize = ZSTD_buildCTable(op, (size_t)(oend - op), CTable_OffsetBits, OffFSELog, (symbolEncodingType_e)Offtype,
-                                                                 count, max, ofCodeTable, nbSeq, OF_defaultNorm, OF_defaultNormLog, DefaultMaxOff,
-                                                                 prevEntropy->fse.offcodeCTable, sizeof(prevEntropy->fse.offcodeCTable),
-                                                                 workspace, wkspSize);
+                     {   size_t const countSize = ZSTD_buildCTable(
+                             op, (size_t)(oend - op),
+                             CTable_OffsetBits, OffFSELog, (symbolEncodingType_e)Offtype,
+                             count, max, ofCodeTable, nbSeq,
+                             OF_defaultNorm, OF_defaultNormLog, DefaultMaxOff,
+                             prevEntropy->fse.offcodeCTable,
+                             sizeof(prevEntropy->fse.offcodeCTable),
+                             entropyWorkspace, entropyWkspSize);
                          FORWARD_IF_ERROR(countSize);
                          if (Offtype == set_compressed)
                              lastNCount = op;
                  }   }
                  /* build CTable for MatchLengths */
                  {   unsigned max = MaxML;
-                     size_t const mostFrequent = HIST_countFast_wksp(count, &max, mlCodeTable, nbSeq, workspace, wkspSize);   /* can't fail */
+                     size_t const mostFrequent = HIST_countFast_wksp(
+                         count, &max, mlCodeTable, nbSeq, entropyWorkspace, entropyWkspSize);   /* can't fail */
                      DEBUGLOG(5, "Building ML table (remaining space : %i)", (int)(oend-op));
                      nextEntropy->fse.matchlength_repeatMode = prevEntropy->fse.matchlength_repeatMode;
                      MLtype = ZSTD_selectEncodingType(&nextEntropy->fse.matchlength_repeatMode,
                                                      ML_defaultNorm, ML_defaultNormLog,
                                                      ZSTD_defaultAllowed, strategy);
                      assert(!(MLtype < set_compressed && nextEntropy->fse.matchlength_repeatMode != FSE_repeat_none)); /* We don't copy tables */
-                     {   size_t const countSize = ZSTD_buildCTable(op, (size_t)(oend - op), CTable_MatchLength, MLFSELog, (symbolEncodingType_e)MLtype,
-                                                                 count, max, mlCodeTable, nbSeq, ML_defaultNorm, ML_defaultNormLog, MaxML,
-                                                                 prevEntropy->fse.matchlengthCTable, sizeof(prevEntropy->fse.matchlengthCTable),
-                                                                 workspace, wkspSize);
+                     {   size_t const countSize = ZSTD_buildCTable(
+                             op, (size_t)(oend - op),
+                             CTable_MatchLength, MLFSELog, (symbolEncodingType_e)MLtype,
+                             count, max, mlCodeTable, nbSeq,
+                             ML_defaultNorm, ML_defaultNormLog, MaxML,
+                             prevEntropy->fse.matchlengthCTable,
+                             sizeof(prevEntropy->fse.matchlengthCTable),
+                             entropyWorkspace, entropyWkspSize);
                          FORWARD_IF_ERROR(countSize);
                          if (MLtype == set_compressed)
                              lastNCount = op;
                                     const ZSTD_CCtx_params* cctxParams,
                                           void* dst, size_t dstCapacity,
                                           size_t srcSize,
-                                          void* workspace, size_t wkspSize,
+                                          void* entropyWorkspace, size_t entropyWkspSize,
                                           int bmi2)
              {
                  size_t const cSize = ZSTD_compressSequences_internal(
                                          seqStorePtr, prevEntropy, nextEntropy, cctxParams,
                                          dst, dstCapacity,
-                                         workspace, wkspSize, bmi2);
+                                         entropyWorkspace, entropyWkspSize, bmi2);
                  if (cSize == 0) return 0;
                  /* When srcSize <= dstCapacity, there is enough space to write a raw uncompressed block.
                   * Since we ran out of space, block must be not compressible, so fall back to raw uncompressed block.
                  return ZSTDbss_compress;
              }
+             static void ZSTD_copyBlockSequences(ZSTD_CCtx* zc)
+             {
+                 const seqStore_t* seqStore = ZSTD_getSeqStore(zc);
+                 const seqDef* seqs = seqStore->sequencesStart;
+                 size_t seqsSize = seqStore->sequences - seqs;
+                 ZSTD_Sequence* outSeqs = &zc->seqCollector.seqStart[zc->seqCollector.seqIndex];
+                 size_t i; size_t position; int repIdx;
+                 assert(zc->seqCollector.seqIndex + 1 < zc->seqCollector.maxSequences);
+                 for (i = 0, position = 0; i < seqsSize; ++i) {
+                     outSeqs[i].offset = seqs[i].offset;
+                     outSeqs[i].litLength = seqs[i].litLength;
+                     outSeqs[i].matchLength = seqs[i].matchLength + MINMATCH;
+                     if (i == seqStore->longLengthPos) {
+                         if (seqStore->longLengthID == 1) {
+                             outSeqs[i].litLength += 0x10000;
+                         } else if (seqStore->longLengthID == 2) {
+                             outSeqs[i].matchLength += 0x10000;
+                         }
+                     }
+                     if (outSeqs[i].offset <= ZSTD_REP_NUM) {
+                         outSeqs[i].rep = outSeqs[i].offset;
+                         repIdx = (unsigned int)i - outSeqs[i].offset;
+                         if (outSeqs[i].litLength == 0) {
+                             if (outSeqs[i].offset < 3) {
+                                 --repIdx;
+                             } else {
+                                 repIdx = (unsigned int)i - 1;
+                             }
+                             ++outSeqs[i].rep;
+                         }
+                         assert(repIdx >= -3);
+                         outSeqs[i].offset = repIdx >= 0 ? outSeqs[repIdx].offset : repStartValue[-repIdx - 1];
+                         if (outSeqs[i].rep == 4) {
+                             --outSeqs[i].offset;
+                         }
+                     } else {
+                         outSeqs[i].offset -= ZSTD_REP_NUM;
+                     }
+                     position += outSeqs[i].litLength;
+                     outSeqs[i].matchPos = (unsigned int)position;
+                     position += outSeqs[i].matchLength;
+                 }
+                 zc->seqCollector.seqIndex += seqsSize;
+             }
+             size_t ZSTD_getSequences(ZSTD_CCtx* zc, ZSTD_Sequence* outSeqs,
+                 size_t outSeqsSize, const void* src, size_t srcSize)
+             {
+                 const size_t dstCapacity = ZSTD_compressBound(srcSize);
+                 void* dst = ZSTD_malloc(dstCapacity, ZSTD_defaultCMem);
+                 SeqCollector seqCollector;
+                 RETURN_ERROR_IF(dst == NULL, memory_allocation);
+                 seqCollector.collectSequences = 1;
+                 seqCollector.seqStart = outSeqs;
+                 seqCollector.seqIndex = 0;
+                 seqCollector.maxSequences = outSeqsSize;
+                 zc->seqCollector = seqCollector;
+                 ZSTD_compress2(zc, dst, dstCapacity, src, srcSize);
+                 ZSTD_free(dst, ZSTD_defaultCMem);
+                 return zc->seqCollector.seqIndex;
+             }
+             /* Returns true if the given block is a RLE block */
+             static int ZSTD_isRLE(const BYTE *ip, size_t length) {
+                 size_t i;
+                 if (length < 2) return 1;
+                 for (i = 1; i < length; ++i) {
+                     if (ip[0] != ip[i]) return 0;
+                 }
+                 return 1;
+             }
              static size_t ZSTD_compressBlock_internal(ZSTD_CCtx* zc,
                                                      void* dst, size_t dstCapacity,
-                                                     const void* src, size_t srcSize)
+                                                     const void* src, size_t srcSize, U32 frame)
              {
+                 /* This the upper bound for the length of an rle block.
+                  * This isn't the actual upper bound. Finding the real threshold
+                  * needs further investigation.
+                  */
+                 const U32 rleMaxLength = 25;
                  size_t cSize;
+                 const BYTE* ip = (const BYTE*)src;
+                 BYTE* op = (BYTE*)dst;
                  DEBUGLOG(5, "ZSTD_compressBlock_internal (dstCapacity=%u, dictLimit=%u, nextToUpdate=%u)",
                              (unsigned)dstCapacity, (unsigned)zc->blockState.matchState.window.dictLimit,
                              (unsigned)zc->blockState.matchState.nextToUpdate);
                      if (bss == ZSTDbss_noCompress) { cSize = 0; goto out; }
                  }
+                 if (zc->seqCollector.collectSequences) {
+                     ZSTD_copyBlockSequences(zc);
+                     return 0;
+                 }
                  /* encode sequences and literals */
                  cSize = ZSTD_compressSequences(&zc->seqStore,
                          &zc->blockState.prevCBlock->entropy, &zc->blockState.nextCBlock->entropy,
                          zc->entropyWorkspace, HUF_WORKSPACE_SIZE /* statically allocated in resetCCtx */,
                          zc->bmi2);
+                 if (frame &&
+                     /* We don't want to emit our first block as a RLE even if it qualifies because
+                      * doing so will cause the decoder (cli only) to throw a "should consume all input error."
+                      * This is only an issue for zstd <= v1.4.3
+                      */
+                     !zc->isFirstBlock &&
+                     cSize < rleMaxLength &&
+                     ZSTD_isRLE(ip, srcSize))
+                 {
+                     cSize = 1;
+                     op[0] = ip[0];
+                 }
              out:
-                 if (!ZSTD_isError(cSize) && cSize != 0) {
+                 if (!ZSTD_isError(cSize) && cSize > 1) {
                      /* confirm repcodes and entropy tables when emitting a compressed block */
                      ZSTD_compressedBlockState_t* const tmp = zc->blockState.prevCBlock;
                      zc->blockState.prevCBlock = zc->blockState.nextCBlock;
              }
-             static void ZSTD_overflowCorrectIfNeeded(ZSTD_matchState_t* ms, ZSTD_CCtx_params const* params, void const* ip, void const* iend)
+             static void ZSTD_overflowCorrectIfNeeded(ZSTD_matchState_t* ms,
+                                                      ZSTD_cwksp* ws,
+                                                      ZSTD_CCtx_params const* params,
+                                                      void const* ip,
+                                                      void const* iend)
              {
                  if (ZSTD_window_needOverflowCorrection(ms->window, iend)) {
                      U32 const maxDist = (U32)1 << params->cParams.windowLog;
                      ZSTD_STATIC_ASSERT(ZSTD_CHAINLOG_MAX <= 30);
                      ZSTD_STATIC_ASSERT(ZSTD_WINDOWLOG_MAX_32 <= 30);
                      ZSTD_STATIC_ASSERT(ZSTD_WINDOWLOG_MAX <= 31);
+                     ZSTD_cwksp_mark_tables_dirty(ws);
                      ZSTD_reduceIndex(ms, params, correction);
+                     ZSTD_cwksp_mark_tables_clean(ws);
                      if (ms->nextToUpdate < correction) ms->nextToUpdate = 0;
                      else ms->nextToUpdate -= correction;
                      /* invalidate dictionaries on overflow correction */
                  }
              }
              /*! ZSTD_compress_frameChunk() :
              *   Compress a chunk of data into one or multiple blocks.
              *   All blocks will be terminated, all input will be consumed.
                                      "not enough space to store compressed block");
                      if (remaining < blockSize) blockSize = remaining;
-                     ZSTD_overflowCorrectIfNeeded(ms, &cctx->appliedParams, ip, ip + blockSize);
+                     ZSTD_overflowCorrectIfNeeded(
+                         ms, &cctx->workspace, &cctx->appliedParams, ip, ip + blockSize);
                      ZSTD_checkDictValidity(&ms->window, ip + blockSize, maxDist, &ms->loadedDictEnd, &ms->dictMatchState);
                      /* Ensure hash/chain table insertion resumes no sooner than lowlimit */
                      {   size_t cSize = ZSTD_compressBlock_internal(cctx,
                                              op+ZSTD_blockHeaderSize, dstCapacity-ZSTD_blockHeaderSize,
-                                             ip, blockSize);
+                                             ip, blockSize, 1 /* frame */);
                          FORWARD_IF_ERROR(cSize);
                          if (cSize == 0) {  /* block is not compressible */
                              cSize = ZSTD_noCompressBlock(op, dstCapacity, ip, blockSize, lastBlock);
                              FORWARD_IF_ERROR(cSize);
                          } else {
-                             U32 const cBlockHeader24 = lastBlock + (((U32)bt_compressed)<<1) + (U32)(cSize << 3);
-                             MEM_writeLE24(op, cBlockHeader24);
+                             const U32 cBlockHeader = cSize == 1 ?
+                                 lastBlock + (((U32)bt_rle)<<1) + (U32)(blockSize << 3) :
+                                 lastBlock + (((U32)bt_compressed)<<1) + (U32)(cSize << 3);
+                             MEM_writeLE24(op, cBlockHeader);
                              cSize += ZSTD_blockHeaderSize;
                          }
                          op += cSize;
                          assert(dstCapacity >= cSize);
                          dstCapacity -= cSize;
+                         cctx->isFirstBlock = 0;
                          DEBUGLOG(5, "ZSTD_compress_frameChunk: adding a block of size %u",
                                      (unsigned)cSize);
                  }   }
              static size_t ZSTD_writeFrameHeader(void* dst, size_t dstCapacity,
-                                                 ZSTD_CCtx_params params, U64 pledgedSrcSize, U32 dictID)
+                                                 const ZSTD_CCtx_params* params, U64 pledgedSrcSize, U32 dictID)
              {   BYTE* const op = (BYTE*)dst;
                  U32   const dictIDSizeCodeLength = (dictID>0) + (dictID>=256) + (dictID>=65536);   /* 0-3 */
-                 U32   const dictIDSizeCode = params.fParams.noDictIDFlag ? 0 : dictIDSizeCodeLength;   /* 0-3 */
-                 U32   const checksumFlag = params.fParams.checksumFlag>0;
-                 U32   const windowSize = (U32)1 << params.cParams.windowLog;
-                 U32   const singleSegment = params.fParams.contentSizeFlag && (windowSize >= pledgedSrcSize);
-                 BYTE  const windowLogByte = (BYTE)((params.cParams.windowLog - ZSTD_WINDOWLOG_ABSOLUTEMIN) << 3);
-                 U32   const fcsCode = params.fParams.contentSizeFlag ?
+                 U32   const dictIDSizeCode = params->fParams.noDictIDFlag ? 0 : dictIDSizeCodeLength;   /* 0-3 */
+                 U32   const checksumFlag = params->fParams.checksumFlag>0;
+                 U32   const windowSize = (U32)1 << params->cParams.windowLog;
+                 U32   const singleSegment = params->fParams.contentSizeFlag && (windowSize >= pledgedSrcSize);
+                 BYTE  const windowLogByte = (BYTE)((params->cParams.windowLog - ZSTD_WINDOWLOG_ABSOLUTEMIN) << 3);
+                 U32   const fcsCode = params->fParams.contentSizeFlag ?
                                   (pledgedSrcSize>=256) + (pledgedSrcSize>=65536+256) + (pledgedSrcSize>=0xFFFFFFFFU) : 0;  /* 0-3 */
                  BYTE  const frameHeaderDescriptionByte = (BYTE)(dictIDSizeCode + (checksumFlag<<2) + (singleSegment<<5) + (fcsCode<<6) );
                  size_t pos=0;
-                 assert(!(params.fParams.contentSizeFlag && pledgedSrcSize == ZSTD_CONTENTSIZE_UNKNOWN));
+                 assert(!(params->fParams.contentSizeFlag && pledgedSrcSize == ZSTD_CONTENTSIZE_UNKNOWN));
                  RETURN_ERROR_IF(dstCapacity < ZSTD_FRAMEHEADERSIZE_MAX, dstSize_tooSmall);
                  DEBUGLOG(4, "ZSTD_writeFrameHeader : dictIDFlag : %u ; dictID : %u ; dictIDSizeCode : %u",
-                             !params.fParams.noDictIDFlag, (unsigned)dictID, (unsigned)dictIDSizeCode);
-                 if (params.format == ZSTD_f_zstd1) {
+                             !params->fParams.noDictIDFlag, (unsigned)dictID, (unsigned)dictIDSizeCode);
+                 if (params->format == ZSTD_f_zstd1) {
                      MEM_writeLE32(dst, ZSTD_MAGICNUMBER);
                      pos = 4;
                  }
                                  "missing init (ZSTD_compressBegin)");
                  if (frame && (cctx->stage==ZSTDcs_init)) {
-                     fhSize = ZSTD_writeFrameHeader(dst, dstCapacity, cctx->appliedParams,
+                     fhSize = ZSTD_writeFrameHeader(dst, dstCapacity, &cctx->appliedParams,
                                                     cctx->pledgedSrcSizePlusOne-1, cctx->dictID);
                      FORWARD_IF_ERROR(fhSize);
                      assert(fhSize <= dstCapacity);
                  if (!frame) {
                      /* overflow check and correction for block mode */
-                     ZSTD_overflowCorrectIfNeeded(ms, &cctx->appliedParams, src, (BYTE const*)src + srcSize);
+                     ZSTD_overflowCorrectIfNeeded(
+                         ms, &cctx->workspace, &cctx->appliedParams,
+                         src, (BYTE const*)src + srcSize);
                  }
                  DEBUGLOG(5, "ZSTD_compressContinue_internal (blockSize=%u)", (unsigned)cctx->blockSize);
                  {   size_t const cSize = frame ?
                                           ZSTD_compress_frameChunk (cctx, dst, dstCapacity, src, srcSize, lastFrameChunk) :
-                                          ZSTD_compressBlock_internal (cctx, dst, dstCapacity, src, srcSize);
+                                          ZSTD_compressBlock_internal (cctx, dst, dstCapacity, src, srcSize, 0 /* frame */);
                      FORWARD_IF_ERROR(cSize);
                      cctx->consumedSrcSize += srcSize;
                      cctx->producedCSize += (cSize + fhSize);
               *  @return : 0, or an error code
               */
              static size_t ZSTD_loadDictionaryContent(ZSTD_matchState_t* ms,
+                                                      ZSTD_cwksp* ws,
                                                       ZSTD_CCtx_params const* params,
                                                       const void* src, size_t srcSize,
                                                       ZSTD_dictTableLoadMethod_e dtlm)
                      size_t const chunk = MIN(remaining, ZSTD_CHUNKSIZE_MAX);
                      const BYTE* const ichunk = ip + chunk;
-                     ZSTD_overflowCorrectIfNeeded(ms, params, ip, ichunk);
+                     ZSTD_overflowCorrectIfNeeded(ms, ws, params, ip, ichunk);
                      switch(params->cParams.strategy)
                      {
              /*! ZSTD_loadZstdDictionary() :
               * @return : dictID, or an error code
               *  assumptions : magic number supposed already checked
-              *                dictSize supposed > 8
+              *                dictSize supposed >= 8
               */
              static size_t ZSTD_loadZstdDictionary(ZSTD_compressedBlockState_t* bs,
                                                    ZSTD_matchState_t* ms,
+                                                   ZSTD_cwksp* ws,
                                                    ZSTD_CCtx_params const* params,
                                                    const void* dict, size_t dictSize,
                                                    ZSTD_dictTableLoadMethod_e dtlm,
                  size_t dictID;
                  ZSTD_STATIC_ASSERT(HUF_WORKSPACE_SIZE >= (1<<MAX(MLFSELog,LLFSELog)));
-                 assert(dictSize > 8);
+                 assert(dictSize >= 8);
                  assert(MEM_readLE32(dictPtr) == ZSTD_MAGIC_DICTIONARY);
                  dictPtr += 4;   /* skip magic number */
                      bs->entropy.fse.offcode_repeatMode = FSE_repeat_valid;
                      bs->entropy.fse.matchlength_repeatMode = FSE_repeat_valid;
                      bs->entropy.fse.litlength_repeatMode = FSE_repeat_valid;
-                     FORWARD_IF_ERROR(ZSTD_loadDictionaryContent(ms, params, dictPtr, dictContentSize, dtlm));
+                     FORWARD_IF_ERROR(ZSTD_loadDictionaryContent(
+                         ms, ws, params, dictPtr, dictContentSize, dtlm));
                      return dictID;
                  }
              }
              static size_t
              ZSTD_compress_insertDictionary(ZSTD_compressedBlockState_t* bs,
                                             ZSTD_matchState_t* ms,
+                                            ZSTD_cwksp* ws,
                                       const ZSTD_CCtx_params* params,
                                       const void* dict, size_t dictSize,
                                             ZSTD_dictContentType_e dictContentType,
                                             void* workspace)
              {
                  DEBUGLOG(4, "ZSTD_compress_insertDictionary (dictSize=%u)", (U32)dictSize);
-                 if ((dict==NULL) || (dictSize<=8)) return 0;
+                 if ((dict==NULL) || (dictSize<8)) {
+                     RETURN_ERROR_IF(dictContentType == ZSTD_dct_fullDict, dictionary_wrong);
+                     return 0;
+                 }
                  ZSTD_reset_compressedBlockState(bs);
                  /* dict restricted modes */
                  if (dictContentType == ZSTD_dct_rawContent)
-                     return ZSTD_loadDictionaryContent(ms, params, dict, dictSize, dtlm);
+                     return ZSTD_loadDictionaryContent(ms, ws, params, dict, dictSize, dtlm);
                  if (MEM_readLE32(dict) != ZSTD_MAGIC_DICTIONARY) {
                      if (dictContentType == ZSTD_dct_auto) {
                          DEBUGLOG(4, "raw content dictionary detected");
-                         return ZSTD_loadDictionaryContent(ms, params, dict, dictSize, dtlm);
+                         return ZSTD_loadDictionaryContent(
+                             ms, ws, params, dict, dictSize, dtlm);
                      }
                      RETURN_ERROR_IF(dictContentType == ZSTD_dct_fullDict, dictionary_wrong);
                      assert(0);   /* impossible */
                  }
                  /* dict as full zstd dictionary */
-                 return ZSTD_loadZstdDictionary(bs, ms, params, dict, dictSize, dtlm, workspace);
+                 return ZSTD_loadZstdDictionary(
+                     bs, ms, ws, params, dict, dictSize, dtlm, workspace);
              }
+             #define ZSTD_USE_CDICT_PARAMS_SRCSIZE_CUTOFF (128 KB)
+             #define ZSTD_USE_CDICT_PARAMS_DICTSIZE_MULTIPLIER (6)
              /*! ZSTD_compressBegin_internal() :
               * @return : 0, or an error code */
              static size_t ZSTD_compressBegin_internal(ZSTD_CCtx* cctx,
                                                  ZSTD_dictContentType_e dictContentType,
                                                  ZSTD_dictTableLoadMethod_e dtlm,
                                                  const ZSTD_CDict* cdict,
-                                                 ZSTD_CCtx_params params, U64 pledgedSrcSize,
+                                                 const ZSTD_CCtx_params* params, U64 pledgedSrcSize,
                                                  ZSTD_buffered_policy_e zbuff)
              {
-                 DEBUGLOG(4, "ZSTD_compressBegin_internal: wlog=%u", params.cParams.windowLog);
+                 DEBUGLOG(4, "ZSTD_compressBegin_internal: wlog=%u", params->cParams.windowLog);
                  /* params are supposed to be fully validated at this point */
-                 assert(!ZSTD_isError(ZSTD_checkCParams(params.cParams)));
+                 assert(!ZSTD_isError(ZSTD_checkCParams(params->cParams)));
                  assert(!((dict) && (cdict)));  /* either dict or cdict, not both */
-                 if (cdict && cdict->dictContentSize>0) {
+                 if ( (cdict)
+                   && (cdict->dictContentSize > 0)
+                   && ( pledgedSrcSize < ZSTD_USE_CDICT_PARAMS_SRCSIZE_CUTOFF
+                     || pledgedSrcSize < cdict->dictContentSize * ZSTD_USE_CDICT_PARAMS_DICTSIZE_MULTIPLIER
+                     || pledgedSrcSize == ZSTD_CONTENTSIZE_UNKNOWN
+                     || cdict->compressionLevel == 0)
+                   && (params->attachDictPref != ZSTD_dictForceLoad) ) {
                      return ZSTD_resetCCtx_usingCDict(cctx, cdict, params, pledgedSrcSize, zbuff);
                  }
-                 FORWARD_IF_ERROR( ZSTD_resetCCtx_internal(cctx, params, pledgedSrcSize,
-                                                  ZSTDcrp_continue, zbuff) );
-                 {   size_t const dictID = ZSTD_compress_insertDictionary(
-                             cctx->blockState.prevCBlock, &cctx->blockState.matchState,
-                             &params, dict, dictSize, dictContentType, dtlm, cctx->entropyWorkspace);
+                 FORWARD_IF_ERROR( ZSTD_resetCCtx_internal(cctx, *params, pledgedSrcSize,
+                                                  ZSTDcrp_makeClean, zbuff) );
+                 {   size_t const dictID = cdict ?
+                             ZSTD_compress_insertDictionary(
+                                     cctx->blockState.prevCBlock, &cctx->blockState.matchState,
+                                     &cctx->workspace, params, cdict->dictContent, cdict->dictContentSize,
+                                     dictContentType, dtlm, cctx->entropyWorkspace)
+                           : ZSTD_compress_insertDictionary(
+                                     cctx->blockState.prevCBlock, &cctx->blockState.matchState,
+                                     &cctx->workspace, params, dict, dictSize,
+                                     dictContentType, dtlm, cctx->entropyWorkspace);
                      FORWARD_IF_ERROR(dictID);
                      assert(dictID <= UINT_MAX);
                      cctx->dictID = (U32)dictID;
                                                  ZSTD_dictContentType_e dictContentType,
                                                  ZSTD_dictTableLoadMethod_e dtlm,
                                                  const ZSTD_CDict* cdict,
-                                                 ZSTD_CCtx_params params,
+                                                 const ZSTD_CCtx_params* params,
                                                  unsigned long long pledgedSrcSize)
              {
-                 DEBUGLOG(4, "ZSTD_compressBegin_advanced_internal: wlog=%u", params.cParams.windowLog);
+                 DEBUGLOG(4, "ZSTD_compressBegin_advanced_internal: wlog=%u", params->cParams.windowLog);
                  /* compression parameters verification and optimization */
-                 FORWARD_IF_ERROR( ZSTD_checkCParams(params.cParams) );
+                 FORWARD_IF_ERROR( ZSTD_checkCParams(params->cParams) );
                  return ZSTD_compressBegin_internal(cctx,
                                                     dict, dictSize, dictContentType, dtlm,
                                                     cdict,
                                                 ZSTD_parameters params, unsigned long long pledgedSrcSize)
              {
                  ZSTD_CCtx_params const cctxParams =
-                         ZSTD_assignParamsToCCtxParams(cctx->requestedParams, params);
+                         ZSTD_assignParamsToCCtxParams(&cctx->requestedParams, params);
                  return ZSTD_compressBegin_advanced_internal(cctx,
                                                          dict, dictSize, ZSTD_dct_auto, ZSTD_dtlm_fast,
                                                          NULL /*cdict*/,
-                                                         cctxParams, pledgedSrcSize);
+                                                         &cctxParams, pledgedSrcSize);
              }
              size_t ZSTD_compressBegin_usingDict(ZSTD_CCtx* cctx, const void* dict, size_t dictSize, int compressionLevel)
              {
                  ZSTD_parameters const params = ZSTD_getParams(compressionLevel, ZSTD_CONTENTSIZE_UNKNOWN, dictSize);
                  ZSTD_CCtx_params const cctxParams =
-                         ZSTD_assignParamsToCCtxParams(cctx->requestedParams, params);
+                         ZSTD_assignParamsToCCtxParams(&cctx->requestedParams, params);
                  DEBUGLOG(4, "ZSTD_compressBegin_usingDict (dictSize=%u)", (unsigned)dictSize);
                  return ZSTD_compressBegin_internal(cctx, dict, dictSize, ZSTD_dct_auto, ZSTD_dtlm_fast, NULL,
-                                                    cctxParams, ZSTD_CONTENTSIZE_UNKNOWN, ZSTDb_not_buffered);
+                                                    &cctxParams, ZSTD_CONTENTSIZE_UNKNOWN, ZSTDb_not_buffered);
              }
              size_t ZSTD_compressBegin(ZSTD_CCtx* cctx, int compressionLevel)
                  /* special case : empty frame */
                  if (cctx->stage == ZSTDcs_init) {
-                     fhSize = ZSTD_writeFrameHeader(dst, dstCapacity, cctx->appliedParams, 0, 0);
+                     fhSize = ZSTD_writeFrameHeader(dst, dstCapacity, &cctx->appliedParams, 0, 0);
                      FORWARD_IF_ERROR(fhSize);
                      dstCapacity -= fhSize;
                      op += fhSize;
                                                    ZSTD_parameters params)
              {
                  ZSTD_CCtx_params const cctxParams =
-                         ZSTD_assignParamsToCCtxParams(cctx->requestedParams, params);
+                         ZSTD_assignParamsToCCtxParams(&cctx->requestedParams, params);
                  DEBUGLOG(4, "ZSTD_compress_internal");
                  return ZSTD_compress_advanced_internal(cctx,
                                                         dst, dstCapacity,
                                                         src, srcSize,
                                                         dict, dictSize,
-                                                        cctxParams);
+                                                        &cctxParams);
              }
              size_t ZSTD_compress_advanced (ZSTD_CCtx* cctx,
                      void* dst, size_t dstCapacity,
                      const void* src, size_t srcSize,
                      const void* dict,size_t dictSize,
-                     ZSTD_CCtx_params params)
+                     const ZSTD_CCtx_params* params)
              {
                  DEBUGLOG(4, "ZSTD_compress_advanced_internal (srcSize:%u)", (unsigned)srcSize);
                  FORWARD_IF_ERROR( ZSTD_compressBegin_internal(cctx,
                                             int compressionLevel)
              {
                  ZSTD_parameters const params = ZSTD_getParams(compressionLevel, srcSize + (!srcSize), dict ? dictSize : 0);
-                 ZSTD_CCtx_params cctxParams = ZSTD_assignParamsToCCtxParams(cctx->requestedParams, params);
+                 ZSTD_CCtx_params cctxParams = ZSTD_assignParamsToCCtxParams(&cctx->requestedParams, params);
                  assert(params.fParams.contentSizeFlag == 1);
-                 return ZSTD_compress_advanced_internal(cctx, dst, dstCapacity, src, srcSize, dict, dictSize, cctxParams);
+                 return ZSTD_compress_advanced_internal(cctx, dst, dstCapacity, src, srcSize, dict, dictSize, &cctxParams);
              }
              size_t ZSTD_compressCCtx(ZSTD_CCtx* cctx,
                      ZSTD_dictLoadMethod_e dictLoadMethod)
              {
                  DEBUGLOG(5, "sizeof(ZSTD_CDict) : %u", (unsigned)sizeof(ZSTD_CDict));
-                 return sizeof(ZSTD_CDict) + HUF_WORKSPACE_SIZE + ZSTD_sizeof_matchState(&cParams, /* forCCtx */ 0)
-                        + (dictLoadMethod == ZSTD_dlm_byRef ? 0 : dictSize);
+                 return ZSTD_cwksp_alloc_size(sizeof(ZSTD_CDict))
+                      + ZSTD_cwksp_alloc_size(HUF_WORKSPACE_SIZE)
+                      + ZSTD_sizeof_matchState(&cParams, /* forCCtx */ 0)
+                      + (dictLoadMethod == ZSTD_dlm_byRef ? 0
+                         : ZSTD_cwksp_alloc_size(ZSTD_cwksp_align(dictSize, sizeof(void *))));
              }
              size_t ZSTD_estimateCDictSize(size_t dictSize, int compressionLevel)
              {
                  if (cdict==NULL) return 0;   /* support sizeof on NULL */
                  DEBUGLOG(5, "sizeof(*cdict) : %u", (unsigned)sizeof(*cdict));
-                 return cdict->workspaceSize + (cdict->dictBuffer ? cdict->dictContentSize : 0) + sizeof(*cdict);
+                 /* cdict may be in the workspace */
+                 return (cdict->workspace.workspace == cdict ? 0 : sizeof(*cdict))
+                     + ZSTD_cwksp_sizeof(&cdict->workspace);
              }
              static size_t ZSTD_initCDict_internal(
                  assert(!ZSTD_checkCParams(cParams));
                  cdict->matchState.cParams = cParams;
                  if ((dictLoadMethod == ZSTD_dlm_byRef) || (!dictBuffer) || (!dictSize)) {
-                     cdict->dictBuffer = NULL;
                      cdict->dictContent = dictBuffer;
                  } else {
-                     void* const internalBuffer = ZSTD_malloc(dictSize, cdict->customMem);
-                     cdict->dictBuffer = internalBuffer;
+                      void *internalBuffer = ZSTD_cwksp_reserve_object(&cdict->workspace, ZSTD_cwksp_align(dictSize, sizeof(void*)));
+                     RETURN_ERROR_IF(!internalBuffer, memory_allocation);
                      cdict->dictContent = internalBuffer;
-                     RETURN_ERROR_IF(!internalBuffer, memory_allocation);
                      memcpy(internalBuffer, dictBuffer, dictSize);
                  }
                  cdict->dictContentSize = dictSize;
+                 cdict->entropyWorkspace = (U32*)ZSTD_cwksp_reserve_object(&cdict->workspace, HUF_WORKSPACE_SIZE);
                  /* Reset the state to no dictionary */
                  ZSTD_reset_compressedBlockState(&cdict->cBlockState);
-                 {   void* const end = ZSTD_reset_matchState(&cdict->matchState,
-                                         (U32*)cdict->workspace + HUF_WORKSPACE_SIZE_U32,
-                                         &cParams,
-                                          ZSTDcrp_continue, ZSTD_resetTarget_CDict);
-                     assert(end == (char*)cdict->workspace + cdict->workspaceSize);
-                     (void)end;
+                 }
+                 FORWARD_IF_ERROR(ZSTD_reset_matchState(
+                     &cdict->matchState,
+                     &cdict->workspace,
+                     &cParams,
+                     ZSTDcrp_makeClean,
+                     ZSTDirp_reset,
+                     ZSTD_resetTarget_CDict));
                  /* (Maybe) load the dictionary
-                  * Skips loading the dictionary if it is <= 8 bytes.
+                  * Skips loading the dictionary if it is < 8 bytes.
                   */
                  {   ZSTD_CCtx_params params;
                      memset(&params, 0, sizeof(params));
                      params.fParams.contentSizeFlag = 1;
                      params.cParams = cParams;
                      {   size_t const dictID = ZSTD_compress_insertDictionary(
-                                 &cdict->cBlockState, &cdict->matchState, &params,
-                                 cdict->dictContent, cdict->dictContentSize,
-                                 dictContentType, ZSTD_dtlm_full, cdict->workspace);
+                                 &cdict->cBlockState, &cdict->matchState, &cdict->workspace,
+                                 &params, cdict->dictContent, cdict->dictContentSize,
+                                 dictContentType, ZSTD_dtlm_full, cdict->entropyWorkspace);
                          FORWARD_IF_ERROR(dictID);
                          assert(dictID <= (size_t)(U32)-1);
                          cdict->dictID = (U32)dictID;
                  DEBUGLOG(3, "ZSTD_createCDict_advanced, mode %u", (unsigned)dictContentType);
                  if (!customMem.customAlloc ^ !customMem.customFree) return NULL;
-                 {   ZSTD_CDict* const cdict = (ZSTD_CDict*)ZSTD_malloc(sizeof(ZSTD_CDict), customMem);
-                     size_t const workspaceSize = HUF_WORKSPACE_SIZE + ZSTD_sizeof_matchState(&cParams, /* forCCtx */ 0);
+                 {   size_t const workspaceSize =
+                         ZSTD_cwksp_alloc_size(sizeof(ZSTD_CDict)) +
+                         ZSTD_cwksp_alloc_size(HUF_WORKSPACE_SIZE) +
+                         ZSTD_sizeof_matchState(&cParams, /* forCCtx */ 0) +
+                         (dictLoadMethod == ZSTD_dlm_byRef ? 0
+                          : ZSTD_cwksp_alloc_size(ZSTD_cwksp_align(dictSize, sizeof(void*))));
                      void* const workspace = ZSTD_malloc(workspaceSize, customMem);
-                     if (!cdict || !workspace) {
-                         ZSTD_free(cdict, customMem);
+                     ZSTD_cwksp ws;
+                     ZSTD_CDict* cdict;
+                     if (!workspace) {
                          ZSTD_free(workspace, customMem);
                          return NULL;
                      }
+                     ZSTD_cwksp_init(&ws, workspace, workspaceSize);
+                     cdict = (ZSTD_CDict*)ZSTD_cwksp_reserve_object(&ws, sizeof(ZSTD_CDict));
+                     assert(cdict != NULL);
+                     ZSTD_cwksp_move(&cdict->workspace, &ws);
                      cdict->customMem = customMem;
-                     cdict->workspace = workspace;
-                     cdict->workspaceSize = workspaceSize;
+                     cdict->compressionLevel = 0; /* signals advanced API usage */
                      if (ZSTD_isError( ZSTD_initCDict_internal(cdict,
                                                      dictBuffer, dictSize,
                                                      dictLoadMethod, dictContentType,
              ZSTD_CDict* ZSTD_createCDict(const void* dict, size_t dictSize, int compressionLevel)
              {
                  ZSTD_compressionParameters cParams = ZSTD_getCParams(compressionLevel, 0, dictSize);
-                 return ZSTD_createCDict_advanced(dict, dictSize,
-                                                  ZSTD_dlm_byCopy, ZSTD_dct_auto,
-                                                  cParams, ZSTD_defaultCMem);
+                 ZSTD_CDict* cdict = ZSTD_createCDict_advanced(dict, dictSize,
+                                                               ZSTD_dlm_byCopy, ZSTD_dct_auto,
+                                                               cParams, ZSTD_defaultCMem);
+                 if (cdict)
+                     cdict->compressionLevel = compressionLevel == 0 ? ZSTD_CLEVEL_DEFAULT : compressionLevel;
+                 return cdict;
              }
              ZSTD_CDict* ZSTD_createCDict_byReference(const void* dict, size_t dictSize, int compressionLevel)
              {
                  if (cdict==NULL) return 0;   /* support free on NULL */
                  {   ZSTD_customMem const cMem = cdict->customMem;
-                     ZSTD_free(cdict->workspace, cMem);
-                     ZSTD_free(cdict->dictBuffer, cMem);
-                     ZSTD_free(cdict, cMem);
+                     int cdictInWorkspace = ZSTD_cwksp_owns_buffer(&cdict->workspace, cdict);
+                     ZSTD_cwksp_free(&cdict->workspace, cMem);
+                     if (!cdictInWorkspace) {
+                         ZSTD_free(cdict, cMem);
+                     }
                      return 0;
                  }
              }
                                               ZSTD_compressionParameters cParams)
              {
                  size_t const matchStateSize = ZSTD_sizeof_matchState(&cParams, /* forCCtx */ 0);
-                 size_t const neededSize = sizeof(ZSTD_CDict) + (dictLoadMethod == ZSTD_dlm_byRef ? 0 : dictSize)
-                                         + HUF_WORKSPACE_SIZE + matchStateSize;
-                 ZSTD_CDict* const cdict = (ZSTD_CDict*) workspace;
-                 void* ptr;
+                 size_t const neededSize = ZSTD_cwksp_alloc_size(sizeof(ZSTD_CDict))
+                                         + (dictLoadMethod == ZSTD_dlm_byRef ? 0
+                                            : ZSTD_cwksp_alloc_size(ZSTD_cwksp_align(dictSize, sizeof(void*))))
+                                         + ZSTD_cwksp_alloc_size(HUF_WORKSPACE_SIZE)
+                                         + matchStateSize;
+                 ZSTD_CDict* cdict;
                  if ((size_t)workspace & 7) return NULL;  /* 8-aligned */
+                 {
+                     ZSTD_cwksp ws;
+                     ZSTD_cwksp_init(&ws, workspace, workspaceSize);
+                     cdict = (ZSTD_CDict*)ZSTD_cwksp_reserve_object(&ws, sizeof(ZSTD_CDict));
+                     if (cdict == NULL) return NULL;
+                     ZSTD_cwksp_move(&cdict->workspace, &ws);
+                 }
                  DEBUGLOG(4, "(workspaceSize < neededSize) : (%u < %u) => %u",
                      (unsigned)workspaceSize, (unsigned)neededSize, (unsigned)(workspaceSize < neededSize));
                  if (workspaceSize < neededSize) return NULL;
-                 if (dictLoadMethod == ZSTD_dlm_byCopy) {
-                     memcpy(cdict+1, dict, dictSize);
-                     dict = cdict+1;
-                     ptr = (char*)workspace + sizeof(ZSTD_CDict) + dictSize;
-                 } else {
-                     ptr = cdict+1;
+                 }
-                 cdict->workspace = ptr;
-                 cdict->workspaceSize = HUF_WORKSPACE_SIZE + matchStateSize;
                  if (ZSTD_isError( ZSTD_initCDict_internal(cdict,
                                                            dict, dictSize,
-                                                           ZSTD_dlm_byRef, dictContentType,
+                                                           dictLoadMethod, dictContentType,
                                                            cParams) ))
                      return NULL;
                  DEBUGLOG(4, "ZSTD_compressBegin_usingCDict_advanced");
                  RETURN_ERROR_IF(cdict==NULL, dictionary_wrong);
                  {   ZSTD_CCtx_params params = cctx->requestedParams;
-                     params.cParams = ZSTD_getCParamsFromCDict(cdict);
+                     params.cParams = ( pledgedSrcSize < ZSTD_USE_CDICT_PARAMS_SRCSIZE_CUTOFF
+                                     || pledgedSrcSize < cdict->dictContentSize * ZSTD_USE_CDICT_PARAMS_DICTSIZE_MULTIPLIER
+                                     || pledgedSrcSize == ZSTD_CONTENTSIZE_UNKNOWN
+                                     || cdict->compressionLevel == 0 )
+                                   && (params.attachDictPref != ZSTD_dictForceLoad) ?
+                             ZSTD_getCParamsFromCDict(cdict)
+                           : ZSTD_getCParams(cdict->compressionLevel,
+                                             pledgedSrcSize,
+                                             cdict->dictContentSize);
                      /* Increase window log to fit the entire dictionary and source if the
                       * source size is known. Limit the increase to 19, which is the
                       * window log for compression level 1 with the largest source size.
                      return ZSTD_compressBegin_internal(cctx,
                                                         NULL, 0, ZSTD_dct_auto, ZSTD_dtlm_fast,
                                                         cdict,
-                                                        params, pledgedSrcSize,
+                                                        &params, pledgedSrcSize,
                                                         ZSTDb_not_buffered);
                  }
              }
                  FORWARD_IF_ERROR( ZSTD_compressBegin_internal(cctx,
                                                       dict, dictSize, dictContentType, ZSTD_dtlm_fast,
                                                       cdict,
-                                                      params, pledgedSrcSize,
+                                                      &params, pledgedSrcSize,
                                                       ZSTDb_buffered) );
                  cctx->inToCompress = 0;
               *  Assumption 2 : either dict, or cdict, is defined, not both */
              size_t ZSTD_initCStream_internal(ZSTD_CStream* zcs,
                                  const void* dict, size_t dictSize, const ZSTD_CDict* cdict,
-                                 ZSTD_CCtx_params params, unsigned long long pledgedSrcSize)
+                                 const ZSTD_CCtx_params* params,
+                                 unsigned long long pledgedSrcSize)
              {
                  DEBUGLOG(4, "ZSTD_initCStream_internal");
                  FORWARD_IF_ERROR( ZSTD_CCtx_reset(zcs, ZSTD_reset_session_only) );
                  FORWARD_IF_ERROR( ZSTD_CCtx_setPledgedSrcSize(zcs, pledgedSrcSize) );
-                 assert(!ZSTD_isError(ZSTD_checkCParams(params.cParams)));
-                 zcs->requestedParams = params;
+                 assert(!ZSTD_isError(ZSTD_checkCParams(params->cParams)));
+                 zcs->requestedParams = *params;
                  assert(!((dict) && (cdict)));  /* either dict or cdict, not both */
                  if (dict) {
                      FORWARD_IF_ERROR( ZSTD_CCtx_loadDictionary(zcs, dict, dictSize) );
              /* ZSTD_initCStream_advanced() :
               * pledgedSrcSize must be exact.
               * if srcSize is not known at init time, use value ZSTD_CONTENTSIZE_UNKNOWN.
-              * dict is loaded with default parameters ZSTD_dm_auto and ZSTD_dlm_byCopy. */
+              * dict is loaded with default parameters ZSTD_dct_auto and ZSTD_dlm_byCopy. */
              size_t ZSTD_initCStream_advanced(ZSTD_CStream* zcs,
                                               const void* dict, size_t dictSize,
                                               ZSTD_parameters params, unsigned long long pss)
                  FORWARD_IF_ERROR( ZSTD_CCtx_reset(zcs, ZSTD_reset_session_only) );
                  FORWARD_IF_ERROR( ZSTD_CCtx_setPledgedSrcSize(zcs, pledgedSrcSize) );
                  FORWARD_IF_ERROR( ZSTD_checkCParams(params.cParams) );
-                 zcs->requestedParams = ZSTD_assignParamsToCCtxParams(zcs->requestedParams, params);
+                 zcs->requestedParams = ZSTD_assignParamsToCCtxParams(&zcs->requestedParams, params);
                  FORWARD_IF_ERROR( ZSTD_CCtx_loadDictionary(zcs, dict, dictSize) );
                  return 0;
              }
                          if (cctx->mtctx == NULL) {
                              DEBUGLOG(4, "ZSTD_compressStream2: creating new mtctx for nbWorkers=%u",
                                          params.nbWorkers);
-                             cctx->mtctx = ZSTDMT_createCCtx_advanced(params.nbWorkers, cctx->customMem);
+                             cctx->mtctx = ZSTDMT_createCCtx_advanced((U32)params.nbWorkers, cctx->customMem);
                              RETURN_ERROR_IF(cctx->mtctx == NULL, memory_allocation);
                          }
                          /* mt compression */
                  { 19, 12, 13,  1,  6,  1, ZSTD_fast    },  /* base for negative levels */
                  { 19, 13, 14,  1,  7,  0, ZSTD_fast    },  /* level  1 */
                  { 20, 15, 16,  1,  6,  0, ZSTD_fast    },  /* level  2 */
-                 { 21, 16, 17,  1,  5,  1, ZSTD_dfast   },  /* level  3 */
-                 { 21, 18, 18,  1,  5,  1, ZSTD_dfast   },  /* level  4 */
+                 { 21, 16, 17,  1,  5,  0, ZSTD_dfast   },  /* level  3 */
+                 { 21, 18, 18,  1,  5,  0, ZSTD_dfast   },  /* level  4 */
                  { 21, 18, 19,  2,  5,  2, ZSTD_greedy  },  /* level  5 */
                  { 21, 19, 19,  3,  5,  4, ZSTD_greedy  },  /* level  6 */
                  { 21, 19, 19,  3,  5,  8, ZSTD_lazy    },  /* level  7 */
                  /* W,  C,  H,  S,  L,  T, strat */
                  { 18, 12, 13,  1,  5,  1, ZSTD_fast    },  /* base for negative levels */
                  { 18, 13, 14,  1,  6,  0, ZSTD_fast    },  /* level  1 */
-                 { 18, 14, 14,  1,  5,  1, ZSTD_dfast   },  /* level  2 */
-                 { 18, 16, 16,  1,  4,  1, ZSTD_dfast   },  /* level  3 */
+                 { 18, 14, 14,  1,  5,  0, ZSTD_dfast   },  /* level  2 */
+                 { 18, 16, 16,  1,  4,  0, ZSTD_dfast   },  /* level  3 */
                  { 18, 16, 17,  2,  5,  2, ZSTD_greedy  },  /* level  4.*/
                  { 18, 18, 18,  3,  5,  2, ZSTD_greedy  },  /* level  5.*/
                  { 18, 18, 19,  3,  5,  4, ZSTD_lazy    },  /* level  6.*/
                  { 17, 12, 12,  1,  5,  1, ZSTD_fast    },  /* base for negative levels */
                  { 17, 12, 13,  1,  6,  0, ZSTD_fast    },  /* level  1 */
                  { 17, 13, 15,  1,  5,  0, ZSTD_fast    },  /* level  2 */
-                 { 17, 15, 16,  2,  5,  1, ZSTD_dfast   },  /* level  3 */
-                 { 17, 17, 17,  2,  4,  1, ZSTD_dfast   },  /* level  4 */
+                 { 17, 15, 16,  2,  5,  0, ZSTD_dfast   },  /* level  3 */
+                 { 17, 17, 17,  2,  4,  0, ZSTD_dfast   },  /* level  4 */
                  { 17, 16, 17,  3,  4,  2, ZSTD_greedy  },  /* level  5 */
                  { 17, 17, 17,  3,  4,  4, ZSTD_lazy    },  /* level  6 */
                  { 17, 17, 17,  3,  4,  8, ZSTD_lazy2   },  /* level  7 */
                  { 14, 12, 13,  1,  5,  1, ZSTD_fast    },  /* base for negative levels */
                  { 14, 14, 15,  1,  5,  0, ZSTD_fast    },  /* level  1 */
                  { 14, 14, 15,  1,  4,  0, ZSTD_fast    },  /* level  2 */
-                 { 14, 14, 15,  2,  4,  1, ZSTD_dfast   },  /* level  3 */
+                 { 14, 14, 15,  2,  4,  0, ZSTD_dfast   },  /* level  3 */
                  { 14, 14, 14,  4,  4,  2, ZSTD_greedy  },  /* level  4 */
                  { 14, 14, 14,  3,  4,  4, ZSTD_lazy    },  /* level  5.*/
                  { 14, 14, 14,  4,  4,  8, ZSTD_lazy2   },  /* level  6 */

contrib/python-zstandard/zstd/compress/zstd_compress_internal.h

0 +54 -12

              *  Dependencies
              ***************************************/
              #include "zstd_internal.h"
+             #include "zstd_cwksp.h"
              #ifdef ZSTD_MULTITHREAD
              #  include "zstdmt_compress.h"
              #endif
                size_t capacity; /* The capacity starting from `seq` pointer */
              } rawSeqStore_t;
+             typedef struct {
+                 int collectSequences;
+                 ZSTD_Sequence* seqStart;
+                 size_t seqIndex;
+                 size_t maxSequences;
+             } SeqCollector;
              struct ZSTD_CCtx_params_s {
                  ZSTD_format_e format;
                  ZSTD_compressionParameters cParams;
                  size_t targetCBlockSize;   /* Tries to fit compressed block size to be around targetCBlockSize.
                                              * No target when targetCBlockSize == 0.
                                              * There is no guarantee on compressed block size */
+                 int srcSizeHint;           /* User's best guess of source size.
+                                             * Hint is not valid when srcSizeHint == 0.
+                                             * There is no guarantee that hint is close to actual source size */
                  ZSTD_dictAttachPref_e attachDictPref;
                  ZSTD_literalCompressionMode_e literalCompressionMode;
                  ZSTD_CCtx_params appliedParams;
                  U32   dictID;
-                 int workSpaceOversizedDuration;
-                 void* workSpace;
-                 size_t workSpaceSize;
+                 ZSTD_cwksp workspace; /* manages buffer for dynamic allocations */
                  size_t blockSize;
                  unsigned long long pledgedSrcSizePlusOne;  /* this way, 0 (default) == unknown */
                  unsigned long long consumedSrcSize;
                  XXH64_state_t xxhState;
                  ZSTD_customMem customMem;
                  size_t staticSize;
+                 SeqCollector seqCollector;
+                 int isFirstBlock;
                  seqStore_t seqStore;      /* sequences storage ptrs */
                  ldmState_t ldmState;      /* long distance matching state */
                  return (srcSize >> minlog) + 2;
              }
+             /*! ZSTD_safecopyLiterals() :
+              *  memcpy() function that won't read beyond more than WILDCOPY_OVERLENGTH bytes past ilimit_w.
+              *  Only called when the sequence ends past ilimit_w, so it only needs to be optimized for single
+              *  large copies.
+              */
+             static void ZSTD_safecopyLiterals(BYTE* op, BYTE const* ip, BYTE const* const iend, BYTE const* ilimit_w) {
+                 assert(iend > ilimit_w);
+                 if (ip <= ilimit_w) {
+                     ZSTD_wildcopy(op, ip, ilimit_w - ip, ZSTD_no_overlap);
+                     op += ilimit_w - ip;
+                     ip = ilimit_w;
+                 }
+                 while (ip < iend) *op++ = *ip++;
+             }
              /*! ZSTD_storeSeq() :
-              *  Store a sequence (literal length, literals, offset code and match length code) into seqStore_t.
-              *  `offsetCode` : distance to match + 3 (values 1-3 are repCodes).
+              *  Store a sequence (litlen, litPtr, offCode and mlBase) into seqStore_t.
+              *  `offCode` : distance to match + ZSTD_REP_MOVE (values <= ZSTD_REP_MOVE are repCodes).
               *  `mlBase` : matchLength - MINMATCH
+              *  Allowed to overread literals up to litLimit.
              */
-             MEM_STATIC void ZSTD_storeSeq(seqStore_t* seqStorePtr, size_t litLength, const void* literals, U32 offsetCode, size_t mlBase)
+             HINT_INLINE UNUSED_ATTR
+             void ZSTD_storeSeq(seqStore_t* seqStorePtr, size_t litLength, const BYTE* literals, const BYTE* litLimit, U32 offCode, size_t mlBase)
              {
+                 BYTE const* const litLimit_w = litLimit - WILDCOPY_OVERLENGTH;
+                 BYTE const* const litEnd = literals + litLength;
              #if defined(DEBUGLEVEL) && (DEBUGLEVEL >= 6)
                  static const BYTE* g_start = NULL;
                  if (g_start==NULL) g_start = (const BYTE*)literals;  /* note : index only works for compression within a single segment */
                  {   U32 const pos = (U32)((const BYTE*)literals - g_start);
                      DEBUGLOG(6, "Cpos%7u :%3u literals, match%4u bytes at offCode%7u",
-                            pos, (U32)litLength, (U32)mlBase+MINMATCH, (U32)offsetCode);
+                            pos, (U32)litLength, (U32)mlBase+MINMATCH, (U32)offCode);
                  }
              #endif
                  assert((size_t)(seqStorePtr->sequences - seqStorePtr->sequencesStart) < seqStorePtr->maxNbSeq);
                  /* copy Literals */
                  assert(seqStorePtr->maxNbLit <= 128 KB);
                  assert(seqStorePtr->lit + litLength <= seqStorePtr->litStart + seqStorePtr->maxNbLit);
-                 ZSTD_wildcopy(seqStorePtr->lit, literals, (ptrdiff_t)litLength, ZSTD_no_overlap);
+                 assert(literals + litLength <= litLimit);
+                 if (litEnd <= litLimit_w) {
+                     /* Common case we can use wildcopy.
+             	 * First copy 16 bytes, because literals are likely short.
+             	 */
+                     assert(WILDCOPY_OVERLENGTH >= 16);
+                     ZSTD_copy16(seqStorePtr->lit, literals);
+                     if (litLength > 16) {
+                         ZSTD_wildcopy(seqStorePtr->lit+16, literals+16, (ptrdiff_t)litLength-16, ZSTD_no_overlap);
+                     }
+                 } else {
+                     ZSTD_safecopyLiterals(seqStorePtr->lit, literals, litEnd, litLimit_w);
+                 }
                  seqStorePtr->lit += litLength;
                  /* literal Length */
                  seqStorePtr->sequences[0].litLength = (U16)litLength;
                  /* match offset */
-                 seqStorePtr->sequences[0].offset = offsetCode + 1;
+                 seqStorePtr->sequences[0].offset = offCode + 1;
                  /* match Length */
                  if (mlBase>0xFFFF) {
              size_t ZSTD_initCStream_internal(ZSTD_CStream* zcs,
                                   const void* dict, size_t dictSize,
                                   const ZSTD_CDict* cdict,
-                                  ZSTD_CCtx_params  params, unsigned long long pledgedSrcSize);
+                                  const ZSTD_CCtx_params* params, unsigned long long pledgedSrcSize);
              void ZSTD_resetSeqStore(seqStore_t* ssPtr);
                                                  ZSTD_dictContentType_e dictContentType,
                                                  ZSTD_dictTableLoadMethod_e dtlm,
                                                  const ZSTD_CDict* cdict,
-                                                 ZSTD_CCtx_params params,
+                                                 const ZSTD_CCtx_params* params,
                                                  unsigned long long pledgedSrcSize);
              /* ZSTD_compress_advanced_internal() :
                                                     void* dst, size_t dstCapacity,
                                               const void* src, size_t srcSize,
                                               const void* dict,size_t dictSize,
-                                              ZSTD_CCtx_params params);
+                                              const ZSTD_CCtx_params* params);
              /* ZSTD_writeLastEmptyBlock() :

contrib/python-zstandard/zstd/compress/zstd_compress_literals.c

0 +10 -5

                                            ZSTD_strategy strategy, int disableLiteralCompression,
                                            void* dst, size_t dstCapacity,
                                      const void* src, size_t srcSize,
-                                           void* workspace, size_t wkspSize,
+                                           void* entropyWorkspace, size_t entropyWorkspaceSize,
                                      const int bmi2)
              {
                  size_t const minGain = ZSTD_minGain(srcSize, strategy);
                  {   HUF_repeat repeat = prevHuf->repeatMode;
                      int const preferRepeat = strategy < ZSTD_lazy ? srcSize <= 1024 : 0;
                      if (repeat == HUF_repeat_valid && lhSize == 3) singleStream = 1;
-                     cLitSize = singleStream ? HUF_compress1X_repeat(ostart+lhSize, dstCapacity-lhSize, src, srcSize, 255, 11,
-                                                   workspace, wkspSize, (HUF_CElt*)nextHuf->CTable, &repeat, preferRepeat, bmi2)
-                                             : HUF_compress4X_repeat(ostart+lhSize, dstCapacity-lhSize, src, srcSize, 255, 11,
-                                                   workspace, wkspSize, (HUF_CElt*)nextHuf->CTable, &repeat, preferRepeat, bmi2);
+                     cLitSize = singleStream ?
+                         HUF_compress1X_repeat(
+                             ostart+lhSize, dstCapacity-lhSize, src, srcSize,
+, 11, entropyWorkspace, entropyWorkspaceSize,
+                             (HUF_CElt*)nextHuf->CTable, &repeat, preferRepeat, bmi2) :
+                         HUF_compress4X_repeat(
+                             ostart+lhSize, dstCapacity-lhSize, src, srcSize,
+, 11, entropyWorkspace, entropyWorkspaceSize,
+                             (HUF_CElt*)nextHuf->CTable, &repeat, preferRepeat, bmi2);
                      if (repeat != HUF_repeat_none) {
                          /* reused the existing table */
                          hType = set_repeat;

contrib/python-zstandard/zstd/compress/zstd_compress_literals.h

0 +1 -1

                                            ZSTD_strategy strategy, int disableLiteralCompression,
                                            void* dst, size_t dstCapacity,
                                      const void* src, size_t srcSize,
-                                           void* workspace, size_t wkspSize,
+                                           void* entropyWorkspace, size_t entropyWorkspaceSize,
                                      const int bmi2);
              #endif /* ZSTD_COMPRESS_LITERALS_H */

contrib/python-zstandard/zstd/compress/zstd_compress_sequences.c

0 +3 -3

                              const BYTE* codeTable, size_t nbSeq,
                              const S16* defaultNorm, U32 defaultNormLog, U32 defaultMax,
                              const FSE_CTable* prevCTable, size_t prevCTableSize,
-                             void* workspace, size_t workspaceSize)
+                             void* entropyWorkspace, size_t entropyWorkspaceSize)
              {
                  BYTE* op = (BYTE*)dst;
                  const BYTE* const oend = op + dstCapacity;
                      memcpy(nextCTable, prevCTable, prevCTableSize);
                      return 0;
                  case set_basic:
-                     FORWARD_IF_ERROR(FSE_buildCTable_wksp(nextCTable, defaultNorm, defaultMax, defaultNormLog, workspace, workspaceSize));  /* note : could be pre-calculated */
+                     FORWARD_IF_ERROR(FSE_buildCTable_wksp(nextCTable, defaultNorm, defaultMax, defaultNormLog, entropyWorkspace, entropyWorkspaceSize));  /* note : could be pre-calculated */
                      return 0;
                  case set_compressed: {
                      S16 norm[MaxSeq + 1];
                      FORWARD_IF_ERROR(FSE_normalizeCount(norm, tableLog, count, nbSeq_1, max));
                      {   size_t const NCountSize = FSE_writeNCount(op, oend - op, norm, max, tableLog);   /* overflow protected */
                          FORWARD_IF_ERROR(NCountSize);
-                         FORWARD_IF_ERROR(FSE_buildCTable_wksp(nextCTable, norm, max, tableLog, workspace, workspaceSize));
+                         FORWARD_IF_ERROR(FSE_buildCTable_wksp(nextCTable, norm, max, tableLog, entropyWorkspace, entropyWorkspaceSize));
                          return NCountSize;
                      }
                  }

contrib/python-zstandard/zstd/compress/zstd_compress_sequences.h

0 +1 -1

                              const BYTE* codeTable, size_t nbSeq,
                              const S16* defaultNorm, U32 defaultNormLog, U32 defaultMax,
                              const FSE_CTable* prevCTable, size_t prevCTableSize,
-                             void* workspace, size_t workspaceSize);
+                             void* entropyWorkspace, size_t entropyWorkspaceSize);
              size_t ZSTD_encodeSequences(
                          void* dst, size_t dstCapacity,

contrib/python-zstandard/zstd/compress/zstd_double_fast.c

0 +9 -9

                          const BYTE* repMatchEnd = repIndex < prefixLowestIndex ? dictEnd : iend;
                          mLength = ZSTD_count_2segments(ip+1+4, repMatch+4, iend, repMatchEnd, prefixLowest) + 4;
                          ip++;
-                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, 0, mLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, 0, mLength-MINMATCH);
                          goto _match_stored;
                      }
                        && ((offset_1 > 0) & (MEM_read32(ip+1-offset_1) == MEM_read32(ip+1)))) {
                          mLength = ZSTD_count(ip+1+4, ip+1+4-offset_1, iend) + 4;
                          ip++;
-                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, 0, mLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, 0, mLength-MINMATCH);
                          goto _match_stored;
                      }
                      offset_2 = offset_1;
                      offset_1 = offset;
-                     ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
+                     ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
              _match_stored:
                      /* match found */
                                      const BYTE* const repEnd2 = repIndex2 < prefixLowestIndex ? dictEnd : iend;
                                      size_t const repLength2 = ZSTD_count_2segments(ip+4, repMatch2+4, iend, repEnd2, prefixLowest) + 4;
                                      U32 tmpOffset = offset_2; offset_2 = offset_1; offset_1 = tmpOffset;   /* swap offset_2 <=> offset_1 */
-                                     ZSTD_storeSeq(seqStore, 0, anchor, 0, repLength2-MINMATCH);
+                                     ZSTD_storeSeq(seqStore, 0, anchor, iend, 0, repLength2-MINMATCH);
                                      hashSmall[ZSTD_hashPtr(ip, hBitsS, mls)] = current2;
                                      hashLong[ZSTD_hashPtr(ip, hBitsL, 8)] = current2;
                                      ip += repLength2;
                                  U32 const tmpOff = offset_2; offset_2 = offset_1; offset_1 = tmpOff;  /* swap offset_2 <=> offset_1 */
                                  hashSmall[ZSTD_hashPtr(ip, hBitsS, mls)] = (U32)(ip-base);
                                  hashLong[ZSTD_hashPtr(ip, hBitsL, 8)] = (U32)(ip-base);
-                                 ZSTD_storeSeq(seqStore, 0, anchor, 0, rLength-MINMATCH);
+                                 ZSTD_storeSeq(seqStore, 0, anchor, iend, 0, rLength-MINMATCH);
                                  ip += rLength;
                                  anchor = ip;
                                  continue;   /* faster when present ... (?) */
                          const BYTE* repMatchEnd = repIndex < prefixStartIndex ? dictEnd : iend;
                          mLength = ZSTD_count_2segments(ip+1+4, repMatch+4, iend, repMatchEnd, prefixStart) + 4;
                          ip++;
-                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, 0, mLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, 0, mLength-MINMATCH);
                      } else {
                          if ((matchLongIndex > dictStartIndex) && (MEM_read64(matchLong) == MEM_read64(ip))) {
                              const BYTE* const matchEnd = matchLongIndex < prefixStartIndex ? dictEnd : iend;
                              while (((ip>anchor) & (matchLong>lowMatchPtr)) && (ip[-1] == matchLong[-1])) { ip--; matchLong--; mLength++; }   /* catch up */
                              offset_2 = offset_1;
                              offset_1 = offset;
-                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
+                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                          } else if ((matchIndex > dictStartIndex) && (MEM_read32(match) == MEM_read32(ip))) {
                              size_t const h3 = ZSTD_hashPtr(ip+1, hBitsL, 8);
                              }
                              offset_2 = offset_1;
                              offset_1 = offset;
-                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
+                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                          } else {
                              ip += ((ip-anchor) >> kSearchStrength) + 1;
                                  const BYTE* const repEnd2 = repIndex2 < prefixStartIndex ? dictEnd : iend;
                                  size_t const repLength2 = ZSTD_count_2segments(ip+4, repMatch2+4, iend, repEnd2, prefixStart) + 4;
                                  U32 const tmpOffset = offset_2; offset_2 = offset_1; offset_1 = tmpOffset;   /* swap offset_2 <=> offset_1 */
-                                 ZSTD_storeSeq(seqStore, 0, anchor, 0, repLength2-MINMATCH);
+                                 ZSTD_storeSeq(seqStore, 0, anchor, iend, 0, repLength2-MINMATCH);
                                  hashSmall[ZSTD_hashPtr(ip, hBitsS, mls)] = current2;
                                  hashLong[ZSTD_hashPtr(ip, hBitsL, 8)] = current2;
                                  ip += repLength2;

contrib/python-zstandard/zstd/compress/zstd_fast.c

0 +30 -39

               * You may select, at your option, one of the above-listed licenses.
               */
-             #include "zstd_compress_internal.h"
+             #include "zstd_compress_internal.h"  /* ZSTD_hashPtr, ZSTD_count, ZSTD_storeSeq */
              #include "zstd_fast.h"
              }
-             FORCE_INLINE_TEMPLATE
-             size_t ZSTD_compressBlock_fast_generic(
+             FORCE_INLINE_TEMPLATE size_t
+             ZSTD_compressBlock_fast_generic(
                      ZSTD_matchState_t* ms, seqStore_t* seqStore, U32 rep[ZSTD_REP_NUM],
                      void const* src, size_t srcSize,
                      U32 const mls)
                  DEBUGLOG(5, "ZSTD_compressBlock_fast_generic");
                  ip0 += (ip0 == prefixStart);
                  ip1 = ip0 + 1;
+                 {
-                     U32 const maxRep = (U32)(ip0 - prefixStart);
+                 {   U32 const maxRep = (U32)(ip0 - prefixStart);
                      if (offset_2 > maxRep) offsetSaved = offset_2, offset_2 = 0;
                      if (offset_1 > maxRep) offsetSaved = offset_1, offset_1 = 0;
                  }
                          match0 = match1;
                          goto _offset;
                      }
+                     {
-                         size_t const step = ((ip0-anchor) >> (kSearchStrength - 1)) + stepSize;
+                     {   size_t const step = ((size_t)(ip0-anchor) >> (kSearchStrength - 1)) + stepSize;
                          assert(step >= 2);
                          ip0 += step;
                          ip1 += step;
              _match: /* Requires: ip0, match0, offcode */
                      /* Count the forward length */
                      mLength += ZSTD_count(ip0+mLength+4, match0+mLength+4, iend) + 4;
-                     ZSTD_storeSeq(seqStore, ip0-anchor, anchor, offcode, mLength-MINMATCH);
+                     ZSTD_storeSeq(seqStore, (size_t)(ip0-anchor), anchor, iend, offcode, mLength-MINMATCH);
                      /* match found */
                      ip0 += mLength;
                      anchor = ip0;
                          hashTable[ZSTD_hashPtr(base+current0+2, hlog, mls)] = current0+2;  /* here because current+2 could be > iend-8 */
                          hashTable[ZSTD_hashPtr(ip0-2, hlog, mls)] = (U32)(ip0-2-base);
-                         while ( (ip0 <= ilimit)
-                              && ( (offset_2>0)
-                                 & (MEM_read32(ip0) == MEM_read32(ip0 - offset_2)) )) {
+                         while ( ((ip0 <= ilimit) & (offset_2>0))  /* offset_2==0 means offset_2 is invalidated */
+                              && (MEM_read32(ip0) == MEM_read32(ip0 - offset_2)) ) {
                              /* store sequence */
                              size_t const rLength = ZSTD_count(ip0+4, ip0+4-offset_2, iend) + 4;
-                             U32 const tmpOff = offset_2; offset_2 = offset_1; offset_1 = tmpOff;  /* swap offset_2 <=> offset_1 */
+                             { U32 const tmpOff = offset_2; offset_2 = offset_1; offset_1 = tmpOff; } /* swap offset_2 <=> offset_1 */
                              hashTable[ZSTD_hashPtr(ip0, hlog, mls)] = (U32)(ip0-base);
                              ip0 += rLength;
                              ip1 = ip0 + 1;
-                             ZSTD_storeSeq(seqStore, 0, anchor, 0, rLength-MINMATCH);
+                             ZSTD_storeSeq(seqStore, 0 /*litLen*/, anchor, iend, 0 /*offCode*/, rLength-MINMATCH);
                              anchor = ip0;
                              continue;   /* faster when present (confirmed on gcc-8) ... (?) */
                          }
                      ZSTD_matchState_t* ms, seqStore_t* seqStore, U32 rep[ZSTD_REP_NUM],
                      void const* src, size_t srcSize)
              {
-                 ZSTD_compressionParameters const* cParams = &ms->cParams;
-                 U32 const mls = cParams->minMatch;
+                 U32 const mls = ms->cParams.minMatch;
                  assert(ms->dictMatchState == NULL);
                  switch(mls)
                  {
                          const BYTE* const repMatchEnd = repIndex < prefixStartIndex ? dictEnd : iend;
                          mLength = ZSTD_count_2segments(ip+1+4, repMatch+4, iend, repMatchEnd, prefixStart) + 4;
                          ip++;
-                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, 0, mLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, 0, mLength-MINMATCH);
                      } else if ( (matchIndex <= prefixStartIndex) ) {
                          size_t const dictHash = ZSTD_hashPtr(ip, dictHLog, mls);
                          U32 const dictMatchIndex = dictHashTable[dictHash];
                              } /* catch up */
                              offset_2 = offset_1;
                              offset_1 = offset;
-                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
+                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                          }
                      } else if (MEM_read32(match) != MEM_read32(ip)) {
                          /* it's not a match, and we're not going to check the dictionary */
                               && (ip[-1] == match[-1])) { ip--; match--; mLength++; } /* catch up */
                          offset_2 = offset_1;
                          offset_1 = offset;
-                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                      }
                      /* match found */
                                  const BYTE* const repEnd2 = repIndex2 < prefixStartIndex ? dictEnd : iend;
                                  size_t const repLength2 = ZSTD_count_2segments(ip+4, repMatch2+4, iend, repEnd2, prefixStart) + 4;
                                  U32 tmpOffset = offset_2; offset_2 = offset_1; offset_1 = tmpOffset;   /* swap offset_2 <=> offset_1 */
-                                 ZSTD_storeSeq(seqStore, 0, anchor, 0, repLength2-MINMATCH);
+                                 ZSTD_storeSeq(seqStore, 0, anchor, iend, 0, repLength2-MINMATCH);
                                  hashTable[ZSTD_hashPtr(ip, hlog, mls)] = current2;
                                  ip += repLength2;
                                  anchor = ip;
                      ZSTD_matchState_t* ms, seqStore_t* seqStore, U32 rep[ZSTD_REP_NUM],
                      void const* src, size_t srcSize)
              {
-                 ZSTD_compressionParameters const* cParams = &ms->cParams;
-                 U32 const mls = cParams->minMatch;
+                 U32 const mls = ms->cParams.minMatch;
                  assert(ms->dictMatchState != NULL);
                  switch(mls)
                  {
                      const U32    repIndex = current + 1 - offset_1;
                      const BYTE* const repBase = repIndex < prefixStartIndex ? dictBase : base;
                      const BYTE* const repMatch = repBase + repIndex;
-                     size_t mLength;
                      hashTable[h] = current;   /* update hash table */
                      assert(offset_1 <= current +1);   /* check repIndex */
                      if ( (((U32)((prefixStartIndex-1) - repIndex) >= 3) /* intentional underflow */ & (repIndex > dictStartIndex))
                         && (MEM_read32(repMatch) == MEM_read32(ip+1)) ) {
                          const BYTE* const repMatchEnd = repIndex < prefixStartIndex ? dictEnd : iend;
-                         mLength = ZSTD_count_2segments(ip+1 +4, repMatch +4, iend, repMatchEnd, prefixStart) + 4;
+                         size_t const rLength = ZSTD_count_2segments(ip+1 +4, repMatch +4, iend, repMatchEnd, prefixStart) + 4;
                          ip++;
-                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, 0, mLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, 0, rLength-MINMATCH);
+                         ip += rLength;
+                         anchor = ip;
                      } else {
                          if ( (matchIndex < dictStartIndex) ||
                               (MEM_read32(match) != MEM_read32(ip)) ) {
                          }
                          {   const BYTE* const matchEnd = matchIndex < prefixStartIndex ? dictEnd : iend;
                              const BYTE* const lowMatchPtr = matchIndex < prefixStartIndex ? dictStart : prefixStart;
-                             U32 offset;
-                             mLength = ZSTD_count_2segments(ip+4, match+4, iend, matchEnd, prefixStart) + 4;
+                             U32 const offset = current - matchIndex;
+                             size_t mLength = ZSTD_count_2segments(ip+4, match+4, iend, matchEnd, prefixStart) + 4;
                              while (((ip>anchor) & (match>lowMatchPtr)) && (ip[-1] == match[-1])) { ip--; match--; mLength++; }   /* catch up */
-                             offset = current - matchIndex;
-                             offset_2 = offset_1;
-                             offset_1 = offset;
-                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
+                             offset_2 = offset_1; offset_1 = offset;  /* update offset history */
+                             ZSTD_storeSeq(seqStore, (size_t)(ip-anchor), anchor, iend, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
+                             ip += mLength;
+                             anchor = ip;
                      }   }
-                     /* found a match : store it */
-                     ip += mLength;
-                     anchor = ip;
                      if (ip <= ilimit) {
                          /* Fill Table */
                          hashTable[ZSTD_hashPtr(base+current+2, hlog, mls)] = current+2;
                          while (ip <= ilimit) {
                              U32 const current2 = (U32)(ip-base);
                              U32 const repIndex2 = current2 - offset_2;
-                             const BYTE* repMatch2 = repIndex2 < prefixStartIndex ? dictBase + repIndex2 : base + repIndex2;
+                             const BYTE* const repMatch2 = repIndex2 < prefixStartIndex ? dictBase + repIndex2 : base + repIndex2;
                              if ( (((U32)((prefixStartIndex-1) - repIndex2) >= 3) & (repIndex2 > dictStartIndex))  /* intentional overflow */
                                 && (MEM_read32(repMatch2) == MEM_read32(ip)) ) {
                                  const BYTE* const repEnd2 = repIndex2 < prefixStartIndex ? dictEnd : iend;
                                  size_t const repLength2 = ZSTD_count_2segments(ip+4, repMatch2+4, iend, repEnd2, prefixStart) + 4;
-                                 U32 const tmpOffset = offset_2; offset_2 = offset_1; offset_1 = tmpOffset;   /* swap offset_2 <=> offset_1 */
-                                 ZSTD_storeSeq(seqStore, 0, anchor, 0, repLength2-MINMATCH);
+                                 { U32 const tmpOffset = offset_2; offset_2 = offset_1; offset_1 = tmpOffset; }  /* swap offset_2 <=> offset_1 */
+                                 ZSTD_storeSeq(seqStore, 0 /*litlen*/, anchor, iend, 0 /*offcode*/, repLength2-MINMATCH);
                                  hashTable[ZSTD_hashPtr(ip, hlog, mls)] = current2;
                                  ip += repLength2;
                                  anchor = ip;
                      ZSTD_matchState_t* ms, seqStore_t* seqStore, U32 rep[ZSTD_REP_NUM],
                      void const* src, size_t srcSize)
              {
-                 ZSTD_compressionParameters const* cParams = &ms->cParams;
-                 U32 const mls = cParams->minMatch;
+                 U32 const mls = ms->cParams.minMatch;
                  switch(mls)
                  {
                  default: /* includes case 3 */

contrib/python-zstandard/zstd/compress/zstd_lazy.c

0 +5 -5

                      /* store sequence */
              _storeSequence:
                      {   size_t const litLength = start - anchor;
-                         ZSTD_storeSeq(seqStore, litLength, anchor, (U32)offset, matchLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, litLength, anchor, iend, (U32)offset, matchLength-MINMATCH);
                          anchor = ip = start + matchLength;
                      }
                                  const BYTE* const repEnd2 = repIndex < prefixLowestIndex ? dictEnd : iend;
                                  matchLength = ZSTD_count_2segments(ip+4, repMatch+4, iend, repEnd2, prefixLowest) + 4;
                                  offset = offset_2; offset_2 = offset_1; offset_1 = (U32)offset;   /* swap offset_2 <=> offset_1 */
-                                 ZSTD_storeSeq(seqStore, 0, anchor, 0, matchLength-MINMATCH);
+                                 ZSTD_storeSeq(seqStore, 0, anchor, iend, 0, matchLength-MINMATCH);
                                  ip += matchLength;
                                  anchor = ip;
                                  continue;
                              /* store sequence */
                              matchLength = ZSTD_count(ip+4, ip+4-offset_2, iend) + 4;
                              offset = offset_2; offset_2 = offset_1; offset_1 = (U32)offset; /* swap repcodes */
-                             ZSTD_storeSeq(seqStore, 0, anchor, 0, matchLength-MINMATCH);
+                             ZSTD_storeSeq(seqStore, 0, anchor, iend, 0, matchLength-MINMATCH);
                              ip += matchLength;
                              anchor = ip;
                              continue;   /* faster when present ... (?) */
                      /* store sequence */
              _storeSequence:
                      {   size_t const litLength = start - anchor;
-                         ZSTD_storeSeq(seqStore, litLength, anchor, (U32)offset, matchLength-MINMATCH);
+                         ZSTD_storeSeq(seqStore, litLength, anchor, iend, (U32)offset, matchLength-MINMATCH);
                          anchor = ip = start + matchLength;
                      }
                              const BYTE* const repEnd = repIndex < dictLimit ? dictEnd : iend;
                              matchLength = ZSTD_count_2segments(ip+4, repMatch+4, iend, repEnd, prefixStart) + 4;
                              offset = offset_2; offset_2 = offset_1; offset_1 = (U32)offset;   /* swap offset history */
-                             ZSTD_storeSeq(seqStore, 0, anchor, 0, matchLength-MINMATCH);
+                             ZSTD_storeSeq(seqStore, 0, anchor, iend, 0, matchLength-MINMATCH);
                              ip += matchLength;
                              anchor = ip;
                              continue;   /* faster when present ... (?) */

contrib/python-zstandard/zstd/compress/zstd_ldm.c

0 +4 -4

              {
                  size_t const ldmHSize = ((size_t)1) << params.hashLog;
                  size_t const ldmBucketSizeLog = MIN(params.bucketSizeLog, params.hashLog);
-                 size_t const ldmBucketSize =
-                     ((size_t)1) << (params.hashLog - ldmBucketSizeLog);
-                 size_t const totalSize = ldmBucketSize + ldmHSize * sizeof(ldmEntry_t);
+                 size_t const ldmBucketSize = ((size_t)1) << (params.hashLog - ldmBucketSizeLog);
+                 size_t const totalSize = ZSTD_cwksp_alloc_size(ldmBucketSize)
+                                        + ZSTD_cwksp_alloc_size(ldmHSize * sizeof(ldmEntry_t));
                  return params.enableLdm ? totalSize : 0;
              }
                              rep[i] = rep[i-1];
                          rep[0] = sequence.offset;
                          /* Store the sequence */
-                         ZSTD_storeSeq(seqStore, newLitLength, ip - newLitLength,
+                         ZSTD_storeSeq(seqStore, newLitLength, ip - newLitLength, iend,
                                        sequence.offset + ZSTD_REP_MOVE,
                                        sequence.matchLength - MINMATCH);
                          ip += sequence.matchLength;

contrib/python-zstandard/zstd/compress/zstd_opt.c

0 +1 -1

                                  assert(anchor + llen <= iend);
                                  ZSTD_updateStats(optStatePtr, llen, anchor, offCode, mlen);
-                                 ZSTD_storeSeq(seqStore, llen, anchor, offCode, mlen-MINMATCH);
+                                 ZSTD_storeSeq(seqStore, llen, anchor, iend, offCode, mlen-MINMATCH);
                                  anchor += advance;
                                  ip = anchor;
                          }   }

contrib/python-zstandard/zstd/compress/zstdmt_compress.c

0 +32 -26

                  /* init */
                  if (job->cdict) {
-                     size_t const initError = ZSTD_compressBegin_advanced_internal(cctx, NULL, 0, ZSTD_dct_auto, ZSTD_dtlm_fast, job->cdict, jobParams, job->fullFrameSize);
+                     size_t const initError = ZSTD_compressBegin_advanced_internal(cctx, NULL, 0, ZSTD_dct_auto, ZSTD_dtlm_fast, job->cdict, &jobParams, job->fullFrameSize);
                      assert(job->firstJob);  /* only allowed for first job */
                      if (ZSTD_isError(initError)) JOB_ERROR(initError);
                  } else {  /* srcStart points at reloaded section */
                                                      job->prefix.start, job->prefix.size, ZSTD_dct_rawContent, /* load dictionary in "content-only" mode (no header analysis) */
                                                      ZSTD_dtlm_fast,
                                                      NULL, /*cdict*/
-                                                     jobParams, pledgedSrcSize);
+                                                     &jobParams, pledgedSrcSize);
                          if (ZSTD_isError(initError)) JOB_ERROR(initError);
                  }   }
                  unsigned jobID;
                  DEBUGLOG(3, "ZSTDMT_releaseAllJobResources");
                  for (jobID=0; jobID <= mtctx->jobIDMask; jobID++) {
+                     /* Copy the mutex/cond out */
+                     ZSTD_pthread_mutex_t const mutex = mtctx->jobs[jobID].job_mutex;
+                     ZSTD_pthread_cond_t const cond = mtctx->jobs[jobID].job_cond;
                      DEBUGLOG(4, "job%02u: release dst address %08X", jobID, (U32)(size_t)mtctx->jobs[jobID].dstBuff.start);
                      ZSTDMT_releaseBuffer(mtctx->bufPool, mtctx->jobs[jobID].dstBuff);
-                     mtctx->jobs[jobID].dstBuff = g_nullBuffer;
-                     mtctx->jobs[jobID].cSize = 0;
+                     /* Clear the job description, but keep the mutex/cond */
+                     memset(&mtctx->jobs[jobID], 0, sizeof(mtctx->jobs[jobID]));
+                     mtctx->jobs[jobID].job_mutex = mutex;
+                     mtctx->jobs[jobID].job_cond = cond;
                  }
-                 memset(mtctx->jobs, 0, (mtctx->jobIDMask+1)*sizeof(ZSTDMT_jobDescription));
                  mtctx->inBuff.buffer = g_nullBuffer;
                  mtctx->inBuff.filled = 0;
                  mtctx->allJobsCompleted = 1;
              /* Sets parameters relevant to the compression job,
               * initializing others to default values. */
-             static ZSTD_CCtx_params ZSTDMT_initJobCCtxParams(ZSTD_CCtx_params const params)
+             static ZSTD_CCtx_params ZSTDMT_initJobCCtxParams(const ZSTD_CCtx_params* params)
              {
-                 ZSTD_CCtx_params jobParams = params;
+                 ZSTD_CCtx_params jobParams = *params;
                  /* Clear parameters related to multithreading */
                  jobParams.forceWindow = 0;
                  jobParams.nbWorkers = 0;
              /* =====   Multi-threaded compression   ===== */
              /* ------------------------------------------ */
-             static unsigned ZSTDMT_computeTargetJobLog(ZSTD_CCtx_params const params)
+             static unsigned ZSTDMT_computeTargetJobLog(const ZSTD_CCtx_params* params)
              {
                  unsigned jobLog;
-                 if (params.ldmParams.enableLdm) {
+                 if (params->ldmParams.enableLdm) {
                      /* In Long Range Mode, the windowLog is typically oversized.
                       * In which case, it's preferable to determine the jobSize
                       * based on chainLog instead. */
-                     jobLog = MAX(21, params.cParams.chainLog + 4);
+                     jobLog = MAX(21, params->cParams.chainLog + 4);
                  } else {
-                     jobLog = MAX(20, params.cParams.windowLog + 2);
+                     jobLog = MAX(20, params->cParams.windowLog + 2);
                  }
                  return MIN(jobLog, (unsigned)ZSTDMT_JOBLOG_MAX);
              }
                  return ovlog;
              }
-             static size_t ZSTDMT_computeOverlapSize(ZSTD_CCtx_params const params)
+             static size_t ZSTDMT_computeOverlapSize(const ZSTD_CCtx_params* params)
              {
-                 int const overlapRLog = 9 - ZSTDMT_overlapLog(params.overlapLog, params.cParams.strategy);
-                 int ovLog = (overlapRLog >= 8) ? 0 : (params.cParams.windowLog - overlapRLog);
+                 int const overlapRLog = 9 - ZSTDMT_overlapLog(params->overlapLog, params->cParams.strategy);
+                 int ovLog = (overlapRLog >= 8) ? 0 : (params->cParams.windowLog - overlapRLog);
                  assert(0 <= overlapRLog && overlapRLog <= 8);
-                 if (params.ldmParams.enableLdm) {
+                 if (params->ldmParams.enableLdm) {
                      /* In Long Range Mode, the windowLog is typically oversized.
                       * In which case, it's preferable to determine the jobSize
                       * based on chainLog instead.
                       * Then, ovLog becomes a fraction of the jobSize, rather than windowSize */
-                     ovLog = MIN(params.cParams.windowLog, ZSTDMT_computeTargetJobLog(params) - 2)
+                     ovLog = MIN(params->cParams.windowLog, ZSTDMT_computeTargetJobLog(params) - 2)
                              - overlapRLog;
                  }
                  assert(0 <= ovLog && ovLog <= ZSTD_WINDOWLOG_MAX);
-                 DEBUGLOG(4, "overlapLog : %i", params.overlapLog);
+                 DEBUGLOG(4, "overlapLog : %i", params->overlapLog);
                  DEBUGLOG(4, "overlap size : %i", 1 << ovLog);
                  return (ovLog==0) ? 0 : (size_t)1 << ovLog;
              }
              static unsigned
-             ZSTDMT_computeNbJobs(ZSTD_CCtx_params params, size_t srcSize, unsigned nbWorkers)
+             ZSTDMT_computeNbJobs(const ZSTD_CCtx_params* params, size_t srcSize, unsigned nbWorkers)
              {
                  assert(nbWorkers>0);
                  {   size_t const jobSizeTarget = (size_t)1 << ZSTDMT_computeTargetJobLog(params);
                        const ZSTD_CDict* cdict,
                              ZSTD_CCtx_params params)
              {
-                 ZSTD_CCtx_params const jobParams = ZSTDMT_initJobCCtxParams(params);
-                 size_t const overlapSize = ZSTDMT_computeOverlapSize(params);
-                 unsigned const nbJobs = ZSTDMT_computeNbJobs(params, srcSize, params.nbWorkers);
+                 ZSTD_CCtx_params const jobParams = ZSTDMT_initJobCCtxParams(&params);
+                 size_t const overlapSize = ZSTDMT_computeOverlapSize(&params);
+                 unsigned const nbJobs = ZSTDMT_computeNbJobs(&params, srcSize, params.nbWorkers);
                  size_t const proposedJobSize = (srcSize + (nbJobs-1)) / nbJobs;
                  size_t const avgJobSize = (((proposedJobSize-1) & 0x1FFFF) < 0x7FFF) ? proposedJobSize + 0xFFFF : proposedJobSize;   /* avoid too small last block */
                  const char* const srcStart = (const char*)src;
                      ZSTD_CCtx* const cctx = mtctx->cctxPool->cctx[0];
                      DEBUGLOG(4, "ZSTDMT_compress_advanced_internal: fallback to single-thread mode");
                      if (cdict) return ZSTD_compress_usingCDict_advanced(cctx, dst, dstCapacity, src, srcSize, cdict, jobParams.fParams);
-                     return ZSTD_compress_advanced_internal(cctx, dst, dstCapacity, src, srcSize, NULL, 0, jobParams);
+                     return ZSTD_compress_advanced_internal(cctx, dst, dstCapacity, src, srcSize, NULL, 0, &jobParams);
                  }
                  assert(avgJobSize >= 256 KB);  /* condition for ZSTD_compressBound(A) + ZSTD_compressBound(B) <= ZSTD_compressBound(A+B), required to compress directly into Dst (no additional buffer) */
                  mtctx->singleBlockingThread = (pledgedSrcSize <= ZSTDMT_JOBSIZE_MIN);  /* do not trigger multi-threading when srcSize is too small */
                  if (mtctx->singleBlockingThread) {
-                     ZSTD_CCtx_params const singleThreadParams = ZSTDMT_initJobCCtxParams(params);
+                     ZSTD_CCtx_params const singleThreadParams = ZSTDMT_initJobCCtxParams(&params);
                      DEBUGLOG(5, "ZSTDMT_initCStream_internal: switch to single blocking thread mode");
                      assert(singleThreadParams.nbWorkers == 0);
                      return ZSTD_initCStream_internal(mtctx->cctxPool->cctx[0],
                                                       dict, dictSize, cdict,
-                                                      singleThreadParams, pledgedSrcSize);
+                                                      &singleThreadParams, pledgedSrcSize);
                  }
                  DEBUGLOG(4, "ZSTDMT_initCStream_internal: %u workers", params.nbWorkers);
                      mtctx->cdict = cdict;
                  }
-                 mtctx->targetPrefixSize = ZSTDMT_computeOverlapSize(params);
+                 mtctx->targetPrefixSize = ZSTDMT_computeOverlapSize(&params);
                  DEBUGLOG(4, "overlapLog=%i => %u KB", params.overlapLog, (U32)(mtctx->targetPrefixSize>>10));
                  mtctx->targetSectionSize = params.jobSize;
                  if (mtctx->targetSectionSize == 0) {
-                     mtctx->targetSectionSize = 1ULL << ZSTDMT_computeTargetJobLog(params);
+                     mtctx->targetSectionSize = 1ULL << ZSTDMT_computeTargetJobLog(&params);
                  }
                  assert(mtctx->targetSectionSize <= (size_t)ZSTDMT_JOBSIZE_MAX);

contrib/python-zstandard/zstd/decompress/huf_decompress.c

0 +2 0

              *  Error Management
              ****************************************************************/
              #define HUF_isError ERR_isError
+             #ifndef CHECK_F
              #define CHECK_F(f) { size_t const err_ = (f); if (HUF_isError(err_)) return err_; }
+             #endif
              /* **************************************************************

contrib/python-zstandard/zstd/decompress/zstd_decompress.c

0 +14 -16

              static size_t ZSTD_startingInputLength(ZSTD_format_e format)
              {
-                 size_t const startingInputLength = (format==ZSTD_f_zstd1_magicless) ?
-                                 ZSTD_FRAMEHEADERSIZE_PREFIX - ZSTD_FRAMEIDSIZE :
-                                 ZSTD_FRAMEHEADERSIZE_PREFIX;
-                 ZSTD_STATIC_ASSERT(ZSTD_FRAMEHEADERSIZE_PREFIX >= ZSTD_FRAMEIDSIZE);
+                 size_t const startingInputLength = ZSTD_FRAMEHEADERSIZE_PREFIX(format);
                  /* only supports formats ZSTD_f_zstd1 and ZSTD_f_zstd1_magicless */
                  assert( (format == ZSTD_f_zstd1) || (format == ZSTD_f_zstd1_magicless) );
                  return startingInputLength;
              {
                  unsigned long long totalDstSize = 0;
-                 while (srcSize >= ZSTD_FRAMEHEADERSIZE_PREFIX) {
+                 while (srcSize >= ZSTD_startingInputLength(ZSTD_f_zstd1)) {
                      U32 const magicNumber = MEM_readLE32(src);
                      if ((magicNumber & ZSTD_MAGIC_SKIPPABLE_MASK) == ZSTD_MAGIC_SKIPPABLE_START) {
                  /* check */
                  RETURN_ERROR_IF(
-                     remainingSrcSize < ZSTD_FRAMEHEADERSIZE_MIN+ZSTD_blockHeaderSize,
+                     remainingSrcSize < ZSTD_FRAMEHEADERSIZE_MIN(dctx->format)+ZSTD_blockHeaderSize,
                      srcSize_wrong);
                  /* Frame Header */
-                 {   size_t const frameHeaderSize = ZSTD_frameHeaderSize(ip, ZSTD_FRAMEHEADERSIZE_PREFIX);
+                 {   size_t const frameHeaderSize = ZSTD_frameHeaderSize_internal(
+                             ip, ZSTD_FRAMEHEADERSIZE_PREFIX(dctx->format), dctx->format);
                      if (ZSTD_isError(frameHeaderSize)) return frameHeaderSize;
                      RETURN_ERROR_IF(remainingSrcSize < frameHeaderSize+ZSTD_blockHeaderSize,
                                      srcSize_wrong);
                      dictSize = ZSTD_DDict_dictSize(ddict);
                  }
-                 while (srcSize >= ZSTD_FRAMEHEADERSIZE_PREFIX) {
+                 while (srcSize >= ZSTD_startingInputLength(dctx->format)) {
              #if defined(ZSTD_LEGACY_SUPPORT) && (ZSTD_LEGACY_SUPPORT >= 1)
                      if (ZSTD_isLegacy(src, srcSize)) {
                      size_t const dictContentSize = (size_t)(dictEnd - (dictPtr+12));
                      for (i=0; i<3; i++) {
                          U32 const rep = MEM_readLE32(dictPtr); dictPtr += 4;
-                         RETURN_ERROR_IF(rep==0 || rep >= dictContentSize,
+                         RETURN_ERROR_IF(rep==0 || rep > dictContentSize,
                                          dictionary_corrupted);
                          entropy->rep[i] = rep;
                  }   }
              {
                  RETURN_ERROR_IF(dctx->streamStage != zdss_init, stage_wrong);
                  ZSTD_clearDict(dctx);
-                 if (dict && dictSize >= 8) {
+                 if (dict && dictSize != 0) {
                      dctx->ddictLocal = ZSTD_createDDict_advanced(dict, dictSize, dictLoadMethod, dictContentType, dctx->customMem);
                      RETURN_ERROR_IF(dctx->ddictLocal == NULL, memory_allocation);
                      dctx->ddict = dctx->ddictLocal;
              /* ZSTD_initDStream_usingDict() :
-              * return : expected size, aka ZSTD_FRAMEHEADERSIZE_PREFIX.
+              * return : expected size, aka ZSTD_startingInputLength().
               * this function cannot fail */
              size_t ZSTD_initDStream_usingDict(ZSTD_DStream* zds, const void* dict, size_t dictSize)
              {
                  DEBUGLOG(4, "ZSTD_initDStream_usingDict");
                  FORWARD_IF_ERROR( ZSTD_DCtx_reset(zds, ZSTD_reset_session_only) );
                  FORWARD_IF_ERROR( ZSTD_DCtx_loadDictionary(zds, dict, dictSize) );
-                 return ZSTD_FRAMEHEADERSIZE_PREFIX;
+                 return ZSTD_startingInputLength(zds->format);
              }
              /* note : this variant can't fail */
              {
                  FORWARD_IF_ERROR( ZSTD_DCtx_reset(dctx, ZSTD_reset_session_only) );
                  FORWARD_IF_ERROR( ZSTD_DCtx_refDDict(dctx, ddict) );
-                 return ZSTD_FRAMEHEADERSIZE_PREFIX;
+                 return ZSTD_startingInputLength(dctx->format);
              }
              /* ZSTD_resetDStream() :
-              * return : expected size, aka ZSTD_FRAMEHEADERSIZE_PREFIX.
+              * return : expected size, aka ZSTD_startingInputLength().
               * this function cannot fail */
              size_t ZSTD_resetDStream(ZSTD_DStream* dctx)
              {
                  FORWARD_IF_ERROR(ZSTD_DCtx_reset(dctx, ZSTD_reset_session_only));
-                 return ZSTD_FRAMEHEADERSIZE_PREFIX;
+                 return ZSTD_startingInputLength(dctx->format);
              }
                                          zds->lhSize += remainingInput;
                                      }
                                      input->pos = input->size;
-                                     return (MAX(ZSTD_FRAMEHEADERSIZE_MIN, hSize) - zds->lhSize) + ZSTD_blockHeaderSize;   /* remaining header bytes + next block header */
+                                     return (MAX((size_t)ZSTD_FRAMEHEADERSIZE_MIN(zds->format), hSize) - zds->lhSize) + ZSTD_blockHeaderSize;   /* remaining header bytes + next block header */
                                  }
                                  assert(ip != NULL);
                                  memcpy(zds->headerBuffer + zds->lhSize, ip, toLoad); zds->lhSize = hSize; ip += toLoad;

contrib/python-zstandard/zstd/decompress/zstd_decompress_block.c

0 +143 -145

		@@ -573,38 +573,118 typedef struct {
573	573	size_t pos;
574	574	} seqState_t;
575	575
	576	/*! ZSTD_overlapCopy8() :
	577	* Copies 8 bytes from ip to op and updates op and ip where ip <= op.
	578	* If the offset is < 8 then the offset is spread to at least 8 bytes.
	579	*
	580	* Precondition: ip <= op
	581	* Postcondition: op - op >= 8
	582	*/
	583	static void ZSTD_overlapCopy8(BYTE op, BYTE const ip, size_t offset) {
	584	assert(ip <= op);
	585	if (offset < 8) {
	586	/* close range match, overlap */
	587	static const U32 dec32table[] = { 0, 1, 2, 1, 4, 4, 4, 4 }; /* added */
	588	static const int dec64table[] = { 8, 8, 8, 7, 8, 9,10,11 }; /* subtracted */
	589	int const sub2 = dec64table[offset];
	590	(op)[0] = (ip)[0];
	591	(op)[1] = (ip)[1];
	592	(op)[2] = (ip)[2];
	593	(op)[3] = (ip)[3];
	594	*ip += dec32table[offset];
	595	ZSTD_copy4(op+4, ip);
	596	*ip -= sub2;
	597	} else {
	598	ZSTD_copy8(op, ip);
	599	}
	600	*ip += 8;
	601	*op += 8;
	602	assert(op - ip >= 8);
	603	}
576	604
577		/* ZSTD_execSequenceLast7():
578		* exceptional case : decompress a match starting within last 7 bytes of output buffer.
579		* requires more careful checks, to ensure there is no overflow.
580		* performance does not matter though.
581		* note : this case is supposed to be never generated "naturally" by reference encoder,
582		* since in most cases it needs at least 8 bytes to look for a match.
583		* but it's allowed by the specification. */
	605	/*! ZSTD_safecopy() :
	606	* Specialized version of memcpy() that is allowed to READ up to WILDCOPY_OVERLENGTH past the input buffer
	607	* and write up to 16 bytes past oend_w (op >= oend_w is allowed).
	608	* This function is only called in the uncommon case where the sequence is near the end of the block. It
	609	* should be fast for a single long sequence, but can be slow for several short sequences.
	610	*
	611	* @param ovtype controls the overlap detection
	612	* - ZSTD_no_overlap: The source and destination are guaranteed to be at least WILDCOPY_VECLEN bytes apart.
	613	* - ZSTD_overlap_src_before_dst: The src and dst may overlap and may be any distance apart.
	614	* The src buffer must be before the dst buffer.
	615	*/
	616	static void ZSTD_safecopy(BYTE* op, BYTE* const oend_w, BYTE const* ip, ptrdiff_t length, ZSTD_overlap_e ovtype) {
	617	ptrdiff_t const diff = op - ip;
	618	BYTE* const oend = op + length;
	619
	620	assert((ovtype == ZSTD_no_overlap && (diff <= -8 \|\| diff >= 8 \|\| op >= oend_w)) \|\|
	621	(ovtype == ZSTD_overlap_src_before_dst && diff >= 0));
	622
	623	if (length < 8) {
	624	/* Handle short lengths. */
	625	while (op < oend) op++ = ip++;
	626	return;
	627	}
	628	if (ovtype == ZSTD_overlap_src_before_dst) {
	629	/* Copy 8 bytes and ensure the offset >= 8 when there can be overlap. */
	630	assert(length >= 8);
	631	ZSTD_overlapCopy8(&op, &ip, diff);
	632	assert(op - ip >= 8);
	633	assert(op <= oend);
	634	}
	635
	636	if (oend <= oend_w) {
	637	/* No risk of overwrite. */
	638	ZSTD_wildcopy(op, ip, length, ovtype);
	639	return;
	640	}
	641	if (op <= oend_w) {
	642	/* Wildcopy until we get close to the end. */
	643	assert(oend > oend_w);
	644	ZSTD_wildcopy(op, ip, oend_w - op, ovtype);
	645	ip += oend_w - op;
	646	op = oend_w;
	647	}
	648	/* Handle the leftovers. */
	649	while (op < oend) op++ = ip++;
	650	}
	651
	652	/* ZSTD_execSequenceEnd():
	653	* This version handles cases that are near the end of the output buffer. It requires
	654	* more careful checks to make sure there is no overflow. By separating out these hard
	655	* and unlikely cases, we can speed up the common cases.
	656	*
	657	* NOTE: This function needs to be fast for a single long sequence, but doesn't need
	658	* to be optimized for many small sequences, since those fall into ZSTD_execSequence().
	659	*/
584	660	FORCE_NOINLINE
585		size_t ZSTD_execSequence~~Last7~~(BYTE* op,
586		BYTE* const oend, seq_t sequence,
587		const BYTE** litPtr, const BYTE* const litLimit,
588		const BYTE* const ~~base~~, const BYTE* const v~~Base~~, const BYTE* const dictEnd)
	661	size_t ZSTD_execSequenceEnd(BYTE* op,
	662	BYTE* const oend, seq_t sequence,
	663	const BYTE** litPtr, const BYTE* const litLimit,
	664	const BYTE* const prefixStart, const BYTE* const virtualStart, const BYTE* const dictEnd)
589	665	{
590	666	BYTE* const oLitEnd = op + sequence.litLength;
591	667	size_t const sequenceLength = sequence.litLength + sequence.matchLength;
592	668	BYTE* const oMatchEnd = op + sequenceLength; /* risk : address space overflow (32-bits) */
593	669	const BYTE* const iLitEnd = *litPtr + sequence.litLength;
594	670	const BYTE* match = oLitEnd - sequence.offset;
	671	BYTE* const oend_w = oend - WILDCOPY_OVERLENGTH;
595	672
596		/* check */
597		RETURN_ERROR_IF(oMatchEnd>oend, dstSize_tooSmall, "last match must fit within dstBuffer");
	673	/* bounds checks */
	674	assert(oLitEnd < oMatchEnd);
	675	RETURN_ERROR_IF(oMatchEnd > oend, dstSize_tooSmall, "last match must fit within dstBuffer");
598	676	RETURN_ERROR_IF(iLitEnd > litLimit, corruption_detected, "try to read beyond literal buffer");
599	677
600	678	/* copy literals */
601		while (op < oLitEnd) op++ = (*litPtr)++;
	679	ZSTD_safecopy(op, oend_w, *litPtr, sequence.litLength, ZSTD_no_overlap);
	680	op = oLitEnd;
	681	*litPtr = iLitEnd;
602	682
603	683	/* copy Match */
604		if (sequence.offset > (size_t)(oLitEnd - ~~base~~)) {
	684	if (sequence.offset > (size_t)(oLitEnd - prefixStart)) {
605	685	/* offset beyond prefix */
606		RETURN_ERROR_IF(sequence.offset > (size_t)(oLitEnd - v~~Base~~),corruption_detected);
607		match = dictEnd - (~~base~~-match);
	686	RETURN_ERROR_IF(sequence.offset > (size_t)(oLitEnd - virtualStart), corruption_detected);
	687	match = dictEnd - (prefixStart-match);
608	688	if (match + sequence.matchLength <= dictEnd) {
609	689	memmove(oLitEnd, match, sequence.matchLength);
610	690	return sequenceLength;
		@@ -614,13 +694,12 size_t ZSTD_execSequenceLast7(BYTE* op,
614	694	memmove(oLitEnd, match, length1);
615	695	op = oLitEnd + length1;
616	696	sequence.matchLength -= length1;
617		match = ~~base~~;
	697	match = prefixStart;
618	698	} }
619		while (op < oMatchEnd) op++ = match++;
	699	ZSTD_safecopy(op, oend_w, match, sequence.matchLength, ZSTD_overlap_src_before_dst);
620	700	return sequenceLength;
621	701	}
622	702
623
624	703	HINT_INLINE
625	704	size_t ZSTD_execSequence(BYTE* op,
626	705	BYTE* const oend, seq_t sequence,
		@@ -634,20 +713,29 size_t ZSTD_execSequence(BYTE* op,
634	713	const BYTE* const iLitEnd = *litPtr + sequence.litLength;
635	714	const BYTE* match = oLitEnd - sequence.offset;
636	715
637		/* check */
638		RETURN_ERROR_IF(oMatchEnd>oend, dstSize_tooSmall, "last match must start at a minimum distance of WILDCOPY_OVERLENGTH from oend");
639		RETURN_ERROR_IF(iLitEnd > litLimit, corruption_detected, "over-read beyond lit buffer");
640		if (~~oLitEnd~~>~~oend_w~~) return ZSTD_execSequence~~Last7~~(op, oend, sequence, litPtr, litLimit, prefixStart, virtualStart, dictEnd);
	716	/* Errors and uncommon cases handled here. */
	717	assert(oLitEnd < oMatchEnd);
	718	if (iLitEnd > litLimit \|\| oMatchEnd > oend_w)
	719	return ZSTD_execSequenceEnd(op, oend, sequence, litPtr, litLimit, prefixStart, virtualStart, dictEnd);
	720
	721	/* Assumptions (everything else goes into ZSTD_execSequenceEnd()) */
	722	assert(iLitEnd <= litLimit /* Literal length is in bounds */);
	723	assert(oLitEnd <= oend_w /* Can wildcopy literals */);
	724	assert(oMatchEnd <= oend_w /* Can wildcopy matches */);
641	725
642		/* copy Literals */
643		if (sequence.litLength > 8)
644		ZSTD_wildcopy_16min(op, (litPtr), sequence.litLength, ZSTD_no_overlap); / note : since oLitEnd <= oend-WILDCOPY_OVERLENGTH, no risk of overwrite beyond oend */
645		else
646		ZSTD_copy8(op, *litPtr);
	726	/* Copy Literals:
	727	* Split out litLength <= 16 since it is nearly always true. +1.6% on gcc-9.
	728	* We likely don't need the full 32-byte wildcopy.
	729	*/
	730	assert(WILDCOPY_OVERLENGTH >= 16);
	731	ZSTD_copy16(op, (*litPtr));
	732	if (sequence.litLength > 16) {
	733	ZSTD_wildcopy(op+16, (*litPtr)+16, sequence.litLength-16, ZSTD_no_overlap);
	734	}
647	735	op = oLitEnd;
648	736	litPtr = iLitEnd; / update for next sequence */
649	737
650		/* copy Match */
	738	/* Copy Match */
651	739	if (sequence.offset > (size_t)(oLitEnd - prefixStart)) {
652	740	/* offset beyond prefix -> go into extDict */
653	741	RETURN_ERROR_IF(sequence.offset > (size_t)(oLitEnd - virtualStart), corruption_detected);
		@@ -662,123 +750,33 size_t ZSTD_execSequence(BYTE* op,
662	750	op = oLitEnd + length1;
663	751	sequence.matchLength -= length1;
664	752	match = prefixStart;
665		if (op > oend_w \|\| sequence.matchLength < MINMATCH) {
666		U32 i;
667		for (i = 0; i < sequence.matchLength; ++i) op[i] = match[i];
668		return sequenceLength;
669		}
670	753	} }
671		/* Requirement: op <= oend_w && sequence.matchLength >= MINMATCH */
672
673		/* match within prefix */
674		if (sequence.offset < 8) {
675		/* close range match, overlap */
676		static const U32 dec32table[] = { 0, 1, 2, 1, 4, 4, 4, 4 }; /* added */
677		static const int dec64table[] = { 8, 8, 8, 7, 8, 9,10,11 }; /* subtracted */
678		int const sub2 = dec64table[sequence.offset];
679		op[0] = match[0];
680		op[1] = match[1];
681		op[2] = match[2];
682		op[3] = match[3];
683		match += dec32table[sequence.offset];
684		ZSTD_copy4(op+4, match);
685		match -= sub2;
686		} else {
687		ZSTD_copy8(op, match);
688		}
689		op += 8; match += 8;
690
691		if (oMatchEnd > oend-(16-MINMATCH)) {
692		if (op < oend_w) {
693		ZSTD_wildcopy(op, match, oend_w - op, ZSTD_overlap_src_before_dst);
694		match += oend_w - op;
695		op = oend_w;
696		}
697		while (op < oMatchEnd) op++ = match++;
698		} else {
699		ZSTD_wildcopy(op, match, (ptrdiff_t)sequence.matchLength-8, ZSTD_overlap_src_before_dst); /* works even if matchLength < 8 */
700		}
701		return sequenceLength;
702		}
703
704
705		HINT_INLINE
706		size_t ZSTD_execSequenceLong(BYTE* op,
707		BYTE* const oend, seq_t sequence,
708		const BYTE** litPtr, const BYTE* const litLimit,
709		const BYTE* const prefixStart, const BYTE* const dictStart, const BYTE* const dictEnd)
710		{
711		BYTE* const oLitEnd = op + sequence.litLength;
712		size_t const sequenceLength = sequence.litLength + sequence.matchLength;
713		BYTE* const oMatchEnd = op + sequenceLength; /* risk : address space overflow (32-bits) */
714		BYTE* const oend_w = oend - WILDCOPY_OVERLENGTH;
715		const BYTE* const iLitEnd = *litPtr + sequence.litLength;
716		const BYTE* match = sequence.match;
717
718		/* check */
719		RETURN_ERROR_IF(oMatchEnd > oend, dstSize_tooSmall, "last match must start at a minimum distance of WILDCOPY_OVERLENGTH from oend");
720		RETURN_ERROR_IF(iLitEnd > litLimit, corruption_detected, "over-read beyond lit buffer");
721		if (oLitEnd > oend_w) return ZSTD_execSequenceLast7(op, oend, sequence, litPtr, litLimit, prefixStart, dictStart, dictEnd);
	754	/* Match within prefix of 1 or more bytes */
	755	assert(op <= oMatchEnd);
	756	assert(oMatchEnd <= oend_w);
	757	assert(match >= prefixStart);
	758	assert(sequence.matchLength >= 1);
722	759
723		/* copy Literals */
724		if (sequence.litLength > 8)
725		ZSTD_wildcopy_16min(op, litPtr, sequence.litLength, ZSTD_no_overlap); / note : since oLitEnd <= oend-WILDCOPY_OVERLENGTH, no risk of overwrite beyond oend */
726		else
727		ZSTD_copy8(op, litPtr); / note : op <= oLitEnd <= oend_w == oend - 8 */
728
729		op = oLitEnd;
730		litPtr = iLitEnd; / update for next sequence */
	760	/* Nearly all offsets are >= WILDCOPY_VECLEN bytes, which means we can use wildcopy
	761	* without overlap checking.
	762	*/
	763	if (sequence.offset >= WILDCOPY_VECLEN) {
	764	/* We bet on a full wildcopy for matches, since we expect matches to be
	765	* longer than literals (in general). In silesia, ~10% of matches are longer
	766	* than 16 bytes.
	767	*/
	768	ZSTD_wildcopy(op, match, (ptrdiff_t)sequence.matchLength, ZSTD_no_overlap);
	769	return sequenceLength;
	770	}
	771	assert(sequence.offset < WILDCOPY_VECLEN);
731	772
732		/* copy Match */
733		if (sequence.offset > (size_t)(oLitEnd - prefixStart)) {
734		/* offset beyond prefix */
735		RETURN_ERROR_IF(sequence.offset > (size_t)(oLitEnd - dictStart), corruption_detected);
736		if (match + sequence.matchLength <= dictEnd) {
737		memmove(oLitEnd, match, sequence.matchLength);
738		return sequenceLength;
739		}
740		/* span extDict & currentPrefixSegment */
741		{ size_t const length1 = dictEnd - match;
742		memmove(oLitEnd, match, length1);
743		op = oLitEnd + length1;
744		sequence.matchLength -= length1;
745		match = prefixStart;
746		if (op > oend_w \|\| sequence.matchLength < MINMATCH) {
747		U32 i;
748		for (i = 0; i < sequence.matchLength; ++i) op[i] = match[i];
749		return sequenceLength;
750		}
751		} }
752		assert(op <= oend_w);
753		assert(sequence.matchLength >= MINMATCH);
	773	/* Copy 8 bytes and spread the offset to be >= 8. */
	774	ZSTD_overlapCopy8(&op, &match, sequence.offset);
754	775
755		/* match within prefix */
756		if (sequence.~~offset~~ < 8) {
757		/* close range match, overlap */
758		static const U32 dec32table[] = { 0, 1, 2, 1, 4, 4, 4, 4 }; /* added */
759		static const int dec64table[] = { 8, 8, 8, 7, 8, 9,10,11 }; /* subtracted */
760		int const sub2 = dec64table[sequence.offset];
761		op[0] = match[0];
762		op[1] = match[1];
763		op[2] = match[2];
764		op[3] = match[3];
765		match += dec32table[sequence.offset];
766		ZSTD_copy4(op+4, match);
767		match -= sub2;
768		} else {
769		ZSTD_copy8(op, match);
770		}
771		op += 8; match += 8;
772
773		if (oMatchEnd > oend-(16-MINMATCH)) {
774		if (op < oend_w) {
775		ZSTD_wildcopy(op, match, oend_w - op, ZSTD_overlap_src_before_dst);
776		match += oend_w - op;
777		op = oend_w;
778		}
779		while (op < oMatchEnd) op++ = match++;
780		} else {
781		ZSTD_wildcopy(op, match, (ptrdiff_t)sequence.matchLength-8, ZSTD_overlap_src_before_dst); /* works even if matchLength < 8 */
	776	/* If the match length is > 8 bytes, then continue with the wildcopy. */
	777	if (sequence.matchLength > 8) {
	778	assert(op < oMatchEnd);
	779	ZSTD_wildcopy(op, match, (ptrdiff_t)sequence.matchLength-8, ZSTD_overlap_src_before_dst);
782	780	}
783	781	return sequenceLength;
784	782	}
		@@ -1098,7 +1096,7 ZSTD_decompressSequencesLong_body(
1098	1096	/* decode and decompress */
1099	1097	for ( ; (BIT_reloadDStream(&(seqState.DStream)) <= BIT_DStream_completed) && (seqNb<nbSeq) ; seqNb++) {
1100	1098	seq_t const sequence = ZSTD_decodeSequenceLong(&seqState, isLongOffset);
1101		size_t const oneSeqSize = ZSTD_execSequence~~Long~~(op, oend, sequences[(seqNb-ADVANCED_SEQS) & STORED_SEQS_MASK], &litPtr, litEnd, prefixStart, dictStart, dictEnd);
	1099	size_t const oneSeqSize = ZSTD_execSequence(op, oend, sequences[(seqNb-ADVANCED_SEQS) & STORED_SEQS_MASK], &litPtr, litEnd, prefixStart, dictStart, dictEnd);
1102	1100	if (ZSTD_isError(oneSeqSize)) return oneSeqSize;
1103	1101	PREFETCH_L1(sequence.match); PREFETCH_L1(sequence.match + sequence.matchLength - 1); /* note : it's safe to invoke PREFETCH() on any memory address, including invalid ones */
1104	1102	sequences[seqNb & STORED_SEQS_MASK] = sequence;
		@@ -1109,7 +1107,7 ZSTD_decompressSequencesLong_body(
1109	1107	/* finish queue */
1110	1108	seqNb -= seqAdvance;
1111	1109	for ( ; seqNb<nbSeq ; seqNb++) {
1112		size_t const oneSeqSize = ZSTD_execSequence~~Long~~(op, oend, sequences[seqNb&STORED_SEQS_MASK], &litPtr, litEnd, prefixStart, dictStart, dictEnd);
	1110	size_t const oneSeqSize = ZSTD_execSequence(op, oend, sequences[seqNb&STORED_SEQS_MASK], &litPtr, litEnd, prefixStart, dictStart, dictEnd);
1113	1111	if (ZSTD_isError(oneSeqSize)) return oneSeqSize;
1114	1112	op += oneSeqSize;
1115	1113	}

contrib/python-zstandard/zstd/deprecated/zbuff.h

0 +6 -5

              *****************************************************************/
              /* Deprecation warnings */
              /* Should these warnings be a problem,
-                it is generally possible to disable them,
-                typically with -Wno-deprecated-declarations for gcc
-                or _CRT_SECURE_NO_WARNINGS in Visual.
-                Otherwise, it's also possible to define ZBUFF_DISABLE_DEPRECATE_WARNINGS */
+              * it is generally possible to disable them,
+              * typically with -Wno-deprecated-declarations for gcc
+              * or _CRT_SECURE_NO_WARNINGS in Visual.
+              * Otherwise, it's also possible to define ZBUFF_DISABLE_DEPRECATE_WARNINGS
+              */
              #ifdef ZBUFF_DISABLE_DEPRECATE_WARNINGS
              #  define ZBUFF_DEPRECATED(message) ZSTDLIB_API  /* disable deprecation warnings */
              #else
              #  if defined (__cplusplus) && (__cplusplus >= 201402) /* C++14 or greater */
              #    define ZBUFF_DEPRECATED(message) [[deprecated(message)]] ZSTDLIB_API
-             #  elif (defined(__GNUC__) && (__GNUC__ >= 5)) || defined(__clang__)
+             #  elif (defined(GNUC) && (GNUC > 4 || (GNUC == 4 && GNUC_MINOR >= 5))) || defined(__clang__)
              #    define ZBUFF_DEPRECATED(message) ZSTDLIB_API __attribute__((deprecated(message)))
              #  elif defined(__GNUC__) && (__GNUC__ >= 3)
              #    define ZBUFF_DEPRECATED(message) ZSTDLIB_API __attribute__((deprecated))

contrib/python-zstandard/zstd/dictBuilder/cover.c

0 +2 -2

                                  "compared to the source size %u! "
                                  "size(source)/size(dictionary) = %f, but it should be >= "
                                  "10! This may lead to a subpar dictionary! We recommend "
-                                 "training on sources at least 10x, and up to 100x the "
-                                 "size of the dictionary!\n", (U32)maxDictSize,
+                                 "training on sources at least 10x, and preferably 100x "
+                                 "the size of the dictionary! \n", (U32)maxDictSize,
                                  (U32)nbDmers, ratio);
              }

contrib/python-zstandard/zstd/dictBuilder/zdict.c

0 +1 -1

                  unsigned const prime1 = 2654435761U;
                  unsigned const prime2 = 2246822519U;
                  unsigned acc = prime1;
-                 size_t p=0;;
+                 size_t p=0;
                  for (p=0; p<length; p++) {
                      acc *= prime2;
                      ((unsigned char*)buffer)[p] = (unsigned char)(acc >> 21);

contrib/python-zstandard/zstd/zstd.h

0 +161 -59

              #define ZSTD_H_235446
              /* ======   Dependency   ======*/
+             #include <limits.h>   /* INT_MAX */
              #include <stddef.h>   /* size_t */
              /*------   Version   ------*/
              #define ZSTD_VERSION_MAJOR    1
              #define ZSTD_VERSION_MINOR    4
-             #define ZSTD_VERSION_RELEASE  3
+             #define ZSTD_VERSION_RELEASE  4
              #define ZSTD_VERSION_NUMBER  (ZSTD_VERSION_MAJOR *100*100 + ZSTD_VERSION_MINOR *100 + ZSTD_VERSION_RELEASE)
              ZSTDLIB_API unsigned ZSTD_versionNumber(void);   /**< to check runtime library version */
              ZSTDLIB_API size_t     ZSTD_freeCCtx(ZSTD_CCtx* cctx);
              /*! ZSTD_compressCCtx() :
-              *  Same as ZSTD_compress(), using an explicit ZSTD_CCtx
-              *  The function will compress at requested compression level,
-              *  ignoring any other parameter */
+              *  Same as ZSTD_compress(), using an explicit ZSTD_CCtx.
+              *  Important : in order to behave similarly to `ZSTD_compress()`,
+              *  this function compresses at requested compression level,
+              *  __ignoring any other parameter__ .
+              *  If any advanced parameter was set using the advanced API,
+              *  they will all be reset. Only `compressionLevel` remains.
+              */
              ZSTDLIB_API size_t ZSTD_compressCCtx(ZSTD_CCtx* cctx,
                                                   void* dst, size_t dstCapacity,
                                             const void* src, size_t srcSize,
               *   using ZSTD_CCtx_set*() functions.
               *   Pushed parameters are sticky : they are valid for next compressed frame, and any subsequent frame.
               *   "sticky" parameters are applicable to `ZSTD_compress2()` and `ZSTD_compressStream*()` !
-              *   They do not apply to "simple" one-shot variants such as ZSTD_compressCCtx()
+              *   __They do not apply to "simple" one-shot variants such as ZSTD_compressCCtx()__ .
               *
               *   It's possible to reset all parameters to "default" using ZSTD_CCtx_reset().
               *
                  /* compression parameters
                   * Note: When compressing with a ZSTD_CDict these parameters are superseded
-                  * by the parameters used to construct the ZSTD_CDict. See ZSTD_CCtx_refCDict()
-                  * for more info (superseded-by-cdict). */
-                 ZSTD_c_compressionLevel=100, /* Update all compression parameters according to pre-defined cLevel table
+                  * by the parameters used to construct the ZSTD_CDict.
+                  * See ZSTD_CCtx_refCDict() for more info (superseded-by-cdict). */
+                 ZSTD_c_compressionLevel=100, /* Set compression parameters according to pre-defined cLevel table.
+                                           * Note that exact compression parameters are dynamically determined,
+                                           * depending on both compression level and srcSize (when known).
                                            * Default level is ZSTD_CLEVEL_DEFAULT==3.
                                            * Special: value 0 means default, which is controlled by ZSTD_CLEVEL_DEFAULT.
                                            * Note 1 : it's possible to pass a negative compression level.
-                                           * Note 2 : setting a level sets all default values of other compression parameters */
+                                           * Note 2 : setting a level resets all other compression parameters to default */
+                 /* Advanced compression parameters :
+                  * It's possible to pin down compression parameters to some specific values.
+                  * In which case, these values are no longer dynamically selected by the compressor */
                  ZSTD_c_windowLog=101,    /* Maximum allowed back-reference distance, expressed as power of 2.
+                                           * This will set a memory budget for streaming decompression,
+                                           * with larger values requiring more memory
+                                           * and typically compressing more.
                                            * Must be clamped between ZSTD_WINDOWLOG_MIN and ZSTD_WINDOWLOG_MAX.
                                            * Special: value 0 means "use default windowLog".
                                            * Note: Using a windowLog greater than ZSTD_WINDOWLOG_LIMIT_DEFAULT
-                                           *       requires explicitly allowing such window size at decompression stage if using streaming. */
+                                           *       requires explicitly allowing such size at streaming decompression stage. */
                  ZSTD_c_hashLog=102,      /* Size of the initial probe table, as a power of 2.
                                            * Resulting memory usage is (1 << (hashLog+2)).
                                            * Must be clamped between ZSTD_HASHLOG_MIN and ZSTD_HASHLOG_MAX.
                                            * Resulting memory usage is (1 << (chainLog+2)).
                                            * Must be clamped between ZSTD_CHAINLOG_MIN and ZSTD_CHAINLOG_MAX.
                                            * Larger tables result in better and slower compression.
-                                           * This parameter is useless when using "fast" strategy.
+                                           * This parameter is useless for "fast" strategy.
                                            * It's still useful when using "dfast" strategy,
                                            * in which case it defines a secondary probe table.
                                            * Special: value 0 means "use default chainLog". */
                  ZSTD_c_searchLog=104,    /* Number of search attempts, as a power of 2.
                                            * More attempts result in better and slower compression.
-                                           * This parameter is useless when using "fast" and "dFast" strategies.
+                                           * This parameter is useless for "fast" and "dFast" strategies.
                                            * Special: value 0 means "use default searchLog". */
                  ZSTD_c_minMatch=105,     /* Minimum size of searched matches.
                                            * Note that Zstandard can still find matches of smaller size,
                  ZSTD_c_contentSizeFlag=200, /* Content size will be written into frame header _whenever known_ (default:1)
                                            * Content size must be known at the beginning of compression.
                                            * This is automatically the case when using ZSTD_compress2(),
-                                           * For streaming variants, content size must be provided with ZSTD_CCtx_setPledgedSrcSize() */
+                                           * For streaming scenarios, content size must be provided with ZSTD_CCtx_setPledgedSrcSize() */
                  ZSTD_c_checksumFlag=201, /* A 32-bits checksum of content is written at end of frame (default:0) */
                  ZSTD_c_dictIDFlag=202,   /* When applicable, dictionary's ID is written into frame header (default:1) */
                                            * Each compression job is completed in parallel, so this value can indirectly impact the nb of active threads.
                                            * 0 means default, which is dynamically determined based on compression parameters.
                                            * Job size must be a minimum of overlap size, or 1 MB, whichever is largest.
-                                           * The minimum size is automatically and transparently enforced */
+                                           * The minimum size is automatically and transparently enforced. */
                  ZSTD_c_overlapLog=402,   /* Control the overlap size, as a fraction of window size.
                                            * The overlap size is an amount of data reloaded from previous job at the beginning of a new job.
                                            * It helps preserve compression ratio, while each job is compressed in parallel.
                   * ZSTD_c_forceAttachDict
                   * ZSTD_c_literalCompressionMode
                   * ZSTD_c_targetCBlockSize
+                  * ZSTD_c_srcSizeHint
                   * Because they are not stable, it's necessary to define ZSTD_STATIC_LINKING_ONLY to access them.
                   * note : never ever use experimentalParam? names directly;
                   *        also, the enums values themselves are unstable and can still change.
                   ZSTD_c_experimentalParam4=1001,
                   ZSTD_c_experimentalParam5=1002,
                   ZSTD_c_experimentalParam6=1003,
+                  ZSTD_c_experimentalParam7=1004
              } ZSTD_cParameter;
              typedef struct {
              typedef struct ZSTD_CDict_s ZSTD_CDict;
              /*! ZSTD_createCDict() :
-              *  When compressing multiple messages / blocks using the same dictionary, it's recommended to load it only once.
-              *  ZSTD_createCDict() will create a digested dictionary, ready to start future compression operations without startup cost.
+              *  When compressing multiple messages or blocks using the same dictionary,
+              *  it's recommended to digest the dictionary only once, since it's a costly operation.
+              *  ZSTD_createCDict() will create a state from digesting a dictionary.
+              *  The resulting state can be used for future compression operations with very limited startup cost.
               *  ZSTD_CDict can be created once and shared by multiple threads concurrently, since its usage is read-only.
-              * `dictBuffer` can be released after ZSTD_CDict creation, because its content is copied within CDict.
-              *  Consider experimental function `ZSTD_createCDict_byReference()` if you prefer to not duplicate `dictBuffer` content.
-              *  Note : A ZSTD_CDict can be created from an empty dictBuffer, but it is inefficient when used to compress small data. */
+              * @dictBuffer can be released after ZSTD_CDict creation, because its content is copied within CDict.
+              *  Note 1 : Consider experimental function `ZSTD_createCDict_byReference()` if you prefer to not duplicate @dictBuffer content.
+              *  Note 2 : A ZSTD_CDict can be created from an empty @dictBuffer,
+              *      in which case the only thing that it transports is the @compressionLevel.
+              *      This can be useful in a pipeline featuring ZSTD_compress_usingCDict() exclusively,
+              *      expecting a ZSTD_CDict parameter with any data, including those without a known dictionary. */
              ZSTDLIB_API ZSTD_CDict* ZSTD_createCDict(const void* dictBuffer, size_t dictSize,
                                                       int compressionLevel);
               *  Note 3 : Referencing a prefix involves building tables, which are dependent on compression parameters.
               *           It's a CPU consuming operation, with non-negligible impact on latency.
               *           If there is a need to use the same prefix multiple times, consider loadDictionary instead.
-              *  Note 4 : By default, the prefix is interpreted as raw content (ZSTD_dm_rawContent).
+              *  Note 4 : By default, the prefix is interpreted as raw content (ZSTD_dct_rawContent).
               *           Use experimental ZSTD_CCtx_refPrefix_advanced() to alter dictionary interpretation. */
              ZSTDLIB_API size_t ZSTD_CCtx_refPrefix(ZSTD_CCtx* cctx,
                                               const void* prefix, size_t prefixSize);
               *  Note 2 : Prefix buffer is referenced. It **must** outlive decompression.
               *           Prefix buffer must remain unmodified up to the end of frame,
               *           reached when ZSTD_decompressStream() returns 0.
-              *  Note 3 : By default, the prefix is treated as raw content (ZSTD_dm_rawContent).
+              *  Note 3 : By default, the prefix is treated as raw content (ZSTD_dct_rawContent).
               *           Use ZSTD_CCtx_refPrefix_advanced() to alter dictMode (Experimental section)
               *  Note 4 : Referencing a raw content prefix has almost no cpu nor memory cost.
               *           A full dictionary is more costly, as it requires building tables.
               * Some of them might be removed in the future (especially when redundant with existing stable functions)
               * ***************************************************************************************/
-             #define ZSTD_FRAMEHEADERSIZE_PREFIX 5   /* minimum input size required to query frame header size */
-             #define ZSTD_FRAMEHEADERSIZE_MIN    6
+             #define ZSTD_FRAMEHEADERSIZE_PREFIX(format) ((format) == ZSTD_f_zstd1 ? 5 : 1)   /* minimum input size required to query frame header size */
+             #define ZSTD_FRAMEHEADERSIZE_MIN(format)    ((format) == ZSTD_f_zstd1 ? 6 : 2)
              #define ZSTD_FRAMEHEADERSIZE_MAX   18   /* can be useful for static allocation */
              #define ZSTD_SKIPPABLEHEADERSIZE    8
              /* Advanced parameter bounds */
              #define ZSTD_TARGETCBLOCKSIZE_MIN   64
              #define ZSTD_TARGETCBLOCKSIZE_MAX   ZSTD_BLOCKSIZE_MAX
+             #define ZSTD_SRCSIZEHINT_MIN        0
+             #define ZSTD_SRCSIZEHINT_MAX        INT_MAX
              /* internal */
              #define ZSTD_HASHLOG3_MAX           17
              typedef struct ZSTD_CCtx_params_s ZSTD_CCtx_params;
              typedef struct {
+                 unsigned int matchPos; /* Match pos in dst */
+                 /* If seqDef.offset > 3, then this is seqDef.offset - 3
+                  * If seqDef.offset < 3, then this is the corresponding repeat offset
+                  * But if seqDef.offset < 3 and litLength == 0, this is the
+                  *   repeat offset before the corresponding repeat offset
+                  * And if seqDef.offset == 3 and litLength == 0, this is the
+                  *   most recent repeat offset - 1
+                  */
+                 unsigned int offset;
+                 unsigned int litLength; /* Literal length */
+                 unsigned int matchLength; /* Match length */
+                 /* 0 when seq not rep and seqDef.offset otherwise
+                  * when litLength == 0 this will be <= 4, otherwise <= 3 like normal
+                  */
+                 unsigned int rep;
+             } ZSTD_Sequence;
+             typedef struct {
                  unsigned windowLog;       /**< largest match distance : larger == more compression, more memory needed during decompression */
                  unsigned chainLog;        /**< fully searched segment : larger == more compression, slower, more memory (useless for fast) */
                  unsigned hashLog;         /**< dispatch table : larger == faster, more memory */
              typedef enum {
                  ZSTD_dlm_byCopy = 0,  /**< Copy dictionary content internally */
-                 ZSTD_dlm_byRef = 1,   /**< Reference dictionary content -- the dictionary buffer must outlive its users. */
+                 ZSTD_dlm_byRef = 1    /**< Reference dictionary content -- the dictionary buffer must outlive its users. */
              } ZSTD_dictLoadMethod_e;
              typedef enum {
-                 /* Opened question : should we have a format ZSTD_f_auto ?
-                  * Today, it would mean exactly the same as ZSTD_f_zstd1.
-                  * But, in the future, should several formats become supported,
-                  * on the compression side, it would mean "default format".
-                  * On the decompression side, it would mean "automatic format detection",
-                  * so that ZSTD_f_zstd1 would mean "accept *only* zstd frames".
-                  * Since meaning is a little different, another option could be to define different enums for compression and decompression.
-                  * This question could be kept for later, when there are actually multiple formats to support,
-                  * but there is also the question of pinning enum values, and pinning value `0` is especially important */
                  ZSTD_f_zstd1 = 0,           /* zstd frame format, specified in zstd_compression_format.md (default) */
-                 ZSTD_f_zstd1_magicless = 1, /* Variant of zstd frame format, without initial 4-bytes magic number.
+                 ZSTD_f_zstd1_magicless = 1  /* Variant of zstd frame format, without initial 4-bytes magic number.
                                               * Useful to save 4 bytes per generated frame.
                                               * Decoder cannot recognise automatically this format, requiring this instruction. */
              } ZSTD_format_e;
                   * to evolve and should be considered only in the context of extremely
                   * advanced performance tuning.
                   *
-                  * Zstd currently supports the use of a CDict in two ways:
+                  * Zstd currently supports the use of a CDict in three ways:
                   *
                   * - The contents of the CDict can be copied into the working context. This
                   *   means that the compression can search both the dictionary and input
                   *   working context's tables can be reused). For small inputs, this can be
                   *   faster than copying the CDict's tables.
                   *
+                  * - The CDict's tables are not used at all, and instead we use the working
+                  *   context alone to reload the dictionary and use params based on the source
+                  *   size. See ZSTD_compress_insertDictionary() and ZSTD_compress_usingDict().
+                  *   This method is effective when the dictionary sizes are very small relative
+                  *   to the input size, and the input size is fairly large to begin with.
+                  *
                   * Zstd has a simple internal heuristic that selects which strategy to use
                   * at the beginning of a compression. However, if experimentation shows that
                   * Zstd is making poor choices, it is possible to override that choice with
                  ZSTD_dictDefaultAttach = 0, /* Use the default heuristic. */
                  ZSTD_dictForceAttach   = 1, /* Never copy the dictionary. */
                  ZSTD_dictForceCopy     = 2, /* Always copy the dictionary. */
+                 ZSTD_dictForceLoad     = 3  /* Always reload the dictionary */
              } ZSTD_dictAttachPref_e;
              typedef enum {
                                             *   levels will be compressed. */
                ZSTD_lcm_huffman = 1,       /**< Always attempt Huffman compression. Uncompressed literals will still be
                                             *   emitted if Huffman compression is not profitable. */
-               ZSTD_lcm_uncompressed = 2,  /**< Always emit uncompressed literals. */
+               ZSTD_lcm_uncompressed = 2   /**< Always emit uncompressed literals. */
              } ZSTD_literalCompressionMode_e;
               *           or an error code (if srcSize is too small) */
              ZSTDLIB_API size_t ZSTD_frameHeaderSize(const void* src, size_t srcSize);
+             /*! ZSTD_getSequences() :
+              * Extract sequences from the sequence store
+              * zc can be used to insert custom compression params.
+              * This function invokes ZSTD_compress2
+              * @return : number of sequences extracted
+              */
+             ZSTDLIB_API size_t ZSTD_getSequences(ZSTD_CCtx* zc, ZSTD_Sequence* outSeqs,
+                 size_t outSeqsSize, const void* src, size_t srcSize);
              /***************************************
              *  Memory management
              ***************************************/
              /*! ZSTD_estimate*() :
-              *  These functions make it possible to estimate memory usage
-              *  of a future {D,C}Ctx, before its creation.
-              *  ZSTD_estimateCCtxSize() will provide a budget large enough for any compression level up to selected one.
-              *  It will also consider src size to be arbitrarily "large", which is worst case.
-              *  If srcSize is known to always be small, ZSTD_estimateCCtxSize_usingCParams() can provide a tighter estimation.
-              *  ZSTD_estimateCCtxSize_usingCParams() can be used in tandem with ZSTD_getCParams() to create cParams from compressionLevel.
-              *  ZSTD_estimateCCtxSize_usingCCtxParams() can be used in tandem with ZSTD_CCtxParams_setParameter(). Only single-threaded compression is supported. This function will return an error code if ZSTD_c_nbWorkers is >= 1.
-              *  Note : CCtx size estimation is only correct for single-threaded compression. */
+              *  These functions make it possible to estimate memory usage of a future
+              *  {D,C}Ctx, before its creation.
+              *
+              *  ZSTD_estimateCCtxSize() will provide a budget large enough for any
+              *  compression level up to selected one. Unlike ZSTD_estimateCStreamSize*(),
+              *  this estimate does not include space for a window buffer, so this estimate
+              *  is guaranteed to be enough for single-shot compressions, but not streaming
+              *  compressions. It will however assume the input may be arbitrarily large,
+              *  which is the worst case. If srcSize is known to always be small,
+              *  ZSTD_estimateCCtxSize_usingCParams() can provide a tighter estimation.
+              *  ZSTD_estimateCCtxSize_usingCParams() can be used in tandem with
+              *  ZSTD_getCParams() to create cParams from compressionLevel.
+              *  ZSTD_estimateCCtxSize_usingCCtxParams() can be used in tandem with
+              *  ZSTD_CCtxParams_setParameter().
+              *
+              *  Note: only single-threaded compression is supported. This function will
+              *  return an error code if ZSTD_c_nbWorkers is >= 1. */
              ZSTDLIB_API size_t ZSTD_estimateCCtxSize(int compressionLevel);
              ZSTDLIB_API size_t ZSTD_estimateCCtxSize_usingCParams(ZSTD_compressionParameters cParams);
              ZSTDLIB_API size_t ZSTD_estimateCCtxSize_usingCCtxParams(const ZSTD_CCtx_params* params);
               *  Create a digested dictionary for compression
               *  Dictionary content is just referenced, not duplicated.
               *  As a consequence, `dictBuffer` **must** outlive CDict,
-              *  and its content must remain unmodified throughout the lifetime of CDict. */
+              *  and its content must remain unmodified throughout the lifetime of CDict.
+              *  note: equivalent to ZSTD_createCDict_advanced(), with dictLoadMethod==ZSTD_dlm_byRef */
              ZSTDLIB_API ZSTD_CDict* ZSTD_createCDict_byReference(const void* dictBuffer, size_t dictSize, int compressionLevel);
              /*! ZSTD_getCParams() :
              ZSTDLIB_API ZSTD_compressionParameters ZSTD_adjustCParams(ZSTD_compressionParameters cPar, unsigned long long srcSize, size_t dictSize);
              /*! ZSTD_compress_advanced() :
-              *  Same as ZSTD_compress_usingDict(), with fine-tune control over compression parameters (by structure) */
+              *  Note : this function is now DEPRECATED.
+              *         It can be replaced by ZSTD_compress2(), in combination with ZSTD_CCtx_setParameter() and other parameter setters.
+              *  This prototype will be marked as deprecated and generate compilation warning on reaching v1.5.x */
              ZSTDLIB_API size_t ZSTD_compress_advanced(ZSTD_CCtx* cctx,
                                                        void* dst, size_t dstCapacity,
                                                  const void* src, size_t srcSize,
                                                        ZSTD_parameters params);
              /*! ZSTD_compress_usingCDict_advanced() :
-              *  Same as ZSTD_compress_usingCDict(), with fine-tune control over frame parameters */
+              *  Note : this function is now REDUNDANT.
+              *         It can be replaced by ZSTD_compress2(), in combination with ZSTD_CCtx_loadDictionary() and other parameter setters.
+              *  This prototype will be marked as deprecated and generate compilation warning in some future version */
              ZSTDLIB_API size_t ZSTD_compress_usingCDict_advanced(ZSTD_CCtx* cctx,
                                                            void* dst, size_t dstCapacity,
                                                      const void* src, size_t srcSize,
               * There is no guarantee on compressed block size (default:0) */
              #define ZSTD_c_targetCBlockSize ZSTD_c_experimentalParam6
+             /* User's best guess of source size.
+              * Hint is not valid when srcSizeHint == 0.
+              * There is no guarantee that hint is close to actual source size,
+              * but compression ratio may regress significantly if guess considerably underestimates */
+             #define ZSTD_c_srcSizeHint ZSTD_c_experimentalParam7
              /*! ZSTD_CCtx_getParameter() :
               *  Get the requested compression parameter value, selected by enum ZSTD_cParameter,
               *  and store it into int* value.
               * pledgedSrcSize must be correct. If it is not known at init time, use
               * ZSTD_CONTENTSIZE_UNKNOWN. Note that, for compatibility with older programs,
               * "0" also disables frame content size field. It may be enabled in the future.
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
-             ZSTDLIB_API size_t ZSTD_initCStream_srcSize(ZSTD_CStream* zcs, int compressionLevel, unsigned long long pledgedSrcSize);
+             ZSTDLIB_API size_t
+             ZSTD_initCStream_srcSize(ZSTD_CStream* zcs,
+                                      int compressionLevel,
+                                      unsigned long long pledgedSrcSize);
              /**! ZSTD_initCStream_usingDict() :
               * This function is deprecated, and is equivalent to:
               *     ZSTD_CCtx_reset(zcs, ZSTD_reset_session_only);
               *
               * Creates of an internal CDict (incompatible with static CCtx), except if
               * dict == NULL or dictSize < 8, in which case no dict is used.
-              * Note: dict is loaded with ZSTD_dm_auto (treated as a full zstd dictionary if
+              * Note: dict is loaded with ZSTD_dct_auto (treated as a full zstd dictionary if
               * it begins with ZSTD_MAGIC_DICTIONARY, else as raw content) and ZSTD_dlm_byCopy.
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
-             ZSTDLIB_API size_t ZSTD_initCStream_usingDict(ZSTD_CStream* zcs, const void* dict, size_t dictSize, int compressionLevel);
+             ZSTDLIB_API size_t
+             ZSTD_initCStream_usingDict(ZSTD_CStream* zcs,
+                                  const void* dict, size_t dictSize,
+                                        int compressionLevel);
              /**! ZSTD_initCStream_advanced() :
               * This function is deprecated, and is approximately equivalent to:
               *     ZSTD_CCtx_reset(zcs, ZSTD_reset_session_only);
-              *     ZSTD_CCtx_setZstdParams(zcs, params); // Set the zstd params and leave the rest as-is
+              *     // Pseudocode: Set each zstd parameter and leave the rest as-is.
+              *     for ((param, value) : params) {
+              *         ZSTD_CCtx_setParameter(zcs, param, value);
+              *     }
               *     ZSTD_CCtx_setPledgedSrcSize(zcs, pledgedSrcSize);
               *     ZSTD_CCtx_loadDictionary(zcs, dict, dictSize);
               *
-              * pledgedSrcSize must be correct. If srcSize is not known at init time, use
-              * value ZSTD_CONTENTSIZE_UNKNOWN. dict is loaded with ZSTD_dm_auto and ZSTD_dlm_byCopy.
+              * dict is loaded with ZSTD_dct_auto and ZSTD_dlm_byCopy.
+              * pledgedSrcSize must be correct.
+              * If srcSize is not known at init time, use value ZSTD_CONTENTSIZE_UNKNOWN.
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
-             ZSTDLIB_API size_t ZSTD_initCStream_advanced(ZSTD_CStream* zcs, const void* dict, size_t dictSize,
-                                                          ZSTD_parameters params, unsigned long long pledgedSrcSize);
+             ZSTDLIB_API size_t
+             ZSTD_initCStream_advanced(ZSTD_CStream* zcs,
+                                 const void* dict, size_t dictSize,
+                                       ZSTD_parameters params,
+                                       unsigned long long pledgedSrcSize);
              /**! ZSTD_initCStream_usingCDict() :
               * This function is deprecated, and equivalent to:
               *     ZSTD_CCtx_reset(zcs, ZSTD_reset_session_only);
               *     ZSTD_CCtx_refCDict(zcs, cdict);
               *
               * note : cdict will just be referenced, and must outlive compression session
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
              ZSTDLIB_API size_t ZSTD_initCStream_usingCDict(ZSTD_CStream* zcs, const ZSTD_CDict* cdict);
              /**! ZSTD_initCStream_usingCDict_advanced() :
-              * This function is deprecated, and is approximately equivalent to:
+              *   This function is DEPRECATED, and is approximately equivalent to:
               *     ZSTD_CCtx_reset(zcs, ZSTD_reset_session_only);
-              *     ZSTD_CCtx_setZstdFrameParams(zcs, fParams); // Set the zstd frame params and leave the rest as-is
+              *     // Pseudocode: Set each zstd frame parameter and leave the rest as-is.
+              *     for ((fParam, value) : fParams) {
+              *         ZSTD_CCtx_setParameter(zcs, fParam, value);
+              *     }
               *     ZSTD_CCtx_setPledgedSrcSize(zcs, pledgedSrcSize);
               *     ZSTD_CCtx_refCDict(zcs, cdict);
               *
               * same as ZSTD_initCStream_usingCDict(), with control over frame parameters.
               * pledgedSrcSize must be correct. If srcSize is not known at init time, use
               * value ZSTD_CONTENTSIZE_UNKNOWN.
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
-             ZSTDLIB_API size_t ZSTD_initCStream_usingCDict_advanced(ZSTD_CStream* zcs, const ZSTD_CDict* cdict, ZSTD_frameParameters fParams, unsigned long long pledgedSrcSize);
+             ZSTDLIB_API size_t
+             ZSTD_initCStream_usingCDict_advanced(ZSTD_CStream* zcs,
+                                            const ZSTD_CDict* cdict,
+                                                  ZSTD_frameParameters fParams,
+                                                  unsigned long long pledgedSrcSize);
              /*! ZSTD_resetCStream() :
               * This function is deprecated, and is equivalent to:
               *  For the time being, pledgedSrcSize==0 is interpreted as "srcSize unknown" for compatibility with older programs,
               *  but it will change to mean "empty" in future version, so use macro ZSTD_CONTENTSIZE_UNKNOWN instead.
               * @return : 0, or an error code (which can be tested using ZSTD_isError())
+              *  Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
              ZSTDLIB_API size_t ZSTD_resetCStream(ZSTD_CStream* zcs, unsigned long long pledgedSrcSize);
               *     ZSTD_DCtx_loadDictionary(zds, dict, dictSize);
               *
               * note: no dictionary will be used if dict == NULL or dictSize < 8
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
              ZSTDLIB_API size_t ZSTD_initDStream_usingDict(ZSTD_DStream* zds, const void* dict, size_t dictSize);
              /**
               * This function is deprecated, and is equivalent to:
               *
               *     ZSTD_DCtx_refDDict(zds, ddict);
               *
               * note : ddict is referenced, it must outlive decompression session
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
              ZSTDLIB_API size_t ZSTD_initDStream_usingDDict(ZSTD_DStream* zds, const ZSTD_DDict* ddict);
              /**
               * This function is deprecated, and is equivalent to:
               *
               *     ZSTD_DCtx_reset(zds, ZSTD_reset_session_only);
               *
               * re-use decompression parameters from previous init; saves dictionary loading
+              * Note : this prototype will be marked as deprecated and generate compilation warnings on reaching v1.5.x
               */
              ZSTDLIB_API size_t ZSTD_resetDStream(ZSTD_DStream* zds);
              /*!
                  Block functions produce and decode raw zstd blocks, without frame metadata.
-                 Frame metadata cost is typically ~18 bytes, which can be non-negligible for very small blocks (< 100 bytes).
+                 Frame metadata cost is typically ~12 bytes, which can be non-negligible for very small blocks (< 100 bytes).
                  But users will have to take in charge needed metadata to regenerate data, such as compressed and content sizes.
                  A few rules to respect :

General Comments 0

Write
Preview

You need to be logged in to leave comments. Login now

No TODOs yet

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages