upstream/mercurial-mirror Commit - r30822:b54a2984

zstd: vendor python-zstandard 0.6.0...

Gregory Szorc -

r30822:b54a2984 default

parent child

Expand all files

The requested changes are too big and content was truncated. Show full diff

contrib/python-zstandard/MANIFEST.in

0 +3 0

+             graft c-ext
              graft zstd
              include make_cffi.py
+             include setup_zstd.py
+             include zstd.c

contrib/python-zstandard/NEWS.rst

0 +27 0

              Version History
              ===============
+.6.0 (released 2017-01-14)
+             ---------------------------
+             * Support for legacy zstd protocols (build time opt in feature).
+             * Automation improvements to test against Python 3.6, latest versions
+               of Tox, more deterministic AppVeyor behavior.
+             * CFFI "parser" improved to use a compiler preprocessor instead of rewriting
+               source code manually.
+             * Vendored version of zstd updated to 1.1.2.
+             * Documentation improvements.
+             * Introduce a bench.py script for performing (crude) benchmarks.
+             * ZSTD_CCtx instances are now reused across multiple compress() operations.
+             * ZstdCompressor.write_to() now has a flush() method.
+             * ZstdCompressor.compressobj()'s flush() method now accepts an argument to
+               flush a block (as opposed to ending the stream).
+             * Disallow compress(b'') when writing content sizes by default (issue #11).
+.5.2 (released 2016-11-12)
+             ---------------------------
+             * more packaging fixes for source distribution
+.5.1 (released 2016-11-12)
+             ---------------------------
+             * setup_zstd.py is included in the source distribution
 .5.0 (released 2016-11-10)
              ---------------------------
              * Vendored version of zstd updated to 1.1.1.
              * Continuous integration for Python 3.6 and 3.7
              * Continuous integration for Conda
              * Added compression and decompression APIs providing similar interfaces
                to the standard library ``zlib`` and ``bz2`` modules. This allows
                coding to a common interface.
              * ``zstd.__version__` is now defined.
              * ``read_from()`` on various APIs now accepts objects implementing the buffer
                protocol.
              * ``read_from()`` has gained a ``skip_bytes`` argument. This allows callers
                to pass in an existing buffer with a header without having to create a
                slice or a new object.
              * Implemented ``ZstdCompressionDict.as_bytes()``.
              * Python's memory allocator is now used instead of ``malloc()``.
              * Low-level zstd data structures are reused in more instances, cutting down
                on overhead for certain operations.
              * ``distutils`` boilerplate for obtaining an ``Extension`` instance
                has now been refactored into a standalone ``setup_zstd.py`` file. This
                allows other projects with ``setup.py`` files to reuse the
                ``distutils`` code for this project without copying code.
              * The monolithic ``zstd.c`` file has been split into a header file defining
                types and separate ``.c`` source files for the implementation.
              History of the Project
              ======================
 -08-31 - Zstandard 1.0.0 is released and Gregory starts hacking on a
              Python extension for use by the Mercurial project. A very hacky prototype
              is sent to the mercurial-devel list for RFC.
 -09-03 - Most functionality from Zstandard C API implemented. Source
              code published on https://github.com/indygreg/python-zstandard. Travis-CI
              automation configured. 0.0.1 release on PyPI.
 -09-05 - After the API was rounded out a bit and support for Python
 .6 and 2.7 was added, version 0.1 was released to PyPI.
 -09-05 - After the compressor and decompressor APIs were changed, 0.2
              was released to PyPI.
 -09-10 - 0.3 is released with a bunch of new features. ZstdCompressor
              now accepts arguments controlling frame parameters. The source size can now
              be declared when performing streaming compression. ZstdDecompressor.decompress()
              is implemented. Compression dictionaries are now cached when using the simple
              compression and decompression APIs. Memory size APIs added.
              ZstdCompressor.read_from() and ZstdDecompressor.read_from() have been
              implemented. This rounds out the major compression/decompression APIs planned
              by the author.
 -10-02 - 0.3.3 is released with a bug fix for read_from not fully
              decoding a zstd frame (issue #2).
 -10-02 - 0.4.0 is released with zstd 1.1.0, support for custom read and
              write buffer sizes, and a few bug fixes involving failure to read/write
              all data when buffer sizes were too small to hold remaining data.
 -11-10 - 0.5.0 is released with zstd 1.1.1 and other enhancements.

contrib/python-zstandard/README.rst

0 +60 -7

              ================
              python-zstandard
              ================
-             This project provides a Python C extension for interfacing with the
-             `Zstandard <http://www.zstd.net>`_ compression library.
+             This project provides Python bindings for interfacing with the
+             `Zstandard <http://www.zstd.net>`_ compression library. A C extension
+             and CFFI interface is provided.
              The primary goal of the extension is to provide a Pythonic interface to
              the underlying C API. This means exposing most of the features and flexibility
              of the C API while not sacrificing usability or safety that Python provides.
+             The canonical home for this project is
+             https://github.com/indygreg/python-zstandard.
              |  |ci-status| |win-ci-status|
              State of Project
              ================
              The project is officially in beta state. The author is reasonably satisfied
              with the current API and that functionality works as advertised. There
              may be some backwards incompatible changes before 1.0. Though the author
              does not intend to make any major changes to the Python API.
              There is continuous integration for Python versions 2.6, 2.7, and 3.3+
              on Linux x86_x64 and Windows x86 and x86_64. The author is reasonably
              confident the extension is stable and works as advertised on these
              platforms.
              Expected Changes
              ----------------
              The author is reasonably confident in the current state of what's
              implemented on the ``ZstdCompressor`` and ``ZstdDecompressor`` types.
              Those APIs likely won't change significantly. Some low-level behavior
              (such as naming and types expected by arguments) may change.
              There will likely be arguments added to control the input and output
              buffer sizes (currently, certain operations read and write in chunk
              sizes using zstd's preferred defaults).
              There should be an API that accepts an object that conforms to the buffer
              interface and returns an iterator over compressed or decompressed output.
              The author is on the fence as to whether to support the extremely
              low level compression and decompression APIs. It could be useful to
              support compression without the framing headers. But the author doesn't
              believe it a high priority at this time.
              The CFFI bindings are half-baked and need to be finished.
              Requirements
              ============
              This extension is designed to run with Python 2.6, 2.7, 3.3, 3.4, and 3.5
              on common platforms (Linux, Windows, and OS X). Only x86_64 is currently
              well-tested as an architecture.
              Installing
              ==========
              This package is uploaded to PyPI at https://pypi.python.org/pypi/zstandard.
              So, to install this package::
                 $ pip install zstandard
              Binary wheels are made available for some platforms. If you need to
              install from a source distribution, all you should need is a working C
              compiler and the Python development headers/libraries. On many Linux
              distributions, you can install a ``python-dev`` or ``python-devel``
              package to provide these dependencies.
              Packages are also uploaded to Anaconda Cloud at
              https://anaconda.org/indygreg/zstandard. See that URL for how to install
              this package with ``conda``.
              Performance
              ===========
              Very crude and non-scientific benchmarking (most benchmarks fall in this
              category because proper benchmarking is hard) show that the Python bindings
              perform within 10% of the native C implementation.
              The following table compares the performance of compressing and decompressing
              a 1.1 GB tar file comprised of the files in a Firefox source checkout. Values
              obtained with the ``zstd`` program are on the left. The remaining columns detail
              performance of various compression APIs in the Python bindings.
              +-------+-----------------+-----------------+-----------------+---------------+
              | Level | Native          | Simple          | Stream In       | Stream Out    |
              |       | Comp / Decomp   | Comp / Decomp   | Comp / Decomp   | Comp          |
              +=======+=================+=================+=================+===============+
              |   1   | 490 / 1338 MB/s | 458 / 1266 MB/s | 407 / 1156 MB/s |  405 MB/s     |
              +-------+-----------------+-----------------+-----------------+---------------+
              |   2   | 412 / 1288 MB/s | 381 / 1203 MB/s | 345 / 1128 MB/s |  349 MB/s     |
              +-------+-----------------+-----------------+-----------------+---------------+
              |   3   | 342 / 1312 MB/s | 319 / 1182 MB/s | 285 / 1165 MB/s |  287 MB/s     |
              +-------+-----------------+-----------------+-----------------+---------------+
              |  11   |  64 / 1506 MB/s |  66 / 1436 MB/s |  56 / 1342 MB/s |   57 MB/s     |
              +-------+-----------------+-----------------+-----------------+---------------+
              Again, these are very unscientific. But it shows that Python is capable of
              compressing at several hundred MB/s and decompressing at over 1 GB/s.
              Comparison to Other Python Bindings
              ===================================
              https://pypi.python.org/pypi/zstd is an alternative Python binding to
              Zstandard. At the time this was written, the latest release of that
              package (1.0.0.2) had the following significant differences from this package:
              * It only exposes the simple API for compression and decompression operations.
                This extension exposes the streaming API, dictionary training, and more.
              * It adds a custom framing header to compressed data and there is no way to
                disable it. This means that data produced with that module cannot be used by
                other Zstandard implementations.
              Bundling of Zstandard Source Code
              =================================
              The source repository for this project contains a vendored copy of the
              Zstandard source code. This is done for a few reasons.
              First, Zstandard is relatively new and not yet widely available as a system
              package. Providing a copy of the source code enables the Python C extension
              to be compiled without requiring the user to obtain the Zstandard source code
              separately.
              Second, Zstandard has both a stable *public* API and an *experimental* API.
              The *experimental* API is actually quite useful (contains functionality for
              training dictionaries for example), so it is something we wish to expose to
              Python. However, the *experimental* API is only available via static linking.
              Furthermore, the *experimental* API can change at any time. So, control over
              the exact version of the Zstandard library linked against is important to
              ensure known behavior.
              Instructions for Building and Testing
              =====================================
              Once you have the source code, the extension can be built via setup.py::
                 $ python setup.py build_ext
              We recommend testing with ``nose``::
                 $ nosetests
              A Tox configuration is present to test against multiple Python versions::
                 $ tox
              Tests use the ``hypothesis`` Python package to perform fuzzing. If you
              don't have it, those tests won't run.
              There is also an experimental CFFI module. You need the ``cffi`` Python
              package installed to build and test that.
              To create a virtualenv with all development dependencies, do something
              like the following::
                # Python 2
                $ virtualenv venv
                # Python 3
                $ python3 -m venv venv
                $ source venv/bin/activate
                $ pip install cffi hypothesis nose tox
              API
              ===
              The compiled C extension provides a ``zstd`` Python module. This module
              exposes the following interfaces.
              ZstdCompressor
              --------------
              The ``ZstdCompressor`` class provides an interface for performing
              compression operations.
              Each instance is associated with parameters that control compression
              behavior. These come from the following named arguments (all optional):
              level
                 Integer compression level. Valid values are between 1 and 22.
              dict_data
                 Compression dictionary to use.
                 Note: When using dictionary data and ``compress()`` is called multiple
                 times, the ``CompressionParameters`` derived from an integer compression
                 ``level`` and the first compressed data's size will be reused for all
                 subsequent operations. This may not be desirable if source data size
                 varies significantly.
              compression_params
                 A ``CompressionParameters`` instance (overrides the ``level`` value).
              write_checksum
                 Whether a 4 byte checksum should be written with the compressed data.
                 Defaults to False. If True, the decompressor can verify that decompressed
                 data matches the original input data.
              write_content_size
                 Whether the size of the uncompressed data will be written into the
                 header of compressed data. Defaults to False. The data will only be
                 written if the compressor knows the size of the input data. This is
                 likely not true for streaming compression.
              write_dict_id
                 Whether to write the dictionary ID into the compressed data.
                 Defaults to True. The dictionary ID is only written if a dictionary
                 is being used.
+             Unless specified otherwise, assume that no two methods of ``ZstdCompressor``
+             instances can be called from multiple Python threads simultaneously. In other
+             words, assume instances are not thread safe unless stated otherwise.
              Simple API
              ^^^^^^^^^^
              ``compress(data)`` compresses and returns data as a one-shot operation.::
-                cctx = zstd.ZsdCompressor()
+                cctx = zstd.ZstdCompressor()
                 compressed = cctx.compress(b'data to compress')
+             Unless ``compression_params`` or ``dict_data`` are passed to the
+             ``ZstdCompressor``, each invocation of ``compress()`` will calculate the
+             optimal compression parameters for the configured compression ``level`` and
+             input data size (some parameters are fine-tuned for small input sizes).
+             If a compression dictionary is being used, the compression parameters
+             determined from the first input's size will be reused for subsequent
+             operations.
+             There is currently a deficiency in zstd's C APIs that makes it difficult
+             to round trip empty inputs when ``write_content_size=True``. Attempting
+             this will raise a ``ValueError`` unless ``allow_empty=True`` is passed
+             to ``compress()``.
              Streaming Input API
              ^^^^^^^^^^^^^^^^^^^
              ``write_to(fh)`` (which behaves as a context manager) allows you to *stream*
              data into a compressor.::
                 cctx = zstd.ZstdCompressor(level=10)
                 with cctx.write_to(fh) as compressor:
                     compressor.write(b'chunk 0')
                     compressor.write(b'chunk 1')
                     ...
              The argument to ``write_to()`` must have a ``write(data)`` method. As
-             compressed data is available, ``write()`` will be called with the comrpessed
+             compressed data is available, ``write()`` will be called with the compressed
              data as its argument. Many common Python types implement ``write()``, including
              open file handles and ``io.BytesIO``.
              ``write_to()`` returns an object representing a streaming compressor instance.
              It **must** be used as a context manager. That object's ``write(data)`` method
              is used to feed data into the compressor.
+             A ``flush()`` method can be called to evict whatever data remains within the
+             compressor's internal state into the output object. This may result in 0 or
+             more ``write()`` calls to the output object.
              If the size of the data being fed to this streaming compressor is known,
              you can declare it before compression begins::
                 cctx = zstd.ZstdCompressor()
                 with cctx.write_to(fh, size=data_len) as compressor:
                     compressor.write(chunk0)
                     compressor.write(chunk1)
                     ...
              Declaring the size of the source data allows compression parameters to
              be tuned. And if ``write_content_size`` is used, it also results in the
              content size being written into the frame header of the output data.
              The size of chunks being ``write()`` to the destination can be specified::
                  cctx = zstd.ZstdCompressor()
                  with cctx.write_to(fh, write_size=32768) as compressor:
                      ...
              To see how much memory is being used by the streaming compressor::
                  cctx = zstd.ZstdCompressor()
                  with cctx.write_to(fh) as compressor:
                      ...
                      byte_size = compressor.memory_size()
              Streaming Output API
              ^^^^^^^^^^^^^^^^^^^^
              ``read_from(reader)`` provides a mechanism to stream data out of a compressor
              as an iterator of data chunks.::
                 cctx = zstd.ZstdCompressor()
                 for chunk in cctx.read_from(fh):
                      # Do something with emitted data.
              ``read_from()`` accepts an object that has a ``read(size)`` method or conforms
              to the buffer protocol. (``bytes`` and ``memoryview`` are 2 common types that
              provide the buffer protocol.)
              Uncompressed data is fetched from the source either by calling ``read(size)``
              or by fetching a slice of data from the object directly (in the case where
              the buffer protocol is being used). The returned iterator consists of chunks
              of compressed data.
+             If reading from the source via ``read()``, ``read()`` will be called until
+             it raises or returns an empty bytes (``b''``). It is perfectly valid for
+             the source to deliver fewer bytes than were what requested by ``read(size)``.
              Like ``write_to()``, ``read_from()`` also accepts a ``size`` argument
              declaring the size of the input stream::
                  cctx = zstd.ZstdCompressor()
                  for chunk in cctx.read_from(fh, size=some_int):
                      pass
              You can also control the size that data is ``read()`` from the source and
              the ideal size of output chunks::
                  cctx = zstd.ZstdCompressor()
                  for chunk in cctx.read_from(fh, read_size=16384, write_size=8192):
                      pass
+             Unlike ``write_to()``, ``read_from()`` does not give direct control over the
+             sizes of chunks fed into the compressor. Instead, chunk sizes will be whatever
+             the object being read from delivers. These will often be of a uniform size.
              Stream Copying API
              ^^^^^^^^^^^^^^^^^^
              ``copy_stream(ifh, ofh)`` can be used to copy data between 2 streams while
              compressing it.::
                 cctx = zstd.ZstdCompressor()
                 cctx.copy_stream(ifh, ofh)
              For example, say you wish to compress a file::
                 cctx = zstd.ZstdCompressor()
                 with open(input_path, 'rb') as ifh, open(output_path, 'wb') as ofh:
                     cctx.copy_stream(ifh, ofh)
              It is also possible to declare the size of the source stream::
                 cctx = zstd.ZstdCompressor()
                 cctx.copy_stream(ifh, ofh, size=len_of_input)
              You can also specify how large the chunks that are ``read()`` and ``write()``
              from and to the streams::
                 cctx = zstd.ZstdCompressor()
                 cctx.copy_stream(ifh, ofh, read_size=32768, write_size=16384)
              The stream copier returns a 2-tuple of bytes read and written::
                 cctx = zstd.ZstdCompressor()
                 read_count, write_count = cctx.copy_stream(ifh, ofh)
              Compressor API
              ^^^^^^^^^^^^^^
              ``compressobj()`` returns an object that exposes ``compress(data)`` and
              ``flush()`` methods. Each returns compressed data or an empty bytes.
              The purpose of ``compressobj()`` is to provide an API-compatible interface
              with ``zlib.compressobj`` and ``bz2.BZ2Compressor``. This allows callers to
              swap in different compressor objects while using the same API.
-             Once ``flush()`` is called, the compressor will no longer accept new data
-             to ``compress()``. ``flush()`` **must** be called to end the compression
-             context. If not called, the returned data may be incomplete.
+             ``flush()`` accepts an optional argument indicating how to end the stream.
+             ``zstd.COMPRESSOBJ_FLUSH_FINISH`` (the default) ends the compression stream.
+             Once this type of flush is performed, ``compress()`` and ``flush()`` can
+             no longer be called. This type of flush **must** be called to end the
+             compression context. If not called, returned data may be incomplete.
+             A ``zstd.COMPRESSOBJ_FLUSH_BLOCK`` argument to ``flush()`` will flush a
+             zstd block. Flushes of this type can be performed multiple times. The next
+             call to ``compress()`` will begin a new zstd block.
              Here is how this API should be used::
                 cctx = zstd.ZstdCompressor()
                 cobj = cctx.compressobj()
                 data = cobj.compress(b'raw input 0')
                 data = cobj.compress(b'raw input 1')
                 data = cobj.flush()
+             Or to flush blocks::
+                cctx.zstd.ZstdCompressor()
+                cobj = cctx.compressobj()
+                data = cobj.compress(b'chunk in first block')
+                data = cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK)
+                data = cobj.compress(b'chunk in second block')
+                data = cobj.flush()
              For best performance results, keep input chunks under 256KB. This avoids
              extra allocations for a large output object.
              It is possible to declare the input size of the data that will be fed into
              the compressor::
                 cctx = zstd.ZstdCompressor()
                 cobj = cctx.compressobj(size=6)
                 data = cobj.compress(b'foobar')
                 data = cobj.flush()
              ZstdDecompressor
              ----------------
              The ``ZstdDecompressor`` class provides an interface for performing
              decompression.
              Each instance is associated with parameters that control decompression. These
              come from the following named arguments (all optional):
              dict_data
                 Compression dictionary to use.
              The interface of this class is very similar to ``ZstdCompressor`` (by design).
+             Unless specified otherwise, assume that no two methods of ``ZstdDecompressor``
+             instances can be called from multiple Python threads simultaneously. In other
+             words, assume instances are not thread safe unless stated otherwise.
              Simple API
              ^^^^^^^^^^
              ``decompress(data)`` can be used to decompress an entire compressed zstd
              frame in a single operation.::
                  dctx = zstd.ZstdDecompressor()
                  decompressed = dctx.decompress(data)
              By default, ``decompress(data)`` will only work on data written with the content
              size encoded in its header. This can be achieved by creating a
              ``ZstdCompressor`` with ``write_content_size=True``. If compressed data without
              an embedded content size is seen, ``zstd.ZstdError`` will be raised.
              If the compressed data doesn't have its content size embedded within it,
              decompression can be attempted by specifying the ``max_output_size``
              argument.::
                  dctx = zstd.ZstdDecompressor()
                  uncompressed = dctx.decompress(data, max_output_size=1048576)
              Ideally, ``max_output_size`` will be identical to the decompressed output
              size.
              If ``max_output_size`` is too small to hold the decompressed data,
              ``zstd.ZstdError`` will be raised.
              If ``max_output_size`` is larger than the decompressed data, the allocated
              output buffer will be resized to only use the space required.
              Please note that an allocation of the requested ``max_output_size`` will be
              performed every time the method is called. Setting to a very large value could
              result in a lot of work for the memory allocator and may result in
              ``MemoryError`` being raised if the allocation fails.
              If the exact size of decompressed data is unknown, it is **strongly**
              recommended to use a streaming API.
              Streaming Input API
              ^^^^^^^^^^^^^^^^^^^
              ``write_to(fh)`` can be used to incrementally send compressed data to a
              decompressor.::
                  dctx = zstd.ZstdDecompressor()
                  with dctx.write_to(fh) as decompressor:
                      decompressor.write(compressed_data)
              This behaves similarly to ``zstd.ZstdCompressor``: compressed data is written to
              the decompressor by calling ``write(data)`` and decompressed output is written
              to the output object by calling its ``write(data)`` method.
              The size of chunks being ``write()`` to the destination can be specified::
                  dctx = zstd.ZstdDecompressor()
                  with dctx.write_to(fh, write_size=16384) as decompressor:
                      pass
              You can see how much memory is being used by the decompressor::
                  dctx = zstd.ZstdDecompressor()
                  with dctx.write_to(fh) as decompressor:
                      byte_size = decompressor.memory_size()
              Streaming Output API
              ^^^^^^^^^^^^^^^^^^^^
              ``read_from(fh)`` provides a mechanism to stream decompressed data out of a
              compressed source as an iterator of data chunks.::
                  dctx = zstd.ZstdDecompressor()
                  for chunk in dctx.read_from(fh):
                      # Do something with original data.
              ``read_from()`` accepts a) an object with a ``read(size)`` method that will
              return  compressed bytes b) an object conforming to the buffer protocol that
              can expose its data as a contiguous range of bytes. The ``bytes`` and
              ``memoryview`` types expose this buffer protocol.
              ``read_from()`` returns an iterator whose elements are chunks of the
              decompressed data.
              The size of requested ``read()`` from the source can be specified::
                  dctx = zstd.ZstdDecompressor()
                  for chunk in dctx.read_from(fh, read_size=16384):
                      pass
              It is also possible to skip leading bytes in the input data::
                  dctx = zstd.ZstdDecompressor()
                  for chunk in dctx.read_from(fh, skip_bytes=1):
                      pass
              Skipping leading bytes is useful if the source data contains extra
              *header* data but you want to avoid the overhead of making a buffer copy
              or allocating a new ``memoryview`` object in order to decompress the data.
              Similarly to ``ZstdCompressor.read_from()``, the consumer of the iterator
              controls when data is decompressed. If the iterator isn't consumed,
              decompression is put on hold.
              When ``read_from()`` is passed an object conforming to the buffer protocol,
              the behavior may seem similar to what occurs when the simple decompression
              API is used. However, this API works when the decompressed size is unknown.
              Furthermore, if feeding large inputs, the decompressor will work in chunks
              instead of performing a single operation.
              Stream Copying API
              ^^^^^^^^^^^^^^^^^^
              ``copy_stream(ifh, ofh)`` can be used to copy data across 2 streams while
              performing decompression.::
                  dctx = zstd.ZstdDecompressor()
                  dctx.copy_stream(ifh, ofh)
              e.g. to decompress a file to another file::
                  dctx = zstd.ZstdDecompressor()
                  with open(input_path, 'rb') as ifh, open(output_path, 'wb') as ofh:
                      dctx.copy_stream(ifh, ofh)
              The size of chunks being ``read()`` and ``write()`` from and to the streams
              can be specified::
                  dctx = zstd.ZstdDecompressor()
                  dctx.copy_stream(ifh, ofh, read_size=8192, write_size=16384)
              Decompressor API
              ^^^^^^^^^^^^^^^^
              ``decompressobj()`` returns an object that exposes a ``decompress(data)``
              methods. Compressed data chunks are fed into ``decompress(data)`` and
              uncompressed output (or an empty bytes) is returned. Output from subsequent
              calls needs to be concatenated to reassemble the full decompressed byte
              sequence.
              The purpose of ``decompressobj()`` is to provide an API-compatible interface
              with ``zlib.decompressobj`` and ``bz2.BZ2Decompressor``. This allows callers
              to swap in different decompressor objects while using the same API.
              Each object is single use: once an input frame is decoded, ``decompress()``
              can no longer be called.
              Here is how this API should be used::
                 dctx = zstd.ZstdDeompressor()
                 dobj = cctx.decompressobj()
                 data = dobj.decompress(compressed_chunk_0)
                 data = dobj.decompress(compressed_chunk_1)
              Choosing an API
              ---------------
              Various forms of compression and decompression APIs are provided because each
              are suitable for different use cases.
              The simple/one-shot APIs are useful for small data, when the decompressed
              data size is known (either recorded in the zstd frame header via
              ``write_content_size`` or known via an out-of-band mechanism, such as a file
              size).
              A limitation of the simple APIs is that input or output data must fit in memory.
              And unless using advanced tricks with Python *buffer objects*, both input and
              output must fit in memory simultaneously.
              Another limitation is that compression or decompression is performed as a single
              operation. So if you feed large input, it could take a long time for the
              function to return.
              The streaming APIs do not have the limitations of the simple API. The cost to
              this is they are more complex to use than a single function call.
              The streaming APIs put the caller in control of compression and decompression
              behavior by allowing them to directly control either the input or output side
              of the operation.
              With the streaming input APIs, the caller feeds data into the compressor or
              decompressor as they see fit. Output data will only be written after the caller
              has explicitly written data.
              With the streaming output APIs, the caller consumes output from the compressor
              or decompressor as they see fit. The compressor or decompressor will only
              consume data from the source when the caller is ready to receive it.
              One end of the streaming APIs involves a file-like object that must
              ``write()`` output data or ``read()`` input data. Depending on what the
              backing storage for these objects is, those operations may not complete quickly.
              For example, when streaming compressed data to a file, the ``write()`` into
              a streaming compressor could result in a ``write()`` to the filesystem, which
              may take a long time to finish due to slow I/O on the filesystem. So, there
              may be overhead in streaming APIs beyond the compression and decompression
              operations.
              Dictionary Creation and Management
              ----------------------------------
              Zstandard allows *dictionaries* to be used when compressing and
              decompressing data. The idea is that if you are compressing a lot of similar
              data, you can precompute common properties of that data (such as recurring
              byte sequences) to achieve better compression ratios.
              In Python, compression dictionaries are represented as the
              ``ZstdCompressionDict`` type.
              Instances can be constructed from bytes::
                 dict_data = zstd.ZstdCompressionDict(data)
              More interestingly, instances can be created by *training* on sample data::
                 dict_data = zstd.train_dictionary(size, samples)
              This takes a list of bytes instances and creates and returns a
              ``ZstdCompressionDict``.
              You can see how many bytes are in the dictionary by calling ``len()``::
                 dict_data = zstd.train_dictionary(size, samples)
                 dict_size = len(dict_data)  # will not be larger than ``size``
              Once you have a dictionary, you can pass it to the objects performing
              compression and decompression::
                 dict_data = zstd.train_dictionary(16384, samples)
                 cctx = zstd.ZstdCompressor(dict_data=dict_data)
                 for source_data in input_data:
                     compressed = cctx.compress(source_data)
                     # Do something with compressed data.
                 dctx = zstd.ZstdDecompressor(dict_data=dict_data)
                 for compressed_data in input_data:
                     buffer = io.BytesIO()
                     with dctx.write_to(buffer) as decompressor:
                         decompressor.write(compressed_data)
                     # Do something with raw data in ``buffer``.
              Dictionaries have unique integer IDs. You can retrieve this ID via::
                 dict_id = zstd.dictionary_id(dict_data)
              You can obtain the raw data in the dict (useful for persisting and constructing
              a ``ZstdCompressionDict`` later) via ``as_bytes()``::
                 dict_data = zstd.train_dictionary(size, samples)
                 raw_data = dict_data.as_bytes()
              Explicit Compression Parameters
              -------------------------------
              Zstandard's integer compression levels along with the input size and dictionary
              size are converted into a data structure defining multiple parameters to tune
              behavior of the compression algorithm. It is possible to use define this
              data structure explicitly to have lower-level control over compression behavior.
              The ``zstd.CompressionParameters`` type represents this data structure.
              You can see how Zstandard converts compression levels to this data structure
              by calling ``zstd.get_compression_parameters()``. e.g.::
                  params = zstd.get_compression_parameters(5)
              This function also accepts the uncompressed data size and dictionary size
              to adjust parameters::
                  params = zstd.get_compression_parameters(3, source_size=len(data), dict_size=len(dict_data))
              You can also construct compression parameters from their low-level components::
                  params = zstd.CompressionParameters(20, 6, 12, 5, 4, 10, zstd.STRATEGY_FAST)
              You can then configure a compressor to use the custom parameters::
                  cctx = zstd.ZstdCompressor(compression_params=params)
              The members of the ``CompressionParameters`` tuple are as follows::
              * 0 - Window log
              * 1 - Chain log
              * 2 - Hash log
              * 3 - Search log
              * 4 - Search length
              * 5 - Target length
              * 6 - Strategy (one of the ``zstd.STRATEGY_`` constants)
              You'll need to read the Zstandard documentation for what these parameters
              do.
              Misc Functionality
              ------------------
              estimate_compression_context_size(CompressionParameters)
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
              Given a ``CompressionParameters`` struct, estimate the memory size required
              to perform compression.
              estimate_decompression_context_size()
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
              Estimate the memory size requirements for a decompressor instance.
              Constants
              ---------
              The following module constants/attributes are exposed:
              ZSTD_VERSION
                  This module attribute exposes a 3-tuple of the Zstandard version. e.g.
                  ``(1, 0, 0)``
              MAX_COMPRESSION_LEVEL
                  Integer max compression level accepted by compression functions
              COMPRESSION_RECOMMENDED_INPUT_SIZE
                  Recommended chunk size to feed to compressor functions
              COMPRESSION_RECOMMENDED_OUTPUT_SIZE
                  Recommended chunk size for compression output
              DECOMPRESSION_RECOMMENDED_INPUT_SIZE
                  Recommended chunk size to feed into decompresor functions
              DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE
                  Recommended chunk size for decompression output
              FRAME_HEADER
                  bytes containing header of the Zstandard frame
              MAGIC_NUMBER
                  Frame header as an integer
              WINDOWLOG_MIN
                  Minimum value for compression parameter
              WINDOWLOG_MAX
                  Maximum value for compression parameter
              CHAINLOG_MIN
                  Minimum value for compression parameter
              CHAINLOG_MAX
                  Maximum value for compression parameter
              HASHLOG_MIN
                  Minimum value for compression parameter
              HASHLOG_MAX
                  Maximum value for compression parameter
              SEARCHLOG_MIN
                  Minimum value for compression parameter
              SEARCHLOG_MAX
                  Maximum value for compression parameter
              SEARCHLENGTH_MIN
                  Minimum value for compression parameter
              SEARCHLENGTH_MAX
                  Maximum value for compression parameter
              TARGETLENGTH_MIN
                  Minimum value for compression parameter
              TARGETLENGTH_MAX
                  Maximum value for compression parameter
              STRATEGY_FAST
                  Compression strategory
              STRATEGY_DFAST
                  Compression strategory
              STRATEGY_GREEDY
                  Compression strategory
              STRATEGY_LAZY
                  Compression strategory
              STRATEGY_LAZY2
                  Compression strategory
              STRATEGY_BTLAZY2
                  Compression strategory
              STRATEGY_BTOPT
                  Compression strategory
              Note on Zstandard's *Experimental* API
              ======================================
              Many of the Zstandard APIs used by this module are marked as *experimental*
              within the Zstandard project. This includes a large number of useful
              features, such as compression and frame parameters and parts of dictionary
              compression.
              It is unclear how Zstandard's C API will evolve over time, especially with
              regards to this *experimental* functionality. We will try to maintain
              backwards compatibility at the Python API level. However, we cannot
              guarantee this for things not under our control.
              Since a copy of the Zstandard source code is distributed with this
              module and since we compile against it, the behavior of a specific
              version of this module should be constant for all of time. So if you
              pin the version of this module used in your projects (which is a Python
              best practice), you should be buffered from unwanted future changes.
              Donate
              ======
              A lot of time has been invested into this project by the author.
              If you find this project useful and would like to thank the author for
              their work, consider donating some money. Any amount is appreciated.
              .. image:: https://www.paypalobjects.com/en_US/i/btn/btn_donate_LG.gif
                  :target: https://www.paypal.com/cgi-bin/webscr?cmd=_donations&business=gregory%2eszorc%40gmail%2ecom&lc=US&item_name=python%2dzstandard&currency_code=USD&bn=PP%2dDonationsBF%3abtn_donate_LG%2egif%3aNonHosted
                  :alt: Donate via PayPal
              .. |ci-status| image:: https://travis-ci.org/indygreg/python-zstandard.svg?branch=master
                  :target: https://travis-ci.org/indygreg/python-zstandard
              .. |win-ci-status| image:: https://ci.appveyor.com/api/projects/status/github/indygreg/python-zstandard?svg=true
                  :target: https://ci.appveyor.com/project/indygreg/python-zstandard
                  :alt: Windows build status

contrib/python-zstandard/c-ext/compressiondict.c

0 +11 -11

              /**
              * Copyright (c) 2016-present, Gregory Szorc
              * All rights reserved.
              *
              * This software may be modified and distributed under the terms
              * of the BSD license. See the LICENSE file for details.
              */
              #include "python-zstandard.h"
              extern PyObject* ZstdError;
              ZstdCompressionDict* train_dictionary(PyObject* self, PyObject* args, PyObject* kwargs) {
              	static char *kwlist[] = { "dict_size", "samples", "parameters", NULL };
              	size_t capacity;
              	PyObject* samples;
              	Py_ssize_t samplesLen;
              	PyObject* parameters = NULL;
              	ZDICT_params_t zparams;
              	Py_ssize_t sampleIndex;
              	Py_ssize_t sampleSize;
              	PyObject* sampleItem;
              	size_t zresult;
              	void* sampleBuffer;
              	void* sampleOffset;
              	size_t samplesSize = 0;
              	size_t* sampleSizes;
              	void* dict;
              	ZstdCompressionDict* result;
              	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "nO!|O!", kwlist,
              		&capacity,
              		&PyList_Type, &samples,
              		(PyObject*)&DictParametersType, &parameters)) {
              		return NULL;
              	}
              	/* Validate parameters first since it is easiest. */
              	zparams.selectivityLevel = 0;
              	zparams.compressionLevel = 0;
              	zparams.notificationLevel = 0;
              	zparams.dictID = 0;
              	zparams.reserved[0] = 0;
              	zparams.reserved[1] = 0;
              	if (parameters) {
              		/* TODO validate data ranges */
              		zparams.selectivityLevel = PyLong_AsUnsignedLong(PyTuple_GetItem(parameters, 0));
              		zparams.compressionLevel = PyLong_AsLong(PyTuple_GetItem(parameters, 1));
              		zparams.notificationLevel = PyLong_AsUnsignedLong(PyTuple_GetItem(parameters, 2));
              		zparams.dictID = PyLong_AsUnsignedLong(PyTuple_GetItem(parameters, 3));
              	}
              	/* Figure out the size of the raw samples */
              	samplesLen = PyList_Size(samples);
              	for (sampleIndex = 0; sampleIndex < samplesLen; sampleIndex++) {
              		sampleItem = PyList_GetItem(samples, sampleIndex);
              		if (!PyBytes_Check(sampleItem)) {
              			PyErr_SetString(PyExc_ValueError, "samples must be bytes");
              			/* TODO probably need to perform DECREF here */
              			return NULL;
              		}
              		samplesSize += PyBytes_GET_SIZE(sampleItem);
              	}
              	/* Now that we know the total size of the raw simples, we can allocate
              	a buffer for the raw data */
-             	sampleBuffer = malloc(samplesSize);
+             	sampleBuffer = PyMem_Malloc(samplesSize);
              	if (!sampleBuffer) {
              		PyErr_NoMemory();
              		return NULL;
              	}
-             	sampleSizes = malloc(samplesLen * sizeof(size_t));
+             	sampleSizes = PyMem_Malloc(samplesLen * sizeof(size_t));
              	if (!sampleSizes) {
-             		free(sampleBuffer);
+             		PyMem_Free(sampleBuffer);
              		PyErr_NoMemory();
              		return NULL;
              	}
              	sampleOffset = sampleBuffer;
              	/* Now iterate again and assemble the samples in the buffer */
              	for (sampleIndex = 0; sampleIndex < samplesLen; sampleIndex++) {
              		sampleItem = PyList_GetItem(samples, sampleIndex);
              		sampleSize = PyBytes_GET_SIZE(sampleItem);
              		sampleSizes[sampleIndex] = sampleSize;
              		memcpy(sampleOffset, PyBytes_AS_STRING(sampleItem), sampleSize);
              		sampleOffset = (char*)sampleOffset + sampleSize;
              	}
-             	dict = malloc(capacity);
+             	dict = PyMem_Malloc(capacity);
              	if (!dict) {
-             		free(sampleSizes);
-             		free(sampleBuffer);
+             		PyMem_Free(sampleSizes);
+             		PyMem_Free(sampleBuffer);
              		PyErr_NoMemory();
              		return NULL;
              	}
              	zresult = ZDICT_trainFromBuffer_advanced(dict, capacity,
              		sampleBuffer, sampleSizes, (unsigned int)samplesLen,
              		zparams);
              	if (ZDICT_isError(zresult)) {
              		PyErr_Format(ZstdError, "Cannot train dict: %s", ZDICT_getErrorName(zresult));
-             		free(dict);
-             		free(sampleSizes);
-             		free(sampleBuffer);
+             		PyMem_Free(dict);
+             		PyMem_Free(sampleSizes);
+             		PyMem_Free(sampleBuffer);
              		return NULL;
              	}
              	result = PyObject_New(ZstdCompressionDict, &ZstdCompressionDictType);
              	if (!result) {
              		return NULL;
              	}
              	result->dictData = dict;
              	result->dictSize = zresult;
              	return result;
              }
              PyDoc_STRVAR(ZstdCompressionDict__doc__,
              "ZstdCompressionDict(data) - Represents a computed compression dictionary\n"
              "\n"
              "This type holds the results of a computed Zstandard compression dictionary.\n"
              "Instances are obtained by calling ``train_dictionary()`` or by passing bytes\n"
              "obtained from another source into the constructor.\n"
              );
              static int ZstdCompressionDict_init(ZstdCompressionDict* self, PyObject* args) {
              	const char* source;
              	Py_ssize_t sourceSize;
              	self->dictData = NULL;
              	self->dictSize = 0;
              #if PY_MAJOR_VERSION >= 3
              	if (!PyArg_ParseTuple(args, "y#", &source, &sourceSize)) {
              #else
              	if (!PyArg_ParseTuple(args, "s#", &source, &sourceSize)) {
              #endif
              		return -1;
              	}
-             	self->dictData = malloc(sourceSize);
+             	self->dictData = PyMem_Malloc(sourceSize);
              	if (!self->dictData) {
              		PyErr_NoMemory();
              		return -1;
              	}
              	memcpy(self->dictData, source, sourceSize);
              	self->dictSize = sourceSize;
              	return 0;
              	}
              static void ZstdCompressionDict_dealloc(ZstdCompressionDict* self) {
              	if (self->dictData) {
-             		free(self->dictData);
+             		PyMem_Free(self->dictData);
              		self->dictData = NULL;
              	}
              	PyObject_Del(self);
              }
              static PyObject* ZstdCompressionDict_dict_id(ZstdCompressionDict* self) {
              	unsigned dictID = ZDICT_getDictID(self->dictData, self->dictSize);
              	return PyLong_FromLong(dictID);
              }
              static PyObject* ZstdCompressionDict_as_bytes(ZstdCompressionDict* self) {
              	return PyBytes_FromStringAndSize(self->dictData, self->dictSize);
              }
              static PyMethodDef ZstdCompressionDict_methods[] = {
              	{ "dict_id", (PyCFunction)ZstdCompressionDict_dict_id, METH_NOARGS,
              	PyDoc_STR("dict_id() -- obtain the numeric dictionary ID") },
              	{ "as_bytes", (PyCFunction)ZstdCompressionDict_as_bytes, METH_NOARGS,
              	PyDoc_STR("as_bytes() -- obtain the raw bytes constituting the dictionary data") },
              	{ NULL, NULL }
              };
              static Py_ssize_t ZstdCompressionDict_length(ZstdCompressionDict* self) {
              	return self->dictSize;
              }
              static PySequenceMethods ZstdCompressionDict_sq = {
              	(lenfunc)ZstdCompressionDict_length, /* sq_length */
 ,                                   /* sq_concat */
 ,                                   /* sq_repeat */
 ,                                   /* sq_item */
 ,                                   /* sq_ass_item */
 ,                                   /* sq_contains */
 ,                                   /* sq_inplace_concat */
 /* sq_inplace_repeat */
              };
              PyTypeObject ZstdCompressionDictType = {
              	PyVarObject_HEAD_INIT(NULL, 0)
              	"zstd.ZstdCompressionDict",     /* tp_name */
              	sizeof(ZstdCompressionDict),    /* tp_basicsize */
 ,                              /* tp_itemsize */
              	(destructor)ZstdCompressionDict_dealloc, /* tp_dealloc */
 ,                              /* tp_print */
 ,                              /* tp_getattr */
 ,                              /* tp_setattr */
 ,                              /* tp_compare */
 ,                              /* tp_repr */
 ,                              /* tp_as_number */
              	&ZstdCompressionDict_sq,        /* tp_as_sequence */
 ,                              /* tp_as_mapping */
 ,                              /* tp_hash */
 ,                              /* tp_call */
 ,                              /* tp_str */
 ,                              /* tp_getattro */
 ,                              /* tp_setattro */
 ,                              /* tp_as_buffer */
              	Py_TPFLAGS_DEFAULT | Py_TPFLAGS_BASETYPE, /* tp_flags */
              	ZstdCompressionDict__doc__,     /* tp_doc */
 ,                              /* tp_traverse */
 ,                              /* tp_clear */
 ,                              /* tp_richcompare */
 ,                              /* tp_weaklistoffset */
 ,                              /* tp_iter */
 ,                              /* tp_iternext */
              	ZstdCompressionDict_methods,    /* tp_methods */
 ,                              /* tp_members */
 ,                              /* tp_getset */
 ,                              /* tp_base */
 ,                              /* tp_dict */
 ,                              /* tp_descr_get */
 ,                              /* tp_descr_set */
 ,                              /* tp_dictoffset */
              	(initproc)ZstdCompressionDict_init, /* tp_init */
 ,                              /* tp_alloc */
              	PyType_GenericNew,              /* tp_new */
              };
              void compressiondict_module_init(PyObject* mod) {
              	Py_TYPE(&ZstdCompressionDictType) = &PyType_Type;
              	if (PyType_Ready(&ZstdCompressionDictType) < 0) {
              		return;
              	}
              	Py_INCREF((PyObject*)&ZstdCompressionDictType);
              	PyModule_AddObject(mod, "ZstdCompressionDict",
              		(PyObject*)&ZstdCompressionDictType);
              }

contrib/python-zstandard/c-ext/compressionwriter.c

0 +59 -6

              /**
              * Copyright (c) 2016-present, Gregory Szorc
              * All rights reserved.
              *
              * This software may be modified and distributed under the terms
              * of the BSD license. See the LICENSE file for details.
              */
              #include "python-zstandard.h"
              extern PyObject* ZstdError;
              PyDoc_STRVAR(ZstdCompresssionWriter__doc__,
              """A context manager used for writing compressed output to a writer.\n"
              );
              static void ZstdCompressionWriter_dealloc(ZstdCompressionWriter* self) {
              	Py_XDECREF(self->compressor);
              	Py_XDECREF(self->writer);
              	if (self->cstream) {
              		ZSTD_freeCStream(self->cstream);
              		self->cstream = NULL;
              	}
              	PyObject_Del(self);
              }
              static PyObject* ZstdCompressionWriter_enter(ZstdCompressionWriter* self) {
              	if (self->entered) {
              		PyErr_SetString(ZstdError, "cannot __enter__ multiple times");
              		return NULL;
              	}
              	self->cstream = CStream_from_ZstdCompressor(self->compressor, self->sourceSize);
              	if (!self->cstream) {
              		return NULL;
              	}
              	self->entered = 1;
              	Py_INCREF(self);
              	return (PyObject*)self;
              }
              static PyObject* ZstdCompressionWriter_exit(ZstdCompressionWriter* self, PyObject* args) {
              	PyObject* exc_type;
              	PyObject* exc_value;
              	PyObject* exc_tb;
              	size_t zresult;
              	ZSTD_outBuffer output;
              	PyObject* res;
              	if (!PyArg_ParseTuple(args, "OOO", &exc_type, &exc_value, &exc_tb)) {
              		return NULL;
              	}
              	self->entered = 0;
              	if (self->cstream && exc_type == Py_None && exc_value == Py_None &&
              		exc_tb == Py_None) {
-             		output.dst = malloc(self->outSize);
+             		output.dst = PyMem_Malloc(self->outSize);
              		if (!output.dst) {
              			return PyErr_NoMemory();
              		}
              		output.size = self->outSize;
              		output.pos = 0;
              		while (1) {
              			zresult = ZSTD_endStream(self->cstream, &output);
              			if (ZSTD_isError(zresult)) {
              				PyErr_Format(ZstdError, "error ending compression stream: %s",
              					ZSTD_getErrorName(zresult));
-             				free(output.dst);
+             				PyMem_Free(output.dst);
              				return NULL;
              			}
              			if (output.pos) {
              #if PY_MAJOR_VERSION >= 3
              				res = PyObject_CallMethod(self->writer, "write", "y#",
              #else
              				res = PyObject_CallMethod(self->writer, "write", "s#",
              #endif
              					output.dst, output.pos);
              				Py_XDECREF(res);
              			}
              			if (!zresult) {
              				break;
              			}
              			output.pos = 0;
              		}
-             		free(output.dst);
+             		PyMem_Free(output.dst);
              		ZSTD_freeCStream(self->cstream);
              		self->cstream = NULL;
              	}
              	Py_RETURN_FALSE;
              }
              static PyObject* ZstdCompressionWriter_memory_size(ZstdCompressionWriter* self) {
              	if (!self->cstream) {
              		PyErr_SetString(ZstdError, "cannot determine size of an inactive compressor; "
              			"call when a context manager is active");
              		return NULL;
              	}
              	return PyLong_FromSize_t(ZSTD_sizeof_CStream(self->cstream));
              }
              static PyObject* ZstdCompressionWriter_write(ZstdCompressionWriter* self, PyObject* args) {
              	const char* source;
              	Py_ssize_t sourceSize;
              	size_t zresult;
              	ZSTD_inBuffer input;
              	ZSTD_outBuffer output;
              	PyObject* res;
              #if PY_MAJOR_VERSION >= 3
              	if (!PyArg_ParseTuple(args, "y#", &source, &sourceSize)) {
              #else
              	if (!PyArg_ParseTuple(args, "s#", &source, &sourceSize)) {
              #endif
              		return NULL;
              	}
              	if (!self->entered) {
              		PyErr_SetString(ZstdError, "compress must be called from an active context manager");
              		return NULL;
              	}
-             	output.dst = malloc(self->outSize);
+             	output.dst = PyMem_Malloc(self->outSize);
              	if (!output.dst) {
              		return PyErr_NoMemory();
              	}
              	output.size = self->outSize;
              	output.pos = 0;
              	input.src = source;
              	input.size = sourceSize;
              	input.pos = 0;
              	while ((ssize_t)input.pos < sourceSize) {
              		Py_BEGIN_ALLOW_THREADS
              		zresult = ZSTD_compressStream(self->cstream, &output, &input);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
-             			free(output.dst);
+             			PyMem_Free(output.dst);
              			PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
              			return NULL;
              		}
              		/* Copy data from output buffer to writer. */
              		if (output.pos) {
              #if PY_MAJOR_VERSION >= 3
              			res = PyObject_CallMethod(self->writer, "write", "y#",
              #else
              			res = PyObject_CallMethod(self->writer, "write", "s#",
              #endif
              				output.dst, output.pos);
              			Py_XDECREF(res);
              		}
              		output.pos = 0;
              	}
-             	free(output.dst);
+             	PyMem_Free(output.dst);
              	/* TODO return bytes written */
              	Py_RETURN_NONE;
+             }
+             static PyObject* ZstdCompressionWriter_flush(ZstdCompressionWriter* self, PyObject* args) {
+             	size_t zresult;
+             	ZSTD_outBuffer output;
+             	PyObject* res;
+             	if (!self->entered) {
+             		PyErr_SetString(ZstdError, "flush must be called from an active context manager");
+             		return NULL;
              	}
+             	output.dst = PyMem_Malloc(self->outSize);
+             	if (!output.dst) {
+             		return PyErr_NoMemory();
+             	}
+             	output.size = self->outSize;
+             	output.pos = 0;
+             	while (1) {
+             		Py_BEGIN_ALLOW_THREADS
+             		zresult = ZSTD_flushStream(self->cstream, &output);
+             		Py_END_ALLOW_THREADS
+             		if (ZSTD_isError(zresult)) {
+             			PyMem_Free(output.dst);
+             			PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
+             			return NULL;
+             		}
+             		if (!output.pos) {
+             			break;
+             		}
+             		/* Copy data from output buffer to writer. */
+             		if (output.pos) {
+             #if PY_MAJOR_VERSION >= 3
+             			res = PyObject_CallMethod(self->writer, "write", "y#",
+             #else
+             			res = PyObject_CallMethod(self->writer, "write", "s#",
+             #endif
+             				output.dst, output.pos);
+             			Py_XDECREF(res);
+             		}
+             		output.pos = 0;
+             	}
+             	PyMem_Free(output.dst);
+             	/* TODO return bytes written */
+             	Py_RETURN_NONE;
+             }
              static PyMethodDef ZstdCompressionWriter_methods[] = {
              	{ "__enter__", (PyCFunction)ZstdCompressionWriter_enter, METH_NOARGS,
              	PyDoc_STR("Enter a compression context.") },
              	{ "__exit__", (PyCFunction)ZstdCompressionWriter_exit, METH_VARARGS,
              	PyDoc_STR("Exit a compression context.") },
              	{ "memory_size", (PyCFunction)ZstdCompressionWriter_memory_size, METH_NOARGS,
              	PyDoc_STR("Obtain the memory size of the underlying compressor") },
              	{ "write", (PyCFunction)ZstdCompressionWriter_write, METH_VARARGS,
              	PyDoc_STR("Compress data") },
+             	{ "flush", (PyCFunction)ZstdCompressionWriter_flush, METH_NOARGS,
+             	PyDoc_STR("Flush data and finish a zstd frame") },
              	{ NULL, NULL }
              };
              PyTypeObject ZstdCompressionWriterType = {
              	PyVarObject_HEAD_INIT(NULL, 0)
              	"zstd.ZstdCompressionWriter",  /* tp_name */
              	sizeof(ZstdCompressionWriter),  /* tp_basicsize */
 ,                              /* tp_itemsize */
              	(destructor)ZstdCompressionWriter_dealloc, /* tp_dealloc */
 ,                              /* tp_print */
 ,                              /* tp_getattr */
 ,                              /* tp_setattr */
 ,                              /* tp_compare */
 ,                              /* tp_repr */
 ,                              /* tp_as_number */
 ,                              /* tp_as_sequence */
 ,                              /* tp_as_mapping */
 ,                              /* tp_hash */
 ,                              /* tp_call */
 ,                              /* tp_str */
 ,                              /* tp_getattro */
 ,                              /* tp_setattro */
 ,                              /* tp_as_buffer */
              	Py_TPFLAGS_DEFAULT | Py_TPFLAGS_BASETYPE, /* tp_flags */
              	ZstdCompresssionWriter__doc__,  /* tp_doc */
 ,                              /* tp_traverse */
 ,                              /* tp_clear */
 ,                              /* tp_richcompare */
 ,                              /* tp_weaklistoffset */
 ,                              /* tp_iter */
 ,                              /* tp_iternext */
              	ZstdCompressionWriter_methods,  /* tp_methods */
 ,                              /* tp_members */
 ,                              /* tp_getset */
 ,                              /* tp_base */
 ,                              /* tp_dict */
 ,                              /* tp_descr_get */
 ,                              /* tp_descr_set */
 ,                              /* tp_dictoffset */
 ,                              /* tp_init */
 ,                              /* tp_alloc */
              	PyType_GenericNew,              /* tp_new */
              };
              void compressionwriter_module_init(PyObject* mod) {
              	Py_TYPE(&ZstdCompressionWriterType) = &PyType_Type;
              	if (PyType_Ready(&ZstdCompressionWriterType) < 0) {
              		return;
              	}
              }

contrib/python-zstandard/c-ext/compressobj.c

0 +52 -7

              /**
              * Copyright (c) 2016-present, Gregory Szorc
              * All rights reserved.
              *
              * This software may be modified and distributed under the terms
              * of the BSD license. See the LICENSE file for details.
              */
              #include "python-zstandard.h"
              extern PyObject* ZstdError;
              PyDoc_STRVAR(ZstdCompressionObj__doc__,
              "Perform compression using a standard library compatible API.\n"
              );
              static void ZstdCompressionObj_dealloc(ZstdCompressionObj* self) {
              	PyMem_Free(self->output.dst);
              	self->output.dst = NULL;
              	if (self->cstream) {
              		ZSTD_freeCStream(self->cstream);
              		self->cstream = NULL;
              	}
              	Py_XDECREF(self->compressor);
              	PyObject_Del(self);
              }
              static PyObject* ZstdCompressionObj_compress(ZstdCompressionObj* self, PyObject* args) {
              	const char* source;
              	Py_ssize_t sourceSize;
              	ZSTD_inBuffer input;
              	size_t zresult;
              	PyObject* result = NULL;
              	Py_ssize_t resultSize = 0;
-             	if (self->flushed) {
-             		PyErr_SetString(ZstdError, "cannot call compress() after flush() has been called");
+             	if (self->finished) {
+             		PyErr_SetString(ZstdError, "cannot call compress() after compressor finished");
              		return NULL;
              	}
              #if PY_MAJOR_VERSION >= 3
              	if (!PyArg_ParseTuple(args, "y#", &source, &sourceSize)) {
              #else
              	if (!PyArg_ParseTuple(args, "s#", &source, &sourceSize)) {
              #endif
              		return NULL;
              	}
              	input.src = source;
              	input.size = sourceSize;
              	input.pos = 0;
              	while ((ssize_t)input.pos < sourceSize) {
              		Py_BEGIN_ALLOW_THREADS
              		zresult = ZSTD_compressStream(self->cstream, &self->output, &input);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
              			PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
              			return NULL;
              		}
              		if (self->output.pos) {
              			if (result) {
              				resultSize = PyBytes_GET_SIZE(result);
              				if (-1 == _PyBytes_Resize(&result, resultSize + self->output.pos)) {
              					return NULL;
              				}
              				memcpy(PyBytes_AS_STRING(result) + resultSize,
              					self->output.dst, self->output.pos);
              			}
              			else {
              				result = PyBytes_FromStringAndSize(self->output.dst, self->output.pos);
              				if (!result) {
              					return NULL;
              				}
              			}
              			self->output.pos = 0;
              		}
              	}
              	if (result) {
              		return result;
              	}
              	else {
              		return PyBytes_FromString("");
              	}
              }
-             static PyObject* ZstdCompressionObj_flush(ZstdCompressionObj* self) {
+             static PyObject* ZstdCompressionObj_flush(ZstdCompressionObj* self, PyObject* args) {
+             	int flushMode = compressorobj_flush_finish;
              	size_t zresult;
              	PyObject* result = NULL;
              	Py_ssize_t resultSize = 0;
-             	if (self->flushed) {
-             		PyErr_SetString(ZstdError, "flush() already called");
+             	if (!PyArg_ParseTuple(args, "|i", &flushMode)) {
+             		return NULL;
+             	}
+             	if (flushMode != compressorobj_flush_finish && flushMode != compressorobj_flush_block) {
+             		PyErr_SetString(PyExc_ValueError, "flush mode not recognized");
+             		return NULL;
+             	}
+             	if (self->finished) {
+             		PyErr_SetString(ZstdError, "compressor object already finished");
              		return NULL;
              	}
-             	self->flushed = 1;
+             	assert(self->output.pos == 0);
+             	if (flushMode == compressorobj_flush_block) {
+             		/* The output buffer is of size ZSTD_CStreamOutSize(), which is
+             		   guaranteed to hold a full block. */
+             		Py_BEGIN_ALLOW_THREADS
+             		zresult = ZSTD_flushStream(self->cstream, &self->output);
+             		Py_END_ALLOW_THREADS
+             		if (ZSTD_isError(zresult)) {
+             			PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
+             			return NULL;
+             		}
+             		/* Output buffer is guaranteed to hold full block. */
+             		assert(zresult == 0);
+             		if (self->output.pos) {
+             			result = PyBytes_FromStringAndSize(self->output.dst, self->output.pos);
+             			if (!result) {
+             				return NULL;
+             			}
+             		}
+             		self->output.pos = 0;
+             		if (result) {
+             			return result;
+             		}
+             		else {
+             			return PyBytes_FromString("");
+             		}
+             	}
+             	assert(flushMode == compressorobj_flush_finish);
+             	self->finished = 1;
              	while (1) {
              		zresult = ZSTD_endStream(self->cstream, &self->output);
              		if (ZSTD_isError(zresult)) {
              			PyErr_Format(ZstdError, "error ending compression stream: %s",
              				ZSTD_getErrorName(zresult));
              			return NULL;
              		}
              		if (self->output.pos) {
              			if (result) {
              				resultSize = PyBytes_GET_SIZE(result);
              				if (-1 == _PyBytes_Resize(&result, resultSize + self->output.pos)) {
              					return NULL;
              				}
              				memcpy(PyBytes_AS_STRING(result) + resultSize,
              					self->output.dst, self->output.pos);
              			}
              			else {
              				result = PyBytes_FromStringAndSize(self->output.dst, self->output.pos);
              				if (!result) {
              					return NULL;
              				}
              			}
              			self->output.pos = 0;
              		}
              		if (!zresult) {
              			break;
              		}
              	}
              	ZSTD_freeCStream(self->cstream);
              	self->cstream = NULL;
              	if (result) {
              		return result;
              	}
              	else {
              		return PyBytes_FromString("");
              	}
              }
              static PyMethodDef ZstdCompressionObj_methods[] = {
              	{ "compress", (PyCFunction)ZstdCompressionObj_compress, METH_VARARGS,
              	PyDoc_STR("compress data") },
-             	{ "flush", (PyCFunction)ZstdCompressionObj_flush, METH_NOARGS,
+             	{ "flush", (PyCFunction)ZstdCompressionObj_flush, METH_VARARGS,
              	PyDoc_STR("finish compression operation") },
              	{ NULL, NULL }
              };
              PyTypeObject ZstdCompressionObjType = {
              	PyVarObject_HEAD_INIT(NULL, 0)
              	"zstd.ZstdCompressionObj",      /* tp_name */
              	sizeof(ZstdCompressionObj),     /* tp_basicsize */
 ,                              /* tp_itemsize */
              	(destructor)ZstdCompressionObj_dealloc, /* tp_dealloc */
 ,                              /* tp_print */
 ,                              /* tp_getattr */
 ,                              /* tp_setattr */
 ,                              /* tp_compare */
 ,                              /* tp_repr */
 ,                              /* tp_as_number */
 ,                              /* tp_as_sequence */
 ,                              /* tp_as_mapping */
 ,                              /* tp_hash */
 ,                              /* tp_call */
 ,                              /* tp_str */
 ,                              /* tp_getattro */
 ,                              /* tp_setattro */
 ,                              /* tp_as_buffer */
              	Py_TPFLAGS_DEFAULT | Py_TPFLAGS_BASETYPE, /* tp_flags */
              	ZstdCompressionObj__doc__,      /* tp_doc */
 ,                              /* tp_traverse */
 ,                              /* tp_clear */
 ,                              /* tp_richcompare */
 ,                              /* tp_weaklistoffset */
 ,                              /* tp_iter */
 ,                              /* tp_iternext */
              	ZstdCompressionObj_methods,     /* tp_methods */
 ,                              /* tp_members */
 ,                              /* tp_getset */
 ,                              /* tp_base */
 ,                              /* tp_dict */
 ,                              /* tp_descr_get */
 ,                              /* tp_descr_set */
 ,                              /* tp_dictoffset */
 ,                              /* tp_init */
 ,                              /* tp_alloc */
              	PyType_GenericNew,              /* tp_new */
              };
              void compressobj_module_init(PyObject* module) {
              	Py_TYPE(&ZstdCompressionObjType) = &PyType_Type;
              	if (PyType_Ready(&ZstdCompressionObjType) < 0) {
              		return;
              	}
              }

contrib/python-zstandard/c-ext/compressor.c

0 +66 -35

              /**
              * Copyright (c) 2016-present, Gregory Szorc
              * All rights reserved.
              *
              * This software may be modified and distributed under the terms
              * of the BSD license. See the LICENSE file for details.
              */
              #include "python-zstandard.h"
              extern PyObject* ZstdError;
+             int populate_cdict(ZstdCompressor* compressor, void* dictData, size_t dictSize, ZSTD_parameters* zparams) {
+             	ZSTD_customMem zmem;
+             	assert(!compressor->cdict);
+             	Py_BEGIN_ALLOW_THREADS
+             	memset(&zmem, 0, sizeof(zmem));
+             	compressor->cdict = ZSTD_createCDict_advanced(compressor->dict->dictData,
+             		compressor->dict->dictSize, *zparams, zmem);
+             	Py_END_ALLOW_THREADS
+             	if (!compressor->cdict) {
+             		PyErr_SetString(ZstdError, "could not create compression dictionary");
+             		return 1;
+             	}
+             	return 0;
+             }
              /**
              * Initialize a zstd CStream from a ZstdCompressor instance.
              *
              * Returns a ZSTD_CStream on success or NULL on failure. If NULL, a Python
              * exception will be set.
              */
              ZSTD_CStream* CStream_from_ZstdCompressor(ZstdCompressor* compressor, Py_ssize_t sourceSize) {
              	ZSTD_CStream* cstream;
              	ZSTD_parameters zparams;
              	void* dictData = NULL;
              	size_t dictSize = 0;
              	size_t zresult;
              	cstream = ZSTD_createCStream();
              	if (!cstream) {
              		PyErr_SetString(ZstdError, "cannot create CStream");
              		return NULL;
              	}
              	if (compressor->dict) {
              		dictData = compressor->dict->dictData;
              		dictSize = compressor->dict->dictSize;
              	}
              	memset(&zparams, 0, sizeof(zparams));
              	if (compressor->cparams) {
              		ztopy_compression_parameters(compressor->cparams, &zparams.cParams);
              		/* Do NOT call ZSTD_adjustCParams() here because the compression params
              		come from the user. */
              	}
              	else {
              		zparams.cParams = ZSTD_getCParams(compressor->compressionLevel, sourceSize, dictSize);
              	}
              	zparams.fParams = compressor->fparams;
              	zresult = ZSTD_initCStream_advanced(cstream, dictData, dictSize, zparams, sourceSize);
              	if (ZSTD_isError(zresult)) {
              		ZSTD_freeCStream(cstream);
              		PyErr_Format(ZstdError, "cannot init CStream: %s", ZSTD_getErrorName(zresult));
              		return NULL;
              	}
              	return cstream;
              }
              PyDoc_STRVAR(ZstdCompressor__doc__,
              "ZstdCompressor(level=None, dict_data=None, compression_params=None)\n"
              "\n"
              "Create an object used to perform Zstandard compression.\n"
              "\n"
              "An instance can compress data various ways. Instances can be used multiple\n"
              "times. Each compression operation will use the compression parameters\n"
              "defined at construction time.\n"
              "\n"
              "Compression can be configured via the following names arguments:\n"
              "\n"
              "level\n"
              "   Integer compression level.\n"
              "dict_data\n"
              "   A ``ZstdCompressionDict`` to be used to compress with dictionary data.\n"
              "compression_params\n"
              "   A ``CompressionParameters`` instance defining low-level compression"
              "   parameters. If defined, this will overwrite the ``level`` argument.\n"
              "write_checksum\n"
              "   If True, a 4 byte content checksum will be written with the compressed\n"
              "   data, allowing the decompressor to perform content verification.\n"
              "write_content_size\n"
              "   If True, the decompressed content size will be included in the header of\n"
              "   the compressed data. This data will only be written if the compressor\n"
              "   knows the size of the input data.\n"
              "write_dict_id\n"
              "   Determines whether the dictionary ID will be written into the compressed\n"
              "   data. Defaults to True. Only adds content to the compressed data if\n"
              "   a dictionary is being used.\n"
              );
              static int ZstdCompressor_init(ZstdCompressor* self, PyObject* args, PyObject* kwargs) {
              	static char* kwlist[] = {
              		"level",
              		"dict_data",
              		"compression_params",
              		"write_checksum",
              		"write_content_size",
              		"write_dict_id",
              		NULL
              	};
              	int level = 3;
              	ZstdCompressionDict* dict = NULL;
              	CompressionParametersObject* params = NULL;
              	PyObject* writeChecksum = NULL;
              	PyObject* writeContentSize = NULL;
              	PyObject* writeDictID = NULL;
+             	self->cctx = NULL;
              	self->dict = NULL;
              	self->cparams = NULL;
              	self->cdict = NULL;
              	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "|iO!O!OOO", kwlist,
              		&level, &ZstdCompressionDictType, &dict,
              		&CompressionParametersType, &params,
              		&writeChecksum, &writeContentSize, &writeDictID)) {
              		return -1;
              	}
              	if (level < 1) {
              		PyErr_SetString(PyExc_ValueError, "level must be greater than 0");
              		return -1;
              	}
              	if (level > ZSTD_maxCLevel()) {
              		PyErr_Format(PyExc_ValueError, "level must be less than %d",
              			ZSTD_maxCLevel() + 1);
              		return -1;
              	}
+             	/* We create a ZSTD_CCtx for reuse among multiple operations to reduce the
+             	   overhead of each compression operation. */
+             	self->cctx = ZSTD_createCCtx();
+             	if (!self->cctx) {
+             		PyErr_NoMemory();
+             		return -1;
+             	}
              	self->compressionLevel = level;
              	if (dict) {
              		self->dict = dict;
              		Py_INCREF(dict);
              	}
              	if (params) {
              		self->cparams = params;
              		Py_INCREF(params);
              	}
              	memset(&self->fparams, 0, sizeof(self->fparams));
              	if (writeChecksum && PyObject_IsTrue(writeChecksum)) {
              		self->fparams.checksumFlag = 1;
              	}
              	if (writeContentSize && PyObject_IsTrue(writeContentSize)) {
              		self->fparams.contentSizeFlag = 1;
              	}
              	if (writeDictID && PyObject_Not(writeDictID)) {
              		self->fparams.noDictIDFlag = 1;
              	}
              	return 0;
              }
              static void ZstdCompressor_dealloc(ZstdCompressor* self) {
              	Py_XDECREF(self->cparams);
              	Py_XDECREF(self->dict);
              	if (self->cdict) {
              		ZSTD_freeCDict(self->cdict);
              		self->cdict = NULL;
              	}
+             	if (self->cctx) {
+             		ZSTD_freeCCtx(self->cctx);
+             		self->cctx = NULL;
+             	}
              	PyObject_Del(self);
              }
              PyDoc_STRVAR(ZstdCompressor_copy_stream__doc__,
              "copy_stream(ifh, ofh[, size=0, read_size=default, write_size=default])\n"
              "compress data between streams\n"
              "\n"
              "Data will be read from ``ifh``, compressed, and written to ``ofh``.\n"
              "``ifh`` must have a ``read(size)`` method. ``ofh`` must have a ``write(data)``\n"
              "method.\n"
              "\n"
              "An optional ``size`` argument specifies the size of the source stream.\n"
              "If defined, compression parameters will be tuned based on the size.\n"
              "\n"
              "Optional arguments ``read_size`` and ``write_size`` define the chunk sizes\n"
              "of ``read()`` and ``write()`` operations, respectively. By default, they use\n"
              "the default compression stream input and output sizes, respectively.\n"
              );
              static PyObject* ZstdCompressor_copy_stream(ZstdCompressor* self, PyObject* args, PyObject* kwargs) {
              	static char* kwlist[] = {
              		"ifh",
              		"ofh",
              		"size",
              		"read_size",
              		"write_size",
              		NULL
              	};
              	PyObject* source;
              	PyObject* dest;
              	Py_ssize_t sourceSize = 0;
              	size_t inSize = ZSTD_CStreamInSize();
              	size_t outSize = ZSTD_CStreamOutSize();
              	ZSTD_CStream* cstream;
              	ZSTD_inBuffer input;
              	ZSTD_outBuffer output;
              	Py_ssize_t totalRead = 0;
              	Py_ssize_t totalWrite = 0;
              	char* readBuffer;
              	Py_ssize_t readSize;
              	PyObject* readResult;
              	PyObject* res = NULL;
              	size_t zresult;
              	PyObject* writeResult;
              	PyObject* totalReadPy;
              	PyObject* totalWritePy;
              	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "OO|nkk", kwlist, &source, &dest, &sourceSize,
              		&inSize, &outSize)) {
              		return NULL;
              	}
              	if (!PyObject_HasAttrString(source, "read")) {
              		PyErr_SetString(PyExc_ValueError, "first argument must have a read() method");
              		return NULL;
              	}
              	if (!PyObject_HasAttrString(dest, "write")) {
              		PyErr_SetString(PyExc_ValueError, "second argument must have a write() method");
              		return NULL;
              	}
              	cstream = CStream_from_ZstdCompressor(self, sourceSize);
              	if (!cstream) {
              		res = NULL;
              		goto finally;
              	}
              	output.dst = PyMem_Malloc(outSize);
              	if (!output.dst) {
              		PyErr_NoMemory();
              		res = NULL;
              		goto finally;
              	}
              	output.size = outSize;
              	output.pos = 0;
              	while (1) {
              		/* Try to read from source stream. */
              		readResult = PyObject_CallMethod(source, "read", "n", inSize);
              		if (!readResult) {
              			PyErr_SetString(ZstdError, "could not read() from source");
              			goto finally;
              		}
              		PyBytes_AsStringAndSize(readResult, &readBuffer, &readSize);
              		/* If no data was read, we're at EOF. */
              		if (0 == readSize) {
              			break;
              		}
              		totalRead += readSize;
              		/* Send data to compressor */
              		input.src = readBuffer;
              		input.size = readSize;
              		input.pos = 0;
              		while (input.pos < input.size) {
              			Py_BEGIN_ALLOW_THREADS
              			zresult = ZSTD_compressStream(cstream, &output, &input);
              			Py_END_ALLOW_THREADS
              			if (ZSTD_isError(zresult)) {
              				res = NULL;
              				PyErr_Format(ZstdError, "zstd compress error: %s", ZSTD_getErrorName(zresult));
              				goto finally;
              			}
              			if (output.pos) {
              #if PY_MAJOR_VERSION >= 3
              				writeResult = PyObject_CallMethod(dest, "write", "y#",
              #else
              				writeResult = PyObject_CallMethod(dest, "write", "s#",
              #endif
              					output.dst, output.pos);
              				Py_XDECREF(writeResult);
              				totalWrite += output.pos;
              				output.pos = 0;
              			}
              		}
              	}
              	/* We've finished reading. Now flush the compressor stream. */
              	while (1) {
              		zresult = ZSTD_endStream(cstream, &output);
              		if (ZSTD_isError(zresult)) {
              			PyErr_Format(ZstdError, "error ending compression stream: %s",
              				ZSTD_getErrorName(zresult));
              			res = NULL;
              			goto finally;
              		}
              		if (output.pos) {
              #if PY_MAJOR_VERSION >= 3
              			writeResult = PyObject_CallMethod(dest, "write", "y#",
              #else
              			writeResult = PyObject_CallMethod(dest, "write", "s#",
              #endif
              				output.dst, output.pos);
              			totalWrite += output.pos;
              			Py_XDECREF(writeResult);
              			output.pos = 0;
              		}
              		if (!zresult) {
              			break;
              		}
              	}
              	ZSTD_freeCStream(cstream);
              	cstream = NULL;
              	totalReadPy = PyLong_FromSsize_t(totalRead);
              	totalWritePy = PyLong_FromSsize_t(totalWrite);
              	res = PyTuple_Pack(2, totalReadPy, totalWritePy);
              	Py_DecRef(totalReadPy);
              	Py_DecRef(totalWritePy);
              finally:
              	if (output.dst) {
              		PyMem_Free(output.dst);
              	}
              	if (cstream) {
              		ZSTD_freeCStream(cstream);
              	}
              	return res;
              }
              PyDoc_STRVAR(ZstdCompressor_compress__doc__,
-             "compress(data)\n"
+             "compress(data, allow_empty=False)\n"
              "\n"
              "Compress data in a single operation.\n"
              "\n"
              "This is the simplest mechanism to perform compression: simply pass in a\n"
              "value and get a compressed value back. It is almost the most prone to abuse.\n"
              "The input and output values must fit in memory, so passing in very large\n"
              "values can result in excessive memory usage. For this reason, one of the\n"
              "streaming based APIs is preferred for larger values.\n"
              );
-             static PyObject* ZstdCompressor_compress(ZstdCompressor* self, PyObject* args) {
+             static PyObject* ZstdCompressor_compress(ZstdCompressor* self, PyObject* args, PyObject* kwargs) {
+             	static char* kwlist[] = {
+             		"data",
+             		"allow_empty",
+             		NULL
+             	};
              	const char* source;
              	Py_ssize_t sourceSize;
+             	PyObject* allowEmpty = NULL;
              	size_t destSize;
-             	ZSTD_CCtx* cctx;
              	PyObject* output;
              	char* dest;
              	void* dictData = NULL;
              	size_t dictSize = 0;
              	size_t zresult;
              	ZSTD_parameters zparams;
-             	ZSTD_customMem zmem;
              #if PY_MAJOR_VERSION >= 3
-             	if (!PyArg_ParseTuple(args, "y#", &source, &sourceSize)) {
+             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "y#|O",
              #else
-             	if (!PyArg_ParseTuple(args, "s#", &source, &sourceSize)) {
+             	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "s#|O",
              #endif
+             		kwlist, &source, &sourceSize, &allowEmpty)) {
+             		return NULL;
+             	}
+             	/* Limitation in zstd C API doesn't let decompression side distinguish
+             	   between content size of 0 and unknown content size. This can make round
+             	   tripping via Python difficult. Until this is fixed, require a flag
+             	   to fire the footgun.
+             	   https://github.com/indygreg/python-zstandard/issues/11 */
+             	if (0 == sourceSize && self->fparams.contentSizeFlag
+             		&& (!allowEmpty || PyObject_Not(allowEmpty))) {
+             		PyErr_SetString(PyExc_ValueError, "cannot write empty inputs when writing content sizes");
              		return NULL;
              	}
              	destSize = ZSTD_compressBound(sourceSize);
              	output = PyBytes_FromStringAndSize(NULL, destSize);
              	if (!output) {
              		return NULL;
              	}
              	dest = PyBytes_AsString(output);
-             	cctx = ZSTD_createCCtx();
-             	if (!cctx) {
-             		Py_DECREF(output);
-             		PyErr_SetString(ZstdError, "could not create CCtx");
-             		return NULL;
+             	}
              	if (self->dict) {
              		dictData = self->dict->dictData;
              		dictSize = self->dict->dictSize;
              	}
              	memset(&zparams, 0, sizeof(zparams));
              	if (!self->cparams) {
              		zparams.cParams = ZSTD_getCParams(self->compressionLevel, sourceSize, dictSize);
              	}
              	else {
              		ztopy_compression_parameters(self->cparams, &zparams.cParams);
              		/* Do NOT call ZSTD_adjustCParams() here because the compression params
              		come from the user. */
              	}
              	zparams.fParams = self->fparams;
              	/* The raw dict data has to be processed before it can be used. Since this
              	adds overhead - especially if multiple dictionary compression operations
              	are performed on the same ZstdCompressor instance - we create a
-             	ZSTD_CDict once and reuse it for all operations. */
+             	ZSTD_CDict once and reuse it for all operations.
-             	/* TODO the zparams (which can be derived from the source data size) used
-             	on first invocation are effectively reused for subsequent operations. This
-             	may not be appropriate if input sizes vary significantly and could affect
-             	chosen compression parameters.
-             	https://github.com/facebook/zstd/issues/358 tracks this issue. */
+             	Note: the compression parameters used for the first invocation (possibly
+             	derived from the source size) will be reused on all subsequent invocations.
+             	https://github.com/facebook/zstd/issues/358 contains more info. We could
+             	potentially add an argument somewhere to control this behavior.
+             	*/
              	if (dictData && !self->cdict) {
-             		Py_BEGIN_ALLOW_THREADS
-             		memset(&zmem, 0, sizeof(zmem));
-             		self->cdict = ZSTD_createCDict_advanced(dictData, dictSize, zparams, zmem);
-             		Py_END_ALLOW_THREADS
-             		if (!self->cdict) {
+             		if (populate_cdict(self, dictData, dictSize, &zparams)) {
              			Py_DECREF(output);
-             			ZSTD_freeCCtx(cctx);
-             			PyErr_SetString(ZstdError, "could not create compression dictionary");
              			return NULL;
              		}
              	}
              	Py_BEGIN_ALLOW_THREADS
              	/* By avoiding ZSTD_compress(), we don't necessarily write out content
              	   size. This means the argument to ZstdCompressor to control frame
              	   parameters is honored. */
              	if (self->cdict) {
-             		zresult = ZSTD_compress_usingCDict(cctx, dest, destSize,
+             		zresult = ZSTD_compress_usingCDict(self->cctx, dest, destSize,
              			source, sourceSize, self->cdict);
              	}
              	else {
-             		zresult = ZSTD_compress_advanced(cctx, dest, destSize,
+             		zresult = ZSTD_compress_advanced(self->cctx, dest, destSize,
              			source, sourceSize, dictData, dictSize, zparams);
              	}
              	Py_END_ALLOW_THREADS
-             	ZSTD_freeCCtx(cctx);
              	if (ZSTD_isError(zresult)) {
              		PyErr_Format(ZstdError, "cannot compress: %s", ZSTD_getErrorName(zresult));
              		Py_CLEAR(output);
              		return NULL;
              	}
              	else {
              		Py_SIZE(output) = zresult;
              	}
              	return output;
              }
              PyDoc_STRVAR(ZstdCompressionObj__doc__,
              "compressobj()\n"
              "\n"
              "Return an object exposing ``compress(data)`` and ``flush()`` methods.\n"
              "\n"
              "The returned object exposes an API similar to ``zlib.compressobj`` and\n"
              "``bz2.BZ2Compressor`` so that callers can swap in the zstd compressor\n"
              "without changing how compression is performed.\n"
              );
              static ZstdCompressionObj* ZstdCompressor_compressobj(ZstdCompressor* self, PyObject* args, PyObject* kwargs) {
              	static char* kwlist[] = {
              		"size",
              		NULL
              	};
              	Py_ssize_t inSize = 0;
              	size_t outSize = ZSTD_CStreamOutSize();
              	ZstdCompressionObj* result = PyObject_New(ZstdCompressionObj, &ZstdCompressionObjType);
              	if (!result) {
              		return NULL;
              	}
              	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "|n", kwlist, &inSize)) {
              		return NULL;
              	}
              	result->cstream = CStream_from_ZstdCompressor(self, inSize);
              	if (!result->cstream) {
              		Py_DECREF(result);
              		return NULL;
              	}
              	result->output.dst = PyMem_Malloc(outSize);
              	if (!result->output.dst) {
              		PyErr_NoMemory();
              		Py_DECREF(result);
              		return NULL;
              	}
              	result->output.size = outSize;
              	result->output.pos = 0;
              	result->compressor = self;
              	Py_INCREF(result->compressor);
-             	result->flushed = 0;
+             	result->finished = 0;
              	return result;
              }
              PyDoc_STRVAR(ZstdCompressor_read_from__doc__,
              "read_from(reader, [size=0, read_size=default, write_size=default])\n"
              "Read uncompress data from a reader and return an iterator\n"
              "\n"
              "Returns an iterator of compressed data produced from reading from ``reader``.\n"
              "\n"
              "Uncompressed data will be obtained from ``reader`` by calling the\n"
              "``read(size)`` method of it. The source data will be streamed into a\n"
              "compressor. As compressed data is available, it will be exposed to the\n"
              "iterator.\n"
              "\n"
              "Data is read from the source in chunks of ``read_size``. Compressed chunks\n"
              "are at most ``write_size`` bytes. Both values default to the zstd input and\n"
              "and output defaults, respectively.\n"
              "\n"
              "The caller is partially in control of how fast data is fed into the\n"
              "compressor by how it consumes the returned iterator. The compressor will\n"
              "not consume from the reader unless the caller consumes from the iterator.\n"
              );
              static ZstdCompressorIterator* ZstdCompressor_read_from(ZstdCompressor* self, PyObject* args, PyObject* kwargs) {
              	static char* kwlist[] = {
              		"reader",
              		"size",
              		"read_size",
              		"write_size",
              		NULL
              	};
              	PyObject* reader;
              	Py_ssize_t sourceSize = 0;
              	size_t inSize = ZSTD_CStreamInSize();
              	size_t outSize = ZSTD_CStreamOutSize();
              	ZstdCompressorIterator* result;
              	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|nkk", kwlist, &reader, &sourceSize,
              		&inSize, &outSize)) {
              		return NULL;
              	}
              	result = PyObject_New(ZstdCompressorIterator, &ZstdCompressorIteratorType);
              	if (!result) {
              		return NULL;
              	}
              	result->compressor = NULL;
              	result->reader = NULL;
              	result->buffer = NULL;
              	result->cstream = NULL;
              	result->input.src = NULL;
              	result->output.dst = NULL;
              	result->readResult = NULL;
              	if (PyObject_HasAttrString(reader, "read")) {
              		result->reader = reader;
              		Py_INCREF(result->reader);
              	}
              	else if (1 == PyObject_CheckBuffer(reader)) {
              		result->buffer = PyMem_Malloc(sizeof(Py_buffer));
              		if (!result->buffer) {
              			goto except;
              		}
              		memset(result->buffer, 0, sizeof(Py_buffer));
              		if (0 != PyObject_GetBuffer(reader, result->buffer, PyBUF_CONTIG_RO)) {
              			goto except;
              		}
              		result->bufferOffset = 0;
              		sourceSize = result->buffer->len;
              	}
              	else {
              		PyErr_SetString(PyExc_ValueError,
              			"must pass an object with a read() method or conforms to buffer protocol");
              		goto except;
              	}
              	result->compressor = self;
              	Py_INCREF(result->compressor);
              	result->sourceSize = sourceSize;
              	result->cstream = CStream_from_ZstdCompressor(self, sourceSize);
              	if (!result->cstream) {
              		goto except;
              	}
              	result->inSize = inSize;
              	result->outSize = outSize;
              	result->output.dst = PyMem_Malloc(outSize);
              	if (!result->output.dst) {
              		PyErr_NoMemory();
              		goto except;
              	}
              	result->output.size = outSize;
              	result->output.pos = 0;
              	result->input.src = NULL;
              	result->input.size = 0;
              	result->input.pos = 0;
              	result->finishedInput = 0;
              	result->finishedOutput = 0;
              	goto finally;
              except:
              	if (result->cstream) {
              		ZSTD_freeCStream(result->cstream);
              		result->cstream = NULL;
              	}
              	Py_DecRef((PyObject*)result->compressor);
              	Py_DecRef(result->reader);
              	Py_DECREF(result);
              	result = NULL;
              finally:
              	return result;
              }
              PyDoc_STRVAR(ZstdCompressor_write_to___doc__,
              "Create a context manager to write compressed data to an object.\n"
              "\n"
              "The passed object must have a ``write()`` method.\n"
              "\n"
              "The caller feeds input data to the object by calling ``compress(data)``.\n"
              "Compressed data is written to the argument given to this function.\n"
              "\n"
              "The function takes an optional ``size`` argument indicating the total size\n"
              "of the eventual input. If specified, the size will influence compression\n"
              "parameter tuning and could result in the size being written into the\n"
              "header of the compressed data.\n"
              "\n"
              "An optional ``write_size`` argument is also accepted. It defines the maximum\n"
              "byte size of chunks fed to ``write()``. By default, it uses the zstd default\n"
              "for a compressor output stream.\n"
              );
              static ZstdCompressionWriter* ZstdCompressor_write_to(ZstdCompressor* self, PyObject* args, PyObject* kwargs) {
              	static char* kwlist[] = {
              		"writer",
              		"size",
              		"write_size",
              		NULL
              	};
              	PyObject* writer;
              	ZstdCompressionWriter* result;
              	Py_ssize_t sourceSize = 0;
              	size_t outSize = ZSTD_CStreamOutSize();
              	if (!PyArg_ParseTupleAndKeywords(args, kwargs, "O|nk", kwlist, &writer, &sourceSize,
              		&outSize)) {
              		return NULL;
              	}
              	if (!PyObject_HasAttrString(writer, "write")) {
              		PyErr_SetString(PyExc_ValueError, "must pass an object with a write() method");
              		return NULL;
              	}
              	result = PyObject_New(ZstdCompressionWriter, &ZstdCompressionWriterType);
              	if (!result) {
              		return NULL;
              	}
              	result->compressor = self;
              	Py_INCREF(result->compressor);
              	result->writer = writer;
              	Py_INCREF(result->writer);
              	result->sourceSize = sourceSize;
              	result->outSize = outSize;
              	result->entered = 0;
              	result->cstream = NULL;
              	return result;
              }
              static PyMethodDef ZstdCompressor_methods[] = {
-             	{ "compress", (PyCFunction)ZstdCompressor_compress, METH_VARARGS,
-             	ZstdCompressor_compress__doc__ },
+             	{ "compress", (PyCFunction)ZstdCompressor_compress,
+             	METH_VARARGS | METH_KEYWORDS, ZstdCompressor_compress__doc__ },
              	{ "compressobj", (PyCFunction)ZstdCompressor_compressobj,
              	METH_VARARGS | METH_KEYWORDS, ZstdCompressionObj__doc__ },
              	{ "copy_stream", (PyCFunction)ZstdCompressor_copy_stream,
              	METH_VARARGS | METH_KEYWORDS, ZstdCompressor_copy_stream__doc__ },
              	{ "read_from", (PyCFunction)ZstdCompressor_read_from,
              	METH_VARARGS | METH_KEYWORDS, ZstdCompressor_read_from__doc__ },
              	{ "write_to", (PyCFunction)ZstdCompressor_write_to,
              	METH_VARARGS | METH_KEYWORDS, ZstdCompressor_write_to___doc__ },
              	{ NULL, NULL }
              };
              PyTypeObject ZstdCompressorType = {
              	PyVarObject_HEAD_INIT(NULL, 0)
              	"zstd.ZstdCompressor",         /* tp_name */
              	sizeof(ZstdCompressor),        /* tp_basicsize */
 ,                              /* tp_itemsize */
              	(destructor)ZstdCompressor_dealloc, /* tp_dealloc */
 ,                              /* tp_print */
 ,                              /* tp_getattr */
 ,                              /* tp_setattr */
 ,                              /* tp_compare */
 ,                              /* tp_repr */
 ,                              /* tp_as_number */
 ,                              /* tp_as_sequence */
 ,                              /* tp_as_mapping */
 ,                              /* tp_hash */
 ,                              /* tp_call */
 ,                              /* tp_str */
 ,                              /* tp_getattro */
 ,                              /* tp_setattro */
 ,                              /* tp_as_buffer */
              	Py_TPFLAGS_DEFAULT | Py_TPFLAGS_BASETYPE, /* tp_flags */
              	ZstdCompressor__doc__,          /* tp_doc */
 ,                              /* tp_traverse */
 ,                              /* tp_clear */
 ,                              /* tp_richcompare */
 ,                              /* tp_weaklistoffset */
 ,                              /* tp_iter */
 ,                              /* tp_iternext */
              	ZstdCompressor_methods,         /* tp_methods */
 ,                              /* tp_members */
 ,                              /* tp_getset */
 ,                              /* tp_base */
 ,                              /* tp_dict */
 ,                              /* tp_descr_get */
 ,                              /* tp_descr_set */
 ,                              /* tp_dictoffset */
              	(initproc)ZstdCompressor_init,  /* tp_init */
 ,                              /* tp_alloc */
              	PyType_GenericNew,              /* tp_new */
              };
              void compressor_module_init(PyObject* mod) {
              	Py_TYPE(&ZstdCompressorType) = &PyType_Type;
              	if (PyType_Ready(&ZstdCompressorType) < 0) {
              		return;
              	}
              	Py_INCREF((PyObject*)&ZstdCompressorType);
              	PyModule_AddObject(mod, "ZstdCompressor",
              		(PyObject*)&ZstdCompressorType);
              }

contrib/python-zstandard/c-ext/constants.c

0 +3 0

              /**
              * Copyright (c) 2016-present, Gregory Szorc
              * All rights reserved.
              *
              * This software may be modified and distributed under the terms
              * of the BSD license. See the LICENSE file for details.
              */
              #include "python-zstandard.h"
              extern PyObject* ZstdError;
              static char frame_header[] = {
              	'\x28',
              	'\xb5',
              	'\x2f',
              	'\xfd',
              };
              void constants_module_init(PyObject* mod) {
              	PyObject* version;
              	PyObject* zstdVersion;
              	PyObject* frameHeader;
              #if PY_MAJOR_VERSION >= 3
              	version = PyUnicode_FromString(PYTHON_ZSTANDARD_VERSION);
              #else
              	version = PyString_FromString(PYTHON_ZSTANDARD_VERSION);
              #endif
              	Py_INCREF(version);
              	PyModule_AddObject(mod, "__version__", version);
              	ZstdError = PyErr_NewException("zstd.ZstdError", NULL, NULL);
              	PyModule_AddObject(mod, "ZstdError", ZstdError);
+             	PyModule_AddIntConstant(mod, "COMPRESSOBJ_FLUSH_FINISH", compressorobj_flush_finish);
+             	PyModule_AddIntConstant(mod, "COMPRESSOBJ_FLUSH_BLOCK", compressorobj_flush_block);
              	/* For now, the version is a simple tuple instead of a dedicated type. */
              	zstdVersion = PyTuple_New(3);
              	PyTuple_SetItem(zstdVersion, 0, PyLong_FromLong(ZSTD_VERSION_MAJOR));
              	PyTuple_SetItem(zstdVersion, 1, PyLong_FromLong(ZSTD_VERSION_MINOR));
              	PyTuple_SetItem(zstdVersion, 2, PyLong_FromLong(ZSTD_VERSION_RELEASE));
              	Py_IncRef(zstdVersion);
              	PyModule_AddObject(mod, "ZSTD_VERSION", zstdVersion);
              	frameHeader = PyBytes_FromStringAndSize(frame_header, sizeof(frame_header));
              	if (frameHeader) {
              		PyModule_AddObject(mod, "FRAME_HEADER", frameHeader);
              	}
              	else {
              		PyErr_Format(PyExc_ValueError, "could not create frame header object");
              	}
              	PyModule_AddIntConstant(mod, "MAX_COMPRESSION_LEVEL", ZSTD_maxCLevel());
              	PyModule_AddIntConstant(mod, "COMPRESSION_RECOMMENDED_INPUT_SIZE",
              		(long)ZSTD_CStreamInSize());
              	PyModule_AddIntConstant(mod, "COMPRESSION_RECOMMENDED_OUTPUT_SIZE",
              		(long)ZSTD_CStreamOutSize());
              	PyModule_AddIntConstant(mod, "DECOMPRESSION_RECOMMENDED_INPUT_SIZE",
              		(long)ZSTD_DStreamInSize());
              	PyModule_AddIntConstant(mod, "DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE",
              		(long)ZSTD_DStreamOutSize());
              	PyModule_AddIntConstant(mod, "MAGIC_NUMBER", ZSTD_MAGICNUMBER);
              	PyModule_AddIntConstant(mod, "WINDOWLOG_MIN", ZSTD_WINDOWLOG_MIN);
              	PyModule_AddIntConstant(mod, "WINDOWLOG_MAX", ZSTD_WINDOWLOG_MAX);
              	PyModule_AddIntConstant(mod, "CHAINLOG_MIN", ZSTD_CHAINLOG_MIN);
              	PyModule_AddIntConstant(mod, "CHAINLOG_MAX", ZSTD_CHAINLOG_MAX);
              	PyModule_AddIntConstant(mod, "HASHLOG_MIN", ZSTD_HASHLOG_MIN);
              	PyModule_AddIntConstant(mod, "HASHLOG_MAX", ZSTD_HASHLOG_MAX);
              	PyModule_AddIntConstant(mod, "HASHLOG3_MAX", ZSTD_HASHLOG3_MAX);
              	PyModule_AddIntConstant(mod, "SEARCHLOG_MIN", ZSTD_SEARCHLOG_MIN);
              	PyModule_AddIntConstant(mod, "SEARCHLOG_MAX", ZSTD_SEARCHLOG_MAX);
              	PyModule_AddIntConstant(mod, "SEARCHLENGTH_MIN", ZSTD_SEARCHLENGTH_MIN);
              	PyModule_AddIntConstant(mod, "SEARCHLENGTH_MAX", ZSTD_SEARCHLENGTH_MAX);
              	PyModule_AddIntConstant(mod, "TARGETLENGTH_MIN", ZSTD_TARGETLENGTH_MIN);
              	PyModule_AddIntConstant(mod, "TARGETLENGTH_MAX", ZSTD_TARGETLENGTH_MAX);
              	PyModule_AddIntConstant(mod, "STRATEGY_FAST", ZSTD_fast);
              	PyModule_AddIntConstant(mod, "STRATEGY_DFAST", ZSTD_dfast);
              	PyModule_AddIntConstant(mod, "STRATEGY_GREEDY", ZSTD_greedy);
              	PyModule_AddIntConstant(mod, "STRATEGY_LAZY", ZSTD_lazy);
              	PyModule_AddIntConstant(mod, "STRATEGY_LAZY2", ZSTD_lazy2);
              	PyModule_AddIntConstant(mod, "STRATEGY_BTLAZY2", ZSTD_btlazy2);
              	PyModule_AddIntConstant(mod, "STRATEGY_BTOPT", ZSTD_btopt);
              }

contrib/python-zstandard/c-ext/decompressionwriter.c

0 +3 -3

              /**
              * Copyright (c) 2016-present, Gregory Szorc
              * All rights reserved.
              *
              * This software may be modified and distributed under the terms
              * of the BSD license. See the LICENSE file for details.
              */
              #include "python-zstandard.h"
              extern PyObject* ZstdError;
              PyDoc_STRVAR(ZstdDecompressionWriter__doc,
              """A context manager used for writing decompressed output.\n"
              );
              static void ZstdDecompressionWriter_dealloc(ZstdDecompressionWriter* self) {
              	Py_XDECREF(self->decompressor);
              	Py_XDECREF(self->writer);
              	if (self->dstream) {
              		ZSTD_freeDStream(self->dstream);
              		self->dstream = NULL;
              	}
              	PyObject_Del(self);
              }
              static PyObject* ZstdDecompressionWriter_enter(ZstdDecompressionWriter* self) {
              	if (self->entered) {
              		PyErr_SetString(ZstdError, "cannot __enter__ multiple times");
              		return NULL;
              	}
              	self->dstream = DStream_from_ZstdDecompressor(self->decompressor);
              	if (!self->dstream) {
              		return NULL;
              	}
              	self->entered = 1;
              	Py_INCREF(self);
              	return (PyObject*)self;
              }
              static PyObject* ZstdDecompressionWriter_exit(ZstdDecompressionWriter* self, PyObject* args) {
              	self->entered = 0;
              	if (self->dstream) {
              		ZSTD_freeDStream(self->dstream);
              		self->dstream = NULL;
              	}
              	Py_RETURN_FALSE;
              }
              static PyObject* ZstdDecompressionWriter_memory_size(ZstdDecompressionWriter* self) {
              	if (!self->dstream) {
              		PyErr_SetString(ZstdError, "cannot determine size of inactive decompressor; "
              			"call when context manager is active");
              		return NULL;
              	}
              	return PyLong_FromSize_t(ZSTD_sizeof_DStream(self->dstream));
              }
              static PyObject* ZstdDecompressionWriter_write(ZstdDecompressionWriter* self, PyObject* args) {
              	const char* source;
              	Py_ssize_t sourceSize;
              	size_t zresult = 0;
              	ZSTD_inBuffer input;
              	ZSTD_outBuffer output;
              	PyObject* res;
              #if PY_MAJOR_VERSION >= 3
              	if (!PyArg_ParseTuple(args, "y#", &source, &sourceSize)) {
              #else
              	if (!PyArg_ParseTuple(args, "s#", &source, &sourceSize)) {
              #endif
              		return NULL;
              	}
              	if (!self->entered) {
              		PyErr_SetString(ZstdError, "write must be called from an active context manager");
              		return NULL;
              	}
-             	output.dst = malloc(self->outSize);
+             	output.dst = PyMem_Malloc(self->outSize);
              	if (!output.dst) {
              		return PyErr_NoMemory();
              	}
              	output.size = self->outSize;
              	output.pos = 0;
              	input.src = source;
              	input.size = sourceSize;
              	input.pos = 0;
              	while ((ssize_t)input.pos < sourceSize) {
              		Py_BEGIN_ALLOW_THREADS
              		zresult = ZSTD_decompressStream(self->dstream, &output, &input);
              		Py_END_ALLOW_THREADS
              		if (ZSTD_isError(zresult)) {
-             			free(output.dst);
+             			PyMem_Free(output.dst);
              			PyErr_Format(ZstdError, "zstd decompress error: %s",
              				ZSTD_getErrorName(zresult));
              			return NULL;
              		}
              		if (output.pos) {
              #if PY_MAJOR_VERSION >= 3
              			res = PyObject_CallMethod(self->writer, "write", "y#",
              #else
              			res = PyObject_CallMethod(self->writer, "write", "s#",
              #endif
              				output.dst, output.pos);
              			Py_XDECREF(res);
              			output.pos = 0;
              		}
              	}
-             	free(output.dst);
+             	PyMem_Free(output.dst);
              	/* TODO return bytes written */
              	Py_RETURN_NONE;
              	}
              static PyMethodDef ZstdDecompressionWriter_methods[] = {
              	{ "__enter__", (PyCFunction)ZstdDecompressionWriter_enter, METH_NOARGS,
              	PyDoc_STR("Enter a decompression context.") },
              	{ "__exit__", (PyCFunction)ZstdDecompressionWriter_exit, METH_VARARGS,
              	PyDoc_STR("Exit a decompression context.") },
              	{ "memory_size", (PyCFunction)ZstdDecompressionWriter_memory_size, METH_NOARGS,
              	PyDoc_STR("Obtain the memory size in bytes of the underlying decompressor.") },
              	{ "write", (PyCFunction)ZstdDecompressionWriter_write, METH_VARARGS,
              	PyDoc_STR("Compress data") },
              	{ NULL, NULL }
              };
              PyTypeObject ZstdDecompressionWriterType = {
              	PyVarObject_HEAD_INIT(NULL, 0)
              	"zstd.ZstdDecompressionWriter", /* tp_name */
              	sizeof(ZstdDecompressionWriter),/* tp_basicsize */
 ,                              /* tp_itemsize */
              	(destructor)ZstdDecompressionWriter_dealloc, /* tp_dealloc */
 ,                              /* tp_print */
 ,                              /* tp_getattr */
 ,                              /* tp_setattr */
 ,                              /* tp_compare */
 ,                              /* tp_repr */
 ,                              /* tp_as_number */
 ,                              /* tp_as_sequence */
 ,                              /* tp_as_mapping */
 ,                              /* tp_hash */
 ,                              /* tp_call */
 ,                              /* tp_str */
 ,                              /* tp_getattro */
 ,                              /* tp_setattro */
 ,                              /* tp_as_buffer */
              	Py_TPFLAGS_DEFAULT | Py_TPFLAGS_BASETYPE, /* tp_flags */
              	ZstdDecompressionWriter__doc,   /* tp_doc */
 ,                              /* tp_traverse */
 ,                              /* tp_clear */
 ,                              /* tp_richcompare */
 ,                              /* tp_weaklistoffset */
 ,                              /* tp_iter */
 ,                              /* tp_iternext */
              	ZstdDecompressionWriter_methods,/* tp_methods */
 ,                              /* tp_members */
 ,                              /* tp_getset */
 ,                              /* tp_base */
 ,                              /* tp_dict */
 ,                              /* tp_descr_get */
 ,                              /* tp_descr_set */
 ,                              /* tp_dictoffset */
 ,                              /* tp_init */
 ,                              /* tp_alloc */
              	PyType_GenericNew,              /* tp_new */
              };
              void decompressionwriter_module_init(PyObject* mod) {
              	Py_TYPE(&ZstdDecompressionWriterType) = &PyType_Type;
              	if (PyType_Ready(&ZstdDecompressionWriterType) < 0) {
              		return;
              	}
              }

contrib/python-zstandard/c-ext/python-zstandard.h

0 +8 -2

              /**
              * Copyright (c) 2016-present, Gregory Szorc
              * All rights reserved.
              *
              * This software may be modified and distributed under the terms
              * of the BSD license. See the LICENSE file for details.
              */
              #define PY_SSIZE_T_CLEAN
              #include <Python.h>
              #define ZSTD_STATIC_LINKING_ONLY
              #define ZDICT_STATIC_LINKING_ONLY
              #include "mem.h"
              #include "zstd.h"
              #include "zdict.h"
-             #define PYTHON_ZSTANDARD_VERSION "0.5.0"
+             #define PYTHON_ZSTANDARD_VERSION "0.6.0"
+             typedef enum {
+             	compressorobj_flush_finish,
+             	compressorobj_flush_block,
+             } CompressorObj_Flush;
              typedef struct {
              	PyObject_HEAD
              	unsigned windowLog;
              	unsigned chainLog;
              	unsigned hashLog;
              	unsigned searchLog;
              	unsigned searchLength;
              	unsigned targetLength;
              	ZSTD_strategy strategy;
              } CompressionParametersObject;
              extern PyTypeObject CompressionParametersType;
              typedef struct {
              	PyObject_HEAD
              	unsigned selectivityLevel;
              	int compressionLevel;
              	unsigned notificationLevel;
              	unsigned dictID;
              } DictParametersObject;
              extern PyTypeObject DictParametersType;
              typedef struct {
              	PyObject_HEAD
              	void* dictData;
              	size_t dictSize;
              } ZstdCompressionDict;
              extern PyTypeObject ZstdCompressionDictType;
              typedef struct {
              	PyObject_HEAD
              	int compressionLevel;
              	ZstdCompressionDict* dict;
+             	ZSTD_CCtx* cctx;
              	ZSTD_CDict* cdict;
              	CompressionParametersObject* cparams;
              	ZSTD_frameParameters fparams;
              } ZstdCompressor;
              extern PyTypeObject ZstdCompressorType;
              typedef struct {
              	PyObject_HEAD
              	ZstdCompressor* compressor;
              	ZSTD_CStream* cstream;
              	ZSTD_outBuffer output;
-             	int flushed;
+             	int finished;
              } ZstdCompressionObj;
              extern PyTypeObject ZstdCompressionObjType;
              typedef struct {
              	PyObject_HEAD
              	ZstdCompressor* compressor;
              	PyObject* writer;
              	Py_ssize_t sourceSize;
              	size_t outSize;
              	ZSTD_CStream* cstream;
              	int entered;
              } ZstdCompressionWriter;
              extern PyTypeObject ZstdCompressionWriterType;
              typedef struct {
              	PyObject_HEAD
              	ZstdCompressor* compressor;
              	PyObject* reader;
              	Py_buffer* buffer;
              	Py_ssize_t bufferOffset;
              	Py_ssize_t sourceSize;
              	size_t inSize;
              	size_t outSize;
              	ZSTD_CStream* cstream;
              	ZSTD_inBuffer input;
              	ZSTD_outBuffer output;
              	int finishedOutput;
              	int finishedInput;
              	PyObject* readResult;
              } ZstdCompressorIterator;
              extern PyTypeObject ZstdCompressorIteratorType;
              typedef struct {
              	PyObject_HEAD
              	ZSTD_DCtx* refdctx;
              	ZstdCompressionDict* dict;
              	ZSTD_DDict* ddict;
              } ZstdDecompressor;
              extern PyTypeObject ZstdDecompressorType;
              typedef struct {
              	PyObject_HEAD
              	ZstdDecompressor* decompressor;
              	ZSTD_DStream* dstream;
              	int finished;
              } ZstdDecompressionObj;
              extern PyTypeObject ZstdDecompressionObjType;
              typedef struct {
              	PyObject_HEAD
              	ZstdDecompressor* decompressor;
              	PyObject* writer;
              	size_t outSize;
              	ZSTD_DStream* dstream;
              	int entered;
              } ZstdDecompressionWriter;
              extern PyTypeObject ZstdDecompressionWriterType;
              typedef struct {
              	PyObject_HEAD
              	ZstdDecompressor* decompressor;
              	PyObject* reader;
              	Py_buffer* buffer;
              	Py_ssize_t bufferOffset;
              	size_t inSize;
              	size_t outSize;
              	size_t skipBytes;
              	ZSTD_DStream* dstream;
              	ZSTD_inBuffer input;
              	ZSTD_outBuffer output;
              	Py_ssize_t readCount;
              	int finishedInput;
              	int finishedOutput;
              } ZstdDecompressorIterator;
              extern PyTypeObject ZstdDecompressorIteratorType;
              typedef struct {
              	int errored;
              	PyObject* chunk;
              } DecompressorIteratorResult;
              void ztopy_compression_parameters(CompressionParametersObject* params, ZSTD_compressionParameters* zparams);
              CompressionParametersObject* get_compression_parameters(PyObject* self, PyObject* args);
              PyObject* estimate_compression_context_size(PyObject* self, PyObject* args);
              ZSTD_CStream* CStream_from_ZstdCompressor(ZstdCompressor* compressor, Py_ssize_t sourceSize);
              ZSTD_DStream* DStream_from_ZstdDecompressor(ZstdDecompressor* decompressor);
              ZstdCompressionDict* train_dictionary(PyObject* self, PyObject* args, PyObject* kwargs);

contrib/python-zstandard/make_cffi.py

0 +61 -63

              # Copyright (c) 2016-present, Gregory Szorc
              # All rights reserved.
              #
              # This software may be modified and distributed under the terms
              # of the BSD license. See the LICENSE file for details.
              from __future__ import absolute_import
              import cffi
+             import distutils.ccompiler
              import os
+             import subprocess
+             import tempfile
              HERE = os.path.abspath(os.path.dirname(__file__))
              SOURCES = ['zstd/%s' % p for p in (
                  'common/entropy_common.c',
                  'common/error_private.c',
                  'common/fse_decompress.c',
                  'common/xxhash.c',
                  'common/zstd_common.c',
                  'compress/fse_compress.c',
                  'compress/huf_compress.c',
-                 'compress/zbuff_compress.c',
                  'compress/zstd_compress.c',
                  'decompress/huf_decompress.c',
-                 'decompress/zbuff_decompress.c',
                  'decompress/zstd_decompress.c',
                  'dictBuilder/divsufsort.c',
                  'dictBuilder/zdict.c',
              )]
              INCLUDE_DIRS = [os.path.join(HERE, d) for d in (
                  'zstd',
                  'zstd/common',
                  'zstd/compress',
                  'zstd/decompress',
                  'zstd/dictBuilder',
              )]
+             # cffi can't parse some of the primitives in zstd.h. So we invoke the
+             # preprocessor and feed its output into cffi.
+             compiler = distutils.ccompiler.new_compiler()
+             # Needed for MSVC.
+             if hasattr(compiler, 'initialize'):
+                 compiler.initialize()
+             # Distutils doesn't set compiler.preprocessor, so invoke the preprocessor
+             # manually.
+             if compiler.compiler_type == 'unix':
+                 args = list(compiler.executables['compiler'])
+                 args.extend([
+                     '-E',
+                     '-DZSTD_STATIC_LINKING_ONLY',
+                 ])
+             elif compiler.compiler_type == 'msvc':
+                 args = [compiler.cc]
+                 args.extend([
+                     '/EP',
+                     '/DZSTD_STATIC_LINKING_ONLY',
+                 ])
+             else:
+                 raise Exception('unsupported compiler type: %s' % compiler.compiler_type)
+             # zstd.h includes <stddef.h>, which is also included by cffi's boilerplate.
+             # This can lead to duplicate declarations. So we strip this include from the
+             # preprocessor invocation.
              with open(os.path.join(HERE, 'zstd', 'zstd.h'), 'rb') as fh:
-                 zstd_h = fh.read()
+                 lines = [l for l in fh if not l.startswith(b'#include <stddef.h>')]
+             fd, input_file = tempfile.mkstemp(suffix='.h')
+             os.write(fd, b''.join(lines))
+             os.close(fd)
+             args.append(input_file)
+             try:
+                 process = subprocess.Popen(args, stdout=subprocess.PIPE)
+                 output = process.communicate()[0]
+                 ret = process.poll()
+                 if ret:
+                     raise Exception('preprocessor exited with error')
+             finally:
+                 os.unlink(input_file)
+             def normalize_output():
+                 lines = []
+                 for line in output.splitlines():
+                     # CFFI's parser doesn't like __attribute__ on UNIX compilers.
+                     if line.startswith(b'__attribute__ ((visibility ("default"))) '):
+                         line = line[len(b'__attribute__ ((visibility ("default"))) '):]
+                     lines.append(line)
+                 return b'\n'.join(lines)
              ffi = cffi.FFI()
              ffi.set_source('_zstd_cffi', '''
-             /* needed for typedefs like U32 references in zstd.h */
-             #include "mem.h"
              #define ZSTD_STATIC_LINKING_ONLY
              #include "zstd.h"
-             ''',
-                 sources=SOURCES, include_dirs=INCLUDE_DIRS)
-             # Rather than define the API definitions from zstd.h inline, munge the
-             # source in a way that cdef() will accept.
-             lines = zstd_h.splitlines()
-             lines = [l.rstrip() for l in lines if l.strip()]
-             # Strip preprocessor directives - they aren't important for our needs.
-             lines = [l for l in lines
-                      if not l.startswith((b'#if', b'#else', b'#endif', b'#include'))]
-             # Remove extern C block
-             lines = [l for l in lines if l not in (b'extern "C" {', b'}')]
-             # The version #defines don't parse and aren't necessary. Strip them.
-             lines = [l for l in lines if not l.startswith((
-                 b'#define ZSTD_H_235446',
-                 b'#define ZSTD_LIB_VERSION',
-                 b'#define ZSTD_QUOTE',
-                 b'#define ZSTD_EXPAND_AND_QUOTE',
-                 b'#define ZSTD_VERSION_STRING',
-                 b'#define ZSTD_VERSION_NUMBER'))]
+             ''', sources=SOURCES, include_dirs=INCLUDE_DIRS)
-             # The C parser also doesn't like some constant defines referencing
-             # other constants.
-             # TODO we pick the 64-bit constants here. We should assert somewhere
-             # we're compiling for 64-bit.
-             def fix_constants(l):
-                 if l.startswith(b'#define ZSTD_WINDOWLOG_MAX '):
-                     return b'#define ZSTD_WINDOWLOG_MAX 27'
-                 elif l.startswith(b'#define ZSTD_CHAINLOG_MAX '):
-                     return b'#define ZSTD_CHAINLOG_MAX 28'
-                 elif l.startswith(b'#define ZSTD_HASHLOG_MAX '):
-                     return b'#define ZSTD_HASHLOG_MAX 27'
-                 elif l.startswith(b'#define ZSTD_CHAINLOG_MAX '):
-                     return b'#define ZSTD_CHAINLOG_MAX 28'
-                 elif l.startswith(b'#define ZSTD_CHAINLOG_MIN '):
-                     return b'#define ZSTD_CHAINLOG_MIN 6'
-                 elif l.startswith(b'#define ZSTD_SEARCHLOG_MAX '):
-                     return b'#define ZSTD_SEARCHLOG_MAX 26'
-                 elif l.startswith(b'#define ZSTD_BLOCKSIZE_ABSOLUTEMAX '):
-                     return b'#define ZSTD_BLOCKSIZE_ABSOLUTEMAX 131072'
-                 else:
-                     return l
-             lines = map(fix_constants, lines)
-             # ZSTDLIB_API isn't handled correctly. Strip it.
-             lines = [l for l in lines if not l.startswith(b'#  define ZSTDLIB_API')]
-             def strip_api(l):
-                 if l.startswith(b'ZSTDLIB_API '):
-                     return l[len(b'ZSTDLIB_API '):]
-                 else:
-                     return l
-             lines = map(strip_api, lines)
-             source = b'\n'.join(lines)
-             ffi.cdef(source.decode('latin1'))
+             ffi.cdef(normalize_output().decode('latin1'))
              if __name__ == '__main__':
                  ffi.compile()

contrib/python-zstandard/setup.py

0 +8 -1

              #!/usr/bin/env python
              # Copyright (c) 2016-present, Gregory Szorc
              # All rights reserved.
              #
              # This software may be modified and distributed under the terms
              # of the BSD license. See the LICENSE file for details.
+             import sys
              from setuptools import setup
              try:
                  import cffi
              except ImportError:
                  cffi = None
              import setup_zstd
+             SUPPORT_LEGACY = False
+             if "--legacy" in sys.argv:
+                 SUPPORT_LEGACY = True
+                 sys.argv.remove("--legacy")
              # Code for obtaining the Extension instance is in its own module to
              # facilitate reuse in other projects.
-             extensions = [setup_zstd.get_c_extension()]
+             extensions = [setup_zstd.get_c_extension(SUPPORT_LEGACY, 'zstd')]
              if cffi:
                  import make_cffi
                  extensions.append(make_cffi.ffi.distutils_extension())
              version = None
              with open('c-ext/python-zstandard.h', 'r') as fh:
                  for line in fh:
                      if not line.startswith('#define PYTHON_ZSTANDARD_VERSION'):
                          continue
                      version = line.split()[2][1:-1]
                      break
              if not version:
                  raise Exception('could not resolve package version; '
                                  'this should never happen')
              setup(
                  name='zstandard',
                  version=version,
                  description='Zstandard bindings for Python',
                  long_description=open('README.rst', 'r').read(),
                  url='https://github.com/indygreg/python-zstandard',
                  author='Gregory Szorc',
                  author_email='gregory.szorc@gmail.com',
                  license='BSD',
                  classifiers=[
                      'Development Status :: 4 - Beta',
                      'Intended Audience :: Developers',
                      'License :: OSI Approved :: BSD License',
                      'Programming Language :: C',
                      'Programming Language :: Python :: 2.6',
                      'Programming Language :: Python :: 2.7',
                      'Programming Language :: Python :: 3.3',
                      'Programming Language :: Python :: 3.4',
                      'Programming Language :: Python :: 3.5',
                  ],
                  keywords='zstandard zstd compression',
                  ext_modules=extensions,
                  test_suite='tests',
              )

contrib/python-zstandard/setup_zstd.py

0 +31 -4

              # Copyright (c) 2016-present, Gregory Szorc
              # All rights reserved.
              #
              # This software may be modified and distributed under the terms
              # of the BSD license. See the LICENSE file for details.
              import os
              from distutils.extension import Extension
              zstd_sources = ['zstd/%s' % p for p in (
                  'common/entropy_common.c',
                  'common/error_private.c',
                  'common/fse_decompress.c',
                  'common/xxhash.c',
                  'common/zstd_common.c',
                  'compress/fse_compress.c',
                  'compress/huf_compress.c',
-                 'compress/zbuff_compress.c',
                  'compress/zstd_compress.c',
                  'decompress/huf_decompress.c',
-                 'decompress/zbuff_decompress.c',
                  'decompress/zstd_decompress.c',
                  'dictBuilder/divsufsort.c',
                  'dictBuilder/zdict.c',
              )]
+             zstd_sources_legacy = ['zstd/%s' % p for p in (
+                 'deprecated/zbuff_compress.c',
+                 'deprecated/zbuff_decompress.c',
+                 'legacy/zstd_v01.c',
+                 'legacy/zstd_v02.c',
+                 'legacy/zstd_v03.c',
+                 'legacy/zstd_v04.c',
+                 'legacy/zstd_v05.c',
+                 'legacy/zstd_v06.c',
+                 'legacy/zstd_v07.c'
+             )]
              zstd_includes = [
                  'c-ext',
                  'zstd',
                  'zstd/common',
                  'zstd/compress',
                  'zstd/decompress',
                  'zstd/dictBuilder',
              ]
+             zstd_includes_legacy = [
+                 'zstd/deprecated',
+                 'zstd/legacy',
+             ]
              ext_sources = [
                  'zstd.c',
                  'c-ext/compressiondict.c',
                  'c-ext/compressobj.c',
                  'c-ext/compressor.c',
                  'c-ext/compressoriterator.c',
                  'c-ext/compressionparams.c',
                  'c-ext/compressionwriter.c',
                  'c-ext/constants.c',
                  'c-ext/decompressobj.c',
                  'c-ext/decompressor.c',
                  'c-ext/decompressoriterator.c',
                  'c-ext/decompressionwriter.c',
                  'c-ext/dictparams.c',
              ]
+             zstd_depends = [
+                 'c-ext/python-zstandard.h',
+             ]
-             def get_c_extension(name='zstd'):
+             def get_c_extension(support_legacy=False, name='zstd'):
                  """Obtain a distutils.extension.Extension for the C extension."""
                  root = os.path.abspath(os.path.dirname(__file__))
                  sources = [os.path.join(root, p) for p in zstd_sources + ext_sources]
+                 if support_legacy:
+                     sources.extend([os.path.join(root, p) for p in zstd_sources_legacy])
                  include_dirs = [os.path.join(root, d) for d in zstd_includes]
+                 if support_legacy:
+                     include_dirs.extend([os.path.join(root, d) for d in zstd_includes_legacy])
+                 depends = [os.path.join(root, p) for p in zstd_depends]
                  # TODO compile with optimizations.
                  return Extension(name, sources,
-                                  include_dirs=include_dirs)
+                                  include_dirs=include_dirs,
+                                  depends=depends,
+                                  extra_compile_args=["-DZSTD_LEGACY_SUPPORT=1"] if support_legacy else [])

contrib/python-zstandard/tests/test_compressor.py

0 +74 -3

              import hashlib
              import io
              import struct
              import sys
              try:
                  import unittest2 as unittest
              except ImportError:
                  import unittest
              import zstd
              from .common import OpCountingBytesIO
              if sys.version_info[0] >= 3:
                  next = lambda it: it.__next__()
              else:
                  next = lambda it: it.next()
              class TestCompressor(unittest.TestCase):
                  def test_level_bounds(self):
                      with self.assertRaises(ValueError):
                          zstd.ZstdCompressor(level=0)
                      with self.assertRaises(ValueError):
                          zstd.ZstdCompressor(level=23)
              class TestCompressor_compress(unittest.TestCase):
                  def test_compress_empty(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      cctx.compress(b'')
                      cctx = zstd.ZstdCompressor(level=22)
                      cctx.compress(b'')
                  def test_compress_empty(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      self.assertEqual(cctx.compress(b''),
                                       b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
+                     # TODO should be temporary until https://github.com/facebook/zstd/issues/506
+                     # is fixed.
+                     cctx = zstd.ZstdCompressor(write_content_size=True)
+                     with self.assertRaises(ValueError):
+                         cctx.compress(b'')
+                     cctx.compress(b'', allow_empty=True)
                  def test_compress_large(self):
                      chunks = []
                      for i in range(255):
                          chunks.append(struct.Struct('>B').pack(i) * 16384)
                      cctx = zstd.ZstdCompressor(level=3)
                      result = cctx.compress(b''.join(chunks))
                      self.assertEqual(len(result), 999)
                      self.assertEqual(result[0:4], b'\x28\xb5\x2f\xfd')
                  def test_write_checksum(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      no_checksum = cctx.compress(b'foobar')
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      with_checksum = cctx.compress(b'foobar')
                      self.assertEqual(len(with_checksum), len(no_checksum) + 4)
                  def test_write_content_size(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      no_size = cctx.compress(b'foobar' * 256)
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=True)
                      with_size = cctx.compress(b'foobar' * 256)
                      self.assertEqual(len(with_size), len(no_size) + 1)
                  def test_no_dict_id(self):
                      samples = []
                      for i in range(128):
                          samples.append(b'foo' * 64)
                          samples.append(b'bar' * 64)
                          samples.append(b'foobar' * 64)
                      d = zstd.train_dictionary(1024, samples)
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      with_dict_id = cctx.compress(b'foobarfoobar')
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d, write_dict_id=False)
                      no_dict_id = cctx.compress(b'foobarfoobar')
                      self.assertEqual(len(with_dict_id), len(no_dict_id) + 4)
                  def test_compress_dict_multiple(self):
                      samples = []
                      for i in range(128):
                          samples.append(b'foo' * 64)
                          samples.append(b'bar' * 64)
                          samples.append(b'foobar' * 64)
                      d = zstd.train_dictionary(8192, samples)
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      for i in range(32):
                          cctx.compress(b'foo bar foobar foo bar foobar')
              class TestCompressor_compressobj(unittest.TestCase):
                  def test_compressobj_empty(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      cobj = cctx.compressobj()
                      self.assertEqual(cobj.compress(b''), b'')
                      self.assertEqual(cobj.flush(),
                                       b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
                  def test_compressobj_large(self):
                      chunks = []
                      for i in range(255):
                          chunks.append(struct.Struct('>B').pack(i) * 16384)
                      cctx = zstd.ZstdCompressor(level=3)
                      cobj = cctx.compressobj()
                      result = cobj.compress(b''.join(chunks)) + cobj.flush()
                      self.assertEqual(len(result), 999)
                      self.assertEqual(result[0:4], b'\x28\xb5\x2f\xfd')
                  def test_write_checksum(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      cobj = cctx.compressobj()
                      no_checksum = cobj.compress(b'foobar') + cobj.flush()
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      cobj = cctx.compressobj()
                      with_checksum = cobj.compress(b'foobar') + cobj.flush()
                      self.assertEqual(len(with_checksum), len(no_checksum) + 4)
                  def test_write_content_size(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      cobj = cctx.compressobj(size=len(b'foobar' * 256))
                      no_size = cobj.compress(b'foobar' * 256) + cobj.flush()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=True)
                      cobj = cctx.compressobj(size=len(b'foobar' * 256))
                      with_size = cobj.compress(b'foobar' * 256) + cobj.flush()
                      self.assertEqual(len(with_size), len(no_size) + 1)
-                 def test_compress_after_flush(self):
+                 def test_compress_after_finished(self):
                      cctx = zstd.ZstdCompressor()
                      cobj = cctx.compressobj()
                      cobj.compress(b'foo')
                      cobj.flush()
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'cannot call compress\(\) after flush'):
+                     with self.assertRaisesRegexp(zstd.ZstdError, 'cannot call compress\(\) after compressor'):
                          cobj.compress(b'foo')
-                     with self.assertRaisesRegexp(zstd.ZstdError, 'flush\(\) already called'):
+                     with self.assertRaisesRegexp(zstd.ZstdError, 'compressor object already finished'):
                          cobj.flush()
+                 def test_flush_block_repeated(self):
+                     cctx = zstd.ZstdCompressor(level=1)
+                     cobj = cctx.compressobj()
+                     self.assertEqual(cobj.compress(b'foo'), b'')
+                     self.assertEqual(cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK),
+                                      b'\x28\xb5\x2f\xfd\x00\x48\x18\x00\x00foo')
+                     self.assertEqual(cobj.compress(b'bar'), b'')
+                     # 3 byte header plus content.
+                     self.assertEqual(cobj.flush(), b'\x19\x00\x00bar')
+                 def test_flush_empty_block(self):
+                     cctx = zstd.ZstdCompressor(write_checksum=True)
+                     cobj = cctx.compressobj()
+                     cobj.compress(b'foobar')
+                     cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK)
+                     # No-op if no block is active (this is internal to zstd).
+                     self.assertEqual(cobj.flush(zstd.COMPRESSOBJ_FLUSH_BLOCK), b'')
+                     trailing = cobj.flush()
+                     # 3 bytes block header + 4 bytes frame checksum
+                     self.assertEqual(len(trailing), 7)
+                     header = trailing[0:3]
+                     self.assertEqual(header, b'\x01\x00\x00')
              class TestCompressor_copy_stream(unittest.TestCase):
                  def test_no_read(self):
                      source = object()
                      dest = io.BytesIO()
                      cctx = zstd.ZstdCompressor()
                      with self.assertRaises(ValueError):
                          cctx.copy_stream(source, dest)
                  def test_no_write(self):
                      source = io.BytesIO()
                      dest = object()
                      cctx = zstd.ZstdCompressor()
                      with self.assertRaises(ValueError):
                          cctx.copy_stream(source, dest)
                  def test_empty(self):
                      source = io.BytesIO()
                      dest = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      r, w = cctx.copy_stream(source, dest)
                      self.assertEqual(int(r), 0)
                      self.assertEqual(w, 9)
                      self.assertEqual(dest.getvalue(),
                                       b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
                  def test_large_data(self):
                      source = io.BytesIO()
                      for i in range(255):
                          source.write(struct.Struct('>B').pack(i) * 16384)
                      source.seek(0)
                      dest = io.BytesIO()
                      cctx = zstd.ZstdCompressor()
                      r, w = cctx.copy_stream(source, dest)
                      self.assertEqual(r, 255 * 16384)
                      self.assertEqual(w, 999)
                  def test_write_checksum(self):
                      source = io.BytesIO(b'foobar')
                      no_checksum = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      cctx.copy_stream(source, no_checksum)
                      source.seek(0)
                      with_checksum = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      cctx.copy_stream(source, with_checksum)
                      self.assertEqual(len(with_checksum.getvalue()),
                                       len(no_checksum.getvalue()) + 4)
                  def test_write_content_size(self):
                      source = io.BytesIO(b'foobar' * 256)
                      no_size = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      cctx.copy_stream(source, no_size)
                      source.seek(0)
                      with_size = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=True)
                      cctx.copy_stream(source, with_size)
                      # Source content size is unknown, so no content size written.
                      self.assertEqual(len(with_size.getvalue()),
                                       len(no_size.getvalue()))
                      source.seek(0)
                      with_size = io.BytesIO()
                      cctx.copy_stream(source, with_size, size=len(source.getvalue()))
                      # We specified source size, so content size header is present.
                      self.assertEqual(len(with_size.getvalue()),
                                       len(no_size.getvalue()) + 1)
                  def test_read_write_size(self):
                      source = OpCountingBytesIO(b'foobarfoobar')
                      dest = OpCountingBytesIO()
                      cctx = zstd.ZstdCompressor()
                      r, w = cctx.copy_stream(source, dest, read_size=1, write_size=1)
                      self.assertEqual(r, len(source.getvalue()))
                      self.assertEqual(w, 21)
                      self.assertEqual(source._read_count, len(source.getvalue()) + 1)
                      self.assertEqual(dest._write_count, len(dest.getvalue()))
              def compress(data, level):
                  buffer = io.BytesIO()
                  cctx = zstd.ZstdCompressor(level=level)
                  with cctx.write_to(buffer) as compressor:
                      compressor.write(data)
                  return buffer.getvalue()
              class TestCompressor_write_to(unittest.TestCase):
                  def test_empty(self):
                      self.assertEqual(compress(b'', 1),
                                       b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
                  def test_multiple_compress(self):
                      buffer = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=5)
                      with cctx.write_to(buffer) as compressor:
                          compressor.write(b'foo')
                          compressor.write(b'bar')
                          compressor.write(b'x' * 8192)
                      result = buffer.getvalue()
                      self.assertEqual(result,
                                       b'\x28\xb5\x2f\xfd\x00\x50\x75\x00\x00\x38\x66\x6f'
                                       b'\x6f\x62\x61\x72\x78\x01\x00\xfc\xdf\x03\x23')
                  def test_dictionary(self):
                      samples = []
                      for i in range(128):
                          samples.append(b'foo' * 64)
                          samples.append(b'bar' * 64)
                          samples.append(b'foobar' * 64)
                      d = zstd.train_dictionary(8192, samples)
                      buffer = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=9, dict_data=d)
                      with cctx.write_to(buffer) as compressor:
                          compressor.write(b'foo')
                          compressor.write(b'bar')
                          compressor.write(b'foo' * 16384)
                      compressed = buffer.getvalue()
                      h = hashlib.sha1(compressed).hexdigest()
                      self.assertEqual(h, '1c5bcd25181bcd8c1a73ea8773323e0056129f92')
                  def test_compression_params(self):
                      params = zstd.CompressionParameters(20, 6, 12, 5, 4, 10, zstd.STRATEGY_FAST)
                      buffer = io.BytesIO()
                      cctx = zstd.ZstdCompressor(compression_params=params)
                      with cctx.write_to(buffer) as compressor:
                          compressor.write(b'foo')
                          compressor.write(b'bar')
                          compressor.write(b'foobar' * 16384)
                      compressed = buffer.getvalue()
                      h = hashlib.sha1(compressed).hexdigest()
                      self.assertEqual(h, '1ae31f270ed7de14235221a604b31ecd517ebd99')
                  def test_write_checksum(self):
                      no_checksum = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      with cctx.write_to(no_checksum) as compressor:
                          compressor.write(b'foobar')
                      with_checksum = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_checksum=True)
                      with cctx.write_to(with_checksum) as compressor:
                          compressor.write(b'foobar')
                      self.assertEqual(len(with_checksum.getvalue()),
                                       len(no_checksum.getvalue()) + 4)
                  def test_write_content_size(self):
                      no_size = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1)
                      with cctx.write_to(no_size) as compressor:
                          compressor.write(b'foobar' * 256)
                      with_size = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1, write_content_size=True)
                      with cctx.write_to(with_size) as compressor:
                          compressor.write(b'foobar' * 256)
                      # Source size is not known in streaming mode, so header not
                      # written.
                      self.assertEqual(len(with_size.getvalue()),
                                       len(no_size.getvalue()))
                      # Declaring size will write the header.
                      with_size = io.BytesIO()
                      with cctx.write_to(with_size, size=len(b'foobar' * 256)) as compressor:
                          compressor.write(b'foobar' * 256)
                      self.assertEqual(len(with_size.getvalue()),
                                       len(no_size.getvalue()) + 1)
                  def test_no_dict_id(self):
                      samples = []
                      for i in range(128):
                          samples.append(b'foo' * 64)
                          samples.append(b'bar' * 64)
                          samples.append(b'foobar' * 64)
                      d = zstd.train_dictionary(1024, samples)
                      with_dict_id = io.BytesIO()
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d)
                      with cctx.write_to(with_dict_id) as compressor:
                          compressor.write(b'foobarfoobar')
                      cctx = zstd.ZstdCompressor(level=1, dict_data=d, write_dict_id=False)
                      no_dict_id = io.BytesIO()
                      with cctx.write_to(no_dict_id) as compressor:
                          compressor.write(b'foobarfoobar')
                      self.assertEqual(len(with_dict_id.getvalue()),
                                       len(no_dict_id.getvalue()) + 4)
                  def test_memory_size(self):
                      cctx = zstd.ZstdCompressor(level=3)
                      buffer = io.BytesIO()
                      with cctx.write_to(buffer) as compressor:
                          size = compressor.memory_size()
                      self.assertGreater(size, 100000)
                  def test_write_size(self):
                      cctx = zstd.ZstdCompressor(level=3)
                      dest = OpCountingBytesIO()
                      with cctx.write_to(dest, write_size=1) as compressor:
                          compressor.write(b'foo')
                          compressor.write(b'bar')
                          compressor.write(b'foobar')
                      self.assertEqual(len(dest.getvalue()), dest._write_count)
+                 def test_flush_repeated(self):
+                     cctx = zstd.ZstdCompressor(level=3)
+                     dest = OpCountingBytesIO()
+                     with cctx.write_to(dest) as compressor:
+                         compressor.write(b'foo')
+                         self.assertEqual(dest._write_count, 0)
+                         compressor.flush()
+                         self.assertEqual(dest._write_count, 1)
+                         compressor.write(b'bar')
+                         self.assertEqual(dest._write_count, 1)
+                         compressor.flush()
+                         self.assertEqual(dest._write_count, 2)
+                         compressor.write(b'baz')
+                     self.assertEqual(dest._write_count, 3)
+                 def test_flush_empty_block(self):
+                     cctx = zstd.ZstdCompressor(level=3, write_checksum=True)
+                     dest = OpCountingBytesIO()
+                     with cctx.write_to(dest) as compressor:
+                         compressor.write(b'foobar' * 8192)
+                         count = dest._write_count
+                         offset = dest.tell()
+                         compressor.flush()
+                         self.assertGreater(dest._write_count, count)
+                         self.assertGreater(dest.tell(), offset)
+                         offset = dest.tell()
+                         # Ending the write here should cause an empty block to be written
+                         # to denote end of frame.
+                     trailing = dest.getvalue()[offset:]
+                     # 3 bytes block header + 4 bytes frame checksum
+                     self.assertEqual(len(trailing), 7)
+                     header = trailing[0:3]
+                     self.assertEqual(header, b'\x01\x00\x00')
              class TestCompressor_read_from(unittest.TestCase):
                  def test_type_validation(self):
                      cctx = zstd.ZstdCompressor()
                      # Object with read() works.
                      cctx.read_from(io.BytesIO())
                      # Buffer protocol works.
                      cctx.read_from(b'foobar')
                      with self.assertRaisesRegexp(ValueError, 'must pass an object with a read'):
                          cctx.read_from(True)
                  def test_read_empty(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      source = io.BytesIO()
                      it = cctx.read_from(source)
                      chunks = list(it)
                      self.assertEqual(len(chunks), 1)
                      compressed = b''.join(chunks)
                      self.assertEqual(compressed, b'\x28\xb5\x2f\xfd\x00\x48\x01\x00\x00')
                      # And again with the buffer protocol.
                      it = cctx.read_from(b'')
                      chunks = list(it)
                      self.assertEqual(len(chunks), 1)
                      compressed2 = b''.join(chunks)
                      self.assertEqual(compressed2, compressed)
                  def test_read_large(self):
                      cctx = zstd.ZstdCompressor(level=1)
                      source = io.BytesIO()
                      source.write(b'f' * zstd.COMPRESSION_RECOMMENDED_INPUT_SIZE)
                      source.write(b'o')
                      source.seek(0)
                      # Creating an iterator should not perform any compression until
                      # first read.
                      it = cctx.read_from(source, size=len(source.getvalue()))
                      self.assertEqual(source.tell(), 0)
                      # We should have exactly 2 output chunks.
                      chunks = []
                      chunk = next(it)
                      self.assertIsNotNone(chunk)
                      self.assertEqual(source.tell(), zstd.COMPRESSION_RECOMMENDED_INPUT_SIZE)
                      chunks.append(chunk)
                      chunk = next(it)
                      self.assertIsNotNone(chunk)
                      chunks.append(chunk)
                      self.assertEqual(source.tell(), len(source.getvalue()))
                      with self.assertRaises(StopIteration):
                          next(it)
                      # And again for good measure.
                      with self.assertRaises(StopIteration):
                          next(it)
                      # We should get the same output as the one-shot compression mechanism.
                      self.assertEqual(b''.join(chunks), cctx.compress(source.getvalue()))
                      # Now check the buffer protocol.
                      it = cctx.read_from(source.getvalue())
                      chunks = list(it)
                      self.assertEqual(len(chunks), 2)
                      self.assertEqual(b''.join(chunks), cctx.compress(source.getvalue()))
                  def test_read_write_size(self):
                      source = OpCountingBytesIO(b'foobarfoobar')
                      cctx = zstd.ZstdCompressor(level=3)
                      for chunk in cctx.read_from(source, read_size=1, write_size=1):
                          self.assertEqual(len(chunk), 1)
                      self.assertEqual(source._read_count, len(source.getvalue()) + 1)

contrib/python-zstandard/tests/test_module_attributes.py

0 +1 -1

              from __future__ import unicode_literals
              try:
                  import unittest2 as unittest
              except ImportError:
                  import unittest
              import zstd
              class TestModuleAttributes(unittest.TestCase):
                  def test_version(self):
-                     self.assertEqual(zstd.ZSTD_VERSION, (1, 1, 1))
+                     self.assertEqual(zstd.ZSTD_VERSION, (1, 1, 2))
                  def test_constants(self):
                      self.assertEqual(zstd.MAX_COMPRESSION_LEVEL, 22)
                      self.assertEqual(zstd.FRAME_HEADER, b'\x28\xb5\x2f\xfd')
                  def test_hasattr(self):
                      attrs = (
                          'COMPRESSION_RECOMMENDED_INPUT_SIZE',
                          'COMPRESSION_RECOMMENDED_OUTPUT_SIZE',
                          'DECOMPRESSION_RECOMMENDED_INPUT_SIZE',
                          'DECOMPRESSION_RECOMMENDED_OUTPUT_SIZE',
                          'MAGIC_NUMBER',
                          'WINDOWLOG_MIN',
                          'WINDOWLOG_MAX',
                          'CHAINLOG_MIN',
                          'CHAINLOG_MAX',
                          'HASHLOG_MIN',
                          'HASHLOG_MAX',
                          'HASHLOG3_MAX',
                          'SEARCHLOG_MIN',
                          'SEARCHLOG_MAX',
                          'SEARCHLENGTH_MIN',
                          'SEARCHLENGTH_MAX',
                          'TARGETLENGTH_MIN',
                          'TARGETLENGTH_MAX',
                          'STRATEGY_FAST',
                          'STRATEGY_DFAST',
                          'STRATEGY_GREEDY',
                          'STRATEGY_LAZY',
                          'STRATEGY_LAZY2',
                          'STRATEGY_BTLAZY2',
                          'STRATEGY_BTOPT',
                      )
                      for a in attrs:
                          self.assertTrue(hasattr(zstd, a))

contrib/python-zstandard/zstd.c

0 +24 0

              /**
               * Copyright (c) 2016-present, Gregory Szorc
               * All rights reserved.
               *
               * This software may be modified and distributed under the terms
               * of the BSD license. See the LICENSE file for details.
               */
              /* A Python C extension for Zstandard. */
              #include "python-zstandard.h"
              PyObject *ZstdError;
              PyDoc_STRVAR(estimate_compression_context_size__doc__,
              "estimate_compression_context_size(compression_parameters)\n"
              "\n"
              "Give the amount of memory allocated for a compression context given a\n"
              "CompressionParameters instance");
              PyDoc_STRVAR(estimate_decompression_context_size__doc__,
              "estimate_decompression_context_size()\n"
              "\n"
              "Estimate the amount of memory allocated to a decompression context.\n"
              );
              static PyObject* estimate_decompression_context_size(PyObject* self) {
              	return PyLong_FromSize_t(ZSTD_estimateDCtxSize());
              }
              PyDoc_STRVAR(get_compression_parameters__doc__,
              "get_compression_parameters(compression_level[, source_size[, dict_size]])\n"
              "\n"
              "Obtains a ``CompressionParameters`` instance from a compression level and\n"
              "optional input size and dictionary size");
              PyDoc_STRVAR(train_dictionary__doc__,
              "train_dictionary(dict_size, samples)\n"
              "\n"
              "Train a dictionary from sample data.\n"
              "\n"
              "A compression dictionary of size ``dict_size`` will be created from the\n"
              "iterable of samples provided by ``samples``.\n"
              "\n"
              "The raw dictionary content will be returned\n");
              static char zstd_doc[] = "Interface to zstandard";
              static PyMethodDef zstd_methods[] = {
              	{ "estimate_compression_context_size", (PyCFunction)estimate_compression_context_size,
              	METH_VARARGS, estimate_compression_context_size__doc__ },
              	{ "estimate_decompression_context_size", (PyCFunction)estimate_decompression_context_size,
              	METH_NOARGS, estimate_decompression_context_size__doc__ },
              	{ "get_compression_parameters", (PyCFunction)get_compression_parameters,
              	METH_VARARGS, get_compression_parameters__doc__ },
              	{ "train_dictionary", (PyCFunction)train_dictionary,
              	METH_VARARGS | METH_KEYWORDS, train_dictionary__doc__ },
              	{ NULL, NULL }
              };
              void compressobj_module_init(PyObject* mod);
              void compressor_module_init(PyObject* mod);
              void compressionparams_module_init(PyObject* mod);
              void constants_module_init(PyObject* mod);
              void dictparams_module_init(PyObject* mod);
              void compressiondict_module_init(PyObject* mod);
              void compressionwriter_module_init(PyObject* mod);
              void compressoriterator_module_init(PyObject* mod);
              void decompressor_module_init(PyObject* mod);
              void decompressobj_module_init(PyObject* mod);
              void decompressionwriter_module_init(PyObject* mod);
              void decompressoriterator_module_init(PyObject* mod);
              void zstd_module_init(PyObject* m) {
+             	/* python-zstandard relies on unstable zstd C API features. This means
+             	   that changes in zstd may break expectations in python-zstandard.
+             	   python-zstandard is distributed with a copy of the zstd sources.
+             	   python-zstandard is only guaranteed to work with the bundled version
+             	   of zstd.
+             	   However, downstream redistributors or packagers may unbundle zstd
+             	   from python-zstandard. This can result in a mismatch between zstd
+             	   versions and API semantics. This essentially "voids the warranty"
+             	   of python-zstandard and may cause undefined behavior.
+             	   We detect this mismatch here and refuse to load the module if this
+             	   scenario is detected.
+             	*/
+             	if (ZSTD_VERSION_NUMBER != 10102 || ZSTD_versionNumber() != 10102) {
+             		PyErr_SetString(PyExc_ImportError, "zstd C API mismatch; Python bindings not compiled against expected zstd version");
+             		return;
+             	}
              	compressionparams_module_init(m);
              	dictparams_module_init(m);
              	compressiondict_module_init(m);
              	compressobj_module_init(m);
              	compressor_module_init(m);
              	compressionwriter_module_init(m);
              	compressoriterator_module_init(m);
              	constants_module_init(m);
              	decompressor_module_init(m);
              	decompressobj_module_init(m);
              	decompressionwriter_module_init(m);
              	decompressoriterator_module_init(m);
              }
              #if PY_MAJOR_VERSION >= 3
              static struct PyModuleDef zstd_module = {
              	PyModuleDef_HEAD_INIT,
              	"zstd",
              	zstd_doc,
              	-1,
              	zstd_methods
              };
              PyMODINIT_FUNC PyInit_zstd(void) {
              	PyObject *m = PyModule_Create(&zstd_module);
              	if (m) {
              		zstd_module_init(m);
+             		if (PyErr_Occurred()) {
+             			Py_DECREF(m);
+             			m = NULL;
+             		}
              	}
              	return m;
              }
              #else
              PyMODINIT_FUNC initzstd(void) {
              	PyObject *m = Py_InitModule3("zstd", zstd_methods, zstd_doc);
              	if (m) {
              		zstd_module_init(m);
              	}
              }
              #endif

contrib/python-zstandard/zstd/common/bitstream.h

0 +4 -4

              /* ******************************************************************
                 bitstream
                 Part of FSE library
                 header file (to include)
                 Copyright (C) 2013-2016, Yann Collet.
                 BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php)
                 Redistribution and use in source and binary forms, with or without
                 modification, are permitted provided that the following conditions are
                 met:
                     * Redistributions of source code must retain the above copyright
                 notice, this list of conditions and the following disclaimer.
                     * Redistributions in binary form must reproduce the above
                 copyright notice, this list of conditions and the following disclaimer
                 in the documentation and/or other materials provided with the
                 distribution.
                 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
                 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
                 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
                 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
                 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
                 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
                 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
                 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
                 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
                 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
                 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
                 You can contact the author at :
                 - Source repository : https://github.com/Cyan4973/FiniteStateEntropy
              ****************************************************************** */
              #ifndef BITSTREAM_H_MODULE
              #define BITSTREAM_H_MODULE
              #if defined (__cplusplus)
              extern "C" {
              #endif
              /*
              *  This API consists of small unitary functions, which must be inlined for best performance.
              *  Since link-time-optimization is not available for all compilers,
              *  these functions are defined into a .h to be included.
              */
              /*-****************************************
              *  Dependencies
              ******************************************/
              #include "mem.h"            /* unaligned access routines */
              #include "error_private.h"  /* error codes and messages */
              /*=========================================
              *  Target specific
              =========================================*/
              #if defined(__BMI__) && defined(__GNUC__)
              #  include <immintrin.h>   /* support for bextr (experimental) */
              #endif
              /*-******************************************
              *  bitStream encoding API (write forward)
              ********************************************/
              /* bitStream can mix input from multiple sources.
              *  A critical property of these streams is that they encode and decode in **reverse** direction.
              *  So the first bit sequence you add will be the last to be read, like a LIFO stack.
              */
              typedef struct
              {
                  size_t bitContainer;
                  int    bitPos;
                  char*  startPtr;
                  char*  ptr;
                  char*  endPtr;
              } BIT_CStream_t;
              MEM_STATIC size_t BIT_initCStream(BIT_CStream_t* bitC, void* dstBuffer, size_t dstCapacity);
              MEM_STATIC void   BIT_addBits(BIT_CStream_t* bitC, size_t value, unsigned nbBits);
              MEM_STATIC void   BIT_flushBits(BIT_CStream_t* bitC);
              MEM_STATIC size_t BIT_closeCStream(BIT_CStream_t* bitC);
              /* Start with initCStream, providing the size of buffer to write into.
              *  bitStream will never write outside of this buffer.
              *  `dstCapacity` must be >= sizeof(bitD->bitContainer), otherwise @return will be an error code.
              *
              *  bits are first added to a local register.
              *  Local register is size_t, hence 64-bits on 64-bits systems, or 32-bits on 32-bits systems.
              *  Writing data into memory is an explicit operation, performed by the flushBits function.
              *  Hence keep track how many bits are potentially stored into local register to avoid register overflow.
              *  After a flushBits, a maximum of 7 bits might still be stored into local register.
              *
              *  Avoid storing elements of more than 24 bits if you want compatibility with 32-bits bitstream readers.
              *
              *  Last operation is to close the bitStream.
              *  The function returns the final size of CStream in bytes.
              *  If data couldn't fit into `dstBuffer`, it will return a 0 ( == not storable)
              */
              /*-********************************************
              *  bitStream decoding API (read backward)
              **********************************************/
              typedef struct
              {
                  size_t   bitContainer;
                  unsigned bitsConsumed;
                  const char* ptr;
                  const char* start;
              } BIT_DStream_t;
              typedef enum { BIT_DStream_unfinished = 0,
                             BIT_DStream_endOfBuffer = 1,
                             BIT_DStream_completed = 2,
                             BIT_DStream_overflow = 3 } BIT_DStream_status;  /* result of BIT_reloadDStream() */
                             /* 1,2,4,8 would be better for bitmap combinations, but slows down performance a bit ... :( */
              MEM_STATIC size_t   BIT_initDStream(BIT_DStream_t* bitD, const void* srcBuffer, size_t srcSize);
              MEM_STATIC size_t   BIT_readBits(BIT_DStream_t* bitD, unsigned nbBits);
              MEM_STATIC BIT_DStream_status BIT_reloadDStream(BIT_DStream_t* bitD);
              MEM_STATIC unsigned BIT_endOfDStream(const BIT_DStream_t* bitD);
              /* Start by invoking BIT_initDStream().
              *  A chunk of the bitStream is then stored into a local register.
              *  Local register size is 64-bits on 64-bits systems, 32-bits on 32-bits systems (size_t).
              *  You can then retrieve bitFields stored into the local register, **in reverse order**.
              *  Local register is explicitly reloaded from memory by the BIT_reloadDStream() method.
              *  A reload guarantee a minimum of ((8*sizeof(bitD->bitContainer))-7) bits when its result is BIT_DStream_unfinished.
              *  Otherwise, it can be less than that, so proceed accordingly.
              *  Checking if DStream has reached its end can be performed with BIT_endOfDStream().
              */
              /*-****************************************
              *  unsafe API
              ******************************************/
              MEM_STATIC void BIT_addBitsFast(BIT_CStream_t* bitC, size_t value, unsigned nbBits);
              /* faster, but works only if value is "clean", meaning all high bits above nbBits are 0 */
              MEM_STATIC void BIT_flushBitsFast(BIT_CStream_t* bitC);
              /* unsafe version; does not check buffer overflow */
              MEM_STATIC size_t BIT_readBitsFast(BIT_DStream_t* bitD, unsigned nbBits);
              /* faster, but works only if nbBits >= 1 */
              /*-**************************************************************
              *  Internal functions
              ****************************************************************/
              MEM_STATIC unsigned BIT_highbit32 (register U32 val)
              {
              #   if defined(_MSC_VER)   /* Visual */
                  unsigned long r=0;
                  _BitScanReverse ( &r, val );
                  return (unsigned) r;
              #   elif defined(__GNUC__) && (__GNUC__ >= 3)   /* Use GCC Intrinsic */
                  return 31 - __builtin_clz (val);
              #   else   /* Software version */
                  static const unsigned DeBruijnClz[32] = { 0, 9, 1, 10, 13, 21, 2, 29, 11, 14, 16, 18, 22, 25, 3, 30, 8, 12, 20, 28, 15, 17, 24, 7, 19, 27, 23, 6, 26, 5, 4, 31 };
                  U32 v = val;
                  v |= v >> 1;
                  v |= v >> 2;
                  v |= v >> 4;
                  v |= v >> 8;
                  v |= v >> 16;
                  return DeBruijnClz[ (U32) (v * 0x07C4ACDDU) >> 27];
              #   endif
              }
              /*=====    Local Constants   =====*/
              static const unsigned BIT_mask[] = { 0, 1, 3, 7, 0xF, 0x1F, 0x3F, 0x7F, 0xFF, 0x1FF, 0x3FF, 0x7FF, 0xFFF, 0x1FFF, 0x3FFF, 0x7FFF, 0xFFFF, 0x1FFFF, 0x3FFFF, 0x7FFFF, 0xFFFFF, 0x1FFFFF, 0x3FFFFF, 0x7FFFFF,  0xFFFFFF, 0x1FFFFFF, 0x3FFFFFF };   /* up to 26 bits */
              /*-**************************************************************
              *  bitStream encoding
              ****************************************************************/
              /*! BIT_initCStream() :
               *  `dstCapacity` must be > sizeof(void*)
               *  @return : 0 if success,
                            otherwise an error code (can be tested using ERR_isError() ) */
              MEM_STATIC size_t BIT_initCStream(BIT_CStream_t* bitC, void* startPtr, size_t dstCapacity)
              {
                  bitC->bitContainer = 0;
                  bitC->bitPos = 0;
                  bitC->startPtr = (char*)startPtr;
                  bitC->ptr = bitC->startPtr;
                  bitC->endPtr = bitC->startPtr + dstCapacity - sizeof(bitC->ptr);
                  if (dstCapacity <= sizeof(bitC->ptr)) return ERROR(dstSize_tooSmall);
                  return 0;
              }
              /*! BIT_addBits() :
                  can add up to 26 bits into `bitC`.
                  Does not check for register overflow ! */
              MEM_STATIC void BIT_addBits(BIT_CStream_t* bitC, size_t value, unsigned nbBits)
              {
                  bitC->bitContainer |= (value & BIT_mask[nbBits]) << bitC->bitPos;
                  bitC->bitPos += nbBits;
              }
              /*! BIT_addBitsFast() :
               *  works only if `value` is _clean_, meaning all high bits above nbBits are 0 */
              MEM_STATIC void BIT_addBitsFast(BIT_CStream_t* bitC, size_t value, unsigned nbBits)
              {
                  bitC->bitContainer |= value << bitC->bitPos;
                  bitC->bitPos += nbBits;
              }
              /*! BIT_flushBitsFast() :
               *  unsafe version; does not check buffer overflow */
              MEM_STATIC void BIT_flushBitsFast(BIT_CStream_t* bitC)
              {
                  size_t const nbBytes = bitC->bitPos >> 3;
                  MEM_writeLEST(bitC->ptr, bitC->bitContainer);
                  bitC->ptr += nbBytes;
                  bitC->bitPos &= 7;
                  bitC->bitContainer >>= nbBytes*8;   /* if bitPos >= sizeof(bitContainer)*8 --> undefined behavior */
              }
              /*! BIT_flushBits() :
               *  safe version; check for buffer overflow, and prevents it.
               *  note : does not signal buffer overflow. This will be revealed later on using BIT_closeCStream() */
              MEM_STATIC void BIT_flushBits(BIT_CStream_t* bitC)
              {
                  size_t const nbBytes = bitC->bitPos >> 3;
                  MEM_writeLEST(bitC->ptr, bitC->bitContainer);
                  bitC->ptr += nbBytes;
                  if (bitC->ptr > bitC->endPtr) bitC->ptr = bitC->endPtr;
                  bitC->bitPos &= 7;
                  bitC->bitContainer >>= nbBytes*8;   /* if bitPos >= sizeof(bitContainer)*8 --> undefined behavior */
              }
              /*! BIT_closeCStream() :
               *  @return : size of CStream, in bytes,
                            or 0 if it could not fit into dstBuffer */
              MEM_STATIC size_t BIT_closeCStream(BIT_CStream_t* bitC)
              {
                  BIT_addBitsFast(bitC, 1, 1);   /* endMark */
                  BIT_flushBits(bitC);
                  if (bitC->ptr >= bitC->endPtr) return 0; /* doesn't fit within authorized budget : cancel */
                  return (bitC->ptr - bitC->startPtr) + (bitC->bitPos > 0);
              }
              /*-********************************************************
              * bitStream decoding
              **********************************************************/
              /*! BIT_initDStream() :
              *   Initialize a BIT_DStream_t.
              *   `bitD` : a pointer to an already allocated BIT_DStream_t structure.
              *   `srcSize` must be the *exact* size of the bitStream, in bytes.
              *   @return : size of stream (== srcSize) or an errorCode if a problem is detected
              */
              MEM_STATIC size_t BIT_initDStream(BIT_DStream_t* bitD, const void* srcBuffer, size_t srcSize)
              {
                  if (srcSize < 1) { memset(bitD, 0, sizeof(*bitD)); return ERROR(srcSize_wrong); }
                  if (srcSize >=  sizeof(bitD->bitContainer)) {  /* normal case */
                      bitD->start = (const char*)srcBuffer;
                      bitD->ptr   = (const char*)srcBuffer + srcSize - sizeof(bitD->bitContainer);
                      bitD->bitContainer = MEM_readLEST(bitD->ptr);
                      { BYTE const lastByte = ((const BYTE*)srcBuffer)[srcSize-1];
-                       bitD->bitsConsumed = lastByte ? 8 - BIT_highbit32(lastByte) : 0;
+                       bitD->bitsConsumed = lastByte ? 8 - BIT_highbit32(lastByte) : 0;  /* ensures bitsConsumed is always set */
                        if (lastByte == 0) return ERROR(GENERIC); /* endMark not present */ }
                  } else {
                      bitD->start = (const char*)srcBuffer;
                      bitD->ptr   = bitD->start;
                      bitD->bitContainer = *(const BYTE*)(bitD->start);
                      switch(srcSize)
                      {
                          case 7: bitD->bitContainer += (size_t)(((const BYTE*)(srcBuffer))[6]) << (sizeof(bitD->bitContainer)*8 - 16);
                          case 6: bitD->bitContainer += (size_t)(((const BYTE*)(srcBuffer))[5]) << (sizeof(bitD->bitContainer)*8 - 24);
                          case 5: bitD->bitContainer += (size_t)(((const BYTE*)(srcBuffer))[4]) << (sizeof(bitD->bitContainer)*8 - 32);
                          case 4: bitD->bitContainer += (size_t)(((const BYTE*)(srcBuffer))[3]) << 24;
                          case 3: bitD->bitContainer += (size_t)(((const BYTE*)(srcBuffer))[2]) << 16;
                          case 2: bitD->bitContainer += (size_t)(((const BYTE*)(srcBuffer))[1]) <<  8;
                          default:;
                      }
                      { BYTE const lastByte = ((const BYTE*)srcBuffer)[srcSize-1];
                        bitD->bitsConsumed = lastByte ? 8 - BIT_highbit32(lastByte) : 0;
                        if (lastByte == 0) return ERROR(GENERIC); /* endMark not present */ }
                      bitD->bitsConsumed += (U32)(sizeof(bitD->bitContainer) - srcSize)*8;
                  }
                  return srcSize;
              }
              MEM_STATIC size_t BIT_getUpperBits(size_t bitContainer, U32 const start)
              {
                  return bitContainer >> start;
              }
              MEM_STATIC size_t BIT_getMiddleBits(size_t bitContainer, U32 const start, U32 const nbBits)
              {
-             #if defined(__BMI__) && defined(__GNUC__)   /* experimental */
+             #if defined(__BMI__) && defined(__GNUC__) && __GNUC__*1000+__GNUC_MINOR__ >= 4008  /* experimental */
              #  if defined(__x86_64__)
                  if (sizeof(bitContainer)==8)
                      return _bextr_u64(bitContainer, start, nbBits);
                  else
              #  endif
                      return _bextr_u32(bitContainer, start, nbBits);
              #else
                  return (bitContainer >> start) & BIT_mask[nbBits];
              #endif
              }
              MEM_STATIC size_t BIT_getLowerBits(size_t bitContainer, U32 const nbBits)
              {
                  return bitContainer & BIT_mask[nbBits];
              }
              /*! BIT_lookBits() :
               *  Provides next n bits from local register.
               *  local register is not modified.
               *  On 32-bits, maxNbBits==24.
               *  On 64-bits, maxNbBits==56.
               *  @return : value extracted
               */
               MEM_STATIC size_t BIT_lookBits(const BIT_DStream_t* bitD, U32 nbBits)
              {
              #if defined(__BMI__) && defined(__GNUC__)   /* experimental; fails if bitD->bitsConsumed + nbBits > sizeof(bitD->bitContainer)*8 */
                  return BIT_getMiddleBits(bitD->bitContainer, (sizeof(bitD->bitContainer)*8) - bitD->bitsConsumed - nbBits, nbBits);
              #else
                  U32 const bitMask = sizeof(bitD->bitContainer)*8 - 1;
                  return ((bitD->bitContainer << (bitD->bitsConsumed & bitMask)) >> 1) >> ((bitMask-nbBits) & bitMask);
              #endif
              }
              /*! BIT_lookBitsFast() :
              *   unsafe version; only works only if nbBits >= 1 */
              MEM_STATIC size_t BIT_lookBitsFast(const BIT_DStream_t* bitD, U32 nbBits)
              {
                  U32 const bitMask = sizeof(bitD->bitContainer)*8 - 1;
                  return (bitD->bitContainer << (bitD->bitsConsumed & bitMask)) >> (((bitMask+1)-nbBits) & bitMask);
              }
              MEM_STATIC void BIT_skipBits(BIT_DStream_t* bitD, U32 nbBits)
              {
                  bitD->bitsConsumed += nbBits;
              }
              /*! BIT_readBits() :
               *  Read (consume) next n bits from local register and update.
               *  Pay attention to not read more than nbBits contained into local register.
               *  @return : extracted value.
               */
              MEM_STATIC size_t BIT_readBits(BIT_DStream_t* bitD, U32 nbBits)
              {
                  size_t const value = BIT_lookBits(bitD, nbBits);
                  BIT_skipBits(bitD, nbBits);
                  return value;
              }
              /*! BIT_readBitsFast() :
              *   unsafe version; only works only if nbBits >= 1 */
              MEM_STATIC size_t BIT_readBitsFast(BIT_DStream_t* bitD, U32 nbBits)
              {
                  size_t const value = BIT_lookBitsFast(bitD, nbBits);
                  BIT_skipBits(bitD, nbBits);
                  return value;
              }
              /*! BIT_reloadDStream() :
-             *   Refill `BIT_DStream_t` from src buffer previously defined (see BIT_initDStream() ).
+             *   Refill `bitD` from buffer previously set in BIT_initDStream() .
              *   This function is safe, it guarantees it will not read beyond src buffer.
              *   @return : status of `BIT_DStream_t` internal register.
-                           if status == unfinished, internal register is filled with >= (sizeof(bitD->bitContainer)*8 - 7) bits */
+                           if status == BIT_DStream_unfinished, internal register is filled with >= (sizeof(bitD->bitContainer)*8 - 7) bits */
              MEM_STATIC BIT_DStream_status BIT_reloadDStream(BIT_DStream_t* bitD)
              {
              	if (bitD->bitsConsumed > (sizeof(bitD->bitContainer)*8))  /* should not happen => corruption detected */
              		return BIT_DStream_overflow;
                  if (bitD->ptr >= bitD->start + sizeof(bitD->bitContainer)) {
                      bitD->ptr -= bitD->bitsConsumed >> 3;
                      bitD->bitsConsumed &= 7;
                      bitD->bitContainer = MEM_readLEST(bitD->ptr);
                      return BIT_DStream_unfinished;
                  }
                  if (bitD->ptr == bitD->start) {
                      if (bitD->bitsConsumed < sizeof(bitD->bitContainer)*8) return BIT_DStream_endOfBuffer;
                      return BIT_DStream_completed;
                  }
                  {   U32 nbBytes = bitD->bitsConsumed >> 3;
                      BIT_DStream_status result = BIT_DStream_unfinished;
                      if (bitD->ptr - nbBytes < bitD->start) {
                          nbBytes = (U32)(bitD->ptr - bitD->start);  /* ptr > start */
                          result = BIT_DStream_endOfBuffer;
                      }
                      bitD->ptr -= nbBytes;
                      bitD->bitsConsumed -= nbBytes*8;
                      bitD->bitContainer = MEM_readLEST(bitD->ptr);   /* reminder : srcSize > sizeof(bitD) */
                      return result;
                  }
              }
              /*! BIT_endOfDStream() :
              *   @return Tells if DStream has exactly reached its end (all bits consumed).
              */
              MEM_STATIC unsigned BIT_endOfDStream(const BIT_DStream_t* DStream)
              {
                  return ((DStream->ptr == DStream->start) && (DStream->bitsConsumed == sizeof(DStream->bitContainer)*8));
              }
              #if defined (__cplusplus)
              }
              #endif
              #endif /* BITSTREAM_H_MODULE */

contrib/python-zstandard/zstd/common/entropy_common.c

0 +6 -4

              /*
                 Common functions of New Generation Entropy library
                 Copyright (C) 2016, Yann Collet.
                 BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php)
                 Redistribution and use in source and binary forms, with or without
                 modification, are permitted provided that the following conditions are
                 met:
                     * Redistributions of source code must retain the above copyright
                 notice, this list of conditions and the following disclaimer.
                     * Redistributions in binary form must reproduce the above
                 copyright notice, this list of conditions and the following disclaimer
                 in the documentation and/or other materials provided with the
                 distribution.
                 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
                 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
                 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
                 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
                 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
                 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
                 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
                 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
                 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
                 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
                 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
                  You can contact the author at :
                  - FSE+HUF source repository : https://github.com/Cyan4973/FiniteStateEntropy
                  - Public forum : https://groups.google.com/forum/#!forum/lz4c
              *************************************************************************** */
              /* *************************************
              *  Dependencies
              ***************************************/
              #include "mem.h"
              #include "error_private.h"       /* ERR_*, ERROR */
              #define FSE_STATIC_LINKING_ONLY  /* FSE_MIN_TABLELOG */
              #include "fse.h"
              #define HUF_STATIC_LINKING_ONLY  /* HUF_TABLELOG_ABSOLUTEMAX */
              #include "huf.h"
              /*-****************************************
              *  FSE Error Management
              ******************************************/
              unsigned FSE_isError(size_t code) { return ERR_isError(code); }
              const char* FSE_getErrorName(size_t code) { return ERR_getErrorName(code); }
              /* **************************************************************
              *  HUF Error Management
              ****************************************************************/
              unsigned HUF_isError(size_t code) { return ERR_isError(code); }
              const char* HUF_getErrorName(size_t code) { return ERR_getErrorName(code); }
              /*-**************************************************************
              *  FSE NCount encoding-decoding
              ****************************************************************/
              static short FSE_abs(short a) { return (short)(a<0 ? -a : a); }
              size_t FSE_readNCount (short* normalizedCounter, unsigned* maxSVPtr, unsigned* tableLogPtr,
                               const void* headerBuffer, size_t hbSize)
              {
                  const BYTE* const istart = (const BYTE*) headerBuffer;
                  const BYTE* const iend = istart + hbSize;
                  const BYTE* ip = istart;
                  int nbBits;
                  int remaining;
                  int threshold;
                  U32 bitStream;
                  int bitCount;
                  unsigned charnum = 0;
                  int previous0 = 0;
                  if (hbSize < 4) return ERROR(srcSize_wrong);
                  bitStream = MEM_readLE32(ip);
                  nbBits = (bitStream & 0xF) + FSE_MIN_TABLELOG;   /* extract tableLog */
                  if (nbBits > FSE_TABLELOG_ABSOLUTE_MAX) return ERROR(tableLog_tooLarge);
                  bitStream >>= 4;
                  bitCount = 4;
                  *tableLogPtr = nbBits;
                  remaining = (1<<nbBits)+1;
                  threshold = 1<<nbBits;
                  nbBits++;
                  while ((remaining>1) & (charnum<=*maxSVPtr)) {
                      if (previous0) {
                          unsigned n0 = charnum;
                          while ((bitStream & 0xFFFF) == 0xFFFF) {
                              n0 += 24;
                              if (ip < iend-5) {
                                  ip += 2;
                                  bitStream = MEM_readLE32(ip) >> bitCount;
                              } else {
                                  bitStream >>= 16;
                                  bitCount   += 16;
                          }   }
                          while ((bitStream & 3) == 3) {
                              n0 += 3;
                              bitStream >>= 2;
                              bitCount += 2;
                          }
                          n0 += bitStream & 3;
                          bitCount += 2;
                          if (n0 > *maxSVPtr) return ERROR(maxSymbolValue_tooSmall);
                          while (charnum < n0) normalizedCounter[charnum++] = 0;
                          if ((ip <= iend-7) || (ip + (bitCount>>3) <= iend-4)) {
                              ip += bitCount>>3;
                              bitCount &= 7;
                              bitStream = MEM_readLE32(ip) >> bitCount;
                          } else {
                              bitStream >>= 2;
                      }   }
                      {   short const max = (short)((2*threshold-1)-remaining);
                          short count;
                          if ((bitStream & (threshold-1)) < (U32)max) {
                              count = (short)(bitStream & (threshold-1));
                              bitCount   += nbBits-1;
                          } else {
                              count = (short)(bitStream & (2*threshold-1));
                              if (count >= threshold) count -= max;
                              bitCount   += nbBits;
                          }
                          count--;   /* extra accuracy */
                          remaining -= FSE_abs(count);
                          normalizedCounter[charnum++] = count;
                          previous0 = !count;
                          while (remaining < threshold) {
                              nbBits--;
                              threshold >>= 1;
                          }
                          if ((ip <= iend-7) || (ip + (bitCount>>3) <= iend-4)) {
                              ip += bitCount>>3;
                              bitCount &= 7;
                          } else {
                              bitCount -= (int)(8 * (iend - 4 - ip));
                              ip = iend - 4;
                          }
                          bitStream = MEM_readLE32(ip) >> (bitCount & 31);
                  }   }   /* while ((remaining>1) & (charnum<=*maxSVPtr)) */
                  if (remaining != 1) return ERROR(corruption_detected);
                  if (bitCount > 32) return ERROR(corruption_detected);
                  *maxSVPtr = charnum-1;
                  ip += (bitCount+7)>>3;
                  return ip-istart;
              }
              /*! HUF_readStats() :
                  Read compact Huffman tree, saved by HUF_writeCTable().
                  `huffWeight` is destination buffer.
+                 `rankStats` is assumed to be a table of at least HUF_TABLELOG_MAX U32.
                  @return : size read from `src` , or an error Code .
                  Note : Needed by HUF_readCTable() and HUF_readDTableX?() .
              */
              size_t HUF_readStats(BYTE* huffWeight, size_t hwSize, U32* rankStats,
                                   U32* nbSymbolsPtr, U32* tableLogPtr,
                                   const void* src, size_t srcSize)
              {
                  U32 weightTotal;
                  const BYTE* ip = (const BYTE*) src;
                  size_t iSize;
                  size_t oSize;
                  if (!srcSize) return ERROR(srcSize_wrong);
                  iSize = ip[0];
                  /* memset(huffWeight, 0, hwSize);   *//* is not necessary, even though some analyzer complain ... */
                  if (iSize >= 128) {  /* special header */
                      oSize = iSize - 127;
                      iSize = ((oSize+1)/2);
                      if (iSize+1 > srcSize) return ERROR(srcSize_wrong);
                      if (oSize >= hwSize) return ERROR(corruption_detected);
                      ip += 1;
                      {   U32 n;
                          for (n=0; n<oSize; n+=2) {
                              huffWeight[n]   = ip[n/2] >> 4;
                              huffWeight[n+1] = ip[n/2] & 15;
                  }   }   }
                  else  {   /* header compressed with FSE (normal case) */
+                     FSE_DTable fseWorkspace[FSE_DTABLE_SIZE_U32(6)];  /* 6 is max possible tableLog for HUF header (maybe even 5, to be tested) */
                      if (iSize+1 > srcSize) return ERROR(srcSize_wrong);
-                     oSize = FSE_decompress(huffWeight, hwSize-1, ip+1, iSize);   /* max (hwSize-1) values decoded, as last one is implied */
+                     oSize = FSE_decompress_wksp(huffWeight, hwSize-1, ip+1, iSize, fseWorkspace, 6);   /* max (hwSize-1) values decoded, as last one is implied */
                      if (FSE_isError(oSize)) return oSize;
                  }
                  /* collect weight stats */
-                 memset(rankStats, 0, (HUF_TABLELOG_ABSOLUTEMAX + 1) * sizeof(U32));
+                 memset(rankStats, 0, (HUF_TABLELOG_MAX + 1) * sizeof(U32));
                  weightTotal = 0;
                  {   U32 n; for (n=0; n<oSize; n++) {
-                         if (huffWeight[n] >= HUF_TABLELOG_ABSOLUTEMAX) return ERROR(corruption_detected);
+                         if (huffWeight[n] >= HUF_TABLELOG_MAX) return ERROR(corruption_detected);
                          rankStats[huffWeight[n]]++;
                          weightTotal += (1 << huffWeight[n]) >> 1;
                  }   }
                  if (weightTotal == 0) return ERROR(corruption_detected);
                  /* get last non-null symbol weight (implied, total must be 2^n) */
                  {   U32 const tableLog = BIT_highbit32(weightTotal) + 1;
-                     if (tableLog > HUF_TABLELOG_ABSOLUTEMAX) return ERROR(corruption_detected);
+                     if (tableLog > HUF_TABLELOG_MAX) return ERROR(corruption_detected);
                      *tableLogPtr = tableLog;
                      /* determine last weight */
                      {   U32 const total = 1 << tableLog;
                          U32 const rest = total - weightTotal;
                          U32 const verif = 1 << BIT_highbit32(rest);
                          U32 const lastWeight = BIT_highbit32(rest) + 1;
                          if (verif != rest) return ERROR(corruption_detected);    /* last value must be a clean power of 2 */
                          huffWeight[oSize] = (BYTE)lastWeight;
                          rankStats[lastWeight]++;
                  }   }
                  /* check tree construction validity */
                  if ((rankStats[1] < 2) || (rankStats[1] & 1)) return ERROR(corruption_detected);   /* by construction : at least 2 elts of rank 1, must be even */
                  /* results */
                  *nbSymbolsPtr = (U32)(oSize+1);
                  return iSize+1;
              }

contrib/python-zstandard/zstd/common/fse.h

0 +46 -12

              /* ******************************************************************
                 FSE : Finite State Entropy codec
                 Public Prototypes declaration
                 Copyright (C) 2013-2016, Yann Collet.
                 BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php)
                 Redistribution and use in source and binary forms, with or without
                 modification, are permitted provided that the following conditions are
                 met:
                     * Redistributions of source code must retain the above copyright
                 notice, this list of conditions and the following disclaimer.
                     * Redistributions in binary form must reproduce the above
                 copyright notice, this list of conditions and the following disclaimer
                 in the documentation and/or other materials provided with the
                 distribution.
                 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
                 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
                 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
                 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
                 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
                 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
                 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
                 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
                 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
                 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
                 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
                 You can contact the author at :
                 - Source repository : https://github.com/Cyan4973/FiniteStateEntropy
              ****************************************************************** */
              #ifndef FSE_H
              #define FSE_H
              #if defined (__cplusplus)
              extern "C" {
              #endif
              /*-*****************************************
              *  Dependencies
              ******************************************/
              #include <stddef.h>    /* size_t, ptrdiff_t */
              /*-****************************************
              *  FSE simple functions
              ******************************************/
              /*! FSE_compress() :
                  Compress content of buffer 'src', of size 'srcSize', into destination buffer 'dst'.
                  'dst' buffer must be already allocated. Compression runs faster is dstCapacity >= FSE_compressBound(srcSize).
                  @return : size of compressed data (<= dstCapacity).
                  Special values : if return == 0, srcData is not compressible => Nothing is stored within dst !!!
                                   if return == 1, srcData is a single byte symbol * srcSize times. Use RLE compression instead.
                                   if FSE_isError(return), compression failed (more details using FSE_getErrorName())
              */
              size_t FSE_compress(void* dst, size_t dstCapacity,
                            const void* src, size_t srcSize);
              /*! FSE_decompress():
                  Decompress FSE data from buffer 'cSrc', of size 'cSrcSize',
                  into already allocated destination buffer 'dst', of size 'dstCapacity'.
                  @return : size of regenerated data (<= maxDstSize),
                            or an error code, which can be tested using FSE_isError() .
                  ** Important ** : FSE_decompress() does not decompress non-compressible nor RLE data !!!
                  Why ? : making this distinction requires a header.
                  Header management is intentionally delegated to the user layer, which can better manage special cases.
              */
              size_t FSE_decompress(void* dst,  size_t dstCapacity,
                              const void* cSrc, size_t cSrcSize);
              /*-*****************************************
              *  Tool functions
              ******************************************/
              size_t FSE_compressBound(size_t size);       /* maximum compressed size */
              /* Error Management */
              unsigned    FSE_isError(size_t code);        /* tells if a return value is an error code */
              const char* FSE_getErrorName(size_t code);   /* provides error code string (useful for debugging) */
              /*-*****************************************
              *  FSE advanced functions
              ******************************************/
              /*! FSE_compress2() :
                  Same as FSE_compress(), but allows the selection of 'maxSymbolValue' and 'tableLog'
                  Both parameters can be defined as '0' to mean : use default value
                  @return : size of compressed data
                  Special values : if return == 0, srcData is not compressible => Nothing is stored within cSrc !!!
                                   if return == 1, srcData is a single byte symbol * srcSize times. Use RLE compression.
                                   if FSE_isError(return), it's an error code.
              */
              size_t FSE_compress2 (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog);
              /*-*****************************************
              *  FSE detailed API
              ******************************************/
              /*!
              FSE_compress() does the following:
 . count symbol occurrence from source[] into table count[]
 . normalize counters so that sum(count[]) == Power_of_2 (2^tableLog)
 . save normalized counters to memory buffer using writeNCount()
 . build encoding table 'CTable' from normalized counters
 . encode the data stream using encoding table 'CTable'
              FSE_decompress() does the following:
 . read normalized counters with readNCount()
 . build decoding table 'DTable' from normalized counters
 . decode the data stream using decoding table 'DTable'
              The following API allows targeting specific sub-functions for advanced tasks.
              For example, it's possible to compress several blocks using the same 'CTable',
              or to save and provide normalized distribution using external method.
              */
              /* *** COMPRESSION *** */
              /*! FSE_count():
                  Provides the precise count of each byte within a table 'count'.
                  'count' is a table of unsigned int, of minimum size (*maxSymbolValuePtr+1).
                  *maxSymbolValuePtr will be updated if detected smaller than initial value.
                  @return : the count of the most frequent symbol (which is not identified).
                            if return == srcSize, there is only one symbol.
                            Can also return an error code, which can be tested with FSE_isError(). */
              size_t FSE_count(unsigned* count, unsigned* maxSymbolValuePtr, const void* src, size_t srcSize);
              /*! FSE_optimalTableLog():
                  dynamically downsize 'tableLog' when conditions are met.
                  It saves CPU time, by using smaller tables, while preserving or even improving compression ratio.
                  @return : recommended tableLog (necessarily <= 'maxTableLog') */
              unsigned FSE_optimalTableLog(unsigned maxTableLog, size_t srcSize, unsigned maxSymbolValue);
              /*! FSE_normalizeCount():
                  normalize counts so that sum(count[]) == Power_of_2 (2^tableLog)
                  'normalizedCounter' is a table of short, of minimum size (maxSymbolValue+1).
                  @return : tableLog,
                            or an errorCode, which can be tested using FSE_isError() */
              size_t FSE_normalizeCount(short* normalizedCounter, unsigned tableLog, const unsigned* count, size_t srcSize, unsigned maxSymbolValue);
              /*! FSE_NCountWriteBound():
                  Provides the maximum possible size of an FSE normalized table, given 'maxSymbolValue' and 'tableLog'.
                  Typically useful for allocation purpose. */
              size_t FSE_NCountWriteBound(unsigned maxSymbolValue, unsigned tableLog);
              /*! FSE_writeNCount():
                  Compactly save 'normalizedCounter' into 'buffer'.
                  @return : size of the compressed table,
                            or an errorCode, which can be tested using FSE_isError(). */
              size_t FSE_writeNCount (void* buffer, size_t bufferSize, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog);
              /*! Constructor and Destructor of FSE_CTable.
                  Note that FSE_CTable size depends on 'tableLog' and 'maxSymbolValue' */
              typedef unsigned FSE_CTable;   /* don't allocate that. It's only meant to be more restrictive than void* */
              FSE_CTable* FSE_createCTable (unsigned tableLog, unsigned maxSymbolValue);
              void        FSE_freeCTable (FSE_CTable* ct);
              /*! FSE_buildCTable():
                  Builds `ct`, which must be already allocated, using FSE_createCTable().
                  @return : 0, or an errorCode, which can be tested using FSE_isError() */
              size_t FSE_buildCTable(FSE_CTable* ct, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog);
              /*! FSE_compress_usingCTable():
                  Compress `src` using `ct` into `dst` which must be already allocated.
                  @return : size of compressed data (<= `dstCapacity`),
                            or 0 if compressed data could not fit into `dst`,
                            or an errorCode, which can be tested using FSE_isError() */
              size_t FSE_compress_usingCTable (void* dst, size_t dstCapacity, const void* src, size_t srcSize, const FSE_CTable* ct);
              /*!
              Tutorial :
              ----------
              The first step is to count all symbols. FSE_count() does this job very fast.
              Result will be saved into 'count', a table of unsigned int, which must be already allocated, and have 'maxSymbolValuePtr[0]+1' cells.
              'src' is a table of bytes of size 'srcSize'. All values within 'src' MUST be <= maxSymbolValuePtr[0]
              maxSymbolValuePtr[0] will be updated, with its real value (necessarily <= original value)
              FSE_count() will return the number of occurrence of the most frequent symbol.
              This can be used to know if there is a single symbol within 'src', and to quickly evaluate its compressibility.
              If there is an error, the function will return an ErrorCode (which can be tested using FSE_isError()).
              The next step is to normalize the frequencies.
              FSE_normalizeCount() will ensure that sum of frequencies is == 2 ^'tableLog'.
              It also guarantees a minimum of 1 to any Symbol with frequency >= 1.
              You can use 'tableLog'==0 to mean "use default tableLog value".
              If you are unsure of which tableLog value to use, you can ask FSE_optimalTableLog(),
              which will provide the optimal valid tableLog given sourceSize, maxSymbolValue, and a user-defined maximum (0 means "default").
              The result of FSE_normalizeCount() will be saved into a table,
              called 'normalizedCounter', which is a table of signed short.
              'normalizedCounter' must be already allocated, and have at least 'maxSymbolValue+1' cells.
              The return value is tableLog if everything proceeded as expected.
              It is 0 if there is a single symbol within distribution.
              If there is an error (ex: invalid tableLog value), the function will return an ErrorCode (which can be tested using FSE_isError()).
              'normalizedCounter' can be saved in a compact manner to a memory area using FSE_writeNCount().
              'buffer' must be already allocated.
              For guaranteed success, buffer size must be at least FSE_headerBound().
              The result of the function is the number of bytes written into 'buffer'.
              If there is an error, the function will return an ErrorCode (which can be tested using FSE_isError(); ex : buffer size too small).
              'normalizedCounter' can then be used to create the compression table 'CTable'.
              The space required by 'CTable' must be already allocated, using FSE_createCTable().
              You can then use FSE_buildCTable() to fill 'CTable'.
              If there is an error, both functions will return an ErrorCode (which can be tested using FSE_isError()).
              'CTable' can then be used to compress 'src', with FSE_compress_usingCTable().
              Similar to FSE_count(), the convention is that 'src' is assumed to be a table of char of size 'srcSize'
              The function returns the size of compressed data (without header), necessarily <= `dstCapacity`.
              If it returns '0', compressed data could not fit into 'dst'.
              If there is an error, the function will return an ErrorCode (which can be tested using FSE_isError()).
              */
              /* *** DECOMPRESSION *** */
              /*! FSE_readNCount():
                  Read compactly saved 'normalizedCounter' from 'rBuffer'.
                  @return : size read from 'rBuffer',
                            or an errorCode, which can be tested using FSE_isError().
                            maxSymbolValuePtr[0] and tableLogPtr[0] will also be updated with their respective values */
              size_t FSE_readNCount (short* normalizedCounter, unsigned* maxSymbolValuePtr, unsigned* tableLogPtr, const void* rBuffer, size_t rBuffSize);
              /*! Constructor and Destructor of FSE_DTable.
                  Note that its size depends on 'tableLog' */
              typedef unsigned FSE_DTable;   /* don't allocate that. It's just a way to be more restrictive than void* */
              FSE_DTable* FSE_createDTable(unsigned tableLog);
              void        FSE_freeDTable(FSE_DTable* dt);
              /*! FSE_buildDTable():
                  Builds 'dt', which must be already allocated, using FSE_createDTable().
                  return : 0, or an errorCode, which can be tested using FSE_isError() */
              size_t FSE_buildDTable (FSE_DTable* dt, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog);
              /*! FSE_decompress_usingDTable():
                  Decompress compressed source `cSrc` of size `cSrcSize` using `dt`
                  into `dst` which must be already allocated.
                  @return : size of regenerated data (necessarily <= `dstCapacity`),
                            or an errorCode, which can be tested using FSE_isError() */
              size_t FSE_decompress_usingDTable(void* dst, size_t dstCapacity, const void* cSrc, size_t cSrcSize, const FSE_DTable* dt);
              /*!
              Tutorial :
              ----------
              (Note : these functions only decompress FSE-compressed blocks.
               If block is uncompressed, use memcpy() instead
               If block is a single repeated byte, use memset() instead )
              The first step is to obtain the normalized frequencies of symbols.
              This can be performed by FSE_readNCount() if it was saved using FSE_writeNCount().
              'normalizedCounter' must be already allocated, and have at least 'maxSymbolValuePtr[0]+1' cells of signed short.
              In practice, that means it's necessary to know 'maxSymbolValue' beforehand,
              or size the table to handle worst case situations (typically 256).
              FSE_readNCount() will provide 'tableLog' and 'maxSymbolValue'.
              The result of FSE_readNCount() is the number of bytes read from 'rBuffer'.
              Note that 'rBufferSize' must be at least 4 bytes, even if useful information is less than that.
              If there is an error, the function will return an error code, which can be tested using FSE_isError().
              The next step is to build the decompression tables 'FSE_DTable' from 'normalizedCounter'.
              This is performed by the function FSE_buildDTable().
              The space required by 'FSE_DTable' must be already allocated using FSE_createDTable().
              If there is an error, the function will return an error code, which can be tested using FSE_isError().
              `FSE_DTable` can then be used to decompress `cSrc`, with FSE_decompress_usingDTable().
              `cSrcSize` must be strictly correct, otherwise decompression will fail.
              FSE_decompress_usingDTable() result will tell how many bytes were regenerated (<=`dstCapacity`).
              If there is an error, the function will return an error code, which can be tested using FSE_isError(). (ex: dst buffer too small)
              */
              #ifdef FSE_STATIC_LINKING_ONLY
              /* *** Dependency *** */
              #include "bitstream.h"
              /* *****************************************
              *  Static allocation
              *******************************************/
              /* FSE buffer bounds */
              #define FSE_NCOUNTBOUND 512
              #define FSE_BLOCKBOUND(size) (size + (size>>7))
              #define FSE_COMPRESSBOUND(size) (FSE_NCOUNTBOUND + FSE_BLOCKBOUND(size))   /* Macro version, useful for static allocation */
-             /* It is possible to statically allocate FSE CTable/DTable as a table of unsigned using below macros */
+             /* It is possible to statically allocate FSE CTable/DTable as a table of FSE_CTable/FSE_DTable using below macros */
              #define FSE_CTABLE_SIZE_U32(maxTableLog, maxSymbolValue)   (1 + (1<<(maxTableLog-1)) + ((maxSymbolValue+1)*2))
              #define FSE_DTABLE_SIZE_U32(maxTableLog)                   (1 + (1<<maxTableLog))
              /* *****************************************
              *  FSE advanced API
              *******************************************/
+             /* FSE_count_wksp() :
+              * Same as FSE_count(), but using an externally provided scratch buffer.
+              * `workSpace` size must be table of >= `1024` unsigned
+              */
+             size_t FSE_count_wksp(unsigned* count, unsigned* maxSymbolValuePtr,
+                              const void* source, size_t sourceSize, unsigned* workSpace);
+             /** FSE_countFast() :
+              *  same as FSE_count(), but blindly trusts that all byte values within src are <= *maxSymbolValuePtr
+              */
              size_t FSE_countFast(unsigned* count, unsigned* maxSymbolValuePtr, const void* src, size_t srcSize);
-             /**< same as FSE_count(), but blindly trusts that all byte values within src are <= *maxSymbolValuePtr  */
+             /* FSE_countFast_wksp() :
+              * Same as FSE_countFast(), but using an externally provided scratch buffer.
+              * `workSpace` must be a table of minimum `1024` unsigned
+              */
+             size_t FSE_countFast_wksp(unsigned* count, unsigned* maxSymbolValuePtr, const void* src, size_t srcSize, unsigned* workSpace);
+             /*! FSE_count_simple
+              * Same as FSE_countFast(), but does not use any additional memory (not even on stack).
+              * This function is unsafe, and will segfault if any value within `src` is `> *maxSymbolValuePtr` (presuming it's also the size of `count`).
+             */
+             size_t FSE_count_simple(unsigned* count, unsigned* maxSymbolValuePtr, const void* src, size_t srcSize);
              unsigned FSE_optimalTableLog_internal(unsigned maxTableLog, size_t srcSize, unsigned maxSymbolValue, unsigned minus);
              /**< same as FSE_optimalTableLog(), which used `minus==2` */
+             /* FSE_compress_wksp() :
+              * Same as FSE_compress2(), but using an externally allocated scratch buffer (`workSpace`).
+              * FSE_WKSP_SIZE_U32() provides the minimum size required for `workSpace` as a table of FSE_CTable.
+              */
+             #define FSE_WKSP_SIZE_U32(maxTableLog, maxSymbolValue)   ( FSE_CTABLE_SIZE_U32(maxTableLog, maxSymbolValue) + (1<<((maxTableLog>2)?(maxTableLog-2):0)) )
+             size_t FSE_compress_wksp (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog, void* workSpace, size_t wkspSize);
              size_t FSE_buildCTable_raw (FSE_CTable* ct, unsigned nbBits);
-             /**< build a fake FSE_CTable, designed to not compress an input, where each symbol uses nbBits */
+             /**< build a fake FSE_CTable, designed for a flat distribution, where each symbol uses nbBits */
              size_t FSE_buildCTable_rle (FSE_CTable* ct, unsigned char symbolValue);
              /**< build a fake FSE_CTable, designed to compress always the same symbolValue */
+             /* FSE_buildCTable_wksp() :
+              * Same as FSE_buildCTable(), but using an externally allocated scratch buffer (`workSpace`).
+              * `wkspSize` must be >= `(1<<tableLog)`.
+              */
+             size_t FSE_buildCTable_wksp(FSE_CTable* ct, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog, void* workSpace, size_t wkspSize);
              size_t FSE_buildDTable_raw (FSE_DTable* dt, unsigned nbBits);
-             /**< build a fake FSE_DTable, designed to read an uncompressed bitstream where each symbol uses nbBits */
+             /**< build a fake FSE_DTable, designed to read a flat distribution where each symbol uses nbBits */
              size_t FSE_buildDTable_rle (FSE_DTable* dt, unsigned char symbolValue);
              /**< build a fake FSE_DTable, designed to always generate the same symbolValue */
+             size_t FSE_decompress_wksp(void* dst, size_t dstCapacity, const void* cSrc, size_t cSrcSize, FSE_DTable* workSpace, unsigned maxLog);
+             /**< same as FSE_decompress(), using an externally allocated `workSpace` produced with `FSE_DTABLE_SIZE_U32(maxLog)` */
              /* *****************************************
              *  FSE symbol compression API
              *******************************************/
              /*!
                 This API consists of small unitary functions, which highly benefit from being inlined.
-                You will want to enable link-time-optimization to ensure these functions are properly inlined in your binary.
-                Visual seems to do it automatically.
-                For gcc or clang, you'll need to add -flto flag at compilation and linking stages.
-                If none of these solutions is applicable, include "fse.c" directly.
+                Hence their body are included in next section.
              */
-             typedef struct
+             {
+             typedef struct {
                  ptrdiff_t   value;
                  const void* stateTable;
                  const void* symbolTT;
                  unsigned    stateLog;
              } FSE_CState_t;
              static void FSE_initCState(FSE_CState_t* CStatePtr, const FSE_CTable* ct);
              static void FSE_encodeSymbol(BIT_CStream_t* bitC, FSE_CState_t* CStatePtr, unsigned symbol);
              static void FSE_flushCState(BIT_CStream_t* bitC, const FSE_CState_t* CStatePtr);
              /**<
              These functions are inner components of FSE_compress_usingCTable().
              They allow the creation of custom streams, mixing multiple tables and bit sources.
              A key property to keep in mind is that encoding and decoding are done **in reverse direction**.
              So the first symbol you will encode is the last you will decode, like a LIFO stack.
              You will need a few variables to track your CStream. They are :
              FSE_CTable    ct;         // Provided by FSE_buildCTable()
              BIT_CStream_t bitStream;  // bitStream tracking structure
              FSE_CState_t  state;      // State tracking structure (can have several)
              The first thing to do is to init bitStream and state.
                  size_t errorCode = BIT_initCStream(&bitStream, dstBuffer, maxDstSize);
                  FSE_initCState(&state, ct);
              Note that BIT_initCStream() can produce an error code, so its result should be tested, using FSE_isError();
              You can then encode your input data, byte after byte.
              FSE_encodeSymbol() outputs a maximum of 'tableLog' bits at a time.
              Remember decoding will be done in reverse direction.
                  FSE_encodeByte(&bitStream, &state, symbol);
              At any time, you can also add any bit sequence.
              Note : maximum allowed nbBits is 25, for compatibility with 32-bits decoders
                  BIT_addBits(&bitStream, bitField, nbBits);
              The above methods don't commit data to memory, they just store it into local register, for speed.
              Local register size is 64-bits on 64-bits systems, 32-bits on 32-bits systems (size_t).
              Writing data to memory is a manual operation, performed by the flushBits function.
                  BIT_flushBits(&bitStream);
              Your last FSE encoding operation shall be to flush your last state value(s).
                  FSE_flushState(&bitStream, &state);
              Finally, you must close the bitStream.
              The function returns the size of CStream in bytes.
              If data couldn't fit into dstBuffer, it will return a 0 ( == not compressible)
              If there is an error, it returns an errorCode (which can be tested using FSE_isError()).
                  size_t size = BIT_closeCStream(&bitStream);
              */
              /* *****************************************
              *  FSE symbol decompression API
              *******************************************/
-             typedef struct
+             {
+             typedef struct {
                  size_t      state;
                  const void* table;   /* precise table may vary, depending on U16 */
              } FSE_DState_t;
              static void     FSE_initDState(FSE_DState_t* DStatePtr, BIT_DStream_t* bitD, const FSE_DTable* dt);
              static unsigned char FSE_decodeSymbol(FSE_DState_t* DStatePtr, BIT_DStream_t* bitD);
              static unsigned FSE_endOfDState(const FSE_DState_t* DStatePtr);
              /**<
              Let's now decompose FSE_decompress_usingDTable() into its unitary components.
              You will decode FSE-encoded symbols from the bitStream,
              and also any other bitFields you put in, **in reverse order**.
              You will need a few variables to track your bitStream. They are :
              BIT_DStream_t DStream;    // Stream context
              FSE_DState_t  DState;     // State context. Multiple ones are possible
              FSE_DTable*   DTablePtr;  // Decoding table, provided by FSE_buildDTable()
              The first thing to do is to init the bitStream.
                  errorCode = BIT_initDStream(&DStream, srcBuffer, srcSize);
              You should then retrieve your initial state(s)
              (in reverse flushing order if you have several ones) :
                  errorCode = FSE_initDState(&DState, &DStream, DTablePtr);
              You can then decode your data, symbol after symbol.
              For information the maximum number of bits read by FSE_decodeSymbol() is 'tableLog'.
              Keep in mind that symbols are decoded in reverse order, like a LIFO stack (last in, first out).
                  unsigned char symbol = FSE_decodeSymbol(&DState, &DStream);
              You can retrieve any bitfield you eventually stored into the bitStream (in reverse order)
              Note : maximum allowed nbBits is 25, for 32-bits compatibility
                  size_t bitField = BIT_readBits(&DStream, nbBits);
              All above operations only read from local register (which size depends on size_t).
              Refueling the register from memory is manually performed by the reload method.
                  endSignal = FSE_reloadDStream(&DStream);
              BIT_reloadDStream() result tells if there is still some more data to read from DStream.
              BIT_DStream_unfinished : there is still some data left into the DStream.
              BIT_DStream_endOfBuffer : Dstream reached end of buffer. Its container may no longer be completely filled.
              BIT_DStream_completed : Dstream reached its exact end, corresponding in general to decompression completed.
              BIT_DStream_tooFar : Dstream went too far. Decompression result is corrupted.
              When reaching end of buffer (BIT_DStream_endOfBuffer), progress slowly, notably if you decode multiple symbols per loop,
              to properly detect the exact end of stream.
              After each decoded symbol, check if DStream is fully consumed using this simple test :
                  BIT_reloadDStream(&DStream) >= BIT_DStream_completed
              When it's done, verify decompression is fully completed, by checking both DStream and the relevant states.
              Checking if DStream has reached its end is performed by :
                  BIT_endOfDStream(&DStream);
              Check also the states. There might be some symbols left there, if some high probability ones (>50%) are possible.
                  FSE_endOfDState(&DState);
              */
              /* *****************************************
              *  FSE unsafe API
              *******************************************/
              static unsigned char FSE_decodeSymbolFast(FSE_DState_t* DStatePtr, BIT_DStream_t* bitD);
              /* faster, but works only if nbBits is always >= 1 (otherwise, result will be corrupted) */
              /* *****************************************
              *  Implementation of inlined functions
              *******************************************/
              typedef struct {
                  int deltaFindState;
                  U32 deltaNbBits;
              } FSE_symbolCompressionTransform; /* total 8 bytes */
              MEM_STATIC void FSE_initCState(FSE_CState_t* statePtr, const FSE_CTable* ct)
              {
                  const void* ptr = ct;
                  const U16* u16ptr = (const U16*) ptr;
                  const U32 tableLog = MEM_read16(ptr);
                  statePtr->value = (ptrdiff_t)1<<tableLog;
                  statePtr->stateTable = u16ptr+2;
                  statePtr->symbolTT = ((const U32*)ct + 1 + (tableLog ? (1<<(tableLog-1)) : 1));
                  statePtr->stateLog = tableLog;
              }
              /*! FSE_initCState2() :
              *   Same as FSE_initCState(), but the first symbol to include (which will be the last to be read)
              *   uses the smallest state value possible, saving the cost of this symbol */
              MEM_STATIC void FSE_initCState2(FSE_CState_t* statePtr, const FSE_CTable* ct, U32 symbol)
              {
                  FSE_initCState(statePtr, ct);
                  {   const FSE_symbolCompressionTransform symbolTT = ((const FSE_symbolCompressionTransform*)(statePtr->symbolTT))[symbol];
                      const U16* stateTable = (const U16*)(statePtr->stateTable);
                      U32 nbBitsOut  = (U32)((symbolTT.deltaNbBits + (1<<15)) >> 16);
                      statePtr->value = (nbBitsOut << 16) - symbolTT.deltaNbBits;
                      statePtr->value = stateTable[(statePtr->value >> nbBitsOut) + symbolTT.deltaFindState];
                  }
              }
              MEM_STATIC void FSE_encodeSymbol(BIT_CStream_t* bitC, FSE_CState_t* statePtr, U32 symbol)
              {
                  const FSE_symbolCompressionTransform symbolTT = ((const FSE_symbolCompressionTransform*)(statePtr->symbolTT))[symbol];
                  const U16* const stateTable = (const U16*)(statePtr->stateTable);
                  U32 nbBitsOut  = (U32)((statePtr->value + symbolTT.deltaNbBits) >> 16);
                  BIT_addBits(bitC, statePtr->value, nbBitsOut);
                  statePtr->value = stateTable[ (statePtr->value >> nbBitsOut) + symbolTT.deltaFindState];
              }
              MEM_STATIC void FSE_flushCState(BIT_CStream_t* bitC, const FSE_CState_t* statePtr)
              {
                  BIT_addBits(bitC, statePtr->value, statePtr->stateLog);
                  BIT_flushBits(bitC);
              }
              /* ======    Decompression    ====== */
              typedef struct {
                  U16 tableLog;
                  U16 fastMode;
              } FSE_DTableHeader;   /* sizeof U32 */
              typedef struct
              {
                  unsigned short newState;
                  unsigned char  symbol;
                  unsigned char  nbBits;
              } FSE_decode_t;   /* size == U32 */
              MEM_STATIC void FSE_initDState(FSE_DState_t* DStatePtr, BIT_DStream_t* bitD, const FSE_DTable* dt)
              {
                  const void* ptr = dt;
                  const FSE_DTableHeader* const DTableH = (const FSE_DTableHeader*)ptr;
                  DStatePtr->state = BIT_readBits(bitD, DTableH->tableLog);
                  BIT_reloadDStream(bitD);
                  DStatePtr->table = dt + 1;
              }
              MEM_STATIC BYTE FSE_peekSymbol(const FSE_DState_t* DStatePtr)
              {
                  FSE_decode_t const DInfo = ((const FSE_decode_t*)(DStatePtr->table))[DStatePtr->state];
                  return DInfo.symbol;
              }
              MEM_STATIC void FSE_updateState(FSE_DState_t* DStatePtr, BIT_DStream_t* bitD)
              {
                  FSE_decode_t const DInfo = ((const FSE_decode_t*)(DStatePtr->table))[DStatePtr->state];
                  U32 const nbBits = DInfo.nbBits;
                  size_t const lowBits = BIT_readBits(bitD, nbBits);
                  DStatePtr->state = DInfo.newState + lowBits;
              }
              MEM_STATIC BYTE FSE_decodeSymbol(FSE_DState_t* DStatePtr, BIT_DStream_t* bitD)
              {
                  FSE_decode_t const DInfo = ((const FSE_decode_t*)(DStatePtr->table))[DStatePtr->state];
                  U32 const nbBits = DInfo.nbBits;
                  BYTE const symbol = DInfo.symbol;
                  size_t const lowBits = BIT_readBits(bitD, nbBits);
                  DStatePtr->state = DInfo.newState + lowBits;
                  return symbol;
              }
              /*! FSE_decodeSymbolFast() :
                  unsafe, only works if no symbol has a probability > 50% */
              MEM_STATIC BYTE FSE_decodeSymbolFast(FSE_DState_t* DStatePtr, BIT_DStream_t* bitD)
              {
                  FSE_decode_t const DInfo = ((const FSE_decode_t*)(DStatePtr->table))[DStatePtr->state];
                  U32 const nbBits = DInfo.nbBits;
                  BYTE const symbol = DInfo.symbol;
                  size_t const lowBits = BIT_readBitsFast(bitD, nbBits);
                  DStatePtr->state = DInfo.newState + lowBits;
                  return symbol;
              }
              MEM_STATIC unsigned FSE_endOfDState(const FSE_DState_t* DStatePtr)
              {
                  return DStatePtr->state == 0;
              }
              #ifndef FSE_COMMONDEFS_ONLY
              /* **************************************************************
              *  Tuning parameters
              ****************************************************************/
              /*!MEMORY_USAGE :
              *  Memory usage formula : N->2^N Bytes (examples : 10 -> 1KB; 12 -> 4KB ; 16 -> 64KB; 20 -> 1MB; etc.)
              *  Increasing memory usage improves compression ratio
              *  Reduced memory usage can improve speed, due to cache effect
              *  Recommended max value is 14, for 16KB, which nicely fits into Intel x86 L1 cache */
              #ifndef FSE_MAX_MEMORY_USAGE
              #  define FSE_MAX_MEMORY_USAGE 14
              #endif
              #ifndef FSE_DEFAULT_MEMORY_USAGE
              #  define FSE_DEFAULT_MEMORY_USAGE 13
              #endif
              /*!FSE_MAX_SYMBOL_VALUE :
              *  Maximum symbol value authorized.
              *  Required for proper stack allocation */
              #ifndef FSE_MAX_SYMBOL_VALUE
              #  define FSE_MAX_SYMBOL_VALUE 255
              #endif
              /* **************************************************************
              *  template functions type & suffix
              ****************************************************************/
              #define FSE_FUNCTION_TYPE BYTE
              #define FSE_FUNCTION_EXTENSION
              #define FSE_DECODE_TYPE FSE_decode_t
              #endif   /* !FSE_COMMONDEFS_ONLY */
              /* ***************************************************************
              *  Constants
              *****************************************************************/
              #define FSE_MAX_TABLELOG  (FSE_MAX_MEMORY_USAGE-2)
              #define FSE_MAX_TABLESIZE (1U<<FSE_MAX_TABLELOG)
              #define FSE_MAXTABLESIZE_MASK (FSE_MAX_TABLESIZE-1)
              #define FSE_DEFAULT_TABLELOG (FSE_DEFAULT_MEMORY_USAGE-2)
              #define FSE_MIN_TABLELOG 5
              #define FSE_TABLELOG_ABSOLUTE_MAX 15
              #if FSE_MAX_TABLELOG > FSE_TABLELOG_ABSOLUTE_MAX
              #  error "FSE_MAX_TABLELOG > FSE_TABLELOG_ABSOLUTE_MAX is not supported"
              #endif
              #define FSE_TABLESTEP(tableSize) ((tableSize>>1) + (tableSize>>3) + 3)
              #endif /* FSE_STATIC_LINKING_ONLY */
              #if defined (__cplusplus)
              }
              #endif
              #endif  /* FSE_H */

contrib/python-zstandard/zstd/common/fse_decompress.c

0 +18 -18

              /* ******************************************************************
                 FSE : Finite State Entropy decoder
                 Copyright (C) 2013-2015, Yann Collet.
                 BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php)
                 Redistribution and use in source and binary forms, with or without
                 modification, are permitted provided that the following conditions are
                 met:
                     * Redistributions of source code must retain the above copyright
                 notice, this list of conditions and the following disclaimer.
                     * Redistributions in binary form must reproduce the above
                 copyright notice, this list of conditions and the following disclaimer
                 in the documentation and/or other materials provided with the
                 distribution.
                 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
                 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
                 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
                 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
                 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
                 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
                 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
                 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
                 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
                 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
                 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
                  You can contact the author at :
                  - FSE source repository : https://github.com/Cyan4973/FiniteStateEntropy
                  - Public forum : https://groups.google.com/forum/#!forum/lz4c
              ****************************************************************** */
              /* **************************************************************
              *  Compiler specifics
              ****************************************************************/
              #ifdef _MSC_VER    /* Visual Studio */
              #  define FORCE_INLINE static __forceinline
              #  include <intrin.h>                    /* For Visual 2005 */
              #  pragma warning(disable : 4127)        /* disable: C4127: conditional expression is constant */
              #  pragma warning(disable : 4214)        /* disable: C4214: non-int bitfields */
              #else
              #  if defined (__cplusplus) || defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* C99 */
              #    ifdef __GNUC__
              #      define FORCE_INLINE static inline __attribute__((always_inline))
              #    else
              #      define FORCE_INLINE static inline
              #    endif
              #  else
              #    define FORCE_INLINE static
              #  endif /* __STDC_VERSION__ */
              #endif
              /* **************************************************************
              *  Includes
              ****************************************************************/
              #include <stdlib.h>     /* malloc, free, qsort */
              #include <string.h>     /* memcpy, memset */
              #include <stdio.h>      /* printf (debug) */
              #include "bitstream.h"
              #define FSE_STATIC_LINKING_ONLY
              #include "fse.h"
              /* **************************************************************
              *  Error Management
              ****************************************************************/
              #define FSE_isError ERR_isError
              #define FSE_STATIC_ASSERT(c) { enum { FSE_static_assert = 1/(int)(!!(c)) }; }   /* use only *after* variable declarations */
              /* check and forward error code */
              #define CHECK_F(f) { size_t const e = f; if (FSE_isError(e)) return e; }
              /* **************************************************************
-             *  Complex types
-             ****************************************************************/
-             typedef U32 DTable_max_t[FSE_DTABLE_SIZE_U32(FSE_MAX_TABLELOG)];
-             /* **************************************************************
              *  Templates
              ****************************************************************/
              /*
                designed to be included
                for type-specific functions (template emulation in C)
                Objective is to write these functions only once, for improved maintenance
              */
              /* safety checks */
              #ifndef FSE_FUNCTION_EXTENSION
              #  error "FSE_FUNCTION_EXTENSION must be defined"
              #endif
              #ifndef FSE_FUNCTION_TYPE
              #  error "FSE_FUNCTION_TYPE must be defined"
              #endif
              /* Function names */
              #define FSE_CAT(X,Y) X##Y
              #define FSE_FUNCTION_NAME(X,Y) FSE_CAT(X,Y)
              #define FSE_TYPE_NAME(X,Y) FSE_CAT(X,Y)
              /* Function templates */
              FSE_DTable* FSE_createDTable (unsigned tableLog)
              {
                  if (tableLog > FSE_TABLELOG_ABSOLUTE_MAX) tableLog = FSE_TABLELOG_ABSOLUTE_MAX;
                  return (FSE_DTable*)malloc( FSE_DTABLE_SIZE_U32(tableLog) * sizeof (U32) );
              }
              void FSE_freeDTable (FSE_DTable* dt)
              {
                  free(dt);
              }
              size_t FSE_buildDTable(FSE_DTable* dt, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog)
              {
                  void* const tdPtr = dt+1;   /* because *dt is unsigned, 32-bits aligned on 32-bits */
                  FSE_DECODE_TYPE* const tableDecode = (FSE_DECODE_TYPE*) (tdPtr);
                  U16 symbolNext[FSE_MAX_SYMBOL_VALUE+1];
                  U32 const maxSV1 = maxSymbolValue + 1;
                  U32 const tableSize = 1 << tableLog;
                  U32 highThreshold = tableSize-1;
                  /* Sanity Checks */
                  if (maxSymbolValue > FSE_MAX_SYMBOL_VALUE) return ERROR(maxSymbolValue_tooLarge);
                  if (tableLog > FSE_MAX_TABLELOG) return ERROR(tableLog_tooLarge);
                  /* Init, lay down lowprob symbols */
                  {   FSE_DTableHeader DTableH;
                      DTableH.tableLog = (U16)tableLog;
                      DTableH.fastMode = 1;
                      {   S16 const largeLimit= (S16)(1 << (tableLog-1));
                          U32 s;
                          for (s=0; s<maxSV1; s++) {
                              if (normalizedCounter[s]==-1) {
                                  tableDecode[highThreshold--].symbol = (FSE_FUNCTION_TYPE)s;
                                  symbolNext[s] = 1;
                              } else {
                                  if (normalizedCounter[s] >= largeLimit) DTableH.fastMode=0;
                                  symbolNext[s] = normalizedCounter[s];
                      }   }   }
                      memcpy(dt, &DTableH, sizeof(DTableH));
                  }
                  /* Spread symbols */
                  {   U32 const tableMask = tableSize-1;
                      U32 const step = FSE_TABLESTEP(tableSize);
                      U32 s, position = 0;
                      for (s=0; s<maxSV1; s++) {
                          int i;
                          for (i=0; i<normalizedCounter[s]; i++) {
                              tableDecode[position].symbol = (FSE_FUNCTION_TYPE)s;
                              position = (position + step) & tableMask;
                              while (position > highThreshold) position = (position + step) & tableMask;   /* lowprob area */
                      }   }
                      if (position!=0) return ERROR(GENERIC);   /* position must reach all cells once, otherwise normalizedCounter is incorrect */
                  }
                  /* Build Decoding table */
                  {   U32 u;
                      for (u=0; u<tableSize; u++) {
                          FSE_FUNCTION_TYPE const symbol = (FSE_FUNCTION_TYPE)(tableDecode[u].symbol);
                          U16 nextState = symbolNext[symbol]++;
                          tableDecode[u].nbBits = (BYTE) (tableLog - BIT_highbit32 ((U32)nextState) );
                          tableDecode[u].newState = (U16) ( (nextState << tableDecode[u].nbBits) - tableSize);
                  }   }
                  return 0;
              }
              #ifndef FSE_COMMONDEFS_ONLY
              /*-*******************************************************
              *  Decompression (Byte symbols)
              *********************************************************/
              size_t FSE_buildDTable_rle (FSE_DTable* dt, BYTE symbolValue)
              {
                  void* ptr = dt;
                  FSE_DTableHeader* const DTableH = (FSE_DTableHeader*)ptr;
                  void* dPtr = dt + 1;
                  FSE_decode_t* const cell = (FSE_decode_t*)dPtr;
                  DTableH->tableLog = 0;
                  DTableH->fastMode = 0;
                  cell->newState = 0;
                  cell->symbol = symbolValue;
                  cell->nbBits = 0;
                  return 0;
              }
              size_t FSE_buildDTable_raw (FSE_DTable* dt, unsigned nbBits)
              {
                  void* ptr = dt;
                  FSE_DTableHeader* const DTableH = (FSE_DTableHeader*)ptr;
                  void* dPtr = dt + 1;
                  FSE_decode_t* const dinfo = (FSE_decode_t*)dPtr;
                  const unsigned tableSize = 1 << nbBits;
                  const unsigned tableMask = tableSize - 1;
                  const unsigned maxSV1 = tableMask+1;
                  unsigned s;
                  /* Sanity checks */
                  if (nbBits < 1) return ERROR(GENERIC);         /* min size */
                  /* Build Decoding Table */
                  DTableH->tableLog = (U16)nbBits;
                  DTableH->fastMode = 1;
                  for (s=0; s<maxSV1; s++) {
                      dinfo[s].newState = 0;
                      dinfo[s].symbol = (BYTE)s;
                      dinfo[s].nbBits = (BYTE)nbBits;
                  }
                  return 0;
              }
              FORCE_INLINE size_t FSE_decompress_usingDTable_generic(
                        void* dst, size_t maxDstSize,
                  const void* cSrc, size_t cSrcSize,
                  const FSE_DTable* dt, const unsigned fast)
              {
                  BYTE* const ostart = (BYTE*) dst;
                  BYTE* op = ostart;
                  BYTE* const omax = op + maxDstSize;
                  BYTE* const olimit = omax-3;
                  BIT_DStream_t bitD;
                  FSE_DState_t state1;
                  FSE_DState_t state2;
                  /* Init */
                  CHECK_F(BIT_initDStream(&bitD, cSrc, cSrcSize));
                  FSE_initDState(&state1, &bitD, dt);
                  FSE_initDState(&state2, &bitD, dt);
              #define FSE_GETSYMBOL(statePtr) fast ? FSE_decodeSymbolFast(statePtr, &bitD) : FSE_decodeSymbol(statePtr, &bitD)
                  /* 4 symbols per loop */
                  for ( ; (BIT_reloadDStream(&bitD)==BIT_DStream_unfinished) & (op<olimit) ; op+=4) {
                      op[0] = FSE_GETSYMBOL(&state1);
                      if (FSE_MAX_TABLELOG*2+7 > sizeof(bitD.bitContainer)*8)    /* This test must be static */
                          BIT_reloadDStream(&bitD);
                      op[1] = FSE_GETSYMBOL(&state2);
                      if (FSE_MAX_TABLELOG*4+7 > sizeof(bitD.bitContainer)*8)    /* This test must be static */
                          { if (BIT_reloadDStream(&bitD) > BIT_DStream_unfinished) { op+=2; break; } }
                      op[2] = FSE_GETSYMBOL(&state1);
                      if (FSE_MAX_TABLELOG*2+7 > sizeof(bitD.bitContainer)*8)    /* This test must be static */
                          BIT_reloadDStream(&bitD);
                      op[3] = FSE_GETSYMBOL(&state2);
                  }
                  /* tail */
                  /* note : BIT_reloadDStream(&bitD) >= FSE_DStream_partiallyFilled; Ends at exactly BIT_DStream_completed */
                  while (1) {
                      if (op>(omax-2)) return ERROR(dstSize_tooSmall);
                      *op++ = FSE_GETSYMBOL(&state1);
                      if (BIT_reloadDStream(&bitD)==BIT_DStream_overflow) {
                          *op++ = FSE_GETSYMBOL(&state2);
                          break;
                      }
                      if (op>(omax-2)) return ERROR(dstSize_tooSmall);
                      *op++ = FSE_GETSYMBOL(&state2);
                      if (BIT_reloadDStream(&bitD)==BIT_DStream_overflow) {
                          *op++ = FSE_GETSYMBOL(&state1);
                          break;
                  }   }
                  return op-ostart;
              }
              size_t FSE_decompress_usingDTable(void* dst, size_t originalSize,
                                          const void* cSrc, size_t cSrcSize,
                                          const FSE_DTable* dt)
              {
                  const void* ptr = dt;
                  const FSE_DTableHeader* DTableH = (const FSE_DTableHeader*)ptr;
                  const U32 fastMode = DTableH->fastMode;
                  /* select fast mode (static) */
                  if (fastMode) return FSE_decompress_usingDTable_generic(dst, originalSize, cSrc, cSrcSize, dt, 1);
                  return FSE_decompress_usingDTable_generic(dst, originalSize, cSrc, cSrcSize, dt, 0);
              }
-             size_t FSE_decompress(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize)
+             size_t FSE_decompress_wksp(void* dst, size_t dstCapacity, const void* cSrc, size_t cSrcSize, FSE_DTable* workSpace, unsigned maxLog)
              {
                  const BYTE* const istart = (const BYTE*)cSrc;
                  const BYTE* ip = istart;
                  short counting[FSE_MAX_SYMBOL_VALUE+1];
-                 DTable_max_t dt;   /* Static analyzer seems unable to understand this table will be properly initialized later */
                  unsigned tableLog;
                  unsigned maxSymbolValue = FSE_MAX_SYMBOL_VALUE;
-                 if (cSrcSize<2) return ERROR(srcSize_wrong);   /* too small input size */
+                 /* normal FSE decoding mode */
+                 size_t const NCountLength = FSE_readNCount (counting, &maxSymbolValue, &tableLog, istart, cSrcSize);
+                 if (FSE_isError(NCountLength)) return NCountLength;
+                 //if (NCountLength >= cSrcSize) return ERROR(srcSize_wrong);   /* too small input size; supposed to be already checked in NCountLength, only remaining case : NCountLength==cSrcSize */
+                 if (tableLog > maxLog) return ERROR(tableLog_tooLarge);
+                 ip += NCountLength;
+                 cSrcSize -= NCountLength;
+                 CHECK_F( FSE_buildDTable (workSpace, counting, maxSymbolValue, tableLog) );
-                 /* normal FSE decoding mode */
-                 {   size_t const NCountLength = FSE_readNCount (counting, &maxSymbolValue, &tableLog, istart, cSrcSize);
-                     if (FSE_isError(NCountLength)) return NCountLength;
-                     if (NCountLength >= cSrcSize) return ERROR(srcSize_wrong);   /* too small input size */
-                     ip += NCountLength;
-                     cSrcSize -= NCountLength;
+                 }
+                 return FSE_decompress_usingDTable (dst, dstCapacity, ip, cSrcSize, workSpace);   /* always return, even if it is an error code */
+             }
-                 CHECK_F( FSE_buildDTable (dt, counting, maxSymbolValue, tableLog) );
+             typedef FSE_DTable DTable_max_t[FSE_DTABLE_SIZE_U32(FSE_MAX_TABLELOG)];
-                 return FSE_decompress_usingDTable (dst, maxDstSize, ip, cSrcSize, dt);   /* always return, even if it is an error code */
+             size_t FSE_decompress(void* dst, size_t dstCapacity, const void* cSrc, size_t cSrcSize)
+             {
+                 DTable_max_t dt;   /* Static analyzer seems unable to understand this table will be properly initialized later */
+                 return FSE_decompress_wksp(dst, dstCapacity, cSrc, cSrcSize, dt, FSE_MAX_TABLELOG);
              }
              #endif   /* FSE_COMMONDEFS_ONLY */

contrib/python-zstandard/zstd/common/huf.h

0 +28 -18

              /* ******************************************************************
                 Huffman coder, part of New Generation Entropy library
                 header file
                 Copyright (C) 2013-2016, Yann Collet.
                 BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php)
                 Redistribution and use in source and binary forms, with or without
                 modification, are permitted provided that the following conditions are
                 met:
                     * Redistributions of source code must retain the above copyright
                 notice, this list of conditions and the following disclaimer.
                     * Redistributions in binary form must reproduce the above
                 copyright notice, this list of conditions and the following disclaimer
                 in the documentation and/or other materials provided with the
                 distribution.
                 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
                 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
                 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
                 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
                 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
                 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
                 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
                 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
                 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
                 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
                 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
                 You can contact the author at :
                 - Source repository : https://github.com/Cyan4973/FiniteStateEntropy
              ****************************************************************** */
              #ifndef HUF_H_298734234
              #define HUF_H_298734234
              #if defined (__cplusplus)
              extern "C" {
              #endif
              /* *** Dependencies *** */
              #include <stddef.h>    /* size_t */
              /* *** simple functions *** */
              /**
              HUF_compress() :
                  Compress content from buffer 'src', of size 'srcSize', into buffer 'dst'.
                  'dst' buffer must be already allocated.
                  Compression runs faster if `dstCapacity` >= HUF_compressBound(srcSize).
                  `srcSize` must be <= `HUF_BLOCKSIZE_MAX` == 128 KB.
                  @return : size of compressed data (<= `dstCapacity`).
                  Special values : if return == 0, srcData is not compressible => Nothing is stored within dst !!!
                                   if return == 1, srcData is a single repeated byte symbol (RLE compression).
                                   if HUF_isError(return), compression failed (more details using HUF_getErrorName())
              */
              size_t HUF_compress(void* dst, size_t dstCapacity,
                            const void* src, size_t srcSize);
              /**
              HUF_decompress() :
                  Decompress HUF data from buffer 'cSrc', of size 'cSrcSize',
                  into already allocated buffer 'dst', of minimum size 'dstSize'.
-                 `dstSize` : **must** be the ***exact*** size of original (uncompressed) data.
+                 `originalSize` : **must** be the ***exact*** size of original (uncompressed) data.
                  Note : in contrast with FSE, HUF_decompress can regenerate
                         RLE (cSrcSize==1) and uncompressed (cSrcSize==dstSize) data,
                         because it knows size to regenerate.
-                 @return : size of regenerated data (== dstSize),
+                 @return : size of regenerated data (== originalSize),
                            or an error code, which can be tested using HUF_isError()
              */
-             size_t HUF_decompress(void* dst,  size_t dstSize,
+             size_t HUF_decompress(void* dst,  size_t originalSize,
                              const void* cSrc, size_t cSrcSize);
-             /* ****************************************
-             *  Tool functions
-             ******************************************/
-             #define HUF_BLOCKSIZE_MAX (128 * 1024)
+             /* ***   Tool functions *** */
+             #define HUF_BLOCKSIZE_MAX (128 * 1024)       /**< maximum input size for a single block compressed with HUF_compress */
              size_t HUF_compressBound(size_t size);       /**< maximum compressed size (worst case) */
              /* Error Management */
              unsigned    HUF_isError(size_t code);        /**< tells if a return value is an error code */
              const char* HUF_getErrorName(size_t code);   /**< provides error code string (useful for debugging) */
-             /* *** Advanced function *** */
+             /* ***   Advanced function   *** */
              /** HUF_compress2() :
-             *   Same as HUF_compress(), but offers direct control over `maxSymbolValue` and `tableLog` */
+              *   Same as HUF_compress(), but offers direct control over `maxSymbolValue` and `tableLog` .
+              *   `tableLog` must be `<= HUF_TABLELOG_MAX` . */
              size_t HUF_compress2 (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog);
+             /** HUF_compress4X_wksp() :
+             *   Same as HUF_compress2(), but uses externally allocated `workSpace`, which must be a table of >= 1024 unsigned */
+             size_t HUF_compress4X_wksp (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog, void* workSpace, size_t wkspSize);  /**< `workSpace` must be a table of at least 1024 unsigned */
              #ifdef HUF_STATIC_LINKING_ONLY
              /* *** Dependencies *** */
              #include "mem.h"   /* U32 */
              /* *** Constants *** */
-             #define HUF_TABLELOG_ABSOLUTEMAX  16   /* absolute limit of HUF_MAX_TABLELOG. Beyond that value, code does not work */
+             #define HUF_TABLELOG_ABSOLUTEMAX  15   /* absolute limit of HUF_MAX_TABLELOG. Beyond that value, code does not work */
              #define HUF_TABLELOG_MAX  12           /* max configured tableLog (for static allocation); can be modified up to HUF_ABSOLUTEMAX_TABLELOG */
              #define HUF_TABLELOG_DEFAULT  11       /* tableLog by default, when not specified */
              #define HUF_SYMBOLVALUE_MAX 255
              #if (HUF_TABLELOG_MAX > HUF_TABLELOG_ABSOLUTEMAX)
              #  error "HUF_TABLELOG_MAX is too large !"
              #endif
              /* ****************************************
              *  Static allocation
              ******************************************/
              /* HUF buffer bounds */
              #define HUF_CTABLEBOUND 129
              #define HUF_BLOCKBOUND(size) (size + (size>>8) + 8)   /* only true if incompressible pre-filtered with fast heuristic */
              #define HUF_COMPRESSBOUND(size) (HUF_CTABLEBOUND + HUF_BLOCKBOUND(size))   /* Macro version, useful for static allocation */
              /* static allocation of HUF's Compression Table */
              #define HUF_CREATE_STATIC_CTABLE(name, maxSymbolValue) \
                  U32 name##hb[maxSymbolValue+1]; \
                  void* name##hv = &(name##hb); \
                  HUF_CElt* name = (HUF_CElt*)(name##hv)   /* no final ; */
              /* static allocation of HUF's DTable */
              typedef U32 HUF_DTable;
              #define HUF_DTABLE_SIZE(maxTableLog)   (1 + (1<<(maxTableLog)))
              #define HUF_CREATE_STATIC_DTABLEX2(DTable, maxTableLog) \
-                     HUF_DTable DTable[HUF_DTABLE_SIZE((maxTableLog)-1)] = { ((U32)((maxTableLog)-1)*0x1000001) }
+                     HUF_DTable DTable[HUF_DTABLE_SIZE((maxTableLog)-1)] = { ((U32)((maxTableLog)-1) * 0x01000001) }
              #define HUF_CREATE_STATIC_DTABLEX4(DTable, maxTableLog) \
-                     HUF_DTable DTable[HUF_DTABLE_SIZE(maxTableLog)] = { ((U32)(maxTableLog)*0x1000001) }
+                     HUF_DTable DTable[HUF_DTABLE_SIZE(maxTableLog)] = { ((U32)(maxTableLog) * 0x01000001) }
              /* ****************************************
              *  Advanced decompression functions
              ******************************************/
              size_t HUF_decompress4X2 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< single-symbol decoder */
              size_t HUF_decompress4X4 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< double-symbols decoder */
              size_t HUF_decompress4X_DCtx (HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< decodes RLE and uncompressed */
              size_t HUF_decompress4X_hufOnly(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize); /**< considers RLE and uncompressed as errors */
              size_t HUF_decompress4X2_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< single-symbol decoder */
              size_t HUF_decompress4X4_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< double-symbols decoder */
-             size_t HUF_decompress1X_DCtx (HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);
-             size_t HUF_decompress1X2_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< single-symbol decoder */
-             size_t HUF_decompress1X4_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< double-symbols decoder */
              /* ****************************************
              *  HUF detailed API
              ******************************************/
              /*!
              HUF_compress() does the following:
 . count symbol occurrence from source[] into table count[] using FSE_count()
 . (optional) refine tableLog using HUF_optimalTableLog()
 . build Huffman table from count using HUF_buildCTable()
 . save Huffman table to memory buffer using HUF_writeCTable()
 . encode the data stream using HUF_compress4X_usingCTable()
              The following API allows targeting specific sub-functions for advanced tasks.
              For example, it's possible to compress several blocks using the same 'CTable',
              or to save and regenerate 'CTable' using external methods.
              */
              /* FSE_count() : find it within "fse.h" */
              unsigned HUF_optimalTableLog(unsigned maxTableLog, size_t srcSize, unsigned maxSymbolValue);
              typedef struct HUF_CElt_s HUF_CElt;   /* incomplete type */
              size_t HUF_buildCTable (HUF_CElt* CTable, const unsigned* count, unsigned maxSymbolValue, unsigned maxNbBits);
              size_t HUF_writeCTable (void* dst, size_t maxDstSize, const HUF_CElt* CTable, unsigned maxSymbolValue, unsigned huffLog);
              size_t HUF_compress4X_usingCTable(void* dst, size_t dstSize, const void* src, size_t srcSize, const HUF_CElt* CTable);
+             /** HUF_buildCTable_wksp() :
+              *  Same as HUF_buildCTable(), but using externally allocated scratch buffer.
+              *  `workSpace` must be aligned on 4-bytes boundaries, and be at least as large as a table of 1024 unsigned.
+              */
+             size_t HUF_buildCTable_wksp (HUF_CElt* tree, const U32* count, U32 maxSymbolValue, U32 maxNbBits, void* workSpace, size_t wkspSize);
              /*! HUF_readStats() :
                  Read compact Huffman tree, saved by HUF_writeCTable().
                  `huffWeight` is destination buffer.
                  @return : size read from `src` , or an error Code .
                  Note : Needed by HUF_readCTable() and HUF_readDTableXn() . */
              size_t HUF_readStats(BYTE* huffWeight, size_t hwSize, U32* rankStats,
                                   U32* nbSymbolsPtr, U32* tableLogPtr,
                                   const void* src, size_t srcSize);
              /** HUF_readCTable() :
              *   Loading a CTable saved with HUF_writeCTable() */
              size_t HUF_readCTable (HUF_CElt* CTable, unsigned maxSymbolValue, const void* src, size_t srcSize);
              /*
              HUF_decompress() does the following:
 . select the decompression algorithm (X2, X4) based on pre-computed heuristics
 . build Huffman table from save, using HUF_readDTableXn()
 . decode 1 or 4 segments in parallel using HUF_decompressSXn_usingDTable
              */
              /** HUF_selectDecoder() :
              *   Tells which decoder is likely to decode faster,
              *   based on a set of pre-determined metrics.
              *   @return : 0==HUF_decompress4X2, 1==HUF_decompress4X4 .
              *   Assumption : 0 < cSrcSize < dstSize <= 128 KB */
              U32 HUF_selectDecoder (size_t dstSize, size_t cSrcSize);
              size_t HUF_readDTableX2 (HUF_DTable* DTable, const void* src, size_t srcSize);
              size_t HUF_readDTableX4 (HUF_DTable* DTable, const void* src, size_t srcSize);
              size_t HUF_decompress4X_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
              size_t HUF_decompress4X2_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
              size_t HUF_decompress4X4_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
              /* single stream variants */
              size_t HUF_compress1X (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog);
+             size_t HUF_compress1X_wksp (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog, void* workSpace, size_t wkspSize);  /**< `workSpace` must be a table of at least 1024 unsigned */
              size_t HUF_compress1X_usingCTable(void* dst, size_t dstSize, const void* src, size_t srcSize, const HUF_CElt* CTable);
              size_t HUF_decompress1X2 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /* single-symbol decoder */
              size_t HUF_decompress1X4 (void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /* double-symbol decoder */
-             size_t HUF_decompress1X_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
+             size_t HUF_decompress1X_DCtx (HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);
+             size_t HUF_decompress1X2_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< single-symbol decoder */
+             size_t HUF_decompress1X4_DCtx(HUF_DTable* dctx, void* dst, size_t dstSize, const void* cSrc, size_t cSrcSize);   /**< double-symbols decoder */
+             size_t HUF_decompress1X_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);   /**< automatic selection of sing or double symbol decoder, based on DTable */
              size_t HUF_decompress1X2_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
              size_t HUF_decompress1X4_usingDTable(void* dst, size_t maxDstSize, const void* cSrc, size_t cSrcSize, const HUF_DTable* DTable);
              #endif /* HUF_STATIC_LINKING_ONLY */
              #if defined (__cplusplus)
              }
              #endif
              #endif   /* HUF_H_298734234 */

contrib/python-zstandard/zstd/common/mem.h

0 +3 -1

              /**
               * Copyright (c) 2016-present, Yann Collet, Facebook, Inc.
               * All rights reserved.
               *
               * This source code is licensed under the BSD-style license found in the
               * LICENSE file in the root directory of this source tree. An additional grant
               * of patent rights can be found in the PATENTS file in the same directory.
               */
              #ifndef MEM_H_MODULE
              #define MEM_H_MODULE
              #if defined (__cplusplus)
              extern "C" {
              #endif
              /*-****************************************
              *  Dependencies
              ******************************************/
              #include <stddef.h>     /* size_t, ptrdiff_t */
              #include <string.h>     /* memcpy */
              /*-****************************************
              *  Compiler specifics
              ******************************************/
              #if defined(_MSC_VER)   /* Visual Studio */
              #   include <stdlib.h>  /* _byteswap_ulong */
              #   include <intrin.h>  /* _byteswap_* */
              #endif
              #if defined(__GNUC__)
              #  define MEM_STATIC static __inline __attribute__((unused))
              #elif defined (__cplusplus) || (defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 199901L) /* C99 */)
              #  define MEM_STATIC static inline
              #elif defined(_MSC_VER)
              #  define MEM_STATIC static __inline
              #else
              #  define MEM_STATIC static  /* this version may generate warnings for unused static functions; disable the relevant warning */
              #endif
              /* code only tested on 32 and 64 bits systems */
              #define MEM_STATIC_ASSERT(c)   { enum { XXH_static_assert = 1/(int)(!!(c)) }; }
              MEM_STATIC void MEM_check(void) { MEM_STATIC_ASSERT((sizeof(size_t)==4) || (sizeof(size_t)==8)); }
              /*-**************************************************************
              *  Basic Types
              *****************************************************************/
              #if  !defined (__VMS) && (defined (__cplusplus) || (defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 199901L) /* C99 */) )
              # include <stdint.h>
                typedef  uint8_t BYTE;
                typedef uint16_t U16;
                typedef  int16_t S16;
                typedef uint32_t U32;
                typedef  int32_t S32;
                typedef uint64_t U64;
                typedef  int64_t S64;
+               typedef intptr_t iPtrDiff;
              #else
-               typedef unsigned char       BYTE;
+               typedef unsigned char      BYTE;
                typedef unsigned short      U16;
                typedef   signed short      S16;
                typedef unsigned int        U32;
                typedef   signed int        S32;
                typedef unsigned long long  U64;
                typedef   signed long long  S64;
+               typedef ptrdiff_t      iPtrDiff;
              #endif
              /*-**************************************************************
              *  Memory I/O
              *****************************************************************/
              /* MEM_FORCE_MEMORY_ACCESS :
               * By default, access to unaligned memory is controlled by `memcpy()`, which is safe and portable.
               * Unfortunately, on some target/compiler combinations, the generated assembly is sub-optimal.
               * The below switch allow to select different access method for improved performance.
               * Method 0 (default) : use `memcpy()`. Safe and portable.
               * Method 1 : `__packed` statement. It depends on compiler extension (ie, not portable).
               *            This method is safe if your compiler supports it, and *generally* as fast or faster than `memcpy`.
               * Method 2 : direct access. This method is portable but violate C standard.
               *            It can generate buggy code on targets depending on alignment.
               *            In some circumstances, it's the only known way to get the most performance (ie GCC + ARMv6)
               * See http://fastcompression.blogspot.fr/2015/08/accessing-unaligned-memory.html for details.
               * Prefer these methods in priority order (0 > 1 > 2)
               */
              #ifndef MEM_FORCE_MEMORY_ACCESS   /* can be defined externally, on command line for example */
              #  if defined(__GNUC__) && ( defined(__ARM_ARCH_6__) || defined(__ARM_ARCH_6J__) || defined(__ARM_ARCH_6K__) || defined(__ARM_ARCH_6Z__) || defined(__ARM_ARCH_6ZK__) || defined(__ARM_ARCH_6T2__) )
              #    define MEM_FORCE_MEMORY_ACCESS 2
              #  elif defined(__INTEL_COMPILER) /*|| defined(_MSC_VER)*/ || \
                (defined(__GNUC__) && ( defined(__ARM_ARCH_7__) || defined(__ARM_ARCH_7A__) || defined(__ARM_ARCH_7R__) || defined(__ARM_ARCH_7M__) || defined(__ARM_ARCH_7S__) ))
              #    define MEM_FORCE_MEMORY_ACCESS 1
              #  endif
              #endif
              MEM_STATIC unsigned MEM_32bits(void) { return sizeof(size_t)==4; }
              MEM_STATIC unsigned MEM_64bits(void) { return sizeof(size_t)==8; }
              MEM_STATIC unsigned MEM_isLittleEndian(void)
              {
                  const union { U32 u; BYTE c[4]; } one = { 1 };   /* don't use static : performance detrimental  */
                  return one.c[0];
              }
              #if defined(MEM_FORCE_MEMORY_ACCESS) && (MEM_FORCE_MEMORY_ACCESS==2)
              /* violates C standard, by lying on structure alignment.
              Only use if no other choice to achieve best performance on target platform */
              MEM_STATIC U16 MEM_read16(const void* memPtr) { return *(const U16*) memPtr; }
              MEM_STATIC U32 MEM_read32(const void* memPtr) { return *(const U32*) memPtr; }
              MEM_STATIC U64 MEM_read64(const void* memPtr) { return *(const U64*) memPtr; }
              MEM_STATIC U64 MEM_readST(const void* memPtr) { return *(const size_t*) memPtr; }
              MEM_STATIC void MEM_write16(void* memPtr, U16 value) { *(U16*)memPtr = value; }
              MEM_STATIC void MEM_write32(void* memPtr, U32 value) { *(U32*)memPtr = value; }
              MEM_STATIC void MEM_write64(void* memPtr, U64 value) { *(U64*)memPtr = value; }
              #elif defined(MEM_FORCE_MEMORY_ACCESS) && (MEM_FORCE_MEMORY_ACCESS==1)
              /* __pack instructions are safer, but compiler specific, hence potentially problematic for some compilers */
              /* currently only defined for gcc and icc */
              #if defined(_MSC_VER) || (defined(__INTEL_COMPILER) && defined(WIN32))
              	__pragma( pack(push, 1) )
                  typedef union { U16 u16; U32 u32; U64 u64; size_t st; } unalign;
                  __pragma( pack(pop) )
              #else
                  typedef union { U16 u16; U32 u32; U64 u64; size_t st; } __attribute__((packed)) unalign;
              #endif
              MEM_STATIC U16 MEM_read16(const void* ptr) { return ((const unalign*)ptr)->u16; }
              MEM_STATIC U32 MEM_read32(const void* ptr) { return ((const unalign*)ptr)->u32; }
              MEM_STATIC U64 MEM_read64(const void* ptr) { return ((const unalign*)ptr)->u64; }
              MEM_STATIC U64 MEM_readST(const void* ptr) { return ((const unalign*)ptr)->st; }
              MEM_STATIC void MEM_write16(void* memPtr, U16 value) { ((unalign*)memPtr)->u16 = value; }
              MEM_STATIC void MEM_write32(void* memPtr, U32 value) { ((unalign*)memPtr)->u32 = value; }
              MEM_STATIC void MEM_write64(void* memPtr, U64 value) { ((unalign*)memPtr)->u64 = value; }
              #else
              /* default method, safe and standard.
                 can sometimes prove slower */
              MEM_STATIC U16 MEM_read16(const void* memPtr)
              {
                  U16 val; memcpy(&val, memPtr, sizeof(val)); return val;
              }
              MEM_STATIC U32 MEM_read32(const void* memPtr)
              {
                  U32 val; memcpy(&val, memPtr, sizeof(val)); return val;
              }
              MEM_STATIC U64 MEM_read64(const void* memPtr)
              {
                  U64 val; memcpy(&val, memPtr, sizeof(val)); return val;
              }
              MEM_STATIC size_t MEM_readST(const void* memPtr)
              {
                  size_t val; memcpy(&val, memPtr, sizeof(val)); return val;
              }
              MEM_STATIC void MEM_write16(void* memPtr, U16 value)
              {
                  memcpy(memPtr, &value, sizeof(value));
              }
              MEM_STATIC void MEM_write32(void* memPtr, U32 value)
              {
                  memcpy(memPtr, &value, sizeof(value));
              }
              MEM_STATIC void MEM_write64(void* memPtr, U64 value)
              {
                  memcpy(memPtr, &value, sizeof(value));
              }
              #endif /* MEM_FORCE_MEMORY_ACCESS */
              MEM_STATIC U32 MEM_swap32(U32 in)
              {
              #if defined(_MSC_VER)     /* Visual Studio */
                  return _byteswap_ulong(in);
              #elif defined (__GNUC__)
                  return __builtin_bswap32(in);
              #else
                  return  ((in << 24) & 0xff000000 ) |
                          ((in <<  8) & 0x00ff0000 ) |
                          ((in >>  8) & 0x0000ff00 ) |
                          ((in >> 24) & 0x000000ff );
              #endif
              }
              MEM_STATIC U64 MEM_swap64(U64 in)
              {
              #if defined(_MSC_VER)     /* Visual Studio */
                  return _byteswap_uint64(in);
              #elif defined (__GNUC__)
                  return __builtin_bswap64(in);
              #else
                  return  ((in << 56) & 0xff00000000000000ULL) |
                          ((in << 40) & 0x00ff000000000000ULL) |
                          ((in << 24) & 0x0000ff0000000000ULL) |
                          ((in << 8)  & 0x000000ff00000000ULL) |
                          ((in >> 8)  & 0x00000000ff000000ULL) |
                          ((in >> 24) & 0x0000000000ff0000ULL) |
                          ((in >> 40) & 0x000000000000ff00ULL) |
                          ((in >> 56) & 0x00000000000000ffULL);
              #endif
              }
              MEM_STATIC size_t MEM_swapST(size_t in)
              {
                  if (MEM_32bits())
                      return (size_t)MEM_swap32((U32)in);
                  else
                      return (size_t)MEM_swap64((U64)in);
              }
              /*=== Little endian r/w ===*/
              MEM_STATIC U16 MEM_readLE16(const void* memPtr)
              {
                  if (MEM_isLittleEndian())
                      return MEM_read16(memPtr);
                  else {
                      const BYTE* p = (const BYTE*)memPtr;
                      return (U16)(p[0] + (p[1]<<8));
                  }
              }
              MEM_STATIC void MEM_writeLE16(void* memPtr, U16 val)
              {
                  if (MEM_isLittleEndian()) {
                      MEM_write16(memPtr, val);
                  } else {
                      BYTE* p = (BYTE*)memPtr;
                      p[0] = (BYTE)val;
                      p[1] = (BYTE)(val>>8);
                  }
              }
              MEM_STATIC U32 MEM_readLE24(const void* memPtr)
              {
                  return MEM_readLE16(memPtr) + (((const BYTE*)memPtr)[2] << 16);
              }
              MEM_STATIC void MEM_writeLE24(void* memPtr, U32 val)
              {
                  MEM_writeLE16(memPtr, (U16)val);
                  ((BYTE*)memPtr)[2] = (BYTE)(val>>16);
              }
              MEM_STATIC U32 MEM_readLE32(const void* memPtr)
              {
                  if (MEM_isLittleEndian())
                      return MEM_read32(memPtr);
                  else
                      return MEM_swap32(MEM_read32(memPtr));
              }
              MEM_STATIC void MEM_writeLE32(void* memPtr, U32 val32)
              {
                  if (MEM_isLittleEndian())
                      MEM_write32(memPtr, val32);
                  else
                      MEM_write32(memPtr, MEM_swap32(val32));
              }
              MEM_STATIC U64 MEM_readLE64(const void* memPtr)
              {
                  if (MEM_isLittleEndian())
                      return MEM_read64(memPtr);
                  else
                      return MEM_swap64(MEM_read64(memPtr));
              }
              MEM_STATIC void MEM_writeLE64(void* memPtr, U64 val64)
              {
                  if (MEM_isLittleEndian())
                      MEM_write64(memPtr, val64);
                  else
                      MEM_write64(memPtr, MEM_swap64(val64));
              }
              MEM_STATIC size_t MEM_readLEST(const void* memPtr)
              {
                  if (MEM_32bits())
                      return (size_t)MEM_readLE32(memPtr);
                  else
                      return (size_t)MEM_readLE64(memPtr);
              }
              MEM_STATIC void MEM_writeLEST(void* memPtr, size_t val)
              {
                  if (MEM_32bits())
                      MEM_writeLE32(memPtr, (U32)val);
                  else
                      MEM_writeLE64(memPtr, (U64)val);
              }
              /*=== Big endian r/w ===*/
              MEM_STATIC U32 MEM_readBE32(const void* memPtr)
              {
                  if (MEM_isLittleEndian())
                      return MEM_swap32(MEM_read32(memPtr));
                  else
                      return MEM_read32(memPtr);
              }
              MEM_STATIC void MEM_writeBE32(void* memPtr, U32 val32)
              {
                  if (MEM_isLittleEndian())
                      MEM_write32(memPtr, MEM_swap32(val32));
                  else
                      MEM_write32(memPtr, val32);
              }
              MEM_STATIC U64 MEM_readBE64(const void* memPtr)
              {
                  if (MEM_isLittleEndian())
                      return MEM_swap64(MEM_read64(memPtr));
                  else
                      return MEM_read64(memPtr);
              }
              MEM_STATIC void MEM_writeBE64(void* memPtr, U64 val64)
              {
                  if (MEM_isLittleEndian())
                      MEM_write64(memPtr, MEM_swap64(val64));
                  else
                      MEM_write64(memPtr, val64);
              }
              MEM_STATIC size_t MEM_readBEST(const void* memPtr)
              {
                  if (MEM_32bits())
                      return (size_t)MEM_readBE32(memPtr);
                  else
                      return (size_t)MEM_readBE64(memPtr);
              }
              MEM_STATIC void MEM_writeBEST(void* memPtr, size_t val)
              {
                  if (MEM_32bits())
                      MEM_writeBE32(memPtr, (U32)val);
                  else
                      MEM_writeBE64(memPtr, (U64)val);
              }
              /* function safe only for comparisons */
              MEM_STATIC U32 MEM_readMINMATCH(const void* memPtr, U32 length)
              {
                  switch (length)
                  {
                  default :
                  case 4 : return MEM_read32(memPtr);
                  case 3 : if (MEM_isLittleEndian())
                              return MEM_read32(memPtr)<<8;
                           else
                              return MEM_read32(memPtr)>>8;
                  }
              }
              #if defined (__cplusplus)
              }
              #endif
              #endif /* MEM_H_MODULE */

contrib/python-zstandard/zstd/common/zstd_common.c

0 +1 -7

              /**
               * Copyright (c) 2016-present, Yann Collet, Facebook, Inc.
               * All rights reserved.
               *
               * This source code is licensed under the BSD-style license found in the
               * LICENSE file in the root directory of this source tree. An additional grant
               * of patent rights can be found in the PATENTS file in the same directory.
               */
              /*-*************************************
              *  Dependencies
              ***************************************/
              #include <stdlib.h>         /* malloc */
              #include "error_private.h"
              #define ZSTD_STATIC_LINKING_ONLY
              #include "zstd.h"           /* declaration of ZSTD_isError, ZSTD_getErrorName, ZSTD_getErrorCode, ZSTD_getErrorString, ZSTD_versionNumber */
-             #include "zbuff.h"          /* declaration of ZBUFF_isError, ZBUFF_getErrorName */
              /*-****************************************
              *  Version
              ******************************************/
              unsigned ZSTD_versionNumber (void) { return ZSTD_VERSION_NUMBER; }
              /*-****************************************
              *  ZSTD Error Management
              ******************************************/
              /*! ZSTD_isError() :
              *   tells if a return value is an error code */
              unsigned ZSTD_isError(size_t code) { return ERR_isError(code); }
              /*! ZSTD_getErrorName() :
              *   provides error code string from function result (useful for debugging) */
              const char* ZSTD_getErrorName(size_t code) { return ERR_getErrorName(code); }
              /*! ZSTD_getError() :
              *   convert a `size_t` function result into a proper ZSTD_errorCode enum */
              ZSTD_ErrorCode ZSTD_getErrorCode(size_t code) { return ERR_getErrorCode(code); }
              /*! ZSTD_getErrorString() :
              *   provides error code string from enum */
              const char* ZSTD_getErrorString(ZSTD_ErrorCode code) { return ERR_getErrorName(code); }
-             /* **************************************************************
-             *  ZBUFF Error Management
-             ****************************************************************/
+             /* ---   ZBUFF Error Management  (deprecated)   --- */
              unsigned ZBUFF_isError(size_t errorCode) { return ERR_isError(errorCode); }
              const char* ZBUFF_getErrorName(size_t errorCode) { return ERR_getErrorName(errorCode); }
              /*=**************************************************************
              *  Custom allocator
              ****************************************************************/
              /* default uses stdlib */
              void* ZSTD_defaultAllocFunction(void* opaque, size_t size)
              {
                  void* address = malloc(size);
                  (void)opaque;
                  return address;
              }
              void ZSTD_defaultFreeFunction(void* opaque, void* address)
              {
                  (void)opaque;
                  free(address);
              }
              void* ZSTD_malloc(size_t size, ZSTD_customMem customMem)
              {
                  return customMem.customAlloc(customMem.opaque, size);
              }
              void ZSTD_free(void* ptr, ZSTD_customMem customMem)
              {
                  if (ptr!=NULL)
                      customMem.customFree(customMem.opaque, ptr);
              }

contrib/python-zstandard/zstd/common/zstd_internal.h

0 +4 -1

              /**
               * Copyright (c) 2016-present, Yann Collet, Facebook, Inc.
               * All rights reserved.
               *
               * This source code is licensed under the BSD-style license found in the
               * LICENSE file in the root directory of this source tree. An additional grant
               * of patent rights can be found in the PATENTS file in the same directory.
               */
              #ifndef ZSTD_CCOMMON_H_MODULE
              #define ZSTD_CCOMMON_H_MODULE
              /*-*******************************************************
              *  Compiler specifics
              *********************************************************/
              #ifdef _MSC_VER    /* Visual Studio */
              #  define FORCE_INLINE static __forceinline
              #  include <intrin.h>                    /* For Visual 2005 */
              #  pragma warning(disable : 4127)        /* disable: C4127: conditional expression is constant */
              #  pragma warning(disable : 4324)        /* disable: C4324: padded structure */
              #  pragma warning(disable : 4100)        /* disable: C4100: unreferenced formal parameter */
              #else
              #  if defined (__cplusplus) || defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* C99 */
              #    ifdef __GNUC__
              #      define FORCE_INLINE static inline __attribute__((always_inline))
              #    else
              #      define FORCE_INLINE static inline
              #    endif
              #  else
              #    define FORCE_INLINE static
              #  endif /* __STDC_VERSION__ */
              #endif
              #ifdef _MSC_VER
              #  define FORCE_NOINLINE static __declspec(noinline)
              #else
              #  ifdef __GNUC__
              #    define FORCE_NOINLINE static __attribute__((__noinline__))
              #  else
              #    define FORCE_NOINLINE static
              #  endif
              #endif
              /*-*************************************
              *  Dependencies
              ***************************************/
              #include "mem.h"
              #include "error_private.h"
              #define ZSTD_STATIC_LINKING_ONLY
              #include "zstd.h"
              /*-*************************************
              *  shared macros
              ***************************************/
              #define MIN(a,b) ((a)<(b) ? (a) : (b))
              #define MAX(a,b) ((a)>(b) ? (a) : (b))
              #define CHECK_F(f) { size_t const errcod = f; if (ERR_isError(errcod)) return errcod; }  /* check and Forward error code */
              #define CHECK_E(f, e) { size_t const errcod = f; if (ERR_isError(errcod)) return ERROR(e); }  /* check and send Error code */
              /*-*************************************
              *  Common constants
              ***************************************/
              #define ZSTD_OPT_NUM    (1<<12)
              #define ZSTD_DICT_MAGIC  0xEC30A437   /* v0.7+ */
              #define ZSTD_REP_NUM      3                 /* number of repcodes */
              #define ZSTD_REP_CHECK    (ZSTD_REP_NUM)    /* number of repcodes to check by the optimal parser */
              #define ZSTD_REP_MOVE     (ZSTD_REP_NUM-1)
              #define ZSTD_REP_MOVE_OPT (ZSTD_REP_NUM)
              static const U32 repStartValue[ZSTD_REP_NUM] = { 1, 4, 8 };
              #define KB *(1 <<10)
              #define MB *(1 <<20)
              #define GB *(1U<<30)
              #define BIT7 128
              #define BIT6  64
              #define BIT5  32
              #define BIT4  16
              #define BIT1   2
              #define BIT0   1
              #define ZSTD_WINDOWLOG_ABSOLUTEMIN 10
              static const size_t ZSTD_fcs_fieldSize[4] = { 0, 2, 4, 8 };
              static const size_t ZSTD_did_fieldSize[4] = { 0, 1, 2, 4 };
              #define ZSTD_BLOCKHEADERSIZE 3   /* C standard doesn't allow `static const` variable to be init using another `static const` variable */
              static const size_t ZSTD_blockHeaderSize = ZSTD_BLOCKHEADERSIZE;
              typedef enum { bt_raw, bt_rle, bt_compressed, bt_reserved } blockType_e;
              #define MIN_SEQUENCES_SIZE 1 /* nbSeq==0 */
              #define MIN_CBLOCK_SIZE (1 /*litCSize*/ + 1 /* RLE or RAW */ + MIN_SEQUENCES_SIZE /* nbSeq==0 */)   /* for a non-null block */
              #define HufLog 12
              typedef enum { set_basic, set_rle, set_compressed, set_repeat } symbolEncodingType_e;
              #define LONGNBSEQ 0x7F00
              #define MINMATCH 3
              #define EQUAL_READ32 4
              #define Litbits  8
              #define MaxLit ((1<<Litbits) - 1)
              #define MaxML  52
              #define MaxLL  35
              #define MaxOff 28
              #define MaxSeq MAX(MaxLL, MaxML)   /* Assumption : MaxOff < MaxLL,MaxML */
              #define MLFSELog    9
              #define LLFSELog    9
              #define OffFSELog   8
              static const U32 LL_bits[MaxLL+1] = { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
 , 1, 1, 1, 2, 2, 3, 3, 4, 6, 7, 8, 9,10,11,12,
 ,14,15,16 };
              static const S16 LL_defaultNorm[MaxLL+1] = { 4, 3, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 1, 1, 1,
 , 2, 2, 2, 2, 2, 2, 2, 2, 3, 2, 1, 1, 1, 1, 1,
                                                          -1,-1,-1,-1 };
              #define LL_DEFAULTNORMLOG 6  /* for static allocation */
              static const U32 LL_defaultNormLog = LL_DEFAULTNORMLOG;
              static const U32 ML_bits[MaxML+1] = { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
 , 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
 , 1, 1, 1, 2, 2, 3, 3, 4, 4, 5, 7, 8, 9,10,11,
 ,13,14,15,16 };
              static const S16 ML_defaultNorm[MaxML+1] = { 1, 4, 3, 2, 2, 2, 2, 2, 2, 1, 1, 1, 1, 1, 1, 1,
 , 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
 , 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,-1,-1,
                                                          -1,-1,-1,-1,-1 };
              #define ML_DEFAULTNORMLOG 6  /* for static allocation */
              static const U32 ML_defaultNormLog = ML_DEFAULTNORMLOG;
              static const S16 OF_defaultNorm[MaxOff+1] = { 1, 1, 1, 1, 1, 1, 2, 2, 2, 1, 1, 1, 1, 1, 1, 1,
 , 1, 1, 1, 1, 1, 1, 1,-1,-1,-1,-1,-1 };
              #define OF_DEFAULTNORMLOG 5  /* for static allocation */
              static const U32 OF_defaultNormLog = OF_DEFAULTNORMLOG;
              /*-*******************************************
              *  Shared functions to include for inlining
              *********************************************/
              static void ZSTD_copy8(void* dst, const void* src) { memcpy(dst, src, 8); }
              #define COPY8(d,s) { ZSTD_copy8(d,s); d+=8; s+=8; }
              /*! ZSTD_wildcopy() :
              *   custom version of memcpy(), can copy up to 7 bytes too many (8 bytes if length==0) */
              #define WILDCOPY_OVERLENGTH 8
-             MEM_STATIC void ZSTD_wildcopy(void* dst, const void* src, size_t length)
+             MEM_STATIC void ZSTD_wildcopy(void* dst, const void* src, ptrdiff_t length)
              {
                  const BYTE* ip = (const BYTE*)src;
                  BYTE* op = (BYTE*)dst;
                  BYTE* const oend = op + length;
                  do
                      COPY8(op, ip)
                  while (op < oend);
              }
              MEM_STATIC void ZSTD_wildcopy_e(void* dst, const void* src, void* dstEnd)   /* should be faster for decoding, but strangely, not verified on all platform */
              {
                  const BYTE* ip = (const BYTE*)src;
                  BYTE* op = (BYTE*)dst;
                  BYTE* const oend = (BYTE*)dstEnd;
                  do
                      COPY8(op, ip)
                  while (op < oend);
              }
              /*-*******************************************
              *  Private interfaces
              *********************************************/
              typedef struct ZSTD_stats_s ZSTD_stats_t;
              typedef struct {
                  U32 off;
                  U32 len;
              } ZSTD_match_t;
              typedef struct {
                  U32 price;
                  U32 off;
                  U32 mlen;
                  U32 litlen;
                  U32 rep[ZSTD_REP_NUM];
              } ZSTD_optimal_t;
              typedef struct seqDef_s {
                  U32 offset;
                  U16 litLength;
                  U16 matchLength;
              } seqDef;
              typedef struct {
                  seqDef* sequencesStart;
                  seqDef* sequences;
                  BYTE* litStart;
                  BYTE* lit;
                  BYTE* llCode;
                  BYTE* mlCode;
                  BYTE* ofCode;
                  U32   longLengthID;   /* 0 == no longLength; 1 == Lit.longLength; 2 == Match.longLength; */
                  U32   longLengthPos;
                  /* opt */
                  ZSTD_optimal_t* priceTable;
                  ZSTD_match_t* matchTable;
                  U32* matchLengthFreq;
                  U32* litLengthFreq;
                  U32* litFreq;
                  U32* offCodeFreq;
                  U32  matchLengthSum;
                  U32  matchSum;
                  U32  litLengthSum;
                  U32  litSum;
                  U32  offCodeSum;
                  U32  log2matchLengthSum;
                  U32  log2matchSum;
                  U32  log2litLengthSum;
                  U32  log2litSum;
                  U32  log2offCodeSum;
                  U32  factor;
+                 U32  staticPrices;
                  U32  cachedPrice;
                  U32  cachedLitLength;
                  const BYTE* cachedLiterals;
              } seqStore_t;
              const seqStore_t* ZSTD_getSeqStore(const ZSTD_CCtx* ctx);
              void ZSTD_seqToCodes(const seqStore_t* seqStorePtr);
              int ZSTD_isSkipFrame(ZSTD_DCtx* dctx);
              /* custom memory allocation functions */
              void* ZSTD_defaultAllocFunction(void* opaque, size_t size);
              void ZSTD_defaultFreeFunction(void* opaque, void* address);
+             #ifndef ZSTD_DLL_IMPORT
              static const ZSTD_customMem defaultCustomMem = { ZSTD_defaultAllocFunction, ZSTD_defaultFreeFunction, NULL };
+             #endif
              void* ZSTD_malloc(size_t size, ZSTD_customMem customMem);
              void ZSTD_free(void* ptr, ZSTD_customMem customMem);
              /*======  common function  ======*/
              MEM_STATIC U32 ZSTD_highbit32(U32 val)
              {
              #   if defined(_MSC_VER)   /* Visual */
                  unsigned long r=0;
                  _BitScanReverse(&r, val);
                  return (unsigned)r;
              #   elif defined(__GNUC__) && (__GNUC__ >= 3)   /* GCC Intrinsic */
                  return 31 - __builtin_clz(val);
              #   else   /* Software version */
                  static const int DeBruijnClz[32] = { 0, 9, 1, 10, 13, 21, 2, 29, 11, 14, 16, 18, 22, 25, 3, 30, 8, 12, 20, 28, 15, 17, 24, 7, 19, 27, 23, 6, 26, 5, 4, 31 };
                  U32 v = val;
                  int r;
                  v |= v >> 1;
                  v |= v >> 2;
                  v |= v >> 4;
                  v |= v >> 8;
                  v |= v >> 16;
                  r = DeBruijnClz[(U32)(v * 0x07C4ACDDU) >> 27];
                  return r;
              #   endif
              }
              #endif   /* ZSTD_CCOMMON_H_MODULE */

contrib/python-zstandard/zstd/compress/fse_compress.c

0 +122 -82

              /* ******************************************************************
                 FSE : Finite State Entropy encoder
                 Copyright (C) 2013-2015, Yann Collet.
                 BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php)
                 Redistribution and use in source and binary forms, with or without
                 modification, are permitted provided that the following conditions are
                 met:
                     * Redistributions of source code must retain the above copyright
                 notice, this list of conditions and the following disclaimer.
                     * Redistributions in binary form must reproduce the above
                 copyright notice, this list of conditions and the following disclaimer
                 in the documentation and/or other materials provided with the
                 distribution.
                 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
                 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
                 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
                 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
                 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
                 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
                 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
                 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
                 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
                 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
                 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
                  You can contact the author at :
                  - FSE source repository : https://github.com/Cyan4973/FiniteStateEntropy
                  - Public forum : https://groups.google.com/forum/#!forum/lz4c
              ****************************************************************** */
              /* **************************************************************
              *  Compiler specifics
              ****************************************************************/
              #ifdef _MSC_VER    /* Visual Studio */
              #  define FORCE_INLINE static __forceinline
              #  include <intrin.h>                    /* For Visual 2005 */
              #  pragma warning(disable : 4127)        /* disable: C4127: conditional expression is constant */
              #  pragma warning(disable : 4214)        /* disable: C4214: non-int bitfields */
              #else
              #  if defined (__cplusplus) || defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* C99 */
              #    ifdef __GNUC__
              #      define FORCE_INLINE static inline __attribute__((always_inline))
              #    else
              #      define FORCE_INLINE static inline
              #    endif
              #  else
              #    define FORCE_INLINE static
              #  endif /* __STDC_VERSION__ */
              #endif
              /* **************************************************************
              *  Includes
              ****************************************************************/
              #include <stdlib.h>     /* malloc, free, qsort */
              #include <string.h>     /* memcpy, memset */
              #include <stdio.h>      /* printf (debug) */
              #include "bitstream.h"
              #define FSE_STATIC_LINKING_ONLY
              #include "fse.h"
              /* **************************************************************
              *  Error Management
              ****************************************************************/
              #define FSE_STATIC_ASSERT(c) { enum { FSE_static_assert = 1/(int)(!!(c)) }; }   /* use only *after* variable declarations */
              /* **************************************************************
-             *  Complex types
-             ****************************************************************/
-             typedef U32 CTable_max_t[FSE_CTABLE_SIZE_U32(FSE_MAX_TABLELOG, FSE_MAX_SYMBOL_VALUE)];
-             /* **************************************************************
              *  Templates
              ****************************************************************/
              /*
                designed to be included
                for type-specific functions (template emulation in C)
                Objective is to write these functions only once, for improved maintenance
              */
              /* safety checks */
              #ifndef FSE_FUNCTION_EXTENSION
              #  error "FSE_FUNCTION_EXTENSION must be defined"
              #endif
              #ifndef FSE_FUNCTION_TYPE
              #  error "FSE_FUNCTION_TYPE must be defined"
              #endif
              /* Function names */
              #define FSE_CAT(X,Y) X##Y
              #define FSE_FUNCTION_NAME(X,Y) FSE_CAT(X,Y)
              #define FSE_TYPE_NAME(X,Y) FSE_CAT(X,Y)
              /* Function templates */
-             size_t FSE_buildCTable(FSE_CTable* ct, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog)
+             /* FSE_buildCTable_wksp() :
+              * Same as FSE_buildCTable(), but using an externally allocated scratch buffer (`workSpace`).
+              * wkspSize should be sized to handle worst case situation, which is `1<<max_tableLog * sizeof(FSE_FUNCTION_TYPE)`
+              * workSpace must also be properly aligned with FSE_FUNCTION_TYPE requirements
+              */
+             size_t FSE_buildCTable_wksp(FSE_CTable* ct, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog, void* workSpace, size_t wkspSize)
              {
                  U32 const tableSize = 1 << tableLog;
                  U32 const tableMask = tableSize - 1;
                  void* const ptr = ct;
                  U16* const tableU16 = ( (U16*) ptr) + 2;
                  void* const FSCT = ((U32*)ptr) + 1 /* header */ + (tableLog ? tableSize>>1 : 1) ;
                  FSE_symbolCompressionTransform* const symbolTT = (FSE_symbolCompressionTransform*) (FSCT);
                  U32 const step = FSE_TABLESTEP(tableSize);
                  U32 cumul[FSE_MAX_SYMBOL_VALUE+2];
-                 FSE_FUNCTION_TYPE tableSymbol[FSE_MAX_TABLESIZE]; /* memset() is not necessary, even if static analyzer complain about it */
+                 FSE_FUNCTION_TYPE* const tableSymbol = (FSE_FUNCTION_TYPE*)workSpace;
                  U32 highThreshold = tableSize-1;
                  /* CTable header */
+                 if (((size_t)1 << tableLog) * sizeof(FSE_FUNCTION_TYPE) > wkspSize) return ERROR(tableLog_tooLarge);
                  tableU16[-2] = (U16) tableLog;
                  tableU16[-1] = (U16) maxSymbolValue;
                  /* For explanations on how to distribute symbol values over the table :
                  *  http://fastcompression.blogspot.fr/2014/02/fse-distributing-symbol-values.html */
                  /* symbol start positions */
                  {   U32 u;
                      cumul[0] = 0;
                      for (u=1; u<=maxSymbolValue+1; u++) {
                          if (normalizedCounter[u-1]==-1) {  /* Low proba symbol */
                              cumul[u] = cumul[u-1] + 1;
                              tableSymbol[highThreshold--] = (FSE_FUNCTION_TYPE)(u-1);
                          } else {
                              cumul[u] = cumul[u-1] + normalizedCounter[u-1];
                      }   }
                      cumul[maxSymbolValue+1] = tableSize+1;
                  }
                  /* Spread symbols */
                  {   U32 position = 0;
                      U32 symbol;
                      for (symbol=0; symbol<=maxSymbolValue; symbol++) {
                          int nbOccurences;
                          for (nbOccurences=0; nbOccurences<normalizedCounter[symbol]; nbOccurences++) {
                              tableSymbol[position] = (FSE_FUNCTION_TYPE)symbol;
                              position = (position + step) & tableMask;
                              while (position > highThreshold) position = (position + step) & tableMask;   /* Low proba area */
                      }   }
                      if (position!=0) return ERROR(GENERIC);   /* Must have gone through all positions */
                  }
                  /* Build table */
                  {   U32 u; for (u=0; u<tableSize; u++) {
                      FSE_FUNCTION_TYPE s = tableSymbol[u];   /* note : static analyzer may not understand tableSymbol is properly initialized */
                      tableU16[cumul[s]++] = (U16) (tableSize+u);   /* TableU16 : sorted by symbol order; gives next state value */
                  }   }
                  /* Build Symbol Transformation Table */
                  {   unsigned total = 0;
                      unsigned s;
                      for (s=0; s<=maxSymbolValue; s++) {
                          switch (normalizedCounter[s])
                          {
                          case  0: break;
                          case -1:
                          case  1:
                              symbolTT[s].deltaNbBits = (tableLog << 16) - (1<<tableLog);
                              symbolTT[s].deltaFindState = total - 1;
                              total ++;
                              break;
                          default :
                              {
                                  U32 const maxBitsOut = tableLog - BIT_highbit32 (normalizedCounter[s]-1);
                                  U32 const minStatePlus = normalizedCounter[s] << maxBitsOut;
                                  symbolTT[s].deltaNbBits = (maxBitsOut << 16) - minStatePlus;
                                  symbolTT[s].deltaFindState = total - normalizedCounter[s];
                                  total +=  normalizedCounter[s];
                  }   }   }   }
                  return 0;
              }
+             size_t FSE_buildCTable(FSE_CTable* ct, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog)
+             {
+                 FSE_FUNCTION_TYPE tableSymbol[FSE_MAX_TABLESIZE];   /* memset() is not necessary, even if static analyzer complain about it */
+                 return FSE_buildCTable_wksp(ct, normalizedCounter, maxSymbolValue, tableLog, tableSymbol, sizeof(tableSymbol));
+             }
              #ifndef FSE_COMMONDEFS_ONLY
              /*-**************************************************************
              *  FSE NCount encoding-decoding
              ****************************************************************/
              size_t FSE_NCountWriteBound(unsigned maxSymbolValue, unsigned tableLog)
              {
-                 size_t maxHeaderSize = (((maxSymbolValue+1) * tableLog) >> 3) + 3;
+                 size_t const maxHeaderSize = (((maxSymbolValue+1) * tableLog) >> 3) + 3;
                  return maxSymbolValue ? maxHeaderSize : FSE_NCOUNTBOUND;  /* maxSymbolValue==0 ? use default */
              }
              static short FSE_abs(short a) { return (short)(a<0 ? -a : a); }
              static size_t FSE_writeNCount_generic (void* header, size_t headerBufferSize,
                                                     const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog,
                                                     unsigned writeIsSafe)
              {
                  BYTE* const ostart = (BYTE*) header;
                  BYTE* out = ostart;
                  BYTE* const oend = ostart + headerBufferSize;
                  int nbBits;
                  const int tableSize = 1 << tableLog;
                  int remaining;
                  int threshold;
                  U32 bitStream;
                  int bitCount;
                  unsigned charnum = 0;
                  int previous0 = 0;
                  bitStream = 0;
                  bitCount  = 0;
                  /* Table Size */
                  bitStream += (tableLog-FSE_MIN_TABLELOG) << bitCount;
                  bitCount  += 4;
                  /* Init */
                  remaining = tableSize+1;   /* +1 for extra accuracy */
                  threshold = tableSize;
                  nbBits = tableLog+1;
                  while (remaining>1) {  /* stops at 1 */
                      if (previous0) {
                          unsigned start = charnum;
                          while (!normalizedCounter[charnum]) charnum++;
                          while (charnum >= start+24) {
                              start+=24;
                              bitStream += 0xFFFFU << bitCount;
                              if ((!writeIsSafe) && (out > oend-2)) return ERROR(dstSize_tooSmall);   /* Buffer overflow */
                              out[0] = (BYTE) bitStream;
                              out[1] = (BYTE)(bitStream>>8);
                              out+=2;
                              bitStream>>=16;
                          }
                          while (charnum >= start+3) {
                              start+=3;
                              bitStream += 3 << bitCount;
                              bitCount += 2;
                          }
                          bitStream += (charnum-start) << bitCount;
                          bitCount += 2;
                          if (bitCount>16) {
                              if ((!writeIsSafe) && (out > oend - 2)) return ERROR(dstSize_tooSmall);   /* Buffer overflow */
                              out[0] = (BYTE)bitStream;
                              out[1] = (BYTE)(bitStream>>8);
                              out += 2;
                              bitStream >>= 16;
                              bitCount -= 16;
                      }   }
                      {   short count = normalizedCounter[charnum++];
                          const short max = (short)((2*threshold-1)-remaining);
                          remaining -= FSE_abs(count);
                          if (remaining<1) return ERROR(GENERIC);
                          count++;   /* +1 for extra accuracy */
                          if (count>=threshold) count += max;   /* [0..max[ [max..threshold[ (...) [threshold+max 2*threshold[ */
                          bitStream += count << bitCount;
                          bitCount  += nbBits;
                          bitCount  -= (count<max);
                          previous0  = (count==1);
                          while (remaining<threshold) nbBits--, threshold>>=1;
                      }
                      if (bitCount>16) {
                          if ((!writeIsSafe) && (out > oend - 2)) return ERROR(dstSize_tooSmall);   /* Buffer overflow */
                          out[0] = (BYTE)bitStream;
                          out[1] = (BYTE)(bitStream>>8);
                          out += 2;
                          bitStream >>= 16;
                          bitCount -= 16;
                  }   }
                  /* flush remaining bitStream */
                  if ((!writeIsSafe) && (out > oend - 2)) return ERROR(dstSize_tooSmall);   /* Buffer overflow */
                  out[0] = (BYTE)bitStream;
                  out[1] = (BYTE)(bitStream>>8);
                  out+= (bitCount+7) /8;
                  if (charnum > maxSymbolValue + 1) return ERROR(GENERIC);
                  return (out-ostart);
              }
              size_t FSE_writeNCount (void* buffer, size_t bufferSize, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog)
              {
                  if (tableLog > FSE_MAX_TABLELOG) return ERROR(GENERIC);   /* Unsupported */
                  if (tableLog < FSE_MIN_TABLELOG) return ERROR(GENERIC);   /* Unsupported */
                  if (bufferSize < FSE_NCountWriteBound(maxSymbolValue, tableLog))
                      return FSE_writeNCount_generic(buffer, bufferSize, normalizedCounter, maxSymbolValue, tableLog, 0);
                  return FSE_writeNCount_generic(buffer, bufferSize, normalizedCounter, maxSymbolValue, tableLog, 1);
              }
              /*-**************************************************************
              *  Counting histogram
              ****************************************************************/
              /*! FSE_count_simple
-                 This function just counts byte values within `src`,
-                 and store the histogram into table `count`.
-                 This function is unsafe : it doesn't check that all values within `src` can fit into `count`.
+                 This function counts byte values within `src`, and store the histogram into table `count`.
+                 It doesn't use any additional memory.
+                 But this function is unsafe : it doesn't check that all values within `src` can fit into `count`.
                  For this reason, prefer using a table `count` with 256 elements.
                  @return : count of most numerous element
              */
-             static size_t FSE_count_simple(unsigned* count, unsigned* maxSymbolValuePtr,
-                                            const void* src, size_t srcSize)
+             size_t FSE_count_simple(unsigned* count, unsigned* maxSymbolValuePtr,
+                                     const void* src, size_t srcSize)
              {
                  const BYTE* ip = (const BYTE*)src;
                  const BYTE* const end = ip + srcSize;
                  unsigned maxSymbolValue = *maxSymbolValuePtr;
                  unsigned max=0;
                  memset(count, 0, (maxSymbolValue+1)*sizeof(*count));
                  if (srcSize==0) { *maxSymbolValuePtr = 0; return 0; }
                  while (ip<end) count[*ip++]++;
                  while (!count[maxSymbolValue]) maxSymbolValue--;
                  *maxSymbolValuePtr = maxSymbolValue;
                  { U32 s; for (s=0; s<=maxSymbolValue; s++) if (count[s] > max) max = count[s]; }
                  return (size_t)max;
              }
-             static size_t FSE_count_parallel(unsigned* count, unsigned* maxSymbolValuePtr,
+             /* FSE_count_parallel_wksp() :
+              * Same as FSE_count_parallel(), but using an externally provided scratch buffer.
+              * `workSpace` size must be a minimum of `1024 * sizeof(unsigned)`` */
+             static size_t FSE_count_parallel_wksp(
+                                             unsigned* count, unsigned* maxSymbolValuePtr,
                                              const void* source, size_t sourceSize,
-                                             unsigned checkMax)
+                                             unsigned checkMax, unsigned* const workSpace)
              {
                  const BYTE* ip = (const BYTE*)source;
                  const BYTE* const iend = ip+sourceSize;
                  unsigned maxSymbolValue = *maxSymbolValuePtr;
                  unsigned max=0;
+                 U32* const Counting1 = workSpace;
+                 U32* const Counting2 = Counting1 + 256;
+                 U32* const Counting3 = Counting2 + 256;
+                 U32* const Counting4 = Counting3 + 256;
-                 U32 Counting1[256] = { 0 };
-                 U32 Counting2[256] = { 0 };
-                 U32 Counting3[256] = { 0 };
-                 U32 Counting4[256] = { 0 };
+                 memset(Counting1, 0, 4*256*sizeof(unsigned));
                  /* safety checks */
                  if (!sourceSize) {
                      memset(count, 0, maxSymbolValue + 1);
                      *maxSymbolValuePtr = 0;
                      return 0;
                  }
                  if (!maxSymbolValue) maxSymbolValue = 255;            /* 0 == default */
                  /* by stripes of 16 bytes */
                  {   U32 cached = MEM_read32(ip); ip += 4;
                      while (ip < iend-15) {
                          U32 c = cached; cached = MEM_read32(ip); ip += 4;
                          Counting1[(BYTE) c     ]++;
                          Counting2[(BYTE)(c>>8) ]++;
                          Counting3[(BYTE)(c>>16)]++;
                          Counting4[       c>>24 ]++;
                          c = cached; cached = MEM_read32(ip); ip += 4;
                          Counting1[(BYTE) c     ]++;
                          Counting2[(BYTE)(c>>8) ]++;
                          Counting3[(BYTE)(c>>16)]++;
                          Counting4[       c>>24 ]++;
                          c = cached; cached = MEM_read32(ip); ip += 4;
                          Counting1[(BYTE) c     ]++;
                          Counting2[(BYTE)(c>>8) ]++;
                          Counting3[(BYTE)(c>>16)]++;
                          Counting4[       c>>24 ]++;
                          c = cached; cached = MEM_read32(ip); ip += 4;
                          Counting1[(BYTE) c     ]++;
                          Counting2[(BYTE)(c>>8) ]++;
                          Counting3[(BYTE)(c>>16)]++;
                          Counting4[       c>>24 ]++;
                      }
                      ip-=4;
                  }
                  /* finish last symbols */
                  while (ip<iend) Counting1[*ip++]++;
                  if (checkMax) {   /* verify stats will fit into destination table */
                      U32 s; for (s=255; s>maxSymbolValue; s--) {
                          Counting1[s] += Counting2[s] + Counting3[s] + Counting4[s];
                          if (Counting1[s]) return ERROR(maxSymbolValue_tooSmall);
                  }   }
-                 { U32 s; for (s=0; s<=maxSymbolValue; s++) {
-                     count[s] = Counting1[s] + Counting2[s] + Counting3[s] + Counting4[s];
-                     if (count[s] > max) max = count[s];
-                 }}
+                 {   U32 s; for (s=0; s<=maxSymbolValue; s++) {
+                         count[s] = Counting1[s] + Counting2[s] + Counting3[s] + Counting4[s];
+                         if (count[s] > max) max = count[s];
+                 }   }
                  while (!count[maxSymbolValue]) maxSymbolValue--;
                  *maxSymbolValuePtr = maxSymbolValue;
                  return (size_t)max;
              }
+             /* FSE_countFast_wksp() :
+              * Same as FSE_countFast(), but using an externally provided scratch buffer.
+              * `workSpace` size must be table of >= `1024` unsigned */
+             size_t FSE_countFast_wksp(unsigned* count, unsigned* maxSymbolValuePtr,
+                                  const void* source, size_t sourceSize, unsigned* workSpace)
+             {
+                 if (sourceSize < 1500) return FSE_count_simple(count, maxSymbolValuePtr, source, sourceSize);
+                 return FSE_count_parallel_wksp(count, maxSymbolValuePtr, source, sourceSize, 0, workSpace);
+             }
              /* fast variant (unsafe : won't check if src contains values beyond count[] limit) */
              size_t FSE_countFast(unsigned* count, unsigned* maxSymbolValuePtr,
                                   const void* source, size_t sourceSize)
              {
-                 if (sourceSize < 1500) return FSE_count_simple(count, maxSymbolValuePtr, source, sourceSize);
-                 return FSE_count_parallel(count, maxSymbolValuePtr, source, sourceSize, 0);
+                 unsigned tmpCounters[1024];
+                 return FSE_countFast_wksp(count, maxSymbolValuePtr, source, sourceSize, tmpCounters);
+             }
+             /* FSE_count_wksp() :
+              * Same as FSE_count(), but using an externally provided scratch buffer.
+              * `workSpace` size must be table of >= `1024` unsigned */
+             size_t FSE_count_wksp(unsigned* count, unsigned* maxSymbolValuePtr,
+                              const void* source, size_t sourceSize, unsigned* workSpace)
+             {
+                 if (*maxSymbolValuePtr < 255)
+                     return FSE_count_parallel_wksp(count, maxSymbolValuePtr, source, sourceSize, 1, workSpace);
+                 *maxSymbolValuePtr = 255;
+                 return FSE_countFast_wksp(count, maxSymbolValuePtr, source, sourceSize, workSpace);
              }
              size_t FSE_count(unsigned* count, unsigned* maxSymbolValuePtr,
-                              const void* source, size_t sourceSize)
+                              const void* src, size_t srcSize)
              {
-                 if (*maxSymbolValuePtr <255)
-                     return FSE_count_parallel(count, maxSymbolValuePtr, source, sourceSize, 1);
-                 *maxSymbolValuePtr = 255;
-                 return FSE_countFast(count, maxSymbolValuePtr, source, sourceSize);
+                 unsigned tmpCounters[1024];
+                 return FSE_count_wksp(count, maxSymbolValuePtr, src, srcSize, tmpCounters);
              }
              /*-**************************************************************
              *  FSE Compression Code
              ****************************************************************/
              /*! FSE_sizeof_CTable() :
                  FSE_CTable is a variable size structure which contains :
                  `U16 tableLog;`
                  `U16 maxSymbolValue;`
                  `U16 nextStateNumber[1 << tableLog];`                         // This size is variable
                  `FSE_symbolCompressionTransform symbolTT[maxSymbolValue+1];`  // This size is variable
              Allocation is manual (C standard does not support variable-size structures).
              */
              size_t FSE_sizeof_CTable (unsigned maxSymbolValue, unsigned tableLog)
              {
-                 size_t size;
-                 FSE_STATIC_ASSERT((size_t)FSE_CTABLE_SIZE_U32(FSE_MAX_TABLELOG, FSE_MAX_SYMBOL_VALUE)*4 >= sizeof(CTable_max_t));   /* A compilation error here means FSE_CTABLE_SIZE_U32 is not large enough */
-                 if (tableLog > FSE_MAX_TABLELOG) return ERROR(GENERIC);
-                 size = FSE_CTABLE_SIZE_U32 (tableLog, maxSymbolValue) * sizeof(U32);
-                 return size;
+                 if (tableLog > FSE_MAX_TABLELOG) return ERROR(tableLog_tooLarge);
+                 return FSE_CTABLE_SIZE_U32 (tableLog, maxSymbolValue) * sizeof(U32);
              }
              FSE_CTable* FSE_createCTable (unsigned maxSymbolValue, unsigned tableLog)
              {
                  size_t size;
                  if (tableLog > FSE_TABLELOG_ABSOLUTE_MAX) tableLog = FSE_TABLELOG_ABSOLUTE_MAX;
                  size = FSE_CTABLE_SIZE_U32 (tableLog, maxSymbolValue) * sizeof(U32);
                  return (FSE_CTable*)malloc(size);
              }
              void FSE_freeCTable (FSE_CTable* ct) { free(ct); }
              /* provides the minimum logSize to safely represent a distribution */
              static unsigned FSE_minTableLog(size_t srcSize, unsigned maxSymbolValue)
              {
              	U32 minBitsSrc = BIT_highbit32((U32)(srcSize - 1)) + 1;
              	U32 minBitsSymbols = BIT_highbit32(maxSymbolValue) + 2;
              	U32 minBits = minBitsSrc < minBitsSymbols ? minBitsSrc : minBitsSymbols;
              	return minBits;
              }
              unsigned FSE_optimalTableLog_internal(unsigned maxTableLog, size_t srcSize, unsigned maxSymbolValue, unsigned minus)
              {
              	U32 maxBitsSrc = BIT_highbit32((U32)(srcSize - 1)) - minus;
                  U32 tableLog = maxTableLog;
              	U32 minBits = FSE_minTableLog(srcSize, maxSymbolValue);
                  if (tableLog==0) tableLog = FSE_DEFAULT_TABLELOG;
              	if (maxBitsSrc < tableLog) tableLog = maxBitsSrc;   /* Accuracy can be reduced */
              	if (minBits > tableLog) tableLog = minBits;   /* Need a minimum to safely represent all symbol values */
                  if (tableLog < FSE_MIN_TABLELOG) tableLog = FSE_MIN_TABLELOG;
                  if (tableLog > FSE_MAX_TABLELOG) tableLog = FSE_MAX_TABLELOG;
                  return tableLog;
              }
              unsigned FSE_optimalTableLog(unsigned maxTableLog, size_t srcSize, unsigned maxSymbolValue)
              {
                  return FSE_optimalTableLog_internal(maxTableLog, srcSize, maxSymbolValue, 2);
              }
              /* Secondary normalization method.
                 To be used when primary method fails. */
              static size_t FSE_normalizeM2(short* norm, U32 tableLog, const unsigned* count, size_t total, U32 maxSymbolValue)
              {
                  U32 s;
                  U32 distributed = 0;
                  U32 ToDistribute;
                  /* Init */
-                 U32 lowThreshold = (U32)(total >> tableLog);
+                 U32 const lowThreshold = (U32)(total >> tableLog);
                  U32 lowOne = (U32)((total * 3) >> (tableLog + 1));
                  for (s=0; s<=maxSymbolValue; s++) {
                      if (count[s] == 0) {
                          norm[s]=0;
                          continue;
                      }
                      if (count[s] <= lowThreshold) {
                          norm[s] = -1;
                          distributed++;
                          total -= count[s];
                          continue;
                      }
                      if (count[s] <= lowOne) {
                          norm[s] = 1;
                          distributed++;
                          total -= count[s];
                          continue;
                      }
                      norm[s]=-2;
                  }
                  ToDistribute = (1 << tableLog) - distributed;
                  if ((total / ToDistribute) > lowOne) {
                      /* risk of rounding to zero */
                      lowOne = (U32)((total * 3) / (ToDistribute * 2));
                      for (s=0; s<=maxSymbolValue; s++) {
                          if ((norm[s] == -2) && (count[s] <= lowOne)) {
                              norm[s] = 1;
                              distributed++;
                              total -= count[s];
                              continue;
                      }   }
                      ToDistribute = (1 << tableLog) - distributed;
                  }
                  if (distributed == maxSymbolValue+1) {
                      /* all values are pretty poor;
                         probably incompressible data (should have already been detected);
                         find max, then give all remaining points to max */
                      U32 maxV = 0, maxC = 0;
                      for (s=0; s<=maxSymbolValue; s++)
                          if (count[s] > maxC) maxV=s, maxC=count[s];
                      norm[maxV] += (short)ToDistribute;
                      return 0;
                  }
+                 {
-                     U64 const vStepLog = 62 - tableLog;
+                 {   U64 const vStepLog = 62 - tableLog;
                      U64 const mid = (1ULL << (vStepLog-1)) - 1;
                      U64 const rStep = ((((U64)1<<vStepLog) * ToDistribute) + mid) / total;   /* scale on remaining */
                      U64 tmpTotal = mid;
                      for (s=0; s<=maxSymbolValue; s++) {
                          if (norm[s]==-2) {
-                             U64 end = tmpTotal + (count[s] * rStep);
-                             U32 sStart = (U32)(tmpTotal >> vStepLog);
-                             U32 sEnd = (U32)(end >> vStepLog);
-                             U32 weight = sEnd - sStart;
+                             U64 const end = tmpTotal + (count[s] * rStep);
+                             U32 const sStart = (U32)(tmpTotal >> vStepLog);
+                             U32 const sEnd = (U32)(end >> vStepLog);
+                             U32 const weight = sEnd - sStart;
                              if (weight < 1)
                                  return ERROR(GENERIC);
                              norm[s] = (short)weight;
                              tmpTotal = end;
                  }   }   }
                  return 0;
              }
              size_t FSE_normalizeCount (short* normalizedCounter, unsigned tableLog,
                                         const unsigned* count, size_t total,
                                         unsigned maxSymbolValue)
              {
                  /* Sanity checks */
                  if (tableLog==0) tableLog = FSE_DEFAULT_TABLELOG;
                  if (tableLog < FSE_MIN_TABLELOG) return ERROR(GENERIC);   /* Unsupported size */
                  if (tableLog > FSE_MAX_TABLELOG) return ERROR(tableLog_tooLarge);   /* Unsupported size */
                  if (tableLog < FSE_minTableLog(total, maxSymbolValue)) return ERROR(GENERIC);   /* Too small tableLog, compression potentially impossible */
                  {   U32 const rtbTable[] = {     0, 473195, 504333, 520860, 550000, 700000, 750000, 830000 };
                      U64 const scale = 62 - tableLog;
                      U64 const step = ((U64)1<<62) / total;   /* <== here, one division ! */
                      U64 const vStep = 1ULL<<(scale-20);
                      int stillToDistribute = 1<<tableLog;
                      unsigned s;
                      unsigned largest=0;
                      short largestP=0;
                      U32 lowThreshold = (U32)(total >> tableLog);
                      for (s=0; s<=maxSymbolValue; s++) {
                          if (count[s] == total) return 0;   /* rle special case */
                          if (count[s] == 0) { normalizedCounter[s]=0; continue; }
                          if (count[s] <= lowThreshold) {
                              normalizedCounter[s] = -1;
                              stillToDistribute--;
                          } else {
                              short proba = (short)((count[s]*step) >> scale);
                              if (proba<8) {
                                  U64 restToBeat = vStep * rtbTable[proba];
                                  proba += (count[s]*step) - ((U64)proba<<scale) > restToBeat;
                              }
                              if (proba > largestP) largestP=proba, largest=s;
                              normalizedCounter[s] = proba;
                              stillToDistribute -= proba;
                      }   }
                      if (-stillToDistribute >= (normalizedCounter[largest] >> 1)) {
                          /* corner case, need another normalization method */
-                         size_t errorCode = FSE_normalizeM2(normalizedCounter, tableLog, count, total, maxSymbolValue);
+                         size_t const errorCode = FSE_normalizeM2(normalizedCounter, tableLog, count, total, maxSymbolValue);
                          if (FSE_isError(errorCode)) return errorCode;
                      }
                      else normalizedCounter[largest] += (short)stillToDistribute;
                  }
              #if 0
                  {   /* Print Table (debug) */
                      U32 s;
                      U32 nTotal = 0;
                      for (s=0; s<=maxSymbolValue; s++)
                          printf("%3i: %4i \n", s, normalizedCounter[s]);
                      for (s=0; s<=maxSymbolValue; s++)
                          nTotal += abs(normalizedCounter[s]);
                      if (nTotal != (1U<<tableLog))
                          printf("Warning !!! Total == %u != %u !!!", nTotal, 1U<<tableLog);
                      getchar();
                  }
              #endif
                  return tableLog;
              }
              /* fake FSE_CTable, for raw (uncompressed) input */
              size_t FSE_buildCTable_raw (FSE_CTable* ct, unsigned nbBits)
              {
                  const unsigned tableSize = 1 << nbBits;
                  const unsigned tableMask = tableSize - 1;
                  const unsigned maxSymbolValue = tableMask;
                  void* const ptr = ct;
                  U16* const tableU16 = ( (U16*) ptr) + 2;
                  void* const FSCT = ((U32*)ptr) + 1 /* header */ + (tableSize>>1);   /* assumption : tableLog >= 1 */
                  FSE_symbolCompressionTransform* const symbolTT = (FSE_symbolCompressionTransform*) (FSCT);
                  unsigned s;
                  /* Sanity checks */
                  if (nbBits < 1) return ERROR(GENERIC);             /* min size */
                  /* header */
                  tableU16[-2] = (U16) nbBits;
                  tableU16[-1] = (U16) maxSymbolValue;
                  /* Build table */
                  for (s=0; s<tableSize; s++)
                      tableU16[s] = (U16)(tableSize + s);
                  /* Build Symbol Transformation Table */
                  {   const U32 deltaNbBits = (nbBits << 16) - (1 << nbBits);
                      for (s=0; s<=maxSymbolValue; s++) {
                          symbolTT[s].deltaNbBits = deltaNbBits;
                          symbolTT[s].deltaFindState = s-1;
                  }   }
                  return 0;
              }
-             /* fake FSE_CTable, for rle (100% always same symbol) input */
+             /* fake FSE_CTable, for rle input (always same symbol) */
              size_t FSE_buildCTable_rle (FSE_CTable* ct, BYTE symbolValue)
              {
                  void* ptr = ct;
                  U16* tableU16 = ( (U16*) ptr) + 2;
                  void* FSCTptr = (U32*)ptr + 2;
                  FSE_symbolCompressionTransform* symbolTT = (FSE_symbolCompressionTransform*) FSCTptr;
                  /* header */
                  tableU16[-2] = (U16) 0;
                  tableU16[-1] = (U16) symbolValue;
                  /* Build table */
                  tableU16[0] = 0;
                  tableU16[1] = 0;   /* just in case */
                  /* Build Symbol Transformation Table */
                  symbolTT[symbolValue].deltaNbBits = 0;
                  symbolTT[symbolValue].deltaFindState = 0;
                  return 0;
              }
              static size_t FSE_compress_usingCTable_generic (void* dst, size_t dstSize,
                                         const void* src, size_t srcSize,
                                         const FSE_CTable* ct, const unsigned fast)
              {
                  const BYTE* const istart = (const BYTE*) src;
                  const BYTE* const iend = istart + srcSize;
                  const BYTE* ip=iend;
                  BIT_CStream_t bitC;
                  FSE_CState_t CState1, CState2;
                  /* init */
                  if (srcSize <= 2) return 0;
-                 { size_t const errorCode = BIT_initCStream(&bitC, dst, dstSize);
-                   if (FSE_isError(errorCode)) return 0; }
+                 { size_t const initError = BIT_initCStream(&bitC, dst, dstSize);
+                   if (FSE_isError(initError)) return 0; /* not enough space available to write a bitstream */ }
              #define FSE_FLUSHBITS(s)  (fast ? BIT_flushBitsFast(s) : BIT_flushBits(s))
                  if (srcSize & 1) {
                      FSE_initCState2(&CState1, ct, *--ip);
                      FSE_initCState2(&CState2, ct, *--ip);
                      FSE_encodeSymbol(&bitC, &CState1, *--ip);
                      FSE_FLUSHBITS(&bitC);
                  } else {
                      FSE_initCState2(&CState2, ct, *--ip);
                      FSE_initCState2(&CState1, ct, *--ip);
                  }
                  /* join to mod 4 */
                  srcSize -= 2;
                  if ((sizeof(bitC.bitContainer)*8 > FSE_MAX_TABLELOG*4+7 ) && (srcSize & 2)) {  /* test bit 2 */
                      FSE_encodeSymbol(&bitC, &CState2, *--ip);
                      FSE_encodeSymbol(&bitC, &CState1, *--ip);
                      FSE_FLUSHBITS(&bitC);
                  }
                  /* 2 or 4 encoding per loop */
-                 for ( ; ip>istart ; ) {
+                 while ( ip>istart ) {
                      FSE_encodeSymbol(&bitC, &CState2, *--ip);
                      if (sizeof(bitC.bitContainer)*8 < FSE_MAX_TABLELOG*2+7 )   /* this test must be static */
                          FSE_FLUSHBITS(&bitC);
                      FSE_encodeSymbol(&bitC, &CState1, *--ip);
                      if (sizeof(bitC.bitContainer)*8 > FSE_MAX_TABLELOG*4+7 ) {  /* this test must be static */
                          FSE_encodeSymbol(&bitC, &CState2, *--ip);
                          FSE_encodeSymbol(&bitC, &CState1, *--ip);
                      }
                      FSE_FLUSHBITS(&bitC);
                  }
                  FSE_flushCState(&bitC, &CState2);
                  FSE_flushCState(&bitC, &CState1);
                  return BIT_closeCStream(&bitC);
              }
              size_t FSE_compress_usingCTable (void* dst, size_t dstSize,
                                         const void* src, size_t srcSize,
                                         const FSE_CTable* ct)
              {
-                 const unsigned fast = (dstSize >= FSE_BLOCKBOUND(srcSize));
+                 unsigned const fast = (dstSize >= FSE_BLOCKBOUND(srcSize));
                  if (fast)
                      return FSE_compress_usingCTable_generic(dst, dstSize, src, srcSize, ct, 1);
                  else
                      return FSE_compress_usingCTable_generic(dst, dstSize, src, srcSize, ct, 0);
              }
              size_t FSE_compressBound(size_t size) { return FSE_COMPRESSBOUND(size); }
-             size_t FSE_compress2 (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog)
+             #define CHECK_V_F(e, f) size_t const e = f; if (ERR_isError(e)) return f
+             #define CHECK_F(f)   { CHECK_V_F(_var_err__, f); }
+             /* FSE_compress_wksp() :
+              * Same as FSE_compress2(), but using an externally allocated scratch buffer (`workSpace`).
+              * `wkspSize` size must be `(1<<tableLog)`.
+              */
+             size_t FSE_compress_wksp (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog, void* workSpace, size_t wkspSize)
              {
-                 const BYTE* const istart = (const BYTE*) src;
-                 const BYTE* ip = istart;
                  BYTE* const ostart = (BYTE*) dst;
                  BYTE* op = ostart;
                  BYTE* const oend = ostart + dstSize;
                  U32   count[FSE_MAX_SYMBOL_VALUE+1];
                  S16   norm[FSE_MAX_SYMBOL_VALUE+1];
-                 CTable_max_t ct;
-                 size_t errorCode;
+                 FSE_CTable* CTable = (FSE_CTable*)workSpace;
+                 size_t const CTableSize = FSE_CTABLE_SIZE_U32(tableLog, maxSymbolValue);
+                 void* scratchBuffer = (void*)(CTable + CTableSize);
+                 size_t const scratchBufferSize = wkspSize - (CTableSize * sizeof(FSE_CTable));
                  /* init conditions */
-                 if (srcSize <= 1) return 0;  /* Uncompressible */
+                 if (wkspSize < FSE_WKSP_SIZE_U32(tableLog, maxSymbolValue)) return ERROR(tableLog_tooLarge);
+                 if (srcSize <= 1) return 0;  /* Not compressible */
                  if (!maxSymbolValue) maxSymbolValue = FSE_MAX_SYMBOL_VALUE;
                  if (!tableLog) tableLog = FSE_DEFAULT_TABLELOG;
                  /* Scan input and build symbol stats */
-                 errorCode = FSE_count (count, &maxSymbolValue, ip, srcSize);
-                 if (FSE_isError(errorCode)) return errorCode;
-                 if (errorCode == srcSize) return 1;
-                 if (errorCode == 1) return 0;   /* each symbol only present once */
-                 if (errorCode < (srcSize >> 7)) return 0;   /* Heuristic : not compressible enough */
+                 {   CHECK_V_F(maxCount, FSE_count(count, &maxSymbolValue, src, srcSize) );
+                     if (maxCount == srcSize) return 1;   /* only a single symbol in src : rle */
+                     if (maxCount == 1) return 0;         /* each symbol present maximum once => not compressible */
+                     if (maxCount < (srcSize >> 7)) return 0;   /* Heuristic : not compressible enough */
+                 }
                  tableLog = FSE_optimalTableLog(tableLog, srcSize, maxSymbolValue);
-                 errorCode = FSE_normalizeCount (norm, tableLog, count, srcSize, maxSymbolValue);
-                 if (FSE_isError(errorCode)) return errorCode;
+                 CHECK_F( FSE_normalizeCount(norm, tableLog, count, srcSize, maxSymbolValue) );
                  /* Write table description header */
-                 errorCode = FSE_writeNCount (op, oend-op, norm, maxSymbolValue, tableLog);
-                 if (FSE_isError(errorCode)) return errorCode;
-                 op += errorCode;
+                 {   CHECK_V_F(nc_err, FSE_writeNCount(op, oend-op, norm, maxSymbolValue, tableLog) );
+                     op += nc_err;
+                 }
                  /* Compress */
-                 errorCode = FSE_buildCTable (ct, norm, maxSymbolValue, tableLog);
-                 if (FSE_isError(errorCode)) return errorCode;
-                 errorCode = FSE_compress_usingCTable(op, oend - op, ip, srcSize, ct);
-                 if (errorCode == 0) return 0;   /* not enough space for compressed data */
-                 op += errorCode;
+                 CHECK_F( FSE_buildCTable_wksp(CTable, norm, maxSymbolValue, tableLog, scratchBuffer, scratchBufferSize) );
+                 {   CHECK_V_F(cSize, FSE_compress_usingCTable(op, oend - op, src, srcSize, CTable) );
+                     if (cSize == 0) return 0;   /* not enough space for compressed data */
+                     op += cSize;
+                 }
                  /* check compressibility */
-                 if ( (size_t)(op-ostart) >= srcSize-1 )
-                     return 0;
+                 if ( (size_t)(op-ostart) >= srcSize-1 ) return 0;
                  return op-ostart;
              }
-             size_t FSE_compress (void* dst, size_t dstSize, const void* src, size_t srcSize)
+             typedef struct {
+                 FSE_CTable CTable_max[FSE_CTABLE_SIZE_U32(FSE_MAX_TABLELOG, FSE_MAX_SYMBOL_VALUE)];
+                 BYTE scratchBuffer[1 << FSE_MAX_TABLELOG];
+             } fseWkspMax_t;
+             size_t FSE_compress2 (void* dst, size_t dstCapacity, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog)
              {
-                 return FSE_compress2(dst, dstSize, src, (U32)srcSize, FSE_MAX_SYMBOL_VALUE, FSE_DEFAULT_TABLELOG);
+                 fseWkspMax_t scratchBuffer;
+                 FSE_STATIC_ASSERT(sizeof(scratchBuffer) >= FSE_WKSP_SIZE_U32(FSE_MAX_TABLELOG, FSE_MAX_SYMBOL_VALUE));   /* compilation failures here means scratchBuffer is not large enough */
+                 if (tableLog > FSE_MAX_TABLELOG) return ERROR(tableLog_tooLarge);
+                 return FSE_compress_wksp(dst, dstCapacity, src, srcSize, maxSymbolValue, tableLog, &scratchBuffer, sizeof(scratchBuffer));
+             }
+             size_t FSE_compress (void* dst, size_t dstCapacity, const void* src, size_t srcSize)
+             {
+                 return FSE_compress2(dst, dstCapacity, src, srcSize, FSE_MAX_SYMBOL_VALUE, FSE_DEFAULT_TABLELOG);
              }
              #endif   /* FSE_COMMONDEFS_ONLY */

contrib/python-zstandard/zstd/compress/huf_compress.c

0 +130 -54

              /* ******************************************************************
                 Huffman encoder, part of New Generation Entropy library
                 Copyright (C) 2013-2016, Yann Collet.
                 BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php)
                 Redistribution and use in source and binary forms, with or without
                 modification, are permitted provided that the following conditions are
                 met:
                     * Redistributions of source code must retain the above copyright
                 notice, this list of conditions and the following disclaimer.
                     * Redistributions in binary form must reproduce the above
                 copyright notice, this list of conditions and the following disclaimer
                 in the documentation and/or other materials provided with the
                 distribution.
                 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
                 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
                 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
                 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
                 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
                 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
                 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
                 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
                 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
                 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
                 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
                  You can contact the author at :
                  - FSE+HUF source repository : https://github.com/Cyan4973/FiniteStateEntropy
                  - Public forum : https://groups.google.com/forum/#!forum/lz4c
              ****************************************************************** */
              /* **************************************************************
              *  Compiler specifics
              ****************************************************************/
              #ifdef _MSC_VER    /* Visual Studio */
              #  pragma warning(disable : 4127)        /* disable: C4127: conditional expression is constant */
              #endif
              /* **************************************************************
              *  Includes
              ****************************************************************/
              #include <string.h>     /* memcpy, memset */
              #include <stdio.h>      /* printf (debug) */
              #include "bitstream.h"
              #define FSE_STATIC_LINKING_ONLY   /* FSE_optimalTableLog_internal */
              #include "fse.h"        /* header compression */
              #define HUF_STATIC_LINKING_ONLY
              #include "huf.h"
              /* **************************************************************
              *  Error Management
              ****************************************************************/
              #define HUF_STATIC_ASSERT(c) { enum { HUF_static_assert = 1/(int)(!!(c)) }; }   /* use only *after* variable declarations */
+             #define CHECK_V_F(e, f) size_t const e = f; if (ERR_isError(e)) return f
+             #define CHECK_F(f)   { CHECK_V_F(_var_err__, f); }
              /* **************************************************************
              *  Utils
              ****************************************************************/
              unsigned HUF_optimalTableLog(unsigned maxTableLog, size_t srcSize, unsigned maxSymbolValue)
              {
                  return FSE_optimalTableLog_internal(maxTableLog, srcSize, maxSymbolValue, 1);
              }
              /* *******************************************************
              *  HUF : Huffman block compression
              *********************************************************/
+             /* HUF_compressWeights() :
+              * Same as FSE_compress(), but dedicated to huff0's weights compression.
+              * The use case needs much less stack memory.
+              * Note : all elements within weightTable are supposed to be <= HUF_TABLELOG_MAX.
+              */
+             #define MAX_FSE_TABLELOG_FOR_HUFF_HEADER 6
+             size_t HUF_compressWeights (void* dst, size_t dstSize, const void* weightTable, size_t wtSize)
+             {
+                 BYTE* const ostart = (BYTE*) dst;
+                 BYTE* op = ostart;
+                 BYTE* const oend = ostart + dstSize;
+                 U32 maxSymbolValue = HUF_TABLELOG_MAX;
+                 U32 tableLog = MAX_FSE_TABLELOG_FOR_HUFF_HEADER;
+                 FSE_CTable CTable[FSE_CTABLE_SIZE_U32(MAX_FSE_TABLELOG_FOR_HUFF_HEADER, HUF_TABLELOG_MAX)];
+                 BYTE scratchBuffer[1<<MAX_FSE_TABLELOG_FOR_HUFF_HEADER];
+                 U32 count[HUF_TABLELOG_MAX+1];
+                 S16 norm[HUF_TABLELOG_MAX+1];
+                 /* init conditions */
+                 if (wtSize <= 1) return 0;  /* Not compressible */
+                 /* Scan input and build symbol stats */
+                 {   CHECK_V_F(maxCount, FSE_count_simple(count, &maxSymbolValue, weightTable, wtSize) );
+                     if (maxCount == wtSize) return 1;   /* only a single symbol in src : rle */
+                     if (maxCount == 1) return 0;         /* each symbol present maximum once => not compressible */
+                 }
+                 tableLog = FSE_optimalTableLog(tableLog, wtSize, maxSymbolValue);
+                 CHECK_F( FSE_normalizeCount(norm, tableLog, count, wtSize, maxSymbolValue) );
+                 /* Write table description header */
+                 {   CHECK_V_F(hSize, FSE_writeNCount(op, oend-op, norm, maxSymbolValue, tableLog) );
+                     op += hSize;
+                 }
+                 /* Compress */
+                 CHECK_F( FSE_buildCTable_wksp(CTable, norm, maxSymbolValue, tableLog, scratchBuffer, sizeof(scratchBuffer)) );
+                 {   CHECK_V_F(cSize, FSE_compress_usingCTable(op, oend - op, weightTable, wtSize, CTable) );
+                     if (cSize == 0) return 0;   /* not enough space for compressed data */
+                     op += cSize;
+                 }
+                 return op-ostart;
+             }
              struct HUF_CElt_s {
                U16  val;
                BYTE nbBits;
              };   /* typedef'd to HUF_CElt within "huf.h" */
-             typedef struct nodeElt_s {
-                 U32 count;
-                 U16 parent;
-                 BYTE byte;
-                 BYTE nbBits;
-             } nodeElt;
              /*! HUF_writeCTable() :
                  `CTable` : huffman tree to save, using huf representation.
                  @return : size of saved CTable */
              size_t HUF_writeCTable (void* dst, size_t maxDstSize,
                                      const HUF_CElt* CTable, U32 maxSymbolValue, U32 huffLog)
              {
-                 BYTE bitsToWeight[HUF_TABLELOG_MAX + 1];
+                 BYTE bitsToWeight[HUF_TABLELOG_MAX + 1];   /* precomputed conversion table */
                  BYTE huffWeight[HUF_SYMBOLVALUE_MAX];
                  BYTE* op = (BYTE*)dst;
                  U32 n;
                   /* check conditions */
-                 if (maxSymbolValue > HUF_SYMBOLVALUE_MAX) return ERROR(GENERIC);
+                 if (maxSymbolValue > HUF_SYMBOLVALUE_MAX) return ERROR(maxSymbolValue_tooLarge);
                  /* convert to weight */
                  bitsToWeight[0] = 0;
                  for (n=1; n<huffLog+1; n++)
                      bitsToWeight[n] = (BYTE)(huffLog + 1 - n);
                  for (n=0; n<maxSymbolValue; n++)
                      huffWeight[n] = bitsToWeight[CTable[n].nbBits];
-                 {   size_t const size = FSE_compress(op+1, maxDstSize-1, huffWeight, maxSymbolValue);
-                     if (FSE_isError(size)) return size;
-                     if ((size>1) & (size < maxSymbolValue/2)) {   /* FSE compressed */
-                         op[0] = (BYTE)size;
-                         return size+1;
+                     }
+                 }
+                 /* attempt weights compression by FSE */
+                 {   CHECK_V_F(hSize, HUF_compressWeights(op+1, maxDstSize-1, huffWeight, maxSymbolValue) );
+                     if ((hSize>1) & (hSize < maxSymbolValue/2)) {   /* FSE compressed */
+                         op[0] = (BYTE)hSize;
+                         return hSize+1;
+                 }   }
-                 /* raw values */
-                 if (maxSymbolValue > (256-128)) return ERROR(GENERIC);   /* should not happen */
+                 /* write raw values as 4-bits (max : 15) */
+                 if (maxSymbolValue > (256-128)) return ERROR(GENERIC);   /* should not happen : likely means source cannot be compressed */
                  if (((maxSymbolValue+1)/2) + 1 > maxDstSize) return ERROR(dstSize_tooSmall);   /* not enough space within dst buffer */
                  op[0] = (BYTE)(128 /*special case*/ + (maxSymbolValue-1));
-                 huffWeight[maxSymbolValue] = 0;   /* to be sure it doesn't cause issue in final combination */
+                 huffWeight[maxSymbolValue] = 0;   /* to be sure it doesn't cause msan issue in final combination */
                  for (n=0; n<maxSymbolValue; n+=2)
                      op[(n/2)+1] = (BYTE)((huffWeight[n] << 4) + huffWeight[n+1]);
                  return ((maxSymbolValue+1)/2) + 1;
              }
              size_t HUF_readCTable (HUF_CElt* CTable, U32 maxSymbolValue, const void* src, size_t srcSize)
              {
-                 BYTE huffWeight[HUF_SYMBOLVALUE_MAX + 1];
+                 BYTE huffWeight[HUF_SYMBOLVALUE_MAX + 1];   /* init not required, even though some static analyzer may complain */
                  U32 rankVal[HUF_TABLELOG_ABSOLUTEMAX + 1];   /* large enough for values from 0 to 16 */
                  U32 tableLog = 0;
-                 size_t readSize;
                  U32 nbSymbols = 0;
-                 /*memset(huffWeight, 0, sizeof(huffWeight));*/   /* is not necessary, even though some analyzer complain ... */
                  /* get symbol weights */
-                 readSize = HUF_readStats(huffWeight, HUF_SYMBOLVALUE_MAX+1, rankVal, &nbSymbols, &tableLog, src, srcSize);
-                 if (HUF_isError(readSize)) return readSize;
+                 CHECK_V_F(readSize, HUF_readStats(huffWeight, HUF_SYMBOLVALUE_MAX+1, rankVal, &nbSymbols, &tableLog, src, srcSize));
                  /* check result */
                  if (tableLog > HUF_TABLELOG_MAX) return ERROR(tableLog_tooLarge);
                  if (nbSymbols > maxSymbolValue+1) return ERROR(maxSymbolValue_tooSmall);
                  /* Prepare base value per rank */
                  {   U32 n, nextRankStart = 0;
                      for (n=1; n<=tableLog; n++) {
                          U32 current = nextRankStart;
                          nextRankStart += (rankVal[n] << (n-1));
                          rankVal[n] = current;
                  }   }
                  /* fill nbBits */
                  {   U32 n; for (n=0; n<nbSymbols; n++) {
                          const U32 w = huffWeight[n];
                          CTable[n].nbBits = (BYTE)(tableLog + 1 - w);
                  }   }
                  /* fill val */
                  {   U16 nbPerRank[HUF_TABLELOG_MAX+2]  = {0};  /* support w=0=>n=tableLog+1 */
                      U16 valPerRank[HUF_TABLELOG_MAX+2] = {0};
                      { U32 n; for (n=0; n<nbSymbols; n++) nbPerRank[CTable[n].nbBits]++; }
                      /* determine stating value per rank */
                      valPerRank[tableLog+1] = 0;   /* for w==0 */
                      {   U16 min = 0;
                          U32 n; for (n=tableLog; n>0; n--) {  /* start at n=tablelog <-> w=1 */
                              valPerRank[n] = min;     /* get starting value within each rank */
                              min += nbPerRank[n];
                              min >>= 1;
                      }   }
                      /* assign value within rank, symbol order */
                      { U32 n; for (n=0; n<=maxSymbolValue; n++) CTable[n].val = valPerRank[CTable[n].nbBits]++; }
                  }
                  return readSize;
              }
+             typedef struct nodeElt_s {
+                 U32 count;
+                 U16 parent;
+                 BYTE byte;
+                 BYTE nbBits;
+             } nodeElt;
              static U32 HUF_setMaxHeight(nodeElt* huffNode, U32 lastNonNull, U32 maxNbBits)
              {
                  const U32 largestBits = huffNode[lastNonNull].nbBits;
                  if (largestBits <= maxNbBits) return largestBits;   /* early exit : no elt > maxNbBits */
                  /* there are several too large elements (at least >= 2) */
                  {   int totalCost = 0;
                      const U32 baseCost = 1 << (largestBits - maxNbBits);
                      U32 n = lastNonNull;
                      while (huffNode[n].nbBits > maxNbBits) {
                          totalCost += baseCost - (1 << (largestBits - huffNode[n].nbBits));
                          huffNode[n].nbBits = (BYTE)maxNbBits;
                          n --;
                      }  /* n stops at huffNode[n].nbBits <= maxNbBits */
                      while (huffNode[n].nbBits == maxNbBits) n--;   /* n end at index of smallest symbol using < maxNbBits */
                      /* renorm totalCost */
                      totalCost >>= (largestBits - maxNbBits);  /* note : totalCost is necessarily a multiple of baseCost */
                      /* repay normalized cost */
                      {   U32 const noSymbol = 0xF0F0F0F0;
                          U32 rankLast[HUF_TABLELOG_MAX+2];
                          int pos;
                          /* Get pos of last (smallest) symbol per rank */
                          memset(rankLast, 0xF0, sizeof(rankLast));
                          {   U32 currentNbBits = maxNbBits;
                              for (pos=n ; pos >= 0; pos--) {
                                  if (huffNode[pos].nbBits >= currentNbBits) continue;
                                  currentNbBits = huffNode[pos].nbBits;   /* < maxNbBits */
                                  rankLast[maxNbBits-currentNbBits] = pos;
                          }   }
                          while (totalCost > 0) {
                              U32 nBitsToDecrease = BIT_highbit32(totalCost) + 1;
                              for ( ; nBitsToDecrease > 1; nBitsToDecrease--) {
                                  U32 highPos = rankLast[nBitsToDecrease];
                                  U32 lowPos = rankLast[nBitsToDecrease-1];
                                  if (highPos == noSymbol) continue;
                                  if (lowPos == noSymbol) break;
                                  {   U32 const highTotal = huffNode[highPos].count;
                                      U32 const lowTotal = 2 * huffNode[lowPos].count;
                                      if (highTotal <= lowTotal) break;
                              }   }
                              /* only triggered when no more rank 1 symbol left => find closest one (note : there is necessarily at least one !) */
                              while ((nBitsToDecrease<=HUF_TABLELOG_MAX) && (rankLast[nBitsToDecrease] == noSymbol))  /* HUF_MAX_TABLELOG test just to please gcc 5+; but it should not be necessary */
                                  nBitsToDecrease ++;
                              totalCost -= 1 << (nBitsToDecrease-1);
                              if (rankLast[nBitsToDecrease-1] == noSymbol)
                                  rankLast[nBitsToDecrease-1] = rankLast[nBitsToDecrease];   /* this rank is no longer empty */
                              huffNode[rankLast[nBitsToDecrease]].nbBits ++;
                              if (rankLast[nBitsToDecrease] == 0)    /* special case, reached largest symbol */
                                  rankLast[nBitsToDecrease] = noSymbol;
                              else {
                                  rankLast[nBitsToDecrease]--;
                                  if (huffNode[rankLast[nBitsToDecrease]].nbBits != maxNbBits-nBitsToDecrease)
                                      rankLast[nBitsToDecrease] = noSymbol;   /* this rank is now empty */
                          }   }   /* while (totalCost > 0) */
                          while (totalCost < 0) {  /* Sometimes, cost correction overshoot */
                              if (rankLast[1] == noSymbol) {  /* special case : no rank 1 symbol (using maxNbBits-1); let's create one from largest rank 0 (using maxNbBits) */
                                  while (huffNode[n].nbBits == maxNbBits) n--;
                                  huffNode[n+1].nbBits--;
                                  rankLast[1] = n+1;
                                  totalCost++;
                                  continue;
                              }
                              huffNode[ rankLast[1] + 1 ].nbBits--;
                              rankLast[1]++;
                              totalCost ++;
                  }   }   }   /* there are several too large elements (at least >= 2) */
                  return maxNbBits;
              }
              typedef struct {
                  U32 base;
                  U32 current;
              } rankPos;
              static void HUF_sort(nodeElt* huffNode, const U32* count, U32 maxSymbolValue)
              {
                  rankPos rank[32];
                  U32 n;
                  memset(rank, 0, sizeof(rank));
                  for (n=0; n<=maxSymbolValue; n++) {
                      U32 r = BIT_highbit32(count[n] + 1);
                      rank[r].base ++;
                  }
                  for (n=30; n>0; n--) rank[n-1].base += rank[n].base;
                  for (n=0; n<32; n++) rank[n].current = rank[n].base;
                  for (n=0; n<=maxSymbolValue; n++) {
                      U32 const c = count[n];
                      U32 const r = BIT_highbit32(c+1) + 1;
                      U32 pos = rank[r].current++;
                      while ((pos > rank[r].base) && (c > huffNode[pos-1].count)) huffNode[pos]=huffNode[pos-1], pos--;
                      huffNode[pos].count = c;
                      huffNode[pos].byte  = (BYTE)n;
                  }
              }
+             /** HUF_buildCTable_wksp() :
+              *  Same as HUF_buildCTable(), but using externally allocated scratch buffer.
+              *  `workSpace` must be aligned on 4-bytes boundaries, and be at least as large as a table of 1024 unsigned.
+              */
              #define STARTNODE (HUF_SYMBOLVALUE_MAX+1)
-             size_t HUF_buildCTable (HUF_CElt* tree, const U32* count, U32 maxSymbolValue, U32 maxNbBits)
+             typedef nodeElt huffNodeTable[2*HUF_SYMBOLVALUE_MAX+1 +1];
+             size_t HUF_buildCTable_wksp (HUF_CElt* tree, const U32* count, U32 maxSymbolValue, U32 maxNbBits, void* workSpace, size_t wkspSize)
              {
-                 nodeElt huffNode0[2*HUF_SYMBOLVALUE_MAX+1 +1];
-                 nodeElt* huffNode = huffNode0 + 1;
+                 nodeElt* const huffNode0 = (nodeElt*)workSpace;
+                 nodeElt* const huffNode = huffNode0+1;
                  U32 n, nonNullRank;
                  int lowS, lowN;
                  U16 nodeNb = STARTNODE;
                  U32 nodeRoot;
                  /* safety checks */
+                 if (wkspSize < sizeof(huffNodeTable)) return ERROR(GENERIC);   /* workSpace is not large enough */
                  if (maxNbBits == 0) maxNbBits = HUF_TABLELOG_DEFAULT;
                  if (maxSymbolValue > HUF_SYMBOLVALUE_MAX) return ERROR(GENERIC);
-                 memset(huffNode0, 0, sizeof(huffNode0));
+                 memset(huffNode0, 0, sizeof(huffNodeTable));
                  /* sort, decreasing order */
                  HUF_sort(huffNode, count, maxSymbolValue);
                  /* init for parents */
                  nonNullRank = maxSymbolValue;
                  while(huffNode[nonNullRank].count == 0) nonNullRank--;
                  lowS = nonNullRank; nodeRoot = nodeNb + lowS - 1; lowN = nodeNb;
                  huffNode[nodeNb].count = huffNode[lowS].count + huffNode[lowS-1].count;
                  huffNode[lowS].parent = huffNode[lowS-1].parent = nodeNb;
                  nodeNb++; lowS-=2;
                  for (n=nodeNb; n<=nodeRoot; n++) huffNode[n].count = (U32)(1U<<30);
-                 huffNode0[0].count = (U32)(1U<<31);
+                 huffNode0[0].count = (U32)(1U<<31);  /* fake entry, strong barrier */
                  /* create parents */
                  while (nodeNb <= nodeRoot) {
                      U32 n1 = (huffNode[lowS].count < huffNode[lowN].count) ? lowS-- : lowN++;
                      U32 n2 = (huffNode[lowS].count < huffNode[lowN].count) ? lowS-- : lowN++;
                      huffNode[nodeNb].count = huffNode[n1].count + huffNode[n2].count;
                      huffNode[n1].parent = huffNode[n2].parent = nodeNb;
                      nodeNb++;
                  }
                  /* distribute weights (unlimited tree height) */
                  huffNode[nodeRoot].nbBits = 0;
                  for (n=nodeRoot-1; n>=STARTNODE; n--)
                      huffNode[n].nbBits = huffNode[ huffNode[n].parent ].nbBits + 1;
                  for (n=0; n<=nonNullRank; n++)
                      huffNode[n].nbBits = huffNode[ huffNode[n].parent ].nbBits + 1;
                  /* enforce maxTableLog */
                  maxNbBits = HUF_setMaxHeight(huffNode, nonNullRank, maxNbBits);
                  /* fill result into tree (val, nbBits) */
                  {   U16 nbPerRank[HUF_TABLELOG_MAX+1] = {0};
                      U16 valPerRank[HUF_TABLELOG_MAX+1] = {0};
                      if (maxNbBits > HUF_TABLELOG_MAX) return ERROR(GENERIC);   /* check fit into table */
                      for (n=0; n<=nonNullRank; n++)
                          nbPerRank[huffNode[n].nbBits]++;
                      /* determine stating value per rank */
                      {   U16 min = 0;
                          for (n=maxNbBits; n>0; n--) {
                              valPerRank[n] = min;      /* get starting value within each rank */
                              min += nbPerRank[n];
                              min >>= 1;
                      }   }
                      for (n=0; n<=maxSymbolValue; n++)
                          tree[huffNode[n].byte].nbBits = huffNode[n].nbBits;   /* push nbBits per symbol, symbol order */
                      for (n=0; n<=maxSymbolValue; n++)
                          tree[n].val = valPerRank[tree[n].nbBits]++;   /* assign value within rank, symbol order */
                  }
                  return maxNbBits;
              }
+             /** HUF_buildCTable() :
+              *  Note : count is used before tree is written, so they can safely overlap
+              */
+             size_t HUF_buildCTable (HUF_CElt* tree, const U32* count, U32 maxSymbolValue, U32 maxNbBits)
+             {
+                 huffNodeTable nodeTable;
+                 return HUF_buildCTable_wksp(tree, count, maxSymbolValue, maxNbBits, nodeTable, sizeof(nodeTable));
+             }
              static void HUF_encodeSymbol(BIT_CStream_t* bitCPtr, U32 symbol, const HUF_CElt* CTable)
              {
                  BIT_addBitsFast(bitCPtr, CTable[symbol].val, CTable[symbol].nbBits);
              }
              size_t HUF_compressBound(size_t size) { return HUF_COMPRESSBOUND(size); }
              #define HUF_FLUSHBITS(s)  (fast ? BIT_flushBitsFast(s) : BIT_flushBits(s))
              #define HUF_FLUSHBITS_1(stream) \
                  if (sizeof((stream)->bitContainer)*8 < HUF_TABLELOG_MAX*2+7) HUF_FLUSHBITS(stream)
              #define HUF_FLUSHBITS_2(stream) \
                  if (sizeof((stream)->bitContainer)*8 < HUF_TABLELOG_MAX*4+7) HUF_FLUSHBITS(stream)
              size_t HUF_compress1X_usingCTable(void* dst, size_t dstSize, const void* src, size_t srcSize, const HUF_CElt* CTable)
              {
                  const BYTE* ip = (const BYTE*) src;
                  BYTE* const ostart = (BYTE*)dst;
                  BYTE* const oend = ostart + dstSize;
                  BYTE* op = ostart;
                  size_t n;
                  const unsigned fast = (dstSize >= HUF_BLOCKBOUND(srcSize));
                  BIT_CStream_t bitC;
                  /* init */
                  if (dstSize < 8) return 0;   /* not enough space to compress */
-                 { size_t const errorCode = BIT_initCStream(&bitC, op, oend-op);
-                   if (HUF_isError(errorCode)) return 0; }
+                 { size_t const initErr = BIT_initCStream(&bitC, op, oend-op);
+                   if (HUF_isError(initErr)) return 0; }
                  n = srcSize & ~3;  /* join to mod 4 */
                  switch (srcSize & 3)
                  {
                      case 3 : HUF_encodeSymbol(&bitC, ip[n+ 2], CTable);
                               HUF_FLUSHBITS_2(&bitC);
                      case 2 : HUF_encodeSymbol(&bitC, ip[n+ 1], CTable);
                               HUF_FLUSHBITS_1(&bitC);
                      case 1 : HUF_encodeSymbol(&bitC, ip[n+ 0], CTable);
                               HUF_FLUSHBITS(&bitC);
                      case 0 :
                      default: ;
                  }
                  for (; n>0; n-=4) {  /* note : n&3==0 at this stage */
                      HUF_encodeSymbol(&bitC, ip[n- 1], CTable);
                      HUF_FLUSHBITS_1(&bitC);
                      HUF_encodeSymbol(&bitC, ip[n- 2], CTable);
                      HUF_FLUSHBITS_2(&bitC);
                      HUF_encodeSymbol(&bitC, ip[n- 3], CTable);
                      HUF_FLUSHBITS_1(&bitC);
                      HUF_encodeSymbol(&bitC, ip[n- 4], CTable);
                      HUF_FLUSHBITS(&bitC);
                  }
                  return BIT_closeCStream(&bitC);
              }
              size_t HUF_compress4X_usingCTable(void* dst, size_t dstSize, const void* src, size_t srcSize, const HUF_CElt* CTable)
              {
                  size_t const segmentSize = (srcSize+3)/4;   /* first 3 segments */
                  const BYTE* ip = (const BYTE*) src;
                  const BYTE* const iend = ip + srcSize;
                  BYTE* const ostart = (BYTE*) dst;
                  BYTE* const oend = ostart + dstSize;
                  BYTE* op = ostart;
                  if (dstSize < 6 + 1 + 1 + 1 + 8) return 0;   /* minimum space to compress successfully */
                  if (srcSize < 12) return 0;   /* no saving possible : too small input */
                  op += 6;   /* jumpTable */
-                 {   size_t const cSize = HUF_compress1X_usingCTable(op, oend-op, ip, segmentSize, CTable);
-                     if (HUF_isError(cSize)) return cSize;
+                 {   CHECK_V_F(cSize, HUF_compress1X_usingCTable(op, oend-op, ip, segmentSize, CTable) );
                      if (cSize==0) return 0;
                      MEM_writeLE16(ostart, (U16)cSize);
                      op += cSize;
                  }
                  ip += segmentSize;
-                 {   size_t const cSize = HUF_compress1X_usingCTable(op, oend-op, ip, segmentSize, CTable);
-                     if (HUF_isError(cSize)) return cSize;
+                 {   CHECK_V_F(cSize, HUF_compress1X_usingCTable(op, oend-op, ip, segmentSize, CTable) );
                      if (cSize==0) return 0;
                      MEM_writeLE16(ostart+2, (U16)cSize);
                      op += cSize;
                  }
                  ip += segmentSize;
-                 {   size_t const cSize = HUF_compress1X_usingCTable(op, oend-op, ip, segmentSize, CTable);
-                     if (HUF_isError(cSize)) return cSize;
+                 {   CHECK_V_F(cSize, HUF_compress1X_usingCTable(op, oend-op, ip, segmentSize, CTable) );
                      if (cSize==0) return 0;
                      MEM_writeLE16(ostart+4, (U16)cSize);
                      op += cSize;
                  }
                  ip += segmentSize;
-                 {   size_t const cSize = HUF_compress1X_usingCTable(op, oend-op, ip, iend-ip, CTable);
-                     if (HUF_isError(cSize)) return cSize;
+                 {   CHECK_V_F(cSize, HUF_compress1X_usingCTable(op, oend-op, ip, iend-ip, CTable) );
                      if (cSize==0) return 0;
                      op += cSize;
                  }
                  return op-ostart;
              }
+             /* `workSpace` must a table of at least 1024 unsigned */
              static size_t HUF_compress_internal (
                              void* dst, size_t dstSize,
                              const void* src, size_t srcSize,
                              unsigned maxSymbolValue, unsigned huffLog,
-                             unsigned singleStream)
+                             unsigned singleStream,
+                             void* workSpace, size_t wkspSize)
              {
                  BYTE* const ostart = (BYTE*)dst;
                  BYTE* const oend = ostart + dstSize;
                  BYTE* op = ostart;
-                 U32 count[HUF_SYMBOLVALUE_MAX+1];
-                 HUF_CElt CTable[HUF_SYMBOLVALUE_MAX+1];
+                 union {
+                     U32 count[HUF_SYMBOLVALUE_MAX+1];
+                     HUF_CElt CTable[HUF_SYMBOLVALUE_MAX+1];
+                 } table;   /* `count` can overlap with `CTable`; saves 1 KB */
                  /* checks & inits */
+                 if (wkspSize < sizeof(huffNodeTable)) return ERROR(GENERIC);
                  if (!srcSize) return 0;  /* Uncompressed (note : 1 means rle, so first byte must be correct) */
                  if (!dstSize) return 0;  /* cannot fit within dst budget */
                  if (srcSize > HUF_BLOCKSIZE_MAX) return ERROR(srcSize_wrong);   /* current block size limit */
                  if (huffLog > HUF_TABLELOG_MAX) return ERROR(tableLog_tooLarge);
                  if (!maxSymbolValue) maxSymbolValue = HUF_SYMBOLVALUE_MAX;
                  if (!huffLog) huffLog = HUF_TABLELOG_DEFAULT;
                  /* Scan input and build symbol stats */
-                 {   size_t const largest = FSE_count (count, &maxSymbolValue, (const BYTE*)src, srcSize);
-                     if (HUF_isError(largest)) return largest;
+                 {   CHECK_V_F(largest, FSE_count_wksp (table.count, &maxSymbolValue, (const BYTE*)src, srcSize, (U32*)workSpace) );
                      if (largest == srcSize) { *ostart = ((const BYTE*)src)[0]; return 1; }   /* single symbol, rle */
                      if (largest <= (srcSize >> 7)+1) return 0;   /* Fast heuristic : not compressible enough */
                  }
                  /* Build Huffman Tree */
                  huffLog = HUF_optimalTableLog(huffLog, srcSize, maxSymbolValue);
-                 {   size_t const maxBits = HUF_buildCTable (CTable, count, maxSymbolValue, huffLog);
-                     if (HUF_isError(maxBits)) return maxBits;
+                 {   CHECK_V_F(maxBits, HUF_buildCTable_wksp (table.CTable, table.count, maxSymbolValue, huffLog, workSpace, wkspSize) );
                      huffLog = (U32)maxBits;
                  }
                  /* Write table description header */
-                 {   size_t const hSize = HUF_writeCTable (op, dstSize, CTable, maxSymbolValue, huffLog);
-                     if (HUF_isError(hSize)) return hSize;
+                 {   CHECK_V_F(hSize, HUF_writeCTable (op, dstSize, table.CTable, maxSymbolValue, huffLog) );
                      if (hSize + 12 >= srcSize) return 0;   /* not useful to try compression */
                      op += hSize;
                  }
                  /* Compress */
                  {   size_t const cSize = (singleStream) ?
-                                         HUF_compress1X_usingCTable(op, oend - op, src, srcSize, CTable) :   /* single segment */
-                                         HUF_compress4X_usingCTable(op, oend - op, src, srcSize, CTable);
+                                         HUF_compress1X_usingCTable(op, oend - op, src, srcSize, table.CTable) :   /* single segment */
+                                         HUF_compress4X_usingCTable(op, oend - op, src, srcSize, table.CTable);
                      if (HUF_isError(cSize)) return cSize;
                      if (cSize==0) return 0;   /* uncompressible */
                      op += cSize;
                  }
                  /* check compressibility */
                  if ((size_t)(op-ostart) >= srcSize-1)
                      return 0;
                  return op-ostart;
              }
+             size_t HUF_compress1X_wksp (void* dst, size_t dstSize,
+                                   const void* src, size_t srcSize,
+                                   unsigned maxSymbolValue, unsigned huffLog,
+                                   void* workSpace, size_t wkspSize)
+             {
+                 return HUF_compress_internal(dst, dstSize, src, srcSize, maxSymbolValue, huffLog, 1 /* single stream */, workSpace, wkspSize);
+             }
              size_t HUF_compress1X (void* dst, size_t dstSize,
                               const void* src, size_t srcSize,
                               unsigned maxSymbolValue, unsigned huffLog)
              {
-                 return HUF_compress_internal(dst, dstSize, src, srcSize, maxSymbolValue, huffLog, 1);
+                 unsigned workSpace[1024];
+                 return HUF_compress1X_wksp(dst, dstSize, src, srcSize, maxSymbolValue, huffLog, workSpace, sizeof(workSpace));
+             }
+             size_t HUF_compress4X_wksp (void* dst, size_t dstSize,
+                                   const void* src, size_t srcSize,
+                                   unsigned maxSymbolValue, unsigned huffLog,
+                                   void* workSpace, size_t wkspSize)
+             {
+                 return HUF_compress_internal(dst, dstSize, src, srcSize, maxSymbolValue, huffLog, 0 /* 4 streams */, workSpace, wkspSize);
              }
              size_t HUF_compress2 (void* dst, size_t dstSize,
                              const void* src, size_t srcSize,
                              unsigned maxSymbolValue, unsigned huffLog)
              {
-                 return HUF_compress_internal(dst, dstSize, src, srcSize, maxSymbolValue, huffLog, 0);
+                 unsigned workSpace[1024];
+                 return HUF_compress4X_wksp(dst, dstSize, src, srcSize, maxSymbolValue, huffLog, workSpace, sizeof(workSpace));
              }
              size_t HUF_compress (void* dst, size_t maxDstSize, const void* src, size_t srcSize)
              {
                  return HUF_compress2(dst, maxDstSize, src, (U32)srcSize, 255, HUF_TABLELOG_DEFAULT);
              }

contrib/python-zstandard/zstd/compress/zstd_compress.c

0 +56 -29

              /**
               * Copyright (c) 2016-present, Yann Collet, Facebook, Inc.
               * All rights reserved.
               *
               * This source code is licensed under the BSD-style license found in the
               * LICENSE file in the root directory of this source tree. An additional grant
               * of patent rights can be found in the PATENTS file in the same directory.
               */
              /*-*************************************
              *  Dependencies
              ***************************************/
              #include <string.h>         /* memset */
              #include "mem.h"
              #define XXH_STATIC_LINKING_ONLY   /* XXH64_state_t */
              #include "xxhash.h"               /* XXH_reset, update, digest */
              #define FSE_STATIC_LINKING_ONLY   /* FSE_encodeSymbol */
              #include "fse.h"
              #define HUF_STATIC_LINKING_ONLY
              #include "huf.h"
              #include "zstd_internal.h"  /* includes zstd.h */
              /*-*************************************
              *  Constants
              ***************************************/
              static const U32 g_searchStrength = 8;   /* control skip over incompressible data */
              #define HASH_READ_SIZE 8
              typedef enum { ZSTDcs_created=0, ZSTDcs_init, ZSTDcs_ongoing, ZSTDcs_ending } ZSTD_compressionStage_e;
              /*-*************************************
              *  Helper functions
              ***************************************/
+             #define ZSTD_STATIC_ASSERT(c) { enum { ZSTD_static_assert = 1/(int)(!!(c)) }; }
              size_t ZSTD_compressBound(size_t srcSize) { return FSE_compressBound(srcSize) + 12; }
              /*-*************************************
              *  Sequence storage
              ***************************************/
              static void ZSTD_resetSeqStore(seqStore_t* ssPtr)
              {
                  ssPtr->lit = ssPtr->litStart;
                  ssPtr->sequences = ssPtr->sequencesStart;
                  ssPtr->longLengthID = 0;
              }
              /*-*************************************
              *  Context memory management
              ***************************************/
              struct ZSTD_CCtx_s
              {
                  const BYTE* nextSrc;    /* next block here to continue on current prefix */
                  const BYTE* base;       /* All regular indexes relative to this position */
                  const BYTE* dictBase;   /* extDict indexes relative to this position */
                  U32   dictLimit;        /* below that point, need extDict */
                  U32   lowLimit;         /* below that point, no more data */
                  U32   nextToUpdate;     /* index from which to continue dictionary update */
                  U32   nextToUpdate3;    /* index from which to continue dictionary update */
                  U32   hashLog3;         /* dispatch table : larger == faster, more memory */
                  U32   loadedDictEnd;
                  ZSTD_compressionStage_e stage;
                  U32   rep[ZSTD_REP_NUM];
                  U32   savedRep[ZSTD_REP_NUM];
                  U32   dictID;
                  ZSTD_parameters params;
                  void* workSpace;
                  size_t workSpaceSize;
                  size_t blockSize;
                  U64 frameContentSize;
                  XXH64_state_t xxhState;
                  ZSTD_customMem customMem;
                  seqStore_t seqStore;    /* sequences storage ptrs */
                  U32* hashTable;
                  U32* hashTable3;
                  U32* chainTable;
                  HUF_CElt* hufTable;
                  U32 flagStaticTables;
                  FSE_CTable offcodeCTable  [FSE_CTABLE_SIZE_U32(OffFSELog, MaxOff)];
                  FSE_CTable matchlengthCTable[FSE_CTABLE_SIZE_U32(MLFSELog, MaxML)];
                  FSE_CTable litlengthCTable  [FSE_CTABLE_SIZE_U32(LLFSELog, MaxLL)];
+                 unsigned tmpCounters[1024];
              };
              ZSTD_CCtx* ZSTD_createCCtx(void)
              {
                  return ZSTD_createCCtx_advanced(defaultCustomMem);
              }
              ZSTD_CCtx* ZSTD_createCCtx_advanced(ZSTD_customMem customMem)
              {
                  ZSTD_CCtx* cctx;
                  if (!customMem.customAlloc && !customMem.customFree) customMem = defaultCustomMem;
                  if (!customMem.customAlloc || !customMem.customFree) return NULL;
                  cctx = (ZSTD_CCtx*) ZSTD_malloc(sizeof(ZSTD_CCtx), customMem);
                  if (!cctx) return NULL;
                  memset(cctx, 0, sizeof(ZSTD_CCtx));
                  memcpy(&(cctx->customMem), &customMem, sizeof(customMem));
                  return cctx;
              }
              size_t ZSTD_freeCCtx(ZSTD_CCtx* cctx)
              {
                  if (cctx==NULL) return 0;   /* support free on NULL */
                  ZSTD_free(cctx->workSpace, cctx->customMem);
                  ZSTD_free(cctx, cctx->customMem);
                  return 0;   /* reserved as a potential error code in the future */
              }
              size_t ZSTD_sizeof_CCtx(const ZSTD_CCtx* cctx)
              {
                  if (cctx==NULL) return 0;   /* support sizeof on NULL */
                  return sizeof(*cctx) + cctx->workSpaceSize;
              }
              const seqStore_t* ZSTD_getSeqStore(const ZSTD_CCtx* ctx)   /* hidden interface */
              {
                  return &(ctx->seqStore);
              }
              static ZSTD_parameters ZSTD_getParamsFromCCtx(const ZSTD_CCtx* cctx)
              {
                  return cctx->params;
              }
              /** ZSTD_checkParams() :
                  ensure param values remain within authorized range.
                  @return : 0, or an error code if one value is beyond authorized range */
              size_t ZSTD_checkCParams(ZSTD_compressionParameters cParams)
              {
              #   define CLAMPCHECK(val,min,max) { if ((val<min) | (val>max)) return ERROR(compressionParameter_unsupported); }
                  CLAMPCHECK(cParams.windowLog, ZSTD_WINDOWLOG_MIN, ZSTD_WINDOWLOG_MAX);
                  CLAMPCHECK(cParams.chainLog, ZSTD_CHAINLOG_MIN, ZSTD_CHAINLOG_MAX);
                  CLAMPCHECK(cParams.hashLog, ZSTD_HASHLOG_MIN, ZSTD_HASHLOG_MAX);
                  CLAMPCHECK(cParams.searchLog, ZSTD_SEARCHLOG_MIN, ZSTD_SEARCHLOG_MAX);
                  { U32 const searchLengthMin = ((cParams.strategy == ZSTD_fast) | (cParams.strategy == ZSTD_greedy)) ? ZSTD_SEARCHLENGTH_MIN+1 : ZSTD_SEARCHLENGTH_MIN;
                    U32 const searchLengthMax = (cParams.strategy == ZSTD_fast) ? ZSTD_SEARCHLENGTH_MAX : ZSTD_SEARCHLENGTH_MAX-1;
                    CLAMPCHECK(cParams.searchLength, searchLengthMin, searchLengthMax); }
                  CLAMPCHECK(cParams.targetLength, ZSTD_TARGETLENGTH_MIN, ZSTD_TARGETLENGTH_MAX);
                  if ((U32)(cParams.strategy) > (U32)ZSTD_btopt2) return ERROR(compressionParameter_unsupported);
                  return 0;
              }
+             /** ZSTD_cycleLog() :
+              *  condition for correct operation : hashLog > 1 */
+             static U32 ZSTD_cycleLog(U32 hashLog, ZSTD_strategy strat)
+             {
+                 U32 const btScale = ((U32)strat >= (U32)ZSTD_btlazy2);
+                 return hashLog - btScale;
+             }
              /** ZSTD_adjustCParams() :
                  optimize `cPar` for a given input (`srcSize` and `dictSize`).
                  mostly downsizing to reduce memory consumption and initialization.
                  Both `srcSize` and `dictSize` are optional (use 0 if unknown),
                  but if both are 0, no optimization can be done.
                  Note : cPar is considered validated at this stage. Use ZSTD_checkParams() to ensure that. */
              ZSTD_compressionParameters ZSTD_adjustCParams(ZSTD_compressionParameters cPar, unsigned long long srcSize, size_t dictSize)
              {
                  if (srcSize+dictSize == 0) return cPar;   /* no size information available : no adjustment */
                  /* resize params, to use less memory when necessary */
                  {   U32 const minSrcSize = (srcSize==0) ? 500 : 0;
                      U64 const rSize = srcSize + dictSize + minSrcSize;
                      if (rSize < ((U64)1<<ZSTD_WINDOWLOG_MAX)) {
                          U32 const srcLog = MAX(ZSTD_HASHLOG_MIN, ZSTD_highbit32((U32)(rSize)-1) + 1);
                          if (cPar.windowLog > srcLog) cPar.windowLog = srcLog;
                  }   }
                  if (cPar.hashLog > cPar.windowLog) cPar.hashLog = cPar.windowLog;
-                 {   U32 const btPlus = (cPar.strategy == ZSTD_btlazy2) | (cPar.strategy == ZSTD_btopt) | (cPar.strategy == ZSTD_btopt2);
-                     U32 const maxChainLog = cPar.windowLog+btPlus;
-                     if (cPar.chainLog > maxChainLog) cPar.chainLog = maxChainLog; }   /* <= ZSTD_CHAINLOG_MAX */
+                 {   U32 const cycleLog = ZSTD_cycleLog(cPar.chainLog, cPar.strategy);
+                     if (cycleLog > cPar.windowLog) cPar.chainLog -= (cycleLog - cPar.windowLog);
+                 }
                  if (cPar.windowLog < ZSTD_WINDOWLOG_ABSOLUTEMIN) cPar.windowLog = ZSTD_WINDOWLOG_ABSOLUTEMIN;  /* required for frame header */
                  return cPar;
              }
              size_t ZSTD_estimateCCtxSize(ZSTD_compressionParameters cParams)
              {
                  size_t const blockSize = MIN(ZSTD_BLOCKSIZE_ABSOLUTEMAX, (size_t)1 << cParams.windowLog);
                  U32    const divider = (cParams.searchLength==3) ? 3 : 4;
                  size_t const maxNbSeq = blockSize / divider;
                  size_t const tokenSpace = blockSize + 11*maxNbSeq;
                  size_t const chainSize = (cParams.strategy == ZSTD_fast) ? 0 : (1 << cParams.chainLog);
                  size_t const hSize = ((size_t)1) << cParams.hashLog;
                  U32    const hashLog3 = (cParams.searchLength>3) ? 0 : MIN(ZSTD_HASHLOG3_MAX, cParams.windowLog);
                  size_t const h3Size = ((size_t)1) << hashLog3;
                  size_t const tableSpace = (chainSize + hSize + h3Size) * sizeof(U32);
                  size_t const optSpace = ((MaxML+1) + (MaxLL+1) + (MaxOff+1) + (1<<Litbits))*sizeof(U32)
                                        + (ZSTD_OPT_NUM+1)*(sizeof(ZSTD_match_t) + sizeof(ZSTD_optimal_t));
                  size_t const neededSpace = tableSpace + (256*sizeof(U32)) /* huffTable */ + tokenSpace
                                           + (((cParams.strategy == ZSTD_btopt) || (cParams.strategy == ZSTD_btopt2)) ? optSpace : 0);
                  return sizeof(ZSTD_CCtx) + neededSpace;
              }
              static U32 ZSTD_equivalentParams(ZSTD_parameters param1, ZSTD_parameters param2)
              {
                  return (param1.cParams.hashLog  == param2.cParams.hashLog)
                       & (param1.cParams.chainLog == param2.cParams.chainLog)
                       & (param1.cParams.strategy == param2.cParams.strategy)
                       & ((param1.cParams.searchLength==3) == (param2.cParams.searchLength==3));
              }
              /*! ZSTD_continueCCtx() :
                  reuse CCtx without reset (note : requires no dictionary) */
              static size_t ZSTD_continueCCtx(ZSTD_CCtx* cctx, ZSTD_parameters params, U64 frameContentSize)
              {
                  U32 const end = (U32)(cctx->nextSrc - cctx->base);
                  cctx->params = params;
                  cctx->frameContentSize = frameContentSize;
                  cctx->lowLimit = end;
                  cctx->dictLimit = end;
                  cctx->nextToUpdate = end+1;
                  cctx->stage = ZSTDcs_init;
                  cctx->dictID = 0;
                  cctx->loadedDictEnd = 0;
                  { int i; for (i=0; i<ZSTD_REP_NUM; i++) cctx->rep[i] = repStartValue[i]; }
                  cctx->seqStore.litLengthSum = 0;  /* force reset of btopt stats */
                  XXH64_reset(&cctx->xxhState, 0);
                  return 0;
              }
              typedef enum { ZSTDcrp_continue, ZSTDcrp_noMemset, ZSTDcrp_fullReset } ZSTD_compResetPolicy_e;
              /*! ZSTD_resetCCtx_advanced() :
                  note : 'params' must be validated */
              static size_t ZSTD_resetCCtx_advanced (ZSTD_CCtx* zc,
                                                     ZSTD_parameters params, U64 frameContentSize,
                                                     ZSTD_compResetPolicy_e const crp)
              {
                  if (crp == ZSTDcrp_continue)
                      if (ZSTD_equivalentParams(params, zc->params))
                          return ZSTD_continueCCtx(zc, params, frameContentSize);
                  {   size_t const blockSize = MIN(ZSTD_BLOCKSIZE_ABSOLUTEMAX, (size_t)1 << params.cParams.windowLog);
                      U32    const divider = (params.cParams.searchLength==3) ? 3 : 4;
                      size_t const maxNbSeq = blockSize / divider;
                      size_t const tokenSpace = blockSize + 11*maxNbSeq;
                      size_t const chainSize = (params.cParams.strategy == ZSTD_fast) ? 0 : (1 << params.cParams.chainLog);
                      size_t const hSize = ((size_t)1) << params.cParams.hashLog;
                      U32    const hashLog3 = (params.cParams.searchLength>3) ? 0 : MIN(ZSTD_HASHLOG3_MAX, params.cParams.windowLog);
                      size_t const h3Size = ((size_t)1) << hashLog3;
                      size_t const tableSpace = (chainSize + hSize + h3Size) * sizeof(U32);
                      void* ptr;
                      /* Check if workSpace is large enough, alloc a new one if needed */
                      {   size_t const optSpace = ((MaxML+1) + (MaxLL+1) + (MaxOff+1) + (1<<Litbits))*sizeof(U32)
                                                + (ZSTD_OPT_NUM+1)*(sizeof(ZSTD_match_t) + sizeof(ZSTD_optimal_t));
                          size_t const neededSpace = tableSpace + (256*sizeof(U32)) /* huffTable */ + tokenSpace
                                                + (((params.cParams.strategy == ZSTD_btopt) || (params.cParams.strategy == ZSTD_btopt2)) ? optSpace : 0);
                          if (zc->workSpaceSize < neededSpace) {
                              ZSTD_free(zc->workSpace, zc->customMem);
                              zc->workSpace = ZSTD_malloc(neededSpace, zc->customMem);
                              if (zc->workSpace == NULL) return ERROR(memory_allocation);
                              zc->workSpaceSize = neededSpace;
                      }   }
                      if (crp!=ZSTDcrp_noMemset) memset(zc->workSpace, 0, tableSpace);   /* reset tables only */
                      XXH64_reset(&zc->xxhState, 0);
                      zc->hashLog3 = hashLog3;
                      zc->hashTable = (U32*)(zc->workSpace);
                      zc->chainTable = zc->hashTable + hSize;
                      zc->hashTable3 = zc->chainTable + chainSize;
                      ptr = zc->hashTable3 + h3Size;
                      zc->hufTable = (HUF_CElt*)ptr;
                      zc->flagStaticTables = 0;
                      ptr = ((U32*)ptr) + 256;  /* note : HUF_CElt* is incomplete type, size is simulated using U32 */
                      zc->nextToUpdate = 1;
                      zc->nextSrc = NULL;
                      zc->base = NULL;
                      zc->dictBase = NULL;
                      zc->dictLimit = 0;
                      zc->lowLimit = 0;
                      zc->params = params;
                      zc->blockSize = blockSize;
                      zc->frameContentSize = frameContentSize;
                      { int i; for (i=0; i<ZSTD_REP_NUM; i++) zc->rep[i] = repStartValue[i]; }
                      if ((params.cParams.strategy == ZSTD_btopt) || (params.cParams.strategy == ZSTD_btopt2)) {
                          zc->seqStore.litFreq = (U32*)ptr;
                          zc->seqStore.litLengthFreq = zc->seqStore.litFreq + (1<<Litbits);
                          zc->seqStore.matchLengthFreq = zc->seqStore.litLengthFreq + (MaxLL+1);
                          zc->seqStore.offCodeFreq = zc->seqStore.matchLengthFreq + (MaxML+1);
                          ptr = zc->seqStore.offCodeFreq + (MaxOff+1);
                          zc->seqStore.matchTable = (ZSTD_match_t*)ptr;
                          ptr = zc->seqStore.matchTable + ZSTD_OPT_NUM+1;
                          zc->seqStore.priceTable = (ZSTD_optimal_t*)ptr;
                          ptr = zc->seqStore.priceTable + ZSTD_OPT_NUM+1;
                          zc->seqStore.litLengthSum = 0;
                      }
                      zc->seqStore.sequencesStart = (seqDef*)ptr;
                      ptr = zc->seqStore.sequencesStart + maxNbSeq;
                      zc->seqStore.llCode = (BYTE*) ptr;
                      zc->seqStore.mlCode = zc->seqStore.llCode + maxNbSeq;
                      zc->seqStore.ofCode = zc->seqStore.mlCode + maxNbSeq;
                      zc->seqStore.litStart = zc->seqStore.ofCode + maxNbSeq;
                      zc->stage = ZSTDcs_init;
                      zc->dictID = 0;
                      zc->loadedDictEnd = 0;
                      return 0;
                  }
              }
              /*! ZSTD_copyCCtx() :
              *   Duplicate an existing context `srcCCtx` into another one `dstCCtx`.
              *   Only works during stage ZSTDcs_init (i.e. after creation, but before first call to ZSTD_compressContinue()).
              *   @return : 0, or an error code */
              size_t ZSTD_copyCCtx(ZSTD_CCtx* dstCCtx, const ZSTD_CCtx* srcCCtx, unsigned long long pledgedSrcSize)
              {
                  if (srcCCtx->stage!=ZSTDcs_init) return ERROR(stage_wrong);
                  memcpy(&dstCCtx->customMem, &srcCCtx->customMem, sizeof(ZSTD_customMem));
                  ZSTD_resetCCtx_advanced(dstCCtx, srcCCtx->params, pledgedSrcSize, ZSTDcrp_noMemset);
                  /* copy tables */
                  {   size_t const chainSize = (srcCCtx->params.cParams.strategy == ZSTD_fast) ? 0 : (1 << srcCCtx->params.cParams.chainLog);
                      size_t const hSize = ((size_t)1) << srcCCtx->params.cParams.hashLog;
                      size_t const h3Size = (size_t)1 << srcCCtx->hashLog3;
                      size_t const tableSpace = (chainSize + hSize + h3Size) * sizeof(U32);
                      memcpy(dstCCtx->workSpace, srcCCtx->workSpace, tableSpace);
                  }
                  /* copy dictionary offsets */
                  dstCCtx->nextToUpdate = srcCCtx->nextToUpdate;
                  dstCCtx->nextToUpdate3= srcCCtx->nextToUpdate3;
                  dstCCtx->nextSrc      = srcCCtx->nextSrc;
                  dstCCtx->base         = srcCCtx->base;
                  dstCCtx->dictBase     = srcCCtx->dictBase;
                  dstCCtx->dictLimit    = srcCCtx->dictLimit;
                  dstCCtx->lowLimit     = srcCCtx->lowLimit;
                  dstCCtx->loadedDictEnd= srcCCtx->loadedDictEnd;
                  dstCCtx->dictID       = srcCCtx->dictID;
                  /* copy entropy tables */
                  dstCCtx->flagStaticTables = srcCCtx->flagStaticTables;
                  if (srcCCtx->flagStaticTables) {
                      memcpy(dstCCtx->hufTable, srcCCtx->hufTable, 256*4);
                      memcpy(dstCCtx->litlengthCTable, srcCCtx->litlengthCTable, sizeof(dstCCtx->litlengthCTable));
                      memcpy(dstCCtx->matchlengthCTable, srcCCtx->matchlengthCTable, sizeof(dstCCtx->matchlengthCTable));
                      memcpy(dstCCtx->offcodeCTable, srcCCtx->offcodeCTable, sizeof(dstCCtx->offcodeCTable));
                  }
                  return 0;
              }
              /*! ZSTD_reduceTable() :
              *   reduce table indexes by `reducerValue` */
              static void ZSTD_reduceTable (U32* const table, U32 const size, U32 const reducerValue)
              {
                  U32 u;
                  for (u=0 ; u < size ; u++) {
                      if (table[u] < reducerValue) table[u] = 0;
                      else table[u] -= reducerValue;
                  }
              }
              /*! ZSTD_reduceIndex() :
              *   rescale all indexes to avoid future overflow (indexes are U32) */
              static void ZSTD_reduceIndex (ZSTD_CCtx* zc, const U32 reducerValue)
              {
                  { U32 const hSize = 1 << zc->params.cParams.hashLog;
                    ZSTD_reduceTable(zc->hashTable, hSize, reducerValue); }
                  { U32 const chainSize = (zc->params.cParams.strategy == ZSTD_fast) ? 0 : (1 << zc->params.cParams.chainLog);
                    ZSTD_reduceTable(zc->chainTable, chainSize, reducerValue); }
                  { U32 const h3Size = (zc->hashLog3) ? 1 << zc->hashLog3 : 0;
                    ZSTD_reduceTable(zc->hashTable3, h3Size, reducerValue); }
              }
              /*-*******************************************************
              *  Block entropic compression
              *********************************************************/
              /* See doc/zstd_compression_format.md for detailed format description */
              size_t ZSTD_noCompressBlock (void* dst, size_t dstCapacity, const void* src, size_t srcSize)
              {
                  if (srcSize + ZSTD_blockHeaderSize > dstCapacity) return ERROR(dstSize_tooSmall);
                  memcpy((BYTE*)dst + ZSTD_blockHeaderSize, src, srcSize);
                  MEM_writeLE24(dst, (U32)(srcSize << 2) + (U32)bt_raw);
                  return ZSTD_blockHeaderSize+srcSize;
              }
              static size_t ZSTD_noCompressLiterals (void* dst, size_t dstCapacity, const void* src, size_t srcSize)
              {
                  BYTE* const ostart = (BYTE* const)dst;
                  U32   const flSize = 1 + (srcSize>31) + (srcSize>4095);
                  if (srcSize + flSize > dstCapacity) return ERROR(dstSize_tooSmall);
                  switch(flSize)
                  {
                      case 1: /* 2 - 1 - 5 */
                          ostart[0] = (BYTE)((U32)set_basic + (srcSize<<3));
                          break;
                      case 2: /* 2 - 2 - 12 */
                          MEM_writeLE16(ostart, (U16)((U32)set_basic + (1<<2) + (srcSize<<4)));
                          break;
                      default:   /*note : should not be necessary : flSize is within {1,2,3} */
                      case 3: /* 2 - 2 - 20 */
                          MEM_writeLE32(ostart, (U32)((U32)set_basic + (3<<2) + (srcSize<<4)));
                          break;
                  }
                  memcpy(ostart + flSize, src, srcSize);
                  return srcSize + flSize;
              }
              static size_t ZSTD_compressRleLiteralsBlock (void* dst, size_t dstCapacity, const void* src, size_t srcSize)
              {
                  BYTE* const ostart = (BYTE* const)dst;
                  U32   const flSize = 1 + (srcSize>31) + (srcSize>4095);
                  (void)dstCapacity;  /* dstCapacity already guaranteed to be >=4, hence large enough */
                  switch(flSize)
                  {
                      case 1: /* 2 - 1 - 5 */
                          ostart[0] = (BYTE)((U32)set_rle + (srcSize<<3));
                          break;
                      case 2: /* 2 - 2 - 12 */
                          MEM_writeLE16(ostart, (U16)((U32)set_rle + (1<<2) + (srcSize<<4)));
                          break;
                      default:   /*note : should not be necessary : flSize is necessarily within {1,2,3} */
                      case 3: /* 2 - 2 - 20 */
                          MEM_writeLE32(ostart, (U32)((U32)set_rle + (3<<2) + (srcSize<<4)));
                          break;
                  }
                  ostart[flSize] = *(const BYTE*)src;
                  return flSize+1;
              }
              static size_t ZSTD_minGain(size_t srcSize) { return (srcSize >> 6) + 2; }
              static size_t ZSTD_compressLiterals (ZSTD_CCtx* zc,
                                                   void* dst, size_t dstCapacity,
                                             const void* src, size_t srcSize)
              {
                  size_t const minGain = ZSTD_minGain(srcSize);
                  size_t const lhSize = 3 + (srcSize >= 1 KB) + (srcSize >= 16 KB);
                  BYTE*  const ostart = (BYTE*)dst;
                  U32 singleStream = srcSize < 256;
                  symbolEncodingType_e hType = set_compressed;
                  size_t cLitSize;
                  /* small ? don't even attempt compression (speed opt) */
              #   define LITERAL_NOENTROPY 63
                  {   size_t const minLitSize = zc->flagStaticTables ? 6 : LITERAL_NOENTROPY;
                      if (srcSize <= minLitSize) return ZSTD_noCompressLiterals(dst, dstCapacity, src, srcSize);
                  }
                  if (dstCapacity < lhSize+1) return ERROR(dstSize_tooSmall);   /* not enough space for compression */
                  if (zc->flagStaticTables && (lhSize==3)) {
                      hType = set_repeat;
                      singleStream = 1;
                      cLitSize = HUF_compress1X_usingCTable(ostart+lhSize, dstCapacity-lhSize, src, srcSize, zc->hufTable);
                  } else {
-                     cLitSize = singleStream ? HUF_compress1X(ostart+lhSize, dstCapacity-lhSize, src, srcSize, 255, 11)
-                                             : HUF_compress2 (ostart+lhSize, dstCapacity-lhSize, src, srcSize, 255, 11);
+                     cLitSize = singleStream ? HUF_compress1X_wksp(ostart+lhSize, dstCapacity-lhSize, src, srcSize, 255, 11, zc->tmpCounters, sizeof(zc->tmpCounters))
+                                             : HUF_compress4X_wksp(ostart+lhSize, dstCapacity-lhSize, src, srcSize, 255, 11, zc->tmpCounters, sizeof(zc->tmpCounters));
                  }
                  if ((cLitSize==0) | (cLitSize >= srcSize - minGain))
                      return ZSTD_noCompressLiterals(dst, dstCapacity, src, srcSize);
                  if (cLitSize==1)
                      return ZSTD_compressRleLiteralsBlock(dst, dstCapacity, src, srcSize);
                  /* Build header */
                  switch(lhSize)
                  {
                  case 3: /* 2 - 2 - 10 - 10 */
                      {   U32 const lhc = hType + ((!singleStream) << 2) + ((U32)srcSize<<4) + ((U32)cLitSize<<14);
                          MEM_writeLE24(ostart, lhc);
                          break;
                      }
                  case 4: /* 2 - 2 - 14 - 14 */
                      {   U32 const lhc = hType + (2 << 2) + ((U32)srcSize<<4) + ((U32)cLitSize<<18);
                          MEM_writeLE32(ostart, lhc);
                          break;
                      }
                  default:   /* should not be necessary, lhSize is only {3,4,5} */
                  case 5: /* 2 - 2 - 18 - 18 */
                      {   U32 const lhc = hType + (3 << 2) + ((U32)srcSize<<4) + ((U32)cLitSize<<22);
                          MEM_writeLE32(ostart, lhc);
                          ostart[4] = (BYTE)(cLitSize >> 10);
                          break;
                      }
                  }
                  return lhSize+cLitSize;
              }
              static const BYTE LL_Code[64] = {  0,  1,  2,  3,  4,  5,  6,  7,
 ,  9, 10, 11, 12, 13, 14, 15,
 , 16, 17, 17, 18, 18, 19, 19,
 , 20, 20, 20, 21, 21, 21, 21,
 , 22, 22, 22, 22, 22, 22, 22,
 , 23, 23, 23, 23, 23, 23, 23,
 , 24, 24, 24, 24, 24, 24, 24,
 , 24, 24, 24, 24, 24, 24, 24 };
              static const BYTE ML_Code[128] = { 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15,
 , 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31,
 , 32, 33, 33, 34, 34, 35, 35, 36, 36, 36, 36, 37, 37, 37, 37,
 , 38, 38, 38, 38, 38, 38, 38, 39, 39, 39, 39, 39, 39, 39, 39,
 , 40, 40, 40, 40, 40, 40, 40, 40, 40, 40, 40, 40, 40, 40, 40,
 , 41, 41, 41, 41, 41, 41, 41, 41, 41, 41, 41, 41, 41, 41, 41,
 , 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42,
 , 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42, 42 };
              void ZSTD_seqToCodes(const seqStore_t* seqStorePtr)
              {
                  BYTE const LL_deltaCode = 19;
                  BYTE const ML_deltaCode = 36;
                  const seqDef* const sequences = seqStorePtr->sequencesStart;
                  BYTE* const llCodeTable = seqStorePtr->llCode;
                  BYTE* const ofCodeTable = seqStorePtr->ofCode;
                  BYTE* const mlCodeTable = seqStorePtr->mlCode;
                  U32 const nbSeq = (U32)(seqStorePtr->sequences - seqStorePtr->sequencesStart);
                  U32 u;
                  for (u=0; u<nbSeq; u++) {
                      U32 const llv = sequences[u].litLength;
                      U32 const mlv = sequences[u].matchLength;
                      llCodeTable[u] = (llv> 63) ? (BYTE)ZSTD_highbit32(llv) + LL_deltaCode : LL_Code[llv];
                      ofCodeTable[u] = (BYTE)ZSTD_highbit32(sequences[u].offset);
                      mlCodeTable[u] = (mlv>127) ? (BYTE)ZSTD_highbit32(mlv) + ML_deltaCode : ML_Code[mlv];
                  }
                  if (seqStorePtr->longLengthID==1)
                      llCodeTable[seqStorePtr->longLengthPos] = MaxLL;
                  if (seqStorePtr->longLengthID==2)
                      mlCodeTable[seqStorePtr->longLengthPos] = MaxML;
              }
              size_t ZSTD_compressSequences(ZSTD_CCtx* zc,
                                            void* dst, size_t dstCapacity,
                                            size_t srcSize)
              {
                  const seqStore_t* seqStorePtr = &(zc->seqStore);
                  U32 count[MaxSeq+1];
                  S16 norm[MaxSeq+1];
                  FSE_CTable* CTable_LitLength = zc->litlengthCTable;
                  FSE_CTable* CTable_OffsetBits = zc->offcodeCTable;
                  FSE_CTable* CTable_MatchLength = zc->matchlengthCTable;
                  U32 LLtype, Offtype, MLtype;   /* compressed, raw or rle */
                  const seqDef* const sequences = seqStorePtr->sequencesStart;
                  const BYTE* const ofCodeTable = seqStorePtr->ofCode;
                  const BYTE* const llCodeTable = seqStorePtr->llCode;
                  const BYTE* const mlCodeTable = seqStorePtr->mlCode;
                  BYTE* const ostart = (BYTE*)dst;
                  BYTE* const oend = ostart + dstCapacity;
                  BYTE* op = ostart;
                  size_t const nbSeq = seqStorePtr->sequences - seqStorePtr->sequencesStart;
                  BYTE* seqHead;
+                 BYTE scratchBuffer[1<<MAX(MLFSELog,LLFSELog)];
                  /* Compress literals */
                  {   const BYTE* const literals = seqStorePtr->litStart;
                      size_t const litSize = seqStorePtr->lit - literals;
                      size_t const cSize = ZSTD_compressLiterals(zc, op, dstCapacity, literals, litSize);
                      if (ZSTD_isError(cSize)) return cSize;
                      op += cSize;
                  }
                  /* Sequences Header */
                  if ((oend-op) < 3 /*max nbSeq Size*/ + 1 /*seqHead */) return ERROR(dstSize_tooSmall);
                  if (nbSeq < 0x7F) *op++ = (BYTE)nbSeq;
                  else if (nbSeq < LONGNBSEQ) op[0] = (BYTE)((nbSeq>>8) + 0x80), op[1] = (BYTE)nbSeq, op+=2;
                  else op[0]=0xFF, MEM_writeLE16(op+1, (U16)(nbSeq - LONGNBSEQ)), op+=3;
                  if (nbSeq==0) goto _check_compressibility;
                  /* seqHead : flags for FSE encoding type */
                  seqHead = op++;
              #define MIN_SEQ_FOR_DYNAMIC_FSE   64
              #define MAX_SEQ_FOR_STATIC_FSE  1000
                  /* convert length/distances into codes */
                  ZSTD_seqToCodes(seqStorePtr);
                  /* CTable for Literal Lengths */
                  {   U32 max = MaxLL;
-                     size_t const mostFrequent = FSE_countFast(count, &max, llCodeTable, nbSeq);
+                     size_t const mostFrequent = FSE_countFast_wksp(count, &max, llCodeTable, nbSeq, zc->tmpCounters);
                      if ((mostFrequent == nbSeq) && (nbSeq > 2)) {
                          *op++ = llCodeTable[0];
                          FSE_buildCTable_rle(CTable_LitLength, (BYTE)max);
                          LLtype = set_rle;
                      } else if ((zc->flagStaticTables) && (nbSeq < MAX_SEQ_FOR_STATIC_FSE)) {
                          LLtype = set_repeat;
                      } else if ((nbSeq < MIN_SEQ_FOR_DYNAMIC_FSE) || (mostFrequent < (nbSeq >> (LL_defaultNormLog-1)))) {
-                         FSE_buildCTable(CTable_LitLength, LL_defaultNorm, MaxLL, LL_defaultNormLog);
+                         FSE_buildCTable_wksp(CTable_LitLength, LL_defaultNorm, MaxLL, LL_defaultNormLog, scratchBuffer, sizeof(scratchBuffer));
                          LLtype = set_basic;
                      } else {
                          size_t nbSeq_1 = nbSeq;
                          const U32 tableLog = FSE_optimalTableLog(LLFSELog, nbSeq, max);
                          if (count[llCodeTable[nbSeq-1]]>1) { count[llCodeTable[nbSeq-1]]--; nbSeq_1--; }
                          FSE_normalizeCount(norm, tableLog, count, nbSeq_1, max);
                          { size_t const NCountSize = FSE_writeNCount(op, oend-op, norm, max, tableLog);   /* overflow protected */
                            if (FSE_isError(NCountSize)) return ERROR(GENERIC);
                            op += NCountSize; }
-                         FSE_buildCTable(CTable_LitLength, norm, max, tableLog);
+                         FSE_buildCTable_wksp(CTable_LitLength, norm, max, tableLog, scratchBuffer, sizeof(scratchBuffer));
                          LLtype = set_compressed;
                  }   }
                  /* CTable for Offsets */
                  {   U32 max = MaxOff;
-                     size_t const mostFrequent = FSE_countFast(count, &max, ofCodeTable, nbSeq);
+                     size_t const mostFrequent = FSE_countFast_wksp(count, &max, ofCodeTable, nbSeq, zc->tmpCounters);
                      if ((mostFrequent == nbSeq) && (nbSeq > 2)) {
                          *op++ = ofCodeTable[0];
                          FSE_buildCTable_rle(CTable_OffsetBits, (BYTE)max);
                          Offtype = set_rle;
                      } else if ((zc->flagStaticTables) && (nbSeq < MAX_SEQ_FOR_STATIC_FSE)) {
                          Offtype = set_repeat;
                      } else if ((nbSeq < MIN_SEQ_FOR_DYNAMIC_FSE) || (mostFrequent < (nbSeq >> (OF_defaultNormLog-1)))) {
-                         FSE_buildCTable(CTable_OffsetBits, OF_defaultNorm, MaxOff, OF_defaultNormLog);
+                         FSE_buildCTable_wksp(CTable_OffsetBits, OF_defaultNorm, MaxOff, OF_defaultNormLog, scratchBuffer, sizeof(scratchBuffer));
                          Offtype = set_basic;
                      } else {
                          size_t nbSeq_1 = nbSeq;
                          const U32 tableLog = FSE_optimalTableLog(OffFSELog, nbSeq, max);
                          if (count[ofCodeTable[nbSeq-1]]>1) { count[ofCodeTable[nbSeq-1]]--; nbSeq_1--; }
                          FSE_normalizeCount(norm, tableLog, count, nbSeq_1, max);
                          { size_t const NCountSize = FSE_writeNCount(op, oend-op, norm, max, tableLog);   /* overflow protected */
                            if (FSE_isError(NCountSize)) return ERROR(GENERIC);
                            op += NCountSize; }
-                         FSE_buildCTable(CTable_OffsetBits, norm, max, tableLog);
+                         FSE_buildCTable_wksp(CTable_OffsetBits, norm, max, tableLog, scratchBuffer, sizeof(scratchBuffer));
                          Offtype = set_compressed;
                  }   }
                  /* CTable for MatchLengths */
                  {   U32 max = MaxML;
-                     size_t const mostFrequent = FSE_countFast(count, &max, mlCodeTable, nbSeq);
+                     size_t const mostFrequent = FSE_countFast_wksp(count, &max, mlCodeTable, nbSeq, zc->tmpCounters);
                      if ((mostFrequent == nbSeq) && (nbSeq > 2)) {
                          *op++ = *mlCodeTable;
                          FSE_buildCTable_rle(CTable_MatchLength, (BYTE)max);
                          MLtype = set_rle;
                      } else if ((zc->flagStaticTables) && (nbSeq < MAX_SEQ_FOR_STATIC_FSE)) {
                          MLtype = set_repeat;
                      } else if ((nbSeq < MIN_SEQ_FOR_DYNAMIC_FSE) || (mostFrequent < (nbSeq >> (ML_defaultNormLog-1)))) {
-                         FSE_buildCTable(CTable_MatchLength, ML_defaultNorm, MaxML, ML_defaultNormLog);
+                         FSE_buildCTable_wksp(CTable_MatchLength, ML_defaultNorm, MaxML, ML_defaultNormLog, scratchBuffer, sizeof(scratchBuffer));
                          MLtype = set_basic;
                      } else {
                          size_t nbSeq_1 = nbSeq;
                          const U32 tableLog = FSE_optimalTableLog(MLFSELog, nbSeq, max);
                          if (count[mlCodeTable[nbSeq-1]]>1) { count[mlCodeTable[nbSeq-1]]--; nbSeq_1--; }
                          FSE_normalizeCount(norm, tableLog, count, nbSeq_1, max);
                          { size_t const NCountSize = FSE_writeNCount(op, oend-op, norm, max, tableLog);   /* overflow protected */
                            if (FSE_isError(NCountSize)) return ERROR(GENERIC);
                            op += NCountSize; }
-                         FSE_buildCTable(CTable_MatchLength, norm, max, tableLog);
+                         FSE_buildCTable_wksp(CTable_MatchLength, norm, max, tableLog, scratchBuffer, sizeof(scratchBuffer));
                          MLtype = set_compressed;
                  }   }
                  *seqHead = (BYTE)((LLtype<<6) + (Offtype<<4) + (MLtype<<2));
                  zc->flagStaticTables = 0;
                  /* Encoding Sequences */
                  {   BIT_CStream_t blockStream;
                      FSE_CState_t  stateMatchLength;
                      FSE_CState_t  stateOffsetBits;
                      FSE_CState_t  stateLitLength;
                      CHECK_E(BIT_initCStream(&blockStream, op, oend-op), dstSize_tooSmall); /* not enough space remaining */
                      /* first symbols */
                      FSE_initCState2(&stateMatchLength, CTable_MatchLength, mlCodeTable[nbSeq-1]);
                      FSE_initCState2(&stateOffsetBits,  CTable_OffsetBits,  ofCodeTable[nbSeq-1]);
                      FSE_initCState2(&stateLitLength,   CTable_LitLength,   llCodeTable[nbSeq-1]);
                      BIT_addBits(&blockStream, sequences[nbSeq-1].litLength, LL_bits[llCodeTable[nbSeq-1]]);
                      if (MEM_32bits()) BIT_flushBits(&blockStream);
                      BIT_addBits(&blockStream, sequences[nbSeq-1].matchLength, ML_bits[mlCodeTable[nbSeq-1]]);
                      if (MEM_32bits()) BIT_flushBits(&blockStream);
                      BIT_addBits(&blockStream, sequences[nbSeq-1].offset, ofCodeTable[nbSeq-1]);
                      BIT_flushBits(&blockStream);
                      {   size_t n;
                          for (n=nbSeq-2 ; n<nbSeq ; n--) {      /* intentional underflow */
                              BYTE const llCode = llCodeTable[n];
                              BYTE const ofCode = ofCodeTable[n];
                              BYTE const mlCode = mlCodeTable[n];
                              U32  const llBits = LL_bits[llCode];
                              U32  const ofBits = ofCode;                                     /* 32b*/  /* 64b*/
                              U32  const mlBits = ML_bits[mlCode];
                                                                                              /* (7)*/  /* (7)*/
                              FSE_encodeSymbol(&blockStream, &stateOffsetBits, ofCode);       /* 15 */  /* 15 */
                              FSE_encodeSymbol(&blockStream, &stateMatchLength, mlCode);      /* 24 */  /* 24 */
                              if (MEM_32bits()) BIT_flushBits(&blockStream);                  /* (7)*/
                              FSE_encodeSymbol(&blockStream, &stateLitLength, llCode);        /* 16 */  /* 33 */
                              if (MEM_32bits() || (ofBits+mlBits+llBits >= 64-7-(LLFSELog+MLFSELog+OffFSELog)))
                                  BIT_flushBits(&blockStream);                                /* (7)*/
                              BIT_addBits(&blockStream, sequences[n].litLength, llBits);
                              if (MEM_32bits() && ((llBits+mlBits)>24)) BIT_flushBits(&blockStream);
                              BIT_addBits(&blockStream, sequences[n].matchLength, mlBits);
                              if (MEM_32bits()) BIT_flushBits(&blockStream);                  /* (7)*/
                              BIT_addBits(&blockStream, sequences[n].offset, ofBits);         /* 31 */
                              BIT_flushBits(&blockStream);                                    /* (7)*/
                      }   }
                      FSE_flushCState(&blockStream, &stateMatchLength);
                      FSE_flushCState(&blockStream, &stateOffsetBits);
                      FSE_flushCState(&blockStream, &stateLitLength);
                      {   size_t const streamSize = BIT_closeCStream(&blockStream);
                          if (streamSize==0) return ERROR(dstSize_tooSmall);   /* not enough space */
                          op += streamSize;
                  }   }
                  /* check compressibility */
              _check_compressibility:
                  { size_t const minGain = ZSTD_minGain(srcSize);
                    size_t const maxCSize = srcSize - minGain;
                    if ((size_t)(op-ostart) >= maxCSize) return 0; }
                  /* confirm repcodes */
                  { int i; for (i=0; i<ZSTD_REP_NUM; i++) zc->rep[i] = zc->savedRep[i]; }
                  return op - ostart;
              }
              /*! ZSTD_storeSeq() :
                  Store a sequence (literal length, literals, offset code and match length code) into seqStore_t.
                  `offsetCode` : distance to match, or 0 == repCode.
                  `matchCode` : matchLength - MINMATCH
              */
              MEM_STATIC void ZSTD_storeSeq(seqStore_t* seqStorePtr, size_t litLength, const void* literals, U32 offsetCode, size_t matchCode)
              {
              #if 0  /* for debug */
                  static const BYTE* g_start = NULL;
-                 const U32 pos = (U32)(literals - g_start);
-                 if (g_start==NULL) g_start = literals;
+                 const U32 pos = (U32)((const BYTE*)literals - g_start);
+                 if (g_start==NULL) g_start = (const BYTE*)literals;
                  //if ((pos > 1) && (pos < 50000))
                      printf("Cpos %6u :%5u literals & match %3u bytes at distance %6u \n",
                             pos, (U32)litLength, (U32)matchCode+MINMATCH, (U32)offsetCode);
              #endif
                  /* copy Literals */
                  ZSTD_wildcopy(seqStorePtr->lit, literals, litLength);
                  seqStorePtr->lit += litLength;
                  /* literal Length */
                  if (litLength>0xFFFF) { seqStorePtr->longLengthID = 1; seqStorePtr->longLengthPos = (U32)(seqStorePtr->sequences - seqStorePtr->sequencesStart); }
                  seqStorePtr->sequences[0].litLength = (U16)litLength;
                  /* match offset */
                  seqStorePtr->sequences[0].offset = offsetCode + 1;
                  /* match Length */
                  if (matchCode>0xFFFF) { seqStorePtr->longLengthID = 2; seqStorePtr->longLengthPos = (U32)(seqStorePtr->sequences - seqStorePtr->sequencesStart); }
                  seqStorePtr->sequences[0].matchLength = (U16)matchCode;
                  seqStorePtr->sequences++;
              }
              /*-*************************************
              *  Match length counter
              ***************************************/
              static unsigned ZSTD_NbCommonBytes (register size_t val)
              {
                  if (MEM_isLittleEndian()) {
                      if (MEM_64bits()) {
              #       if defined(_MSC_VER) && defined(_WIN64)
                          unsigned long r = 0;
                          _BitScanForward64( &r, (U64)val );
                          return (unsigned)(r>>3);
              #       elif defined(__GNUC__) && (__GNUC__ >= 3)
                          return (__builtin_ctzll((U64)val) >> 3);
              #       else
                          static const int DeBruijnBytePos[64] = { 0, 0, 0, 0, 0, 1, 1, 2, 0, 3, 1, 3, 1, 4, 2, 7, 0, 2, 3, 6, 1, 5, 3, 5, 1, 3, 4, 4, 2, 5, 6, 7, 7, 0, 1, 2, 3, 3, 4, 6, 2, 6, 5, 5, 3, 4, 5, 6, 7, 1, 2, 4, 6, 4, 4, 5, 7, 2, 6, 5, 7, 6, 7, 7 };
                          return DeBruijnBytePos[((U64)((val & -(long long)val) * 0x0218A392CDABBD3FULL)) >> 58];
              #       endif
                      } else { /* 32 bits */
              #       if defined(_MSC_VER)
                          unsigned long r=0;
                          _BitScanForward( &r, (U32)val );
                          return (unsigned)(r>>3);
              #       elif defined(__GNUC__) && (__GNUC__ >= 3)
                          return (__builtin_ctz((U32)val) >> 3);
              #       else
                          static const int DeBruijnBytePos[32] = { 0, 0, 3, 0, 3, 1, 3, 0, 3, 2, 2, 1, 3, 2, 0, 1, 3, 3, 1, 2, 2, 2, 2, 0, 3, 1, 2, 0, 1, 0, 1, 1 };
                          return DeBruijnBytePos[((U32)((val & -(S32)val) * 0x077CB531U)) >> 27];
              #       endif
                      }
                  } else {  /* Big Endian CPU */
                      if (MEM_64bits()) {
              #       if defined(_MSC_VER) && defined(_WIN64)
                          unsigned long r = 0;
                          _BitScanReverse64( &r, val );
                          return (unsigned)(r>>3);
              #       elif defined(__GNUC__) && (__GNUC__ >= 3)
                          return (__builtin_clzll(val) >> 3);
              #       else
                          unsigned r;
                          const unsigned n32 = sizeof(size_t)*4;   /* calculate this way due to compiler complaining in 32-bits mode */
                          if (!(val>>n32)) { r=4; } else { r=0; val>>=n32; }
                          if (!(val>>16)) { r+=2; val>>=8; } else { val>>=24; }
                          r += (!val);
                          return r;
              #       endif
                      } else { /* 32 bits */
              #       if defined(_MSC_VER)
                          unsigned long r = 0;
                          _BitScanReverse( &r, (unsigned long)val );
                          return (unsigned)(r>>3);
              #       elif defined(__GNUC__) && (__GNUC__ >= 3)
                          return (__builtin_clz((U32)val) >> 3);
              #       else
                          unsigned r;
                          if (!(val>>16)) { r=2; val>>=8; } else { r=0; val>>=24; }
                          r += (!val);
                          return r;
              #       endif
                  }   }
              }
              static size_t ZSTD_count(const BYTE* pIn, const BYTE* pMatch, const BYTE* const pInLimit)
              {
                  const BYTE* const pStart = pIn;
                  const BYTE* const pInLoopLimit = pInLimit - (sizeof(size_t)-1);
                  while (pIn < pInLoopLimit) {
                      size_t const diff = MEM_readST(pMatch) ^ MEM_readST(pIn);
                      if (!diff) { pIn+=sizeof(size_t); pMatch+=sizeof(size_t); continue; }
                      pIn += ZSTD_NbCommonBytes(diff);
                      return (size_t)(pIn - pStart);
                  }
                  if (MEM_64bits()) if ((pIn<(pInLimit-3)) && (MEM_read32(pMatch) == MEM_read32(pIn))) { pIn+=4; pMatch+=4; }
                  if ((pIn<(pInLimit-1)) && (MEM_read16(pMatch) == MEM_read16(pIn))) { pIn+=2; pMatch+=2; }
                  if ((pIn<pInLimit) && (*pMatch == *pIn)) pIn++;
                  return (size_t)(pIn - pStart);
              }
              /** ZSTD_count_2segments() :
              *   can count match length with `ip` & `match` in 2 different segments.
              *   convention : on reaching mEnd, match count continue starting from iStart
              */
              static size_t ZSTD_count_2segments(const BYTE* ip, const BYTE* match, const BYTE* iEnd, const BYTE* mEnd, const BYTE* iStart)
              {
                  const BYTE* const vEnd = MIN( ip + (mEnd - match), iEnd);
                  size_t const matchLength = ZSTD_count(ip, match, vEnd);
                  if (match + matchLength != mEnd) return matchLength;
                  return matchLength + ZSTD_count(ip+matchLength, iStart, iEnd);
              }
              /*-*************************************
              *  Hashes
              ***************************************/
              static const U32 prime3bytes = 506832829U;
              static U32    ZSTD_hash3(U32 u, U32 h) { return ((u << (32-24)) * prime3bytes)  >> (32-h) ; }
              MEM_STATIC size_t ZSTD_hash3Ptr(const void* ptr, U32 h) { return ZSTD_hash3(MEM_readLE32(ptr), h); }   /* only in zstd_opt.h */
              static const U32 prime4bytes = 2654435761U;
              static U32    ZSTD_hash4(U32 u, U32 h) { return (u * prime4bytes) >> (32-h) ; }
              static size_t ZSTD_hash4Ptr(const void* ptr, U32 h) { return ZSTD_hash4(MEM_read32(ptr), h); }
              static const U64 prime5bytes = 889523592379ULL;
              static size_t ZSTD_hash5(U64 u, U32 h) { return (size_t)(((u  << (64-40)) * prime5bytes) >> (64-h)) ; }
              static size_t ZSTD_hash5Ptr(const void* p, U32 h) { return ZSTD_hash5(MEM_readLE64(p), h); }
              static const U64 prime6bytes = 227718039650203ULL;
              static size_t ZSTD_hash6(U64 u, U32 h) { return (size_t)(((u  << (64-48)) * prime6bytes) >> (64-h)) ; }
              static size_t ZSTD_hash6Ptr(const void* p, U32 h) { return ZSTD_hash6(MEM_readLE64(p), h); }
              static const U64 prime7bytes = 58295818150454627ULL;
              static size_t ZSTD_hash7(U64 u, U32 h) { return (size_t)(((u  << (64-56)) * prime7bytes) >> (64-h)) ; }
              static size_t ZSTD_hash7Ptr(const void* p, U32 h) { return ZSTD_hash7(MEM_readLE64(p), h); }
              static const U64 prime8bytes = 0xCF1BBCDCB7A56463ULL;
              static size_t ZSTD_hash8(U64 u, U32 h) { return (size_t)(((u) * prime8bytes) >> (64-h)) ; }
              static size_t ZSTD_hash8Ptr(const void* p, U32 h) { return ZSTD_hash8(MEM_readLE64(p), h); }
              static size_t ZSTD_hashPtr(const void* p, U32 hBits, U32 mls)
              {
                  switch(mls)
                  {
                  default:
                  case 4: return ZSTD_hash4Ptr(p, hBits);
                  case 5: return ZSTD_hash5Ptr(p, hBits);
                  case 6: return ZSTD_hash6Ptr(p, hBits);
                  case 7: return ZSTD_hash7Ptr(p, hBits);
                  case 8: return ZSTD_hash8Ptr(p, hBits);
                  }
              }
              /*-*************************************
              *  Fast Scan
              ***************************************/
              static void ZSTD_fillHashTable (ZSTD_CCtx* zc, const void* end, const U32 mls)
              {
                  U32* const hashTable = zc->hashTable;
                  U32  const hBits = zc->params.cParams.hashLog;
                  const BYTE* const base = zc->base;
                  const BYTE* ip = base + zc->nextToUpdate;
                  const BYTE* const iend = ((const BYTE*)end) - HASH_READ_SIZE;
                  const size_t fastHashFillStep = 3;
                  while(ip <= iend) {
                      hashTable[ZSTD_hashPtr(ip, hBits, mls)] = (U32)(ip - base);
                      ip += fastHashFillStep;
                  }
              }
              FORCE_INLINE
              void ZSTD_compressBlock_fast_generic(ZSTD_CCtx* cctx,
                                             const void* src, size_t srcSize,
                                             const U32 mls)
              {
                  U32* const hashTable = cctx->hashTable;
                  U32  const hBits = cctx->params.cParams.hashLog;
                  seqStore_t* seqStorePtr = &(cctx->seqStore);
                  const BYTE* const base = cctx->base;
                  const BYTE* const istart = (const BYTE*)src;
                  const BYTE* ip = istart;
                  const BYTE* anchor = istart;
                  const U32   lowestIndex = cctx->dictLimit;
                  const BYTE* const lowest = base + lowestIndex;
                  const BYTE* const iend = istart + srcSize;
                  const BYTE* const ilimit = iend - HASH_READ_SIZE;
                  U32 offset_1=cctx->rep[0], offset_2=cctx->rep[1];
                  U32 offsetSaved = 0;
                  /* init */
                  ip += (ip==lowest);
                  {   U32 const maxRep = (U32)(ip-lowest);
                      if (offset_2 > maxRep) offsetSaved = offset_2, offset_2 = 0;
                      if (offset_1 > maxRep) offsetSaved = offset_1, offset_1 = 0;
                  }
                  /* Main Search Loop */
                  while (ip < ilimit) {   /* < instead of <=, because repcode check at (ip+1) */
                      size_t mLength;
                      size_t const h = ZSTD_hashPtr(ip, hBits, mls);
                      U32 const current = (U32)(ip-base);
                      U32 const matchIndex = hashTable[h];
                      const BYTE* match = base + matchIndex;
                      hashTable[h] = current;   /* update hash table */
                      if ((offset_1 > 0) & (MEM_read32(ip+1-offset_1) == MEM_read32(ip+1))) {
                          mLength = ZSTD_count(ip+1+4, ip+1+4-offset_1, iend) + 4;
                          ip++;
                          ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, 0, mLength-MINMATCH);
                      } else {
                          U32 offset;
                          if ( (matchIndex <= lowestIndex) || (MEM_read32(match) != MEM_read32(ip)) ) {
                              ip += ((ip-anchor) >> g_searchStrength) + 1;
                              continue;
                          }
                          mLength = ZSTD_count(ip+4, match+4, iend) + 4;
                          offset = (U32)(ip-match);
                          while (((ip>anchor) & (match>lowest)) && (ip[-1] == match[-1])) { ip--; match--; mLength++; } /* catch up */
                          offset_2 = offset_1;
                          offset_1 = offset;
                          ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                      }
                      /* match found */
                      ip += mLength;
                      anchor = ip;
                      if (ip <= ilimit) {
                          /* Fill Table */
                          hashTable[ZSTD_hashPtr(base+current+2, hBits, mls)] = current+2;  /* here because current+2 could be > iend-8 */
                          hashTable[ZSTD_hashPtr(ip-2, hBits, mls)] = (U32)(ip-2-base);
                          /* check immediate repcode */
                          while ( (ip <= ilimit)
                               && ( (offset_2>0)
                               & (MEM_read32(ip) == MEM_read32(ip - offset_2)) )) {
                              /* store sequence */
                              size_t const rLength = ZSTD_count(ip+4, ip+4-offset_2, iend) + 4;
                              { U32 const tmpOff = offset_2; offset_2 = offset_1; offset_1 = tmpOff; }  /* swap offset_2 <=> offset_1 */
                              hashTable[ZSTD_hashPtr(ip, hBits, mls)] = (U32)(ip-base);
                              ZSTD_storeSeq(seqStorePtr, 0, anchor, 0, rLength-MINMATCH);
                              ip += rLength;
                              anchor = ip;
                              continue;   /* faster when present ... (?) */
                  }   }   }
                  /* save reps for next block */
                  cctx->savedRep[0] = offset_1 ? offset_1 : offsetSaved;
                  cctx->savedRep[1] = offset_2 ? offset_2 : offsetSaved;
                  /* Last Literals */
                  {   size_t const lastLLSize = iend - anchor;
                      memcpy(seqStorePtr->lit, anchor, lastLLSize);
                      seqStorePtr->lit += lastLLSize;
                  }
              }
              static void ZSTD_compressBlock_fast(ZSTD_CCtx* ctx,
                                     const void* src, size_t srcSize)
              {
                  const U32 mls = ctx->params.cParams.searchLength;
                  switch(mls)
                  {
                  default:
                  case 4 :
                      ZSTD_compressBlock_fast_generic(ctx, src, srcSize, 4); return;
                  case 5 :
                      ZSTD_compressBlock_fast_generic(ctx, src, srcSize, 5); return;
                  case 6 :
                      ZSTD_compressBlock_fast_generic(ctx, src, srcSize, 6); return;
                  case 7 :
                      ZSTD_compressBlock_fast_generic(ctx, src, srcSize, 7); return;
                  }
              }
              static void ZSTD_compressBlock_fast_extDict_generic(ZSTD_CCtx* ctx,
                                               const void* src, size_t srcSize,
                                               const U32 mls)
              {
                  U32* hashTable = ctx->hashTable;
                  const U32 hBits = ctx->params.cParams.hashLog;
                  seqStore_t* seqStorePtr = &(ctx->seqStore);
                  const BYTE* const base = ctx->base;
                  const BYTE* const dictBase = ctx->dictBase;
                  const BYTE* const istart = (const BYTE*)src;
                  const BYTE* ip = istart;
                  const BYTE* anchor = istart;
                  const U32   lowestIndex = ctx->lowLimit;
                  const BYTE* const dictStart = dictBase + lowestIndex;
                  const U32   dictLimit = ctx->dictLimit;
                  const BYTE* const lowPrefixPtr = base + dictLimit;
                  const BYTE* const dictEnd = dictBase + dictLimit;
                  const BYTE* const iend = istart + srcSize;
                  const BYTE* const ilimit = iend - 8;
                  U32 offset_1=ctx->rep[0], offset_2=ctx->rep[1];
                  /* Search Loop */
                  while (ip < ilimit) {  /* < instead of <=, because (ip+1) */
                      const size_t h = ZSTD_hashPtr(ip, hBits, mls);
                      const U32 matchIndex = hashTable[h];
                      const BYTE* matchBase = matchIndex < dictLimit ? dictBase : base;
                      const BYTE* match = matchBase + matchIndex;
                      const U32 current = (U32)(ip-base);
                      const U32 repIndex = current + 1 - offset_1;   /* offset_1 expected <= current +1 */
                      const BYTE* repBase = repIndex < dictLimit ? dictBase : base;
                      const BYTE* repMatch = repBase + repIndex;
                      size_t mLength;
                      hashTable[h] = current;   /* update hash table */
                      if ( (((U32)((dictLimit-1) - repIndex) >= 3) /* intentional underflow */ & (repIndex > lowestIndex))
                         && (MEM_read32(repMatch) == MEM_read32(ip+1)) ) {
                          const BYTE* repMatchEnd = repIndex < dictLimit ? dictEnd : iend;
                          mLength = ZSTD_count_2segments(ip+1+EQUAL_READ32, repMatch+EQUAL_READ32, iend, repMatchEnd, lowPrefixPtr) + EQUAL_READ32;
                          ip++;
                          ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, 0, mLength-MINMATCH);
                      } else {
                          if ( (matchIndex < lowestIndex) ||
                               (MEM_read32(match) != MEM_read32(ip)) ) {
                              ip += ((ip-anchor) >> g_searchStrength) + 1;
                              continue;
                          }
                          {   const BYTE* matchEnd = matchIndex < dictLimit ? dictEnd : iend;
                              const BYTE* lowMatchPtr = matchIndex < dictLimit ? dictStart : lowPrefixPtr;
                              U32 offset;
                              mLength = ZSTD_count_2segments(ip+EQUAL_READ32, match+EQUAL_READ32, iend, matchEnd, lowPrefixPtr) + EQUAL_READ32;
                              while (((ip>anchor) & (match>lowMatchPtr)) && (ip[-1] == match[-1])) { ip--; match--; mLength++; }   /* catch up */
                              offset = current - matchIndex;
                              offset_2 = offset_1;
                              offset_1 = offset;
                              ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                      }   }
                      /* found a match : store it */
                      ip += mLength;
                      anchor = ip;
                      if (ip <= ilimit) {
                          /* Fill Table */
                          hashTable[ZSTD_hashPtr(base+current+2, hBits, mls)] = current+2;
                          hashTable[ZSTD_hashPtr(ip-2, hBits, mls)] = (U32)(ip-2-base);
                          /* check immediate repcode */
                          while (ip <= ilimit) {
                              U32 const current2 = (U32)(ip-base);
                              U32 const repIndex2 = current2 - offset_2;
                              const BYTE* repMatch2 = repIndex2 < dictLimit ? dictBase + repIndex2 : base + repIndex2;
                              if ( (((U32)((dictLimit-1) - repIndex2) >= 3) & (repIndex2 > lowestIndex))  /* intentional overflow */
                                 && (MEM_read32(repMatch2) == MEM_read32(ip)) ) {
                                  const BYTE* const repEnd2 = repIndex2 < dictLimit ? dictEnd : iend;
                                  size_t repLength2 = ZSTD_count_2segments(ip+EQUAL_READ32, repMatch2+EQUAL_READ32, iend, repEnd2, lowPrefixPtr) + EQUAL_READ32;
                                  U32 tmpOffset = offset_2; offset_2 = offset_1; offset_1 = tmpOffset;   /* swap offset_2 <=> offset_1 */
                                  ZSTD_storeSeq(seqStorePtr, 0, anchor, 0, repLength2-MINMATCH);
                                  hashTable[ZSTD_hashPtr(ip, hBits, mls)] = current2;
                                  ip += repLength2;
                                  anchor = ip;
                                  continue;
                              }
                              break;
                  }   }   }
                  /* save reps for next block */
                  ctx->savedRep[0] = offset_1; ctx->savedRep[1] = offset_2;
                  /* Last Literals */
                  {   size_t const lastLLSize = iend - anchor;
                      memcpy(seqStorePtr->lit, anchor, lastLLSize);
                      seqStorePtr->lit += lastLLSize;
                  }
              }
              static void ZSTD_compressBlock_fast_extDict(ZSTD_CCtx* ctx,
                                       const void* src, size_t srcSize)
              {
                  U32 const mls = ctx->params.cParams.searchLength;
                  switch(mls)
                  {
                  default:
                  case 4 :
                      ZSTD_compressBlock_fast_extDict_generic(ctx, src, srcSize, 4); return;
                  case 5 :
                      ZSTD_compressBlock_fast_extDict_generic(ctx, src, srcSize, 5); return;
                  case 6 :
                      ZSTD_compressBlock_fast_extDict_generic(ctx, src, srcSize, 6); return;
                  case 7 :
                      ZSTD_compressBlock_fast_extDict_generic(ctx, src, srcSize, 7); return;
                  }
              }
              /*-*************************************
              *  Double Fast
              ***************************************/
              static void ZSTD_fillDoubleHashTable (ZSTD_CCtx* cctx, const void* end, const U32 mls)
              {
                  U32* const hashLarge = cctx->hashTable;
                  U32  const hBitsL = cctx->params.cParams.hashLog;
                  U32* const hashSmall = cctx->chainTable;
                  U32  const hBitsS = cctx->params.cParams.chainLog;
                  const BYTE* const base = cctx->base;
                  const BYTE* ip = base + cctx->nextToUpdate;
                  const BYTE* const iend = ((const BYTE*)end) - HASH_READ_SIZE;
                  const size_t fastHashFillStep = 3;
                  while(ip <= iend) {
                      hashSmall[ZSTD_hashPtr(ip, hBitsS, mls)] = (U32)(ip - base);
                      hashLarge[ZSTD_hashPtr(ip, hBitsL, 8)] = (U32)(ip - base);
                      ip += fastHashFillStep;
                  }
              }
              FORCE_INLINE
              void ZSTD_compressBlock_doubleFast_generic(ZSTD_CCtx* cctx,
                                               const void* src, size_t srcSize,
                                               const U32 mls)
              {
                  U32* const hashLong = cctx->hashTable;
                  const U32 hBitsL = cctx->params.cParams.hashLog;
                  U32* const hashSmall = cctx->chainTable;
                  const U32 hBitsS = cctx->params.cParams.chainLog;
                  seqStore_t* seqStorePtr = &(cctx->seqStore);
                  const BYTE* const base = cctx->base;
                  const BYTE* const istart = (const BYTE*)src;
                  const BYTE* ip = istart;
                  const BYTE* anchor = istart;
                  const U32 lowestIndex = cctx->dictLimit;
                  const BYTE* const lowest = base + lowestIndex;
                  const BYTE* const iend = istart + srcSize;
                  const BYTE* const ilimit = iend - HASH_READ_SIZE;
                  U32 offset_1=cctx->rep[0], offset_2=cctx->rep[1];
                  U32 offsetSaved = 0;
                  /* init */
                  ip += (ip==lowest);
                  {   U32 const maxRep = (U32)(ip-lowest);
                      if (offset_2 > maxRep) offsetSaved = offset_2, offset_2 = 0;
                      if (offset_1 > maxRep) offsetSaved = offset_1, offset_1 = 0;
                  }
                  /* Main Search Loop */
                  while (ip < ilimit) {   /* < instead of <=, because repcode check at (ip+1) */
                      size_t mLength;
                      size_t const h2 = ZSTD_hashPtr(ip, hBitsL, 8);
                      size_t const h = ZSTD_hashPtr(ip, hBitsS, mls);
                      U32 const current = (U32)(ip-base);
                      U32 const matchIndexL = hashLong[h2];
                      U32 const matchIndexS = hashSmall[h];
                      const BYTE* matchLong = base + matchIndexL;
                      const BYTE* match = base + matchIndexS;
                      hashLong[h2] = hashSmall[h] = current;   /* update hash tables */
                      if ((offset_1 > 0) & (MEM_read32(ip+1-offset_1) == MEM_read32(ip+1))) { /* note : by construction, offset_1 <= current */
                          mLength = ZSTD_count(ip+1+4, ip+1+4-offset_1, iend) + 4;
                          ip++;
                          ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, 0, mLength-MINMATCH);
                      } else {
                          U32 offset;
                          if ( (matchIndexL > lowestIndex) && (MEM_read64(matchLong) == MEM_read64(ip)) ) {
                              mLength = ZSTD_count(ip+8, matchLong+8, iend) + 8;
                              offset = (U32)(ip-matchLong);
                              while (((ip>anchor) & (matchLong>lowest)) && (ip[-1] == matchLong[-1])) { ip--; matchLong--; mLength++; } /* catch up */
                          } else if ( (matchIndexS > lowestIndex) && (MEM_read32(match) == MEM_read32(ip)) ) {
                              size_t const h3 = ZSTD_hashPtr(ip+1, hBitsL, 8);
                              U32 const matchIndex3 = hashLong[h3];
                              const BYTE* match3 = base + matchIndex3;
                              hashLong[h3] = current + 1;
                              if ( (matchIndex3 > lowestIndex) && (MEM_read64(match3) == MEM_read64(ip+1)) ) {
                                  mLength = ZSTD_count(ip+9, match3+8, iend) + 8;
                                  ip++;
                                  offset = (U32)(ip-match3);
                                  while (((ip>anchor) & (match3>lowest)) && (ip[-1] == match3[-1])) { ip--; match3--; mLength++; } /* catch up */
                              } else {
                                  mLength = ZSTD_count(ip+4, match+4, iend) + 4;
                                  offset = (U32)(ip-match);
                                  while (((ip>anchor) & (match>lowest)) && (ip[-1] == match[-1])) { ip--; match--; mLength++; } /* catch up */
                              }
                          } else {
                              ip += ((ip-anchor) >> g_searchStrength) + 1;
                              continue;
                          }
                          offset_2 = offset_1;
                          offset_1 = offset;
                          ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                      }
                      /* match found */
                      ip += mLength;
                      anchor = ip;
                      if (ip <= ilimit) {
                          /* Fill Table */
                          hashLong[ZSTD_hashPtr(base+current+2, hBitsL, 8)] =
                              hashSmall[ZSTD_hashPtr(base+current+2, hBitsS, mls)] = current+2;  /* here because current+2 could be > iend-8 */
                          hashLong[ZSTD_hashPtr(ip-2, hBitsL, 8)] =
                              hashSmall[ZSTD_hashPtr(ip-2, hBitsS, mls)] = (U32)(ip-2-base);
                          /* check immediate repcode */
                          while ( (ip <= ilimit)
                               && ( (offset_2>0)
                               & (MEM_read32(ip) == MEM_read32(ip - offset_2)) )) {
                              /* store sequence */
                              size_t const rLength = ZSTD_count(ip+4, ip+4-offset_2, iend) + 4;
                              { U32 const tmpOff = offset_2; offset_2 = offset_1; offset_1 = tmpOff; } /* swap offset_2 <=> offset_1 */
                              hashSmall[ZSTD_hashPtr(ip, hBitsS, mls)] = (U32)(ip-base);
                              hashLong[ZSTD_hashPtr(ip, hBitsL, 8)] = (U32)(ip-base);
                              ZSTD_storeSeq(seqStorePtr, 0, anchor, 0, rLength-MINMATCH);
                              ip += rLength;
                              anchor = ip;
                              continue;   /* faster when present ... (?) */
                  }   }   }
                  /* save reps for next block */
                  cctx->savedRep[0] = offset_1 ? offset_1 : offsetSaved;
                  cctx->savedRep[1] = offset_2 ? offset_2 : offsetSaved;
                  /* Last Literals */
                  {   size_t const lastLLSize = iend - anchor;
                      memcpy(seqStorePtr->lit, anchor, lastLLSize);
                      seqStorePtr->lit += lastLLSize;
                  }
              }
              static void ZSTD_compressBlock_doubleFast(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  const U32 mls = ctx->params.cParams.searchLength;
                  switch(mls)
                  {
                  default:
                  case 4 :
                      ZSTD_compressBlock_doubleFast_generic(ctx, src, srcSize, 4); return;
                  case 5 :
                      ZSTD_compressBlock_doubleFast_generic(ctx, src, srcSize, 5); return;
                  case 6 :
                      ZSTD_compressBlock_doubleFast_generic(ctx, src, srcSize, 6); return;
                  case 7 :
                      ZSTD_compressBlock_doubleFast_generic(ctx, src, srcSize, 7); return;
                  }
              }
              static void ZSTD_compressBlock_doubleFast_extDict_generic(ZSTD_CCtx* ctx,
                                               const void* src, size_t srcSize,
                                               const U32 mls)
              {
                  U32* const hashLong = ctx->hashTable;
                  U32  const hBitsL = ctx->params.cParams.hashLog;
                  U32* const hashSmall = ctx->chainTable;
                  U32  const hBitsS = ctx->params.cParams.chainLog;
                  seqStore_t* seqStorePtr = &(ctx->seqStore);
                  const BYTE* const base = ctx->base;
                  const BYTE* const dictBase = ctx->dictBase;
                  const BYTE* const istart = (const BYTE*)src;
                  const BYTE* ip = istart;
                  const BYTE* anchor = istart;
                  const U32   lowestIndex = ctx->lowLimit;
                  const BYTE* const dictStart = dictBase + lowestIndex;
                  const U32   dictLimit = ctx->dictLimit;
                  const BYTE* const lowPrefixPtr = base + dictLimit;
                  const BYTE* const dictEnd = dictBase + dictLimit;
                  const BYTE* const iend = istart + srcSize;
                  const BYTE* const ilimit = iend - 8;
                  U32 offset_1=ctx->rep[0], offset_2=ctx->rep[1];
                  /* Search Loop */
                  while (ip < ilimit) {  /* < instead of <=, because (ip+1) */
                      const size_t hSmall = ZSTD_hashPtr(ip, hBitsS, mls);
                      const U32 matchIndex = hashSmall[hSmall];
                      const BYTE* matchBase = matchIndex < dictLimit ? dictBase : base;
                      const BYTE* match = matchBase + matchIndex;
                      const size_t hLong = ZSTD_hashPtr(ip, hBitsL, 8);
                      const U32 matchLongIndex = hashLong[hLong];
                      const BYTE* matchLongBase = matchLongIndex < dictLimit ? dictBase : base;
                      const BYTE* matchLong = matchLongBase + matchLongIndex;
                      const U32 current = (U32)(ip-base);
                      const U32 repIndex = current + 1 - offset_1;   /* offset_1 expected <= current +1 */
                      const BYTE* repBase = repIndex < dictLimit ? dictBase : base;
                      const BYTE* repMatch = repBase + repIndex;
                      size_t mLength;
                      hashSmall[hSmall] = hashLong[hLong] = current;   /* update hash table */
                      if ( (((U32)((dictLimit-1) - repIndex) >= 3) /* intentional underflow */ & (repIndex > lowestIndex))
                         && (MEM_read32(repMatch) == MEM_read32(ip+1)) ) {
                          const BYTE* repMatchEnd = repIndex < dictLimit ? dictEnd : iend;
                          mLength = ZSTD_count_2segments(ip+1+4, repMatch+4, iend, repMatchEnd, lowPrefixPtr) + 4;
                          ip++;
                          ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, 0, mLength-MINMATCH);
                      } else {
                          if ((matchLongIndex > lowestIndex) && (MEM_read64(matchLong) == MEM_read64(ip))) {
                              const BYTE* matchEnd = matchLongIndex < dictLimit ? dictEnd : iend;
                              const BYTE* lowMatchPtr = matchLongIndex < dictLimit ? dictStart : lowPrefixPtr;
                              U32 offset;
                              mLength = ZSTD_count_2segments(ip+8, matchLong+8, iend, matchEnd, lowPrefixPtr) + 8;
                              offset = current - matchLongIndex;
                              while (((ip>anchor) & (matchLong>lowMatchPtr)) && (ip[-1] == matchLong[-1])) { ip--; matchLong--; mLength++; }   /* catch up */
                              offset_2 = offset_1;
                              offset_1 = offset;
                              ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                          } else if ((matchIndex > lowestIndex) && (MEM_read32(match) == MEM_read32(ip))) {
                              size_t const h3 = ZSTD_hashPtr(ip+1, hBitsL, 8);
                              U32 const matchIndex3 = hashLong[h3];
                              const BYTE* const match3Base = matchIndex3 < dictLimit ? dictBase : base;
                              const BYTE* match3 = match3Base + matchIndex3;
                              U32 offset;
                              hashLong[h3] = current + 1;
                              if ( (matchIndex3 > lowestIndex) && (MEM_read64(match3) == MEM_read64(ip+1)) ) {
                                  const BYTE* matchEnd = matchIndex3 < dictLimit ? dictEnd : iend;
                                  const BYTE* lowMatchPtr = matchIndex3 < dictLimit ? dictStart : lowPrefixPtr;
                                  mLength = ZSTD_count_2segments(ip+9, match3+8, iend, matchEnd, lowPrefixPtr) + 8;
                                  ip++;
                                  offset = current+1 - matchIndex3;
                                  while (((ip>anchor) & (match3>lowMatchPtr)) && (ip[-1] == match3[-1])) { ip--; match3--; mLength++; } /* catch up */
                              } else {
                                  const BYTE* matchEnd = matchIndex < dictLimit ? dictEnd : iend;
                                  const BYTE* lowMatchPtr = matchIndex < dictLimit ? dictStart : lowPrefixPtr;
                                  mLength = ZSTD_count_2segments(ip+4, match+4, iend, matchEnd, lowPrefixPtr) + 4;
                                  offset = current - matchIndex;
                                  while (((ip>anchor) & (match>lowMatchPtr)) && (ip[-1] == match[-1])) { ip--; match--; mLength++; }   /* catch up */
                              }
                              offset_2 = offset_1;
                              offset_1 = offset;
                              ZSTD_storeSeq(seqStorePtr, ip-anchor, anchor, offset + ZSTD_REP_MOVE, mLength-MINMATCH);
                          } else {
                              ip += ((ip-anchor) >> g_searchStrength) + 1;
                              continue;
                      }   }
                      /* found a match : store it */
                      ip += mLength;
                      anchor = ip;
                      if (ip <= ilimit) {
                          /* Fill Table */
              			hashSmall[ZSTD_hashPtr(base+current+2, hBitsS, mls)] = current+2;
              			hashLong[ZSTD_hashPtr(base+current+2, hBitsL, 8)] = current+2;
                          hashSmall[ZSTD_hashPtr(ip-2, hBitsS, mls)] = (U32)(ip-2-base);
                          hashLong[ZSTD_hashPtr(ip-2, hBitsL, 8)] = (U32)(ip-2-base);
                          /* check immediate repcode */
                          while (ip <= ilimit) {
                              U32 const current2 = (U32)(ip-base);
                              U32 const repIndex2 = current2 - offset_2;
                              const BYTE* repMatch2 = repIndex2 < dictLimit ? dictBase + repIndex2 : base + repIndex2;
                              if ( (((U32)((dictLimit-1) - repIndex2) >= 3) & (repIndex2 > lowestIndex))  /* intentional overflow */
                                 && (MEM_read32(repMatch2) == MEM_read32(ip)) ) {
                                  const BYTE* const repEnd2 = repIndex2 < dictLimit ? dictEnd : iend;
                                  size_t const repLength2 = ZSTD_count_2segments(ip+EQUAL_READ32, repMatch2+EQUAL_READ32, iend, repEnd2, lowPrefixPtr) + EQUAL_READ32;
                                  U32 tmpOffset = offset_2; offset_2 = offset_1; offset_1 = tmpOffset;   /* swap offset_2 <=> offset_1 */
                                  ZSTD_storeSeq(seqStorePtr, 0, anchor, 0, repLength2-MINMATCH);
                                  hashSmall[ZSTD_hashPtr(ip, hBitsS, mls)] = current2;
                                  hashLong[ZSTD_hashPtr(ip, hBitsL, 8)] = current2;
                                  ip += repLength2;
                                  anchor = ip;
                                  continue;
                              }
                              break;
                  }   }   }
                  /* save reps for next block */
                  ctx->savedRep[0] = offset_1; ctx->savedRep[1] = offset_2;
                  /* Last Literals */
                  {   size_t const lastLLSize = iend - anchor;
                      memcpy(seqStorePtr->lit, anchor, lastLLSize);
                      seqStorePtr->lit += lastLLSize;
                  }
              }
              static void ZSTD_compressBlock_doubleFast_extDict(ZSTD_CCtx* ctx,
                                       const void* src, size_t srcSize)
              {
                  U32 const mls = ctx->params.cParams.searchLength;
                  switch(mls)
                  {
                  default:
                  case 4 :
                      ZSTD_compressBlock_doubleFast_extDict_generic(ctx, src, srcSize, 4); return;
                  case 5 :
                      ZSTD_compressBlock_doubleFast_extDict_generic(ctx, src, srcSize, 5); return;
                  case 6 :
                      ZSTD_compressBlock_doubleFast_extDict_generic(ctx, src, srcSize, 6); return;
                  case 7 :
                      ZSTD_compressBlock_doubleFast_extDict_generic(ctx, src, srcSize, 7); return;
                  }
              }
              /*-*************************************
              *  Binary Tree search
              ***************************************/
              /** ZSTD_insertBt1() : add one or multiple positions to tree.
              *   ip : assumed <= iend-8 .
              *   @return : nb of positions added */
              static U32 ZSTD_insertBt1(ZSTD_CCtx* zc, const BYTE* const ip, const U32 mls, const BYTE* const iend, U32 nbCompares,
                                        U32 extDict)
              {
                  U32*   const hashTable = zc->hashTable;
                  U32    const hashLog = zc->params.cParams.hashLog;
                  size_t const h  = ZSTD_hashPtr(ip, hashLog, mls);
                  U32*   const bt = zc->chainTable;
                  U32    const btLog  = zc->params.cParams.chainLog - 1;
                  U32    const btMask = (1 << btLog) - 1;
                  U32 matchIndex = hashTable[h];
                  size_t commonLengthSmaller=0, commonLengthLarger=0;
                  const BYTE* const base = zc->base;
                  const BYTE* const dictBase = zc->dictBase;
                  const U32 dictLimit = zc->dictLimit;
                  const BYTE* const dictEnd = dictBase + dictLimit;
                  const BYTE* const prefixStart = base + dictLimit;
                  const BYTE* match;
                  const U32 current = (U32)(ip-base);
                  const U32 btLow = btMask >= current ? 0 : current - btMask;
                  U32* smallerPtr = bt + 2*(current&btMask);
                  U32* largerPtr  = smallerPtr + 1;
                  U32 dummy32;   /* to be nullified at the end */
                  U32 const windowLow = zc->lowLimit;
                  U32 matchEndIdx = current+8;
                  size_t bestLength = 8;
              #ifdef ZSTD_C_PREDICT
                  U32 predictedSmall = *(bt + 2*((current-1)&btMask) + 0);
                  U32 predictedLarge = *(bt + 2*((current-1)&btMask) + 1);
                  predictedSmall += (predictedSmall>0);
                  predictedLarge += (predictedLarge>0);
              #endif /* ZSTD_C_PREDICT */
                  hashTable[h] = current;   /* Update Hash Table */
                  while (nbCompares-- && (matchIndex > windowLow)) {
-                     U32* nextPtr = bt + 2*(matchIndex & btMask);
+                     U32* const nextPtr = bt + 2*(matchIndex & btMask);
                      size_t matchLength = MIN(commonLengthSmaller, commonLengthLarger);   /* guaranteed minimum nb of common bytes */
              #ifdef ZSTD_C_PREDICT   /* note : can create issues when hlog small <= 11 */
                      const U32* predictPtr = bt + 2*((matchIndex-1) & btMask);   /* written this way, as bt is a roll buffer */
                      if (matchIndex == predictedSmall) {
                          /* no need to check length, result known */
                          *smallerPtr = matchIndex;
                          if (matchIndex <= btLow) { smallerPtr=&dummy32; break; }   /* beyond tree size, stop the search */
                          smallerPtr = nextPtr+1;               /* new "smaller" => larger of match */
                          matchIndex = nextPtr[1];              /* new matchIndex larger than previous (closer to current) */
                          predictedSmall = predictPtr[1] + (predictPtr[1]>0);
                          continue;
                      }
                      if (matchIndex == predictedLarge) {
                          *largerPtr = matchIndex;
                          if (matchIndex <= btLow) { largerPtr=&dummy32; break; }   /* beyond tree size, stop the search */
                          largerPtr = nextPtr;
                          matchIndex = nextPtr[0];
                          predictedLarge = predictPtr[0] + (predictPtr[0]>0);
                          continue;
                      }
              #endif
                      if ((!extDict) || (matchIndex+matchLength >= dictLimit)) {
                          match = base + matchIndex;
                          if (match[matchLength] == ip[matchLength])
                              matchLength += ZSTD_count(ip+matchLength+1, match+matchLength+1, iend) +1;
                      } else {
                          match = dictBase + matchIndex;
                          matchLength += ZSTD_count_2segments(ip+matchLength, match+matchLength, iend, dictEnd, prefixStart);
                          if (matchIndex+matchLength >= dictLimit)
              				match = base + matchIndex;   /* to prepare for next usage of match[matchLength] */
                      }
                      if (matchLength > bestLength) {
                          bestLength = matchLength;
                          if (matchLength > matchEndIdx - matchIndex)
                              matchEndIdx = matchIndex + (U32)matchLength;
                      }
                      if (ip+matchLength == iend)   /* equal : no way to know if inf or sup */
                          break;   /* drop , to guarantee consistency ; miss a bit of compression, but other solutions can corrupt the tree */
                      if (match[matchLength] < ip[matchLength]) {  /* necessarily within correct buffer */
                          /* match is smaller than current */
                          *smallerPtr = matchIndex;             /* update smaller idx */
                          commonLengthSmaller = matchLength;    /* all smaller will now have at least this guaranteed common length */
                          if (matchIndex <= btLow) { smallerPtr=&dummy32; break; }   /* beyond tree size, stop the search */
                          smallerPtr = nextPtr+1;               /* new "smaller" => larger of match */
                          matchIndex = nextPtr[1];              /* new matchIndex larger than previous (closer to current) */
                      } else {
                          /* match is larger than current */
                          *largerPtr = matchIndex;
                          commonLengthLarger = matchLength;
                          if (matchIndex <= btLow) { largerPtr=&dummy32; break; }   /* beyond tree size, stop the search */
                          largerPtr = nextPtr;
                          matchIndex = nextPtr[0];
                  }   }
                  *smallerPtr = *largerPtr = 0;
                  if (bestLength > 384) return MIN(192, (U32)(bestLength - 384));   /* speed optimization */
                  if (matchEndIdx > current + 8) return matchEndIdx - current - 8;
                  return 1;
              }
              static size_t ZSTD_insertBtAndFindBestMatch (
                                      ZSTD_CCtx* zc,
                                      const BYTE* const ip, const BYTE* const iend,
                                      size_t* offsetPtr,
                                      U32 nbCompares, const U32 mls,
                                      U32 extDict)
              {
                  U32*   const hashTable = zc->hashTable;
                  U32    const hashLog = zc->params.cParams.hashLog;
                  size_t const h  = ZSTD_hashPtr(ip, hashLog, mls);
                  U32*   const bt = zc->chainTable;
                  U32    const btLog  = zc->params.cParams.chainLog - 1;
                  U32    const btMask = (1 << btLog) - 1;
                  U32 matchIndex  = hashTable[h];
                  size_t commonLengthSmaller=0, commonLengthLarger=0;
                  const BYTE* const base = zc->base;
                  const BYTE* const dictBase = zc->dictBase;
                  const U32 dictLimit = zc->dictLimit;
                  const BYTE* const dictEnd = dictBase + dictLimit;
                  const BYTE* const prefixStart = base + dictLimit;
                  const U32 current = (U32)(ip-base);
                  const U32 btLow = btMask >= current ? 0 : current - btMask;
                  const U32 windowLow = zc->lowLimit;
                  U32* smallerPtr = bt + 2*(current&btMask);
                  U32* largerPtr  = bt + 2*(current&btMask) + 1;
                  U32 matchEndIdx = current+8;
                  U32 dummy32;   /* to be nullified at the end */
                  size_t bestLength = 0;
                  hashTable[h] = current;   /* Update Hash Table */
                  while (nbCompares-- && (matchIndex > windowLow)) {
-                     U32* nextPtr = bt + 2*(matchIndex & btMask);
+                     U32* const nextPtr = bt + 2*(matchIndex & btMask);
                      size_t matchLength = MIN(commonLengthSmaller, commonLengthLarger);   /* guaranteed minimum nb of common bytes */
                      const BYTE* match;
                      if ((!extDict) || (matchIndex+matchLength >= dictLimit)) {
                          match = base + matchIndex;
                          if (match[matchLength] == ip[matchLength])
                              matchLength += ZSTD_count(ip+matchLength+1, match+matchLength+1, iend) +1;
                      } else {
                          match = dictBase + matchIndex;
                          matchLength += ZSTD_count_2segments(ip+matchLength, match+matchLength, iend, dictEnd, prefixStart);
                          if (matchIndex+matchLength >= dictLimit)
              				match = base + matchIndex;   /* to prepare for next usage of match[matchLength] */
                      }
                      if (matchLength > bestLength) {
                          if (matchLength > matchEndIdx - matchIndex)
                              matchEndIdx = matchIndex + (U32)matchLength;
                          if ( (4*(int)(matchLength-bestLength)) > (int)(ZSTD_highbit32(current-matchIndex+1) - ZSTD_highbit32((U32)offsetPtr[0]+1)) )
                              bestLength = matchLength, *offsetPtr = ZSTD_REP_MOVE + current - matchIndex;
                          if (ip+matchLength == iend)   /* equal : no way to know if inf or sup */
                              break;   /* drop, to guarantee consistency (miss a little bit of compression) */
                      }
                      if (match[matchLength] < ip[matchLength]) {
                          /* match is smaller than current */
                          *smallerPtr = matchIndex;             /* update smaller idx */
                          commonLengthSmaller = matchLength;    /* all smaller will now have at least this guaranteed common length */
                          if (matchIndex <= btLow) { smallerPtr=&dummy32; break; }   /* beyond tree size, stop the search */
                          smallerPtr = nextPtr+1;               /* new "smaller" => larger of match */
                          matchIndex = nextPtr[1];              /* new matchIndex larger than previous (closer to current) */
                      } else {
                          /* match is larger than current */
                          *largerPtr = matchIndex;
                          commonLengthLarger = matchLength;
                          if (matchIndex <= btLow) { largerPtr=&dummy32; break; }   /* beyond tree size, stop the search */
                          largerPtr = nextPtr;
                          matchIndex = nextPtr[0];
                  }   }
                  *smallerPtr = *largerPtr = 0;
                  zc->nextToUpdate = (matchEndIdx > current + 8) ? matchEndIdx - 8 : current+1;
                  return bestLength;
              }
              static void ZSTD_updateTree(ZSTD_CCtx* zc, const BYTE* const ip, const BYTE* const iend, const U32 nbCompares, const U32 mls)
              {
                  const BYTE* const base = zc->base;
                  const U32 target = (U32)(ip - base);
                  U32 idx = zc->nextToUpdate;
                  while(idx < target)
                      idx += ZSTD_insertBt1(zc, base+idx, mls, iend, nbCompares, 0);
              }
              /** ZSTD_BtFindBestMatch() : Tree updater, providing best match */
              static size_t ZSTD_BtFindBestMatch (
                                      ZSTD_CCtx* zc,
                                      const BYTE* const ip, const BYTE* const iLimit,
                                      size_t* offsetPtr,
                                      const U32 maxNbAttempts, const U32 mls)
              {
                  if (ip < zc->base + zc->nextToUpdate) return 0;   /* skipped area */
                  ZSTD_updateTree(zc, ip, iLimit, maxNbAttempts, mls);
                  return ZSTD_insertBtAndFindBestMatch(zc, ip, iLimit, offsetPtr, maxNbAttempts, mls, 0);
              }
              static size_t ZSTD_BtFindBestMatch_selectMLS (
                                      ZSTD_CCtx* zc,   /* Index table will be updated */
                                      const BYTE* ip, const BYTE* const iLimit,
                                      size_t* offsetPtr,
                                      const U32 maxNbAttempts, const U32 matchLengthSearch)
              {
                  switch(matchLengthSearch)
                  {
                  default :
                  case 4 : return ZSTD_BtFindBestMatch(zc, ip, iLimit, offsetPtr, maxNbAttempts, 4);
                  case 5 : return ZSTD_BtFindBestMatch(zc, ip, iLimit, offsetPtr, maxNbAttempts, 5);
                  case 6 : return ZSTD_BtFindBestMatch(zc, ip, iLimit, offsetPtr, maxNbAttempts, 6);
                  }
              }
              static void ZSTD_updateTree_extDict(ZSTD_CCtx* zc, const BYTE* const ip, const BYTE* const iend, const U32 nbCompares, const U32 mls)
              {
                  const BYTE* const base = zc->base;
                  const U32 target = (U32)(ip - base);
                  U32 idx = zc->nextToUpdate;
                  while (idx < target) idx += ZSTD_insertBt1(zc, base+idx, mls, iend, nbCompares, 1);
              }
              /** Tree updater, providing best match */
              static size_t ZSTD_BtFindBestMatch_extDict (
                                      ZSTD_CCtx* zc,
                                      const BYTE* const ip, const BYTE* const iLimit,
                                      size_t* offsetPtr,
                                      const U32 maxNbAttempts, const U32 mls)
              {
                  if (ip < zc->base + zc->nextToUpdate) return 0;   /* skipped area */
                  ZSTD_updateTree_extDict(zc, ip, iLimit, maxNbAttempts, mls);
                  return ZSTD_insertBtAndFindBestMatch(zc, ip, iLimit, offsetPtr, maxNbAttempts, mls, 1);
              }
              static size_t ZSTD_BtFindBestMatch_selectMLS_extDict (
                                      ZSTD_CCtx* zc,   /* Index table will be updated */
                                      const BYTE* ip, const BYTE* const iLimit,
                                      size_t* offsetPtr,
                                      const U32 maxNbAttempts, const U32 matchLengthSearch)
              {
                  switch(matchLengthSearch)
                  {
                  default :
                  case 4 : return ZSTD_BtFindBestMatch_extDict(zc, ip, iLimit, offsetPtr, maxNbAttempts, 4);
                  case 5 : return ZSTD_BtFindBestMatch_extDict(zc, ip, iLimit, offsetPtr, maxNbAttempts, 5);
                  case 6 : return ZSTD_BtFindBestMatch_extDict(zc, ip, iLimit, offsetPtr, maxNbAttempts, 6);
                  }
              }
              /* *********************************
              *  Hash Chain
              ***********************************/
              #define NEXT_IN_CHAIN(d, mask)   chainTable[(d) & mask]
              /* Update chains up to ip (excluded)
                 Assumption : always within prefix (ie. not within extDict) */
              FORCE_INLINE
              U32 ZSTD_insertAndFindFirstIndex (ZSTD_CCtx* zc, const BYTE* ip, U32 mls)
              {
                  U32* const hashTable  = zc->hashTable;
                  const U32 hashLog = zc->params.cParams.hashLog;
                  U32* const chainTable = zc->chainTable;
                  const U32 chainMask = (1 << zc->params.cParams.chainLog) - 1;
                  const BYTE* const base = zc->base;
                  const U32 target = (U32)(ip - base);
                  U32 idx = zc->nextToUpdate;
                  while(idx < target) { /* catch up */
                      size_t const h = ZSTD_hashPtr(base+idx, hashLog, mls);
                      NEXT_IN_CHAIN(idx, chainMask) = hashTable[h];
                      hashTable[h] = idx;
                      idx++;
                  }
                  zc->nextToUpdate = target;
                  return hashTable[ZSTD_hashPtr(ip, hashLog, mls)];
              }
              FORCE_INLINE /* inlining is important to hardwire a hot branch (template emulation) */
              size_t ZSTD_HcFindBestMatch_generic (
                                      ZSTD_CCtx* zc,   /* Index table will be updated */
                                      const BYTE* const ip, const BYTE* const iLimit,
                                      size_t* offsetPtr,
                                      const U32 maxNbAttempts, const U32 mls, const U32 extDict)
              {
                  U32* const chainTable = zc->chainTable;
                  const U32 chainSize = (1 << zc->params.cParams.chainLog);
                  const U32 chainMask = chainSize-1;
                  const BYTE* const base = zc->base;
                  const BYTE* const dictBase = zc->dictBase;
                  const U32 dictLimit = zc->dictLimit;
                  const BYTE* const prefixStart = base + dictLimit;
                  const BYTE* const dictEnd = dictBase + dictLimit;
                  const U32 lowLimit = zc->lowLimit;
                  const U32 current = (U32)(ip-base);
                  const U32 minChain = current > chainSize ? current - chainSize : 0;
                  int nbAttempts=maxNbAttempts;
                  size_t ml=EQUAL_READ32-1;
                  /* HC4 match finder */
                  U32 matchIndex = ZSTD_insertAndFindFirstIndex (zc, ip, mls);
                  for ( ; (matchIndex>lowLimit) & (nbAttempts>0) ; nbAttempts--) {
                      const BYTE* match;
                      size_t currentMl=0;
                      if ((!extDict) || matchIndex >= dictLimit) {
                          match = base + matchIndex;
                          if (match[ml] == ip[ml])   /* potentially better */
                              currentMl = ZSTD_count(ip, match, iLimit);
                      } else {
                          match = dictBase + matchIndex;
                          if (MEM_read32(match) == MEM_read32(ip))   /* assumption : matchIndex <= dictLimit-4 (by table construction) */
                              currentMl = ZSTD_count_2segments(ip+EQUAL_READ32, match+EQUAL_READ32, iLimit, dictEnd, prefixStart) + EQUAL_READ32;
                      }
                      /* save best solution */
                      if (currentMl > ml) { ml = currentMl; *offsetPtr = current - matchIndex + ZSTD_REP_MOVE; if (ip+currentMl == iLimit) break; /* best possible, and avoid read overflow*/ }
                      if (matchIndex <= minChain) break;
                      matchIndex = NEXT_IN_CHAIN(matchIndex, chainMask);
                  }
                  return ml;
              }
              FORCE_INLINE size_t ZSTD_HcFindBestMatch_selectMLS (
                                      ZSTD_CCtx* zc,
                                      const BYTE* ip, const BYTE* const iLimit,
                                      size_t* offsetPtr,
                                      const U32 maxNbAttempts, const U32 matchLengthSearch)
              {
                  switch(matchLengthSearch)
                  {
                  default :
                  case 4 : return ZSTD_HcFindBestMatch_generic(zc, ip, iLimit, offsetPtr, maxNbAttempts, 4, 0);
                  case 5 : return ZSTD_HcFindBestMatch_generic(zc, ip, iLimit, offsetPtr, maxNbAttempts, 5, 0);
                  case 6 : return ZSTD_HcFindBestMatch_generic(zc, ip, iLimit, offsetPtr, maxNbAttempts, 6, 0);
                  }
              }
              FORCE_INLINE size_t ZSTD_HcFindBestMatch_extDict_selectMLS (
                                      ZSTD_CCtx* zc,
                                      const BYTE* ip, const BYTE* const iLimit,
                                      size_t* offsetPtr,
                                      const U32 maxNbAttempts, const U32 matchLengthSearch)
              {
                  switch(matchLengthSearch)
                  {
                  default :
                  case 4 : return ZSTD_HcFindBestMatch_generic(zc, ip, iLimit, offsetPtr, maxNbAttempts, 4, 1);
                  case 5 : return ZSTD_HcFindBestMatch_generic(zc, ip, iLimit, offsetPtr, maxNbAttempts, 5, 1);
                  case 6 : return ZSTD_HcFindBestMatch_generic(zc, ip, iLimit, offsetPtr, maxNbAttempts, 6, 1);
                  }
              }
              /* *******************************
              *  Common parser - lazy strategy
              *********************************/
              FORCE_INLINE
              void ZSTD_compressBlock_lazy_generic(ZSTD_CCtx* ctx,
                                                   const void* src, size_t srcSize,
                                                   const U32 searchMethod, const U32 depth)
              {
                  seqStore_t* seqStorePtr = &(ctx->seqStore);
                  const BYTE* const istart = (const BYTE*)src;
                  const BYTE* ip = istart;
                  const BYTE* anchor = istart;
                  const BYTE* const iend = istart + srcSize;
                  const BYTE* const ilimit = iend - 8;
                  const BYTE* const base = ctx->base + ctx->dictLimit;
                  U32 const maxSearches = 1 << ctx->params.cParams.searchLog;
                  U32 const mls = ctx->params.cParams.searchLength;
                  typedef size_t (*searchMax_f)(ZSTD_CCtx* zc, const BYTE* ip, const BYTE* iLimit,
                                      size_t* offsetPtr,
                                      U32 maxNbAttempts, U32 matchLengthSearch);
                  searchMax_f const searchMax = searchMethod ? ZSTD_BtFindBestMatch_selectMLS : ZSTD_HcFindBestMatch_selectMLS;
                  U32 offset_1 = ctx->rep[0], offset_2 = ctx->rep[1], savedOffset=0;
                  /* init */
                  ip += (ip==base);
                  ctx->nextToUpdate3 = ctx->nextToUpdate;
                  {   U32 const maxRep = (U32)(ip-base);
                      if (offset_2 > maxRep) savedOffset = offset_2, offset_2 = 0;
                      if (offset_1 > maxRep) savedOffset = offset_1, offset_1 = 0;
                  }
                  /* Match Loop */
                  while (ip < ilimit) {
                      size_t matchLength=0;
                      size_t offset=0;
                      const BYTE* start=ip+1;
                      /* check repCode */
                      if ((offset_1>0) & (MEM_read32(ip+1) == MEM_read32(ip+1 - offset_1))) {
                          /* repcode : we take it */
                          matchLength = ZSTD_count(ip+1+EQUAL_READ32, ip+1+EQUAL_READ32-offset_1, iend) + EQUAL_READ32;
                          if (depth==0) goto _storeSequence;
                      }
                      /* first search (depth 0) */
                      {   size_t offsetFound = 99999999;
                          size_t const ml2 = searchMax(ctx, ip, iend, &offsetFound, maxSearches, mls);
                          if (ml2 > matchLength)
                              matchLength = ml2, start = ip, offset=offsetFound;
                      }
                      if (matchLength < EQUAL_READ32) {
                          ip += ((ip-anchor) >> g_searchStrength) + 1;   /* jump faster over incompressible sections */
                          continue;
                      }
                      /* let's try to find a better solution */
                      if (depth>=1)
                      while (ip<ilimit) {
                          ip ++;
                          if ((offset) && ((offset_1>0) & (MEM_read32(ip) == MEM_read32(ip - offset_1)))) {
                              size_t const mlRep = ZSTD_count(ip+EQUAL_READ32, ip+EQUAL_READ32-offset_1, iend) + EQUAL_READ32;
                              int const gain2 = (int)(mlRep * 3);
                              int const gain1 = (int)(matchLength*3 - ZSTD_highbit32((U32)offset+1) + 1);
                              if ((mlRep >= EQUAL_READ32) && (gain2 > gain1))
                                  matchLength = mlRep, offset = 0, start = ip;
                          }
                          {   size_t offset2=99999999;
                              size_t const ml2 = searchMax(ctx, ip, iend, &offset2, maxSearches, mls);
                              int const gain2 = (int)(ml2*4 - ZSTD_highbit32((U32)offset2+1));   /* raw approx */
                              int const gain1 = (int)(matchLength*4 - ZSTD_highbit32((U32)offset+1) + 4);
                              if ((ml2 >= EQUAL_READ32) && (gain2 > gain1)) {
                                  matchLength = ml2, offset = offset2, start = ip;
                                  continue;   /* search a better one */
                          }   }
                          /* let's find an even better one */
                          if ((depth==2) && (ip<ilimit)) {
                              ip ++;
                              if ((offset) && ((offset_1>0) & (MEM_read32(ip) == MEM_read32(ip - offset_1)))) {
                                  size_t const ml2 = ZSTD_count(ip+EQUAL_READ32, ip+EQUAL_READ32-offset_1, iend) + EQUAL_READ32;
                                  int const gain2 = (int)(ml2 * 4);
                                  int const gain1 = (int)(matchLength*4 - ZSTD_highbit32((U32)offset+1) + 1);
                                  if ((ml2 >= EQUAL_READ32) && (gain2 > gain1))
                                      matchLength = ml2, offset = 0, start = ip;
                              }
                              {   size_t offset2=99999999;
                                  size_t const ml2 = searchMax(ctx, ip, iend, &offset2, maxSearches, mls);
                                  int const gain2 = (int)(ml2*4 - ZSTD_highbit32((U32)offset2+1));   /* raw approx */
                                  int const gain1 = (int)(matchLength*4 - ZSTD_highbit32((U32)offset+1) + 7);
                                  if ((ml2 >= EQUAL_READ32) && (gain2 > gain1)) {
                                      matchLength = ml2, offset = offset2, start = ip;
                                      continue;
                          }   }   }
                          break;  /* nothing found : store previous solution */
                      }
                      /* catch up */
                      if (offset) {
                          while ((start>anchor) && (start>base+offset-ZSTD_REP_MOVE) && (start[-1] == start[-1-offset+ZSTD_REP_MOVE]))   /* only search for offset within prefix */
                              { start--; matchLength++; }
                          offset_2 = offset_1; offset_1 = (U32)(offset - ZSTD_REP_MOVE);
                      }
                      /* store sequence */
              _storeSequence:
                      {   size_t const litLength = start - anchor;
                          ZSTD_storeSeq(seqStorePtr, litLength, anchor, (U32)offset, matchLength-MINMATCH);
                          anchor = ip = start + matchLength;
                      }
                      /* check immediate repcode */
                      while ( (ip <= ilimit)
                           && ((offset_2>0)
                           & (MEM_read32(ip) == MEM_read32(ip - offset_2)) )) {
                          /* store sequence */
                          matchLength = ZSTD_count(ip+EQUAL_READ32, ip+EQUAL_READ32-offset_2, iend) + EQUAL_READ32;
                          offset = offset_2; offset_2 = offset_1; offset_1 = (U32)offset; /* swap repcodes */
                          ZSTD_storeSeq(seqStorePtr, 0, anchor, 0, matchLength-MINMATCH);
                          ip += matchLength;
                          anchor = ip;
                          continue;   /* faster when present ... (?) */
                  }   }
                  /* Save reps for next block */
                  ctx->savedRep[0] = offset_1 ? offset_1 : savedOffset;
                  ctx->savedRep[1] = offset_2 ? offset_2 : savedOffset;
                  /* Last Literals */
                  {   size_t const lastLLSize = iend - anchor;
                      memcpy(seqStorePtr->lit, anchor, lastLLSize);
                      seqStorePtr->lit += lastLLSize;
                  }
              }
              static void ZSTD_compressBlock_btlazy2(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_generic(ctx, src, srcSize, 1, 2);
              }
              static void ZSTD_compressBlock_lazy2(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_generic(ctx, src, srcSize, 0, 2);
              }
              static void ZSTD_compressBlock_lazy(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_generic(ctx, src, srcSize, 0, 1);
              }
              static void ZSTD_compressBlock_greedy(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_generic(ctx, src, srcSize, 0, 0);
              }
              FORCE_INLINE
              void ZSTD_compressBlock_lazy_extDict_generic(ZSTD_CCtx* ctx,
                                                   const void* src, size_t srcSize,
                                                   const U32 searchMethod, const U32 depth)
              {
                  seqStore_t* seqStorePtr = &(ctx->seqStore);
                  const BYTE* const istart = (const BYTE*)src;
                  const BYTE* ip = istart;
                  const BYTE* anchor = istart;
                  const BYTE* const iend = istart + srcSize;
                  const BYTE* const ilimit = iend - 8;
                  const BYTE* const base = ctx->base;
                  const U32 dictLimit = ctx->dictLimit;
                  const U32 lowestIndex = ctx->lowLimit;
                  const BYTE* const prefixStart = base + dictLimit;
                  const BYTE* const dictBase = ctx->dictBase;
                  const BYTE* const dictEnd  = dictBase + dictLimit;
                  const BYTE* const dictStart  = dictBase + ctx->lowLimit;
                  const U32 maxSearches = 1 << ctx->params.cParams.searchLog;
                  const U32 mls = ctx->params.cParams.searchLength;
                  typedef size_t (*searchMax_f)(ZSTD_CCtx* zc, const BYTE* ip, const BYTE* iLimit,
                                      size_t* offsetPtr,
                                      U32 maxNbAttempts, U32 matchLengthSearch);
                  searchMax_f searchMax = searchMethod ? ZSTD_BtFindBestMatch_selectMLS_extDict : ZSTD_HcFindBestMatch_extDict_selectMLS;
                  U32 offset_1 = ctx->rep[0], offset_2 = ctx->rep[1];
                  /* init */
                  ctx->nextToUpdate3 = ctx->nextToUpdate;
                  ip += (ip == prefixStart);
                  /* Match Loop */
                  while (ip < ilimit) {
                      size_t matchLength=0;
                      size_t offset=0;
                      const BYTE* start=ip+1;
                      U32 current = (U32)(ip-base);
                      /* check repCode */
                      {   const U32 repIndex = (U32)(current+1 - offset_1);
                          const BYTE* const repBase = repIndex < dictLimit ? dictBase : base;
                          const BYTE* const repMatch = repBase + repIndex;
                          if (((U32)((dictLimit-1) - repIndex) >= 3) & (repIndex > lowestIndex))   /* intentional overflow */
                          if (MEM_read32(ip+1) == MEM_read32(repMatch)) {
                              /* repcode detected we should take it */
                              const BYTE* const repEnd = repIndex < dictLimit ? dictEnd : iend;
                              matchLength = ZSTD_count_2segments(ip+1+EQUAL_READ32, repMatch+EQUAL_READ32, iend, repEnd, prefixStart) + EQUAL_READ32;
                              if (depth==0) goto _storeSequence;
                      }   }
                      /* first search (depth 0) */
                      {   size_t offsetFound = 99999999;
                          size_t const ml2 = searchMax(ctx, ip, iend, &offsetFound, maxSearches, mls);
                          if (ml2 > matchLength)
                              matchLength = ml2, start = ip, offset=offsetFound;
                      }
                       if (matchLength < EQUAL_READ32) {
                          ip += ((ip-anchor) >> g_searchStrength) + 1;   /* jump faster over incompressible sections */
                          continue;
                      }
                      /* let's try to find a better solution */
                      if (depth>=1)
                      while (ip<ilimit) {
                          ip ++;
                          current++;
                          /* check repCode */
                          if (offset) {
                              const U32 repIndex = (U32)(current - offset_1);
                              const BYTE* const repBase = repIndex < dictLimit ? dictBase : base;
                              const BYTE* const repMatch = repBase + repIndex;
                              if (((U32)((dictLimit-1) - repIndex) >= 3) & (repIndex > lowestIndex))  /* intentional overflow */
                              if (MEM_read32(ip) == MEM_read32(repMatch)) {
                                  /* repcode detected */
                                  const BYTE* const repEnd = repIndex < dictLimit ? dictEnd : iend;
                                  size_t const repLength = ZSTD_count_2segments(ip+EQUAL_READ32, repMatch+EQUAL_READ32, iend, repEnd, prefixStart) + EQUAL_READ32;
                                  int const gain2 = (int)(repLength * 3);
                                  int const gain1 = (int)(matchLength*3 - ZSTD_highbit32((U32)offset+1) + 1);
                                  if ((repLength >= EQUAL_READ32) && (gain2 > gain1))
                                      matchLength = repLength, offset = 0, start = ip;
                          }   }
                          /* search match, depth 1 */
                          {   size_t offset2=99999999;
                              size_t const ml2 = searchMax(ctx, ip, iend, &offset2, maxSearches, mls);
                              int const gain2 = (int)(ml2*4 - ZSTD_highbit32((U32)offset2+1));   /* raw approx */
                              int const gain1 = (int)(matchLength*4 - ZSTD_highbit32((U32)offset+1) + 4);
                              if ((ml2 >= EQUAL_READ32) && (gain2 > gain1)) {
                                  matchLength = ml2, offset = offset2, start = ip;
                                  continue;   /* search a better one */
                          }   }
                          /* let's find an even better one */
                          if ((depth==2) && (ip<ilimit)) {
                              ip ++;
                              current++;
                              /* check repCode */
                              if (offset) {
                                  const U32 repIndex = (U32)(current - offset_1);
                                  const BYTE* const repBase = repIndex < dictLimit ? dictBase : base;
                                  const BYTE* const repMatch = repBase + repIndex;
                                  if (((U32)((dictLimit-1) - repIndex) >= 3) & (repIndex > lowestIndex))  /* intentional overflow */
                                  if (MEM_read32(ip) == MEM_read32(repMatch)) {
                                      /* repcode detected */
                                      const BYTE* const repEnd = repIndex < dictLimit ? dictEnd : iend;
                                      size_t repLength = ZSTD_count_2segments(ip+EQUAL_READ32, repMatch+EQUAL_READ32, iend, repEnd, prefixStart) + EQUAL_READ32;
                                      int gain2 = (int)(repLength * 4);
                                      int gain1 = (int)(matchLength*4 - ZSTD_highbit32((U32)offset+1) + 1);
                                      if ((repLength >= EQUAL_READ32) && (gain2 > gain1))
                                          matchLength = repLength, offset = 0, start = ip;
                              }   }
                              /* search match, depth 2 */
                              {   size_t offset2=99999999;
                                  size_t const ml2 = searchMax(ctx, ip, iend, &offset2, maxSearches, mls);
                                  int const gain2 = (int)(ml2*4 - ZSTD_highbit32((U32)offset2+1));   /* raw approx */
                                  int const gain1 = (int)(matchLength*4 - ZSTD_highbit32((U32)offset+1) + 7);
                                  if ((ml2 >= EQUAL_READ32) && (gain2 > gain1)) {
                                      matchLength = ml2, offset = offset2, start = ip;
                                      continue;
                          }   }   }
                          break;  /* nothing found : store previous solution */
                      }
                      /* catch up */
                      if (offset) {
                          U32 const matchIndex = (U32)((start-base) - (offset - ZSTD_REP_MOVE));
                          const BYTE* match = (matchIndex < dictLimit) ? dictBase + matchIndex : base + matchIndex;
                          const BYTE* const mStart = (matchIndex < dictLimit) ? dictStart : prefixStart;
                          while ((start>anchor) && (match>mStart) && (start[-1] == match[-1])) { start--; match--; matchLength++; }  /* catch up */
                          offset_2 = offset_1; offset_1 = (U32)(offset - ZSTD_REP_MOVE);
                      }
                      /* store sequence */
              _storeSequence:
                      {   size_t const litLength = start - anchor;
                          ZSTD_storeSeq(seqStorePtr, litLength, anchor, (U32)offset, matchLength-MINMATCH);
                          anchor = ip = start + matchLength;
                      }
                      /* check immediate repcode */
                      while (ip <= ilimit) {
                          const U32 repIndex = (U32)((ip-base) - offset_2);
                          const BYTE* const repBase = repIndex < dictLimit ? dictBase : base;
                          const BYTE* const repMatch = repBase + repIndex;
                          if (((U32)((dictLimit-1) - repIndex) >= 3) & (repIndex > lowestIndex))  /* intentional overflow */
                          if (MEM_read32(ip) == MEM_read32(repMatch)) {
                              /* repcode detected we should take it */
                              const BYTE* const repEnd = repIndex < dictLimit ? dictEnd : iend;
                              matchLength = ZSTD_count_2segments(ip+EQUAL_READ32, repMatch+EQUAL_READ32, iend, repEnd, prefixStart) + EQUAL_READ32;
                              offset = offset_2; offset_2 = offset_1; offset_1 = (U32)offset;   /* swap offset history */
                              ZSTD_storeSeq(seqStorePtr, 0, anchor, 0, matchLength-MINMATCH);
                              ip += matchLength;
                              anchor = ip;
                              continue;   /* faster when present ... (?) */
                          }
                          break;
                  }   }
                  /* Save reps for next block */
                  ctx->savedRep[0] = offset_1; ctx->savedRep[1] = offset_2;
                  /* Last Literals */
                  {   size_t const lastLLSize = iend - anchor;
                      memcpy(seqStorePtr->lit, anchor, lastLLSize);
                      seqStorePtr->lit += lastLLSize;
                  }
              }
              void ZSTD_compressBlock_greedy_extDict(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_extDict_generic(ctx, src, srcSize, 0, 0);
              }
              static void ZSTD_compressBlock_lazy_extDict(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_extDict_generic(ctx, src, srcSize, 0, 1);
              }
              static void ZSTD_compressBlock_lazy2_extDict(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_extDict_generic(ctx, src, srcSize, 0, 2);
              }
              static void ZSTD_compressBlock_btlazy2_extDict(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
                  ZSTD_compressBlock_lazy_extDict_generic(ctx, src, srcSize, 1, 2);
              }
              /* The optimal parser */
              #include "zstd_opt.h"
              static void ZSTD_compressBlock_btopt(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
              #ifdef ZSTD_OPT_H_91842398743
                  ZSTD_compressBlock_opt_generic(ctx, src, srcSize, 0);
              #else
                  (void)ctx; (void)src; (void)srcSize;
                  return;
              #endif
              }
              static void ZSTD_compressBlock_btopt2(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
              #ifdef ZSTD_OPT_H_91842398743
                  ZSTD_compressBlock_opt_generic(ctx, src, srcSize, 1);
              #else
                  (void)ctx; (void)src; (void)srcSize;
                  return;
              #endif
              }
              static void ZSTD_compressBlock_btopt_extDict(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
              #ifdef ZSTD_OPT_H_91842398743
                  ZSTD_compressBlock_opt_extDict_generic(ctx, src, srcSize, 0);
              #else
                  (void)ctx; (void)src; (void)srcSize;
                  return;
              #endif
              }
              static void ZSTD_compressBlock_btopt2_extDict(ZSTD_CCtx* ctx, const void* src, size_t srcSize)
              {
              #ifdef ZSTD_OPT_H_91842398743
                  ZSTD_compressBlock_opt_extDict_generic(ctx, src, srcSize, 1);
              #else
                  (void)ctx; (void)src; (void)srcSize;
                  return;
              #endif
              }
              typedef void (*ZSTD_blockCompressor) (ZSTD_CCtx* ctx, const void* src, size_t srcSize);
              static ZSTD_blockCompressor ZSTD_selectBlockCompressor(ZSTD_strategy strat, int extDict)
              {
                  static const ZSTD_blockCompressor blockCompressor[2][8] = {
                      { ZSTD_compressBlock_fast, ZSTD_compressBlock_doubleFast, ZSTD_compressBlock_greedy, ZSTD_compressBlock_lazy, ZSTD_compressBlock_lazy2, ZSTD_compressBlock_btlazy2, ZSTD_compressBlock_btopt, ZSTD_compressBlock_btopt2 },
                      { ZSTD_compressBlock_fast_extDict, ZSTD_compressBlock_doubleFast_extDict, ZSTD_compressBlock_greedy_extDict, ZSTD_compressBlock_lazy_extDict,ZSTD_compressBlock_lazy2_extDict, ZSTD_compressBlock_btlazy2_extDict, ZSTD_compressBlock_btopt_extDict, ZSTD_compressBlock_btopt2_extDict }
                  };
                  return blockCompressor[extDict][(U32)strat];
              }
              static size_t ZSTD_compressBlock_internal(ZSTD_CCtx* zc, void* dst, size_t dstCapacity, const void* src, size_t srcSize)
              {
                  ZSTD_blockCompressor const blockCompressor = ZSTD_selectBlockCompressor(zc->params.cParams.strategy, zc->lowLimit < zc->dictLimit);
                  const BYTE* const base = zc->base;
                  const BYTE* const istart = (const BYTE*)src;
                  const U32 current = (U32)(istart-base);
                  if (srcSize < MIN_CBLOCK_SIZE+ZSTD_blockHeaderSize+1) return 0;   /* don't even attempt compression below a certain srcSize */
                  ZSTD_resetSeqStore(&(zc->seqStore));
                  if (current > zc->nextToUpdate + 384)
                      zc->nextToUpdate = current - MIN(192, (U32)(current - zc->nextToUpdate - 384));   /* update tree not updated after finding very long rep matches */
                  blockCompressor(zc, src, srcSize);
                  return ZSTD_compressSequences(zc, dst, dstCapacity, srcSize);
              }
              /*! ZSTD_compress_generic() :
              *   Compress a chunk of data into one or multiple blocks.
              *   All blocks will be terminated, all input will be consumed.
              *   Function will issue an error if there is not enough `dstCapacity` to hold the compressed content.
              *   Frame is supposed already started (header already produced)
              *   @return : compressed size, or an error code
              */
              static size_t ZSTD_compress_generic (ZSTD_CCtx* cctx,
                                                   void* dst, size_t dstCapacity,
                                             const void* src, size_t srcSize,
                                                   U32 lastFrameChunk)
              {
                  size_t blockSize = cctx->blockSize;
                  size_t remaining = srcSize;
                  const BYTE* ip = (const BYTE*)src;
                  BYTE* const ostart = (BYTE*)dst;
                  BYTE* op = ostart;
                  U32 const maxDist = 1 << cctx->params.cParams.windowLog;
                  if (cctx->params.fParams.checksumFlag && srcSize)
                      XXH64_update(&cctx->xxhState, src, srcSize);
                  while (remaining) {
                      U32 const lastBlock = lastFrameChunk & (blockSize >= remaining);
                      size_t cSize;
                      if (dstCapacity < ZSTD_blockHeaderSize + MIN_CBLOCK_SIZE) return ERROR(dstSize_tooSmall);   /* not enough space to store compressed block */
                      if (remaining < blockSize) blockSize = remaining;
                      /* preemptive overflow correction */
-                     if (cctx->lowLimit > (1<<30)) {
-                         U32 const btplus = (cctx->params.cParams.strategy == ZSTD_btlazy2) | (cctx->params.cParams.strategy == ZSTD_btopt) | (cctx->params.cParams.strategy == ZSTD_btopt2);
-                         U32 const chainMask = (1 << (cctx->params.cParams.chainLog - btplus)) - 1;
-                         U32 const supLog = MAX(cctx->params.cParams.chainLog, 17 /* blockSize */);
-                         U32 const newLowLimit = (cctx->lowLimit & chainMask) + (1 << supLog);   /* preserve position % chainSize, ensure current-repcode doesn't underflow */
-                         U32 const correction = cctx->lowLimit - newLowLimit;
+                     if (cctx->lowLimit > (2U<<30)) {
+                         U32 const cycleMask = (1 << ZSTD_cycleLog(cctx->params.cParams.hashLog, cctx->params.cParams.strategy)) - 1;
+                         U32 const current = (U32)(ip - cctx->base);
+                         U32 const newCurrent = (current & cycleMask) + (1 << cctx->params.cParams.windowLog);
+                         U32 const correction = current - newCurrent;
+                         ZSTD_STATIC_ASSERT(ZSTD_WINDOWLOG_MAX_64 <= 30);
                          ZSTD_reduceIndex(cctx, correction);
                          cctx->base += correction;
                          cctx->dictBase += correction;
-                         cctx->lowLimit = newLowLimit;
+                         cctx->lowLimit -= correction;
                          cctx->dictLimit -= correction;
                          if (cctx->nextToUpdate < correction) cctx->nextToUpdate = 0;
                          else cctx->nextToUpdate -= correction;
                      }
                      if ((U32)(ip+blockSize - cctx->base) > cctx->loadedDictEnd + maxDist) {
                          /* enforce maxDist */
                          U32 const newLowLimit = (U32)(ip+blockSize - cctx->base) - maxDist;
                          if (cctx->lowLimit < newLowLimit) cctx->lowLimit = newLowLimit;
                          if (cctx->dictLimit < cctx->lowLimit) cctx->dictLimit = cctx->lowLimit;
                      }
                      cSize = ZSTD_compressBlock_internal(cctx, op+ZSTD_blockHeaderSize, dstCapacity-ZSTD_blockHeaderSize, ip, blockSize);
                      if (ZSTD_isError(cSize)) return cSize;
                      if (cSize == 0) {  /* block is not compressible */
                          U32 const cBlockHeader24 = lastBlock + (((U32)bt_raw)<<1) + (U32)(blockSize << 3);
                          if (blockSize + ZSTD_blockHeaderSize > dstCapacity) return ERROR(dstSize_tooSmall);
                          MEM_writeLE32(op, cBlockHeader24);   /* no pb, 4th byte will be overwritten */
                          memcpy(op + ZSTD_blockHeaderSize, ip, blockSize);
                          cSize = ZSTD_blockHeaderSize+blockSize;
                      } else {
                          U32 const cBlockHeader24 = lastBlock + (((U32)bt_compressed)<<1) + (U32)(cSize << 3);
                          MEM_writeLE24(op, cBlockHeader24);
                          cSize += ZSTD_blockHeaderSize;
                      }
                      remaining -= blockSize;
                      dstCapacity -= cSize;
                      ip += blockSize;
                      op += cSize;
                  }
                  if (lastFrameChunk && (op>ostart)) cctx->stage = ZSTDcs_ending;
                  return op-ostart;
              }
              static size_t ZSTD_writeFrameHeader(void* dst, size_t dstCapacity,
                                                  ZSTD_parameters params, U64 pledgedSrcSize, U32 dictID)
              {   BYTE* const op = (BYTE*)dst;
                  U32   const dictIDSizeCode = (dictID>0) + (dictID>=256) + (dictID>=65536);   /* 0-3 */
                  U32   const checksumFlag = params.fParams.checksumFlag>0;
                  U32   const windowSize = 1U << params.cParams.windowLog;
                  U32   const singleSegment = params.fParams.contentSizeFlag && (windowSize > (pledgedSrcSize-1));
                  BYTE  const windowLogByte = (BYTE)((params.cParams.windowLog - ZSTD_WINDOWLOG_ABSOLUTEMIN) << 3);
                  U32   const fcsCode = params.fParams.contentSizeFlag ?
                                   (pledgedSrcSize>=256) + (pledgedSrcSize>=65536+256) + (pledgedSrcSize>=0xFFFFFFFFU) :   /* 0-3 */
 ;
                  BYTE  const frameHeaderDecriptionByte = (BYTE)(dictIDSizeCode + (checksumFlag<<2) + (singleSegment<<5) + (fcsCode<<6) );
                  size_t pos;
                  if (dstCapacity < ZSTD_frameHeaderSize_max) return ERROR(dstSize_tooSmall);
                  MEM_writeLE32(dst, ZSTD_MAGICNUMBER);
                  op[4] = frameHeaderDecriptionByte; pos=5;
                  if (!singleSegment) op[pos++] = windowLogByte;
                  switch(dictIDSizeCode)
                  {
                      default:   /* impossible */
                      case 0 : break;
                      case 1 : op[pos] = (BYTE)(dictID); pos++; break;
                      case 2 : MEM_writeLE16(op+pos, (U16)dictID); pos+=2; break;
                      case 3 : MEM_writeLE32(op+pos, dictID); pos+=4; break;
                  }
                  switch(fcsCode)
                  {
                      default:   /* impossible */
                      case 0 : if (singleSegment) op[pos++] = (BYTE)(pledgedSrcSize); break;
                      case 1 : MEM_writeLE16(op+pos, (U16)(pledgedSrcSize-256)); pos+=2; break;
                      case 2 : MEM_writeLE32(op+pos, (U32)(pledgedSrcSize)); pos+=4; break;
                      case 3 : MEM_writeLE64(op+pos, (U64)(pledgedSrcSize)); pos+=8; break;
                  }
                  return pos;
              }
              static size_t ZSTD_compressContinue_internal (ZSTD_CCtx* cctx,
                                            void* dst, size_t dstCapacity,
                                      const void* src, size_t srcSize,
                                             U32 frame, U32 lastFrameChunk)
              {
                  const BYTE* const ip = (const BYTE*) src;
                  size_t fhSize = 0;
                  if (cctx->stage==ZSTDcs_created) return ERROR(stage_wrong);   /* missing init (ZSTD_compressBegin) */
                  if (frame && (cctx->stage==ZSTDcs_init)) {
                      fhSize = ZSTD_writeFrameHeader(dst, dstCapacity, cctx->params, cctx->frameContentSize, cctx->dictID);
                      if (ZSTD_isError(fhSize)) return fhSize;
                      dstCapacity -= fhSize;
                      dst = (char*)dst + fhSize;
                      cctx->stage = ZSTDcs_ongoing;
                  }
                  /* Check if blocks follow each other */
                  if (src != cctx->nextSrc) {
                      /* not contiguous */
                      ptrdiff_t const delta = cctx->nextSrc - ip;
                      cctx->lowLimit = cctx->dictLimit;
                      cctx->dictLimit = (U32)(cctx->nextSrc - cctx->base);
                      cctx->dictBase = cctx->base;
                      cctx->base -= delta;
                      cctx->nextToUpdate = cctx->dictLimit;
                      if (cctx->dictLimit - cctx->lowLimit < HASH_READ_SIZE) cctx->lowLimit = cctx->dictLimit;   /* too small extDict */
                  }
                  /* if input and dictionary overlap : reduce dictionary (area presumed modified by input) */
                  if ((ip+srcSize > cctx->dictBase + cctx->lowLimit) & (ip < cctx->dictBase + cctx->dictLimit)) {
                      ptrdiff_t const highInputIdx = (ip + srcSize) - cctx->dictBase;
                      U32 const lowLimitMax = (highInputIdx > (ptrdiff_t)cctx->dictLimit) ? cctx->dictLimit : (U32)highInputIdx;
                      cctx->lowLimit = lowLimitMax;
                  }
                  cctx->nextSrc = ip + srcSize;
                  {   size_t const cSize = frame ?
                                           ZSTD_compress_generic (cctx, dst, dstCapacity, src, srcSize, lastFrameChunk) :
                                           ZSTD_compressBlock_internal (cctx, dst, dstCapacity, src, srcSize);
                      if (ZSTD_isError(cSize)) return cSize;
                      return cSize + fhSize;
                  }
              }
              size_t ZSTD_compressContinue (ZSTD_CCtx* cctx,
                                            void* dst, size_t dstCapacity,
                                      const void* src, size_t srcSize)
              {
                  return ZSTD_compressContinue_internal(cctx, dst, dstCapacity, src, srcSize, 1, 0);
              }
              size_t ZSTD_getBlockSizeMax(ZSTD_CCtx* cctx)
              {
                  return MIN (ZSTD_BLOCKSIZE_ABSOLUTEMAX, 1 << cctx->params.cParams.windowLog);
              }
              size_t ZSTD_compressBlock(ZSTD_CCtx* cctx, void* dst, size_t dstCapacity, const void* src, size_t srcSize)
              {
                  size_t const blockSizeMax = ZSTD_getBlockSizeMax(cctx);
                  if (srcSize > blockSizeMax) return ERROR(srcSize_wrong);
                  return ZSTD_compressContinue_internal(cctx, dst, dstCapacity, src, srcSize, 0, 0);
              }
              static size_t ZSTD_loadDictionaryContent(ZSTD_CCtx* zc, const void* src, size_t srcSize)
              {
                  const BYTE* const ip = (const BYTE*) src;
                  const BYTE* const iend = ip + srcSize;
                  /* input becomes current prefix */
                  zc->lowLimit = zc->dictLimit;
                  zc->dictLimit = (U32)(zc->nextSrc - zc->base);
                  zc->dictBase = zc->base;
                  zc->base += ip - zc->nextSrc;
                  zc->nextToUpdate = zc->dictLimit;
                  zc->loadedDictEnd = (U32)(iend - zc->base);
                  zc->nextSrc = iend;
                  if (srcSize <= HASH_READ_SIZE) return 0;
                  switch(zc->params.cParams.strategy)
                  {
                  case ZSTD_fast:
                      ZSTD_fillHashTable (zc, iend, zc->params.cParams.searchLength);
                      break;
                  case ZSTD_dfast:
                      ZSTD_fillDoubleHashTable (zc, iend, zc->params.cParams.searchLength);
                      break;
                  case ZSTD_greedy:
                  case ZSTD_lazy:
                  case ZSTD_lazy2:
                      ZSTD_insertAndFindFirstIndex (zc, iend-HASH_READ_SIZE, zc->params.cParams.searchLength);
                      break;
                  case ZSTD_btlazy2:
                  case ZSTD_btopt:
                  case ZSTD_btopt2:
                      ZSTD_updateTree(zc, iend-HASH_READ_SIZE, iend, 1 << zc->params.cParams.searchLog, zc->params.cParams.searchLength);
                      break;
                  default:
                      return ERROR(GENERIC);   /* strategy doesn't exist; impossible */
                  }
                  zc->nextToUpdate = zc->loadedDictEnd;
                  return 0;
              }
              /* Dictionaries that assign zero probability to symbols that show up causes problems
                 when FSE encoding.  Refuse dictionaries that assign zero probability to symbols
                 that we may encounter during compression.
                 NOTE: This behavior is not standard and could be improved in the future. */
              static size_t ZSTD_checkDictNCount(short* normalizedCounter, unsigned dictMaxSymbolValue, unsigned maxSymbolValue) {
                  U32 s;
                  if (dictMaxSymbolValue < maxSymbolValue) return ERROR(dictionary_corrupted);
                  for (s = 0; s <= maxSymbolValue; ++s) {
                      if (normalizedCounter[s] == 0) return ERROR(dictionary_corrupted);
                  }
                  return 0;
              }
              /* Dictionary format :
                  Magic == ZSTD_DICT_MAGIC (4 bytes)
                  HUF_writeCTable(256)
                  FSE_writeNCount(off)
                  FSE_writeNCount(ml)
                  FSE_writeNCount(ll)
                  RepOffsets
                  Dictionary content
              */
              /*! ZSTD_loadDictEntropyStats() :
                  @return : size read from dictionary
                  note : magic number supposed already checked */
              static size_t ZSTD_loadDictEntropyStats(ZSTD_CCtx* cctx, const void* dict, size_t dictSize)
              {
                  const BYTE* dictPtr = (const BYTE*)dict;
                  const BYTE* const dictEnd = dictPtr + dictSize;
                  short offcodeNCount[MaxOff+1];
                  unsigned offcodeMaxValue = MaxOff;
+                 BYTE scratchBuffer[1<<MAX(MLFSELog,LLFSELog)];
                  {   size_t const hufHeaderSize = HUF_readCTable(cctx->hufTable, 255, dict, dictSize);
                      if (HUF_isError(hufHeaderSize)) return ERROR(dictionary_corrupted);
                      dictPtr += hufHeaderSize;
                  }
                  {   unsigned offcodeLog;
                      size_t const offcodeHeaderSize = FSE_readNCount(offcodeNCount, &offcodeMaxValue, &offcodeLog, dictPtr, dictEnd-dictPtr);
                      if (FSE_isError(offcodeHeaderSize)) return ERROR(dictionary_corrupted);
                      if (offcodeLog > OffFSELog) return ERROR(dictionary_corrupted);
                      /* Defer checking offcodeMaxValue because we need to know the size of the dictionary content */
-                     CHECK_E (FSE_buildCTable(cctx->offcodeCTable, offcodeNCount, offcodeMaxValue, offcodeLog), dictionary_corrupted);
+                     CHECK_E (FSE_buildCTable_wksp(cctx->offcodeCTable, offcodeNCount, offcodeMaxValue, offcodeLog, scratchBuffer, sizeof(scratchBuffer)), dictionary_corrupted);
                      dictPtr += offcodeHeaderSize;
                  }
                  {   short matchlengthNCount[MaxML+1];
                      unsigned matchlengthMaxValue = MaxML, matchlengthLog;
                      size_t const matchlengthHeaderSize = FSE_readNCount(matchlengthNCount, &matchlengthMaxValue, &matchlengthLog, dictPtr, dictEnd-dictPtr);
                      if (FSE_isError(matchlengthHeaderSize)) return ERROR(dictionary_corrupted);
                      if (matchlengthLog > MLFSELog) return ERROR(dictionary_corrupted);
                      /* Every match length code must have non-zero probability */
                      CHECK_F (ZSTD_checkDictNCount(matchlengthNCount, matchlengthMaxValue, MaxML));
-                     CHECK_E (FSE_buildCTable(cctx->matchlengthCTable, matchlengthNCount, matchlengthMaxValue, matchlengthLog), dictionary_corrupted);
+                     CHECK_E (FSE_buildCTable_wksp(cctx->matchlengthCTable, matchlengthNCount, matchlengthMaxValue, matchlengthLog, scratchBuffer, sizeof(scratchBuffer)), dictionary_corrupted);
                      dictPtr += matchlengthHeaderSize;
                  }
                  {   short litlengthNCount[MaxLL+1];
                      unsigned litlengthMaxValue = MaxLL, litlengthLog;
                      size_t const litlengthHeaderSize = FSE_readNCount(litlengthNCount, &litlengthMaxValue, &litlengthLog, dictPtr, dictEnd-dictPtr);
                      if (FSE_isError(litlengthHeaderSize)) return ERROR(dictionary_corrupted);
                      if (litlengthLog > LLFSELog) return ERROR(dictionary_corrupted);
                      /* Every literal length code must have non-zero probability */
                      CHECK_F (ZSTD_checkDictNCount(litlengthNCount, litlengthMaxValue, MaxLL));
-                     CHECK_E(FSE_buildCTable(cctx->litlengthCTable, litlengthNCount, litlengthMaxValue, litlengthLog), dictionary_corrupted);
+                     CHECK_E(FSE_buildCTable_wksp(cctx->litlengthCTable, litlengthNCount, litlengthMaxValue, litlengthLog, scratchBuffer, sizeof(scratchBuffer)), dictionary_corrupted);
                      dictPtr += litlengthHeaderSize;
                  }
                  if (dictPtr+12 > dictEnd) return ERROR(dictionary_corrupted);
                  cctx->rep[0] = MEM_readLE32(dictPtr+0); if (cctx->rep[0] >= dictSize) return ERROR(dictionary_corrupted);
                  cctx->rep[1] = MEM_readLE32(dictPtr+4); if (cctx->rep[1] >= dictSize) return ERROR(dictionary_corrupted);
                  cctx->rep[2] = MEM_readLE32(dictPtr+8); if (cctx->rep[2] >= dictSize) return ERROR(dictionary_corrupted);
                  dictPtr += 12;
                  {   U32 offcodeMax = MaxOff;
                      if ((size_t)(dictEnd - dictPtr) <= ((U32)-1) - 128 KB) {
                          U32 const maxOffset = (U32)(dictEnd - dictPtr) + 128 KB; /* The maximum offset that must be supported */
                          /* Calculate minimum offset code required to represent maxOffset */
                          offcodeMax = ZSTD_highbit32(maxOffset);
                      }
                      /* Every possible supported offset <= dictContentSize + 128 KB must be representable */
                      CHECK_F (ZSTD_checkDictNCount(offcodeNCount, offcodeMaxValue, MIN(offcodeMax, MaxOff)));
                  }
                  cctx->flagStaticTables = 1;
                  return dictPtr - (const BYTE*)dict;
              }
              /** ZSTD_compress_insertDictionary() :
              *   @return : 0, or an error code */
              static size_t ZSTD_compress_insertDictionary(ZSTD_CCtx* zc, const void* dict, size_t dictSize)
              {
                  if ((dict==NULL) || (dictSize<=8)) return 0;
                  /* default : dict is pure content */
                  if (MEM_readLE32(dict) != ZSTD_DICT_MAGIC) return ZSTD_loadDictionaryContent(zc, dict, dictSize);
                  zc->dictID = zc->params.fParams.noDictIDFlag ? 0 :  MEM_readLE32((const char*)dict+4);
                  /* known magic number : dict is parsed for entropy stats and content */
                  {   size_t const loadError = ZSTD_loadDictEntropyStats(zc, (const char*)dict+8 /* skip dictHeader */, dictSize-8);
                      size_t const eSize = loadError + 8;
                      if (ZSTD_isError(loadError)) return loadError;
                      return ZSTD_loadDictionaryContent(zc, (const char*)dict+eSize, dictSize-eSize);
                  }
              }
              /*! ZSTD_compressBegin_internal() :
              *   @return : 0, or an error code */
              static size_t ZSTD_compressBegin_internal(ZSTD_CCtx* cctx,
                                           const void* dict, size_t dictSize,
                                                 ZSTD_parameters params, U64 pledgedSrcSize)
              {
                  ZSTD_compResetPolicy_e const crp = dictSize ? ZSTDcrp_fullReset : ZSTDcrp_continue;
                  CHECK_F(ZSTD_resetCCtx_advanced(cctx, params, pledgedSrcSize, crp));
                  return ZSTD_compress_insertDictionary(cctx, dict, dictSize);
              }
              /*! ZSTD_compressBegin_advanced() :
              *   @return : 0, or an error code */
              size_t ZSTD_compressBegin_advanced(ZSTD_CCtx* cctx,
                                           const void* dict, size_t dictSize,
                                                 ZSTD_parameters params, unsigned long long pledgedSrcSize)
              {
                  /* compression parameters verification and optimization */
                  CHECK_F(ZSTD_checkCParams(params.cParams));
                  return ZSTD_compressBegin_internal(cctx, dict, dictSize, params, pledgedSrcSize);
              }
              size_t ZSTD_compressBegin_usingDict(ZSTD_CCtx* cctx, const void* dict, size_t dictSize, int compressionLevel)
              {
                  ZSTD_parameters const params = ZSTD_getParams(compressionLevel, 0, dictSize);
                  return ZSTD_compressBegin_internal(cctx, dict, dictSize, params, 0);
              }
              size_t ZSTD_compressBegin(ZSTD_CCtx* zc, int compressionLevel)
              {
                  return ZSTD_compressBegin_usingDict(zc, NULL, 0, compressionLevel);
              }
              /*! ZSTD_writeEpilogue() :
              *   Ends a frame.
              *   @return : nb of bytes written into dst (or an error code) */
              static size_t ZSTD_writeEpilogue(ZSTD_CCtx* cctx, void* dst, size_t dstCapacity)
              {
                  BYTE* const ostart = (BYTE*)dst;
                  BYTE* op = ostart;
                  size_t fhSize = 0;
                  if (cctx->stage == ZSTDcs_created) return ERROR(stage_wrong);  /* init missing */
                  /* special case : empty frame */
                  if (cctx->stage == ZSTDcs_init) {
                      fhSize = ZSTD_writeFrameHeader(dst, dstCapacity, cctx->params, 0, 0);
                      if (ZSTD_isError(fhSize)) return fhSize;
                      dstCapacity -= fhSize;
                      op += fhSize;
                      cctx->stage = ZSTDcs_ongoing;
                  }
                  if (cctx->stage != ZSTDcs_ending) {
                      /* write one last empty block, make it the "last" block */
                      U32 const cBlockHeader24 = 1 /* last block */ + (((U32)bt_raw)<<1) + 0;
                      if (dstCapacity<4) return ERROR(dstSize_tooSmall);
                      MEM_writeLE32(op, cBlockHeader24);
                      op += ZSTD_blockHeaderSize;
                      dstCapacity -= ZSTD_blockHeaderSize;
                  }
                  if (cctx->params.fParams.checksumFlag) {
                      U32 const checksum = (U32) XXH64_digest(&cctx->xxhState);
                      if (dstCapacity<4) return ERROR(dstSize_tooSmall);
                      MEM_writeLE32(op, checksum);
                      op += 4;
                  }
                  cctx->stage = ZSTDcs_created;  /* return to "created but no init" status */
                  return op-ostart;
              }
              size_t ZSTD_compressEnd (ZSTD_CCtx* cctx,
                                       void* dst, size_t dstCapacity,
                                 const void* src, size_t srcSize)
              {
                  size_t endResult;
                  size_t const cSize = ZSTD_compressContinue_internal(cctx, dst, dstCapacity, src, srcSize, 1, 1);
                  if (ZSTD_isError(cSize)) return cSize;
                  endResult = ZSTD_writeEpilogue(cctx, (char*)dst + cSize, dstCapacity-cSize);
                  if (ZSTD_isError(endResult)) return endResult;
                  return cSize + endResult;
              }
              static size_t ZSTD_compress_internal (ZSTD_CCtx* cctx,
                                             void* dst, size_t dstCapacity,
                                       const void* src, size_t srcSize,
                                       const void* dict,size_t dictSize,
                                             ZSTD_parameters params)
              {
                  CHECK_F(ZSTD_compressBegin_internal(cctx, dict, dictSize, params, srcSize));
                  return ZSTD_compressEnd(cctx, dst,  dstCapacity, src, srcSize);
              }
              size_t ZSTD_compress_advanced (ZSTD_CCtx* ctx,
                                             void* dst, size_t dstCapacity,
                                       const void* src, size_t srcSize,
                                       const void* dict,size_t dictSize,
                                             ZSTD_parameters params)
              {
                  CHECK_F(ZSTD_checkCParams(params.cParams));
                  return ZSTD_compress_internal(ctx, dst, dstCapacity, src, srcSize, dict, dictSize, params);
              }
              size_t ZSTD_compress_usingDict(ZSTD_CCtx* ctx, void* dst, size_t dstCapacity, const void* src, size_t srcSize, const void* dict, size_t dictSize, int compressionLevel)
              {
-                 ZSTD_parameters params = ZSTD_getParams(compressionLevel, srcSize, dictSize);
+                 ZSTD_parameters params = ZSTD_getParams(compressionLevel, srcSize, dict ? dictSize : 0);
                  params.fParams.contentSizeFlag = 1;
                  return ZSTD_compress_internal(ctx, dst, dstCapacity, src, srcSize, dict, dictSize, params);
              }
              size_t ZSTD_compressCCtx (ZSTD_CCtx* ctx, void* dst, size_t dstCapacity, const void* src, size_t srcSize, int compressionLevel)
              {
                  return ZSTD_compress_usingDict(ctx, dst, dstCapacity, src, srcSize, NULL, 0, compressionLevel);
              }
              size_t ZSTD_compress(void* dst, size_t dstCapacity, const void* src, size_t srcSize, int compressionLevel)
              {
                  size_t result;
                  ZSTD_CCtx ctxBody;
                  memset(&ctxBody, 0, sizeof(ctxBody));
                  memcpy(&ctxBody.customMem, &defaultCustomMem, sizeof(ZSTD_customMem));
                  result = ZSTD_compressCCtx(&ctxBody, dst, dstCapacity, src, srcSize, compressionLevel);
                  ZSTD_free(ctxBody.workSpace, defaultCustomMem);  /* can't free ctxBody itself, as it's on stack; free only heap content */
                  return result;
              }
              /* =====  Dictionary API  ===== */
              struct ZSTD_CDict_s {
                  void* dictContent;
                  size_t dictContentSize;
                  ZSTD_CCtx* refContext;
              };  /* typedef'd tp ZSTD_CDict within "zstd.h" */
              size_t ZSTD_sizeof_CDict(const ZSTD_CDict* cdict)
              {
                  if (cdict==NULL) return 0;   /* support sizeof on NULL */
                  return ZSTD_sizeof_CCtx(cdict->refContext) + cdict->dictContentSize;
              }
              ZSTD_CDict* ZSTD_createCDict_advanced(const void* dict, size_t dictSize, ZSTD_parameters params, ZSTD_customMem customMem)
              {
                  if (!customMem.customAlloc && !customMem.customFree) customMem = defaultCustomMem;
                  if (!customMem.customAlloc || !customMem.customFree) return NULL;
                  {   ZSTD_CDict* const cdict = (ZSTD_CDict*) ZSTD_malloc(sizeof(ZSTD_CDict), customMem);
                      void* const dictContent = ZSTD_malloc(dictSize, customMem);
                      ZSTD_CCtx* const cctx = ZSTD_createCCtx_advanced(customMem);
                      if (!dictContent || !cdict || !cctx) {
                          ZSTD_free(dictContent, customMem);
                          ZSTD_free(cdict, customMem);
                          ZSTD_free(cctx, customMem);
                          return NULL;
                      }
                      if (dictSize) {
                          memcpy(dictContent, dict, dictSize);
                      }
                      {   size_t const errorCode = ZSTD_compressBegin_advanced(cctx, dictContent, dictSize, params, 0);
                          if (ZSTD_isError(errorCode)) {
                              ZSTD_free(dictContent, customMem);
                              ZSTD_free(cdict, customMem);
                              ZSTD_free(cctx, customMem);
                              return NULL;
                      }   }
                      cdict->dictContent = dictContent;
                      cdict->dictContentSize = dictSize;
                      cdict->refContext = cctx;
                      return cdict;
                  }
              }
              ZSTD_CDict* ZSTD_createCDict(const void* dict, size_t dictSize, int compressionLevel)
              {
                  ZSTD_customMem const allocator = { NULL, NULL, NULL };
                  ZSTD_parameters params = ZSTD_getParams(compressionLevel, 0, dictSize);
                  params.fParams.contentSizeFlag = 1;
                  return ZSTD_createCDict_advanced(dict, dictSize, params, allocator);
              }
              size_t ZSTD_freeCDict(ZSTD_CDict* cdict)
              {
                  if (cdict==NULL) return 0;   /* support free on NULL */
                  {   ZSTD_customMem const cMem = cdict->refContext->customMem;
                      ZSTD_freeCCtx(cdict->refContext);
                      ZSTD_free(cdict->dictContent, cMem);
                      ZSTD_free(cdict, cMem);
                      return 0;
                  }
              }
              static ZSTD_parameters ZSTD_getParamsFromCDict(const ZSTD_CDict* cdict) {
                  return ZSTD_getParamsFromCCtx(cdict->refContext);
              }
              size_t ZSTD_compressBegin_usingCDict(ZSTD_CCtx* cctx, const ZSTD_CDict* cdict, U64 pledgedSrcSize)
              {
                  if (cdict->dictContentSize) CHECK_F(ZSTD_copyCCtx(cctx, cdict->refContext, pledgedSrcSize))
                  else CHECK_F(ZSTD_compressBegin_advanced(cctx, NULL, 0, cdict->refContext->params, pledgedSrcSize));
                  return 0;
              }
              /*! ZSTD_compress_usingCDict() :
              *   Compression using a digested Dictionary.
              *   Faster startup than ZSTD_compress_usingDict(), recommended when same dictionary is used multiple times.
              *   Note that compression level is decided during dictionary creation */
              size_t ZSTD_compress_usingCDict(ZSTD_CCtx* cctx,
                                              void* dst, size_t dstCapacity,
                                              const void* src, size_t srcSize,
                                              const ZSTD_CDict* cdict)
              {
                  CHECK_F(ZSTD_compressBegin_usingCDict(cctx, cdict, srcSize));
                  if (cdict->refContext->params.fParams.contentSizeFlag==1) {
                      cctx->params.fParams.contentSizeFlag = 1;
                      cctx->frameContentSize = srcSize;
                  }
                  return ZSTD_compressEnd(cctx, dst, dstCapacity, src, srcSize);
              }
              /* ******************************************************************
              *  Streaming
              ********************************************************************/
              typedef enum { zcss_init, zcss_load, zcss_flush, zcss_final } ZSTD_cStreamStage;
              struct ZSTD_CStream_s {
                  ZSTD_CCtx* cctx;
                  ZSTD_CDict* cdictLocal;
                  const ZSTD_CDict* cdict;
                  char*  inBuff;
                  size_t inBuffSize;
                  size_t inToCompress;
                  size_t inBuffPos;
                  size_t inBuffTarget;
                  size_t blockSize;
                  char*  outBuff;
                  size_t outBuffSize;
                  size_t outBuffContentSize;
                  size_t outBuffFlushedSize;
                  ZSTD_cStreamStage stage;
                  U32    checksum;
                  U32    frameEnded;
+                 U64    pledgedSrcSize;
+                 U64    inputProcessed;
                  ZSTD_parameters params;
                  ZSTD_customMem customMem;
              };   /* typedef'd to ZSTD_CStream within "zstd.h" */
              ZSTD_CStream* ZSTD_createCStream(void)
              {
                  return ZSTD_createCStream_advanced(defaultCustomMem);
              }
              ZSTD_CStream* ZSTD_createCStream_advanced(ZSTD_customMem customMem)
              {
                  ZSTD_CStream* zcs;
                  if (!customMem.customAlloc && !customMem.customFree) customMem = defaultCustomMem;
                  if (!customMem.customAlloc || !customMem.customFree) return NULL;
                  zcs = (ZSTD_CStream*)ZSTD_malloc(sizeof(ZSTD_CStream), customMem);
                  if (zcs==NULL) return NULL;
                  memset(zcs, 0, sizeof(ZSTD_CStream));
                  memcpy(&zcs->customMem, &customMem, sizeof(ZSTD_customMem));
                  zcs->cctx = ZSTD_createCCtx_advanced(customMem);
                  if (zcs->cctx == NULL) { ZSTD_freeCStream(zcs); return NULL; }
                  return zcs;
              }
              size_t ZSTD_freeCStream(ZSTD_CStream* zcs)
              {
                  if (zcs==NULL) return 0;   /* support free on NULL */
                  {   ZSTD_customMem const cMem = zcs->customMem;
                      ZSTD_freeCCtx(zcs->cctx);
                      ZSTD_freeCDict(zcs->cdictLocal);
                      ZSTD_free(zcs->inBuff, cMem);
                      ZSTD_free(zcs->outBuff, cMem);
                      ZSTD_free(zcs, cMem);
                      return 0;
                  }
              }
              /*======   Initialization   ======*/
              size_t ZSTD_CStreamInSize(void)  { return ZSTD_BLOCKSIZE_ABSOLUTEMAX; }
              size_t ZSTD_CStreamOutSize(void) { return ZSTD_compressBound(ZSTD_BLOCKSIZE_ABSOLUTEMAX) + ZSTD_blockHeaderSize + 4 /* 32-bits hash */ ; }
              size_t ZSTD_resetCStream(ZSTD_CStream* zcs, unsigned long long pledgedSrcSize)
              {
                  if (zcs->inBuffSize==0) return ERROR(stage_wrong);   /* zcs has not been init at least once */
                  if (zcs->cdict) CHECK_F(ZSTD_compressBegin_usingCDict(zcs->cctx, zcs->cdict, pledgedSrcSize))
                  else CHECK_F(ZSTD_compressBegin_advanced(zcs->cctx, NULL, 0, zcs->params, pledgedSrcSize));
                  zcs->inToCompress = 0;
                  zcs->inBuffPos = 0;
                  zcs->inBuffTarget = zcs->blockSize;
                  zcs->outBuffContentSize = zcs->outBuffFlushedSize = 0;
                  zcs->stage = zcss_load;
                  zcs->frameEnded = 0;
+                 zcs->pledgedSrcSize = pledgedSrcSize;
+                 zcs->inputProcessed = 0;
                  return 0;   /* ready to go */
              }
              size_t ZSTD_initCStream_advanced(ZSTD_CStream* zcs,
                                               const void* dict, size_t dictSize,
                                               ZSTD_parameters params, unsigned long long pledgedSrcSize)
              {
                  /* allocate buffers */
                  {   size_t const neededInBuffSize = (size_t)1 << params.cParams.windowLog;
                      if (zcs->inBuffSize < neededInBuffSize) {
                          zcs->inBuffSize = neededInBuffSize;
                          ZSTD_free(zcs->inBuff, zcs->customMem);
                          zcs->inBuff = (char*) ZSTD_malloc(neededInBuffSize, zcs->customMem);
                          if (zcs->inBuff == NULL) return ERROR(memory_allocation);
                      }
                      zcs->blockSize = MIN(ZSTD_BLOCKSIZE_ABSOLUTEMAX, neededInBuffSize);
                  }
                  if (zcs->outBuffSize < ZSTD_compressBound(zcs->blockSize)+1) {
                      zcs->outBuffSize = ZSTD_compressBound(zcs->blockSize)+1;
                      ZSTD_free(zcs->outBuff, zcs->customMem);
                      zcs->outBuff = (char*) ZSTD_malloc(zcs->outBuffSize, zcs->customMem);
                      if (zcs->outBuff == NULL) return ERROR(memory_allocation);
                  }
                  if (dict) {
                      ZSTD_freeCDict(zcs->cdictLocal);
                      zcs->cdictLocal = ZSTD_createCDict_advanced(dict, dictSize, params, zcs->customMem);
                      if (zcs->cdictLocal == NULL) return ERROR(memory_allocation);
                      zcs->cdict = zcs->cdictLocal;
                  } else zcs->cdict = NULL;
                  zcs->checksum = params.fParams.checksumFlag > 0;
                  zcs->params = params;
                  return ZSTD_resetCStream(zcs, pledgedSrcSize);
              }
              /* note : cdict must outlive compression session */
              size_t ZSTD_initCStream_usingCDict(ZSTD_CStream* zcs, const ZSTD_CDict* cdict)
              {
                  ZSTD_parameters const params = ZSTD_getParamsFromCDict(cdict);
                  size_t const initError =  ZSTD_initCStream_advanced(zcs, NULL, 0, params, 0);
                  zcs->cdict = cdict;
                  return initError;
              }
              size_t ZSTD_initCStream_usingDict(ZSTD_CStream* zcs, const void* dict, size_t dictSize, int compressionLevel)
              {
                  ZSTD_parameters const params = ZSTD_getParams(compressionLevel, 0, dictSize);
                  return ZSTD_initCStream_advanced(zcs, dict, dictSize, params, 0);
              }
+             size_t ZSTD_initCStream_srcSize(ZSTD_CStream* zcs, int compressionLevel, unsigned long long pledgedSrcSize)
+             {
+                 ZSTD_parameters const params = ZSTD_getParams(compressionLevel, pledgedSrcSize, 0);
+                 return ZSTD_initCStream_advanced(zcs, NULL, 0, params, pledgedSrcSize);
+             }
              size_t ZSTD_initCStream(ZSTD_CStream* zcs, int compressionLevel)
              {
                  return ZSTD_initCStream_usingDict(zcs, NULL, 0, compressionLevel);
              }
              size_t ZSTD_sizeof_CStream(const ZSTD_CStream* zcs)
              {
                  if (zcs==NULL) return 0;   /* support sizeof on NULL */
                  return sizeof(zcs) + ZSTD_sizeof_CCtx(zcs->cctx) + ZSTD_sizeof_CDict(zcs->cdictLocal) + zcs->outBuffSize + zcs->inBuffSize;
              }
              /*======   Compression   ======*/
              typedef enum { zsf_gather, zsf_flush, zsf_end } ZSTD_flush_e;
              MEM_STATIC size_t ZSTD_limitCopy(void* dst, size_t dstCapacity, const void* src, size_t srcSize)
              {
                  size_t const length = MIN(dstCapacity, srcSize);
                  memcpy(dst, src, length);
                  return length;
              }
              static size_t ZSTD_compressStream_generic(ZSTD_CStream* zcs,
                                            void* dst, size_t* dstCapacityPtr,
                                      const void* src, size_t* srcSizePtr,
                                            ZSTD_flush_e const flush)
              {
                  U32 someMoreWork = 1;
                  const char* const istart = (const char*)src;
                  const char* const iend = istart + *srcSizePtr;
                  const char* ip = istart;
                  char* const ostart = (char*)dst;
                  char* const oend = ostart + *dstCapacityPtr;
                  char* op = ostart;
                  while (someMoreWork) {
                      switch(zcs->stage)
                      {
                      case zcss_init: return ERROR(init_missing);   /* call ZBUFF_compressInit() first ! */
                      case zcss_load:
                          /* complete inBuffer */
                          {   size_t const toLoad = zcs->inBuffTarget - zcs->inBuffPos;
                              size_t const loaded = ZSTD_limitCopy(zcs->inBuff + zcs->inBuffPos, toLoad, ip, iend-ip);
                              zcs->inBuffPos += loaded;
                              ip += loaded;
                              if ( (zcs->inBuffPos==zcs->inToCompress) || (!flush && (toLoad != loaded)) ) {
                                  someMoreWork = 0; break;  /* not enough input to get a full block : stop there, wait for more */
                          }   }
                          /* compress current block (note : this stage cannot be stopped in the middle) */
                          {   void* cDst;
                              size_t cSize;
                              size_t const iSize = zcs->inBuffPos - zcs->inToCompress;
                              size_t oSize = oend-op;
                              if (oSize >= ZSTD_compressBound(iSize))
                                  cDst = op;   /* compress directly into output buffer (avoid flush stage) */
                              else
                                  cDst = zcs->outBuff, oSize = zcs->outBuffSize;
                              cSize = (flush == zsf_end) ?
                                      ZSTD_compressEnd(zcs->cctx, cDst, oSize, zcs->inBuff + zcs->inToCompress, iSize) :
                                      ZSTD_compressContinue(zcs->cctx, cDst, oSize, zcs->inBuff + zcs->inToCompress, iSize);
                              if (ZSTD_isError(cSize)) return cSize;
                              if (flush == zsf_end) zcs->frameEnded = 1;
                              /* prepare next block */
                              zcs->inBuffTarget = zcs->inBuffPos + zcs->blockSize;
                              if (zcs->inBuffTarget > zcs->inBuffSize)
                                  zcs->inBuffPos = 0, zcs->inBuffTarget = zcs->blockSize;   /* note : inBuffSize >= blockSize */
                              zcs->inToCompress = zcs->inBuffPos;
                              if (cDst == op) { op += cSize; break; }   /* no need to flush */
                              zcs->outBuffContentSize = cSize;
                              zcs->outBuffFlushedSize = 0;
                              zcs->stage = zcss_flush;   /* pass-through to flush stage */
                          }
                      case zcss_flush:
                          {   size_t const toFlush = zcs->outBuffContentSize - zcs->outBuffFlushedSize;
                              size_t const flushed = ZSTD_limitCopy(op, oend-op, zcs->outBuff + zcs->outBuffFlushedSize, toFlush);
                              op += flushed;
                              zcs->outBuffFlushedSize += flushed;
                              if (toFlush!=flushed) { someMoreWork = 0; break; }  /* dst too small to store flushed data : stop there */
                              zcs->outBuffContentSize = zcs->outBuffFlushedSize = 0;
                              zcs->stage = zcss_load;
                              break;
                          }
                      case zcss_final:
                          someMoreWork = 0;   /* do nothing */
                          break;
                      default:
                          return ERROR(GENERIC);   /* impossible */
                      }
                  }
                  *srcSizePtr = ip - istart;
                  *dstCapacityPtr = op - ostart;
+                 zcs->inputProcessed += *srcSizePtr;
                  if (zcs->frameEnded) return 0;
                  {   size_t hintInSize = zcs->inBuffTarget - zcs->inBuffPos;
                      if (hintInSize==0) hintInSize = zcs->blockSize;
                      return hintInSize;
                  }
              }
              size_t ZSTD_compressStream(ZSTD_CStream* zcs, ZSTD_outBuffer* output, ZSTD_inBuffer* input)
              {
                  size_t sizeRead = input->size - input->pos;
                  size_t sizeWritten = output->size - output->pos;
                  size_t const result = ZSTD_compressStream_generic(zcs,
                                                                    (char*)(output->dst) + output->pos, &sizeWritten,
                                                                    (const char*)(input->src) + input->pos, &sizeRead, zsf_gather);
                  input->pos += sizeRead;
                  output->pos += sizeWritten;
                  return result;
              }
              /*======   Finalize   ======*/
              /*! ZSTD_flushStream() :
              *   @return : amount of data remaining to flush */
              size_t ZSTD_flushStream(ZSTD_CStream* zcs, ZSTD_outBuffer* output)
              {
                  size_t srcSize = 0;
                  size_t sizeWritten = output->size - output->pos;
                  size_t const result = ZSTD_compressStream_generic(zcs,
                                                                   (char*)(output->dst) + output->pos, &sizeWritten,
                                                                   &srcSize, &srcSize, /* use a valid src address instead of NULL */
                                                                    zsf_flush);
                  output->pos += sizeWritten;
                  if (ZSTD_isError(result)) return result;
                  return zcs->outBuffContentSize - zcs->outBuffFlushedSize;   /* remaining to flush */
              }
              size_t ZSTD_endStream(ZSTD_CStream* zcs, ZSTD_outBuffer* output)
              {
                  BYTE* const ostart = (BYTE*)(output->dst) + output->pos;
                  BYTE* const oend = (BYTE*)(output->dst) + output->size;
                  BYTE* op = ostart;
+                 if ((zcs->pledgedSrcSize) && (zcs->inputProcessed != zcs->pledgedSrcSize))
+                     return ERROR(srcSize_wrong);   /* pledgedSrcSize not respected */
                  if (zcs->stage != zcss_final) {
                      /* flush whatever remains */
                      size_t srcSize = 0;
                      size_t sizeWritten = output->size - output->pos;
                      size_t const notEnded = ZSTD_compressStream_generic(zcs, ostart, &sizeWritten, &srcSize, &srcSize, zsf_end);  /* use a valid src address instead of NULL */
                      size_t const remainingToFlush = zcs->outBuffContentSize - zcs->outBuffFlushedSize;
                      op += sizeWritten;
                      if (remainingToFlush) {
                          output->pos += sizeWritten;
                          return remainingToFlush + ZSTD_BLOCKHEADERSIZE /* final empty block */ + (zcs->checksum * 4);
                      }
                      /* create epilogue */
                      zcs->stage = zcss_final;
                      zcs->outBuffContentSize = !notEnded ? 0 :
                          ZSTD_compressEnd(zcs->cctx, zcs->outBuff, zcs->outBuffSize, NULL, 0);  /* write epilogue, including final empty block, into outBuff */
                  }
                  /* flush epilogue */
                  {   size_t const toFlush = zcs->outBuffContentSize - zcs->outBuffFlushedSize;
                      size_t const flushed = ZSTD_limitCopy(op, oend-op, zcs->outBuff + zcs->outBuffFlushedSize, toFlush);
                      op += flushed;
                      zcs->outBuffFlushedSize += flushed;
                      output->pos += op-ostart;
                      if (toFlush==flushed) zcs->stage = zcss_init;  /* end reached */
                      return toFlush - flushed;
                  }
              }
              /*-=====  Pre-defined compression levels  =====-*/
              #define ZSTD_DEFAULT_CLEVEL 1
              #define ZSTD_MAX_CLEVEL     22
              int ZSTD_maxCLevel(void) { return ZSTD_MAX_CLEVEL; }
              static const ZSTD_compressionParameters ZSTD_defaultCParameters[4][ZSTD_MAX_CLEVEL+1] = {
              {   /* "default" */
                  /* W,  C,  H,  S,  L, TL, strat */
                  { 18, 12, 12,  1,  7, 16, ZSTD_fast    },  /* level  0 - never used */
                  { 19, 13, 14,  1,  7, 16, ZSTD_fast    },  /* level  1 */
                  { 19, 15, 16,  1,  6, 16, ZSTD_fast    },  /* level  2 */
                  { 20, 16, 17,  1,  5, 16, ZSTD_dfast   },  /* level  3.*/
                  { 20, 18, 18,  1,  5, 16, ZSTD_dfast   },  /* level  4.*/
                  { 20, 15, 18,  3,  5, 16, ZSTD_greedy  },  /* level  5 */
                  { 21, 16, 19,  2,  5, 16, ZSTD_lazy    },  /* level  6 */
                  { 21, 17, 20,  3,  5, 16, ZSTD_lazy    },  /* level  7 */
                  { 21, 18, 20,  3,  5, 16, ZSTD_lazy2   },  /* level  8 */
                  { 21, 20, 20,  3,  5, 16, ZSTD_lazy2   },  /* level  9 */
                  { 21, 19, 21,  4,  5, 16, ZSTD_lazy2   },  /* level 10 */
                  { 22, 20, 22,  4,  5, 16, ZSTD_lazy2   },  /* level 11 */
                  { 22, 20, 22,  5,  5, 16, ZSTD_lazy2   },  /* level 12 */
                  { 22, 21, 22,  5,  5, 16, ZSTD_lazy2   },  /* level 13 */
                  { 22, 21, 22,  6,  5, 16, ZSTD_lazy2   },  /* level 14 */
                  { 22, 21, 21,  5,  5, 16, ZSTD_btlazy2 },  /* level 15 */
                  { 23, 22, 22,  5,  5, 16, ZSTD_btlazy2 },  /* level 16 */
                  { 23, 21, 22,  4,  5, 24, ZSTD_btopt   },  /* level 17 */
                  { 23, 23, 22,  6,  5, 32, ZSTD_btopt   },  /* level 18 */
                  { 23, 23, 22,  6,  3, 48, ZSTD_btopt   },  /* level 19 */
                  { 25, 25, 23,  7,  3, 64, ZSTD_btopt2  },  /* level 20 */
                  { 26, 26, 23,  7,  3,256, ZSTD_btopt2  },  /* level 21 */
                  { 27, 27, 25,  9,  3,512, ZSTD_btopt2  },  /* level 22 */
              },
              {   /* for srcSize <= 256 KB */
                  /* W,  C,  H,  S,  L,  T, strat */
                  {  0,  0,  0,  0,  0,  0, ZSTD_fast    },  /* level  0 - not used */
                  { 18, 13, 14,  1,  6,  8, ZSTD_fast    },  /* level  1 */
                  { 18, 14, 13,  1,  5,  8, ZSTD_dfast   },  /* level  2 */
                  { 18, 16, 15,  1,  5,  8, ZSTD_dfast   },  /* level  3 */
                  { 18, 15, 17,  1,  5,  8, ZSTD_greedy  },  /* level  4.*/
                  { 18, 16, 17,  4,  5,  8, ZSTD_greedy  },  /* level  5.*/
                  { 18, 16, 17,  3,  5,  8, ZSTD_lazy    },  /* level  6.*/
                  { 18, 17, 17,  4,  4,  8, ZSTD_lazy    },  /* level  7 */
                  { 18, 17, 17,  4,  4,  8, ZSTD_lazy2   },  /* level  8 */
                  { 18, 17, 17,  5,  4,  8, ZSTD_lazy2   },  /* level  9 */
                  { 18, 17, 17,  6,  4,  8, ZSTD_lazy2   },  /* level 10 */
                  { 18, 18, 17,  6,  4,  8, ZSTD_lazy2   },  /* level 11.*/
                  { 18, 18, 17,  7,  4,  8, ZSTD_lazy2   },  /* level 12.*/
                  { 18, 19, 17,  6,  4,  8, ZSTD_btlazy2 },  /* level 13 */
                  { 18, 18, 18,  4,  4, 16, ZSTD_btopt   },  /* level 14.*/
                  { 18, 18, 18,  4,  3, 16, ZSTD_btopt   },  /* level 15.*/
                  { 18, 19, 18,  6,  3, 32, ZSTD_btopt   },  /* level 16.*/
                  { 18, 19, 18,  8,  3, 64, ZSTD_btopt   },  /* level 17.*/
                  { 18, 19, 18,  9,  3,128, ZSTD_btopt   },  /* level 18.*/
                  { 18, 19, 18, 10,  3,256, ZSTD_btopt   },  /* level 19.*/
                  { 18, 19, 18, 11,  3,512, ZSTD_btopt2  },  /* level 20.*/
                  { 18, 19, 18, 12,  3,512, ZSTD_btopt2  },  /* level 21.*/
                  { 18, 19, 18, 13,  3,512, ZSTD_btopt2  },  /* level 22.*/
              },
              {   /* for srcSize <= 128 KB */
                  /* W,  C,  H,  S,  L,  T, strat */
                  { 17, 12, 12,  1,  7,  8, ZSTD_fast    },  /* level  0 - not used */
                  { 17, 12, 13,  1,  6,  8, ZSTD_fast    },  /* level  1 */
                  { 17, 13, 16,  1,  5,  8, ZSTD_fast    },  /* level  2 */
                  { 17, 16, 16,  2,  5,  8, ZSTD_dfast   },  /* level  3 */
                  { 17, 13, 15,  3,  4,  8, ZSTD_greedy  },  /* level  4 */
                  { 17, 15, 17,  4,  4,  8, ZSTD_greedy  },  /* level  5 */
                  { 17, 16, 17,  3,  4,  8, ZSTD_lazy    },  /* level  6 */
                  { 17, 15, 17,  4,  4,  8, ZSTD_lazy2   },  /* level  7 */
                  { 17, 17, 17,  4,  4,  8, ZSTD_lazy2   },  /* level  8 */
                  { 17, 17, 17,  5,  4,  8, ZSTD_lazy2   },  /* level  9 */
                  { 17, 17, 17,  6,  4,  8, ZSTD_lazy2   },  /* level 10 */
                  { 17, 17, 17,  7,  4,  8, ZSTD_lazy2   },  /* level 11 */
                  { 17, 17, 17,  8,  4,  8, ZSTD_lazy2   },  /* level 12 */
                  { 17, 18, 17,  6,  4,  8, ZSTD_btlazy2 },  /* level 13.*/
                  { 17, 17, 17,  7,  3,  8, ZSTD_btopt   },  /* level 14.*/
                  { 17, 17, 17,  7,  3, 16, ZSTD_btopt   },  /* level 15.*/
                  { 17, 18, 17,  7,  3, 32, ZSTD_btopt   },  /* level 16.*/
                  { 17, 18, 17,  7,  3, 64, ZSTD_btopt   },  /* level 17.*/
                  { 17, 18, 17,  7,  3,256, ZSTD_btopt   },  /* level 18.*/
                  { 17, 18, 17,  8,  3,256, ZSTD_btopt   },  /* level 19.*/
                  { 17, 18, 17,  9,  3,256, ZSTD_btopt2  },  /* level 20.*/
                  { 17, 18, 17, 10,  3,256, ZSTD_btopt2  },  /* level 21.*/
                  { 17, 18, 17, 11,  3,512, ZSTD_btopt2  },  /* level 22.*/
              },
              {   /* for srcSize <= 16 KB */
                  /* W,  C,  H,  S,  L,  T, strat */
                  { 14, 12, 12,  1,  7,  6, ZSTD_fast    },  /* level  0 - not used */
                  { 14, 14, 14,  1,  6,  6, ZSTD_fast    },  /* level  1 */
                  { 14, 14, 14,  1,  4,  6, ZSTD_fast    },  /* level  2 */
                  { 14, 14, 14,  1,  4,  6, ZSTD_dfast   },  /* level  3.*/
                  { 14, 14, 14,  4,  4,  6, ZSTD_greedy  },  /* level  4.*/
                  { 14, 14, 14,  3,  4,  6, ZSTD_lazy    },  /* level  5.*/
                  { 14, 14, 14,  4,  4,  6, ZSTD_lazy2   },  /* level  6 */
                  { 14, 14, 14,  5,  4,  6, ZSTD_lazy2   },  /* level  7 */
                  { 14, 14, 14,  6,  4,  6, ZSTD_lazy2   },  /* level  8.*/
                  { 14, 15, 14,  6,  4,  6, ZSTD_btlazy2 },  /* level  9.*/
                  { 14, 15, 14,  3,  3,  6, ZSTD_btopt   },  /* level 10.*/
                  { 14, 15, 14,  6,  3,  8, ZSTD_btopt   },  /* level 11.*/
                  { 14, 15, 14,  6,  3, 16, ZSTD_btopt   },  /* level 12.*/
                  { 14, 15, 14,  6,  3, 24, ZSTD_btopt   },  /* level 13.*/
                  { 14, 15, 15,  6,  3, 48, ZSTD_btopt   },  /* level 14.*/
                  { 14, 15, 15,  6,  3, 64, ZSTD_btopt   },  /* level 15.*/
                  { 14, 15, 15,  6,  3, 96, ZSTD_btopt   },  /* level 16.*/
                  { 14, 15, 15,  6,  3,128, ZSTD_btopt   },  /* level 17.*/
                  { 14, 15, 15,  6,  3,256, ZSTD_btopt   },  /* level 18.*/
                  { 14, 15, 15,  7,  3,256, ZSTD_btopt   },  /* level 19.*/
                  { 14, 15, 15,  8,  3,256, ZSTD_btopt2  },  /* level 20.*/
                  { 14, 15, 15,  9,  3,256, ZSTD_btopt2  },  /* level 21.*/
                  { 14, 15, 15, 10,  3,256, ZSTD_btopt2  },  /* level 22.*/
              },
              };
              /*! ZSTD_getCParams() :
              *   @return ZSTD_compressionParameters structure for a selected compression level, `srcSize` and `dictSize`.
              *   Size values are optional, provide 0 if not known or unused */
              ZSTD_compressionParameters ZSTD_getCParams(int compressionLevel, unsigned long long srcSize, size_t dictSize)
              {
                  ZSTD_compressionParameters cp;
                  size_t const addedSize = srcSize ? 0 : 500;
                  U64 const rSize = srcSize+dictSize ? srcSize+dictSize+addedSize : (U64)-1;
                  U32 const tableID = (rSize <= 256 KB) + (rSize <= 128 KB) + (rSize <= 16 KB);   /* intentional underflow for srcSizeHint == 0 */
                  if (compressionLevel <= 0) compressionLevel = ZSTD_DEFAULT_CLEVEL;   /* 0 == default; no negative compressionLevel yet */
                  if (compressionLevel > ZSTD_MAX_CLEVEL) compressionLevel = ZSTD_MAX_CLEVEL;
                  cp = ZSTD_defaultCParameters[tableID][compressionLevel];
                  if (MEM_32bits()) {   /* auto-correction, for 32-bits mode */
                      if (cp.windowLog > ZSTD_WINDOWLOG_MAX) cp.windowLog = ZSTD_WINDOWLOG_MAX;
                      if (cp.chainLog > ZSTD_CHAINLOG_MAX) cp.chainLog = ZSTD_CHAINLOG_MAX;
                      if (cp.hashLog > ZSTD_HASHLOG_MAX) cp.hashLog = ZSTD_HASHLOG_MAX;
                  }
                  cp = ZSTD_adjustCParams(cp, srcSize, dictSize);
                  return cp;
              }
              /*! ZSTD_getParams() :
              *   same as ZSTD_getCParams(), but @return a `ZSTD_parameters` object (instead of `ZSTD_compressionParameters`).
              *   All fields of `ZSTD_frameParameters` are set to default (0) */
              ZSTD_parameters ZSTD_getParams(int compressionLevel, unsigned long long srcSize, size_t dictSize) {
                  ZSTD_parameters params;
                  ZSTD_compressionParameters const cParams = ZSTD_getCParams(compressionLevel, srcSize, dictSize);
                  memset(&params, 0, sizeof(params));
                  params.cParams = cParams;
                  return params;
              }

contrib/python-zstandard/zstd/compress/zstd_opt.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/huf_decompress.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/decompress/zstd_decompress.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/dictBuilder/zdict.c

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/zstd.h

0 0 0

	1		NO CONTENT: modified file
The requested commit or file is too big and content was truncated. Show full diff

contrib/python-zstandard/zstd/common/zbuff.h

0 removed 0 -191

NO CONTENT: file was removed

contrib/python-zstandard/zstd/compress/zbuff_compress.c

0 removed 0 -319

NO CONTENT: file was removed

contrib/python-zstandard/zstd/decompress/zbuff_decompress.c

0 removed 0 0

	1		NO CONTENT: file was removed
The requested commit or file is too big and content was truncated. Show full diff

General Comments 0

Write
Preview

You need to be logged in to leave comments. Login now

No TODOs yet

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages