From c4792b674688959f1be0538de76eaf1dda1faad6 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 27 Jul 2020 14:23:23 -0700
Subject: [PATCH 001/187] updated benchmark results on README.md

with newest measurements featuring XXH3 on recent systems
---
 README.md | 128 +++++++++++++++++++++++++-----------------------------
 1 file changed, 60 insertions(+), 68 deletions(-)

diff --git a/README.md b/README.md
index 01637f49..ecf96520 100644
--- a/README.md
+++ b/README.md
@@ -1,11 +1,11 @@
+
 xxHash - Extremely fast hash algorithm
 ======================================
 
-<!-- TODO: Update. -->
 xxHash is an Extremely fast Hash algorithm, running at RAM speed limits.
 It successfully completes the [SMHasher](https://code.google.com/p/smhasher/wiki/SMHasher) test suite
 which evaluates collision, dispersion and randomness qualities of hash functions.
-Code is highly portable, and hashes are identical on all platforms (little / big endian).
+Code is highly portable, and hashes are identical across all platforms (little / big endian).
 
 |Branch      |Status   |
 |------------|---------|
@@ -13,50 +13,70 @@ Code is highly portable, and hashes are identical on all platforms (little / big
 |dev         | [![Build Status](https://travis-ci.org/Cyan4973/xxHash.svg?branch=dev)](https://travis-ci.org/Cyan4973/xxHash?branch=dev) |
 
 
-
 Benchmarks
 -------------------------
 
-The benchmark uses SMHasher speed test, compiled with Visual 2010 on a Windows Seven 32-bit box.
-The reference system uses a Core 2 Duo @3GHz
-
-
-| Name          |   Speed            | Quality | Author            |
-|---------------|--------------------|:-------:|-------------------|
-| [xxHash]      | 5.4 GB/s           |   10    | Y.C.              |
-| MurmurHash 3a | 2.7 GB/s           |   10    | Austin Appleby    |
-| SBox          | 1.4 GB/s           |    9    | Bret Mulvey       |
-| Lookup3       | 1.2 GB/s           |    9    | Bob Jenkins       |
-| CityHash64    | 1.05 GB/s          |   10    | Pike & Alakuijala |
-| FNV           | 0.55 GB/s          |    5    | Fowler, Noll, Vo  |
-| CRC32         | 0.43 GB/s &dagger; |    9    |                   |
-| MD5-32        | 0.33 GB/s          |   10    | Ronald L.Rivest   |
-| SHA1-32       | 0.28 GB/s          |   10    |                   |
-
-[xxHash]: https://www.xxhash.com
+The benchmark is compiled with clang v10.0 and run on Ubuntu x64 20.04.
+The reference system uses Intel i7-9700K
+
+| Hash Name     | Width | Bandwidth (GB/s) | Small Data Velocity | Quality | Comment |
+| ---------     | ----- | ----------------- | ----- | --- | --- |
+| __XXH3__ (SSE2) |  64 | 31.5 GB/s         | 133.1 | 10
+| __XXH128__ (SSE2) | 128 | 29.6 GB/s       | 118.1 | 10
+| _RAM sequential read_ | N/A | 28.0 GB/s   |   N/A | N/A
+| City64        |    64 | 22.0 GB/s         |  76.6 | 10
+| T1ha2         |    64 | 22.0 GB/s         |  99.0 |  9 | Slightly worse [collision ratio]
+| City128       |   128 | 21.7 GB/s         |  57.7 | 10
+| __XXH64__     |    64 | 19.4 GB/s         |  71.0 | 10
+| SpookyHash    |    64 | 19.3 GB/s         |  53.2 | 10
+| Mum           |    64 | 18.0 GB/s         |  67.0 |  9 | Slightly worse [collision ratio]
+| __XXH32__     |    32 |  9.7 GB/s         |  71.9 | 10
+| City32        |    32 |  9.1 GB/s         |  66.0 | 10
+| Murmur3       |    32 |  3.9 GB/s         |  56.1 | 10
+| SipHash       |    64 |  3.0 GB/s         |  43.2 | 10
+| HighwayHash   |    64 |  1.4 GB/s         |   6.0 | 10
+| FNV64         |    64 |  1.2 GB/s         |  62.7 |  5 | Poor avalanche properties
+| Blake2        |   128 |  1.1 GB/s         |   5.1 | 10
+
+[collision ratio]: https://github.com/Cyan4973/xxHash/wiki/Collision-ratio-comparison#collision-study
+
+note: some algorithms feature _faster than RAM_ speed. In which case, they can only reach their full speed when input data is already in CPU cache (L3 or better). Otherwise, they max out on RAM speed limit.
+
+### Small data
+Performance on large data is only one part of the picture.
+Hashing is also very useful in constructions like hash tables and bloom filters.
+In these use cases, it's frequent to hash a lot of small data (starting at a few bytes).
+Algorithm's performance can be very different for such scenarios, since parts of the algorithm,
+such as initialization or finalization, become fixed cost.
+The impact of branch mis-prediction also becomes much more present.
+
+XXH3 has been designed for excellent performance on both long and small inputs,
+which can be observed in the following graph:
 
-Note &dagger;: SMHasher's CRC32 implementation is known to be slow. Faster implementations exist.
+![XXH3, latency, random size](https://user-images.githubusercontent.com/750081/61976089-aedeab00-af9f-11e9-9239-e5375d6c080f.png)
 
-Q.Score is a measure of quality of the hash function.
-It depends on successfully passing SMHasher test set.
-10 is a perfect score.
-Algorithms with a score < 5 are not listed on this table.
+For a more detailed analysis, visit the wiki :
+https://github.com/Cyan4973/xxHash/wiki/Performance-comparison#benchmarks-concentrating-on-small-data-
 
-A more recent version, XXH64, has been created thanks to [Mathias Westerdahl](https://github.com/JCash),
-which offers superior speed and dispersion for 64-bit systems.
-Note however that 32-bit applications will still run faster using the 32-bit version.
+Quality
+-------------------------
 
-SMHasher speed test, compiled using GCC 4.8.2, on Linux Mint 64-bit.
-The reference system uses a Core i5-3340M @2.7GHz
+Speed is not the only property that matters.
+Produced hash values must respect excellent dispersion and randomness properties,
+so that any sub-section of it can be used to maximally spread out a table or index,
+as well as reduce the amount of collisions to the minimal theoretical level, following the [birthday paradox].
 
-| Version    | Speed on 64-bit  | Speed on 32-bit  |
-|------------|------------------|------------------|
-| XXH64      | 13.8 GB/s        |  1.9 GB/s        |
-| XXH32      |  6.8 GB/s        |  6.0 GB/s        |
+`xxHash` has been tested with Austin Appleby's excellent SMHasher test suite,
+and passes all tests, ensuring reasonable quality levels.
+It also passes extended tests from [newer forks of SMHasher], featuring additional scenarios and conditions.
 
-This project also includes a command line utility, named `xxhsum`, offering similar features to `md5sum`,
-thanks to [Takayuki Matsuoka](https://github.com/t-mat)'s contributions.
+Finally, xxHash provides its own [massive collision tester](https://github.com/Cyan4973/xxHash/tree/dev/tests/collisions),
+able to generate and compare billions of hash to test the limits of 64-bit hash algorithms.
+On this front too, xxHash features good results, in line with the [birthday paradox].
+A more detailed analysis is documented [in the wiki](https://github.com/Cyan4973/xxHash/wiki/Collision-ratio-comparison).
 
+[birthday paradox]: https://en.wikipedia.org/wiki/Birthday_problem
+[newer forks of SMHasher]: https://github.com/rurban/smhasher
 
 ### License
 
@@ -64,30 +84,6 @@ The library files `xxhash.c` and `xxhash.h` are BSD licensed.
 The utility `xxhsum` is GPL licensed.
 
 
-### New hash algorithms
-
-Starting with `v0.7.0`, the library includes a new algorithm named `XXH3`,
-which is able to generate 64 and 128-bit hashes.
-
-The new algorithm is much faster than its predecessors for both long and small inputs,
-which can be observed in the following graphs:
-
-![XXH3, bargraph](https://user-images.githubusercontent.com/750081/61976096-b3a35f00-af9f-11e9-8229-e0afc506c6ec.png)
-
-![XXH3, latency, random size](https://user-images.githubusercontent.com/750081/61976089-aedeab00-af9f-11e9-9239-e5375d6c080f.png)
-
-To access these new prototypes, one needs to unlock their declaration, using the build macro `XXH_STATIC_LINKING_ONLY`.
-
-The algorithm is currently in development, meaning its return values might still change in future versions.
-However, the API is stable, and can be used in production,
-typically for generation of ephemeral hashes (produced and consumed in same session).
-
-`XXH3` has now reached "release candidate" status.
-If everything remains fine, its format will be "frozen" and become final.
-After which, return values of `XXH3` and `XXH128` will no longer change in future versions.
-`XXH3`'s return values will be officially finalized upon reaching `v0.8.0`.
-
-
 ### Build modifiers
 
 The following macros can be set at compilation time to modify libxxhash's behavior. They are generally disabled by default.
@@ -216,11 +212,7 @@ thanks to many great contributors.
 They are [listed here](https://www.xxhash.com/#other-languages).
 
 
-### Branch Policy
-
-> - The "master" branch is considered stable, at all times.
-> - The "dev" branch is the one where all contributions must be merged
-    before being promoted to master.
->   + If you plan to propose a patch, please commit into the "dev" branch,
-      or its own feature branch.
-      Direct commit to "master" are not permitted.
+### Special Thanks
+Takayuki Matsuoka, aka @t-mat, for creating `xxhsum -c` and general support during early xxh releases
+Mathias Westerdahl, aka @JCash, for introducing the first version of `XXH64`
+Devin Hussey, aka @easyaspi314, for excellent low-level optimizations on `XXH3` and `XXH128`

From 69b894eadeeecfaa429f8dabaf4d62b0e3f0de91 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 27 Jul 2020 14:29:39 -0700
Subject: [PATCH 002/187] fix minor formatting issues

especially in the "Thanks" section
---
 README.md | 17 +++++++++++------
 1 file changed, 11 insertions(+), 6 deletions(-)

diff --git a/README.md b/README.md
index ecf96520..a4832ab1 100644
--- a/README.md
+++ b/README.md
@@ -23,13 +23,13 @@ The reference system uses Intel i7-9700K
 | ---------     | ----- | ----------------- | ----- | --- | --- |
 | __XXH3__ (SSE2) |  64 | 31.5 GB/s         | 133.1 | 10
 | __XXH128__ (SSE2) | 128 | 29.6 GB/s       | 118.1 | 10
-| _RAM sequential read_ | N/A | 28.0 GB/s   |   N/A | N/A
+| _RAM sequential read_ | N/A | 28.0 GB/s   |   N/A | N/A | _for reference_
 | City64        |    64 | 22.0 GB/s         |  76.6 | 10
-| T1ha2         |    64 | 22.0 GB/s         |  99.0 |  9 | Slightly worse [collision ratio]
+| T1ha2         |    64 | 22.0 GB/s         |  99.0 |  9 | Slightly worse [collisions]
 | City128       |   128 | 21.7 GB/s         |  57.7 | 10
 | __XXH64__     |    64 | 19.4 GB/s         |  71.0 | 10
 | SpookyHash    |    64 | 19.3 GB/s         |  53.2 | 10
-| Mum           |    64 | 18.0 GB/s         |  67.0 |  9 | Slightly worse [collision ratio]
+| Mum           |    64 | 18.0 GB/s         |  67.0 |  9 | Slightly worse [collisions]
 | __XXH32__     |    32 |  9.7 GB/s         |  71.9 | 10
 | City32        |    32 |  9.1 GB/s         |  66.0 | 10
 | Murmur3       |    32 |  3.9 GB/s         |  56.1 | 10
@@ -38,9 +38,11 @@ The reference system uses Intel i7-9700K
 | FNV64         |    64 |  1.2 GB/s         |  62.7 |  5 | Poor avalanche properties
 | Blake2        |   128 |  1.1 GB/s         |   5.1 | 10
 
-[collision ratio]: https://github.com/Cyan4973/xxHash/wiki/Collision-ratio-comparison#collision-study
+[collisions]: https://github.com/Cyan4973/xxHash/wiki/Collision-ratio-comparison#collision-study
 
-note: some algorithms feature _faster than RAM_ speed. In which case, they can only reach their full speed when input data is already in CPU cache (L3 or better). Otherwise, they max out on RAM speed limit.
+note 1: Small data velocity is a rough evaluation of algorithm's efficiency on small data. For more detailed information, please refer to next paragraph.
+
+note 2: some algorithms feature _faster than RAM_ speed. In which case, they can only reach their full speed when input data is already in CPU cache (L3 or better). Otherwise, they max out on RAM speed limit.
 
 ### Small data
 Performance on large data is only one part of the picture.
@@ -209,10 +211,13 @@ XXH64_hash_t calcul_hash_streaming(FileHandler fh)
 Aside from the C reference version,
 xxHash is also available in many different programming languages,
 thanks to many great contributors.
-They are [listed here](https://www.xxhash.com/#other-languages).
+They are [listed here](http://www.xxhash.com/#other-languages).
 
 
 ### Special Thanks
+
 Takayuki Matsuoka, aka @t-mat, for creating `xxhsum -c` and general support during early xxh releases
+
 Mathias Westerdahl, aka @JCash, for introducing the first version of `XXH64`
+
 Devin Hussey, aka @easyaspi314, for excellent low-level optimizations on `XXH3` and `XXH128`

From b4dbf5fefc37b8a5f80b3bbc6b8ff639cf5f0ec2 Mon Sep 17 00:00:00 2001
From: Mattias Ellert <mattias.ellert@physics.uu.se>
Date: Tue, 28 Jul 2020 08:01:00 +0200
Subject: [PATCH 003/187] Fix empty version in .pc file

---
 Makefile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/Makefile b/Makefile
index ef24e94c..da1ce068 100644
--- a/Makefile
+++ b/Makefile
@@ -437,7 +437,7 @@ libxxhash.pc: libxxhash.pc.in
           -e 's|@EXECPREFIX@|$(PCEXECDIR)|' \
           -e 's|@LIBDIR@|$(PCLIBDIR)|' \
           -e 's|@INCLUDEDIR@|$(PCINCDIR)|' \
-          -e 's|@VERSION@|$(VERSION)|' \
+          -e 's|@VERSION@|$(LIBVER)|' \
           $< > $@
 
 

From 50d5774bcf3a2173d85aab6a2d080e854ed49ca9 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Sun, 2 Aug 2020 02:05:57 -0700
Subject: [PATCH 004/187] removed highwayhash from benchmark summary

the summary is merely there to provide some rough guideline
providing comparative performance figures from well known hash algorithms.

highway was just added to please a 3rd party request,
but results are contested by another 3rd party,
this kind of debate is outside the scope of xxhash repository
which merely aims at presenting xxhash in context.

On the other hand, maybe I should consider adding some very well known algorithms
such as md5 or sha1 which are used very often in multiple contexts
in order to provide perspective.
---
 README.md | 72 +++++++++++++++++++++++++++++--------------------------
 1 file changed, 38 insertions(+), 34 deletions(-)

diff --git a/README.md b/README.md
index a4832ab1..a7884c06 100644
--- a/README.md
+++ b/README.md
@@ -16,35 +16,36 @@ Code is highly portable, and hashes are identical across all platforms (little /
 Benchmarks
 -------------------------
 
-The benchmark is compiled with clang v10.0 and run on Ubuntu x64 20.04.
-The reference system uses Intel i7-9700K
+The reference system uses an Intel i7-9700K cpu, and runs Ubuntu x64 20.04.
+The [open source benchmark program] is compiled with `clang` v10.0 using `-O3` flag.
 
 | Hash Name     | Width | Bandwidth (GB/s) | Small Data Velocity | Quality | Comment |
-| ---------     | ----- | ----------------- | ----- | --- | --- |
-| __XXH3__ (SSE2) |  64 | 31.5 GB/s         | 133.1 | 10
-| __XXH128__ (SSE2) | 128 | 29.6 GB/s       | 118.1 | 10
-| _RAM sequential read_ | N/A | 28.0 GB/s   |   N/A | N/A | _for reference_
-| City64        |    64 | 22.0 GB/s         |  76.6 | 10
-| T1ha2         |    64 | 22.0 GB/s         |  99.0 |  9 | Slightly worse [collisions]
-| City128       |   128 | 21.7 GB/s         |  57.7 | 10
-| __XXH64__     |    64 | 19.4 GB/s         |  71.0 | 10
-| SpookyHash    |    64 | 19.3 GB/s         |  53.2 | 10
-| Mum           |    64 | 18.0 GB/s         |  67.0 |  9 | Slightly worse [collisions]
-| __XXH32__     |    32 |  9.7 GB/s         |  71.9 | 10
-| City32        |    32 |  9.1 GB/s         |  66.0 | 10
-| Murmur3       |    32 |  3.9 GB/s         |  56.1 | 10
-| SipHash       |    64 |  3.0 GB/s         |  43.2 | 10
-| HighwayHash   |    64 |  1.4 GB/s         |   6.0 | 10
-| FNV64         |    64 |  1.2 GB/s         |  62.7 |  5 | Poor avalanche properties
-| Blake2        |   128 |  1.1 GB/s         |   5.1 | 10
-
+| ---------     | ----- | ---------------- | ----- | --- | --- |
+| __XXH3__ (SSE2) |  64 | 31.5 GB/s        | 133.1 | 10
+| __XXH128__ (SSE2) | 128 | 29.6 GB/s      | 118.1 | 10
+| _RAM sequential read_ | N/A | 28.0 GB/s  |   N/A | N/A | _for reference_
+| City64        |    64 | 22.0 GB/s        |  76.6 | 10
+| T1ha2         |    64 | 22.0 GB/s        |  99.0 |  9 | Slightly worse [collisions]
+| City128       |   128 | 21.7 GB/s        |  57.7 | 10
+| __XXH64__     |    64 | 19.4 GB/s        |  71.0 | 10
+| SpookyHash    |    64 | 19.3 GB/s        |  53.2 | 10
+| Mum           |    64 | 18.0 GB/s        |  67.0 |  9 | Slightly worse [collisions]
+| __XXH32__     |    32 |  9.7 GB/s        |  71.9 | 10
+| City32        |    32 |  9.1 GB/s        |  66.0 | 10
+| Murmur3       |    32 |  3.9 GB/s        |  56.1 | 10
+| SipHash       |    64 |  3.0 GB/s        |  43.2 | 10
+| FNV64         |    64 |  1.2 GB/s        |  62.7 |  5 | Poor avalanche properties
+| Blake2        |   128 |  1.1 GB/s        |   5.1 | 10
+
+[open source benchmark program]: https://github.com/Cyan4973/xxHash/tree/release/tests/bench
 [collisions]: https://github.com/Cyan4973/xxHash/wiki/Collision-ratio-comparison#collision-study
 
-note 1: Small data velocity is a rough evaluation of algorithm's efficiency on small data. For more detailed information, please refer to next paragraph.
+note 1: Small data velocity is a _rough_ evaluation of algorithm's efficiency on small data. For more detailed analysis, please refer to next paragraph.
 
 note 2: some algorithms feature _faster than RAM_ speed. In which case, they can only reach their full speed when input data is already in CPU cache (L3 or better). Otherwise, they max out on RAM speed limit.
 
 ### Small data
+
 Performance on large data is only one part of the picture.
 Hashing is also very useful in constructions like hash tables and bloom filters.
 In these use cases, it's frequent to hash a lot of small data (starting at a few bytes).
@@ -80,11 +81,6 @@ A more detailed analysis is documented [in the wiki](https://github.com/Cyan4973
 [birthday paradox]: https://en.wikipedia.org/wiki/Birthday_problem
 [newer forks of SMHasher]: https://github.com/rurban/smhasher
 
-### License
-
-The library files `xxhash.c` and `xxhash.h` are BSD licensed.
-The utility `xxhsum` is GPL licensed.
-
 
 ### Build modifiers
 
@@ -155,7 +151,8 @@ The xxHash port in vcpkg is kept up to date by Microsoft team members and commun
 
 ### Example
 
-Calling xxhash 64-bit variant from a C program:
+The simplest example calls xxhash 64-bit variant as a one-shot function
+generating a hash value from a single buffer, and invoked from a C/C++ program:
 
 ```C
 #include "xxhash.h"
@@ -165,7 +162,8 @@ Calling xxhash 64-bit variant from a C program:
 }
 ```
 
-Using streaming variant is more involved, but makes it possible to provide data incrementally:
+Streaming variant is more involved, but makes it possible to provide data incrementally:
+
 ```C
 #include "stdlib.h"   /* abort() */
 #include "xxhash.h"
@@ -187,17 +185,17 @@ XXH64_hash_t calcul_hash_streaming(FileHandler fh)
 
     /* Feed the state with input data, any size, any number of times */
     (...)
-    while ( /* any condition */ ) {
+    while ( /* some data left */ ) {
         size_t const length = get_more_data(buffer, bufferSize, fh);
         if (XXH64_update(state, buffer, length) == XXH_ERROR) abort();
         (...)
     }
     (...)
 
-    /* Get the hash */
+    /* Produce the final hash value */
     XXH64_hash_t const hash = XXH64_digest(state);
 
-    /* State can be re-used; in this example, it is simply freed  */
+    /* State could be re-used; but in this example, it is simply freed  */
     free(buffer);
     XXH64_freeState(state);
 
@@ -206,11 +204,17 @@ XXH64_hash_t calcul_hash_streaming(FileHandler fh)
 ```
 
 
+### License
+
+The library files `xxhash.c` and `xxhash.h` are BSD licensed.
+The utility `xxhsum` is GPL licensed.
+
+
 ### Other programming languages
 
-Aside from the C reference version,
-xxHash is also available in many different programming languages,
-thanks to many great contributors.
+Beyond the C reference version,
+xxHash is also available from many different programming languages,
+thanks to great contributors.
 They are [listed here](http://www.xxhash.com/#other-languages).
 
 

From 95014352caf0da5162bb3677ba5a7c0f7ddcd895 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Sun, 2 Aug 2020 02:57:25 -0700
Subject: [PATCH 005/187] added MD5 benchmark results

---
 README.md | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/README.md b/README.md
index a7884c06..78978826 100644
--- a/README.md
+++ b/README.md
@@ -35,7 +35,8 @@ The [open source benchmark program] is compiled with `clang` v10.0 using `-O3` f
 | Murmur3       |    32 |  3.9 GB/s        |  56.1 | 10
 | SipHash       |    64 |  3.0 GB/s        |  43.2 | 10
 | FNV64         |    64 |  1.2 GB/s        |  62.7 |  5 | Poor avalanche properties
-| Blake2        |   128 |  1.1 GB/s        |   5.1 | 10
+| Blake2        |   256 |  1.1 GB/s        |   5.1 | 10 | Cryptographic
+| MD5           |   128 |  0.6 GB/s        |   7.8 | 10 | Cryptographic but broken
 
 [open source benchmark program]: https://github.com/Cyan4973/xxHash/tree/release/tests/bench
 [collisions]: https://github.com/Cyan4973/xxHash/wiki/Collision-ratio-comparison#collision-study

From b2a1eba413e67426b7b177288f033ac5c1e3b805 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Sun, 2 Aug 2020 03:19:21 -0700
Subject: [PATCH 006/187] added SHA1 results

---
 README.md | 1 +
 1 file changed, 1 insertion(+)

diff --git a/README.md b/README.md
index 78978826..1207969a 100644
--- a/README.md
+++ b/README.md
@@ -36,6 +36,7 @@ The [open source benchmark program] is compiled with `clang` v10.0 using `-O3` f
 | SipHash       |    64 |  3.0 GB/s        |  43.2 | 10
 | FNV64         |    64 |  1.2 GB/s        |  62.7 |  5 | Poor avalanche properties
 | Blake2        |   256 |  1.1 GB/s        |   5.1 | 10 | Cryptographic
+| SHA1          |   160 |  0.8 GB/s        |   5.6 | 10 | Cryptographic but broken
 | MD5           |   128 |  0.6 GB/s        |   7.8 | 10 | Cryptographic but broken
 
 [open source benchmark program]: https://github.com/Cyan4973/xxHash/tree/release/tests/bench

From 7e1e4a7d6f7577d7850d3912c235ebf740b19517 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 13 Aug 2020 00:43:59 -0700
Subject: [PATCH 007/187] minor fix url in pkgconfig

since https doesn't work anymore ...
---
 libxxhash.pc.in | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/libxxhash.pc.in b/libxxhash.pc.in
index 0a52dde2..28c16448 100644
--- a/libxxhash.pc.in
+++ b/libxxhash.pc.in
@@ -9,7 +9,7 @@ libdir=${exec_prefix}/@LIBDIR@
 
 Name: xxhash
 Description: extremely fast hash algorithm
-URL: https://www.xxhash.com/
+URL: http://www.xxhash.com/
 Version: @VERSION@
 Libs: -L${libdir} -lxxhash
 Cflags: -I${includedir}

From 9481f42a629f315dacf6df8a6029451c1f9e3974 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Niklas=20Hamb=C3=BCchen?= <mail@nh2.me>
Date: Thu, 13 Aug 2020 18:34:19 +0200
Subject: [PATCH 008/187] xxhash.h: Update "still in development" comments

v0.8.0 marked XXH3 as stable.
---
 xxhash.h | 11 +++--------
 1 file changed, 3 insertions(+), 8 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 2d56d23c..400d3a21 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -440,19 +440,14 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
  *
  * The 128-bit version adds additional strength, but it is slightly slower.
  *
- * The XXH3 algorithm is still in development.
- * The results it produces may still change in future versions.
+ * Return values of XXH3 and XXH128 are officially finalized starting
+ * with v0.8.0 and will no longer change in future versions.
+ * Avoid storing values from before that release in long-term storage.
  *
  * Results produced by v0.7.x are not comparable with results from v0.7.y.
  * However, the API is completely stable, and it can safely be used for
  * ephemeral data (local sessions).
  *
- * Avoid storing values in long-term storage until the algorithm is finalized.
- * XXH3's return values will be officially finalized upon reaching v0.8.0.
- *
- * After which, return values of XXH3 and XXH128 will no longer change in
- * future versions.
- *
  * The API supports one-shot hashing, streaming mode, and custom secrets.
  */
 

From 96bd569adc71e01df0930836792de36cd7742883 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 2 Sep 2020 10:11:36 -0400
Subject: [PATCH 009/187] [WIP] Start prefixing most symbols in xxhsum.c

Make most symbols start with XSUM_ to prepare for future refactoring.

Some macros are no longer SCREAMING_SNAKE_CASE, they will be converted
to functions in the refactor.
---
 xxhsum.c | 767 +++++++++++++++++++++++++++----------------------------
 1 file changed, 378 insertions(+), 389 deletions(-)

diff --git a/xxhsum.c b/xxhsum.c
index 565eb998..4f790a1a 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -97,20 +97,20 @@
  || defined(__DJGPP__) \
  || defined(__MSYS__)
 #  include <unistd.h>   /* isatty */
-#  define IS_CONSOLE(stdStream) isatty(fileno(stdStream))
+#  define XSUM_isConsole(stdStream) isatty(fileno(stdStream))
 #elif defined(MSDOS) || defined(OS2)
 #  include <io.h>       /* _isatty */
-#  define IS_CONSOLE(stdStream) _isatty(_fileno(stdStream))
+#  define XSUM_isConsole(stdStream) _isatty(_fileno(stdStream))
 #elif defined(WIN32) || defined(_WIN32)
 #  include <io.h>      /* _isatty */
 #  include <windows.h> /* DeviceIoControl, HANDLE, FSCTL_SET_SPARSE */
 #  include <stdio.h>   /* FILE */
-static __inline int IS_CONSOLE(FILE* stdStream) {
+static __inline int XSUM_isConsole(FILE* stdStream) {
     DWORD dummy;
     return _isatty(_fileno(stdStream)) && GetConsoleMode((HANDLE)_get_osfhandle(_fileno(stdStream)), &dummy);
 }
 #else
-#  define IS_CONSOLE(stdStream) 0
+#  define XSUM_isConsole(stdStream) 0
 #endif
 
 #if defined(MSDOS) || defined(OS2) || defined(WIN32) || defined(_WIN32)
@@ -137,7 +137,7 @@ static __inline int IS_CONSOLE(FILE* stdStream) {
  * Converts a UTF-8 string to UTF-16. Acts like strdup. The string must be freed afterwards.
  * This version allows keeping the output length.
  */
-static wchar_t* utf8_to_utf16_len(const char* str, int* lenOut)
+static wchar_t* XSUM_widenString(const char* str, int* lenOut)
 {
     int const len = MultiByteToWideChar(CP_UTF8, 0, str, -1, NULL, 0);
     if (lenOut != NULL) *lenOut = len;
@@ -152,17 +152,11 @@ static wchar_t* utf8_to_utf16_len(const char* str, int* lenOut)
     }
 }
 
-/* Converts a UTF-8 string to UTF-16. Acts like strdup. The string must be freed afterwards. */
-static wchar_t* utf8_to_utf16(const char *str)
-{
-    return utf8_to_utf16_len(str, NULL);
-}
-
 /*
  * Converts a UTF-16 string to UTF-8. Acts like strdup. The string must be freed afterwards.
  * This version allows keeping the output length.
  */
-static char* utf16_to_utf8_len(const wchar_t *str, int *lenOut)
+static char* XSUM_narrowString(const wchar_t *str, int *lenOut)
 {
     int len = WideCharToMultiByte(CP_UTF8, 0, str, -1, NULL, 0, NULL, NULL);
     if (lenOut != NULL) *lenOut = len;
@@ -177,12 +171,6 @@ static char* utf16_to_utf8_len(const wchar_t *str, int *lenOut)
     }
 }
 
-/* Converts a UTF-16 string to UTF-8. Acts like strdup. The string must be freed afterwards. */
-static char *utf16_to_utf8(const wchar_t *str)
-{
-    return utf16_to_utf8_len(str, NULL);
-}
-
 /*
  * fopen wrapper that supports UTF-8
  *
@@ -190,9 +178,9 @@ static char *utf16_to_utf8(const wchar_t *str)
  *
  * In order to open a Unicode filename, we need to convert filenames to UTF-16 and use _wfopen.
  */
-static FILE* XXH_fopen_wrapped(const char *filename, const wchar_t *mode)
+static FILE* XSUM_fopen_wrapped(const char *filename, const wchar_t *mode)
 {
-    wchar_t* const wide_filename = utf8_to_utf16(filename);
+    wchar_t* const wide_filename = XSUM_widenString(filename, NULL);
     if (wide_filename == NULL) return NULL;
     {   FILE* const f = _wfopen(wide_filename, mode);
         free(wide_filename);
@@ -219,7 +207,7 @@ static FILE* XXH_fopen_wrapped(const char *filename, const wchar_t *mode)
  *
  * Credit to t-mat: https://github.com/t-mat/xxHash/commit/5691423
  */
-static int fprintf_utf8(FILE *stream, const char *format, ...)
+static int XSUM_fprintf_utf8(FILE *stream, const char *format, ...)
 {
     int result;
     va_list args;
@@ -251,7 +239,7 @@ static int fprintf_utf8(FILE *stream, const char *format, ...)
             u8_str[nchar - 1] = '\0';
             if (result > 0) {
                 /*
-                 * Check if we are outputting to a console. Don't use IS_CONSOLE
+                 * Check if we are outputting to a console. Don't use XSUM_isConsole
                  * directly -- we don't need to call _get_osfhandle twice.
                  */
                 int fileNb = _fileno(stream);
@@ -269,7 +257,7 @@ static int fprintf_utf8(FILE *stream, const char *format, ...)
                      * default msvcrt.dll.
                      */
                     int len;
-                    wchar_t *const u16_buf = utf8_to_utf16_len(u8_str, &len);
+                    wchar_t *const u16_buf = XSUM_widenString(u8_str, &len);
                     if (u16_buf == NULL) {
                         result = -1;
                     } else {
@@ -298,9 +286,9 @@ static int fprintf_utf8(FILE *stream, const char *format, ...)
  * Since we always use literals in the "mode" argument, it is just easier to append "L" to
  * the string to make it UTF-16 and avoid the hassle of a second manual conversion.
  */
-#  define XXH_fopen(filename, mode) XXH_fopen_wrapped(filename, L##mode)
+#  define XSUM_fopen(filename, mode) XSUM_fopen_wrapped(filename, L##mode)
 #else
-#  define XXH_fopen(filename, mode) fopen(filename, mode)
+#  define XSUM_fopen(filename, mode) fopen(filename, mode)
 #endif
 
 /* ************************************
@@ -323,7 +311,7 @@ static int fprintf_utf8(FILE *stream, const char *format, ...)
     typedef unsigned long long U64;
 #endif /* not C++/C99 */
 
-static unsigned BMK_isLittleEndian(void)
+static unsigned XSUM_isLittleEndian(void)
 {
     const union { U32 u; U8 c[4]; } one = { 1 };   /* don't use static: performance detrimental  */
     return one.c[0];
@@ -449,7 +437,7 @@ static unsigned BMK_isLittleEndian(void)
 static const int g_nbBits = (int)(sizeof(void*)*8);
 static const char g_lename[] = "little endian";
 static const char g_bename[] = "big endian";
-#define ENDIAN_NAME (BMK_isLittleEndian() ? g_lename : g_bename)
+#define ENDIAN_NAME (XSUM_isLittleEndian() ? g_lename : g_bename)
 static const char author[] = "Yann Collet";
 #define WELCOME_MESSAGE(exename) "%s %s by %s \n", exename, PROGRAM_VERSION, author
 #define FULL_WELCOME_MESSAGE(exename) "%s %s by %s \n" \
@@ -461,7 +449,7 @@ static const char author[] = "Yann Collet";
 #define MB *( 1<<20)
 #define GB *(1U<<30)
 
-static size_t XXH_DEFAULT_SAMPLE_SIZE = 100 KB;
+static size_t XSUM_DEFAULT_SAMPLE_SIZE = 100 KB;
 #define NBLOOPS    3                              /* Default number of benchmark iterations */
 #define TIMELOOP_S 1
 #define TIMELOOP  (TIMELOOP_S * CLOCKS_PER_SEC)   /* target timing per iteration */
@@ -473,7 +461,7 @@ static size_t XXH_DEFAULT_SAMPLE_SIZE = 100 KB;
 
 static const char stdinName[] = "-";
 typedef enum { algo_xxh32=0, algo_xxh64=1, algo_xxh128=2 } AlgoSelected;
-static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & usage() */
+static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & XSUM_usage() */
 
 /* <16 hex char> <SPC> <SPC> <filename> <'\0'>
  * '4096' is typical Linux PATH_MAX configuration. */
@@ -487,14 +475,14 @@ static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & u
  *  Display macros
  **************************************/
 #ifdef _WIN32
-#define DISPLAY(...)         fprintf_utf8(stderr, __VA_ARGS__)
-#define DISPLAYRESULT(...)   fprintf_utf8(stdout, __VA_ARGS__)
+#define XSUM_log(...)         XSUM_fprintf_utf8(stderr, __VA_ARGS__)
+#define XSUM_output(...)   XSUM_fprintf_utf8(stdout, __VA_ARGS__)
 #else
-#define DISPLAY(...)         fprintf(stderr, __VA_ARGS__)
-#define DISPLAYRESULT(...)   fprintf(stdout, __VA_ARGS__)
+#define XSUM_log(...)         fprintf(stderr, __VA_ARGS__)
+#define XSUM_output(...)   fprintf(stdout, __VA_ARGS__)
 #endif
 
-#define DISPLAYLEVEL(l, ...) do { if (g_displayLevel>=l) DISPLAY(__VA_ARGS__); } while (0)
+#define XSUM_logVerbose(l, ...) do { if (g_displayLevel>=l) XSUM_log(__VA_ARGS__); } while (0)
 static int g_displayLevel = 2;
 
 
@@ -507,13 +495,13 @@ static U32 g_nbIterations = NBLOOPS;
 /* ************************************
  *  Benchmark Functions
  **************************************/
-static clock_t BMK_clockSpan( clock_t start )
+static clock_t XSUM_clockSpan( clock_t start )
 {
     return clock() - start;   /* works even if overflow; Typical max span ~ 30 mn */
 }
 
 
-static size_t BMK_findMaxMem(U64 requiredMem)
+static size_t XSUM_findMaxMem(U64 requiredMem)
 {
     size_t const step = 64 MB;
     void* testmem = NULL;
@@ -537,7 +525,7 @@ static size_t BMK_findMaxMem(U64 requiredMem)
 }
 
 
-static U64 BMK_GetFileSize(const char* infilename)
+static U64 XSUM_GetFileSize(const char* infilename)
 {
     int r;
 #if defined(_MSC_VER)
@@ -555,7 +543,7 @@ static U64 BMK_GetFileSize(const char* infilename)
  * Allocates a string containing s1 and s2 concatenated. Acts like strdup.
  * The result must be freed.
  */
-static char* XXH_strcatDup(const char* s1, const char* s2)
+static char* XSUM_strcatDup(const char* s1, const char* s2)
 {
     assert(s1 != NULL);
     assert(s2 != NULL);
@@ -582,7 +570,7 @@ static char* XXH_strcatDup(const char* s1, const char* s2)
  *
  * This is used in the sanity check - its values must not be changed.
  */
-static void BMK_fillTestBuffer(U8* buffer, size_t len)
+static void XSUM_fillTestBuffer(U8* buffer, size_t len)
 {
     U64 byteGen = PRIME32;
     size_t i;
@@ -713,20 +701,20 @@ static const char k_testIDs_default[NB_TESTFUNC] = { 0,
         1 /*XXH128*/ };
 
 #define HASHNAME_MAX 29
-static void BMK_benchHash(hashFunction h, const char* hName, int testID,
+static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
                           const void* buffer, size_t bufferSize)
 {
     U32 nbh_perIteration = (U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
     unsigned iterationNb, nbIterations = g_nbIterations + !g_nbIterations /* min 1 */;
     double fastestH = 100000000.;
     assert(HASHNAME_MAX > 2);
-    DISPLAYLEVEL(2, "\r%80s\r", "");       /* Clean display line */
+    XSUM_logVerbose(2, "\r%80s\r", "");       /* Clean display line */
 
     for (iterationNb = 1; iterationNb <= nbIterations; iterationNb++) {
         U32 r=0;
         clock_t cStart;
 
-        DISPLAYLEVEL(2, "%2u-%-*.*s : %10u ->\r",
+        XSUM_logVerbose(2, "%2u-%-*.*s : %10u ->\r",
                         iterationNb,
                         HASHNAME_MAX, HASHNAME_MAX, hName,
                         (unsigned)bufferSize);
@@ -738,9 +726,9 @@ static void BMK_benchHash(hashFunction h, const char* hName, int testID,
             for (u=0; u<nbh_perIteration; u++)
                 r += h(buffer, bufferSize, u);
         }
-        if (r==0) DISPLAYLEVEL(3,".\r");  /* do something with r to defeat compiler "optimizing" hash away */
+        if (r==0) XSUM_logVerbose(3,".\r");  /* do something with r to defeat compiler "optimizing" hash away */
 
-        {   clock_t const nbTicks = BMK_clockSpan(cStart);
+        {   clock_t const nbTicks = XSUM_clockSpan(cStart);
             double const ticksPerHash = ((double)nbTicks / TIMELOOP) / nbh_perIteration;
             /*
              * clock() is the only decent portable timer, but it isn't very
@@ -781,7 +769,7 @@ static void BMK_benchHash(hashFunction h, const char* hName, int testID,
             }
             if (ticksPerHash < fastestH) fastestH = ticksPerHash;
             if (fastestH>0.) { /* avoid div by zero */
-                DISPLAYLEVEL(2, "%2u-%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \r",
+                XSUM_logVerbose(2, "%2u-%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \r",
                             iterationNb,
                             HASHNAME_MAX, HASHNAME_MAX, hName,
                             (unsigned)bufferSize,
@@ -793,27 +781,27 @@ static void BMK_benchHash(hashFunction h, const char* hName, int testID,
             nbh_perIteration = (U32)nbh_perSecond;
         }
     }
-    DISPLAYLEVEL(1, "%2i#%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \n",
+    XSUM_logVerbose(1, "%2i#%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \n",
                     testID,
                     HASHNAME_MAX, HASHNAME_MAX, hName,
                     (unsigned)bufferSize,
                     (double)1 / fastestH,
                     ((double)bufferSize / (1 MB)) / fastestH);
     if (g_displayLevel<1)
-        DISPLAYLEVEL(0, "%u, ", (unsigned)((double)1 / fastestH));
+        XSUM_logVerbose(0, "%u, ", (unsigned)((double)1 / fastestH));
 }
 
 
 /*!
- * BMK_benchMem():
+ * XSUM_benchMem():
  * buffer: Must be 16-byte aligned.
  * The real allocated size of buffer is supposed to be >= (bufferSize+3).
  * returns: 0 on success, 1 if error (invalid mode selected)
  */
-static void BMK_benchMem(const void* buffer, size_t bufferSize)
+static void XSUM_benchMem(const void* buffer, size_t bufferSize)
 {
     assert((((size_t)buffer) & 15) == 0);  /* ensure alignment */
-    BMK_fillTestBuffer(g_benchSecretBuf, sizeof(g_benchSecretBuf));
+    XSUM_fillTestBuffer(g_benchSecretBuf, sizeof(g_benchSecretBuf));
     {   int i;
         for (i = 1; i < NB_TESTFUNC; i++) {
             int const hashFuncID = (i-1) / 2;
@@ -821,51 +809,51 @@ static void BMK_benchMem(const void* buffer, size_t bufferSize)
             if (g_testIDs[i] == 0) continue;
             /* aligned */
             if ((i % 2) == 1) {
-                BMK_benchHash(g_hashesToBench[hashFuncID].func, g_hashesToBench[hashFuncID].name, i, buffer, bufferSize);
+                XSUM_benchHash(g_hashesToBench[hashFuncID].func, g_hashesToBench[hashFuncID].name, i, buffer, bufferSize);
             }
             /* unaligned */
             if ((i % 2) == 0) {
                 /* Append "unaligned". */
-                char* const hashNameBuf = XXH_strcatDup(g_hashesToBench[hashFuncID].name, " unaligned");
+                char* const hashNameBuf = XSUM_strcatDup(g_hashesToBench[hashFuncID].name, " unaligned");
                 assert(hashNameBuf != NULL);
-                BMK_benchHash(g_hashesToBench[hashFuncID].func, hashNameBuf, i, ((const char*)buffer)+3, bufferSize);
+                XSUM_benchHash(g_hashesToBench[hashFuncID].func, hashNameBuf, i, ((const char*)buffer)+3, bufferSize);
                 free(hashNameBuf);
             }
     }   }
 }
 
-static size_t BMK_selectBenchedSize(const char* fileName)
+static size_t XSUM_selectBenchedSize(const char* fileName)
 {
-    U64 const inFileSize = BMK_GetFileSize(fileName);
-    size_t benchedSize = (size_t) BMK_findMaxMem(inFileSize);
+    U64 const inFileSize = XSUM_GetFileSize(fileName);
+    size_t benchedSize = (size_t) XSUM_findMaxMem(inFileSize);
     if ((U64)benchedSize > inFileSize) benchedSize = (size_t)inFileSize;
     if (benchedSize < inFileSize) {
-        DISPLAY("Not enough memory for '%s' full size; testing %i MB only...\n", fileName, (int)(benchedSize>>20));
+        XSUM_log("Not enough memory for '%s' full size; testing %i MB only...\n", fileName, (int)(benchedSize>>20));
     }
     return benchedSize;
 }
 
 
-static int BMK_benchFiles(const char*const* fileNamesTable, int nbFiles)
+static int XSUM_benchFiles(const char*const* fileNamesTable, int nbFiles)
 {
     int fileIdx;
     for (fileIdx=0; fileIdx<nbFiles; fileIdx++) {
         const char* const inFileName = fileNamesTable[fileIdx];
         assert(inFileName != NULL);
 
-        {   FILE* const inFile = XXH_fopen( inFileName, "rb" );
-            size_t const benchedSize = BMK_selectBenchedSize(inFileName);
+        {   FILE* const inFile = XSUM_fopen( inFileName, "rb" );
+            size_t const benchedSize = XSUM_selectBenchedSize(inFileName);
             char* const buffer = (char*)calloc(benchedSize+16+3, 1);
             void* const alignedBuffer = (buffer+15) - (((size_t)(buffer+15)) & 0xF);  /* align on next 16 bytes */
 
             /* Checks */
             if (inFile==NULL){
-                DISPLAY("Error: Could not open '%s': %s.\n", inFileName, strerror(errno));
+                XSUM_log("Error: Could not open '%s': %s.\n", inFileName, strerror(errno));
                 free(buffer);
                 exit(11);
             }
             if(!buffer) {
-                DISPLAY("\nError: Out of memory.\n");
+                XSUM_log("\nError: Out of memory.\n");
                 fclose(inFile);
                 exit(12);
             }
@@ -874,13 +862,13 @@ static int BMK_benchFiles(const char*const* fileNamesTable, int nbFiles)
             {   size_t const readSize = fread(alignedBuffer, 1, benchedSize, inFile);
                 fclose(inFile);
                 if(readSize != benchedSize) {
-                    DISPLAY("\nError: Could not read '%s': %s.\n", inFileName, strerror(errno));
+                    XSUM_log("\nError: Could not read '%s': %s.\n", inFileName, strerror(errno));
                     free(buffer);
                     exit(13);
             }   }
 
             /* bench */
-            BMK_benchMem(alignedBuffer, benchedSize);
+            XSUM_benchMem(alignedBuffer, benchedSize);
 
             free(buffer);
     }   }
@@ -888,26 +876,26 @@ static int BMK_benchFiles(const char*const* fileNamesTable, int nbFiles)
 }
 
 
-static int BMK_benchInternal(size_t keySize)
+static int XSUM_benchInternal(size_t keySize)
 {
     void* const buffer = calloc(keySize+16+3, 1);
     if (buffer == NULL) {
-        DISPLAY("\nError: Out of memory.\n");
+        XSUM_log("\nError: Out of memory.\n");
         exit(12);
     }
 
     {   const void* const alignedBuffer = ((char*)buffer+15) - (((size_t)((char*)buffer+15)) & 0xF);  /* align on next 16 bytes */
 
         /* bench */
-        DISPLAYLEVEL(1, "Sample of ");
+        XSUM_logVerbose(1, "Sample of ");
         if (keySize > 10 KB) {
-            DISPLAYLEVEL(1, "%u KB", (unsigned)(keySize >> 10));
+            XSUM_logVerbose(1, "%u KB", (unsigned)(keySize >> 10));
         } else {
-            DISPLAYLEVEL(1, "%u bytes", (unsigned)keySize);
+            XSUM_logVerbose(1, "%u bytes", (unsigned)keySize);
         }
-        DISPLAYLEVEL(1, "...        \n");
+        XSUM_logVerbose(1, "...        \n");
 
-        BMK_benchMem(alignedBuffer, keySize);
+        XSUM_benchMem(alignedBuffer, keySize);
         free(buffer);
     }
     return 0;
@@ -919,50 +907,50 @@ static int BMK_benchInternal(size_t keySize)
  * ensure results consistency accross platforms
  *********************************************** */
 
-static void BMK_checkResult32(XXH32_hash_t r1, XXH32_hash_t r2)
+static void XSUM_checkResult32(XXH32_hash_t r1, XXH32_hash_t r2)
 {
     static int nbTests = 1;
     if (r1!=r2) {
-        DISPLAY("\rError: 32-bit hash test %i: Internal sanity check failed!\n", nbTests);
-        DISPLAY("\rGot 0x%08X, expected 0x%08X.\n", (unsigned)r1, (unsigned)r2);
-        DISPLAY("\rNote: If you modified the hash functions, make sure to either update the values\n"
-                  "or temporarily comment out the tests in BMK_sanityCheck.\n");
+        XSUM_log("\rError: 32-bit hash test %i: Internal sanity check failed!\n", nbTests);
+        XSUM_log("\rGot 0x%08X, expected 0x%08X.\n", (unsigned)r1, (unsigned)r2);
+        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
+                  "or temporarily comment out the tests in XSUM_sanityCheck.\n");
         exit(1);
     }
     nbTests++;
 }
 
-static void BMK_checkResult64(XXH64_hash_t r1, XXH64_hash_t r2)
+static void XSUM_checkResult64(XXH64_hash_t r1, XXH64_hash_t r2)
 {
     static int nbTests = 1;
     if (r1!=r2) {
-        DISPLAY("\rError: 64-bit hash test %i: Internal sanity check failed!\n", nbTests);
-        DISPLAY("\rGot 0x%08X%08XULL, expected 0x%08X%08XULL.\n",
+        XSUM_log("\rError: 64-bit hash test %i: Internal sanity check failed!\n", nbTests);
+        XSUM_log("\rGot 0x%08X%08XULL, expected 0x%08X%08XULL.\n",
                 (unsigned)(r1>>32), (unsigned)r1, (unsigned)(r2>>32), (unsigned)r2);
-        DISPLAY("\rNote: If you modified the hash functions, make sure to either update the values\n"
-                  "or temporarily comment out the tests in BMK_sanityCheck.\n");
+        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
+                  "or temporarily comment out the tests in XSUM_sanityCheck.\n");
         exit(1);
     }
     nbTests++;
 }
 
-static void BMK_checkResult128(XXH128_hash_t r1, XXH128_hash_t r2)
+static void XSUM_checkResult128(XXH128_hash_t r1, XXH128_hash_t r2)
 {
     static int nbTests = 1;
     if ((r1.low64 != r2.low64) || (r1.high64 != r2.high64)) {
-        DISPLAY("\rError: 128-bit hash test %i: Internal sanity check failed.\n", nbTests);
-        DISPLAY("\rGot { 0x%08X%08XULL, 0x%08X%08XULL }, expected { 0x%08X%08XULL, 0x%08X%08XULL } \n",
+        XSUM_log("\rError: 128-bit hash test %i: Internal sanity check failed.\n", nbTests);
+        XSUM_log("\rGot { 0x%08X%08XULL, 0x%08X%08XULL }, expected { 0x%08X%08XULL, 0x%08X%08XULL } \n",
                 (unsigned)(r1.low64>>32), (unsigned)r1.low64, (unsigned)(r1.high64>>32), (unsigned)r1.high64,
                 (unsigned)(r2.low64>>32), (unsigned)r2.low64, (unsigned)(r2.high64>>32), (unsigned)r2.high64 );
-        DISPLAY("\rNote: If you modified the hash functions, make sure to either update the values\n"
-                  "or temporarily comment out the tests in BMK_sanityCheck.\n");
+        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
+                  "or temporarily comment out the tests in XSUM_sanityCheck.\n");
         exit(1);
     }
     nbTests++;
 }
 
 
-static void BMK_testXXH32(const void* data, size_t len, U32 seed, U32 Nresult)
+static void XSUM_testXXH32(const void* data, size_t len, U32 seed, U32 Nresult)
 {
     XXH32_state_t *state = XXH32_createState();
     size_t pos;
@@ -970,20 +958,20 @@ static void BMK_testXXH32(const void* data, size_t len, U32 seed, U32 Nresult)
     assert(state != NULL);
     if (len>0) assert(data != NULL);
 
-    BMK_checkResult32(XXH32(data, len, seed), Nresult);
+    XSUM_checkResult32(XXH32(data, len, seed), Nresult);
 
     (void)XXH32_reset(state, seed);
     (void)XXH32_update(state, data, len);
-    BMK_checkResult32(XXH32_digest(state), Nresult);
+    XSUM_checkResult32(XXH32_digest(state), Nresult);
 
     (void)XXH32_reset(state, seed);
     for (pos=0; pos<len; pos++)
         (void)XXH32_update(state, ((const char*)data)+pos, 1);
-    BMK_checkResult32(XXH32_digest(state), Nresult);
+    XSUM_checkResult32(XXH32_digest(state), Nresult);
     XXH32_freeState(state);
 }
 
-static void BMK_testXXH64(const void* data, size_t len, U64 seed, U64 Nresult)
+static void XSUM_testXXH64(const void* data, size_t len, U64 seed, U64 Nresult)
 {
     XXH64_state_t *state = XXH64_createState();
     size_t pos;
@@ -991,20 +979,20 @@ static void BMK_testXXH64(const void* data, size_t len, U64 seed, U64 Nresult)
     assert(state != NULL);
     if (len>0) assert(data != NULL);
 
-    BMK_checkResult64(XXH64(data, len, seed), Nresult);
+    XSUM_checkResult64(XXH64(data, len, seed), Nresult);
 
     (void)XXH64_reset(state, seed);
     (void)XXH64_update(state, data, len);
-    BMK_checkResult64(XXH64_digest(state), Nresult);
+    XSUM_checkResult64(XXH64_digest(state), Nresult);
 
     (void)XXH64_reset(state, seed);
     for (pos=0; pos<len; pos++)
         (void)XXH64_update(state, ((const char*)data)+pos, 1);
-    BMK_checkResult64(XXH64_digest(state), Nresult);
+    XSUM_checkResult64(XXH64_digest(state), Nresult);
     XXH64_freeState(state);
 }
 
-static U32 BMK_rand(void)
+static U32 XSUM_rand(void)
 {
     static U64 seed = PRIME32;
     seed *= PRIME64;
@@ -1012,18 +1000,18 @@ static U32 BMK_rand(void)
 }
 
 
-void BMK_testXXH3(const void* data, size_t len, U64 seed, U64 Nresult)
+void XSUM_testXXH3(const void* data, size_t len, U64 seed, U64 Nresult)
 {
     if (len>0) assert(data != NULL);
 
     {   U64 const Dresult = XXH3_64bits_withSeed(data, len, seed);
-        BMK_checkResult64(Dresult, Nresult);
+        XSUM_checkResult64(Dresult, Nresult);
     }
 
     /* check that the no-seed variant produces same result as seed==0 */
     if (seed == 0) {
         U64 const Dresult = XXH3_64bits(data, len);
-        BMK_checkResult64(Dresult, Nresult);
+        XSUM_checkResult64(Dresult, Nresult);
     }
 
     /* streaming API test */
@@ -1032,19 +1020,19 @@ void BMK_testXXH3(const void* data, size_t len, U64 seed, U64 Nresult)
         /* single ingestion */
         (void)XXH3_64bits_reset_withSeed(state, seed);
         (void)XXH3_64bits_update(state, data, len);
-        BMK_checkResult64(XXH3_64bits_digest(state), Nresult);
+        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
 
         /* random ingestion */
         {   size_t p = 0;
             (void)XXH3_64bits_reset_withSeed(state, seed);
             while (p < len) {
                 size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(BMK_rand()) % modulo;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
                 if (p + l > len) l = len - p;
                 (void)XXH3_64bits_update(state, (const char*)data+p, l);
                 p += l;
             }
-            BMK_checkResult64(XXH3_64bits_digest(state), Nresult);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
         }
 
         /* byte by byte ingestion */
@@ -1052,18 +1040,18 @@ void BMK_testXXH3(const void* data, size_t len, U64 seed, U64 Nresult)
             (void)XXH3_64bits_reset_withSeed(state, seed);
             for (pos=0; pos<len; pos++)
                 (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
-            BMK_checkResult64(XXH3_64bits_digest(state), Nresult);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
         }
         XXH3_freeState(state);
     }
 }
 
-void BMK_testXXH3_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, U64 Nresult)
+void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, U64 Nresult)
 {
     if (len>0) assert(data != NULL);
 
     {   U64 const Dresult = XXH3_64bits_withSecret(data, len, secret, secretSize);
-        BMK_checkResult64(Dresult, Nresult);
+        XSUM_checkResult64(Dresult, Nresult);
     }
 
     /* streaming API test */
@@ -1071,19 +1059,19 @@ void BMK_testXXH3_withSecret(const void* data, size_t len, const void* secret, s
         assert(state != NULL);
         (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
         (void)XXH3_64bits_update(state, data, len);
-        BMK_checkResult64(XXH3_64bits_digest(state), Nresult);
+        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
 
         /* random ingestion */
         {   size_t p = 0;
             (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
             while (p < len) {
                 size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(BMK_rand()) % modulo;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
                 if (p + l > len) l = len - p;
                 (void)XXH3_64bits_update(state, (const char*)data+p, l);
                 p += l;
             }
-            BMK_checkResult64(XXH3_64bits_digest(state), Nresult);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
         }
 
         /* byte by byte ingestion */
@@ -1091,27 +1079,27 @@ void BMK_testXXH3_withSecret(const void* data, size_t len, const void* secret, s
             (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
             for (pos=0; pos<len; pos++)
                 (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
-            BMK_checkResult64(XXH3_64bits_digest(state), Nresult);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
         }
         XXH3_freeState(state);
     }
 }
 
-void BMK_testXXH128(const void* data, size_t len, U64 seed, XXH128_hash_t Nresult)
+void XSUM_testXXH128(const void* data, size_t len, U64 seed, XXH128_hash_t Nresult)
 {
     {   XXH128_hash_t const Dresult = XXH3_128bits_withSeed(data, len, seed);
-        BMK_checkResult128(Dresult, Nresult);
+        XSUM_checkResult128(Dresult, Nresult);
     }
 
     /* check that XXH128() is identical to XXH3_128bits_withSeed() */
     {   XXH128_hash_t const Dresult2 = XXH128(data, len, seed);
-        BMK_checkResult128(Dresult2, Nresult);
+        XSUM_checkResult128(Dresult2, Nresult);
     }
 
     /* check that the no-seed variant produces same result as seed==0 */
     if (seed == 0) {
         XXH128_hash_t const Dresult = XXH3_128bits(data, len);
-        BMK_checkResult128(Dresult, Nresult);
+        XSUM_checkResult128(Dresult, Nresult);
     }
 
     /* streaming API test */
@@ -1121,19 +1109,19 @@ void BMK_testXXH128(const void* data, size_t len, U64 seed, XXH128_hash_t Nresul
         /* single ingestion */
         (void)XXH3_128bits_reset_withSeed(state, seed);
         (void)XXH3_128bits_update(state, data, len);
-        BMK_checkResult128(XXH3_128bits_digest(state), Nresult);
+        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
 
         /* random ingestion */
         {   size_t p = 0;
             (void)XXH3_128bits_reset_withSeed(state, seed);
             while (p < len) {
                 size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(BMK_rand()) % modulo;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
                 if (p + l > len) l = len - p;
                 (void)XXH3_128bits_update(state, (const char*)data+p, l);
                 p += l;
             }
-            BMK_checkResult128(XXH3_128bits_digest(state), Nresult);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
         }
 
         /* byte by byte ingestion */
@@ -1141,18 +1129,18 @@ void BMK_testXXH128(const void* data, size_t len, U64 seed, XXH128_hash_t Nresul
             (void)XXH3_128bits_reset_withSeed(state, seed);
             for (pos=0; pos<len; pos++)
                 (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
-            BMK_checkResult128(XXH3_128bits_digest(state), Nresult);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
         }
         XXH3_freeState(state);
     }
 }
 
-void BMK_testXXH128_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XXH128_hash_t Nresult)
+void XSUM_testXXH128_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XXH128_hash_t Nresult)
 {
     if (len>0) assert(data != NULL);
 
     {   XXH128_hash_t const Dresult = XXH3_128bits_withSecret(data, len, secret, secretSize);
-        BMK_checkResult128(Dresult, Nresult);
+        XSUM_checkResult128(Dresult, Nresult);
     }
 
     /* streaming API test */
@@ -1160,19 +1148,19 @@ void BMK_testXXH128_withSecret(const void* data, size_t len, const void* secret,
         assert(state != NULL);
         (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
         (void)XXH3_128bits_update(state, data, len);
-        BMK_checkResult128(XXH3_128bits_digest(state), Nresult);
+        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
 
         /* random ingestion */
         {   size_t p = 0;
             (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
             while (p < len) {
                 size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(BMK_rand()) % modulo;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
                 if (p + l > len) l = len - p;
                 (void)XXH3_128bits_update(state, (const char*)data+p, l);
                 p += l;
             }
-            BMK_checkResult128(XXH3_128bits_digest(state), Nresult);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
         }
 
         /* byte by byte ingestion */
@@ -1180,7 +1168,7 @@ void BMK_testXXH128_withSecret(const void* data, size_t len, const void* secret,
             (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
             for (pos=0; pos<len; pos++)
                 (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
-            BMK_checkResult128(XXH3_128bits_digest(state), Nresult);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
         }
         XXH3_freeState(state);
     }
@@ -1189,7 +1177,7 @@ void BMK_testXXH128_withSecret(const void* data, size_t len, const void* secret,
 #define SECRET_SAMPLE_NBBYTES 4
 typedef struct { U8 byte[SECRET_SAMPLE_NBBYTES]; } verifSample_t;
 
-void BMK_testSecretGenerator(const void* customSeed, size_t len, verifSample_t result)
+void XSUM_testSecretGenerator(const void* customSeed, size_t len, verifSample_t result)
 {
     static int nbTests = 1;
     const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};
@@ -1202,8 +1190,8 @@ void BMK_testSecretGenerator(const void* customSeed, size_t len, verifSample_t r
         samples.byte[i] = secretBuffer[sampleIndex[i]];
     }
     if (memcmp(&samples, &result, sizeof(result))) {
-        DISPLAY("\rError: Secret generation test %i: Internal sanity check failed. \n", nbTests);
-        DISPLAY("\rGot { 0x%02X, 0x%02X, 0x%02X, 0x%02X }, expected { 0x%02X, 0x%02X, 0x%02X, 0x%02X } \n",
+        XSUM_log("\rError: Secret generation test %i: Internal sanity check failed. \n", nbTests);
+        XSUM_log("\rGot { 0x%02X, 0x%02X, 0x%02X, 0x%02X }, expected { 0x%02X, 0x%02X, 0x%02X, 0x%02X } \n",
                 samples.byte[0], samples.byte[1], samples.byte[2], samples.byte[3],
                 result.byte[0], result.byte[1], result.byte[2], result.byte[3] );
         exit(1);
@@ -1213,163 +1201,163 @@ void BMK_testSecretGenerator(const void* customSeed, size_t len, verifSample_t r
 
 
 /*!
- * BMK_sanityCheck():
+ * XSUM_sanityCheck():
  * Runs a sanity check before the benchmark.
  *
  * Exits on an incorrect output.
  */
-static void BMK_sanityCheck(void)
+static void XSUM_sanityCheck(void)
 {
 #define SANITY_BUFFER_SIZE 2367
     U8 sanityBuffer[SANITY_BUFFER_SIZE];
-    BMK_fillTestBuffer(sanityBuffer, sizeof(sanityBuffer));
-
-    BMK_testXXH32(NULL,          0, 0,       0x02CC5D05);
-    BMK_testXXH32(NULL,          0, PRIME32, 0x36B78AE7);
-    BMK_testXXH32(sanityBuffer,  1, 0,       0xCF65B03E);
-    BMK_testXXH32(sanityBuffer,  1, PRIME32, 0xB4545AA4);
-    BMK_testXXH32(sanityBuffer, 14, 0,       0x1208E7E2);
-    BMK_testXXH32(sanityBuffer, 14, PRIME32, 0x6AF1D1FE);
-    BMK_testXXH32(sanityBuffer,222, 0,       0x5BD11DBD);
-    BMK_testXXH32(sanityBuffer,222, PRIME32, 0x58803C5F);
-
-    BMK_testXXH64(NULL        ,  0, 0,       0xEF46DB3751D8E999ULL);
-    BMK_testXXH64(NULL        ,  0, PRIME32, 0xAC75FDA2929B17EFULL);
-    BMK_testXXH64(sanityBuffer,  1, 0,       0xE934A84ADB052768ULL);
-    BMK_testXXH64(sanityBuffer,  1, PRIME32, 0x5014607643A9B4C3ULL);
-    BMK_testXXH64(sanityBuffer,  4, 0,       0x9136A0DCA57457EEULL);
-    BMK_testXXH64(sanityBuffer, 14, 0,       0x8282DCC4994E35C8ULL);
-    BMK_testXXH64(sanityBuffer, 14, PRIME32, 0xC3BD6BF63DEB6DF0ULL);
-    BMK_testXXH64(sanityBuffer,222, 0,       0xB641AE8CB691C174ULL);
-    BMK_testXXH64(sanityBuffer,222, PRIME32, 0x20CB8AB7AE10C14AULL);
-
-    BMK_testXXH3(NULL,           0, 0,       0x2D06800538D394C2ULL);  /* empty string */
-    BMK_testXXH3(NULL,           0, PRIME64, 0xA8A6B918B2F0364AULL);
-    BMK_testXXH3(sanityBuffer,   1, 0,       0xC44BDFF4074EECDBULL);  /*  1 -  3 */
-    BMK_testXXH3(sanityBuffer,   1, PRIME64, 0x032BE332DD766EF8ULL);  /*  1 -  3 */
-    BMK_testXXH3(sanityBuffer,   6, 0,       0x27B56A84CD2D7325ULL);  /*  4 -  8 */
-    BMK_testXXH3(sanityBuffer,   6, PRIME64, 0x84589C116AB59AB9ULL);  /*  4 -  8 */
-    BMK_testXXH3(sanityBuffer,  12, 0,       0xA713DAF0DFBB77E7ULL);  /*  9 - 16 */
-    BMK_testXXH3(sanityBuffer,  12, PRIME64, 0xE7303E1B2336DE0EULL);  /*  9 - 16 */
-    BMK_testXXH3(sanityBuffer,  24, 0,       0xA3FE70BF9D3510EBULL);  /* 17 - 32 */
-    BMK_testXXH3(sanityBuffer,  24, PRIME64, 0x850E80FC35BDD690ULL);  /* 17 - 32 */
-    BMK_testXXH3(sanityBuffer,  48, 0,       0x397DA259ECBA1F11ULL);  /* 33 - 64 */
-    BMK_testXXH3(sanityBuffer,  48, PRIME64, 0xADC2CBAA44ACC616ULL);  /* 33 - 64 */
-    BMK_testXXH3(sanityBuffer,  80, 0,       0xBCDEFBBB2C47C90AULL);  /* 65 - 96 */
-    BMK_testXXH3(sanityBuffer,  80, PRIME64, 0xC6DD0CB699532E73ULL);  /* 65 - 96 */
-    BMK_testXXH3(sanityBuffer, 195, 0,       0xCD94217EE362EC3AULL);  /* 129-240 */
-    BMK_testXXH3(sanityBuffer, 195, PRIME64, 0xBA68003D370CB3D9ULL);  /* 129-240 */
-
-    BMK_testXXH3(sanityBuffer, 403, 0,       0xCDEB804D65C6DEA4ULL);  /* one block, last stripe is overlapping */
-    BMK_testXXH3(sanityBuffer, 403, PRIME64, 0x6259F6ECFD6443FDULL);  /* one block, last stripe is overlapping */
-    BMK_testXXH3(sanityBuffer, 512, 0,       0x617E49599013CB6BULL);  /* one block, finishing at stripe boundary */
-    BMK_testXXH3(sanityBuffer, 512, PRIME64, 0x3CE457DE14C27708ULL);  /* one block, finishing at stripe boundary */
-    BMK_testXXH3(sanityBuffer,2048, 0,       0xDD59E2C3A5F038E0ULL);  /* 2 blocks, finishing at block boundary */
-    BMK_testXXH3(sanityBuffer,2048, PRIME64, 0x66F81670669ABABCULL);  /* 2 blocks, finishing at block boundary */
-    BMK_testXXH3(sanityBuffer,2240, 0,       0x6E73A90539CF2948ULL);  /* 3 blocks, finishing at stripe boundary */
-    BMK_testXXH3(sanityBuffer,2240, PRIME64, 0x757BA8487D1B5247ULL);  /* 3 blocks, finishing at stripe boundary */
-    BMK_testXXH3(sanityBuffer,2367, 0,       0xCB37AEB9E5D361EDULL);  /* 3 blocks, last stripe is overlapping */
-    BMK_testXXH3(sanityBuffer,2367, PRIME64, 0xD2DB3415B942B42AULL);  /* 3 blocks, last stripe is overlapping */
+    XSUM_fillTestBuffer(sanityBuffer, sizeof(sanityBuffer));
+
+    XSUM_testXXH32(NULL,          0, 0,       0x02CC5D05);
+    XSUM_testXXH32(NULL,          0, PRIME32, 0x36B78AE7);
+    XSUM_testXXH32(sanityBuffer,  1, 0,       0xCF65B03E);
+    XSUM_testXXH32(sanityBuffer,  1, PRIME32, 0xB4545AA4);
+    XSUM_testXXH32(sanityBuffer, 14, 0,       0x1208E7E2);
+    XSUM_testXXH32(sanityBuffer, 14, PRIME32, 0x6AF1D1FE);
+    XSUM_testXXH32(sanityBuffer,222, 0,       0x5BD11DBD);
+    XSUM_testXXH32(sanityBuffer,222, PRIME32, 0x58803C5F);
+
+    XSUM_testXXH64(NULL        ,  0, 0,       0xEF46DB3751D8E999ULL);
+    XSUM_testXXH64(NULL        ,  0, PRIME32, 0xAC75FDA2929B17EFULL);
+    XSUM_testXXH64(sanityBuffer,  1, 0,       0xE934A84ADB052768ULL);
+    XSUM_testXXH64(sanityBuffer,  1, PRIME32, 0x5014607643A9B4C3ULL);
+    XSUM_testXXH64(sanityBuffer,  4, 0,       0x9136A0DCA57457EEULL);
+    XSUM_testXXH64(sanityBuffer, 14, 0,       0x8282DCC4994E35C8ULL);
+    XSUM_testXXH64(sanityBuffer, 14, PRIME32, 0xC3BD6BF63DEB6DF0ULL);
+    XSUM_testXXH64(sanityBuffer,222, 0,       0xB641AE8CB691C174ULL);
+    XSUM_testXXH64(sanityBuffer,222, PRIME32, 0x20CB8AB7AE10C14AULL);
+
+    XSUM_testXXH3(NULL,           0, 0,       0x2D06800538D394C2ULL);  /* empty string */
+    XSUM_testXXH3(NULL,           0, PRIME64, 0xA8A6B918B2F0364AULL);
+    XSUM_testXXH3(sanityBuffer,   1, 0,       0xC44BDFF4074EECDBULL);  /*  1 -  3 */
+    XSUM_testXXH3(sanityBuffer,   1, PRIME64, 0x032BE332DD766EF8ULL);  /*  1 -  3 */
+    XSUM_testXXH3(sanityBuffer,   6, 0,       0x27B56A84CD2D7325ULL);  /*  4 -  8 */
+    XSUM_testXXH3(sanityBuffer,   6, PRIME64, 0x84589C116AB59AB9ULL);  /*  4 -  8 */
+    XSUM_testXXH3(sanityBuffer,  12, 0,       0xA713DAF0DFBB77E7ULL);  /*  9 - 16 */
+    XSUM_testXXH3(sanityBuffer,  12, PRIME64, 0xE7303E1B2336DE0EULL);  /*  9 - 16 */
+    XSUM_testXXH3(sanityBuffer,  24, 0,       0xA3FE70BF9D3510EBULL);  /* 17 - 32 */
+    XSUM_testXXH3(sanityBuffer,  24, PRIME64, 0x850E80FC35BDD690ULL);  /* 17 - 32 */
+    XSUM_testXXH3(sanityBuffer,  48, 0,       0x397DA259ECBA1F11ULL);  /* 33 - 64 */
+    XSUM_testXXH3(sanityBuffer,  48, PRIME64, 0xADC2CBAA44ACC616ULL);  /* 33 - 64 */
+    XSUM_testXXH3(sanityBuffer,  80, 0,       0xBCDEFBBB2C47C90AULL);  /* 65 - 96 */
+    XSUM_testXXH3(sanityBuffer,  80, PRIME64, 0xC6DD0CB699532E73ULL);  /* 65 - 96 */
+    XSUM_testXXH3(sanityBuffer, 195, 0,       0xCD94217EE362EC3AULL);  /* 129-240 */
+    XSUM_testXXH3(sanityBuffer, 195, PRIME64, 0xBA68003D370CB3D9ULL);  /* 129-240 */
+
+    XSUM_testXXH3(sanityBuffer, 403, 0,       0xCDEB804D65C6DEA4ULL);  /* one block, last stripe is overlapping */
+    XSUM_testXXH3(sanityBuffer, 403, PRIME64, 0x6259F6ECFD6443FDULL);  /* one block, last stripe is overlapping */
+    XSUM_testXXH3(sanityBuffer, 512, 0,       0x617E49599013CB6BULL);  /* one block, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer, 512, PRIME64, 0x3CE457DE14C27708ULL);  /* one block, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer,2048, 0,       0xDD59E2C3A5F038E0ULL);  /* 2 blocks, finishing at block boundary */
+    XSUM_testXXH3(sanityBuffer,2048, PRIME64, 0x66F81670669ABABCULL);  /* 2 blocks, finishing at block boundary */
+    XSUM_testXXH3(sanityBuffer,2240, 0,       0x6E73A90539CF2948ULL);  /* 3 blocks, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer,2240, PRIME64, 0x757BA8487D1B5247ULL);  /* 3 blocks, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer,2367, 0,       0xCB37AEB9E5D361EDULL);  /* 3 blocks, last stripe is overlapping */
+    XSUM_testXXH3(sanityBuffer,2367, PRIME64, 0xD2DB3415B942B42AULL);  /* 3 blocks, last stripe is overlapping */
 
     /* XXH3 with Custom Secret */
     {   const void* const secret = sanityBuffer + 7;
         const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
         assert(sizeof(sanityBuffer) >= 7 + secretSize);
-        BMK_testXXH3_withSecret(NULL,           0, secret, secretSize, 0x3559D64878C5C66CULL);  /* empty string */
-        BMK_testXXH3_withSecret(sanityBuffer,   1, secret, secretSize, 0x8A52451418B2DA4DULL);  /*  1 -  3 */
-        BMK_testXXH3_withSecret(sanityBuffer,   6, secret, secretSize, 0x82C90AB0519369ADULL);  /*  4 -  8 */
-        BMK_testXXH3_withSecret(sanityBuffer,  12, secret, secretSize, 0x14631E773B78EC57ULL);  /*  9 - 16 */
-        BMK_testXXH3_withSecret(sanityBuffer,  24, secret, secretSize, 0xCDD5542E4A9D9FE8ULL);  /* 17 - 32 */
-        BMK_testXXH3_withSecret(sanityBuffer,  48, secret, secretSize, 0x33ABD54D094B2534ULL);  /* 33 - 64 */
-        BMK_testXXH3_withSecret(sanityBuffer,  80, secret, secretSize, 0xE687BA1684965297ULL);  /* 65 - 96 */
-        BMK_testXXH3_withSecret(sanityBuffer, 195, secret, secretSize, 0xA057273F5EECFB20ULL);  /* 129-240 */
+        XSUM_testXXH3_withSecret(NULL,           0, secret, secretSize, 0x3559D64878C5C66CULL);  /* empty string */
+        XSUM_testXXH3_withSecret(sanityBuffer,   1, secret, secretSize, 0x8A52451418B2DA4DULL);  /*  1 -  3 */
+        XSUM_testXXH3_withSecret(sanityBuffer,   6, secret, secretSize, 0x82C90AB0519369ADULL);  /*  4 -  8 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  12, secret, secretSize, 0x14631E773B78EC57ULL);  /*  9 - 16 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  24, secret, secretSize, 0xCDD5542E4A9D9FE8ULL);  /* 17 - 32 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  48, secret, secretSize, 0x33ABD54D094B2534ULL);  /* 33 - 64 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  80, secret, secretSize, 0xE687BA1684965297ULL);  /* 65 - 96 */
+        XSUM_testXXH3_withSecret(sanityBuffer, 195, secret, secretSize, 0xA057273F5EECFB20ULL);  /* 129-240 */
 
-        BMK_testXXH3_withSecret(sanityBuffer, 403, secret, secretSize, 0x14546019124D43B8ULL);  /* one block, last stripe is overlapping */
-        BMK_testXXH3_withSecret(sanityBuffer, 512, secret, secretSize, 0x7564693DD526E28DULL);  /* one block, finishing at stripe boundary */
-        BMK_testXXH3_withSecret(sanityBuffer,2048, secret, secretSize, 0xD32E975821D6519FULL);  /* >= 2 blocks, at least one scrambling */
-        BMK_testXXH3_withSecret(sanityBuffer,2367, secret, secretSize, 0x293FA8E5173BB5E7ULL);  /* >= 2 blocks, at least one scrambling, last stripe unaligned */
+        XSUM_testXXH3_withSecret(sanityBuffer, 403, secret, secretSize, 0x14546019124D43B8ULL);  /* one block, last stripe is overlapping */
+        XSUM_testXXH3_withSecret(sanityBuffer, 512, secret, secretSize, 0x7564693DD526E28DULL);  /* one block, finishing at stripe boundary */
+        XSUM_testXXH3_withSecret(sanityBuffer,2048, secret, secretSize, 0xD32E975821D6519FULL);  /* >= 2 blocks, at least one scrambling */
+        XSUM_testXXH3_withSecret(sanityBuffer,2367, secret, secretSize, 0x293FA8E5173BB5E7ULL);  /* >= 2 blocks, at least one scrambling, last stripe unaligned */
 
-        BMK_testXXH3_withSecret(sanityBuffer,64*10*3, secret, secretSize, 0x751D2EC54BC6038BULL);  /* exactly 3 full blocks, not a multiple of 256 */
+        XSUM_testXXH3_withSecret(sanityBuffer,64*10*3, secret, secretSize, 0x751D2EC54BC6038BULL);  /* exactly 3 full blocks, not a multiple of 256 */
     }
 
     /* XXH128 */
     {   XXH128_hash_t const expected = { 0x6001C324468D497FULL, 0x99AA06D3014798D8ULL };
-        BMK_testXXH128(NULL,           0, 0,     expected);         /* empty string */
+        XSUM_testXXH128(NULL,           0, 0,     expected);         /* empty string */
     }
     {   XXH128_hash_t const expected = { 0x5444F7869C671AB0ULL, 0x92220AE55E14AB50ULL };
-        BMK_testXXH128(NULL,           0, PRIME32, expected);
+        XSUM_testXXH128(NULL,           0, PRIME32, expected);
     }
     {   XXH128_hash_t const expected = { 0xC44BDFF4074EECDBULL, 0xA6CD5E9392000F6AULL };
-        BMK_testXXH128(sanityBuffer,   1, 0,       expected);       /* 1-3 */
+        XSUM_testXXH128(sanityBuffer,   1, 0,       expected);       /* 1-3 */
     }
     {   XXH128_hash_t const expected = { 0xB53D5557E7F76F8DULL, 0x89B99554BA22467CULL };
-        BMK_testXXH128(sanityBuffer,   1, PRIME32, expected);       /* 1-3 */
+        XSUM_testXXH128(sanityBuffer,   1, PRIME32, expected);       /* 1-3 */
     }
     {   XXH128_hash_t const expected = { 0x3E7039BDDA43CFC6ULL, 0x082AFE0B8162D12AULL };
-        BMK_testXXH128(sanityBuffer,   6, 0,       expected);       /* 4-8 */
+        XSUM_testXXH128(sanityBuffer,   6, 0,       expected);       /* 4-8 */
     }
     {   XXH128_hash_t const expected = { 0x269D8F70BE98856EULL, 0x5A865B5389ABD2B1ULL };
-        BMK_testXXH128(sanityBuffer,   6, PRIME32, expected);       /* 4-8 */
+        XSUM_testXXH128(sanityBuffer,   6, PRIME32, expected);       /* 4-8 */
     }
     {   XXH128_hash_t const expected = { 0x061A192713F69AD9ULL, 0x6E3EFD8FC7802B18ULL };
-        BMK_testXXH128(sanityBuffer,  12, 0,       expected);       /* 9-16 */
+        XSUM_testXXH128(sanityBuffer,  12, 0,       expected);       /* 9-16 */
     }
     {   XXH128_hash_t const expected = { 0x9BE9F9A67F3C7DFBULL, 0xD7E09D518A3405D3ULL };
-        BMK_testXXH128(sanityBuffer,  12, PRIME32, expected);       /* 9-16 */
+        XSUM_testXXH128(sanityBuffer,  12, PRIME32, expected);       /* 9-16 */
     }
     {   XXH128_hash_t const expected = { 0x1E7044D28B1B901DULL, 0x0CE966E4678D3761ULL };
-        BMK_testXXH128(sanityBuffer,  24, 0,       expected);       /* 17-32 */
+        XSUM_testXXH128(sanityBuffer,  24, 0,       expected);       /* 17-32 */
     }
     {   XXH128_hash_t const expected = { 0xD7304C54EBAD40A9ULL, 0x3162026714A6A243ULL };
-        BMK_testXXH128(sanityBuffer,  24, PRIME32, expected);       /* 17-32 */
+        XSUM_testXXH128(sanityBuffer,  24, PRIME32, expected);       /* 17-32 */
     }
     {   XXH128_hash_t const expected = { 0xF942219AED80F67BULL, 0xA002AC4E5478227EULL };
-        BMK_testXXH128(sanityBuffer,  48, 0,       expected);       /* 33-64 */
+        XSUM_testXXH128(sanityBuffer,  48, 0,       expected);       /* 33-64 */
     }
     {   XXH128_hash_t const expected = { 0x7BA3C3E453A1934EULL, 0x163ADDE36C072295ULL };
-        BMK_testXXH128(sanityBuffer,  48, PRIME32, expected);       /* 33-64 */
+        XSUM_testXXH128(sanityBuffer,  48, PRIME32, expected);       /* 33-64 */
     }
     {   XXH128_hash_t const expected = { 0x5E8BAFB9F95FB803ULL, 0x4952F58181AB0042ULL };
-        BMK_testXXH128(sanityBuffer,  81, 0,       expected);       /* 65-96 */
+        XSUM_testXXH128(sanityBuffer,  81, 0,       expected);       /* 65-96 */
     }
     {   XXH128_hash_t const expected = { 0x703FBB3D7A5F755CULL, 0x2724EC7ADC750FB6ULL };
-        BMK_testXXH128(sanityBuffer,  81, PRIME32, expected);       /* 65-96 */
+        XSUM_testXXH128(sanityBuffer,  81, PRIME32, expected);       /* 65-96 */
     }
     {   XXH128_hash_t const expected = { 0xF1AEBD597CEC6B3AULL, 0x337E09641B948717ULL };
-        BMK_testXXH128(sanityBuffer, 222, 0,       expected);       /* 129-240 */
+        XSUM_testXXH128(sanityBuffer, 222, 0,       expected);       /* 129-240 */
     }
     {   XXH128_hash_t const expected = { 0xAE995BB8AF917A8DULL, 0x91820016621E97F1ULL };
-        BMK_testXXH128(sanityBuffer, 222, PRIME32, expected);       /* 129-240 */
+        XSUM_testXXH128(sanityBuffer, 222, PRIME32, expected);       /* 129-240 */
     }
     {   XXH128_hash_t const expected = { 0xCDEB804D65C6DEA4ULL, 0x1B6DE21E332DD73DULL };
-        BMK_testXXH128(sanityBuffer, 403, 0,       expected);       /* one block, last stripe is overlapping */
+        XSUM_testXXH128(sanityBuffer, 403, 0,       expected);       /* one block, last stripe is overlapping */
     }
     {   XXH128_hash_t const expected = { 0x6259F6ECFD6443FDULL, 0xBED311971E0BE8F2ULL };
-        BMK_testXXH128(sanityBuffer, 403, PRIME64, expected);       /* one block, last stripe is overlapping */
+        XSUM_testXXH128(sanityBuffer, 403, PRIME64, expected);       /* one block, last stripe is overlapping */
     }
     {   XXH128_hash_t const expected = { 0x617E49599013CB6BULL, 0x18D2D110DCC9BCA1ULL };
-        BMK_testXXH128(sanityBuffer, 512, 0,       expected);       /* one block, finishing at stripe boundary */
+        XSUM_testXXH128(sanityBuffer, 512, 0,       expected);       /* one block, finishing at stripe boundary */
     }
     {   XXH128_hash_t const expected = { 0x3CE457DE14C27708ULL, 0x925D06B8EC5B8040ULL };
-        BMK_testXXH128(sanityBuffer, 512, PRIME64, expected);       /* one block, finishing at stripe boundary */
+        XSUM_testXXH128(sanityBuffer, 512, PRIME64, expected);       /* one block, finishing at stripe boundary */
     }
     {   XXH128_hash_t const expected = { 0xDD59E2C3A5F038E0ULL, 0xF736557FD47073A5ULL };
-        BMK_testXXH128(sanityBuffer,2048, 0,       expected);       /* two blocks, finishing at block boundary */
+        XSUM_testXXH128(sanityBuffer,2048, 0,       expected);       /* two blocks, finishing at block boundary */
     }
     {   XXH128_hash_t const expected = { 0x230D43F30206260BULL, 0x7FB03F7E7186C3EAULL };
-        BMK_testXXH128(sanityBuffer,2048, PRIME32, expected);       /* two blocks, finishing at block boundary */
+        XSUM_testXXH128(sanityBuffer,2048, PRIME32, expected);       /* two blocks, finishing at block boundary */
     }
     {   XXH128_hash_t const expected = { 0x6E73A90539CF2948ULL, 0xCCB134FBFA7CE49DULL };
-        BMK_testXXH128(sanityBuffer,2240, 0,       expected);      /* two blocks, ends at stripe boundary */
+        XSUM_testXXH128(sanityBuffer,2240, 0,       expected);      /* two blocks, ends at stripe boundary */
     }
     {   XXH128_hash_t const expected = { 0xED385111126FBA6FULL, 0x50A1FE17B338995FULL };
-        BMK_testXXH128(sanityBuffer,2240, PRIME32, expected);       /* two blocks, ends at stripe boundary */
+        XSUM_testXXH128(sanityBuffer,2240, PRIME32, expected);       /* two blocks, ends at stripe boundary */
     }
     {   XXH128_hash_t const expected = { 0xCB37AEB9E5D361EDULL, 0xE89C0F6FF369B427ULL };
-        BMK_testXXH128(sanityBuffer,2367, 0,       expected);       /* two blocks, last stripe is overlapping */
+        XSUM_testXXH128(sanityBuffer,2367, 0,       expected);       /* two blocks, last stripe is overlapping */
     }
     {   XXH128_hash_t const expected = { 0x6F5360AE69C2F406ULL, 0xD23AAE4B76C31ECBULL };
-        BMK_testXXH128(sanityBuffer,2367, PRIME32, expected);       /* two blocks, last stripe is overlapping */
+        XSUM_testXXH128(sanityBuffer,2367, PRIME32, expected);       /* two blocks, last stripe is overlapping */
     }
 
     /* XXH128 with custom Secret */
@@ -1378,38 +1366,38 @@ static void BMK_sanityCheck(void)
         assert(sizeof(sanityBuffer) >= 7 + secretSize);
 
         {   XXH128_hash_t const expected = { 0x005923CCEECBE8AEULL, 0x5F70F4EA232F1D38ULL };
-            BMK_testXXH128_withSecret(NULL,           0, secret, secretSize,     expected);         /* empty string */
+            XSUM_testXXH128_withSecret(NULL,           0, secret, secretSize,     expected);         /* empty string */
         }
         {   XXH128_hash_t const expected = { 0x8A52451418B2DA4DULL, 0x3A66AF5A9819198EULL };
-            BMK_testXXH128_withSecret(sanityBuffer,   1, secret, secretSize,       expected);       /* 1-3 */
+            XSUM_testXXH128_withSecret(sanityBuffer,   1, secret, secretSize,       expected);       /* 1-3 */
         }
         {   XXH128_hash_t const expected = { 0x0B61C8ACA7D4778FULL, 0x376BD91B6432F36DULL };
-            BMK_testXXH128_withSecret(sanityBuffer,   6, secret, secretSize,       expected);       /* 4-8 */
+            XSUM_testXXH128_withSecret(sanityBuffer,   6, secret, secretSize,       expected);       /* 4-8 */
         }
         {   XXH128_hash_t const expected = { 0xAF82F6EBA263D7D8ULL, 0x90A3C2D839F57D0FULL };
-            BMK_testXXH128_withSecret(sanityBuffer,  12, secret, secretSize,       expected);       /* 9-16 */
+            XSUM_testXXH128_withSecret(sanityBuffer,  12, secret, secretSize,       expected);       /* 9-16 */
         }
     }
 
     /* secret generator */
     {   verifSample_t const expected = { { 0xB8, 0x26, 0x83, 0x7E } };
-        BMK_testSecretGenerator(NULL, 0, expected);
+        XSUM_testSecretGenerator(NULL, 0, expected);
     }
 
     {   verifSample_t const expected = { { 0xA6, 0x16, 0x06, 0x7B } };
-        BMK_testSecretGenerator(sanityBuffer, 1, expected);
+        XSUM_testSecretGenerator(sanityBuffer, 1, expected);
     }
 
     {   verifSample_t const expected = { { 0xDA, 0x2A, 0x12, 0x11 } };
-        BMK_testSecretGenerator(sanityBuffer, XXH3_SECRET_SIZE_MIN - 1, expected);
+        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_SIZE_MIN - 1, expected);
     }
 
     {   verifSample_t const expected = { { 0x7E, 0x48, 0x0C, 0xA7 } };
-        BMK_testSecretGenerator(sanityBuffer, XXH3_SECRET_DEFAULT_SIZE + 500, expected);
+        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_DEFAULT_SIZE + 500, expected);
     }
 
-    DISPLAYLEVEL(3, "\r%70s\r", "");       /* Clean display line */
-    DISPLAYLEVEL(3, "Sanity check -- all tests ok\n");
+    XSUM_logVerbose(3, "\r%70s\r", "");       /* Clean display line */
+    XSUM_logVerbose(3, "Sanity check -- all tests ok\n");
 }
 
 
@@ -1440,20 +1428,20 @@ int XSUM_isDirectory(const char* infilename)
 }
 
 /* for support of --little-endian display mode */
-static void BMK_display_LittleEndian(const void* ptr, size_t length)
+static void XSUM_display_LittleEndian(const void* ptr, size_t length)
 {
     const U8* const p = (const U8*)ptr;
     size_t idx;
     for (idx=length-1; idx<length; idx--)    /* intentional underflow to negative to detect end */
-        DISPLAYRESULT("%02x", p[idx]);
+        XSUM_output("%02x", p[idx]);
 }
 
-static void BMK_display_BigEndian(const void* ptr, size_t length)
+static void XSUM_display_BigEndian(const void* ptr, size_t length)
 {
     const U8* const p = (const U8*)ptr;
     size_t idx;
     for (idx=0; idx<length; idx++)
-        DISPLAYRESULT("%02x", p[idx]);
+        XSUM_output("%02x", p[idx]);
 }
 
 typedef union {
@@ -1500,7 +1488,7 @@ XSUM_hashStream(FILE* inFile,
             }
         }
         if (ferror(inFile)) {
-            DISPLAY("Error: a failure occurred reading the input file.\n");
+            XSUM_log("Error: a failure occurred reading the input file.\n");
             exit(1);
     }   }
 
@@ -1540,19 +1528,19 @@ static void XSUM_printLine_BSD_internal(const char* filename,
     assert(0 <= hashType && hashType <= XSUM_TABLE_ELT_SIZE(XSUM_algoName));
     {   const char* const typeString = algoString[hashType];
         const size_t hashLength = XSUM_algoLength[hashType];
-        DISPLAYRESULT("%s (%s) = ", typeString, filename);
+        XSUM_output("%s (%s) = ", typeString, filename);
         f_displayHash(canonicalHash, hashLength);
-        DISPLAYRESULT("\n");
+        XSUM_output("\n");
 }   }
 
 static void XSUM_printLine_BSD_LE(const char* filename, const void* canonicalHash, const AlgoSelected hashType)
 {
-    XSUM_printLine_BSD_internal(filename, canonicalHash, hashType, XSUM_algoLE_name, BMK_display_LittleEndian);
+    XSUM_printLine_BSD_internal(filename, canonicalHash, hashType, XSUM_algoLE_name, XSUM_display_LittleEndian);
 }
 
 static void XSUM_printLine_BSD(const char* filename, const void* canonicalHash, const AlgoSelected hashType)
 {
-    XSUM_printLine_BSD_internal(filename, canonicalHash, hashType, XSUM_algoName, BMK_display_BigEndian);
+    XSUM_printLine_BSD_internal(filename, canonicalHash, hashType, XSUM_algoName, XSUM_display_BigEndian);
 }
 
 static void XSUM_printLine_GNU_internal(const char* filename,
@@ -1562,19 +1550,19 @@ static void XSUM_printLine_GNU_internal(const char* filename,
     assert(0 <= hashType && hashType <= XSUM_TABLE_ELT_SIZE(XSUM_algoName));
     {   const size_t hashLength = XSUM_algoLength[hashType];
         f_displayHash(canonicalHash, hashLength);
-        DISPLAYRESULT("  %s\n", filename);
+        XSUM_output("  %s\n", filename);
 }   }
 
 static void XSUM_printLine_GNU(const char* filename,
                                const void* canonicalHash, const AlgoSelected hashType)
 {
-    XSUM_printLine_GNU_internal(filename, canonicalHash, hashType, BMK_display_BigEndian);
+    XSUM_printLine_GNU_internal(filename, canonicalHash, hashType, XSUM_display_BigEndian);
 }
 
 static void XSUM_printLine_GNU_LE(const char* filename,
                                   const void* canonicalHash, const AlgoSelected hashType)
 {
-    XSUM_printLine_GNU_internal(filename, canonicalHash, hashType, BMK_display_LittleEndian);
+    XSUM_printLine_GNU_internal(filename, canonicalHash, hashType, XSUM_display_LittleEndian);
 }
 
 typedef enum { big_endian, little_endian} Display_endianess;
@@ -1607,19 +1595,19 @@ static int XSUM_hashFile(const char* fileName,
         SET_BINARY_MODE(stdin);
     } else {
         if (XSUM_isDirectory(fileName)) {
-            DISPLAY("xxhsum: %s: Is a directory \n", fileName);
+            XSUM_log("xxhsum: %s: Is a directory \n", fileName);
             return 1;
         }
-        inFile = XXH_fopen( fileName, "rb" );
+        inFile = XSUM_fopen( fileName, "rb" );
         if (inFile==NULL) {
-            DISPLAY("Error: Could not open '%s': %s. \n", fileName, strerror(errno));
+            XSUM_log("Error: Could not open '%s': %s. \n", fileName, strerror(errno));
             return 1;
     }   }
 
     /* Memory allocation & streaming */
     {   void* const buffer = malloc(blockSize);
         if (buffer == NULL) {
-            DISPLAY("\nError: Out of memory.\n");
+            XSUM_log("\nError: Out of memory.\n");
             fclose(inFile);
             return 1;
         }
@@ -1677,7 +1665,7 @@ static int XSUM_hashFiles(const char*const * fnList, int fnTotal,
 
     for (fnNb=0; fnNb<fnTotal; fnNb++)
         result |= XSUM_hashFile(fnList[fnNb], hashType, displayEndianess, convention);
-    DISPLAYLEVEL(2, "\r%70s\r", "");
+    XSUM_logVerbose(2, "\r%70s\r", "");
     return result;
 }
 
@@ -1748,7 +1736,7 @@ typedef struct {
  * Returns GetLine_exceedMaxLineLength, if line length is longer than MAX_LINE_LENGTH.
  * Returns GetLine_outOfMemory, if line buffer memory allocation failed.
  */
-static GetLineResult getLine(char** lineBuf, int* lineMax, FILE* inFile)
+static GetLineResult XSUM_getLine(char** lineBuf, int* lineMax, FILE* inFile)
 {
     GetLineResult result = GetLine_ok;
     size_t len = 0;
@@ -1822,10 +1810,10 @@ static int charToHex(char c)
  * Returns CanonicalFromString_invalidFormat if hashStr is not well formatted.
  * Returns CanonicalFromString_ok if hashStr is parsed successfully.
  */
-static CanonicalFromStringResult canonicalFromString(unsigned char* dst,
-                                                     size_t dstSize,
-                                                     const char* hashStr,
-                                                     int reverseBytes)
+static CanonicalFromStringResult XSUM_canonicalFromString(unsigned char* dst,
+                                                          size_t dstSize,
+                                                          const char* hashStr,
+                                                          int reverseBytes)
 {
     size_t i;
     for (i = 0; i < dstSize; ++i) {
@@ -1848,7 +1836,7 @@ static CanonicalFromStringResult canonicalFromString(unsigned char* dst,
  * Parse single line of xxHash checksum file.
  * Returns ParseLine_invalidFormat if the line is not well formatted.
  * Returns ParseLine_ok if the line is parsed successfully.
- * And members of parseLine will be filled by parsed values.
+ * And members of XSUM_parseLine will be filled by parsed values.
  *
  *  - line must be terminated with '\0' without a trailing newline.
  *  - Since parsedLine.filename will point within given argument `line`,
@@ -1863,7 +1851,7 @@ static CanonicalFromStringResult canonicalFromString(unsigned char* dst,
  *
  *      <algorithm> <' ('> <filename> <') = '> <hexstring> <'\0'>
  */
-static ParseLineResult parseLine(ParsedLine* parsedLine, char* line, int rev)
+static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int rev)
 {
     char* const firstSpace = strchr(line, ' ');
     const char* hash_ptr;
@@ -1895,7 +1883,7 @@ static ParseLineResult parseLine(ParsedLine* parsedLine, char* line, int rev)
     {
     case 8:
         {   XXH32_canonical_t* xxh32c = &parsedLine->canonical.xxh32;
-            if (canonicalFromString(xxh32c->digest, sizeof(xxh32c->digest), hash_ptr, rev)
+            if (XSUM_canonicalFromString(xxh32c->digest, sizeof(xxh32c->digest), hash_ptr, rev)
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
@@ -1905,7 +1893,7 @@ static ParseLineResult parseLine(ParsedLine* parsedLine, char* line, int rev)
 
     case 16:
         {   XXH64_canonical_t* xxh64c = &parsedLine->canonical.xxh64;
-            if (canonicalFromString(xxh64c->digest, sizeof(xxh64c->digest), hash_ptr, rev)
+            if (XSUM_canonicalFromString(xxh64c->digest, sizeof(xxh64c->digest), hash_ptr, rev)
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
@@ -1915,7 +1903,7 @@ static ParseLineResult parseLine(ParsedLine* parsedLine, char* line, int rev)
 
     case 32:
         {   XXH128_canonical_t* xxh128c = &parsedLine->canonical.xxh128;
-            if (canonicalFromString(xxh128c->digest, sizeof(xxh128c->digest), hash_ptr, rev)
+            if (XSUM_canonicalFromString(xxh128c->digest, sizeof(xxh128c->digest), hash_ptr, rev)
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
@@ -1938,10 +1926,10 @@ static ParseLineResult parseLine(ParsedLine* parsedLine, char* line, int rev)
 /*!
  * Parse xxHash checksum file.
  */
-static void parseFile1(ParseFileArg* parseFileArg, int rev)
+static void XSUM_parseFile1(ParseFileArg* XSUM_parseFileArg, int rev)
 {
-    const char* const inFileName = parseFileArg->inFileName;
-    ParseFileReport* const report = &parseFileArg->report;
+    const char* const inFileName = XSUM_parseFileArg->inFileName;
+    ParseFileReport* const report = &XSUM_parseFileArg->report;
 
     unsigned long lineNumber = 0;
     memset(report, 0, sizeof(*report));
@@ -1954,46 +1942,46 @@ static void parseFile1(ParseFileArg* parseFileArg, int rev)
         lineNumber++;
         if (lineNumber == 0) {
             /* This is unlikely happen, but md5sum.c has this error check. */
-            DISPLAY("%s: Error: Too many checksum lines\n", inFileName);
+            XSUM_log("%s: Error: Too many checksum lines\n", inFileName);
             report->quit = 1;
             break;
         }
 
-        {   GetLineResult const getLineResult = getLine(&parseFileArg->lineBuf,
-                                                        &parseFileArg->lineMax,
-                                                         parseFileArg->inFile);
-            if (getLineResult != GetLine_ok) {
-                if (getLineResult == GetLine_eof) break;
+        {   GetLineResult const XSUM_getLineResult = XSUM_getLine(&XSUM_parseFileArg->lineBuf,
+                                                        &XSUM_parseFileArg->lineMax,
+                                                         XSUM_parseFileArg->inFile);
+            if (XSUM_getLineResult != GetLine_ok) {
+                if (XSUM_getLineResult == GetLine_eof) break;
 
-                switch (getLineResult)
+                switch (XSUM_getLineResult)
                 {
                 case GetLine_ok:
                 case GetLine_eof:
-                    /* These cases never happen.  See above getLineResult related "if"s.
+                    /* These cases never happen.  See above XSUM_getLineResult related "if"s.
                        They exist just for make gcc's -Wswitch-enum happy. */
                     assert(0);
                     break;
 
                 default:
-                    DISPLAY("%s:%lu: Error: Unknown error.\n", inFileName, lineNumber);
+                    XSUM_log("%s:%lu: Error: Unknown error.\n", inFileName, lineNumber);
                     break;
 
                 case GetLine_exceedMaxLineLength:
-                    DISPLAY("%s:%lu: Error: Line too long.\n", inFileName, lineNumber);
+                    XSUM_log("%s:%lu: Error: Line too long.\n", inFileName, lineNumber);
                     break;
 
                 case GetLine_outOfMemory:
-                    DISPLAY("%s:%lu: Error: Out of memory.\n", inFileName, lineNumber);
+                    XSUM_log("%s:%lu: Error: Out of memory.\n", inFileName, lineNumber);
                     break;
                 }
                 report->quit = 1;
                 break;
         }   }
 
-        if (parseLine(&parsedLine, parseFileArg->lineBuf, rev) != ParseLine_ok) {
+        if (XSUM_parseLine(&parsedLine, XSUM_parseFileArg->lineBuf, rev) != ParseLine_ok) {
             report->nImproperlyFormattedLines++;
-            if (parseFileArg->warn) {
-                DISPLAY("%s:%lu: Error: Improperly formatted checksum line.\n",
+            if (XSUM_parseFileArg->warn) {
+                XSUM_log("%s:%lu: Error: Improperly formatted checksum line.\n",
                         inFileName, lineNumber);
             }
             continue;
@@ -2002,7 +1990,7 @@ static void parseFile1(ParseFileArg* parseFileArg, int rev)
         report->nProperlyFormattedLines++;
 
         do {
-            FILE* const fp = XXH_fopen(parsedLine.filename, "rb");
+            FILE* const fp = XSUM_fopen(parsedLine.filename, "rb");
             if (fp == NULL) {
                 lineStatus = LineStatus_failedToOpen;
                 break;
@@ -2011,21 +1999,21 @@ static void parseFile1(ParseFileArg* parseFileArg, int rev)
             switch (parsedLine.xxhBits)
             {
             case 32:
-                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh32, parseFileArg->blockBuf, parseFileArg->blockSize);
+                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh32, XSUM_parseFileArg->blockBuf, XSUM_parseFileArg->blockSize);
                     if (xxh.xxh32 == XXH32_hashFromCanonical(&parsedLine.canonical.xxh32)) {
                         lineStatus = LineStatus_hashOk;
                 }   }
                 break;
 
             case 64:
-                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh64, parseFileArg->blockBuf, parseFileArg->blockSize);
+                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh64, XSUM_parseFileArg->blockBuf, XSUM_parseFileArg->blockSize);
                     if (xxh.xxh64 == XXH64_hashFromCanonical(&parsedLine.canonical.xxh64)) {
                         lineStatus = LineStatus_hashOk;
                 }   }
                 break;
 
             case 128:
-                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh128, parseFileArg->blockBuf, parseFileArg->blockSize);
+                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh128, XSUM_parseFileArg->blockBuf, XSUM_parseFileArg->blockSize);
                     if (XXH128_isEqual(xxh.xxh128, XXH128_hashFromCanonical(&parsedLine.canonical.xxh128))) {
                         lineStatus = LineStatus_hashOk;
                 }   }
@@ -2040,14 +2028,14 @@ static void parseFile1(ParseFileArg* parseFileArg, int rev)
         switch (lineStatus)
         {
         default:
-            DISPLAY("%s: Error: Unknown error.\n", inFileName);
+            XSUM_log("%s: Error: Unknown error.\n", inFileName);
             report->quit = 1;
             break;
 
         case LineStatus_failedToOpen:
             report->nOpenOrReadFailures++;
-            if (!parseFileArg->statusOnly) {
-                DISPLAYRESULT("%s:%lu: Could not open or read '%s': %s.\n",
+            if (!XSUM_parseFileArg->statusOnly) {
+                XSUM_output("%s:%lu: Could not open or read '%s': %s.\n",
                     inFileName, lineNumber, parsedLine.filename, strerror(errno));
             }
             break;
@@ -2057,13 +2045,13 @@ static void parseFile1(ParseFileArg* parseFileArg, int rev)
             {   int b = 1;
                 if (lineStatus == LineStatus_hashOk) {
                     /* If --quiet is specified, don't display "OK" */
-                    if (parseFileArg->quiet) b = 0;
+                    if (XSUM_parseFileArg->quiet) b = 0;
                 } else {
                     report->nMismatchedChecksums++;
                 }
 
-                if (b && !parseFileArg->statusOnly) {
-                    DISPLAYRESULT("%s: %s\n", parsedLine.filename
+                if (b && !XSUM_parseFileArg->statusOnly) {
+                    XSUM_output("%s: %s\n", parsedLine.filename
                         , lineStatus == LineStatus_hashOk ? "OK" : "FAILED");
             }   }
             break;
@@ -2087,18 +2075,18 @@ static void parseFile1(ParseFileArg* parseFileArg, int rev)
  *    - All hash values match with its content.
  *    - (strict mode) All lines in checksum file are consistent and well formatted.
  */
-static int checkFile(const char* inFileName,
-                     const Display_endianess displayEndianess,
-                     U32 strictMode,
-                     U32 statusOnly,
-                     U32 warn,
-                     U32 quiet)
+static int XSUM_checkFile(const char* inFileName,
+                          const Display_endianess displayEndianess,
+                          U32 strictMode,
+                          U32 statusOnly,
+                          U32 warn,
+                          U32 quiet)
 {
     int result = 0;
     FILE* inFile = NULL;
-    ParseFileArg parseFileArgBody;
-    ParseFileArg* const parseFileArg = &parseFileArgBody;
-    ParseFileReport* const report = &parseFileArg->report;
+    ParseFileArg XSUM_parseFileArgBody;
+    ParseFileArg* const XSUM_parseFileArg = &XSUM_parseFileArgBody;
+    ParseFileReport* const report = &XSUM_parseFileArg->report;
 
     /* note: stdinName is special constant pointer.  It is not a string. */
     if (inFileName == stdinName) {
@@ -2109,54 +2097,54 @@ static int checkFile(const char* inFileName,
         inFileName = "stdin";
         inFile = stdin;
     } else {
-        inFile = XXH_fopen( inFileName, "rt" );
+        inFile = XSUM_fopen( inFileName, "rt" );
     }
 
     if (inFile == NULL) {
-        DISPLAY("Error: Could not open '%s': %s\n", inFileName, strerror(errno));
+        XSUM_log("Error: Could not open '%s': %s\n", inFileName, strerror(errno));
         return 0;
     }
 
-    parseFileArg->inFileName  = inFileName;
-    parseFileArg->inFile      = inFile;
-    parseFileArg->lineMax     = DEFAULT_LINE_LENGTH;
-    parseFileArg->lineBuf     = (char*) malloc((size_t)parseFileArg->lineMax);
-    parseFileArg->blockSize   = 64 * 1024;
-    parseFileArg->blockBuf    = (char*) malloc(parseFileArg->blockSize);
-    parseFileArg->strictMode  = strictMode;
-    parseFileArg->statusOnly  = statusOnly;
-    parseFileArg->warn        = warn;
-    parseFileArg->quiet       = quiet;
-
-    if ( (parseFileArg->lineBuf == NULL)
-      || (parseFileArg->blockBuf == NULL) ) {
-        DISPLAY("Error: : memory allocation failed \n");
+    XSUM_parseFileArg->inFileName  = inFileName;
+    XSUM_parseFileArg->inFile      = inFile;
+    XSUM_parseFileArg->lineMax     = DEFAULT_LINE_LENGTH;
+    XSUM_parseFileArg->lineBuf     = (char*) malloc((size_t)XSUM_parseFileArg->lineMax);
+    XSUM_parseFileArg->blockSize   = 64 * 1024;
+    XSUM_parseFileArg->blockBuf    = (char*) malloc(XSUM_parseFileArg->blockSize);
+    XSUM_parseFileArg->strictMode  = strictMode;
+    XSUM_parseFileArg->statusOnly  = statusOnly;
+    XSUM_parseFileArg->warn        = warn;
+    XSUM_parseFileArg->quiet       = quiet;
+
+    if ( (XSUM_parseFileArg->lineBuf == NULL)
+      || (XSUM_parseFileArg->blockBuf == NULL) ) {
+        XSUM_log("Error: : memory allocation failed \n");
         exit(1);
     }
-    parseFile1(parseFileArg, displayEndianess != big_endian);
+    XSUM_parseFile1(XSUM_parseFileArg, displayEndianess != big_endian);
 
-    free(parseFileArg->blockBuf);
-    free(parseFileArg->lineBuf);
+    free(XSUM_parseFileArg->blockBuf);
+    free(XSUM_parseFileArg->lineBuf);
 
     if (inFile != stdin) fclose(inFile);
 
     /* Show error/warning messages.  All messages are copied from md5sum.c
      */
     if (report->nProperlyFormattedLines == 0) {
-        DISPLAY("%s: no properly formatted xxHash checksum lines found\n", inFileName);
+        XSUM_log("%s: no properly formatted xxHash checksum lines found\n", inFileName);
     } else if (!statusOnly) {
         if (report->nImproperlyFormattedLines) {
-            DISPLAYRESULT("%lu %s improperly formatted\n"
+            XSUM_output("%lu %s improperly formatted\n"
                 , report->nImproperlyFormattedLines
                 , report->nImproperlyFormattedLines == 1 ? "line is" : "lines are");
         }
         if (report->nOpenOrReadFailures) {
-            DISPLAYRESULT("%lu listed %s could not be read\n"
+            XSUM_output("%lu listed %s could not be read\n"
                 , report->nOpenOrReadFailures
                 , report->nOpenOrReadFailures == 1 ? "file" : "files");
         }
         if (report->nMismatchedChecksums) {
-            DISPLAYRESULT("%lu computed %s did NOT match\n"
+            XSUM_output("%lu computed %s did NOT match\n"
                 , report->nMismatchedChecksums
                 , report->nMismatchedChecksums == 1 ? "checksum" : "checksums");
     }   }
@@ -2172,23 +2160,23 @@ static int checkFile(const char* inFileName,
 }
 
 
-static int checkFiles(const char*const* fnList, int fnTotal,
-                      const Display_endianess displayEndianess,
-                      U32 strictMode,
-                      U32 statusOnly,
-                      U32 warn,
-                      U32 quiet)
+static int XSUM_checkFiles(const char*const* fnList, int fnTotal,
+                           const Display_endianess displayEndianess,
+                           U32 strictMode,
+                           U32 statusOnly,
+                           U32 warn,
+                           U32 quiet)
 {
     int ok = 1;
 
     /* Special case for stdinName "-",
      * note: stdinName is not a string.  It's special pointer. */
     if (fnTotal==0) {
-        ok &= checkFile(stdinName, displayEndianess, strictMode, statusOnly, warn, quiet);
+        ok &= XSUM_checkFile(stdinName, displayEndianess, strictMode, statusOnly, warn, quiet);
     } else {
         int fnNb;
         for (fnNb=0; fnNb<fnTotal; fnNb++)
-            ok &= checkFile(fnList[fnNb], displayEndianess, strictMode, statusOnly, warn, quiet);
+            ok &= XSUM_checkFile(fnList[fnNb], displayEndianess, strictMode, statusOnly, warn, quiet);
     }
     return ok ? 0 : 1;
 }
@@ -2198,53 +2186,54 @@ static int checkFiles(const char*const* fnList, int fnTotal,
 *  Main
 **********************************************************/
 
-static int usage(const char* exename)
+static int XSUM_usage(const char* exename)
 {
-    DISPLAY( WELCOME_MESSAGE(exename) );
-    DISPLAY( "Print or verify checksums using fast non-cryptographic algorithm xxHash \n\n" );
-    DISPLAY( "Usage: %s [options] [files] \n\n", exename);
-    DISPLAY( "When no filename provided or when '-' is provided, uses stdin as input. \n");
-    DISPLAY( "Options: \n");
-    DISPLAY( "  -H#         algorithm selection: 0,1,2 or 32,64,128 (default: %i) \n", (int)g_defaultAlgo);
-    DISPLAY( "  -c, --check read xxHash checksum from [files] and check them \n");
-    DISPLAY( "  -h, --help  display a long help page about advanced options \n");
+    XSUM_log( WELCOME_MESSAGE(exename) );
+    XSUM_log( "Print or verify checksums using fast non-cryptographic algorithm xxHash \n\n" );
+    XSUM_log( "Usage: %s [options] [files] \n\n", exename);
+    XSUM_log( "When no filename provided or when '-' is provided, uses stdin as input. \n");
+    XSUM_log( "Options: \n");
+    XSUM_log( "  -H#         algorithm selection: 0,1,2 or 32,64,128 (default: %i) \n", (int)g_defaultAlgo);
+    XSUM_log( "  -c, --check read xxHash checksum from [files] and check them \n");
+    XSUM_log( "  -h, --help  display a long help page about advanced options \n");
     return 0;
 }
 
 
-static int usage_advanced(const char* exename)
-{
-    usage(exename);
-    DISPLAY( "Advanced :\n");
-    DISPLAY( "  -V, --version        Display version information \n");
-    DISPLAY( "      --tag            Produce BSD-style checksum lines \n");
-    DISPLAY( "      --little-endian  Checksum values use little endian convention (default: big endian) \n");
-    DISPLAY( "  -b                   Run benchmark \n");
-    DISPLAY( "  -b#                  Bench only algorithm variant # \n");
-    DISPLAY( "  -i#                  Number of times to run the benchmark (default: %u) \n", (unsigned)g_nbIterations);
-    DISPLAY( "  -q, --quiet          Don't display version header in benchmark mode \n");
-    DISPLAY( "\n");
-    DISPLAY( "The following four options are useful only when verifying checksums (-c): \n");
-    DISPLAY( "  -q, --quiet          Don't print OK for each successfully verified file \n");
-    DISPLAY( "      --status         Don't output anything, status code shows success \n");
-    DISPLAY( "      --strict         Exit non-zero for improperly formatted checksum lines \n");
-    DISPLAY( "      --warn           Warn about improperly formatted checksum lines \n");
+static int XSUM_usage_advanced(const char* exename)
+{
+    XSUM_usage(exename);
+    XSUM_log( "Advanced :\n");
+    XSUM_log( "  -V, --version        Display version information \n");
+    XSUM_log( "      --tag            Produce BSD-style checksum lines \n");
+    XSUM_log( "      --little-endian  Checksum values use little endian convention (default: big endian) \n");
+    XSUM_log( "  -b                   Run benchmark \n");
+    XSUM_log( "  -b#                  Bench only algorithm variant # \n");
+    XSUM_log( "  -i#                  Number of times to run the benchmark (default: %u) \n", (unsigned)g_nbIterations);
+    XSUM_log( "  -q, --quiet          Don't display version header in benchmark mode \n");
+    XSUM_log( "\n");
+    XSUM_log( "The following four options are useful only when verifying checksums (-c): \n");
+    XSUM_log( "  -q, --quiet          Don't print OK for each successfully verified file \n");
+    XSUM_log( "      --status         Don't output anything, status code shows success \n");
+    XSUM_log( "      --strict         Exit non-zero for improperly formatted checksum lines \n");
+    XSUM_log( "      --warn           Warn about improperly formatted checksum lines \n");
     return 0;
 }
 
-static int badusage(const char* exename)
+static int XSUM_badusage(const char* exename)
 {
-    DISPLAY("Wrong parameters\n\n");
-    usage(exename);
+    XSUM_log("Wrong parameters\n\n");
+    XSUM_usage(exename);
     return 1;
 }
 
 static void errorOut(const char* msg)
 {
-    DISPLAY("%s \n", msg); exit(1);
+    XSUM_log("%s \n", msg);
+    exit(1);
 }
 
-static const char* lastNameFromPath(const char* path)
+static const char* XSUM_lastNameFromPath(const char* path)
 {
     const char* name = path;
     if (strrchr(name, '/')) name = strrchr(name, '/') + 1;
@@ -2253,13 +2242,13 @@ static const char* lastNameFromPath(const char* path)
 }
 
 /*!
- * readU32FromCharChecked():
+ * XSUM_readU32FromCharChecked():
  * @return 0 if success, and store the result in *value.
  * Allows and interprets K, KB, KiB, M, MB and MiB suffix.
  * Will also modify `*stringPtr`, advancing it to position where it stopped reading.
  * @return 1 if an overflow error occurs
  */
-static int readU32FromCharChecked(const char** stringPtr, U32* value)
+static int XSUM_readU32FromCharChecked(const char** stringPtr, U32* value)
 {
     static const U32 max = (((U32)(-1)) / 10) - 1;
     U32 result = 0;
@@ -2286,25 +2275,25 @@ static int readU32FromCharChecked(const char** stringPtr, U32* value)
 }
 
 /*!
- * readU32FromChar():
+ * XSUM_readU32FromChar():
  * @return: unsigned integer value read from input in `char` format.
  *  allows and interprets K, KB, KiB, M, MB and MiB suffix.
  *  Will also modify `*stringPtr`, advancing it to position where it stopped reading.
  *  Note: function will exit() program if digit sequence overflows
  */
-static U32 readU32FromChar(const char** stringPtr) {
+static U32 XSUM_readU32FromChar(const char** stringPtr) {
     U32 result;
-    if (readU32FromCharChecked(stringPtr, &result)) {
+    if (XSUM_readU32FromCharChecked(stringPtr, &result)) {
         static const char errorMsg[] = "Error: numeric value too large";
         errorOut(errorMsg);
     }
     return result;
 }
 
-static int XXH_main(int argc, const char* const* argv)
+static int XSUM_main(int argc, const char* const* argv)
 {
     int i, filenamesStart = 0;
-    const char* const exename = lastNameFromPath(argv[0]);
+    const char* const exename = XSUM_lastNameFromPath(argv[0]);
     U32 benchmarkMode = 0;
     U32 fileCheckMode = 0;
     U32 strictMode    = 0;
@@ -2313,7 +2302,7 @@ static int XXH_main(int argc, const char* const* argv)
     int explicitStdin = 0;
     U32 selectBenchIDs= 0;  /* 0 == use default k_testIDs_default, kBenchAll == bench all */
     static const U32 kBenchAll = 99;
-    size_t keySize    = XXH_DEFAULT_SAMPLE_SIZE;
+    size_t keySize    = XSUM_DEFAULT_SAMPLE_SIZE;
     AlgoSelected algo     = g_defaultAlgo;
     Display_endianess displayEndianess = big_endian;
     Display_convention convention = display_gnu;
@@ -2335,8 +2324,8 @@ static int XXH_main(int argc, const char* const* argv)
         if (!strcmp(argument, "--strict")) { strictMode = 1; continue; }
         if (!strcmp(argument, "--status")) { statusOnly = 1; continue; }
         if (!strcmp(argument, "--warn")) { warn = 1; continue; }
-        if (!strcmp(argument, "--help")) { return usage_advanced(exename); }
-        if (!strcmp(argument, "--version")) { DISPLAY(FULL_WELCOME_MESSAGE(exename)); BMK_sanityCheck(); return 0; }
+        if (!strcmp(argument, "--help")) { return XSUM_usage_advanced(exename); }
+        if (!strcmp(argument, "--version")) { XSUM_log(FULL_WELCOME_MESSAGE(exename)); XSUM_sanityCheck(); return 0; }
         if (!strcmp(argument, "--tag")) { convention = display_bsd; continue; }
 
         if (!strcmp(argument, "--")) {
@@ -2357,15 +2346,15 @@ static int XXH_main(int argc, const char* const* argv)
             {
             /* Display version */
             case 'V':
-                DISPLAY(FULL_WELCOME_MESSAGE(exename)); return 0;
+                XSUM_log(FULL_WELCOME_MESSAGE(exename)); return 0;
 
-            /* Display help on usage */
+            /* Display help on XSUM_usage */
             case 'h':
-                return usage_advanced(exename);
+                return XSUM_usage_advanced(exename);
 
             /* select hash algorithm */
             case 'H': argument++;
-                switch(readU32FromChar(&argument)) {
+                switch(XSUM_readU32FromChar(&argument)) {
                     case 0 :
                     case 32: algo = algo_xxh32; break;
                     case 1 :
@@ -2373,7 +2362,7 @@ static int XXH_main(int argc, const char* const* argv)
                     case 2 :
                     case 128: algo = algo_xxh128; break;
                     default:
-                        return badusage(exename);
+                        return XSUM_badusage(exename);
                 }
                 break;
 
@@ -2395,7 +2384,7 @@ static int XXH_main(int argc, const char* const* argv)
                 benchmarkMode = 1;
                 do {
                     if (*argument == ',') argument++;
-                    selectBenchIDs = readU32FromChar(&argument); /* select one specific test */
+                    selectBenchIDs = XSUM_readU32FromChar(&argument); /* select one specific test */
                     if (selectBenchIDs < NB_TESTFUNC) {
                         g_testIDs[selectBenchIDs] = 1;
                     } else
@@ -2406,13 +2395,13 @@ static int XXH_main(int argc, const char* const* argv)
             /* Modify Nb Iterations (benchmark only) */
             case 'i':
                 argument++;
-                g_nbIterations = readU32FromChar(&argument);
+                g_nbIterations = XSUM_readU32FromChar(&argument);
                 break;
 
             /* Modify Block size (benchmark only) */
             case 'B':
                 argument++;
-                keySize = readU32FromChar(&argument);
+                keySize = XSUM_readU32FromChar(&argument);
                 break;
 
             /* Modify verbosity of benchmark output (hidden option) */
@@ -2422,28 +2411,28 @@ static int XXH_main(int argc, const char* const* argv)
                 break;
 
             default:
-                return badusage(exename);
+                return XSUM_badusage(exename);
             }
         }
     }   /* for(i=1; i<argc; i++) */
 
     /* Check benchmark mode */
     if (benchmarkMode) {
-        DISPLAYLEVEL(2, FULL_WELCOME_MESSAGE(exename) );
-        BMK_sanityCheck();
+        XSUM_logVerbose(2, FULL_WELCOME_MESSAGE(exename) );
+        XSUM_sanityCheck();
         if (selectBenchIDs == 0) memcpy(g_testIDs, k_testIDs_default, sizeof(g_testIDs));
         if (selectBenchIDs == kBenchAll) memset(g_testIDs, 1, sizeof(g_testIDs));
-        if (filenamesStart==0) return BMK_benchInternal(keySize);
-        return BMK_benchFiles(argv+filenamesStart, argc-filenamesStart);
+        if (filenamesStart==0) return XSUM_benchInternal(keySize);
+        return XSUM_benchFiles(argv+filenamesStart, argc-filenamesStart);
     }
 
     /* Check if input is defined as console; trigger an error in this case */
-    if ( (filenamesStart==0) && IS_CONSOLE(stdin) && !explicitStdin)
-        return badusage(exename);
+    if ( (filenamesStart==0) && XSUM_isConsole(stdin) && !explicitStdin)
+        return XSUM_badusage(exename);
 
     if (filenamesStart==0) filenamesStart = argc;
     if (fileCheckMode) {
-        return checkFiles(argv+filenamesStart, argc-filenamesStart,
+        return XSUM_checkFiles(argv+filenamesStart, argc-filenamesStart,
                           displayEndianess, strictMode, statusOnly, warn, (g_displayLevel < 2) /*quiet*/);
     } else {
         return XSUM_hashFiles(argv+filenamesStart, argc-filenamesStart, algo, displayEndianess, convention);
@@ -2453,20 +2442,20 @@ static int XXH_main(int argc, const char* const* argv)
 /* Windows main wrapper which properly handles UTF-8 command line arguments. */
 #ifdef _WIN32
 /* Converts a UTF-16 argv to UTF-8. */
-static char** convert_argv(int argc, const wchar_t* const utf16_argv[])
+static char** XSUM_convertArgv(int argc, const wchar_t* const utf16_argv[])
 {
     char** const utf8_argv = (char**)malloc((size_t)(argc + 1) * sizeof(char*));
     if (utf8_argv != NULL) {
         int i;
         for (i = 0; i < argc; i++) {
-            utf8_argv[i] = utf16_to_utf8(utf16_argv[i]);
+            utf8_argv[i] = XSUM_narrowString(utf16_argv[i], NULL);
         }
         utf8_argv[argc] = NULL;
     }
     return utf8_argv;
 }
-/* Frees arguments returned by convert_argv */
-static void free_argv(int argc, char** argv)
+/* Frees arguments returned by XSUM_convertArgv */
+static void freeargv(int argc, char** argv)
 {
     int i;
     if (argv == NULL) {
@@ -2491,10 +2480,10 @@ static void free_argv(int argc, char** argv)
  * This function is wrapped by `__wgetmainargs()` and `main()` below on MinGW
  * with Unicode disabled, but if possible, we try to use `wmain()`.
  */
-static int XXH_wmain(int argc, const wchar_t* const utf16_argv[])
+static int XSUM_wmain(int argc, const wchar_t* const utf16_argv[])
 {
     /* Convert the UTF-16 arguments to UTF-8. */
-    char** utf8_argv = convert_argv(argc, utf16_argv);
+    char** utf8_argv = XSUM_convertArgv(argc, utf16_argv);
 
     if (utf8_argv == NULL) {
         /* An unfortunate but incredibly unlikely error, */
@@ -2514,10 +2503,10 @@ static int XXH_wmain(int argc, const wchar_t* const utf16_argv[])
         setvbuf(stderr, NULL, _IONBF, 0);
 
         /* Call our real main function */
-        ret = XXH_main(argc, (const char* const *) utf8_argv);
+        ret = XSUM_main(argc, (const char* const *) utf8_argv);
 
         /* Cleanup */
-        free_argv(argc, utf8_argv);
+        XSUM_freeArgv(argc, utf8_argv);
         return ret;
     }
 }
@@ -2531,13 +2520,13 @@ extern "C"
 #endif
 int wmain(int argc, const wchar_t* utf16_argv[])
 {
-    return XXH_wmain(argc, utf16_argv);
+    return XSUM_wmain(argc, utf16_argv);
 }
 
 #else /* Non-Unicode MinGW */
 
 /*
- * Wrap `XXH_wmain()` using `main()` and `__wgetmainargs()` on MinGW without
+ * Wrap `XSUM_wmain()` using `main()` and `__wgetmainargs()` on MinGW without
  * Unicode support.
  *
  * `__wgetmainargs()` is used in the CRT startup to retrieve the arguments for
@@ -2579,10 +2568,10 @@ int main(int ansi_argc, const char* ansi_argv[])
     /* Get wmain's UTF-16 arguments. Make sure we expand wildcards. */
     if (__wgetmainargs(&utf16_argc, &utf16_argv, &utf16_envp, 1, &startinfo) < 0)
         /* In the very unlikely case of an error, use the ANSI arguments. */
-        return XXH_main(ansi_argc, ansi_argv);
+        return XSUM_main(ansi_argc, ansi_argv);
 
-    /* Call XXH_wmain with our UTF-16 arguments */
-    return XXH_wmain(utf16_argc, (const wchar_t* const *)utf16_argv);
+    /* Call XSUM_wmain with our UTF-16 arguments */
+    return XSUM_wmain(utf16_argc, (const wchar_t* const *)utf16_argv);
 }
 
 #endif /* Non-Unicode MinGW */
@@ -2592,6 +2581,6 @@ int main(int ansi_argc, const char* ansi_argv[])
 /* Wrap main normally on non-Windows platforms. */
 int main(int argc, const char* argv[])
 {
-    return XXH_main(argc, argv);
+    return XSUM_main(argc, argv);
 }
 #endif /* !Windows */

From 4929b7dc9c69d8eed92a5599c2565bb16eed4213 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 2 Sep 2020 10:26:26 -0400
Subject: [PATCH 010/187] Fix typo

---
 xxhsum.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhsum.c b/xxhsum.c
index 4f790a1a..748a403d 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -2455,7 +2455,7 @@ static char** XSUM_convertArgv(int argc, const wchar_t* const utf16_argv[])
     return utf8_argv;
 }
 /* Frees arguments returned by XSUM_convertArgv */
-static void freeargv(int argc, char** argv)
+static void XSUM_freeArgv(int argc, char** argv)
 {
     int i;
     if (argv == NULL) {

From 67d35c7d34e867f35dbb493972c4c1cf048f829b Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 9 Sep 2020 13:59:29 -0400
Subject: [PATCH 011/187] [xxhsum] Begin splitting xxhsum.c

Starting with the macros

Still need to implement the Windows feature tests.
---
 .gitignore                    |   1 +
 Makefile                      |   3 +-
 programs/xxhsum/xsum_arch.h   | 153 +++++++++++++++++++++++++++++
 programs/xxhsum/xsum_config.h | 169 ++++++++++++++++++++++++++++++++
 xxhsum.c                      | 175 ++--------------------------------
 5 files changed, 333 insertions(+), 168 deletions(-)
 create mode 100644 programs/xxhsum/xsum_arch.h
 create mode 100644 programs/xxhsum/xsum_config.h

diff --git a/.gitignore b/.gitignore
index d0ce9aac..a2209534 100644
--- a/.gitignore
+++ b/.gitignore
@@ -13,6 +13,7 @@ xxh32sum
 xxh64sum
 xxh128sum
 xxhsum
+!programs/xxhsum
 xxhsum32
 xxhsum_privateXXH
 xxhsum_inlinedXXH
diff --git a/Makefile b/Makefile
index da1ce068..42e23b82 100644
--- a/Makefile
+++ b/Makefile
@@ -98,7 +98,8 @@ dispatch: xxhash.o xxh_x86dispatch.o xxhsum.c
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 xxhash.o: xxhash.c xxhash.h
-xxhsum.o: xxhsum.c xxhash.h xxh_x86dispatch.h
+xxhsum.o: xxhsum.c programs/xxhsum/xsum_config.h programs/xxhsum/xsum_arch.h \
+    xxhash.h xxh_x86dispatch.h
 xxh_x86dispatch.o: xxh_x86dispatch.c xxh_x86dispatch.h xxhash.h
 
 .PHONY: xxhsum_and_links
diff --git a/programs/xxhsum/xsum_arch.h b/programs/xxhsum/xsum_arch.h
new file mode 100644
index 00000000..1fb9a634
--- /dev/null
+++ b/programs/xxhsum/xsum_arch.h
@@ -0,0 +1,153 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+/*
+ * Checks for predefined macros by the compiler to try and get both the arch
+ * and the compiler version.
+ */
+#ifndef XSUM_ARCH_H
+#define XSUM_ARCH_H
+
+#include "xsum_config.h"
+
+#define XSUM_LIB_VERSION XXH_VERSION_MAJOR.XXH_VERSION_MINOR.XXH_VERSION_RELEASE
+#define XSUM_QUOTE(str) #str
+#define XSUM_EXPAND_AND_QUOTE(str) XSUM_QUOTE(str)
+#define XSUM_PROGRAM_VERSION XSUM_EXPAND_AND_QUOTE(XSUM_LIB_VERSION)
+
+
+/* Show compiler versions in WELCOME_MESSAGE. XSUM_CC_VERSION_FMT will return the printf specifiers,
+ * and VERSION will contain the comma separated list of arguments to the XSUM_CC_VERSION_FMT string. */
+#if defined(__clang_version__)
+/* Clang does its own thing. */
+#  ifdef __apple_build_version__
+#    define XSUM_CC_VERSION_FMT "Apple Clang %s"
+#  else
+#    define XSUM_CC_VERSION_FMT "Clang %s"
+#  endif
+#  define XSUM_CC_VERSION  __clang_version__
+#elif defined(__VERSION__)
+/* GCC and ICC */
+#  define XSUM_CC_VERSION_FMT "%s"
+#  ifdef __INTEL_COMPILER /* icc adds its prefix */
+#    define XSUM_CC_VERSION __VERSION__
+#  else /* assume GCC */
+#    define XSUM_CC_VERSION "GCC " __VERSION__
+#  endif
+#elif defined(_MSC_FULL_VER) && defined(_MSC_BUILD)
+/*
+ * MSVC
+ *  "For example, if the version number of the Visual C++ compiler is
+ *   15.00.20706.01, the _MSC_FULL_VER macro evaluates to 150020706."
+ *
+ *   https://docs.microsoft.com/en-us/cpp/preprocessor/predefined-macros?view=vs-2017
+ */
+#  define XSUM_CC_VERSION_FMT "MSVC %02i.%02i.%05i.%02i"
+#  define XSUM_CC_VERSION  _MSC_FULL_VER / 10000000 % 100, _MSC_FULL_VER / 100000 % 100, _MSC_FULL_VER % 100000, _MSC_BUILD
+#elif defined(_MSC_VER) /* old MSVC */
+#  define XSUM_CC_VERSION_FMT "MSVC %02i.%02i"
+#  define XSUM_CC_VERSION _MSC_VER / 100, _MSC_VER % 100
+#elif defined(__TINYC__)
+/* tcc stores its version in the __TINYC__ macro. */
+#  define XSUM_CC_VERSION_FMT "tcc %i.%i.%i"
+#  define XSUM_CC_VERSION __TINYC__ / 10000 % 100, __TINYC__ / 100 % 100, __TINYC__ % 100
+#else
+#  define XSUM_CC_VERSION_FMT "%s"
+#  define XSUM_CC_VERSION "unknown compiler"
+#endif
+
+/* makes the next part easier */
+#if defined(__x86_64__) || defined(_M_AMD64) || defined(_M_X64)
+#   define XSUM_ARCH_X64 1
+#   define XSUM_ARCH_X86 "x86_64"
+#elif defined(__i386__) || defined(_M_IX86) || defined(_M_IX86_FP)
+#   define XSUM_ARCH_X86 "i386"
+#endif
+
+/* Try to detect the architecture. */
+#if defined(ARCH_X86)
+#  if defined(XXHSUM_DISPATCH)
+#    define XSUM_ARCH XSUM_ARCH_X86 " autoVec"
+#  elif defined(__AVX512F__)
+#    define XSUM_ARCH XSUM_ARCH_X86 " + AVX512"
+#  elif defined(__AVX2__)
+#    define XSUM_ARCH XSUM_ARCH_X86 " + AVX2"
+#  elif defined(__AVX__)
+#    define XSUM_ARCH XSUM_ARCH_X86 " + AVX"
+#  elif defined(_M_X64) || defined(_M_AMD64) || defined(__x86_64__) \
+      || defined(__SSE2__) || (defined(_M_IX86_FP) && _M_IX86_FP == 2)
+#     define XSUM_ARCH XSUM_ARCH_X86 " + SSE2"
+#  else
+#     define XSUM_ARCH XSUM_ARCH_X86
+#  endif
+#elif defined(__aarch64__) || defined(__arm64__) || defined(_M_ARM64)
+#  define XSUM_ARCH "aarch64 + NEON"
+#elif defined(__arm__) || defined(__thumb__) || defined(__thumb2__) || defined(_M_ARM)
+/* ARM has a lot of different features that can change xxHash significantly. */
+#  if defined(__thumb2__) || (defined(__thumb__) && (__thumb__ == 2 || __ARM_ARCH >= 7))
+#    define XSUM_ARCH_THUMB " Thumb-2"
+#  elif defined(__thumb__)
+#    define XSUM_ARCH_THUMB " Thumb-1"
+#  else
+#    define XSUM_ARCH_THUMB ""
+#  endif
+/* ARMv7 has unaligned by default */
+#  if defined(__ARM_FEATURE_UNALIGNED) || __ARM_ARCH >= 7 || defined(_M_ARMV7VE)
+#    define XSUM_ARCH_UNALIGNED " + unaligned"
+#  else
+#    define XSUM_ARCH_UNALIGNED ""
+#  endif
+#  if defined(__ARM_NEON) || defined(__ARM_NEON__)
+#    define XSUM_ARCH_NEON " + NEON"
+#  else
+#    define XSUM_ARCH_NEON ""
+#  endif
+#  define XSUM_ARCH "ARMv" XSUM_EXPAND_AND_QUOTE(__ARM_ARCH) XSUM_ARCH_THUMB XSUM_ARCH_NEON XSUM_ARCH_UNALIGNED
+#elif defined(__powerpc64__) || defined(__ppc64__) || defined(__PPC64__)
+#  if defined(__GNUC__) && defined(__POWER9_VECTOR__)
+#    define XSUM_ARCH "ppc64 + POWER9 vector"
+#  elif defined(__GNUC__) && defined(__POWER8_VECTOR__)
+#    define XSUM_ARCH "ppc64 + POWER8 vector"
+#  else
+#    define XSUM_ARCH "ppc64"
+#  endif
+#elif defined(__powerpc__) || defined(__ppc__) || defined(__PPC__)
+#  define XSUM_ARCH "ppc"
+#elif defined(__AVR)
+#  define XSUM_ARCH "AVR"
+#elif defined(__mips64)
+#  define XSUM_ARCH "mips64"
+#elif defined(__mips)
+#  define XSUM_ARCH "mips"
+#elif defined(__s390x__)
+#  define XSUM_ARCH "s390x"
+#elif defined(__s390__)
+#  define XSUM_ARCH "s390"
+#else
+#  define XSUM_ARCH "unknown"
+#endif
+
+
+#endif /* XSUM_ARCH_H */
diff --git a/programs/xxhsum/xsum_config.h b/programs/xxhsum/xsum_config.h
new file mode 100644
index 00000000..1f28f9cb
--- /dev/null
+++ b/programs/xxhsum/xsum_config.h
@@ -0,0 +1,169 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+/*
+ * This contains various configuration parameters and feature detection for
+ * xxhsum.
+ *
+ * Similar to config.h in Autotools, this should be the first header included.
+ */
+
+#ifndef XSUM_CONFIG_H
+#define XSUM_CONFIG_H
+
+
+/* ************************************
+ *  Compiler Options
+ **************************************/
+/*
+ * Disable Visual C's warnings when using the "insecure" CRT functions instead
+ * of the "secure" _s functions.
+ *
+ * These functions are not portable, and aren't necessary if you are using the
+ * original functions properly.
+ */
+#if defined(_MSC_VER) || defined(_WIN32)
+#  ifndef _CRT_SECURE_NO_WARNINGS
+#    define _CRT_SECURE_NO_WARNINGS
+#  endif
+#endif
+
+/* Under Linux at least, pull in the *64 commands */
+#ifndef _LARGEFILE64_SOURCE
+#  define _LARGEFILE64_SOURCE
+#endif
+
+/*
+ * So we can use __attribute__((__format__))
+ */
+#ifdef __GNUC__
+#  define XSUM_ATRRIBUTE(x) __attribute__(x)
+#else
+#  define XSUM_ATTRIBUTE(x)
+#endif
+
+#if !defined(_WIN32) && (defined(__unix__) || defined(__unix) || (defined(__APPLE__) && defined(__MACH__)) /* UNIX-like OS */ \
+   || defined(__midipix__) || defined(__VMS))
+#  if (defined(__APPLE__) && defined(__MACH__)) || defined(__SVR4) || defined(_AIX) || defined(__hpux) /* POSIX.1-2001 (SUSv3) conformant */ \
+     || defined(__DragonFly__) || defined(__FreeBSD__) || defined(__NetBSD__) || defined(__OpenBSD__)  /* BSD distros */
+#    define XSUM_PLATFORM_POSIX_VERSION 200112L
+#  else
+#    if defined(__linux__) || defined(__linux)
+#      ifndef _POSIX_C_SOURCE
+#        define _POSIX_C_SOURCE 200112L  /* use feature test macro */
+#      endif
+#    endif
+#    include <unistd.h>  /* declares _POSIX_VERSION */
+#    if defined(_POSIX_VERSION)  /* POSIX compliant */
+#      define XSUM_PLATFORM_POSIX_VERSION _POSIX_VERSION
+#    else
+#      define XSUM_PLATFORM_POSIX_VERSION 0
+#    endif
+#  endif
+#endif
+#if !defined(XSUM_PLATFORM_POSIX_VERSION)
+#  define XSUM_PLATFORM_POSIX_VERSION -1
+#endif
+
+#if !defined(S_ISREG)
+#  define S_ISREG(x) (((x) & S_IFMT) == S_IFREG)
+#endif
+
+
+/* ************************************
+ * Windows helpers
+ **************************************/
+
+/*
+ * Whether to use the Windows UTF-16 APIs instead of the portable libc 8-bit
+ * ("ANSI") APIs.
+ *
+ * Windows is not UTF-8 clean by default, and the only way to access every file
+ * on the OS is to use UTF-16.
+ *
+ * Do note that xxhsum uses UTF-8 internally and only uses UTF-16 for command
+ * line arguments, console I/O, and opening files.
+ *
+ * Additionally, this guarantees all piped output is UTF-8.
+ */
+#if defined(XSUM_WIN32_USE_WCHAR) && !defined(_WIN32)
+/* We use Windows APIs, only use this on Windows. */
+#  undef XSUM_WIN32_USE_WCHAR
+#endif
+
+#ifndef XSUM_WIN32_USE_WCHAR
+#  if defined(_WIN32)
+#    include <wchar.h>
+#    if WCHAR_MAX == 0xFFFFU /* UTF-16 wchar_t */
+#       define XSUM_WIN32_USE_WCHAR 1
+#    else
+#       define XSUM_WIN32_USE_WCHAR 0
+#    endif
+#  else
+#    define XSUM_WIN32_USE_WCHAR 0
+#  endif
+#endif
+
+#if !XSUM_WIN32_USE_WCHAR
+/*
+ * It doesn't make sense to have one without the other.
+ * Due to XSUM_WIN32_USE_WCHAR being undef'd, this also handles
+ * non-WIN32 platforms.
+ */
+#  undef  XSUM_WIN32_USE_WMAIN
+#  define XSUM_WIN32_USE_WMAIN 0
+#else
+/*
+ * Whether to use wmain() or main().
+ *
+ * wmain() is preferred because we don't have to mess with internal hidden
+ * APIs.
+ *
+ * It always works on MSVC, but in MinGW, it only works on MinGW-w64 with the
+ * -municode flag.
+ *
+ * Therefore we have to use main() -- there is no better option.
+ */
+#  ifndef XSUM_WIN32_USE_WMAIN
+#    if defined(_UNICODE) || defined(UNICODE) /* MinGW -municode */ \
+        || defined(_MSC_VER) /* MSVC */
+#      define XSUM_WIN32_USE_WMAIN 1
+#    else
+#      define XSUM_WIN32_USE_WMAIN 0
+#    endif
+#  endif
+/*
+ * It is always good practice to define these to prevent accidental use of the
+ * ANSI APIs, even if the program primarily uses UTF-8.
+ */
+#  ifndef _UNICODE
+#    define _UNICODE
+#  endif
+#  ifndef UNICODE
+#    define UNICODE
+#  endif
+#endif /* XSUM_WIN32_USE_WCHAR */
+
+#endif /* XSUM_CONFIG_H */
diff --git a/xxhsum.c b/xxhsum.c
index 748a403d..0f906ae1 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -29,21 +29,8 @@
  * Display convention is Big Endian, for both 32 and 64 bits algorithms
  */
 
-
-/* ************************************
- *  Compiler Options
- **************************************/
-/* MS Visual */
-#if defined(_MSC_VER) || defined(_WIN32)
-#  ifndef _CRT_SECURE_NO_WARNINGS
-#    define _CRT_SECURE_NO_WARNINGS   /* removes visual warnings */
-#  endif
-#endif
-
-/* Under Linux at least, pull in the *64 commands */
-#ifndef _LARGEFILE64_SOURCE
-#  define _LARGEFILE64_SOURCE
-#endif
+#include "programs/xxhsum/xsum_config.h"
+#include "programs/xxhsum/xsum_arch.h"
 
 /* ************************************
  *  Includes
@@ -65,35 +52,8 @@
 #  include "xxh_x86dispatch.h"
 #endif
 
-
-/* ************************************
- *  OS-Specific Includes
- **************************************/
-#if !defined(_WIN32) && (defined(__unix__) || defined(__unix) || (defined(__APPLE__) && defined(__MACH__)) /* UNIX-like OS */ \
-   || defined(__midipix__) || defined(__VMS))
-#  if (defined(__APPLE__) && defined(__MACH__)) || defined(__SVR4) || defined(_AIX) || defined(__hpux) /* POSIX.1-2001 (SUSv3) conformant */ \
-     || defined(__DragonFly__) || defined(__FreeBSD__) || defined(__NetBSD__) || defined(__OpenBSD__)  /* BSD distros */
-#    define PLATFORM_POSIX_VERSION 200112L
-#  else
-#    if defined(__linux__) || defined(__linux)
-#      ifndef _POSIX_C_SOURCE
-#        define _POSIX_C_SOURCE 200112L  /* use feature test macro */
-#      endif
-#    endif
-#    include <unistd.h>  /* declares _POSIX_VERSION */
-#    if defined(_POSIX_VERSION)  /* POSIX compliant */
-#      define PLATFORM_POSIX_VERSION _POSIX_VERSION
-#    else
-#      define PLATFORM_POSIX_VERSION 0
-#    endif
-#  endif
-#endif
-#if !defined(PLATFORM_POSIX_VERSION)
-#  define PLATFORM_POSIX_VERSION -1
-#endif
-
-#if (defined(__linux__) && (PLATFORM_POSIX_VERSION >= 1)) \
- || (PLATFORM_POSIX_VERSION >= 200112L) \
+#if (defined(__linux__) && (XSUM_PLATFORM_POSIX_VERSION >= 1)) \
+ || (XSUM_PLATFORM_POSIX_VERSION >= 200112L) \
  || defined(__DJGPP__) \
  || defined(__MSYS__)
 #  include <unistd.h>   /* isatty */
@@ -127,10 +87,6 @@ static __inline int XSUM_isConsole(FILE* stdStream) {
 #  define SET_BINARY_MODE(file)
 #endif
 
-#if !defined(S_ISREG)
-#  define S_ISREG(x) (((x) & S_IFMT) == S_IFREG)
-#endif
-
 /* Unicode helpers for Windows to make UTF-8 act as it should. */
 #ifdef _WIN32
 /*
@@ -318,132 +274,17 @@ static unsigned XSUM_isLittleEndian(void)
 }
 
 
-/* *************************************
- *  Constants
- ***************************************/
-#define LIB_VERSION XXH_VERSION_MAJOR.XXH_VERSION_MINOR.XXH_VERSION_RELEASE
-#define QUOTE(str) #str
-#define EXPAND_AND_QUOTE(str) QUOTE(str)
-#define PROGRAM_VERSION EXPAND_AND_QUOTE(LIB_VERSION)
-
-/* Show compiler versions in WELCOME_MESSAGE. CC_VERSION_FMT will return the printf specifiers,
- * and VERSION will contain the comma separated list of arguments to the CC_VERSION_FMT string. */
-#if defined(__clang_version__)
-/* Clang does its own thing. */
-#  ifdef __apple_build_version__
-#    define CC_VERSION_FMT "Apple Clang %s"
-#  else
-#    define CC_VERSION_FMT "Clang %s"
-#  endif
-#  define CC_VERSION  __clang_version__
-#elif defined(__VERSION__)
-/* GCC and ICC */
-#  define CC_VERSION_FMT "%s"
-#  ifdef __INTEL_COMPILER /* icc adds its prefix */
-#    define CC_VERSION __VERSION__
-#  else /* assume GCC */
-#    define CC_VERSION "GCC " __VERSION__
-#  endif
-#elif defined(_MSC_FULL_VER) && defined(_MSC_BUILD)
-/*
- * MSVC
- *  "For example, if the version number of the Visual C++ compiler is
- *   15.00.20706.01, the _MSC_FULL_VER macro evaluates to 150020706."
- *
- *   https://docs.microsoft.com/en-us/cpp/preprocessor/predefined-macros?view=vs-2017
- */
-#  define CC_VERSION_FMT "MSVC %02i.%02i.%05i.%02i"
-#  define CC_VERSION  _MSC_FULL_VER / 10000000 % 100, _MSC_FULL_VER / 100000 % 100, _MSC_FULL_VER % 100000, _MSC_BUILD
-#elif defined(__TINYC__)
-/* tcc stores its version in the __TINYC__ macro. */
-#  define CC_VERSION_FMT "tcc %i.%i.%i"
-#  define CC_VERSION __TINYC__ / 10000 % 100, __TINYC__ / 100 % 100, __TINYC__ % 100
-#else
-#  define CC_VERSION_FMT "%s"
-#  define CC_VERSION "unknown compiler"
-#endif
-
-/* makes the next part easier */
-#if defined(__x86_64__) || defined(_M_AMD64) || defined(_M_X64)
-#   define ARCH_X64 1
-#   define ARCH_X86 "x86_64"
-#elif defined(__i386__) || defined(_M_IX86) || defined(_M_IX86_FP)
-#   define ARCH_X86 "i386"
-#endif
-
-/* Try to detect the architecture. */
-#if defined(ARCH_X86)
-#  if defined(XXHSUM_DISPATCH)
-#    define ARCH ARCH_X86 " autoVec"
-#  elif defined(__AVX512F__)
-#    define ARCH ARCH_X86 " + AVX512"
-#  elif defined(__AVX2__)
-#    define ARCH ARCH_X86 " + AVX2"
-#  elif defined(__AVX__)
-#    define ARCH ARCH_X86 " + AVX"
-#  elif defined(_M_X64) || defined(_M_AMD64) || defined(__x86_64__) \
-      || defined(__SSE2__) || (defined(_M_IX86_FP) && _M_IX86_FP == 2)
-#     define ARCH ARCH_X86 " + SSE2"
-#  else
-#     define ARCH ARCH_X86
-#  endif
-#elif defined(__aarch64__) || defined(__arm64__) || defined(_M_ARM64)
-#  define ARCH "aarch64 + NEON"
-#elif defined(__arm__) || defined(__thumb__) || defined(__thumb2__) || defined(_M_ARM)
-/* ARM has a lot of different features that can change xxHash significantly. */
-#  if defined(__thumb2__) || (defined(__thumb__) && (__thumb__ == 2 || __ARM_ARCH >= 7))
-#    define ARCH_THUMB " Thumb-2"
-#  elif defined(__thumb__)
-#    define ARCH_THUMB " Thumb-1"
-#  else
-#    define ARCH_THUMB ""
-#  endif
-/* ARMv7 has unaligned by default */
-#  if defined(__ARM_FEATURE_UNALIGNED) || __ARM_ARCH >= 7 || defined(_M_ARMV7VE)
-#    define ARCH_UNALIGNED " + unaligned"
-#  else
-#    define ARCH_UNALIGNED ""
-#  endif
-#  if defined(__ARM_NEON) || defined(__ARM_NEON__)
-#    define ARCH_NEON " + NEON"
-#  else
-#    define ARCH_NEON ""
-#  endif
-#  define ARCH "ARMv" EXPAND_AND_QUOTE(__ARM_ARCH) ARCH_THUMB ARCH_NEON ARCH_UNALIGNED
-#elif defined(__powerpc64__) || defined(__ppc64__) || defined(__PPC64__)
-#  if defined(__GNUC__) && defined(__POWER9_VECTOR__)
-#    define ARCH "ppc64 + POWER9 vector"
-#  elif defined(__GNUC__) && defined(__POWER8_VECTOR__)
-#    define ARCH "ppc64 + POWER8 vector"
-#  else
-#    define ARCH "ppc64"
-#  endif
-#elif defined(__powerpc__) || defined(__ppc__) || defined(__PPC__)
-#  define ARCH "ppc"
-#elif defined(__AVR)
-#  define ARCH "AVR"
-#elif defined(__mips64)
-#  define ARCH "mips64"
-#elif defined(__mips)
-#  define ARCH "mips"
-#elif defined(__s390x__)
-#  define ARCH "s390x"
-#elif defined(__s390__)
-#  define ARCH "s390"
-#else
-#  define ARCH "unknown"
-#endif
 
 static const int g_nbBits = (int)(sizeof(void*)*8);
 static const char g_lename[] = "little endian";
 static const char g_bename[] = "big endian";
 #define ENDIAN_NAME (XSUM_isLittleEndian() ? g_lename : g_bename)
 static const char author[] = "Yann Collet";
-#define WELCOME_MESSAGE(exename) "%s %s by %s \n", exename, PROGRAM_VERSION, author
+#define WELCOME_MESSAGE(exename) "%s %s by %s \n", exename, XSUM_PROGRAM_VERSION, author
 #define FULL_WELCOME_MESSAGE(exename) "%s %s by %s \n" \
-                    "compiled as %i-bit %s %s with " CC_VERSION_FMT " \n", \
-                    exename, PROGRAM_VERSION, author, \
-                    g_nbBits, ARCH, ENDIAN_NAME, CC_VERSION
+                    "compiled as %i-bit %s %s with " XSUM_CC_VERSION_FMT " \n", \
+                    exename, XSUM_PROGRAM_VERSION, author, \
+                    g_nbBits, XSUM_ARCH, ENDIAN_NAME, XSUM_CC_VERSION
 
 #define KB *( 1<<10)
 #define MB *( 1<<20)

From 7ee691763bf55669217d8b74846e4992a558fe9c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 15 Sep 2020 11:21:58 -0700
Subject: [PATCH 012/187] refactor xxh3 presentation in code comments

hopefully clarifying #449
---
 xxhash.h | 43 ++++++++++++++++++-------------------------
 1 file changed, 18 insertions(+), 25 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 400d3a21..2ba034a5 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -406,7 +406,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
 ************************************************************************/
 
 /* ************************************************************************
- * XXH3 is a new hash algorithm featuring:
+ * XXH3 is a more recent hash algorithm featuring:
  *  - Improved speed for both small and large inputs
  *  - True 64-bit and 128-bit outputs
  *  - SIMD acceleration
@@ -416,38 +416,31 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
  *
  *    https://fastcompression.blogspot.com/2019/03/presenting-xxh3.html
  *
- * In general, expect XXH3 to run about ~2x faster on large inputs and >3x
- * faster on small ones compared to XXH64, though exact differences depend on
- * the platform.
+ * Compared to XXH64, expect XXH3 to run approximately
+ * ~2x faster on large inputs and >3x faster on small ones,
+ * exact differences vary depending on platform.
  *
- * The algorithm is portable: Like XXH32 and XXH64, it generates the same hash
- * on all platforms.
- *
- * It benefits greatly from SIMD and 64-bit arithmetic, but does not require it.
- *
- * Almost all 32-bit and 64-bit targets that can run XXH32 smoothly can run
- * XXH3 at competitive speeds, even if XXH64 runs slowly. Further details are
- * explained in the implementation.
+ * XXH3's speed benefits greatly from SIMD and 64-bit arithmetic,
+ * but does not require it.
+ * Any 32-bit and 64-bit targets that can run XXH32 smoothly
+ * can run XXH3 at competitive speeds, even without vector support.
+ * Further details are explained in the implementation.
  *
  * Optimized implementations are provided for AVX512, AVX2, SSE2, NEON, POWER8,
- * ZVector and scalar targets. This can be controlled with the XXH_VECTOR macro.
+ * ZVector and scalar targets. This can be controlled via the XXH_VECTOR macro.
+ *
+ * XXH3 implementation is portable:
+ * it has a generic C90 formulation that can be compiled on any platform,
+ * all implementations generage exactly the same hash value on all platforms.
+ * Starting from v0.8.0, it's also labelled "stable", meaning that
+ * any future version will also generate the same hash value.
  *
  * XXH3 offers 2 variants, _64bits and _128bits.
- * When only 64 bits are needed, prefer calling the _64bits variant, as it
- * reduces the amount of mixing, resulting in faster speed on small inputs.
  *
+ * When only 64 bits are needed, prefer invoking the _64bits variant, as it
+ * reduces the amount of mixing, resulting in faster speed on small inputs.
  * It's also generally simpler to manipulate a scalar return type than a struct.
  *
- * The 128-bit version adds additional strength, but it is slightly slower.
- *
- * Return values of XXH3 and XXH128 are officially finalized starting
- * with v0.8.0 and will no longer change in future versions.
- * Avoid storing values from before that release in long-term storage.
- *
- * Results produced by v0.7.x are not comparable with results from v0.7.y.
- * However, the API is completely stable, and it can safely be used for
- * ephemeral data (local sessions).
- *
  * The API supports one-shot hashing, streaming mode, and custom secrets.
  */
 

From 1c5402c4f5ca1a1a97b5df794cf5aafe8afc883c Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 21:30:07 -0400
Subject: [PATCH 013/187] xxhsum: Split most OS/wrapper functions

Most, if not all platform-specific code is in its own file now.

Also, minor, fix Windows ANSI API usage on XSUM_isDirectory()

Didn't test Windows yet, waiting for CI.

A new change is that XSUM_fprintf_utf8 (now XSUM_vfprintf) separates
the printing and the formatting routines, instead using a vasprintf
implementation. Functionally the same, but much cleaner now.
---
 Makefile                           |  15 +-
 cmake_unofficial/CMakeLists.txt    |   5 +-
 programs/xxhsum/xsum_config.h      |   2 +-
 programs/xxhsum/xsum_os_specific.c | 467 +++++++++++++++++++++++++++++
 programs/xxhsum/xsum_os_specific.h |  84 ++++++
 programs/xxhsum/xsum_output.c      |  67 +++++
 programs/xxhsum/xsum_output.h      |  62 ++++
 xxhsum.c                           | 397 +-----------------------
 8 files changed, 708 insertions(+), 391 deletions(-)
 create mode 100644 programs/xxhsum/xsum_os_specific.c
 create mode 100644 programs/xxhsum/xsum_os_specific.h
 create mode 100644 programs/xxhsum/xsum_output.c
 create mode 100644 programs/xxhsum/xsum_output.h

diff --git a/Makefile b/Makefile
index 42e23b82..63349695 100644
--- a/Makefile
+++ b/Makefile
@@ -70,7 +70,13 @@ else
 endif
 
 LIBXXH = libxxhash.$(SHARED_EXT_VER)
-
+XXHSUM_OBJS = xxhsum.o \
+              programs/xxhsum/xsum_os_specific.o \
+              programs/xxhsum/xsum_output.o
+XXHSUM_HEADERS = programs/xxhsum/xsum_config.h \
+                 programs/xxhsum/xsum_arch.h \
+                 programs/xxhsum/xsum_os_specific.h \
+                 programs/xxhsum/xsum_output.h
 
 ## generate CLI and libraries in release mode (default for `make`)
 .PHONY: default
@@ -85,7 +91,7 @@ ifeq ($(DISPATCH),1)
 xxhsum: CPPFLAGS += -DXXHSUM_DISPATCH=1
 xxhsum: xxh_x86dispatch.o
 endif
-xxhsum: xxhash.o xxhsum.o
+xxhsum: xxhash.o $(XXHSUM_OBJS)
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 xxhsum32: CFLAGS += -m32  ## generate CLI in 32-bits mode
@@ -98,7 +104,7 @@ dispatch: xxhash.o xxh_x86dispatch.o xxhsum.c
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 xxhash.o: xxhash.c xxhash.h
-xxhsum.o: xxhsum.c programs/xxhsum/xsum_config.h programs/xxhsum/xsum_arch.h \
+xxhsum.o: xxhsum.c $(XXHSUM_HEADERS) \
     xxhash.h xxh_x86dispatch.h
 xxh_x86dispatch.o: xxh_x86dispatch.c xxh_x86dispatch.h xxhash.h
 
@@ -158,9 +164,10 @@ help:  ## list documented targets
 .PHONY: clean
 clean:  ## remove all build artifacts
 	$(Q)$(RM) -r *.dSYM   # Mac OS-X specific
-	$(Q)$(RM) core *.o *.$(SHARED_EXT) *.$(SHARED_EXT).* *.a libxxhash.pc
+	$(Q)$(RM) core *.o *.obj *.$(SHARED_EXT) *.$(SHARED_EXT).* *.a libxxhash.pc
 	$(Q)$(RM) xxhsum$(EXT) xxhsum32$(EXT) xxhsum_inlinedXXH$(EXT) dispatch$(EXT)
 	$(Q)$(RM) xxh32sum$(EXT) xxh64sum$(EXT) xxh128sum$(EXT)
+	$(Q)$(RM) programs/xxhsum/*.o programs/xxhsum/*.obj
 	@echo cleaning completed
 
 
diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index d9a8636f..3a5086c8 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -83,7 +83,10 @@ set_target_properties(xxhash PROPERTIES
 
 if(XXHASH_BUILD_XXHSUM)
   # xxhsum
-  add_executable(xxhsum "${XXHASH_DIR}/xxhsum.c")
+  add_executable(xxhsum "${XXHASH_DIR}/xxhsum.c"
+                        "${XXHASH_DIR}/programs/xxhsum/xsum_os_specific.c"
+                        "${XXHASH_DIR}/programs/xxhsum/xsum_output.c"
+                )
   add_executable(${PROJECT_NAME}::xxhsum ALIAS xxhsum)
 
   target_link_libraries(xxhsum PRIVATE xxhash)
diff --git a/programs/xxhsum/xsum_config.h b/programs/xxhsum/xsum_config.h
index 1f28f9cb..f49bf394 100644
--- a/programs/xxhsum/xsum_config.h
+++ b/programs/xxhsum/xsum_config.h
@@ -59,7 +59,7 @@
  * So we can use __attribute__((__format__))
  */
 #ifdef __GNUC__
-#  define XSUM_ATRRIBUTE(x) __attribute__(x)
+#  define XSUM_ATTRIBUTE(x) __attribute__(x)
 #else
 #  define XSUM_ATTRIBUTE(x)
 #endif
diff --git a/programs/xxhsum/xsum_os_specific.c b/programs/xxhsum/xsum_os_specific.c
new file mode 100644
index 00000000..45a5896f
--- /dev/null
+++ b/programs/xxhsum/xsum_os_specific.c
@@ -0,0 +1,467 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#include "xsum_config.h"
+#include "xsum_os_specific.h"
+#include <stdio.h>
+#include <stdarg.h>
+#include <stdlib.h>
+#include <sys/types.h>  /* struct stat / __wstat64 */
+#include <sys/stat.h>   /* stat() / _stat64() */
+
+/*
+ * This file contains all of the ugly boilerplate to make xxhsum work across
+ * platforms.
+ */
+#if defined(_MSC_VER)
+    typedef struct __stat64 stat_t;
+    typedef int mode_t;
+#else
+    typedef struct stat stat_t;
+#endif
+
+#if (defined(__linux__) && (XSUM_PLATFORM_POSIX_VERSION >= 1)) \
+ || (XSUM_PLATFORM_POSIX_VERSION >= 200112L) \
+ || defined(__DJGPP__) \
+ || defined(__MSYS__)
+#  include <unistd.h>   /* isatty */
+#  define XSUM_IS_CONSOLE(stdStream) isatty(fileno(stdStream))
+#elif defined(MSDOS) || defined(OS2)
+#  include <io.h>       /* _isatty */
+#  define XSUM_IS_CONSOLE(stdStream) _isatty(_fileno(stdStream))
+#elif defined(WIN32) || defined(_WIN32)
+#  include <io.h>      /* _isatty */
+#  include <windows.h> /* DeviceIoControl, HANDLE, FSCTL_SET_SPARSE */
+#  include <stdio.h>   /* FILE */
+static __inline int XSUM_IS_CONSOLE(FILE* stdStream)
+{
+    DWORD dummy;
+    return _isatty(_fileno(stdStream)) && GetConsoleMode((HANDLE)_get_osfhandle(_fileno(stdStream)), &dummy);
+}
+#else
+#  define XSUM_IS_CONSOLE(stdStream) 0
+#endif
+
+#if defined(MSDOS) || defined(OS2) || defined(WIN32) || defined(_WIN32)
+#  include <fcntl.h>   /* _O_BINARY */
+#  include <io.h>      /* _setmode, _fileno, _get_osfhandle */
+#  if !defined(__DJGPP__)
+#    include <windows.h> /* DeviceIoControl, HANDLE, FSCTL_SET_SPARSE */
+#    include <winioctl.h> /* FSCTL_SET_SPARSE */
+#    define XSUM_SET_BINARY_MODE(file) { int const unused=_setmode(_fileno(file), _O_BINARY); (void)unused; }
+#  else
+#    define XSUM_SET_BINARY_MODE(file) setmode(fileno(file), O_BINARY)
+#  endif
+#else
+#  define XSUM_SET_BINARY_MODE(file) ((void)file)
+#endif
+
+int XSUM_isConsole(FILE* stream)
+{
+    return XSUM_IS_CONSOLE(stream);
+}
+
+void XSUM_setBinaryMode(FILE* stream)
+{
+    XSUM_SET_BINARY_MODE(stream);
+}
+
+#if !XSUM_WIN32_USE_WCHAR
+
+FILE* XSUM_fopen(const char* filename, const char* mode)
+{
+    return fopen(filename, mode);
+}
+XSUM_ATTRIBUTE((__format__(__printf__, 2, 0)))
+int XSUM_vfprintf(FILE* stream, const char* format, va_list ap)
+{
+    return vfprintf(stream, format, ap);
+}
+
+int XSUM_isDirectory(const char* infilename)
+{
+    stat_t statbuf;
+#if defined(_MSC_VER)
+    int const r = _stat64(infilename, &statbuf);
+    if (!r && (statbuf.st_mode & _S_IFDIR)) return 1;
+#else
+    int const r = stat(infilename, &statbuf);
+    if (!r && S_ISDIR(statbuf.st_mode)) return 1;
+#endif
+    return 0;
+}
+
+#ifndef XSUM_NO_MAIN
+int main(int argc, char* argv[])
+{
+    return XSUM_main(argc, argv);
+}
+#endif
+
+/* Unicode helpers for Windows to make UTF-8 act as it should. */
+#else
+#  include <windows.h>
+#  include <wchar.h>
+
+/*****************************************************************************
+ *                       Unicode conversion tools
+ *****************************************************************************/
+
+/*
+ * Converts a UTF-8 string to UTF-16. Acts like strdup. The string must be freed afterwards.
+ * This version allows keeping the output length.
+ */
+static wchar_t* XSUM_widenString(const char* str, int* lenOut)
+{
+    int const len = MultiByteToWideChar(CP_UTF8, 0, str, -1, NULL, 0);
+    if (lenOut != NULL) *lenOut = len;
+    if (len == 0) return NULL;
+    {   wchar_t* buf = (wchar_t*)malloc((size_t)len * sizeof(wchar_t));
+        if (buf != NULL) {
+            if (MultiByteToWideChar(CP_UTF8, 0, str, -1, buf, len) == 0) {
+                free(buf);
+                return NULL;
+       }    }
+       return buf;
+    }
+}
+
+/*
+ * Converts a UTF-16 string to UTF-8. Acts like strdup. The string must be freed afterwards.
+ * This version allows keeping the output length.
+ */
+static char* XSUM_narrowString(const wchar_t *str, int *lenOut)
+{
+    int len = WideCharToMultiByte(CP_UTF8, 0, str, -1, NULL, 0, NULL, NULL);
+    if (lenOut != NULL) *lenOut = len;
+    if (len == 0) return NULL;
+    {   char* const buf = (char*)malloc((size_t)len * sizeof(char));
+        if (buf != NULL) {
+            if (WideCharToMultiByte(CP_UTF8, 0, str, -1, buf, len, NULL, NULL) == 0) {
+                free(buf);
+                return NULL;
+        }    }
+        return buf;
+    }
+}
+
+
+
+/*****************************************************************************
+ *                             File helpers
+ *****************************************************************************/
+/*
+ * fopen wrapper that supports UTF-8
+ *
+ * fopen will only accept ANSI filenames, which means that we can't open Unicode filenames.
+ *
+ * In order to open a Unicode filename, we need to convert filenames to UTF-16 and use _wfopen.
+ */
+FILE* XSUM_fopen(const char* filename, const char* mode)
+{
+    FILE* f = NULL;
+    wchar_t* const wide_filename = XSUM_widenString(filename, NULL);
+    if (wide_filename != NULL) {
+        wchar_t* const wide_mode = XSUM_widenString(mode, NULL);
+        if (wide_mode != NULL) {
+            f = _wfopen(wide_filename, wide_mode);
+            free(wide_mode);
+        }
+        free(wide_filename);
+    }
+    return f;
+}
+
+/*
+ * Determines whether the file at path is a directory.
+ *
+ * Accepts UTF-8 filenames, unlike _stat64.
+ */
+int XSUM_isDirectory(const char* filename)
+{
+    struct __stat64 statbuf;
+    int result = 0;
+    wchar_t* const wide_filename = XSUM_widenString(filename, NULL);
+    if (wide_filename != NULL) {
+        if (_wstat64(wide_filename, &statbuf) == 0 /* stat fail is ok */
+              && (statbuf.st_mode & _S_IFDIR)) {
+            result = 1;
+        }
+        free(wide_filename);
+    }
+    return result;
+}
+
+/*
+ * In case it isn't available, this is what MSVC 2019 defines in stdarg.h.
+ */
+#if defined(_MSC_VER) && !defined(__clang__) && !defined(va_copy)
+#  define XSUM_va_copy(destination, source) ((destination) = (source))
+#else
+#  define XSUM_va_copy(destination, source) va_copy(destination, source)
+#endif
+
+/*
+ * vasprintf for Windows.
+ */
+static int XSUM_vasprintf(char** strp, const char* format, va_list ap)
+{
+    int ret;
+    int size;
+    va_list copy;
+    /*
+     * To be safe, make a va_copy.
+     *
+     * Note that Microsoft doesn't use va_copy in its sample code:
+     *   https://docs.microsoft.com/en-us/cpp/c-runtime-library/reference/vsprintf-vsprintf-l-vswprintf-vswprintf-l-vswprintf-l?view=vs-2019
+     */
+    XSUM_va_copy(copy, ap);
+    /* Calculate how many characters we need */
+    size = _vscprintf(format, ap);
+    va_end(copy);
+
+    if (size < 0) {
+        *strp = NULL;
+        return size;
+    } else {
+        *strp = (char*) malloc((size_t)size + 1);
+        if (*strp == NULL) {
+            return -1;
+        }
+        /* vsprintf into the new buffer */
+        ret = vsprintf(*strp, format, ap);
+        if (ret < 0) {
+            free(*strp);
+            *strp = NULL;
+        }
+        return ret;
+    }
+}
+
+/*
+ * fprintf wrapper that supports UTF-8.
+ *
+ * fprintf doesn't properly handle Unicode on Windows.
+ *
+ * Additionally, it is codepage sensitive on console and may crash the program.
+ *
+ * Instead, we use vsnprintf, and either print with fwrite or convert to UTF-16
+ * for console output and use the codepage-independent WriteConsoleW.
+ *
+ * Credit to t-mat: https://github.com/t-mat/xxHash/commit/5691423
+ */
+XSUM_ATTRIBUTE((__format__(__printf__, 2, 0)))
+int XSUM_vfprintf(FILE *stream, const char *format, va_list ap)
+{
+    int result;
+    char* u8_str = NULL;
+
+    /*
+     * Generate the UTF-8 output string with vasprintf.
+     */
+    result = XSUM_vasprintf(&u8_str, format, ap);
+
+    if (result >= 0) {
+        const size_t nchar = (size_t)result + 1;
+
+        /*
+         * Check if we are outputting to a console. Don't use XSUM_isConsole
+         * directly -- we don't need to call _get_osfhandle twice.
+         */
+        int fileNb = _fileno(stream);
+        intptr_t handle_raw = _get_osfhandle(fileNb);
+        HANDLE handle = (HANDLE)handle_raw;
+        DWORD dwTemp;
+
+        if (handle_raw < 0) {
+             result = -1;
+        } else if (_isatty(fileNb) && GetConsoleMode(handle, &dwTemp)) {
+            /*
+             * Convert to UTF-16 and output with WriteConsoleW.
+             *
+             * This is codepage independent and works on Windows XP's default
+             * msvcrt.dll.
+             */
+            int len;
+            wchar_t* const u16_buf = XSUM_widenString(u8_str, &len);
+            if (u16_buf == NULL) {
+                result = -1;
+            } else {
+                if (WriteConsoleW(handle, u16_buf, (DWORD)len - 1, &dwTemp, NULL)) {
+                    result = (int)dwTemp;
+                } else {
+                    result = -1;
+                }
+                free(u16_buf);
+            }
+        } else {
+            /* fwrite the UTF-8 string if we are printing to a file */
+            result = (int)fwrite(u8_str, 1, nchar - 1, stream);
+            if (result == 0) {
+                result = -1;
+            }
+        }
+        free(u8_str);
+    }
+    return result;
+}
+
+#ifndef XSUM_NO_MAIN
+/*****************************************************************************
+ *                    Command Line argument parsing
+ *****************************************************************************/
+
+/* Converts a UTF-16 argv to UTF-8. */
+static char** XSUM_convertArgv(int argc, wchar_t* utf16_argv[])
+{
+    char** const utf8_argv = (char**)malloc((size_t)(argc + 1) * sizeof(char*));
+    if (utf8_argv != NULL) {
+        int i;
+        for (i = 0; i < argc; i++) {
+            utf8_argv[i] = XSUM_narrowString(utf16_argv[i], NULL);
+            if (utf8_argv[i] == NULL) {
+                /* Out of memory, whoops. */
+                while (i-- > 0) {
+                    free(utf8_argv[i]);
+                }
+                free(utf8_argv);
+                return NULL;
+            }
+        }
+        utf8_argv[argc] = NULL;
+    }
+    return utf8_argv;
+}
+/* Frees arguments returned by XSUM_convertArgv */
+static void XSUM_freeArgv(int argc, char** argv)
+{
+    int i;
+    if (argv == NULL) {
+        return;
+    }
+    for (i = 0; i < argc; i++) {
+        free(argv[i]);
+    }
+    free(argv);
+}
+
+static int XSUM_wmain(int argc, wchar_t* utf16_argv[])
+{
+    /* Convert the UTF-16 arguments to UTF-8. */
+    char** utf8_argv = XSUM_convertArgv(argc, utf16_argv);
+
+    if (utf8_argv == NULL) {
+        /* An unfortunate but incredibly unlikely error. */
+        fprintf(stderr, "xxhsum: error converting command line arguments!\n");
+        abort();
+    } else {
+        int ret;
+
+        /*
+         * MinGW's terminal uses full block buffering for stderr.
+         *
+         * This is nonstandard behavior and causes text to not display until
+         * the buffer fills.
+         *
+         * `setvbuf()` can easily correct this to make text display instantly.
+         */
+        setvbuf(stderr, NULL, _IONBF, 0);
+
+        /* Call our real main function */
+        ret = XSUM_main(argc, utf8_argv);
+
+        /* Cleanup */
+        XSUM_freeArgv(argc, utf8_argv);
+        return ret;
+    }
+}
+
+#if XSUM_WIN32_USE_WMAIN
+
+/*
+ * The preferred method of obtaining the real UTF-16 arguments. Always works
+ * on MSVC, sometimes works on MinGW-w64 depending on the compiler flags.
+ */
+#ifdef __cplusplus
+extern "C"
+#endif
+int __cdecl wmain(int argc, wchar_t* utf16_argv[])
+{
+    return XSUM_wmain(argc, utf16_argv);
+}
+#else /* !XSUM_WIN32_USE_WMAIN */
+
+/*
+ * Wrap `XSUM_wmain()` using `main()` and `__wgetmainargs()` on MinGW without
+ * Unicode support.
+ *
+ * `__wgetmainargs()` is used in the CRT startup to retrieve the arguments for
+ * `wmain()`, so we use it on MinGW to emulate `wmain()`.
+ *
+ * It is an internal function and not declared in any public headers, so we
+ * have to declare it manually.
+ *
+ * An alternative that doesn't mess with internal APIs is `GetCommandLineW()`
+ * with `CommandLineToArgvW()`, but the former doesn't expand wildcards and the
+ * latter requires linking to Shell32.dll and its numerous dependencies.
+ *
+ * This method keeps our dependencies to kernel32.dll and the CRT.
+ *
+ * https://docs.microsoft.com/en-us/cpp/c-runtime-library/getmainargs-wgetmainargs?view=vs-2019
+ */
+typedef struct {
+    int newmode;
+} _startupinfo;
+
+#ifdef __cplusplus
+extern "C"
+#endif
+int __cdecl __wgetmainargs(
+    int*          Argc,
+    wchar_t***    Argv,
+    wchar_t***    Env,
+    int           DoWildCard,
+    _startupinfo* StartInfo
+);
+
+int main(int ansi_argc, char* ansi_argv[])
+{
+    int       utf16_argc;
+    wchar_t** utf16_argv;
+    wchar_t** utf16_envp;         /* Unused but required */
+    _startupinfo startinfo = {0}; /* 0 == don't change new mode */
+
+    /* Get wmain's UTF-16 arguments. Make sure we expand wildcards. */
+    if (__wgetmainargs(&utf16_argc, &utf16_argv, &utf16_envp, 1, &startinfo) < 0)
+        /* In the very unlikely case of an error, use the ANSI arguments. */
+        return XSUM_main(ansi_argc, ansi_argv);
+
+    /* Call XSUM_wmain with our UTF-16 arguments */
+    return XSUM_wmain(utf16_argc, utf16_argv);
+}
+
+#endif /* !XSUM_WIN32_USE_WMAIN */
+#endif /* !XSUM_NO_MAIN */
+#endif /* XSUM_WIN32_USE_WCHAR */
diff --git a/programs/xxhsum/xsum_os_specific.h b/programs/xxhsum/xsum_os_specific.h
new file mode 100644
index 00000000..695e6463
--- /dev/null
+++ b/programs/xxhsum/xsum_os_specific.h
@@ -0,0 +1,84 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#ifndef XSUM_OS_SPECIFIC_H
+#define XSUM_OS_SPECIFIC_H
+
+#include "xsum_config.h"
+#include <stdio.h>
+#include <stdarg.h>
+
+#ifdef __cplusplus
+extern "C" {
+#endif
+
+/*
+ * Declared here to be implemented in user code.
+ *
+ * Functions like main(), but is passed UTF-8 arguments even on Windows.
+ */
+int XSUM_main(int argc, char* argv[]);
+
+/*
+ * Returns whether stream is a console.
+ *
+ * Functionally equivalent to isatty(fileno(stream)).
+ */
+int XSUM_isConsole(FILE* stream);
+
+/*
+ * Sets stream to pure binary mode (a.k.a. no CRLF conversions).
+ */
+void XSUM_setBinaryMode(FILE* stream);
+
+/*
+ * Returns whether the file at filename is a directory.
+ */
+int XSUM_isDirectory(const char* filename);
+
+/*
+ * UTF-8 stdio wrappers primarily for Windows
+ */
+
+/*
+ * fopen() wrapper. Accepts UTF-8 filenames on Windows.
+ *
+ * Specifically, on Windows, the arguments will be converted to UTF-16
+ * and passed to _wfopen().
+ */
+FILE* XSUM_fopen(const char* filename, const char* mode);
+
+/*
+ * vfprintf() wrapper which prints UTF-8 strings to Windows consoles
+ * if applicable.
+ */
+XSUM_ATTRIBUTE((__format__(__printf__, 2, 0)))
+int XSUM_vfprintf(FILE* stream, const char* format, va_list ap);
+
+#ifdef __cplusplus
+}
+#endif
+
+#endif /* XSUM_OS_SPECIFIC_H */
diff --git a/programs/xxhsum/xsum_output.c b/programs/xxhsum/xsum_output.c
new file mode 100644
index 00000000..83e6e7bd
--- /dev/null
+++ b/programs/xxhsum/xsum_output.c
@@ -0,0 +1,67 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#include "xsum_output.h"
+#include "xsum_os_specific.h"
+#include <stdio.h>
+
+int XSUM_logLevel = 2;
+
+XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
+int XSUM_log(const char* format, ...)
+{
+    int ret;
+    va_list ap;
+    va_start(ap, format);
+    ret = XSUM_vfprintf(stderr, format, ap);
+    va_end(ap);
+    return ret;
+}
+
+
+XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
+int XSUM_output(const char* format, ...)
+{
+    int ret;
+    va_list ap;
+    va_start(ap, format);
+    ret = XSUM_vfprintf(stdout, format, ap);
+    va_end(ap);
+    return ret;
+}
+
+XSUM_ATTRIBUTE((__format__(__printf__, 2, 3)))
+int XSUM_logVerbose(int minLevel, const char* format, ...)
+{
+    if (XSUM_logLevel >= minLevel) {
+        int ret;
+        va_list ap;
+        va_start(ap, format);
+        ret = XSUM_vfprintf(stderr, format, ap);
+        va_end(ap);
+        return ret;
+    }
+    return 0;
+}
diff --git a/programs/xxhsum/xsum_output.h b/programs/xxhsum/xsum_output.h
new file mode 100644
index 00000000..80fec2f0
--- /dev/null
+++ b/programs/xxhsum/xsum_output.h
@@ -0,0 +1,62 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#ifndef XSUM_OUTPUT_H
+#define XSUM_OUTPUT_H
+
+#include "xsum_config.h"
+
+#ifdef __cplusplus
+extern "C" {
+#endif
+
+/*
+ * How verbose the output is.
+ */
+extern int XSUM_logLevel;
+
+/*
+ * Same as fprintf(stderr, format, ...)
+ */
+XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
+int XSUM_log(const char *format, ...);
+
+/*
+ * Like XSUM_log, but only outputs if XSUM_logLevel >= minLevel.
+ */
+XSUM_ATTRIBUTE((__format__(__printf__, 2, 3)))
+int XSUM_logVerbose(int minLevel, const char *format, ...);
+
+/*
+ * Same as printf(format, ...)
+ */
+XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
+int XSUM_output(const char *format, ...);
+
+#ifdef __cplusplus
+}
+#endif
+
+#endif /* XSUM_OUTPUT_H */
diff --git a/xxhsum.c b/xxhsum.c
index 0f906ae1..64bc6c79 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -29,8 +29,11 @@
  * Display convention is Big Endian, for both 32 and 64 bits algorithms
  */
 
+/* Transitional headers */
 #include "programs/xxhsum/xsum_config.h"
 #include "programs/xxhsum/xsum_arch.h"
+#include "programs/xxhsum/xsum_os_specific.h"
+#include "programs/xxhsum/xsum_output.h"
 
 /* ************************************
  *  Includes
@@ -52,201 +55,6 @@
 #  include "xxh_x86dispatch.h"
 #endif
 
-#if (defined(__linux__) && (XSUM_PLATFORM_POSIX_VERSION >= 1)) \
- || (XSUM_PLATFORM_POSIX_VERSION >= 200112L) \
- || defined(__DJGPP__) \
- || defined(__MSYS__)
-#  include <unistd.h>   /* isatty */
-#  define XSUM_isConsole(stdStream) isatty(fileno(stdStream))
-#elif defined(MSDOS) || defined(OS2)
-#  include <io.h>       /* _isatty */
-#  define XSUM_isConsole(stdStream) _isatty(_fileno(stdStream))
-#elif defined(WIN32) || defined(_WIN32)
-#  include <io.h>      /* _isatty */
-#  include <windows.h> /* DeviceIoControl, HANDLE, FSCTL_SET_SPARSE */
-#  include <stdio.h>   /* FILE */
-static __inline int XSUM_isConsole(FILE* stdStream) {
-    DWORD dummy;
-    return _isatty(_fileno(stdStream)) && GetConsoleMode((HANDLE)_get_osfhandle(_fileno(stdStream)), &dummy);
-}
-#else
-#  define XSUM_isConsole(stdStream) 0
-#endif
-
-#if defined(MSDOS) || defined(OS2) || defined(WIN32) || defined(_WIN32)
-#  include <fcntl.h>   /* _O_BINARY */
-#  include <io.h>      /* _setmode, _fileno, _get_osfhandle */
-#  if !defined(__DJGPP__)
-#    include <windows.h> /* DeviceIoControl, HANDLE, FSCTL_SET_SPARSE */
-#    include <winioctl.h> /* FSCTL_SET_SPARSE */
-#    define SET_BINARY_MODE(file) { int const unused=_setmode(_fileno(file), _O_BINARY); (void)unused; }
-#  else
-#    define SET_BINARY_MODE(file) setmode(fileno(file), O_BINARY)
-#  endif
-#else
-#  define SET_BINARY_MODE(file)
-#endif
-
-/* Unicode helpers for Windows to make UTF-8 act as it should. */
-#ifdef _WIN32
-/*
- * Converts a UTF-8 string to UTF-16. Acts like strdup. The string must be freed afterwards.
- * This version allows keeping the output length.
- */
-static wchar_t* XSUM_widenString(const char* str, int* lenOut)
-{
-    int const len = MultiByteToWideChar(CP_UTF8, 0, str, -1, NULL, 0);
-    if (lenOut != NULL) *lenOut = len;
-    if (len == 0) return NULL;
-    {   wchar_t* buf = (wchar_t*)malloc((size_t)len * sizeof(wchar_t));
-        if (buf != NULL) {
-            if (MultiByteToWideChar(CP_UTF8, 0, str, -1, buf, len) == 0) {
-                free(buf);
-                return NULL;
-       }    }
-       return buf;
-    }
-}
-
-/*
- * Converts a UTF-16 string to UTF-8. Acts like strdup. The string must be freed afterwards.
- * This version allows keeping the output length.
- */
-static char* XSUM_narrowString(const wchar_t *str, int *lenOut)
-{
-    int len = WideCharToMultiByte(CP_UTF8, 0, str, -1, NULL, 0, NULL, NULL);
-    if (lenOut != NULL) *lenOut = len;
-    if (len == 0) return NULL;
-    {   char* const buf = (char*)malloc((size_t)len * sizeof(char));
-        if (buf != NULL) {
-            if (WideCharToMultiByte(CP_UTF8, 0, str, -1, buf, len, NULL, NULL) == 0) {
-                free(buf);
-                return NULL;
-        }    }
-        return buf;
-    }
-}
-
-/*
- * fopen wrapper that supports UTF-8
- *
- * fopen will only accept ANSI filenames, which means that we can't open Unicode filenames.
- *
- * In order to open a Unicode filename, we need to convert filenames to UTF-16 and use _wfopen.
- */
-static FILE* XSUM_fopen_wrapped(const char *filename, const wchar_t *mode)
-{
-    wchar_t* const wide_filename = XSUM_widenString(filename, NULL);
-    if (wide_filename == NULL) return NULL;
-    {   FILE* const f = _wfopen(wide_filename, mode);
-        free(wide_filename);
-        return f;
-    }
-}
-
-/*
- * In case it isn't available, this is what MSVC 2019 defines in stdarg.h.
- */
-#if defined(_MSC_VER) && !defined(__clang__) && !defined(va_copy)
-#  define va_copy(destination, source) ((destination) = (source))
-#endif
-
-/*
- * fprintf wrapper that supports UTF-8.
- *
- * fprintf doesn't properly handle Unicode on Windows.
- *
- * Additionally, it is codepage sensitive on console and may crash the program.
- *
- * Instead, we use vsnprintf, and either print with fwrite or convert to UTF-16
- * for console output and use the codepage-independent WriteConsoleW.
- *
- * Credit to t-mat: https://github.com/t-mat/xxHash/commit/5691423
- */
-static int XSUM_fprintf_utf8(FILE *stream, const char *format, ...)
-{
-    int result;
-    va_list args;
-    va_list copy;
-
-    va_start(args, format);
-
-    /*
-     * To be safe, make a va_copy.
-     *
-     * Note that Microsoft doesn't use va_copy in its sample code:
-     *   https://docs.microsoft.com/en-us/cpp/c-runtime-library/reference/vsprintf-vsprintf-l-vswprintf-vswprintf-l-vswprintf-l?view=vs-2019
-     */
-    va_copy(copy, args);
-    /* Counts the number of characters needed for vsnprintf. */
-    result = _vscprintf(format, copy);
-    va_end(copy);
-
-    if (result > 0) {
-        /* Create a buffer for vsnprintf */
-        const size_t nchar = (size_t)result + 1;
-        char* u8_str = (char*)malloc(nchar * sizeof(u8_str[0]));
-
-        if (u8_str == NULL) {
-            result = -1;
-        } else {
-            /* Generate the UTF-8 string with vsnprintf. */
-            result = _vsnprintf(u8_str, nchar - 1, format, args);
-            u8_str[nchar - 1] = '\0';
-            if (result > 0) {
-                /*
-                 * Check if we are outputting to a console. Don't use XSUM_isConsole
-                 * directly -- we don't need to call _get_osfhandle twice.
-                 */
-                int fileNb = _fileno(stream);
-                intptr_t handle_raw = _get_osfhandle(fileNb);
-                HANDLE handle = (HANDLE)handle_raw;
-                DWORD dwTemp;
-
-                if (handle_raw < 0) {
-                     result = -1;
-                } else if (_isatty(fileNb) && GetConsoleMode(handle, &dwTemp)) {
-                    /*
-                     * Convert to UTF-16 and output with WriteConsoleW.
-                     *
-                     * This is codepage independent and works on Windows XP's
-                     * default msvcrt.dll.
-                     */
-                    int len;
-                    wchar_t *const u16_buf = XSUM_widenString(u8_str, &len);
-                    if (u16_buf == NULL) {
-                        result = -1;
-                    } else {
-                        if (WriteConsoleW(handle, u16_buf, (DWORD)len - 1, &dwTemp, NULL)) {
-                            result = (int)dwTemp;
-                        } else {
-                            result = -1;
-                        }
-                        free(u16_buf);
-                    }
-                } else {
-                    /* fwrite the UTF-8 string if we are printing to a file */
-                    result = (int)fwrite(u8_str, 1, nchar - 1, stream);
-                    if (result == 0) {
-                        result = -1;
-                    }
-                }
-            }
-            free(u8_str);
-        }
-    }
-    va_end(args);
-    return result;
-}
-/*
- * Since we always use literals in the "mode" argument, it is just easier to append "L" to
- * the string to make it UTF-16 and avoid the hassle of a second manual conversion.
- */
-#  define XSUM_fopen(filename, mode) XSUM_fopen_wrapped(filename, L##mode)
-#else
-#  define XSUM_fopen(filename, mode) fopen(filename, mode)
-#endif
-
 /* ************************************
 *  Basic Types
 **************************************/
@@ -273,8 +81,6 @@ static unsigned XSUM_isLittleEndian(void)
     return one.c[0];
 }
 
-
-
 static const int g_nbBits = (int)(sizeof(void*)*8);
 static const char g_lename[] = "little endian";
 static const char g_bename[] = "big endian";
@@ -315,16 +121,6 @@ static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & X
 /* ************************************
  *  Display macros
  **************************************/
-#ifdef _WIN32
-#define XSUM_log(...)         XSUM_fprintf_utf8(stderr, __VA_ARGS__)
-#define XSUM_output(...)   XSUM_fprintf_utf8(stdout, __VA_ARGS__)
-#else
-#define XSUM_log(...)         fprintf(stderr, __VA_ARGS__)
-#define XSUM_output(...)   fprintf(stdout, __VA_ARGS__)
-#endif
-
-#define XSUM_logVerbose(l, ...) do { if (g_displayLevel>=l) XSUM_log(__VA_ARGS__); } while (0)
-static int g_displayLevel = 2;
 
 
 /* ************************************
@@ -341,7 +137,6 @@ static clock_t XSUM_clockSpan( clock_t start )
     return clock() - start;   /* works even if overflow; Typical max span ~ 30 mn */
 }
 
-
 static size_t XSUM_findMaxMem(U64 requiredMem)
 {
     size_t const step = 64 MB;
@@ -628,7 +423,7 @@ static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
                     (unsigned)bufferSize,
                     (double)1 / fastestH,
                     ((double)bufferSize / (1 MB)) / fastestH);
-    if (g_displayLevel<1)
+    if (XSUM_logLevel<1)
         XSUM_logVerbose(0, "%u, ", (unsigned)((double)1 / fastestH));
 }
 
@@ -675,7 +470,7 @@ static size_t XSUM_selectBenchedSize(const char* fileName)
 }
 
 
-static int XSUM_benchFiles(const char*const* fileNamesTable, int nbFiles)
+static int XSUM_benchFiles(char*const* fileNamesTable, int nbFiles)
 {
     int fileIdx;
     for (fileIdx=0; fileIdx<nbFiles; fileIdx++) {
@@ -1245,28 +1040,6 @@ static void XSUM_sanityCheck(void)
 /* ********************************************************
 *  File Hashing
 **********************************************************/
-#if defined(_MSC_VER)
-    typedef struct __stat64 stat_t;
-    typedef int mode_t;
-#else
-    typedef struct stat stat_t;
-#endif
-
-#include <sys/types.h>  /* struct stat / __start64 */
-#include <sys/stat.h>   /* stat() / _stat64() */
-
-int XSUM_isDirectory(const char* infilename)
-{
-    stat_t statbuf;
-#if defined(_MSC_VER)
-    int const r = _stat64(infilename, &statbuf);
-    if (!r && (statbuf.st_mode & _S_IFDIR)) return 1;
-#else
-    int const r = stat(infilename, &statbuf);
-    if (!r && S_ISDIR(statbuf.st_mode)) return 1;
-#endif
-    return 0;
-}
 
 /* for support of --little-endian display mode */
 static void XSUM_display_LittleEndian(const void* ptr, size_t length)
@@ -1433,7 +1206,7 @@ static int XSUM_hashFile(const char* fileName,
     if (fileName == stdinName) {
         inFile = stdin;
         fileName = "stdin";
-        SET_BINARY_MODE(stdin);
+        XSUM_setBinaryMode(stdin);
     } else {
         if (XSUM_isDirectory(fileName)) {
             XSUM_log("xxhsum: %s: Is a directory \n", fileName);
@@ -1493,7 +1266,7 @@ static int XSUM_hashFile(const char* fileName,
  * XSUM_hashFiles:
  * If fnTotal==0, read from stdin instead.
  */
-static int XSUM_hashFiles(const char*const * fnList, int fnTotal,
+static int XSUM_hashFiles(char*const * fnList, int fnTotal,
                           AlgoSelected hashType,
                           Display_endianess displayEndianess,
                           Display_convention convention)
@@ -2001,7 +1774,7 @@ static int XSUM_checkFile(const char* inFileName,
 }
 
 
-static int XSUM_checkFiles(const char*const* fnList, int fnTotal,
+static int XSUM_checkFiles(char*const* fnList, int fnTotal,
                            const Display_endianess displayEndianess,
                            U32 strictMode,
                            U32 statusOnly,
@@ -2131,7 +1904,7 @@ static U32 XSUM_readU32FromChar(const char** stringPtr) {
     return result;
 }
 
-static int XSUM_main(int argc, const char* const* argv)
+int XSUM_main(int argc, char* argv[])
 {
     int i, filenamesStart = 0;
     const char* const exename = XSUM_lastNameFromPath(argv[0]);
@@ -2160,7 +1933,7 @@ static int XSUM_main(int argc, const char* const* argv)
         if (!strcmp(argument, "--check")) { fileCheckMode = 1; continue; }
         if (!strcmp(argument, "--benchmark-all")) { benchmarkMode = 1; selectBenchIDs = kBenchAll; continue; }
         if (!strcmp(argument, "--bench-all")) { benchmarkMode = 1; selectBenchIDs = kBenchAll; continue; }
-        if (!strcmp(argument, "--quiet")) { g_displayLevel--; continue; }
+        if (!strcmp(argument, "--quiet")) { XSUM_logLevel--; continue; }
         if (!strcmp(argument, "--little-endian")) { displayEndianess = little_endian; continue; }
         if (!strcmp(argument, "--strict")) { strictMode = 1; continue; }
         if (!strcmp(argument, "--status")) { statusOnly = 1; continue; }
@@ -2248,7 +2021,7 @@ static int XSUM_main(int argc, const char* const* argv)
             /* Modify verbosity of benchmark output (hidden option) */
             case 'q':
                 argument++;
-                g_displayLevel--;
+                XSUM_logLevel--;
                 break;
 
             default:
@@ -2274,154 +2047,8 @@ static int XSUM_main(int argc, const char* const* argv)
     if (filenamesStart==0) filenamesStart = argc;
     if (fileCheckMode) {
         return XSUM_checkFiles(argv+filenamesStart, argc-filenamesStart,
-                          displayEndianess, strictMode, statusOnly, warn, (g_displayLevel < 2) /*quiet*/);
+                          displayEndianess, strictMode, statusOnly, warn, (XSUM_logLevel < 2) /*quiet*/);
     } else {
         return XSUM_hashFiles(argv+filenamesStart, argc-filenamesStart, algo, displayEndianess, convention);
     }
 }
-
-/* Windows main wrapper which properly handles UTF-8 command line arguments. */
-#ifdef _WIN32
-/* Converts a UTF-16 argv to UTF-8. */
-static char** XSUM_convertArgv(int argc, const wchar_t* const utf16_argv[])
-{
-    char** const utf8_argv = (char**)malloc((size_t)(argc + 1) * sizeof(char*));
-    if (utf8_argv != NULL) {
-        int i;
-        for (i = 0; i < argc; i++) {
-            utf8_argv[i] = XSUM_narrowString(utf16_argv[i], NULL);
-        }
-        utf8_argv[argc] = NULL;
-    }
-    return utf8_argv;
-}
-/* Frees arguments returned by XSUM_convertArgv */
-static void XSUM_freeArgv(int argc, char** argv)
-{
-    int i;
-    if (argv == NULL) {
-        return;
-    }
-    for (i = 0; i < argc; i++) {
-        free(argv[i]);
-    }
-    free(argv);
-}
-
-
-/*
- * On Windows, main's argv parameter is useless. Instead of UTF-8, you get ANSI
- * encoding, and any unknown characters will show up as mojibake.
- *
- * While this doesn't affect most programs, what does happen is that we can't
- * open any files with Unicode filenames.
- *
- * We instead convert wmain's arguments to UTF-8, preserving Unicode arguments.
- *
- * This function is wrapped by `__wgetmainargs()` and `main()` below on MinGW
- * with Unicode disabled, but if possible, we try to use `wmain()`.
- */
-static int XSUM_wmain(int argc, const wchar_t* const utf16_argv[])
-{
-    /* Convert the UTF-16 arguments to UTF-8. */
-    char** utf8_argv = XSUM_convertArgv(argc, utf16_argv);
-
-    if (utf8_argv == NULL) {
-        /* An unfortunate but incredibly unlikely error, */
-        fprintf(stderr, "Error converting command line arguments!\n");
-        return 1;
-    } else {
-        int ret;
-
-        /*
-         * MinGW's terminal uses full block buffering for stderr.
-         *
-         * This is nonstandard behavior and causes text to not display until
-         * the buffer fills.
-         *
-         * `setvbuf()` can easily correct this to make text display instantly.
-         */
-        setvbuf(stderr, NULL, _IONBF, 0);
-
-        /* Call our real main function */
-        ret = XSUM_main(argc, (const char* const *) utf8_argv);
-
-        /* Cleanup */
-        XSUM_freeArgv(argc, utf8_argv);
-        return ret;
-    }
-}
-
-#if defined(_MSC_VER)                     /* MSVC always accepts wmain */ \
- || defined(_UNICODE) || defined(UNICODE) /* defined with -municode on MinGW-w64 */
-
-/* Preferred: Use the real `wmain()`. */
-#if defined(__cplusplus)
-extern "C"
-#endif
-int wmain(int argc, const wchar_t* utf16_argv[])
-{
-    return XSUM_wmain(argc, utf16_argv);
-}
-
-#else /* Non-Unicode MinGW */
-
-/*
- * Wrap `XSUM_wmain()` using `main()` and `__wgetmainargs()` on MinGW without
- * Unicode support.
- *
- * `__wgetmainargs()` is used in the CRT startup to retrieve the arguments for
- * `wmain()`, so we use it on MinGW to emulate `wmain()`.
- *
- * It is an internal function and not declared in any public headers, so we
- * have to declare it manually.
- *
- * An alternative that doesn't mess with internal APIs is `GetCommandLineW()`
- * with `CommandLineToArgvW()`, but the former doesn't expand wildcards and the
- * latter requires linking to Shell32.dll and its numerous dependencies.
- *
- * This method keeps our dependencies to kernel32.dll and the CRT.
- *
- * https://docs.microsoft.com/en-us/cpp/c-runtime-library/getmainargs-wgetmainargs?view=vs-2019
- */
-typedef struct {
-    int newmode;
-} _startupinfo;
-
-#ifdef __cplusplus
-extern "C"
-#endif
-int __cdecl __wgetmainargs(
-    int*          Argc,
-    wchar_t***    Argv,
-    wchar_t***    Env,
-    int           DoWildCard,
-    _startupinfo* StartInfo
-);
-
-int main(int ansi_argc, const char* ansi_argv[])
-{
-    int       utf16_argc;
-    wchar_t** utf16_argv;
-    wchar_t** utf16_envp;         /* Unused but required */
-    _startupinfo startinfo = {0}; /* 0 == don't change new mode */
-
-    /* Get wmain's UTF-16 arguments. Make sure we expand wildcards. */
-    if (__wgetmainargs(&utf16_argc, &utf16_argv, &utf16_envp, 1, &startinfo) < 0)
-        /* In the very unlikely case of an error, use the ANSI arguments. */
-        return XSUM_main(ansi_argc, ansi_argv);
-
-    /* Call XSUM_wmain with our UTF-16 arguments */
-    return XSUM_wmain(utf16_argc, (const wchar_t* const *)utf16_argv);
-}
-
-#endif /* Non-Unicode MinGW */
-
-#else /* Not Windows */
-
-/* Wrap main normally on non-Windows platforms. */
-int main(int argc, const char* argv[])
-{
-    return XSUM_main(argc, argv);
-}
-#endif /* !Windows */

From aaf6716f666665b13cfbe5f2a96a57c95d887215 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 21:38:59 -0400
Subject: [PATCH 014/187] Fix #451

Define _FILE_OFFSET_BITS to 64
---
 programs/xxhsum/xsum_config.h | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/programs/xxhsum/xsum_config.h b/programs/xxhsum/xsum_config.h
index f49bf394..dd797069 100644
--- a/programs/xxhsum/xsum_config.h
+++ b/programs/xxhsum/xsum_config.h
@@ -54,6 +54,9 @@
 #ifndef _LARGEFILE64_SOURCE
 #  define _LARGEFILE64_SOURCE
 #endif
+#ifndef _FILE_OFFSET_BITS
+#  define _FILE_OFFSET_BITS 64
+#endif
 
 /*
  * So we can use __attribute__((__format__))

From 5634a1f3fe70145a5ed25537fc544430333c42b7 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 22:06:24 -0400
Subject: [PATCH 015/187] Fix dependency issues, XXH_INLINE_ALL affects xxhsum
 files

---
 Makefile                           | 19 ++++++++++---------
 programs/xxhsum/xsum_config.h      |  5 +++++
 programs/xxhsum/xsum_os_specific.c | 14 +++++++-------
 programs/xxhsum/xsum_os_specific.h | 12 ++++++------
 programs/xxhsum/xsum_output.c      |  6 +++---
 programs/xxhsum/xsum_output.h      |  6 +++---
 xxhsum.c                           |  6 +++++-
 7 files changed, 39 insertions(+), 29 deletions(-)

diff --git a/Makefile b/Makefile
index 63349695..15a6c23d 100644
--- a/Makefile
+++ b/Makefile
@@ -70,9 +70,10 @@ else
 endif
 
 LIBXXH = libxxhash.$(SHARED_EXT_VER)
-XXHSUM_OBJS = xxhsum.o \
-              programs/xxhsum/xsum_os_specific.o \
-              programs/xxhsum/xsum_output.o
+
+XXHSUM_SPLIT_SRCS = programs/xxhsum/xsum_os_specific.c \
+                    programs/xxhsum/xsum_output.c
+XXHSUM_SPLIT_OBJS = $(XXHSUM_SPLIT_SRCS:.c=.o)
 XXHSUM_HEADERS = programs/xxhsum/xsum_config.h \
                  programs/xxhsum/xsum_arch.h \
                  programs/xxhsum/xsum_os_specific.h \
@@ -91,16 +92,16 @@ ifeq ($(DISPATCH),1)
 xxhsum: CPPFLAGS += -DXXHSUM_DISPATCH=1
 xxhsum: xxh_x86dispatch.o
 endif
-xxhsum: xxhash.o $(XXHSUM_OBJS)
+xxhsum: xxhash.o xxhsum.o $(XXHSUM_SPLIT_OBJS)
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 xxhsum32: CFLAGS += -m32  ## generate CLI in 32-bits mode
-xxhsum32: xxhash.c xxhsum.c  ## do not generate object (avoid mixing different ABI)
+xxhsum32: xxhash.c xxhsum.c $(XXHSUM_SPLIT_SRCS) ## do not generate object (avoid mixing different ABI)
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 ## dispatch only works for x86/x64 systems
 dispatch: CPPFLAGS += -DXXHSUM_DISPATCH=1
-dispatch: xxhash.o xxh_x86dispatch.o xxhsum.c
+dispatch: xxhash.o xxh_x86dispatch.o xxhsum.c $(XXHSUM_SPLIT_SRCS)
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 xxhash.o: xxhash.c xxhash.h
@@ -115,8 +116,8 @@ xxh32sum xxh64sum xxh128sum: xxhsum
 	ln -sf $<$(EXT) $@$(EXT)
 
 xxhsum_inlinedXXH: CPPFLAGS += -DXXH_INLINE_ALL
-xxhsum_inlinedXXH: xxhsum.c
-	$(CC) $(FLAGS) $^ -o $@$(EXT)
+xxhsum_inlinedXXH: xxhsum.c $(XXHSUM_SPLIT_SRCS)
+	$(CC) $(FLAGS) $< -o $@$(EXT)
 
 
 # library
@@ -329,7 +330,7 @@ cppcheck:  ## check C source files using $(CPPCHECK) static analyzer
 namespaceTest:  ## ensure XXH_NAMESPACE redefines all public symbols
 	$(CC) -c xxhash.c
 	$(CC) -DXXH_NAMESPACE=TEST_ -c xxhash.c -o xxhash2.o
-	$(CC) xxhash.o xxhash2.o xxhsum.c -o xxhsum2  # will fail if one namespace missing (symbol collision)
+	$(CC) xxhash.o xxhash2.o xxhsum.c $(XXHSUM_SPLIT_SRCS)  -o xxhsum2  # will fail if one namespace missing (symbol collision)
 	$(RM) *.o xxhsum2  # clean
 
 MD2ROFF ?= ronn
diff --git a/programs/xxhsum/xsum_config.h b/programs/xxhsum/xsum_config.h
index dd797069..ef21a5ae 100644
--- a/programs/xxhsum/xsum_config.h
+++ b/programs/xxhsum/xsum_config.h
@@ -169,4 +169,9 @@
 #  endif
 #endif /* XSUM_WIN32_USE_WCHAR */
 
+#ifdef XXH_INLINE_ALL
+#  define XSUM_API static
+#else
+#  define XSUM_API
+#endif
 #endif /* XSUM_CONFIG_H */
diff --git a/programs/xxhsum/xsum_os_specific.c b/programs/xxhsum/xsum_os_specific.c
index 45a5896f..684f8473 100644
--- a/programs/xxhsum/xsum_os_specific.c
+++ b/programs/xxhsum/xsum_os_specific.c
@@ -78,24 +78,24 @@ static __inline int XSUM_IS_CONSOLE(FILE* stdStream)
 #  define XSUM_SET_BINARY_MODE(file) ((void)file)
 #endif
 
-int XSUM_isConsole(FILE* stream)
+XSUM_API int XSUM_isConsole(FILE* stream)
 {
     return XSUM_IS_CONSOLE(stream);
 }
 
-void XSUM_setBinaryMode(FILE* stream)
+XSUM_API void XSUM_setBinaryMode(FILE* stream)
 {
     XSUM_SET_BINARY_MODE(stream);
 }
 
 #if !XSUM_WIN32_USE_WCHAR
 
-FILE* XSUM_fopen(const char* filename, const char* mode)
+XSUM_API FILE* XSUM_fopen(const char* filename, const char* mode)
 {
     return fopen(filename, mode);
 }
 XSUM_ATTRIBUTE((__format__(__printf__, 2, 0)))
-int XSUM_vfprintf(FILE* stream, const char* format, va_list ap)
+XSUM_API int XSUM_vfprintf(FILE* stream, const char* format, va_list ap)
 {
     return vfprintf(stream, format, ap);
 }
@@ -179,7 +179,7 @@ static char* XSUM_narrowString(const wchar_t *str, int *lenOut)
  *
  * In order to open a Unicode filename, we need to convert filenames to UTF-16 and use _wfopen.
  */
-FILE* XSUM_fopen(const char* filename, const char* mode)
+XSUM_API FILE* XSUM_fopen(const char* filename, const char* mode)
 {
     FILE* f = NULL;
     wchar_t* const wide_filename = XSUM_widenString(filename, NULL);
@@ -199,7 +199,7 @@ FILE* XSUM_fopen(const char* filename, const char* mode)
  *
  * Accepts UTF-8 filenames, unlike _stat64.
  */
-int XSUM_isDirectory(const char* filename)
+XSUM_API int XSUM_isDirectory(const char* filename)
 {
     struct __stat64 statbuf;
     int result = 0;
@@ -273,7 +273,7 @@ static int XSUM_vasprintf(char** strp, const char* format, va_list ap)
  * Credit to t-mat: https://github.com/t-mat/xxHash/commit/5691423
  */
 XSUM_ATTRIBUTE((__format__(__printf__, 2, 0)))
-int XSUM_vfprintf(FILE *stream, const char *format, va_list ap)
+XSUM_API int XSUM_vfprintf(FILE *stream, const char *format, va_list ap)
 {
     int result;
     char* u8_str = NULL;
diff --git a/programs/xxhsum/xsum_os_specific.h b/programs/xxhsum/xsum_os_specific.h
index 695e6463..58fb63c1 100644
--- a/programs/xxhsum/xsum_os_specific.h
+++ b/programs/xxhsum/xsum_os_specific.h
@@ -39,24 +39,24 @@ extern "C" {
  *
  * Functions like main(), but is passed UTF-8 arguments even on Windows.
  */
-int XSUM_main(int argc, char* argv[]);
+XSUM_API int XSUM_main(int argc, char* argv[]);
 
 /*
  * Returns whether stream is a console.
  *
  * Functionally equivalent to isatty(fileno(stream)).
  */
-int XSUM_isConsole(FILE* stream);
+XSUM_API int XSUM_isConsole(FILE* stream);
 
 /*
  * Sets stream to pure binary mode (a.k.a. no CRLF conversions).
  */
-void XSUM_setBinaryMode(FILE* stream);
+XSUM_API void XSUM_setBinaryMode(FILE* stream);
 
 /*
  * Returns whether the file at filename is a directory.
  */
-int XSUM_isDirectory(const char* filename);
+XSUM_API int XSUM_isDirectory(const char* filename);
 
 /*
  * UTF-8 stdio wrappers primarily for Windows
@@ -68,14 +68,14 @@ int XSUM_isDirectory(const char* filename);
  * Specifically, on Windows, the arguments will be converted to UTF-16
  * and passed to _wfopen().
  */
-FILE* XSUM_fopen(const char* filename, const char* mode);
+XSUM_API FILE* XSUM_fopen(const char* filename, const char* mode);
 
 /*
  * vfprintf() wrapper which prints UTF-8 strings to Windows consoles
  * if applicable.
  */
 XSUM_ATTRIBUTE((__format__(__printf__, 2, 0)))
-int XSUM_vfprintf(FILE* stream, const char* format, va_list ap);
+XSUM_API int XSUM_vfprintf(FILE* stream, const char* format, va_list ap);
 
 #ifdef __cplusplus
 }
diff --git a/programs/xxhsum/xsum_output.c b/programs/xxhsum/xsum_output.c
index 83e6e7bd..a4d74115 100644
--- a/programs/xxhsum/xsum_output.c
+++ b/programs/xxhsum/xsum_output.c
@@ -30,7 +30,7 @@
 int XSUM_logLevel = 2;
 
 XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
-int XSUM_log(const char* format, ...)
+XSUM_API int XSUM_log(const char* format, ...)
 {
     int ret;
     va_list ap;
@@ -42,7 +42,7 @@ int XSUM_log(const char* format, ...)
 
 
 XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
-int XSUM_output(const char* format, ...)
+XSUM_API int XSUM_output(const char* format, ...)
 {
     int ret;
     va_list ap;
@@ -53,7 +53,7 @@ int XSUM_output(const char* format, ...)
 }
 
 XSUM_ATTRIBUTE((__format__(__printf__, 2, 3)))
-int XSUM_logVerbose(int minLevel, const char* format, ...)
+XSUM_API int XSUM_logVerbose(int minLevel, const char* format, ...)
 {
     if (XSUM_logLevel >= minLevel) {
         int ret;
diff --git a/programs/xxhsum/xsum_output.h b/programs/xxhsum/xsum_output.h
index 80fec2f0..8a02c1b7 100644
--- a/programs/xxhsum/xsum_output.h
+++ b/programs/xxhsum/xsum_output.h
@@ -41,19 +41,19 @@ extern int XSUM_logLevel;
  * Same as fprintf(stderr, format, ...)
  */
 XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
-int XSUM_log(const char *format, ...);
+XSUM_API int XSUM_log(const char *format, ...);
 
 /*
  * Like XSUM_log, but only outputs if XSUM_logLevel >= minLevel.
  */
 XSUM_ATTRIBUTE((__format__(__printf__, 2, 3)))
-int XSUM_logVerbose(int minLevel, const char *format, ...);
+XSUM_API int XSUM_logVerbose(int minLevel, const char *format, ...);
 
 /*
  * Same as printf(format, ...)
  */
 XSUM_ATTRIBUTE((__format__(__printf__, 1, 2)))
-int XSUM_output(const char *format, ...);
+XSUM_API int XSUM_output(const char *format, ...);
 
 #ifdef __cplusplus
 }
diff --git a/xxhsum.c b/xxhsum.c
index 64bc6c79..0e4a87c6 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -34,6 +34,10 @@
 #include "programs/xxhsum/xsum_arch.h"
 #include "programs/xxhsum/xsum_os_specific.h"
 #include "programs/xxhsum/xsum_output.h"
+#ifdef XXH_INLINE_ALL
+#  include "programs/xxhsum/xsum_os_specific.c"
+#  include "programs/xxhsum/xsum_output.c"
+#endif
 
 /* ************************************
  *  Includes
@@ -1904,7 +1908,7 @@ static U32 XSUM_readU32FromChar(const char** stringPtr) {
     return result;
 }
 
-int XSUM_main(int argc, char* argv[])
+XSUM_API int XSUM_main(int argc, char* argv[])
 {
     int i, filenamesStart = 0;
     const char* const exename = XSUM_lastNameFromPath(argv[0]);

From 7cd2f455e7370757678f293e7a1c3af3ed7b27f4 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 22:30:18 -0400
Subject: [PATCH 016/187] xxhsum: Move XSUM_getFileSize

---
 programs/xxhsum/xsum_config.h      |  32 ++++-
 programs/xxhsum/xsum_os_specific.c |  57 ++++++---
 programs/xxhsum/xsum_os_specific.h |   5 +
 xxhsum.c                           | 195 ++++++++++++-----------------
 4 files changed, 150 insertions(+), 139 deletions(-)

diff --git a/programs/xxhsum/xsum_config.h b/programs/xxhsum/xsum_config.h
index ef21a5ae..9895744a 100644
--- a/programs/xxhsum/xsum_config.h
+++ b/programs/xxhsum/xsum_config.h
@@ -169,9 +169,33 @@
 #  endif
 #endif /* XSUM_WIN32_USE_WCHAR */
 
-#ifdef XXH_INLINE_ALL
-#  define XSUM_API static
-#else
-#  define XSUM_API
+#ifndef XSUM_API
+#  ifdef XXH_INLINE_ALL
+#    define XSUM_API static
+#  else
+#    define XSUM_API
+#  endif
 #endif
+
+/* ***************************
+ * Basic types
+ * ***************************/
+
+#if defined(__cplusplus) /* C++ */ \
+ || (defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L)  /* C99 */
+#  include <stdint.h>
+    typedef uint8_t  XSUM_U8;
+    typedef uint32_t XSUM_U32;
+    typedef uint64_t XSUM_U64;
+# else
+#   include <limits.h>
+    typedef unsigned char      XSUM_U8;
+#   if UINT_MAX == 0xFFFFFFFFUL
+      typedef unsigned int     XSUM_U32;
+#   else
+      typedef unsigned long    XSUM_U32;
+#   endif
+    typedef unsigned long long XSUM_U64;
+#endif /* not C++/C99 */
+
 #endif /* XSUM_CONFIG_H */
diff --git a/programs/xxhsum/xsum_os_specific.c b/programs/xxhsum/xsum_os_specific.c
index 684f8473..5dde3668 100644
--- a/programs/xxhsum/xsum_os_specific.c
+++ b/programs/xxhsum/xsum_os_specific.c
@@ -36,10 +36,10 @@
  * platforms.
  */
 #if defined(_MSC_VER)
-    typedef struct __stat64 stat_t;
+    typedef struct __stat64 XSUM_stat_t;
     typedef int mode_t;
 #else
-    typedef struct stat stat_t;
+    typedef struct stat XSUM_stat_t;
 #endif
 
 #if (defined(__linux__) && (XSUM_PLATFORM_POSIX_VERSION >= 1)) \
@@ -100,17 +100,13 @@ XSUM_API int XSUM_vfprintf(FILE* stream, const char* format, va_list ap)
     return vfprintf(stream, format, ap);
 }
 
-int XSUM_isDirectory(const char* infilename)
+static int XSUM_stat(const char* infilename, XSUM_stat_t* statbuf)
 {
-    stat_t statbuf;
 #if defined(_MSC_VER)
-    int const r = _stat64(infilename, &statbuf);
-    if (!r && (statbuf.st_mode & _S_IFDIR)) return 1;
+    return _stat64(infilename, statbuf);
 #else
-    int const r = stat(infilename, &statbuf);
-    if (!r && S_ISDIR(statbuf.st_mode)) return 1;
+    return stat(infilename, statbuf);
 #endif
-    return 0;
 }
 
 #ifndef XSUM_NO_MAIN
@@ -195,23 +191,17 @@ XSUM_API FILE* XSUM_fopen(const char* filename, const char* mode)
 }
 
 /*
- * Determines whether the file at path is a directory.
- *
- * Accepts UTF-8 filenames, unlike _stat64.
+ * stat() wrapper which supports UTF-8 filenames.
  */
-XSUM_API int XSUM_isDirectory(const char* filename)
+static int XSUM_stat(const char* infilename, XSUM_stat_t* statbuf)
 {
-    struct __stat64 statbuf;
-    int result = 0;
+    int r = -1;
     wchar_t* const wide_filename = XSUM_widenString(filename, NULL);
     if (wide_filename != NULL) {
-        if (_wstat64(wide_filename, &statbuf) == 0 /* stat fail is ok */
-              && (statbuf.st_mode & _S_IFDIR)) {
-            result = 1;
-        }
+        r = _wstat64(wide_filename, statbuf);
         free(wide_filename);
     }
-    return result;
+    return r;
 }
 
 /*
@@ -465,3 +455,30 @@ int main(int ansi_argc, char* ansi_argv[])
 #endif /* !XSUM_WIN32_USE_WMAIN */
 #endif /* !XSUM_NO_MAIN */
 #endif /* XSUM_WIN32_USE_WCHAR */
+
+
+/*
+ * Determines whether the file at filename is a directory.
+ */
+XSUM_API int XSUM_isDirectory(const char* filename)
+{
+    XSUM_stat_t statbuf;
+    int r = XSUM_stat(filename, &statbuf);
+#ifdef _MSC_VER
+    if (!r && (statbuf.st_mode & _S_IFDIR)) return 1;
+#else
+    if (!r && S_ISDIR(statbuf.st_mode)) return 1;
+#endif
+    return 0;
+}
+
+/*
+ * Returns the filesize of the file at filename.
+ */
+XSUM_API XSUM_U64 XSUM_getFileSize(const char* filename)
+{
+    XSUM_stat_t statbuf;
+    int r = XSUM_stat(filename, &statbuf);
+    if (r || !S_ISREG(statbuf.st_mode)) return 0;   /* No good... */
+    return (XSUM_U64)statbuf.st_size;
+}
diff --git a/programs/xxhsum/xsum_os_specific.h b/programs/xxhsum/xsum_os_specific.h
index 58fb63c1..b3562b26 100644
--- a/programs/xxhsum/xsum_os_specific.h
+++ b/programs/xxhsum/xsum_os_specific.h
@@ -58,6 +58,11 @@ XSUM_API void XSUM_setBinaryMode(FILE* stream);
  */
 XSUM_API int XSUM_isDirectory(const char* filename);
 
+/*
+ * Returns the file size of the file at filename.
+ */
+XSUM_API XSUM_U64 XSUM_getFileSize(const char* filename);
+
 /*
  * UTF-8 stdio wrappers primarily for Windows
  */
diff --git a/xxhsum.c b/xxhsum.c
index 0e4a87c6..96131f62 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -59,29 +59,9 @@
 #  include "xxh_x86dispatch.h"
 #endif
 
-/* ************************************
-*  Basic Types
-**************************************/
-#if defined(__cplusplus) /* C++ */ \
- || (defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L)  /* C99 */
-#  include <stdint.h>
-    typedef uint8_t  U8;
-    typedef uint32_t U32;
-    typedef uint64_t U64;
-# else
-#   include <limits.h>
-    typedef unsigned char      U8;
-#   if UINT_MAX == 0xFFFFFFFFUL
-      typedef unsigned int     U32;
-#   else
-      typedef unsigned long    U32;
-#   endif
-    typedef unsigned long long U64;
-#endif /* not C++/C99 */
-
 static unsigned XSUM_isLittleEndian(void)
 {
-    const union { U32 u; U8 c[4]; } one = { 1 };   /* don't use static: performance detrimental  */
+    const union { XSUM_U32 u; XSUM_U8 c[4]; } one = { 1 };   /* don't use static: performance detrimental  */
     return one.c[0];
 }
 
@@ -130,7 +110,7 @@ static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & X
 /* ************************************
  *  Local variables
  **************************************/
-static U32 g_nbIterations = NBLOOPS;
+static XSUM_U32 g_nbIterations = NBLOOPS;
 
 
 /* ************************************
@@ -141,7 +121,7 @@ static clock_t XSUM_clockSpan( clock_t start )
     return clock() - start;   /* works even if overflow; Typical max span ~ 30 mn */
 }
 
-static size_t XSUM_findMaxMem(U64 requiredMem)
+static size_t XSUM_findMaxMem(XSUM_U64 requiredMem)
 {
     size_t const step = 64 MB;
     void* testmem = NULL;
@@ -164,21 +144,6 @@ static size_t XSUM_findMaxMem(U64 requiredMem)
     return (size_t)requiredMem;
 }
 
-
-static U64 XSUM_GetFileSize(const char* infilename)
-{
-    int r;
-#if defined(_MSC_VER)
-    struct _stat64 statbuf;
-    r = _stat64(infilename, &statbuf);
-#else
-    struct stat statbuf;
-    r = stat(infilename, &statbuf);
-#endif
-    if (r || !S_ISREG(statbuf.st_mode)) return 0;   /* No good... */
-    return (U64)statbuf.st_size;
-}
-
 /*
  * Allocates a string containing s1 and s2 concatenated. Acts like strdup.
  * The result must be freed.
@@ -210,15 +175,15 @@ static char* XSUM_strcatDup(const char* s1, const char* s2)
  *
  * This is used in the sanity check - its values must not be changed.
  */
-static void XSUM_fillTestBuffer(U8* buffer, size_t len)
+static void XSUM_fillTestBuffer(XSUM_U8* buffer, size_t len)
 {
-    U64 byteGen = PRIME32;
+    XSUM_U64 byteGen = PRIME32;
     size_t i;
 
     assert(buffer != NULL);
 
     for (i=0; i<len; i++) {
-        buffer[i] = (U8)(byteGen>>56);
+        buffer[i] = (XSUM_U8)(byteGen>>56);
         byteGen *= PRIME64;
     }
 }
@@ -231,7 +196,7 @@ static void XSUM_fillTestBuffer(U8* buffer, size_t len)
  *
  * Adding a pointer to the parameter list would be messy.
  */
-static U8 g_benchSecretBuf[XXH3_SECRET_SIZE_MIN];
+static XSUM_U8 g_benchSecretBuf[XXH3_SECRET_SIZE_MIN];
 
 /*
  * Wrappers for the benchmark.
@@ -239,75 +204,75 @@ static U8 g_benchSecretBuf[XXH3_SECRET_SIZE_MIN];
  * If you would like to add other hashes to the bench, create a wrapper and add
  * it to the g_hashesToBench table. It will automatically be added.
  */
-typedef U32 (*hashFunction)(const void* buffer, size_t bufferSize, U32 seed);
+typedef XSUM_U32 (*hashFunction)(const void* buffer, size_t bufferSize, XSUM_U32 seed);
 
-static U32 localXXH32(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH32(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     return XXH32(buffer, bufferSize, seed);
 }
-static U32 localXXH64(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH64(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
-    return (U32)XXH64(buffer, bufferSize, seed);
+    return (XSUM_U32)XXH64(buffer, bufferSize, seed);
 }
-static U32 localXXH3_64b(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_64b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     (void)seed;
-    return (U32)XXH3_64bits(buffer, bufferSize);
+    return (XSUM_U32)XXH3_64bits(buffer, bufferSize);
 }
-static U32 localXXH3_64b_seeded(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_64b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
-    return (U32)XXH3_64bits_withSeed(buffer, bufferSize, seed);
+    return (XSUM_U32)XXH3_64bits_withSeed(buffer, bufferSize, seed);
 }
-static U32 localXXH3_64b_secret(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_64b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     (void)seed;
-    return (U32)XXH3_64bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf));
+    return (XSUM_U32)XXH3_64bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf));
 }
-static U32 localXXH3_128b(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_128b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     (void)seed;
-    return (U32)(XXH3_128bits(buffer, bufferSize).low64);
+    return (XSUM_U32)(XXH3_128bits(buffer, bufferSize).low64);
 }
-static U32 localXXH3_128b_seeded(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_128b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
-    return (U32)(XXH3_128bits_withSeed(buffer, bufferSize, seed).low64);
+    return (XSUM_U32)(XXH3_128bits_withSeed(buffer, bufferSize, seed).low64);
 }
-static U32 localXXH3_128b_secret(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_128b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     (void)seed;
-    return (U32)(XXH3_128bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf)).low64);
+    return (XSUM_U32)(XXH3_128bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf)).low64);
 }
-static U32 localXXH3_stream(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     XXH3_state_t state;
     (void)seed;
     XXH3_64bits_reset(&state);
     XXH3_64bits_update(&state, buffer, bufferSize);
-    return (U32)XXH3_64bits_digest(&state);
+    return (XSUM_U32)XXH3_64bits_digest(&state);
 }
-static U32 localXXH3_stream_seeded(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH3_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     XXH3_state_t state;
     XXH3_INITSTATE(&state);
     XXH3_64bits_reset_withSeed(&state, (XXH64_hash_t)seed);
     XXH3_64bits_update(&state, buffer, bufferSize);
-    return (U32)XXH3_64bits_digest(&state);
+    return (XSUM_U32)XXH3_64bits_digest(&state);
 }
-static U32 localXXH128_stream(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH128_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     XXH3_state_t state;
     (void)seed;
     XXH3_128bits_reset(&state);
     XXH3_128bits_update(&state, buffer, bufferSize);
-    return (U32)(XXH3_128bits_digest(&state).low64);
+    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
 }
-static U32 localXXH128_stream_seeded(const void* buffer, size_t bufferSize, U32 seed)
+static XSUM_U32 localXXH128_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     XXH3_state_t state;
     XXH3_INITSTATE(&state);
     XXH3_128bits_reset_withSeed(&state, (XXH64_hash_t)seed);
     XXH3_128bits_update(&state, buffer, bufferSize);
-    return (U32)(XXH3_128bits_digest(&state).low64);
+    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
 }
 
 
@@ -344,14 +309,14 @@ static const char k_testIDs_default[NB_TESTFUNC] = { 0,
 static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
                           const void* buffer, size_t bufferSize)
 {
-    U32 nbh_perIteration = (U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
+    XSUM_U32 nbh_perIteration = (XSUM_U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
     unsigned iterationNb, nbIterations = g_nbIterations + !g_nbIterations /* min 1 */;
     double fastestH = 100000000.;
     assert(HASHNAME_MAX > 2);
     XSUM_logVerbose(2, "\r%80s\r", "");       /* Clean display line */
 
     for (iterationNb = 1; iterationNb <= nbIterations; iterationNb++) {
-        U32 r=0;
+        XSUM_U32 r=0;
         clock_t cStart;
 
         XSUM_logVerbose(2, "%2u-%-*.*s : %10u ->\r",
@@ -362,7 +327,7 @@ static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
         while (clock() == cStart);   /* starts clock() at its exact beginning */
         cStart = clock();
 
-        {   U32 u;
+        {   XSUM_U32 u;
             for (u=0; u<nbh_perIteration; u++)
                 r += h(buffer, bufferSize, u);
         }
@@ -399,7 +364,7 @@ static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
                      */
                     double nbh_perSecond = (1 / ticksPerHash) + 1;
                     if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
-                    nbh_perIteration = (U32)nbh_perSecond;
+                    nbh_perIteration = (XSUM_U32)nbh_perSecond;
                 }
                 /* g_nbIterations==0 => quick evaluation, no claim of accuracy */
                 if (g_nbIterations>0) {
@@ -418,7 +383,7 @@ static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
         }   }
         {   double nbh_perSecond = (1 / fastestH) + 1;
             if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
-            nbh_perIteration = (U32)nbh_perSecond;
+            nbh_perIteration = (XSUM_U32)nbh_perSecond;
         }
     }
     XSUM_logVerbose(1, "%2i#%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \n",
@@ -464,9 +429,9 @@ static void XSUM_benchMem(const void* buffer, size_t bufferSize)
 
 static size_t XSUM_selectBenchedSize(const char* fileName)
 {
-    U64 const inFileSize = XSUM_GetFileSize(fileName);
+    XSUM_U64 const inFileSize = XSUM_getFileSize(fileName);
     size_t benchedSize = (size_t) XSUM_findMaxMem(inFileSize);
-    if ((U64)benchedSize > inFileSize) benchedSize = (size_t)inFileSize;
+    if ((XSUM_U64)benchedSize > inFileSize) benchedSize = (size_t)inFileSize;
     if (benchedSize < inFileSize) {
         XSUM_log("Not enough memory for '%s' full size; testing %i MB only...\n", fileName, (int)(benchedSize>>20));
     }
@@ -590,7 +555,7 @@ static void XSUM_checkResult128(XXH128_hash_t r1, XXH128_hash_t r2)
 }
 
 
-static void XSUM_testXXH32(const void* data, size_t len, U32 seed, U32 Nresult)
+static void XSUM_testXXH32(const void* data, size_t len, XSUM_U32 seed, XSUM_U32 Nresult)
 {
     XXH32_state_t *state = XXH32_createState();
     size_t pos;
@@ -611,7 +576,7 @@ static void XSUM_testXXH32(const void* data, size_t len, U32 seed, U32 Nresult)
     XXH32_freeState(state);
 }
 
-static void XSUM_testXXH64(const void* data, size_t len, U64 seed, U64 Nresult)
+static void XSUM_testXXH64(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
 {
     XXH64_state_t *state = XXH64_createState();
     size_t pos;
@@ -632,25 +597,25 @@ static void XSUM_testXXH64(const void* data, size_t len, U64 seed, U64 Nresult)
     XXH64_freeState(state);
 }
 
-static U32 XSUM_rand(void)
+static XSUM_U32 XSUM_rand(void)
 {
-    static U64 seed = PRIME32;
+    static XSUM_U64 seed = PRIME32;
     seed *= PRIME64;
-    return (U32)(seed >> 40);
+    return (XSUM_U32)(seed >> 40);
 }
 
 
-void XSUM_testXXH3(const void* data, size_t len, U64 seed, U64 Nresult)
+void XSUM_testXXH3(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
 {
     if (len>0) assert(data != NULL);
 
-    {   U64 const Dresult = XXH3_64bits_withSeed(data, len, seed);
+    {   XSUM_U64 const Dresult = XXH3_64bits_withSeed(data, len, seed);
         XSUM_checkResult64(Dresult, Nresult);
     }
 
     /* check that the no-seed variant produces same result as seed==0 */
     if (seed == 0) {
-        U64 const Dresult = XXH3_64bits(data, len);
+        XSUM_U64 const Dresult = XXH3_64bits(data, len);
         XSUM_checkResult64(Dresult, Nresult);
     }
 
@@ -686,11 +651,11 @@ void XSUM_testXXH3(const void* data, size_t len, U64 seed, U64 Nresult)
     }
 }
 
-void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, U64 Nresult)
+void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XSUM_U64 Nresult)
 {
     if (len>0) assert(data != NULL);
 
-    {   U64 const Dresult = XXH3_64bits_withSecret(data, len, secret, secretSize);
+    {   XSUM_U64 const Dresult = XXH3_64bits_withSecret(data, len, secret, secretSize);
         XSUM_checkResult64(Dresult, Nresult);
     }
 
@@ -725,7 +690,7 @@ void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* secret,
     }
 }
 
-void XSUM_testXXH128(const void* data, size_t len, U64 seed, XXH128_hash_t Nresult)
+void XSUM_testXXH128(const void* data, size_t len, XSUM_U64 seed, XXH128_hash_t Nresult)
 {
     {   XXH128_hash_t const Dresult = XXH3_128bits_withSeed(data, len, seed);
         XSUM_checkResult128(Dresult, Nresult);
@@ -815,13 +780,13 @@ void XSUM_testXXH128_withSecret(const void* data, size_t len, const void* secret
 }
 
 #define SECRET_SAMPLE_NBBYTES 4
-typedef struct { U8 byte[SECRET_SAMPLE_NBBYTES]; } verifSample_t;
+typedef struct { XSUM_U8 byte[SECRET_SAMPLE_NBBYTES]; } verifSample_t;
 
 void XSUM_testSecretGenerator(const void* customSeed, size_t len, verifSample_t result)
 {
     static int nbTests = 1;
     const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};
-    U8 secretBuffer[XXH3_SECRET_DEFAULT_SIZE] = {0};
+    XSUM_U8 secretBuffer[XXH3_SECRET_DEFAULT_SIZE] = {0};
     verifSample_t samples;
     int i;
 
@@ -849,7 +814,7 @@ void XSUM_testSecretGenerator(const void* customSeed, size_t len, verifSample_t
 static void XSUM_sanityCheck(void)
 {
 #define SANITY_BUFFER_SIZE 2367
-    U8 sanityBuffer[SANITY_BUFFER_SIZE];
+    XSUM_U8 sanityBuffer[SANITY_BUFFER_SIZE];
     XSUM_fillTestBuffer(sanityBuffer, sizeof(sanityBuffer));
 
     XSUM_testXXH32(NULL,          0, 0,       0x02CC5D05);
@@ -1048,7 +1013,7 @@ static void XSUM_sanityCheck(void)
 /* for support of --little-endian display mode */
 static void XSUM_display_LittleEndian(const void* ptr, size_t length)
 {
-    const U8* const p = (const U8*)ptr;
+    const XSUM_U8* const p = (const XSUM_U8*)ptr;
     size_t idx;
     for (idx=length-1; idx<length; idx--)    /* intentional underflow to negative to detect end */
         XSUM_output("%02x", p[idx]);
@@ -1056,7 +1021,7 @@ static void XSUM_display_LittleEndian(const void* ptr, size_t length)
 
 static void XSUM_display_BigEndian(const void* ptr, size_t length)
 {
-    const U8* const p = (const U8*)ptr;
+    const XSUM_U8* const p = (const XSUM_U8*)ptr;
     size_t idx;
     for (idx=0; idx<length; idx++)
         XSUM_output("%02x", p[idx]);
@@ -1339,10 +1304,10 @@ typedef struct {
     char*           lineBuf;
     size_t          blockSize;
     char*           blockBuf;
-    U32             strictMode;
-    U32             statusOnly;
-    U32             warn;
-    U32             quiet;
+    XSUM_U32             strictMode;
+    XSUM_U32             statusOnly;
+    XSUM_U32             warn;
+    XSUM_U32             quiet;
     ParseFileReport report;
 } ParseFileArg;
 
@@ -1695,10 +1660,10 @@ static void XSUM_parseFile1(ParseFileArg* XSUM_parseFileArg, int rev)
  */
 static int XSUM_checkFile(const char* inFileName,
                           const Display_endianess displayEndianess,
-                          U32 strictMode,
-                          U32 statusOnly,
-                          U32 warn,
-                          U32 quiet)
+                          XSUM_U32 strictMode,
+                          XSUM_U32 statusOnly,
+                          XSUM_U32 warn,
+                          XSUM_U32 quiet)
 {
     int result = 0;
     FILE* inFile = NULL;
@@ -1780,10 +1745,10 @@ static int XSUM_checkFile(const char* inFileName,
 
 static int XSUM_checkFiles(char*const* fnList, int fnTotal,
                            const Display_endianess displayEndianess,
-                           U32 strictMode,
-                           U32 statusOnly,
-                           U32 warn,
-                           U32 quiet)
+                           XSUM_U32 strictMode,
+                           XSUM_U32 statusOnly,
+                           XSUM_U32 warn,
+                           XSUM_U32 quiet)
 {
     int ok = 1;
 
@@ -1866,18 +1831,18 @@ static const char* XSUM_lastNameFromPath(const char* path)
  * Will also modify `*stringPtr`, advancing it to position where it stopped reading.
  * @return 1 if an overflow error occurs
  */
-static int XSUM_readU32FromCharChecked(const char** stringPtr, U32* value)
+static int XSUM_readU32FromCharChecked(const char** stringPtr, XSUM_U32* value)
 {
-    static const U32 max = (((U32)(-1)) / 10) - 1;
-    U32 result = 0;
+    static const XSUM_U32 max = (((XSUM_U32)(-1)) / 10) - 1;
+    XSUM_U32 result = 0;
     while ((**stringPtr >='0') && (**stringPtr <='9')) {
         if (result > max) return 1; /* overflow error */
         result *= 10;
-        result += (U32)(**stringPtr - '0');
+        result += (XSUM_U32)(**stringPtr - '0');
         (*stringPtr)++ ;
     }
     if ((**stringPtr=='K') || (**stringPtr=='M')) {
-        U32 const maxK = ((U32)(-1)) >> 10;
+        XSUM_U32 const maxK = ((XSUM_U32)(-1)) >> 10;
         if (result > maxK) return 1; /* overflow error */
         result <<= 10;
         if (**stringPtr=='M') {
@@ -1899,8 +1864,8 @@ static int XSUM_readU32FromCharChecked(const char** stringPtr, U32* value)
  *  Will also modify `*stringPtr`, advancing it to position where it stopped reading.
  *  Note: function will exit() program if digit sequence overflows
  */
-static U32 XSUM_readU32FromChar(const char** stringPtr) {
-    U32 result;
+static XSUM_U32 XSUM_readU32FromChar(const char** stringPtr) {
+    XSUM_U32 result;
     if (XSUM_readU32FromCharChecked(stringPtr, &result)) {
         static const char errorMsg[] = "Error: numeric value too large";
         errorOut(errorMsg);
@@ -1912,14 +1877,14 @@ XSUM_API int XSUM_main(int argc, char* argv[])
 {
     int i, filenamesStart = 0;
     const char* const exename = XSUM_lastNameFromPath(argv[0]);
-    U32 benchmarkMode = 0;
-    U32 fileCheckMode = 0;
-    U32 strictMode    = 0;
-    U32 statusOnly    = 0;
-    U32 warn          = 0;
+    XSUM_U32 benchmarkMode = 0;
+    XSUM_U32 fileCheckMode = 0;
+    XSUM_U32 strictMode    = 0;
+    XSUM_U32 statusOnly    = 0;
+    XSUM_U32 warn          = 0;
     int explicitStdin = 0;
-    U32 selectBenchIDs= 0;  /* 0 == use default k_testIDs_default, kBenchAll == bench all */
-    static const U32 kBenchAll = 99;
+    XSUM_U32 selectBenchIDs= 0;  /* 0 == use default k_testIDs_default, kBenchAll == bench all */
+    static const XSUM_U32 kBenchAll = 99;
     size_t keySize    = XSUM_DEFAULT_SAMPLE_SIZE;
     AlgoSelected algo     = g_defaultAlgo;
     Display_endianess displayEndianess = big_endian;

From 276bb6a8e6b11d1f00386f602a0a3a8e387d59ee Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 22:33:25 -0400
Subject: [PATCH 017/187] Add missing printf attribute

---
 programs/xxhsum/xsum_os_specific.c | 1 +
 1 file changed, 1 insertion(+)

diff --git a/programs/xxhsum/xsum_os_specific.c b/programs/xxhsum/xsum_os_specific.c
index 5dde3668..e5079c9e 100644
--- a/programs/xxhsum/xsum_os_specific.c
+++ b/programs/xxhsum/xsum_os_specific.c
@@ -216,6 +216,7 @@ static int XSUM_stat(const char* infilename, XSUM_stat_t* statbuf)
 /*
  * vasprintf for Windows.
  */
+XSUM_ATTRIBUTE((__format__(__printf__, 2, 0)))
 static int XSUM_vasprintf(char** strp, const char* format, va_list ap)
 {
     int ret;

From fdf2e840cbc205921d0caf8853972d0b9d3b66f3 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 22:41:36 -0400
Subject: [PATCH 018/187] Fix Unicode test

Now uses XXH_INLINE_ALL.
---
 tests/Makefile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/Makefile b/tests/Makefile
index 361032fd..092711ad 100644
--- a/tests/Makefile
+++ b/tests/Makefile
@@ -53,7 +53,7 @@ test_ppc_redefine: ppc_define.c
 	$(CC) $(CPPFLAGS) $(CFLAGS) -c $^
 
 xxhsum$(EXT): ../xxhash.c ../xxhash.h ../xxhsum.c
-	$(CC) $(CFLAGS) $(LDFLAGS) ../xxhash.c ../xxhsum.c -o $@
+	$(CC) $(CPPFLAGS) $(CFLAGS) $(LDFLAGS) -DXXH_INLINE_ALL ../xxhsum.c -o $@
 
 # Make sure that Unicode filenames work.
 # https://github.com/Cyan4973/xxHash/issues/293

From 598e32c4902f681fe6195296eaa2280c4cc050c2 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 22:43:23 -0400
Subject: [PATCH 019/187] Fix typo

---
 programs/xxhsum/xsum_os_specific.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/programs/xxhsum/xsum_os_specific.c b/programs/xxhsum/xsum_os_specific.c
index e5079c9e..f6e873a0 100644
--- a/programs/xxhsum/xsum_os_specific.c
+++ b/programs/xxhsum/xsum_os_specific.c
@@ -196,7 +196,7 @@ XSUM_API FILE* XSUM_fopen(const char* filename, const char* mode)
 static int XSUM_stat(const char* infilename, XSUM_stat_t* statbuf)
 {
     int r = -1;
-    wchar_t* const wide_filename = XSUM_widenString(filename, NULL);
+    wchar_t* const wide_filename = XSUM_widenString(infilename, NULL);
     if (wide_filename != NULL) {
         r = _wstat64(wide_filename, statbuf);
         free(wide_filename);

From 2f9c567857d572d5ecea0d8c17f72901a70de330 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Wed, 16 Sep 2020 22:55:19 -0400
Subject: [PATCH 020/187] Use __stat64 on MinGW in wchar mode

---
 programs/xxhsum/xsum_os_specific.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/programs/xxhsum/xsum_os_specific.c b/programs/xxhsum/xsum_os_specific.c
index f6e873a0..cdf44d6c 100644
--- a/programs/xxhsum/xsum_os_specific.c
+++ b/programs/xxhsum/xsum_os_specific.c
@@ -35,7 +35,7 @@
  * This file contains all of the ugly boilerplate to make xxhsum work across
  * platforms.
  */
-#if defined(_MSC_VER)
+#if defined(_MSC_VER) || XSUM_WIN32_USE_WCHAR
     typedef struct __stat64 XSUM_stat_t;
     typedef int mode_t;
 #else

From a217d5fa464c8945261c1833863779035ff399c7 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Thu, 17 Sep 2020 10:05:37 -0400
Subject: [PATCH 021/187] xxhsum: Don't redefine mode_t on MinGW

---
 programs/xxhsum/xsum_os_specific.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/programs/xxhsum/xsum_os_specific.c b/programs/xxhsum/xsum_os_specific.c
index cdf44d6c..8f48ce07 100644
--- a/programs/xxhsum/xsum_os_specific.c
+++ b/programs/xxhsum/xsum_os_specific.c
@@ -37,7 +37,9 @@
  */
 #if defined(_MSC_VER) || XSUM_WIN32_USE_WCHAR
     typedef struct __stat64 XSUM_stat_t;
+# if defined(_MSC_VER)
     typedef int mode_t;
+# endif
 #else
     typedef struct stat XSUM_stat_t;
 #endif

From 4b0d1731f1a082942c62569e9a8368e184fc8c73 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Fri, 18 Sep 2020 17:44:15 -0400
Subject: [PATCH 022/187] xxhsum: move programs/xxhsum -> cli by request

also use XXHSUM_SRC_DIR in Makefile
---
 .gitignore                                  |  1 -
 Makefile                                    | 15 ++++++++-------
 {programs/xxhsum => cli}/xsum_arch.h        |  0
 {programs/xxhsum => cli}/xsum_config.h      |  0
 {programs/xxhsum => cli}/xsum_os_specific.c |  0
 {programs/xxhsum => cli}/xsum_os_specific.h |  0
 {programs/xxhsum => cli}/xsum_output.c      |  0
 {programs/xxhsum => cli}/xsum_output.h      |  0
 cmake_unofficial/CMakeLists.txt             |  5 +++--
 xxhsum.c                                    | 12 ++++++------
 10 files changed, 17 insertions(+), 16 deletions(-)
 rename {programs/xxhsum => cli}/xsum_arch.h (100%)
 rename {programs/xxhsum => cli}/xsum_config.h (100%)
 rename {programs/xxhsum => cli}/xsum_os_specific.c (100%)
 rename {programs/xxhsum => cli}/xsum_os_specific.h (100%)
 rename {programs/xxhsum => cli}/xsum_output.c (100%)
 rename {programs/xxhsum => cli}/xsum_output.h (100%)

diff --git a/.gitignore b/.gitignore
index a2209534..d0ce9aac 100644
--- a/.gitignore
+++ b/.gitignore
@@ -13,7 +13,6 @@ xxh32sum
 xxh64sum
 xxh128sum
 xxhsum
-!programs/xxhsum
 xxhsum32
 xxhsum_privateXXH
 xxhsum_inlinedXXH
diff --git a/Makefile b/Makefile
index 15a6c23d..4ed334d1 100644
--- a/Makefile
+++ b/Makefile
@@ -71,13 +71,14 @@ endif
 
 LIBXXH = libxxhash.$(SHARED_EXT_VER)
 
-XXHSUM_SPLIT_SRCS = programs/xxhsum/xsum_os_specific.c \
-                    programs/xxhsum/xsum_output.c
+XXHSUM_SRC_DIR = cli
+XXHSUM_SPLIT_SRCS = $(XXHSUM_SRC_DIR)/xsum_os_specific.c \
+                    $(XXHSUM_SRC_DIR)/xsum_output.c
 XXHSUM_SPLIT_OBJS = $(XXHSUM_SPLIT_SRCS:.c=.o)
-XXHSUM_HEADERS = programs/xxhsum/xsum_config.h \
-                 programs/xxhsum/xsum_arch.h \
-                 programs/xxhsum/xsum_os_specific.h \
-                 programs/xxhsum/xsum_output.h
+XXHSUM_HEADERS = $(XXHSUM_SRC_DIR)/xsum_config.h \
+                 $(XXHSUM_SRC_DIR)/xsum_arch.h \
+                 $(XXHSUM_SRC_DIR)/xsum_os_specific.h \
+                 $(XXHSUM_SRC_DIR)/xsum_output.h
 
 ## generate CLI and libraries in release mode (default for `make`)
 .PHONY: default
@@ -168,7 +169,7 @@ clean:  ## remove all build artifacts
 	$(Q)$(RM) core *.o *.obj *.$(SHARED_EXT) *.$(SHARED_EXT).* *.a libxxhash.pc
 	$(Q)$(RM) xxhsum$(EXT) xxhsum32$(EXT) xxhsum_inlinedXXH$(EXT) dispatch$(EXT)
 	$(Q)$(RM) xxh32sum$(EXT) xxh64sum$(EXT) xxh128sum$(EXT)
-	$(Q)$(RM) programs/xxhsum/*.o programs/xxhsum/*.obj
+	$(Q)$(RM) $(XXHSUM_SRC_DIR)/*.o $(XXHSUM_SRC_DIR)/*.obj
 	@echo cleaning completed
 
 
diff --git a/programs/xxhsum/xsum_arch.h b/cli/xsum_arch.h
similarity index 100%
rename from programs/xxhsum/xsum_arch.h
rename to cli/xsum_arch.h
diff --git a/programs/xxhsum/xsum_config.h b/cli/xsum_config.h
similarity index 100%
rename from programs/xxhsum/xsum_config.h
rename to cli/xsum_config.h
diff --git a/programs/xxhsum/xsum_os_specific.c b/cli/xsum_os_specific.c
similarity index 100%
rename from programs/xxhsum/xsum_os_specific.c
rename to cli/xsum_os_specific.c
diff --git a/programs/xxhsum/xsum_os_specific.h b/cli/xsum_os_specific.h
similarity index 100%
rename from programs/xxhsum/xsum_os_specific.h
rename to cli/xsum_os_specific.h
diff --git a/programs/xxhsum/xsum_output.c b/cli/xsum_output.c
similarity index 100%
rename from programs/xxhsum/xsum_output.c
rename to cli/xsum_output.c
diff --git a/programs/xxhsum/xsum_output.h b/cli/xsum_output.h
similarity index 100%
rename from programs/xxhsum/xsum_output.h
rename to cli/xsum_output.h
diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index 3a5086c8..5abd0c5f 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -82,10 +82,11 @@ set_target_properties(xxhash PROPERTIES
   VERSION "${XXHASH_VERSION_STRING}")
 
 if(XXHASH_BUILD_XXHSUM)
+  set(XXHSUM_DIR "${XXHASH_DIR}/cli")
   # xxhsum
   add_executable(xxhsum "${XXHASH_DIR}/xxhsum.c"
-                        "${XXHASH_DIR}/programs/xxhsum/xsum_os_specific.c"
-                        "${XXHASH_DIR}/programs/xxhsum/xsum_output.c"
+                        "${XXHSUM_DIR}/xsum_os_specific.c"
+                        "${XXHSUM_DIR}/xsum_output.c"
                 )
   add_executable(${PROJECT_NAME}::xxhsum ALIAS xxhsum)
 
diff --git a/xxhsum.c b/xxhsum.c
index 96131f62..bac55b20 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -30,13 +30,13 @@
  */
 
 /* Transitional headers */
-#include "programs/xxhsum/xsum_config.h"
-#include "programs/xxhsum/xsum_arch.h"
-#include "programs/xxhsum/xsum_os_specific.h"
-#include "programs/xxhsum/xsum_output.h"
+#include "cli/xsum_config.h"
+#include "cli/xsum_arch.h"
+#include "cli/xsum_os_specific.h"
+#include "cli/xsum_output.h"
 #ifdef XXH_INLINE_ALL
-#  include "programs/xxhsum/xsum_os_specific.c"
-#  include "programs/xxhsum/xsum_output.c"
+#  include "cli/xsum_os_specific.c"
+#  include "cli/xsum_output.c"
 #endif
 
 /* ************************************

From 1d00c512374055d81848a7a427f4a1bfb297c96f Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Sat, 19 Sep 2020 19:27:05 -0400
Subject: [PATCH 023/187] xxhsum: split sanity check

It now exists in xsum_sanity_check.c

Also add `XSUM_NO_TESTS` option, which instead prints "This version of xxhsum
is not verified." to stderr whenever XSUM_sanityCheck() is called.
---
 Makefile                        |   6 +-
 cli/xsum_config.h               |   4 +
 cli/xsum_sanity_check.c         | 564 ++++++++++++++++++++++++++++++++
 cli/xsum_sanity_check.h         |  57 ++++
 cmake_unofficial/CMakeLists.txt |   1 +
 xxhsum.c                        | 524 +----------------------------
 6 files changed, 632 insertions(+), 524 deletions(-)
 create mode 100644 cli/xsum_sanity_check.c
 create mode 100644 cli/xsum_sanity_check.h

diff --git a/Makefile b/Makefile
index 4ed334d1..83092358 100644
--- a/Makefile
+++ b/Makefile
@@ -73,12 +73,14 @@ LIBXXH = libxxhash.$(SHARED_EXT_VER)
 
 XXHSUM_SRC_DIR = cli
 XXHSUM_SPLIT_SRCS = $(XXHSUM_SRC_DIR)/xsum_os_specific.c \
-                    $(XXHSUM_SRC_DIR)/xsum_output.c
+                    $(XXHSUM_SRC_DIR)/xsum_output.c \
+                    $(XXHSUM_SRC_DIR)/xsum_sanity_check.c
 XXHSUM_SPLIT_OBJS = $(XXHSUM_SPLIT_SRCS:.c=.o)
 XXHSUM_HEADERS = $(XXHSUM_SRC_DIR)/xsum_config.h \
                  $(XXHSUM_SRC_DIR)/xsum_arch.h \
                  $(XXHSUM_SRC_DIR)/xsum_os_specific.h \
-                 $(XXHSUM_SRC_DIR)/xsum_output.h
+                 $(XXHSUM_SRC_DIR)/xsum_output.h \
+                 $(XXHSUM_SRC_DIR)/xsum_sanity_check.h
 
 ## generate CLI and libraries in release mode (default for `make`)
 .PHONY: default
diff --git a/cli/xsum_config.h b/cli/xsum_config.h
index 9895744a..9222144d 100644
--- a/cli/xsum_config.h
+++ b/cli/xsum_config.h
@@ -177,6 +177,10 @@
 #  endif
 #endif
 
+#ifndef XSUM_NO_TESTS
+#  define XSUM_NO_TESTS 0
+#endif
+
 /* ***************************
  * Basic types
  * ***************************/
diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
new file mode 100644
index 00000000..6d9c6c13
--- /dev/null
+++ b/cli/xsum_sanity_check.c
@@ -0,0 +1,564 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#include "xsum_config.h"
+#include "xsum_sanity_check.h"
+#include "xsum_output.h"
+#include <stdlib.h>
+#include <assert.h>
+#include <string.h>
+#ifndef XXH_STATIC_LINKING_ONLY
+#  define XXH_STATIC_LINKING_ONLY
+#endif
+#include "../xxhash.h"
+
+/* use #define to make them constant, required for initialization */
+#define PRIME32 2654435761U
+#define PRIME64 11400714785074694797ULL
+
+/*
+ * Fills a test buffer with pseudorandom data.
+ *
+ * This is used in the sanity check - its values must not be changed.
+ */
+XSUM_API void XSUM_fillTestBuffer(XSUM_U8* buffer, size_t len)
+{
+    XSUM_U64 byteGen = PRIME32;
+    size_t i;
+
+    assert(buffer != NULL);
+
+    for (i=0; i<len; i++) {
+        buffer[i] = (XSUM_U8)(byteGen>>56);
+        byteGen *= PRIME64;
+    }
+}
+
+
+
+/* ************************************************
+ * Self-test:
+ * ensure results consistency accross platforms
+ *********************************************** */
+#if XSUM_NO_TESTS
+XSUM_API void XSUM_sanityCheck(void)
+{
+    XSUM_log("This version of xxhsum is not verified.\n");
+}
+#else
+static void XSUM_checkResult32(XXH32_hash_t r1, XXH32_hash_t r2)
+{
+    static int nbTests = 1;
+    if (r1!=r2) {
+        XSUM_log("\rError: 32-bit hash test %i: Internal sanity check failed!\n", nbTests);
+        XSUM_log("\rGot 0x%08X, expected 0x%08X.\n", (unsigned)r1, (unsigned)r2);
+        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
+                  "or temporarily recompile with XSUM_NO_TESTS=1.\n");
+        exit(1);
+    }
+    nbTests++;
+}
+
+static void XSUM_checkResult64(XXH64_hash_t r1, XXH64_hash_t r2)
+{
+    static int nbTests = 1;
+    if (r1!=r2) {
+        XSUM_log("\rError: 64-bit hash test %i: Internal sanity check failed!\n", nbTests);
+        XSUM_log("\rGot 0x%08X%08XULL, expected 0x%08X%08XULL.\n",
+                (unsigned)(r1>>32), (unsigned)r1, (unsigned)(r2>>32), (unsigned)r2);
+        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
+                  "or temporarily recompile with XSUM_NO_TESTS=1.\n");
+        exit(1);
+    }
+    nbTests++;
+}
+
+static void XSUM_checkResult128(XXH128_hash_t r1, XXH128_hash_t r2)
+{
+    static int nbTests = 1;
+    if ((r1.low64 != r2.low64) || (r1.high64 != r2.high64)) {
+        XSUM_log("\rError: 128-bit hash test %i: Internal sanity check failed.\n", nbTests);
+        XSUM_log("\rGot { 0x%08X%08XULL, 0x%08X%08XULL }, expected { 0x%08X%08XULL, 0x%08X%08XULL } \n",
+                (unsigned)(r1.low64>>32), (unsigned)r1.low64, (unsigned)(r1.high64>>32), (unsigned)r1.high64,
+                (unsigned)(r2.low64>>32), (unsigned)r2.low64, (unsigned)(r2.high64>>32), (unsigned)r2.high64 );
+        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
+                  "or temporarily recompile with XSUM_NO_TESTS=1.\n");
+        exit(1);
+    }
+    nbTests++;
+}
+
+
+static void XSUM_testXXH32(const void* data, size_t len, XSUM_U32 seed, XSUM_U32 Nresult)
+{
+    XXH32_state_t *state = XXH32_createState();
+    size_t pos;
+
+    assert(state != NULL);
+    if (len>0) assert(data != NULL);
+
+    XSUM_checkResult32(XXH32(data, len, seed), Nresult);
+
+    (void)XXH32_reset(state, seed);
+    (void)XXH32_update(state, data, len);
+    XSUM_checkResult32(XXH32_digest(state), Nresult);
+
+    (void)XXH32_reset(state, seed);
+    for (pos=0; pos<len; pos++)
+        (void)XXH32_update(state, ((const char*)data)+pos, 1);
+    XSUM_checkResult32(XXH32_digest(state), Nresult);
+    XXH32_freeState(state);
+}
+
+static void XSUM_testXXH64(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
+{
+    XXH64_state_t *state = XXH64_createState();
+    size_t pos;
+
+    assert(state != NULL);
+    if (len>0) assert(data != NULL);
+
+    XSUM_checkResult64(XXH64(data, len, seed), Nresult);
+
+    (void)XXH64_reset(state, seed);
+    (void)XXH64_update(state, data, len);
+    XSUM_checkResult64(XXH64_digest(state), Nresult);
+
+    (void)XXH64_reset(state, seed);
+    for (pos=0; pos<len; pos++)
+        (void)XXH64_update(state, ((const char*)data)+pos, 1);
+    XSUM_checkResult64(XXH64_digest(state), Nresult);
+    XXH64_freeState(state);
+}
+
+static XSUM_U32 XSUM_rand(void)
+{
+    static XSUM_U64 seed = PRIME32;
+    seed *= PRIME64;
+    return (XSUM_U32)(seed >> 40);
+}
+
+
+static void XSUM_testXXH3(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
+{
+    if (len>0) assert(data != NULL);
+
+    {   XSUM_U64 const Dresult = XXH3_64bits_withSeed(data, len, seed);
+        XSUM_checkResult64(Dresult, Nresult);
+    }
+
+    /* check that the no-seed variant produces same result as seed==0 */
+    if (seed == 0) {
+        XSUM_U64 const Dresult = XXH3_64bits(data, len);
+        XSUM_checkResult64(Dresult, Nresult);
+    }
+
+    /* streaming API test */
+    {   XXH3_state_t* const state = XXH3_createState();
+        assert(state != NULL);
+        /* single ingestion */
+        (void)XXH3_64bits_reset_withSeed(state, seed);
+        (void)XXH3_64bits_update(state, data, len);
+        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+
+        /* random ingestion */
+        {   size_t p = 0;
+            (void)XXH3_64bits_reset_withSeed(state, seed);
+            while (p < len) {
+                size_t const modulo = len > 2 ? len : 2;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
+                if (p + l > len) l = len - p;
+                (void)XXH3_64bits_update(state, (const char*)data+p, l);
+                p += l;
+            }
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+        }
+
+        /* byte by byte ingestion */
+        {   size_t pos;
+            (void)XXH3_64bits_reset_withSeed(state, seed);
+            for (pos=0; pos<len; pos++)
+                (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+        }
+        XXH3_freeState(state);
+    }
+}
+
+static void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XSUM_U64 Nresult)
+{
+    if (len>0) assert(data != NULL);
+
+    {   XSUM_U64 const Dresult = XXH3_64bits_withSecret(data, len, secret, secretSize);
+        XSUM_checkResult64(Dresult, Nresult);
+    }
+
+    /* streaming API test */
+    {   XXH3_state_t *state = XXH3_createState();
+        assert(state != NULL);
+        (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
+        (void)XXH3_64bits_update(state, data, len);
+        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+
+        /* random ingestion */
+        {   size_t p = 0;
+            (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
+            while (p < len) {
+                size_t const modulo = len > 2 ? len : 2;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
+                if (p + l > len) l = len - p;
+                (void)XXH3_64bits_update(state, (const char*)data+p, l);
+                p += l;
+            }
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+        }
+
+        /* byte by byte ingestion */
+        {   size_t pos;
+            (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
+            for (pos=0; pos<len; pos++)
+                (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+        }
+        XXH3_freeState(state);
+    }
+}
+
+static void XSUM_testXXH128(const void* data, size_t len, XSUM_U64 seed, XXH128_hash_t Nresult)
+{
+    {   XXH128_hash_t const Dresult = XXH3_128bits_withSeed(data, len, seed);
+        XSUM_checkResult128(Dresult, Nresult);
+    }
+
+    /* check that XXH128() is identical to XXH3_128bits_withSeed() */
+    {   XXH128_hash_t const Dresult2 = XXH128(data, len, seed);
+        XSUM_checkResult128(Dresult2, Nresult);
+    }
+
+    /* check that the no-seed variant produces same result as seed==0 */
+    if (seed == 0) {
+        XXH128_hash_t const Dresult = XXH3_128bits(data, len);
+        XSUM_checkResult128(Dresult, Nresult);
+    }
+
+    /* streaming API test */
+    {   XXH3_state_t *state = XXH3_createState();
+        assert(state != NULL);
+
+        /* single ingestion */
+        (void)XXH3_128bits_reset_withSeed(state, seed);
+        (void)XXH3_128bits_update(state, data, len);
+        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+
+        /* random ingestion */
+        {   size_t p = 0;
+            (void)XXH3_128bits_reset_withSeed(state, seed);
+            while (p < len) {
+                size_t const modulo = len > 2 ? len : 2;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
+                if (p + l > len) l = len - p;
+                (void)XXH3_128bits_update(state, (const char*)data+p, l);
+                p += l;
+            }
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+        }
+
+        /* byte by byte ingestion */
+        {   size_t pos;
+            (void)XXH3_128bits_reset_withSeed(state, seed);
+            for (pos=0; pos<len; pos++)
+                (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+        }
+        XXH3_freeState(state);
+    }
+}
+
+static void XSUM_testXXH128_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XXH128_hash_t Nresult)
+{
+    if (len>0) assert(data != NULL);
+
+    {   XXH128_hash_t const Dresult = XXH3_128bits_withSecret(data, len, secret, secretSize);
+        XSUM_checkResult128(Dresult, Nresult);
+    }
+
+    /* streaming API test */
+    {   XXH3_state_t* const state = XXH3_createState();
+        assert(state != NULL);
+        (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
+        (void)XXH3_128bits_update(state, data, len);
+        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+
+        /* random ingestion */
+        {   size_t p = 0;
+            (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
+            while (p < len) {
+                size_t const modulo = len > 2 ? len : 2;
+                size_t l = (size_t)(XSUM_rand()) % modulo;
+                if (p + l > len) l = len - p;
+                (void)XXH3_128bits_update(state, (const char*)data+p, l);
+                p += l;
+            }
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+        }
+
+        /* byte by byte ingestion */
+        {   size_t pos;
+            (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
+            for (pos=0; pos<len; pos++)
+                (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+        }
+        XXH3_freeState(state);
+    }
+}
+
+#define SECRET_SAMPLE_NBBYTES 4
+typedef struct { XSUM_U8 byte[SECRET_SAMPLE_NBBYTES]; } verifSample_t;
+
+static void XSUM_testSecretGenerator(const void* customSeed, size_t len, verifSample_t result)
+{
+    static int nbTests = 1;
+    const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};
+    XSUM_U8 secretBuffer[XXH3_SECRET_DEFAULT_SIZE] = {0};
+    verifSample_t samples;
+    int i;
+
+    XXH3_generateSecret(secretBuffer, customSeed, len);
+    for (i=0; i<SECRET_SAMPLE_NBBYTES; i++) {
+        samples.byte[i] = secretBuffer[sampleIndex[i]];
+    }
+    if (memcmp(&samples, &result, sizeof(result))) {
+        XSUM_log("\rError: Secret generation test %i: Internal sanity check failed. \n", nbTests);
+        XSUM_log("\rGot { 0x%02X, 0x%02X, 0x%02X, 0x%02X }, expected { 0x%02X, 0x%02X, 0x%02X, 0x%02X } \n",
+                samples.byte[0], samples.byte[1], samples.byte[2], samples.byte[3],
+                result.byte[0], result.byte[1], result.byte[2], result.byte[3] );
+        exit(1);
+    }
+    nbTests++;
+}
+
+
+/*!
+ * XSUM_sanityCheck():
+ * Runs a sanity check before the benchmark.
+ *
+ * Exits on an incorrect output.
+ */
+XSUM_API void XSUM_sanityCheck(void)
+{
+#define SANITY_BUFFER_SIZE 2367
+    XSUM_U8 sanityBuffer[SANITY_BUFFER_SIZE];
+    XSUM_fillTestBuffer(sanityBuffer, sizeof(sanityBuffer));
+
+    XSUM_testXXH32(NULL,          0, 0,       0x02CC5D05);
+    XSUM_testXXH32(NULL,          0, PRIME32, 0x36B78AE7);
+    XSUM_testXXH32(sanityBuffer,  1, 0,       0xCF65B03E);
+    XSUM_testXXH32(sanityBuffer,  1, PRIME32, 0xB4545AA4);
+    XSUM_testXXH32(sanityBuffer, 14, 0,       0x1208E7E2);
+    XSUM_testXXH32(sanityBuffer, 14, PRIME32, 0x6AF1D1FE);
+    XSUM_testXXH32(sanityBuffer,222, 0,       0x5BD11DBD);
+    XSUM_testXXH32(sanityBuffer,222, PRIME32, 0x58803C5F);
+
+    XSUM_testXXH64(NULL        ,  0, 0,       0xEF46DB3751D8E999ULL);
+    XSUM_testXXH64(NULL        ,  0, PRIME32, 0xAC75FDA2929B17EFULL);
+    XSUM_testXXH64(sanityBuffer,  1, 0,       0xE934A84ADB052768ULL);
+    XSUM_testXXH64(sanityBuffer,  1, PRIME32, 0x5014607643A9B4C3ULL);
+    XSUM_testXXH64(sanityBuffer,  4, 0,       0x9136A0DCA57457EEULL);
+    XSUM_testXXH64(sanityBuffer, 14, 0,       0x8282DCC4994E35C8ULL);
+    XSUM_testXXH64(sanityBuffer, 14, PRIME32, 0xC3BD6BF63DEB6DF0ULL);
+    XSUM_testXXH64(sanityBuffer,222, 0,       0xB641AE8CB691C174ULL);
+    XSUM_testXXH64(sanityBuffer,222, PRIME32, 0x20CB8AB7AE10C14AULL);
+
+    XSUM_testXXH3(NULL,           0, 0,       0x2D06800538D394C2ULL);  /* empty string */
+    XSUM_testXXH3(NULL,           0, PRIME64, 0xA8A6B918B2F0364AULL);
+    XSUM_testXXH3(sanityBuffer,   1, 0,       0xC44BDFF4074EECDBULL);  /*  1 -  3 */
+    XSUM_testXXH3(sanityBuffer,   1, PRIME64, 0x032BE332DD766EF8ULL);  /*  1 -  3 */
+    XSUM_testXXH3(sanityBuffer,   6, 0,       0x27B56A84CD2D7325ULL);  /*  4 -  8 */
+    XSUM_testXXH3(sanityBuffer,   6, PRIME64, 0x84589C116AB59AB9ULL);  /*  4 -  8 */
+    XSUM_testXXH3(sanityBuffer,  12, 0,       0xA713DAF0DFBB77E7ULL);  /*  9 - 16 */
+    XSUM_testXXH3(sanityBuffer,  12, PRIME64, 0xE7303E1B2336DE0EULL);  /*  9 - 16 */
+    XSUM_testXXH3(sanityBuffer,  24, 0,       0xA3FE70BF9D3510EBULL);  /* 17 - 32 */
+    XSUM_testXXH3(sanityBuffer,  24, PRIME64, 0x850E80FC35BDD690ULL);  /* 17 - 32 */
+    XSUM_testXXH3(sanityBuffer,  48, 0,       0x397DA259ECBA1F11ULL);  /* 33 - 64 */
+    XSUM_testXXH3(sanityBuffer,  48, PRIME64, 0xADC2CBAA44ACC616ULL);  /* 33 - 64 */
+    XSUM_testXXH3(sanityBuffer,  80, 0,       0xBCDEFBBB2C47C90AULL);  /* 65 - 96 */
+    XSUM_testXXH3(sanityBuffer,  80, PRIME64, 0xC6DD0CB699532E73ULL);  /* 65 - 96 */
+    XSUM_testXXH3(sanityBuffer, 195, 0,       0xCD94217EE362EC3AULL);  /* 129-240 */
+    XSUM_testXXH3(sanityBuffer, 195, PRIME64, 0xBA68003D370CB3D9ULL);  /* 129-240 */
+
+    XSUM_testXXH3(sanityBuffer, 403, 0,       0xCDEB804D65C6DEA4ULL);  /* one block, last stripe is overlapping */
+    XSUM_testXXH3(sanityBuffer, 403, PRIME64, 0x6259F6ECFD6443FDULL);  /* one block, last stripe is overlapping */
+    XSUM_testXXH3(sanityBuffer, 512, 0,       0x617E49599013CB6BULL);  /* one block, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer, 512, PRIME64, 0x3CE457DE14C27708ULL);  /* one block, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer,2048, 0,       0xDD59E2C3A5F038E0ULL);  /* 2 blocks, finishing at block boundary */
+    XSUM_testXXH3(sanityBuffer,2048, PRIME64, 0x66F81670669ABABCULL);  /* 2 blocks, finishing at block boundary */
+    XSUM_testXXH3(sanityBuffer,2240, 0,       0x6E73A90539CF2948ULL);  /* 3 blocks, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer,2240, PRIME64, 0x757BA8487D1B5247ULL);  /* 3 blocks, finishing at stripe boundary */
+    XSUM_testXXH3(sanityBuffer,2367, 0,       0xCB37AEB9E5D361EDULL);  /* 3 blocks, last stripe is overlapping */
+    XSUM_testXXH3(sanityBuffer,2367, PRIME64, 0xD2DB3415B942B42AULL);  /* 3 blocks, last stripe is overlapping */
+
+    /* XXH3 with Custom Secret */
+    {   const void* const secret = sanityBuffer + 7;
+        const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
+        assert(sizeof(sanityBuffer) >= 7 + secretSize);
+        XSUM_testXXH3_withSecret(NULL,           0, secret, secretSize, 0x3559D64878C5C66CULL);  /* empty string */
+        XSUM_testXXH3_withSecret(sanityBuffer,   1, secret, secretSize, 0x8A52451418B2DA4DULL);  /*  1 -  3 */
+        XSUM_testXXH3_withSecret(sanityBuffer,   6, secret, secretSize, 0x82C90AB0519369ADULL);  /*  4 -  8 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  12, secret, secretSize, 0x14631E773B78EC57ULL);  /*  9 - 16 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  24, secret, secretSize, 0xCDD5542E4A9D9FE8ULL);  /* 17 - 32 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  48, secret, secretSize, 0x33ABD54D094B2534ULL);  /* 33 - 64 */
+        XSUM_testXXH3_withSecret(sanityBuffer,  80, secret, secretSize, 0xE687BA1684965297ULL);  /* 65 - 96 */
+        XSUM_testXXH3_withSecret(sanityBuffer, 195, secret, secretSize, 0xA057273F5EECFB20ULL);  /* 129-240 */
+
+        XSUM_testXXH3_withSecret(sanityBuffer, 403, secret, secretSize, 0x14546019124D43B8ULL);  /* one block, last stripe is overlapping */
+        XSUM_testXXH3_withSecret(sanityBuffer, 512, secret, secretSize, 0x7564693DD526E28DULL);  /* one block, finishing at stripe boundary */
+        XSUM_testXXH3_withSecret(sanityBuffer,2048, secret, secretSize, 0xD32E975821D6519FULL);  /* >= 2 blocks, at least one scrambling */
+        XSUM_testXXH3_withSecret(sanityBuffer,2367, secret, secretSize, 0x293FA8E5173BB5E7ULL);  /* >= 2 blocks, at least one scrambling, last stripe unaligned */
+
+        XSUM_testXXH3_withSecret(sanityBuffer,64*10*3, secret, secretSize, 0x751D2EC54BC6038BULL);  /* exactly 3 full blocks, not a multiple of 256 */
+    }
+
+    /* XXH128 */
+    {   XXH128_hash_t const expected = { 0x6001C324468D497FULL, 0x99AA06D3014798D8ULL };
+        XSUM_testXXH128(NULL,           0, 0,     expected);         /* empty string */
+    }
+    {   XXH128_hash_t const expected = { 0x5444F7869C671AB0ULL, 0x92220AE55E14AB50ULL };
+        XSUM_testXXH128(NULL,           0, PRIME32, expected);
+    }
+    {   XXH128_hash_t const expected = { 0xC44BDFF4074EECDBULL, 0xA6CD5E9392000F6AULL };
+        XSUM_testXXH128(sanityBuffer,   1, 0,       expected);       /* 1-3 */
+    }
+    {   XXH128_hash_t const expected = { 0xB53D5557E7F76F8DULL, 0x89B99554BA22467CULL };
+        XSUM_testXXH128(sanityBuffer,   1, PRIME32, expected);       /* 1-3 */
+    }
+    {   XXH128_hash_t const expected = { 0x3E7039BDDA43CFC6ULL, 0x082AFE0B8162D12AULL };
+        XSUM_testXXH128(sanityBuffer,   6, 0,       expected);       /* 4-8 */
+    }
+    {   XXH128_hash_t const expected = { 0x269D8F70BE98856EULL, 0x5A865B5389ABD2B1ULL };
+        XSUM_testXXH128(sanityBuffer,   6, PRIME32, expected);       /* 4-8 */
+    }
+    {   XXH128_hash_t const expected = { 0x061A192713F69AD9ULL, 0x6E3EFD8FC7802B18ULL };
+        XSUM_testXXH128(sanityBuffer,  12, 0,       expected);       /* 9-16 */
+    }
+    {   XXH128_hash_t const expected = { 0x9BE9F9A67F3C7DFBULL, 0xD7E09D518A3405D3ULL };
+        XSUM_testXXH128(sanityBuffer,  12, PRIME32, expected);       /* 9-16 */
+    }
+    {   XXH128_hash_t const expected = { 0x1E7044D28B1B901DULL, 0x0CE966E4678D3761ULL };
+        XSUM_testXXH128(sanityBuffer,  24, 0,       expected);       /* 17-32 */
+    }
+    {   XXH128_hash_t const expected = { 0xD7304C54EBAD40A9ULL, 0x3162026714A6A243ULL };
+        XSUM_testXXH128(sanityBuffer,  24, PRIME32, expected);       /* 17-32 */
+    }
+    {   XXH128_hash_t const expected = { 0xF942219AED80F67BULL, 0xA002AC4E5478227EULL };
+        XSUM_testXXH128(sanityBuffer,  48, 0,       expected);       /* 33-64 */
+    }
+    {   XXH128_hash_t const expected = { 0x7BA3C3E453A1934EULL, 0x163ADDE36C072295ULL };
+        XSUM_testXXH128(sanityBuffer,  48, PRIME32, expected);       /* 33-64 */
+    }
+    {   XXH128_hash_t const expected = { 0x5E8BAFB9F95FB803ULL, 0x4952F58181AB0042ULL };
+        XSUM_testXXH128(sanityBuffer,  81, 0,       expected);       /* 65-96 */
+    }
+    {   XXH128_hash_t const expected = { 0x703FBB3D7A5F755CULL, 0x2724EC7ADC750FB6ULL };
+        XSUM_testXXH128(sanityBuffer,  81, PRIME32, expected);       /* 65-96 */
+    }
+    {   XXH128_hash_t const expected = { 0xF1AEBD597CEC6B3AULL, 0x337E09641B948717ULL };
+        XSUM_testXXH128(sanityBuffer, 222, 0,       expected);       /* 129-240 */
+    }
+    {   XXH128_hash_t const expected = { 0xAE995BB8AF917A8DULL, 0x91820016621E97F1ULL };
+        XSUM_testXXH128(sanityBuffer, 222, PRIME32, expected);       /* 129-240 */
+    }
+    {   XXH128_hash_t const expected = { 0xCDEB804D65C6DEA4ULL, 0x1B6DE21E332DD73DULL };
+        XSUM_testXXH128(sanityBuffer, 403, 0,       expected);       /* one block, last stripe is overlapping */
+    }
+    {   XXH128_hash_t const expected = { 0x6259F6ECFD6443FDULL, 0xBED311971E0BE8F2ULL };
+        XSUM_testXXH128(sanityBuffer, 403, PRIME64, expected);       /* one block, last stripe is overlapping */
+    }
+    {   XXH128_hash_t const expected = { 0x617E49599013CB6BULL, 0x18D2D110DCC9BCA1ULL };
+        XSUM_testXXH128(sanityBuffer, 512, 0,       expected);       /* one block, finishing at stripe boundary */
+    }
+    {   XXH128_hash_t const expected = { 0x3CE457DE14C27708ULL, 0x925D06B8EC5B8040ULL };
+        XSUM_testXXH128(sanityBuffer, 512, PRIME64, expected);       /* one block, finishing at stripe boundary */
+    }
+    {   XXH128_hash_t const expected = { 0xDD59E2C3A5F038E0ULL, 0xF736557FD47073A5ULL };
+        XSUM_testXXH128(sanityBuffer,2048, 0,       expected);       /* two blocks, finishing at block boundary */
+    }
+    {   XXH128_hash_t const expected = { 0x230D43F30206260BULL, 0x7FB03F7E7186C3EAULL };
+        XSUM_testXXH128(sanityBuffer,2048, PRIME32, expected);       /* two blocks, finishing at block boundary */
+    }
+    {   XXH128_hash_t const expected = { 0x6E73A90539CF2948ULL, 0xCCB134FBFA7CE49DULL };
+        XSUM_testXXH128(sanityBuffer,2240, 0,       expected);      /* two blocks, ends at stripe boundary */
+    }
+    {   XXH128_hash_t const expected = { 0xED385111126FBA6FULL, 0x50A1FE17B338995FULL };
+        XSUM_testXXH128(sanityBuffer,2240, PRIME32, expected);       /* two blocks, ends at stripe boundary */
+    }
+    {   XXH128_hash_t const expected = { 0xCB37AEB9E5D361EDULL, 0xE89C0F6FF369B427ULL };
+        XSUM_testXXH128(sanityBuffer,2367, 0,       expected);       /* two blocks, last stripe is overlapping */
+    }
+    {   XXH128_hash_t const expected = { 0x6F5360AE69C2F406ULL, 0xD23AAE4B76C31ECBULL };
+        XSUM_testXXH128(sanityBuffer,2367, PRIME32, expected);       /* two blocks, last stripe is overlapping */
+    }
+
+    /* XXH128 with custom Secret */
+    {   const void* const secret = sanityBuffer + 7;
+        const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
+        assert(sizeof(sanityBuffer) >= 7 + secretSize);
+
+        {   XXH128_hash_t const expected = { 0x005923CCEECBE8AEULL, 0x5F70F4EA232F1D38ULL };
+            XSUM_testXXH128_withSecret(NULL,           0, secret, secretSize,     expected);         /* empty string */
+        }
+        {   XXH128_hash_t const expected = { 0x8A52451418B2DA4DULL, 0x3A66AF5A9819198EULL };
+            XSUM_testXXH128_withSecret(sanityBuffer,   1, secret, secretSize,       expected);       /* 1-3 */
+        }
+        {   XXH128_hash_t const expected = { 0x0B61C8ACA7D4778FULL, 0x376BD91B6432F36DULL };
+            XSUM_testXXH128_withSecret(sanityBuffer,   6, secret, secretSize,       expected);       /* 4-8 */
+        }
+        {   XXH128_hash_t const expected = { 0xAF82F6EBA263D7D8ULL, 0x90A3C2D839F57D0FULL };
+            XSUM_testXXH128_withSecret(sanityBuffer,  12, secret, secretSize,       expected);       /* 9-16 */
+        }
+    }
+
+    /* secret generator */
+    {   verifSample_t const expected = { { 0xB8, 0x26, 0x83, 0x7E } };
+        XSUM_testSecretGenerator(NULL, 0, expected);
+    }
+
+    {   verifSample_t const expected = { { 0xA6, 0x16, 0x06, 0x7B } };
+        XSUM_testSecretGenerator(sanityBuffer, 1, expected);
+    }
+
+    {   verifSample_t const expected = { { 0xDA, 0x2A, 0x12, 0x11 } };
+        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_SIZE_MIN - 1, expected);
+    }
+
+    {   verifSample_t const expected = { { 0x7E, 0x48, 0x0C, 0xA7 } };
+        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_DEFAULT_SIZE + 500, expected);
+    }
+
+    XSUM_logVerbose(3, "\r%70s\r", "");       /* Clean display line */
+    XSUM_logVerbose(3, "Sanity check -- all tests ok\n");
+}
+
+#endif /* !XSUM_NO_TESTS */
diff --git a/cli/xsum_sanity_check.h b/cli/xsum_sanity_check.h
new file mode 100644
index 00000000..a3f57a16
--- /dev/null
+++ b/cli/xsum_sanity_check.h
@@ -0,0 +1,57 @@
+/*
+ * xxhsum - Command line interface for xxhash algorithms
+ * Copyright (C) 2013-2020 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#ifndef XSUM_SANITY_CHECK_H
+#define XSUM_SANITY_CHECK_H
+
+#include "xsum_config.h"
+
+#ifdef __cplusplus
+extern "C" {
+#endif
+
+/*
+ * Runs a series of self-tests.
+ *
+ * Exits if any of these tests fail, printing a message to stderr.
+ *
+ * If XSUM_NO_TESTS is defined to non-zero, this will instead print a warning
+ * if this is called (e.g. via xxhsum -b).
+ */
+XSUM_API void XSUM_sanityCheck(void);
+
+/*
+ * Fills a test buffer with pseudorandom data.
+ *
+ * This is used in the sanity check and the benchmarks - its values must not be
+ * changed.
+ */
+XSUM_API void XSUM_fillTestBuffer(XSUM_U8* buffer, size_t len);
+
+#ifdef __cplusplus
+}
+#endif
+
+#endif /* XSUM_SANITY_CHECK_H */
diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index 5abd0c5f..cd38be4b 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -87,6 +87,7 @@ if(XXHASH_BUILD_XXHSUM)
   add_executable(xxhsum "${XXHASH_DIR}/xxhsum.c"
                         "${XXHSUM_DIR}/xsum_os_specific.c"
                         "${XXHSUM_DIR}/xsum_output.c"
+                        "${XXHSUM_DIR}/xsum_sanity_check.c"
                 )
   add_executable(${PROJECT_NAME}::xxhsum ALIAS xxhsum)
 
diff --git a/xxhsum.c b/xxhsum.c
index bac55b20..82324091 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -34,9 +34,11 @@
 #include "cli/xsum_arch.h"
 #include "cli/xsum_os_specific.h"
 #include "cli/xsum_output.h"
+#include "cli/xsum_sanity_check.h"
 #ifdef XXH_INLINE_ALL
 #  include "cli/xsum_os_specific.c"
 #  include "cli/xsum_output.c"
+#  include "cli/xsum_sanity_check.c"
 #endif
 
 /* ************************************
@@ -166,28 +168,6 @@ static char* XSUM_strcatDup(const char* s1, const char* s2)
 }
 
 
-/* use #define to make them constant, required for initialization */
-#define PRIME32 2654435761U
-#define PRIME64 11400714785074694797ULL
-
-/*
- * Fills a test buffer with pseudorandom data.
- *
- * This is used in the sanity check - its values must not be changed.
- */
-static void XSUM_fillTestBuffer(XSUM_U8* buffer, size_t len)
-{
-    XSUM_U64 byteGen = PRIME32;
-    size_t i;
-
-    assert(buffer != NULL);
-
-    for (i=0; i<len; i++) {
-        buffer[i] = (XSUM_U8)(byteGen>>56);
-        byteGen *= PRIME64;
-    }
-}
-
 /*
  * A secret buffer used for benchmarking XXH3's withSecret variants.
  *
@@ -506,506 +486,6 @@ static int XSUM_benchInternal(size_t keySize)
     return 0;
 }
 
-
-/* ************************************************
- * Self-test:
- * ensure results consistency accross platforms
- *********************************************** */
-
-static void XSUM_checkResult32(XXH32_hash_t r1, XXH32_hash_t r2)
-{
-    static int nbTests = 1;
-    if (r1!=r2) {
-        XSUM_log("\rError: 32-bit hash test %i: Internal sanity check failed!\n", nbTests);
-        XSUM_log("\rGot 0x%08X, expected 0x%08X.\n", (unsigned)r1, (unsigned)r2);
-        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
-                  "or temporarily comment out the tests in XSUM_sanityCheck.\n");
-        exit(1);
-    }
-    nbTests++;
-}
-
-static void XSUM_checkResult64(XXH64_hash_t r1, XXH64_hash_t r2)
-{
-    static int nbTests = 1;
-    if (r1!=r2) {
-        XSUM_log("\rError: 64-bit hash test %i: Internal sanity check failed!\n", nbTests);
-        XSUM_log("\rGot 0x%08X%08XULL, expected 0x%08X%08XULL.\n",
-                (unsigned)(r1>>32), (unsigned)r1, (unsigned)(r2>>32), (unsigned)r2);
-        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
-                  "or temporarily comment out the tests in XSUM_sanityCheck.\n");
-        exit(1);
-    }
-    nbTests++;
-}
-
-static void XSUM_checkResult128(XXH128_hash_t r1, XXH128_hash_t r2)
-{
-    static int nbTests = 1;
-    if ((r1.low64 != r2.low64) || (r1.high64 != r2.high64)) {
-        XSUM_log("\rError: 128-bit hash test %i: Internal sanity check failed.\n", nbTests);
-        XSUM_log("\rGot { 0x%08X%08XULL, 0x%08X%08XULL }, expected { 0x%08X%08XULL, 0x%08X%08XULL } \n",
-                (unsigned)(r1.low64>>32), (unsigned)r1.low64, (unsigned)(r1.high64>>32), (unsigned)r1.high64,
-                (unsigned)(r2.low64>>32), (unsigned)r2.low64, (unsigned)(r2.high64>>32), (unsigned)r2.high64 );
-        XSUM_log("\rNote: If you modified the hash functions, make sure to either update the values\n"
-                  "or temporarily comment out the tests in XSUM_sanityCheck.\n");
-        exit(1);
-    }
-    nbTests++;
-}
-
-
-static void XSUM_testXXH32(const void* data, size_t len, XSUM_U32 seed, XSUM_U32 Nresult)
-{
-    XXH32_state_t *state = XXH32_createState();
-    size_t pos;
-
-    assert(state != NULL);
-    if (len>0) assert(data != NULL);
-
-    XSUM_checkResult32(XXH32(data, len, seed), Nresult);
-
-    (void)XXH32_reset(state, seed);
-    (void)XXH32_update(state, data, len);
-    XSUM_checkResult32(XXH32_digest(state), Nresult);
-
-    (void)XXH32_reset(state, seed);
-    for (pos=0; pos<len; pos++)
-        (void)XXH32_update(state, ((const char*)data)+pos, 1);
-    XSUM_checkResult32(XXH32_digest(state), Nresult);
-    XXH32_freeState(state);
-}
-
-static void XSUM_testXXH64(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
-{
-    XXH64_state_t *state = XXH64_createState();
-    size_t pos;
-
-    assert(state != NULL);
-    if (len>0) assert(data != NULL);
-
-    XSUM_checkResult64(XXH64(data, len, seed), Nresult);
-
-    (void)XXH64_reset(state, seed);
-    (void)XXH64_update(state, data, len);
-    XSUM_checkResult64(XXH64_digest(state), Nresult);
-
-    (void)XXH64_reset(state, seed);
-    for (pos=0; pos<len; pos++)
-        (void)XXH64_update(state, ((const char*)data)+pos, 1);
-    XSUM_checkResult64(XXH64_digest(state), Nresult);
-    XXH64_freeState(state);
-}
-
-static XSUM_U32 XSUM_rand(void)
-{
-    static XSUM_U64 seed = PRIME32;
-    seed *= PRIME64;
-    return (XSUM_U32)(seed >> 40);
-}
-
-
-void XSUM_testXXH3(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
-{
-    if (len>0) assert(data != NULL);
-
-    {   XSUM_U64 const Dresult = XXH3_64bits_withSeed(data, len, seed);
-        XSUM_checkResult64(Dresult, Nresult);
-    }
-
-    /* check that the no-seed variant produces same result as seed==0 */
-    if (seed == 0) {
-        XSUM_U64 const Dresult = XXH3_64bits(data, len);
-        XSUM_checkResult64(Dresult, Nresult);
-    }
-
-    /* streaming API test */
-    {   XXH3_state_t* const state = XXH3_createState();
-        assert(state != NULL);
-        /* single ingestion */
-        (void)XXH3_64bits_reset_withSeed(state, seed);
-        (void)XXH3_64bits_update(state, data, len);
-        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-
-        /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_64bits_reset_withSeed(state, seed);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_64bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-        }
-
-        /* byte by byte ingestion */
-        {   size_t pos;
-            (void)XXH3_64bits_reset_withSeed(state, seed);
-            for (pos=0; pos<len; pos++)
-                (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
-            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-        }
-        XXH3_freeState(state);
-    }
-}
-
-void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XSUM_U64 Nresult)
-{
-    if (len>0) assert(data != NULL);
-
-    {   XSUM_U64 const Dresult = XXH3_64bits_withSecret(data, len, secret, secretSize);
-        XSUM_checkResult64(Dresult, Nresult);
-    }
-
-    /* streaming API test */
-    {   XXH3_state_t *state = XXH3_createState();
-        assert(state != NULL);
-        (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
-        (void)XXH3_64bits_update(state, data, len);
-        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-
-        /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_64bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-        }
-
-        /* byte by byte ingestion */
-        {   size_t pos;
-            (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
-            for (pos=0; pos<len; pos++)
-                (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
-            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-        }
-        XXH3_freeState(state);
-    }
-}
-
-void XSUM_testXXH128(const void* data, size_t len, XSUM_U64 seed, XXH128_hash_t Nresult)
-{
-    {   XXH128_hash_t const Dresult = XXH3_128bits_withSeed(data, len, seed);
-        XSUM_checkResult128(Dresult, Nresult);
-    }
-
-    /* check that XXH128() is identical to XXH3_128bits_withSeed() */
-    {   XXH128_hash_t const Dresult2 = XXH128(data, len, seed);
-        XSUM_checkResult128(Dresult2, Nresult);
-    }
-
-    /* check that the no-seed variant produces same result as seed==0 */
-    if (seed == 0) {
-        XXH128_hash_t const Dresult = XXH3_128bits(data, len);
-        XSUM_checkResult128(Dresult, Nresult);
-    }
-
-    /* streaming API test */
-    {   XXH3_state_t *state = XXH3_createState();
-        assert(state != NULL);
-
-        /* single ingestion */
-        (void)XXH3_128bits_reset_withSeed(state, seed);
-        (void)XXH3_128bits_update(state, data, len);
-        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-
-        /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_128bits_reset_withSeed(state, seed);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_128bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-        }
-
-        /* byte by byte ingestion */
-        {   size_t pos;
-            (void)XXH3_128bits_reset_withSeed(state, seed);
-            for (pos=0; pos<len; pos++)
-                (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
-            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-        }
-        XXH3_freeState(state);
-    }
-}
-
-void XSUM_testXXH128_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XXH128_hash_t Nresult)
-{
-    if (len>0) assert(data != NULL);
-
-    {   XXH128_hash_t const Dresult = XXH3_128bits_withSecret(data, len, secret, secretSize);
-        XSUM_checkResult128(Dresult, Nresult);
-    }
-
-    /* streaming API test */
-    {   XXH3_state_t* const state = XXH3_createState();
-        assert(state != NULL);
-        (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
-        (void)XXH3_128bits_update(state, data, len);
-        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-
-        /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_128bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-        }
-
-        /* byte by byte ingestion */
-        {   size_t pos;
-            (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
-            for (pos=0; pos<len; pos++)
-                (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
-            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-        }
-        XXH3_freeState(state);
-    }
-}
-
-#define SECRET_SAMPLE_NBBYTES 4
-typedef struct { XSUM_U8 byte[SECRET_SAMPLE_NBBYTES]; } verifSample_t;
-
-void XSUM_testSecretGenerator(const void* customSeed, size_t len, verifSample_t result)
-{
-    static int nbTests = 1;
-    const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};
-    XSUM_U8 secretBuffer[XXH3_SECRET_DEFAULT_SIZE] = {0};
-    verifSample_t samples;
-    int i;
-
-    XXH3_generateSecret(secretBuffer, customSeed, len);
-    for (i=0; i<SECRET_SAMPLE_NBBYTES; i++) {
-        samples.byte[i] = secretBuffer[sampleIndex[i]];
-    }
-    if (memcmp(&samples, &result, sizeof(result))) {
-        XSUM_log("\rError: Secret generation test %i: Internal sanity check failed. \n", nbTests);
-        XSUM_log("\rGot { 0x%02X, 0x%02X, 0x%02X, 0x%02X }, expected { 0x%02X, 0x%02X, 0x%02X, 0x%02X } \n",
-                samples.byte[0], samples.byte[1], samples.byte[2], samples.byte[3],
-                result.byte[0], result.byte[1], result.byte[2], result.byte[3] );
-        exit(1);
-    }
-    nbTests++;
-}
-
-
-/*!
- * XSUM_sanityCheck():
- * Runs a sanity check before the benchmark.
- *
- * Exits on an incorrect output.
- */
-static void XSUM_sanityCheck(void)
-{
-#define SANITY_BUFFER_SIZE 2367
-    XSUM_U8 sanityBuffer[SANITY_BUFFER_SIZE];
-    XSUM_fillTestBuffer(sanityBuffer, sizeof(sanityBuffer));
-
-    XSUM_testXXH32(NULL,          0, 0,       0x02CC5D05);
-    XSUM_testXXH32(NULL,          0, PRIME32, 0x36B78AE7);
-    XSUM_testXXH32(sanityBuffer,  1, 0,       0xCF65B03E);
-    XSUM_testXXH32(sanityBuffer,  1, PRIME32, 0xB4545AA4);
-    XSUM_testXXH32(sanityBuffer, 14, 0,       0x1208E7E2);
-    XSUM_testXXH32(sanityBuffer, 14, PRIME32, 0x6AF1D1FE);
-    XSUM_testXXH32(sanityBuffer,222, 0,       0x5BD11DBD);
-    XSUM_testXXH32(sanityBuffer,222, PRIME32, 0x58803C5F);
-
-    XSUM_testXXH64(NULL        ,  0, 0,       0xEF46DB3751D8E999ULL);
-    XSUM_testXXH64(NULL        ,  0, PRIME32, 0xAC75FDA2929B17EFULL);
-    XSUM_testXXH64(sanityBuffer,  1, 0,       0xE934A84ADB052768ULL);
-    XSUM_testXXH64(sanityBuffer,  1, PRIME32, 0x5014607643A9B4C3ULL);
-    XSUM_testXXH64(sanityBuffer,  4, 0,       0x9136A0DCA57457EEULL);
-    XSUM_testXXH64(sanityBuffer, 14, 0,       0x8282DCC4994E35C8ULL);
-    XSUM_testXXH64(sanityBuffer, 14, PRIME32, 0xC3BD6BF63DEB6DF0ULL);
-    XSUM_testXXH64(sanityBuffer,222, 0,       0xB641AE8CB691C174ULL);
-    XSUM_testXXH64(sanityBuffer,222, PRIME32, 0x20CB8AB7AE10C14AULL);
-
-    XSUM_testXXH3(NULL,           0, 0,       0x2D06800538D394C2ULL);  /* empty string */
-    XSUM_testXXH3(NULL,           0, PRIME64, 0xA8A6B918B2F0364AULL);
-    XSUM_testXXH3(sanityBuffer,   1, 0,       0xC44BDFF4074EECDBULL);  /*  1 -  3 */
-    XSUM_testXXH3(sanityBuffer,   1, PRIME64, 0x032BE332DD766EF8ULL);  /*  1 -  3 */
-    XSUM_testXXH3(sanityBuffer,   6, 0,       0x27B56A84CD2D7325ULL);  /*  4 -  8 */
-    XSUM_testXXH3(sanityBuffer,   6, PRIME64, 0x84589C116AB59AB9ULL);  /*  4 -  8 */
-    XSUM_testXXH3(sanityBuffer,  12, 0,       0xA713DAF0DFBB77E7ULL);  /*  9 - 16 */
-    XSUM_testXXH3(sanityBuffer,  12, PRIME64, 0xE7303E1B2336DE0EULL);  /*  9 - 16 */
-    XSUM_testXXH3(sanityBuffer,  24, 0,       0xA3FE70BF9D3510EBULL);  /* 17 - 32 */
-    XSUM_testXXH3(sanityBuffer,  24, PRIME64, 0x850E80FC35BDD690ULL);  /* 17 - 32 */
-    XSUM_testXXH3(sanityBuffer,  48, 0,       0x397DA259ECBA1F11ULL);  /* 33 - 64 */
-    XSUM_testXXH3(sanityBuffer,  48, PRIME64, 0xADC2CBAA44ACC616ULL);  /* 33 - 64 */
-    XSUM_testXXH3(sanityBuffer,  80, 0,       0xBCDEFBBB2C47C90AULL);  /* 65 - 96 */
-    XSUM_testXXH3(sanityBuffer,  80, PRIME64, 0xC6DD0CB699532E73ULL);  /* 65 - 96 */
-    XSUM_testXXH3(sanityBuffer, 195, 0,       0xCD94217EE362EC3AULL);  /* 129-240 */
-    XSUM_testXXH3(sanityBuffer, 195, PRIME64, 0xBA68003D370CB3D9ULL);  /* 129-240 */
-
-    XSUM_testXXH3(sanityBuffer, 403, 0,       0xCDEB804D65C6DEA4ULL);  /* one block, last stripe is overlapping */
-    XSUM_testXXH3(sanityBuffer, 403, PRIME64, 0x6259F6ECFD6443FDULL);  /* one block, last stripe is overlapping */
-    XSUM_testXXH3(sanityBuffer, 512, 0,       0x617E49599013CB6BULL);  /* one block, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer, 512, PRIME64, 0x3CE457DE14C27708ULL);  /* one block, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer,2048, 0,       0xDD59E2C3A5F038E0ULL);  /* 2 blocks, finishing at block boundary */
-    XSUM_testXXH3(sanityBuffer,2048, PRIME64, 0x66F81670669ABABCULL);  /* 2 blocks, finishing at block boundary */
-    XSUM_testXXH3(sanityBuffer,2240, 0,       0x6E73A90539CF2948ULL);  /* 3 blocks, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer,2240, PRIME64, 0x757BA8487D1B5247ULL);  /* 3 blocks, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer,2367, 0,       0xCB37AEB9E5D361EDULL);  /* 3 blocks, last stripe is overlapping */
-    XSUM_testXXH3(sanityBuffer,2367, PRIME64, 0xD2DB3415B942B42AULL);  /* 3 blocks, last stripe is overlapping */
-
-    /* XXH3 with Custom Secret */
-    {   const void* const secret = sanityBuffer + 7;
-        const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
-        assert(sizeof(sanityBuffer) >= 7 + secretSize);
-        XSUM_testXXH3_withSecret(NULL,           0, secret, secretSize, 0x3559D64878C5C66CULL);  /* empty string */
-        XSUM_testXXH3_withSecret(sanityBuffer,   1, secret, secretSize, 0x8A52451418B2DA4DULL);  /*  1 -  3 */
-        XSUM_testXXH3_withSecret(sanityBuffer,   6, secret, secretSize, 0x82C90AB0519369ADULL);  /*  4 -  8 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  12, secret, secretSize, 0x14631E773B78EC57ULL);  /*  9 - 16 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  24, secret, secretSize, 0xCDD5542E4A9D9FE8ULL);  /* 17 - 32 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  48, secret, secretSize, 0x33ABD54D094B2534ULL);  /* 33 - 64 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  80, secret, secretSize, 0xE687BA1684965297ULL);  /* 65 - 96 */
-        XSUM_testXXH3_withSecret(sanityBuffer, 195, secret, secretSize, 0xA057273F5EECFB20ULL);  /* 129-240 */
-
-        XSUM_testXXH3_withSecret(sanityBuffer, 403, secret, secretSize, 0x14546019124D43B8ULL);  /* one block, last stripe is overlapping */
-        XSUM_testXXH3_withSecret(sanityBuffer, 512, secret, secretSize, 0x7564693DD526E28DULL);  /* one block, finishing at stripe boundary */
-        XSUM_testXXH3_withSecret(sanityBuffer,2048, secret, secretSize, 0xD32E975821D6519FULL);  /* >= 2 blocks, at least one scrambling */
-        XSUM_testXXH3_withSecret(sanityBuffer,2367, secret, secretSize, 0x293FA8E5173BB5E7ULL);  /* >= 2 blocks, at least one scrambling, last stripe unaligned */
-
-        XSUM_testXXH3_withSecret(sanityBuffer,64*10*3, secret, secretSize, 0x751D2EC54BC6038BULL);  /* exactly 3 full blocks, not a multiple of 256 */
-    }
-
-    /* XXH128 */
-    {   XXH128_hash_t const expected = { 0x6001C324468D497FULL, 0x99AA06D3014798D8ULL };
-        XSUM_testXXH128(NULL,           0, 0,     expected);         /* empty string */
-    }
-    {   XXH128_hash_t const expected = { 0x5444F7869C671AB0ULL, 0x92220AE55E14AB50ULL };
-        XSUM_testXXH128(NULL,           0, PRIME32, expected);
-    }
-    {   XXH128_hash_t const expected = { 0xC44BDFF4074EECDBULL, 0xA6CD5E9392000F6AULL };
-        XSUM_testXXH128(sanityBuffer,   1, 0,       expected);       /* 1-3 */
-    }
-    {   XXH128_hash_t const expected = { 0xB53D5557E7F76F8DULL, 0x89B99554BA22467CULL };
-        XSUM_testXXH128(sanityBuffer,   1, PRIME32, expected);       /* 1-3 */
-    }
-    {   XXH128_hash_t const expected = { 0x3E7039BDDA43CFC6ULL, 0x082AFE0B8162D12AULL };
-        XSUM_testXXH128(sanityBuffer,   6, 0,       expected);       /* 4-8 */
-    }
-    {   XXH128_hash_t const expected = { 0x269D8F70BE98856EULL, 0x5A865B5389ABD2B1ULL };
-        XSUM_testXXH128(sanityBuffer,   6, PRIME32, expected);       /* 4-8 */
-    }
-    {   XXH128_hash_t const expected = { 0x061A192713F69AD9ULL, 0x6E3EFD8FC7802B18ULL };
-        XSUM_testXXH128(sanityBuffer,  12, 0,       expected);       /* 9-16 */
-    }
-    {   XXH128_hash_t const expected = { 0x9BE9F9A67F3C7DFBULL, 0xD7E09D518A3405D3ULL };
-        XSUM_testXXH128(sanityBuffer,  12, PRIME32, expected);       /* 9-16 */
-    }
-    {   XXH128_hash_t const expected = { 0x1E7044D28B1B901DULL, 0x0CE966E4678D3761ULL };
-        XSUM_testXXH128(sanityBuffer,  24, 0,       expected);       /* 17-32 */
-    }
-    {   XXH128_hash_t const expected = { 0xD7304C54EBAD40A9ULL, 0x3162026714A6A243ULL };
-        XSUM_testXXH128(sanityBuffer,  24, PRIME32, expected);       /* 17-32 */
-    }
-    {   XXH128_hash_t const expected = { 0xF942219AED80F67BULL, 0xA002AC4E5478227EULL };
-        XSUM_testXXH128(sanityBuffer,  48, 0,       expected);       /* 33-64 */
-    }
-    {   XXH128_hash_t const expected = { 0x7BA3C3E453A1934EULL, 0x163ADDE36C072295ULL };
-        XSUM_testXXH128(sanityBuffer,  48, PRIME32, expected);       /* 33-64 */
-    }
-    {   XXH128_hash_t const expected = { 0x5E8BAFB9F95FB803ULL, 0x4952F58181AB0042ULL };
-        XSUM_testXXH128(sanityBuffer,  81, 0,       expected);       /* 65-96 */
-    }
-    {   XXH128_hash_t const expected = { 0x703FBB3D7A5F755CULL, 0x2724EC7ADC750FB6ULL };
-        XSUM_testXXH128(sanityBuffer,  81, PRIME32, expected);       /* 65-96 */
-    }
-    {   XXH128_hash_t const expected = { 0xF1AEBD597CEC6B3AULL, 0x337E09641B948717ULL };
-        XSUM_testXXH128(sanityBuffer, 222, 0,       expected);       /* 129-240 */
-    }
-    {   XXH128_hash_t const expected = { 0xAE995BB8AF917A8DULL, 0x91820016621E97F1ULL };
-        XSUM_testXXH128(sanityBuffer, 222, PRIME32, expected);       /* 129-240 */
-    }
-    {   XXH128_hash_t const expected = { 0xCDEB804D65C6DEA4ULL, 0x1B6DE21E332DD73DULL };
-        XSUM_testXXH128(sanityBuffer, 403, 0,       expected);       /* one block, last stripe is overlapping */
-    }
-    {   XXH128_hash_t const expected = { 0x6259F6ECFD6443FDULL, 0xBED311971E0BE8F2ULL };
-        XSUM_testXXH128(sanityBuffer, 403, PRIME64, expected);       /* one block, last stripe is overlapping */
-    }
-    {   XXH128_hash_t const expected = { 0x617E49599013CB6BULL, 0x18D2D110DCC9BCA1ULL };
-        XSUM_testXXH128(sanityBuffer, 512, 0,       expected);       /* one block, finishing at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0x3CE457DE14C27708ULL, 0x925D06B8EC5B8040ULL };
-        XSUM_testXXH128(sanityBuffer, 512, PRIME64, expected);       /* one block, finishing at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0xDD59E2C3A5F038E0ULL, 0xF736557FD47073A5ULL };
-        XSUM_testXXH128(sanityBuffer,2048, 0,       expected);       /* two blocks, finishing at block boundary */
-    }
-    {   XXH128_hash_t const expected = { 0x230D43F30206260BULL, 0x7FB03F7E7186C3EAULL };
-        XSUM_testXXH128(sanityBuffer,2048, PRIME32, expected);       /* two blocks, finishing at block boundary */
-    }
-    {   XXH128_hash_t const expected = { 0x6E73A90539CF2948ULL, 0xCCB134FBFA7CE49DULL };
-        XSUM_testXXH128(sanityBuffer,2240, 0,       expected);      /* two blocks, ends at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0xED385111126FBA6FULL, 0x50A1FE17B338995FULL };
-        XSUM_testXXH128(sanityBuffer,2240, PRIME32, expected);       /* two blocks, ends at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0xCB37AEB9E5D361EDULL, 0xE89C0F6FF369B427ULL };
-        XSUM_testXXH128(sanityBuffer,2367, 0,       expected);       /* two blocks, last stripe is overlapping */
-    }
-    {   XXH128_hash_t const expected = { 0x6F5360AE69C2F406ULL, 0xD23AAE4B76C31ECBULL };
-        XSUM_testXXH128(sanityBuffer,2367, PRIME32, expected);       /* two blocks, last stripe is overlapping */
-    }
-
-    /* XXH128 with custom Secret */
-    {   const void* const secret = sanityBuffer + 7;
-        const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
-        assert(sizeof(sanityBuffer) >= 7 + secretSize);
-
-        {   XXH128_hash_t const expected = { 0x005923CCEECBE8AEULL, 0x5F70F4EA232F1D38ULL };
-            XSUM_testXXH128_withSecret(NULL,           0, secret, secretSize,     expected);         /* empty string */
-        }
-        {   XXH128_hash_t const expected = { 0x8A52451418B2DA4DULL, 0x3A66AF5A9819198EULL };
-            XSUM_testXXH128_withSecret(sanityBuffer,   1, secret, secretSize,       expected);       /* 1-3 */
-        }
-        {   XXH128_hash_t const expected = { 0x0B61C8ACA7D4778FULL, 0x376BD91B6432F36DULL };
-            XSUM_testXXH128_withSecret(sanityBuffer,   6, secret, secretSize,       expected);       /* 4-8 */
-        }
-        {   XXH128_hash_t const expected = { 0xAF82F6EBA263D7D8ULL, 0x90A3C2D839F57D0FULL };
-            XSUM_testXXH128_withSecret(sanityBuffer,  12, secret, secretSize,       expected);       /* 9-16 */
-        }
-    }
-
-    /* secret generator */
-    {   verifSample_t const expected = { { 0xB8, 0x26, 0x83, 0x7E } };
-        XSUM_testSecretGenerator(NULL, 0, expected);
-    }
-
-    {   verifSample_t const expected = { { 0xA6, 0x16, 0x06, 0x7B } };
-        XSUM_testSecretGenerator(sanityBuffer, 1, expected);
-    }
-
-    {   verifSample_t const expected = { { 0xDA, 0x2A, 0x12, 0x11 } };
-        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_SIZE_MIN - 1, expected);
-    }
-
-    {   verifSample_t const expected = { { 0x7E, 0x48, 0x0C, 0xA7 } };
-        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_DEFAULT_SIZE + 500, expected);
-    }
-
-    XSUM_logVerbose(3, "\r%70s\r", "");       /* Clean display line */
-    XSUM_logVerbose(3, "Sanity check -- all tests ok\n");
-}
-
-
 /* ********************************************************
 *  File Hashing
 **********************************************************/

From 684812267dcaf99d8571c00bd5a215eca0c4daaa Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Sun, 20 Sep 2020 14:16:06 -0400
Subject: [PATCH 024/187] xsum_sanity_check.c: Refactor to use tables

Refactored xsum_sanity_check.c to use a table instead of direct
function calls. This is cleaner and has smaller code size.

Additionally, did some cleanup to reduce repetition a bit. There is
still a lot of DRY that could be applied to this file.
---
 cli/xsum_sanity_check.c | 520 +++++++++++++++++++++-------------------
 1 file changed, 279 insertions(+), 241 deletions(-)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 6d9c6c13..347d1db5 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -68,6 +68,158 @@ XSUM_API void XSUM_sanityCheck(void)
     XSUM_log("This version of xxhsum is not verified.\n");
 }
 #else
+
+/*
+ * Test data vectors
+ */
+typedef struct {
+    XSUM_U32 len;
+    XSUM_U32 seed;
+    XSUM_U32 Nresult;
+} XSUM_testdata32_t;
+
+typedef struct {
+    XSUM_U32 len;
+    XSUM_U64 seed;
+    XSUM_U64 Nresult;
+} XSUM_testdata64_t;
+
+typedef struct {
+    XSUM_U32 len;
+    XSUM_U64 seed;
+    XXH128_hash_t Nresult;
+} XSUM_testdata128_t;
+
+#define SECRET_SAMPLE_NBBYTES 4
+typedef struct {
+    XSUM_U32 len;
+    XSUM_U8 byte[SECRET_SAMPLE_NBBYTES];
+} XSUM_testdata_sample_t;
+
+/* XXH32 */
+static const XSUM_testdata32_t XSUM_XXH32_testdata[] = {
+    {   0,       0, 0x02CC5D05U },
+    {   0, PRIME32, 0x36B78AE7U },
+    {   1,       0, 0xCF65B03EU },
+    {   1, PRIME32, 0xB4545AA4U },
+    {  14,       0, 0x1208E7E2U },
+    {  14, PRIME32, 0x6AF1D1FEU },
+    { 222,       0, 0x5BD11DBDU },
+    { 222, PRIME32, 0x58803C5FU }
+};
+
+/* XXH64 */
+static const XSUM_testdata64_t XSUM_XXH64_testdata[] = {
+    {   0,       0, 0xEF46DB3751D8E999ULL },
+    {   0, PRIME32, 0xAC75FDA2929B17EFULL },
+    {   1,       0, 0xE934A84ADB052768ULL },
+    {   1, PRIME32, 0x5014607643A9B4C3ULL },
+    {   4,       0, 0x9136A0DCA57457EEULL },
+    {  14,       0, 0x8282DCC4994E35C8ULL },
+    {  14, PRIME32, 0xC3BD6BF63DEB6DF0ULL },
+    { 222,       0, 0xB641AE8CB691C174ULL },
+    { 222, PRIME32, 0x20CB8AB7AE10C14AULL }
+};
+/*
+ * XXH3:
+ * Due to being a more complex hash function with specializations for certain
+ * lengths, a more extensive test is used for XXH3.
+ */
+
+/* XXH3_64bits, seeded */
+static const XSUM_testdata64_t XSUM_XXH3_testdata[] = {
+    {    0,       0, 0x2D06800538D394C2ULL },  /* empty string */
+    {    0, PRIME64, 0xA8A6B918B2F0364AULL },  /* empty string */
+    {    1,       0, 0xC44BDFF4074EECDBULL },  /*  1 -  3 */
+    {    1, PRIME64, 0x032BE332DD766EF8ULL },  /*  1 -  3 */
+    {    6,       0, 0x27B56A84CD2D7325ULL },  /*  4 -  8 */
+    {    6, PRIME64, 0x84589C116AB59AB9ULL },  /*  4 -  8 */
+    {   12,       0, 0xA713DAF0DFBB77E7ULL },  /*  9 - 16 */
+    {   12, PRIME64, 0xE7303E1B2336DE0EULL },  /*  9 - 16 */
+    {   24,       0, 0xA3FE70BF9D3510EBULL },  /* 17 - 32 */
+    {   24, PRIME64, 0x850E80FC35BDD690ULL },  /* 17 - 32 */
+    {   48,       0, 0x397DA259ECBA1F11ULL },  /* 33 - 64 */
+    {   48, PRIME64, 0xADC2CBAA44ACC616ULL },  /* 33 - 64 */
+    {   80,       0, 0xBCDEFBBB2C47C90AULL },  /* 65 - 96 */
+    {   80, PRIME64, 0xC6DD0CB699532E73ULL },  /* 65 - 96 */
+    {  195,       0, 0xCD94217EE362EC3AULL },  /* 129-240 */
+    {  195, PRIME64, 0xBA68003D370CB3D9ULL },  /* 129-240 */
+
+    {  403,       0, 0xCDEB804D65C6DEA4ULL },  /* one block, last stripe is overlapping */
+    {  403, PRIME64, 0x6259F6ECFD6443FDULL },  /* one block, last stripe is overlapping */
+    {  512,       0, 0x617E49599013CB6BULL },  /* one block, finishing at stripe boundary */
+    {  512, PRIME64, 0x3CE457DE14C27708ULL },  /* one block, finishing at stripe boundary */
+    { 2048,       0, 0xDD59E2C3A5F038E0ULL },  /* 2 blocks, finishing at block boundary */
+    { 2048, PRIME64, 0x66F81670669ABABCULL },  /* 2 blocks, finishing at block boundary */
+    { 2240,       0, 0x6E73A90539CF2948ULL },  /* 3 blocks, finishing at stripe boundary */
+    { 2240, PRIME64, 0x757BA8487D1B5247ULL },  /* 3 blocks, finishing at stripe boundary */
+    { 2367,       0, 0xCB37AEB9E5D361EDULL },  /* 3 blocks, last stripe is overlapping */
+    { 2367, PRIME64, 0xD2DB3415B942B42AULL }   /* 3 blocks, last stripe is overlapping */
+};
+/* XXH3_64bits, custom secret */
+static const XSUM_testdata64_t XSUM_XXH3_withSecret_testdata[] = {
+    {       0, 0, 0x3559D64878C5C66CULL },  /* empty string */
+    {       1, 0, 0x8A52451418B2DA4DULL },  /*  1 -  3 */
+    {       6, 0, 0x82C90AB0519369ADULL },  /*  4 -  8 */
+    {      12, 0, 0x14631E773B78EC57ULL },  /*  9 - 16 */
+    {      24, 0, 0xCDD5542E4A9D9FE8ULL },  /* 17 - 32 */
+    {      48, 0, 0x33ABD54D094B2534ULL },  /* 33 - 64 */
+    {      80, 0, 0xE687BA1684965297ULL },  /* 65 - 96 */
+    {     195, 0, 0xA057273F5EECFB20ULL },  /* 129-240 */
+
+    {     403, 0, 0x14546019124D43B8ULL },  /* one block, last stripe is overlapping */
+    {     512, 0, 0x7564693DD526E28DULL },  /* one block, finishing at stripe boundary */
+    {    2048, 0, 0xD32E975821D6519FULL },  /* >= 2 blodcks, at least one scrambling */
+    {    2367, 0, 0x293FA8E5173BB5E7ULL },  /* >= 2 blocks, at least one scrambling, last stripe unaligned */
+
+    { 64*10*3, 0, 0x751D2EC54BC6038BULL }   /* exactly 3 full blocks, not a multiple of 256 */
+};
+/* XXH3_128bits, seeded */
+static const XSUM_testdata128_t XSUM_XXH128_testdata[] = {
+    {    0,       0, { 0x6001C324468D497FULL, 0x99AA06D3014798D8ULL } },  /* empty string */
+    {    0, PRIME32, { 0x5444F7869C671AB0ULL, 0x92220AE55E14AB50ULL } },  /* empty string */
+    {    1,       0, { 0xC44BDFF4074EECDBULL, 0xA6CD5E9392000F6AULL } },  /*  1 -  3 */
+    {    1, PRIME32, { 0xB53D5557E7F76F8DULL, 0x89B99554BA22467CULL } },  /*  1 -  3 */
+    {    6,       0, { 0x3E7039BDDA43CFC6ULL, 0x082AFE0B8162D12AULL } },  /*  4 -  8 */
+    {    6, PRIME32, { 0x269D8F70BE98856EULL, 0x5A865B5389ABD2B1ULL } },  /*  4 -  8 */
+    {   12,       0, { 0x061A192713F69AD9ULL, 0x6E3EFD8FC7802B18ULL } },  /*  9 - 16 */
+    {   12, PRIME32, { 0x9BE9F9A67F3C7DFBULL, 0xD7E09D518A3405D3ULL } },  /*  9 - 16 */
+    {   24,       0, { 0x1E7044D28B1B901DULL, 0x0CE966E4678D3761ULL } },  /* 17 - 32 */
+    {   24, PRIME32, { 0xD7304C54EBAD40A9ULL, 0x3162026714A6A243ULL } },  /* 17 - 32 */
+    {   48,       0, { 0xF942219AED80F67BULL, 0xA002AC4E5478227EULL } },  /* 33 - 64 */
+    {   48, PRIME32, { 0x7BA3C3E453A1934EULL, 0x163ADDE36C072295ULL } },  /* 33 - 64 */
+    {   81,       0, { 0x5E8BAFB9F95FB803ULL, 0x4952F58181AB0042ULL } },  /* 65 - 96 */
+    {   81, PRIME32, { 0x703FBB3D7A5F755CULL, 0x2724EC7ADC750FB6ULL } },  /* 65 - 96 */
+    {  222,       0, { 0xF1AEBD597CEC6B3AULL, 0x337E09641B948717ULL } },  /* 129-240 */
+    {  222, PRIME32, { 0xAE995BB8AF917A8DULL, 0x91820016621E97F1ULL } },  /* 129-240 */
+
+    {  403,       0, { 0xCDEB804D65C6DEA4ULL, 0x1B6DE21E332DD73DULL } },  /* one block, last stripe is overlapping */
+    {  403, PRIME64, { 0x6259F6ECFD6443FDULL, 0xBED311971E0BE8F2ULL } },  /* one block, last stripe is overlapping */
+    {  512,       0, { 0x617E49599013CB6BULL, 0x18D2D110DCC9BCA1ULL } },  /* one block, finishing at stripe boundary */
+    {  512, PRIME64, { 0x3CE457DE14C27708ULL, 0x925D06B8EC5B8040ULL } },  /* one block, finishing at stripe boundary */
+    { 2048,       0, { 0xDD59E2C3A5F038E0ULL, 0xF736557FD47073A5ULL } },  /* 2 blocks, finishing at block boundary */
+    { 2048, PRIME32, { 0x230D43F30206260BULL, 0x7FB03F7E7186C3EAULL } },  /* 2 blocks, finishing at block boundary */
+    { 2240,       0, { 0x6E73A90539CF2948ULL, 0xCCB134FBFA7CE49DULL } },  /* 3 blocks, finishing at stripe boundary */
+    { 2240, PRIME32, { 0xED385111126FBA6FULL, 0x50A1FE17B338995FULL } },  /* 3 blocks, finishing at stripe boundary */
+    { 2367,       0, { 0xCB37AEB9E5D361EDULL, 0xE89C0F6FF369B427ULL } },  /* 3 blocks, last stripe is overlapping */
+    { 2367, PRIME32, { 0x6F5360AE69C2F406ULL, 0xD23AAE4B76C31ECBULL } }   /* 3 blocks, last stripe is overlapping */
+};
+
+/* XXH128, custom secret */
+static const XSUM_testdata128_t XSUM_XXH128_withSecret_testdata[] = {
+    {  0, 0, { 0x005923CCEECBE8AEULL, 0x5F70F4EA232F1D38ULL } },  /* empty string */
+    {  1, 0, { 0x8A52451418B2DA4DULL, 0x3A66AF5A9819198EULL } },  /*  1 -  3 */
+    {  6, 0, { 0x0B61C8ACA7D4778FULL, 0x376BD91B6432F36DULL } },  /*  4 -  8 */
+    { 12, 0, { 0xAF82F6EBA263D7D8ULL, 0x90A3C2D839F57D0FULL } }   /*  9 - 16 */
+};
+
+static const XSUM_testdata_sample_t XSUM_XXH3_generateSecret_testdata[] = {
+    {                              0, { 0xB8, 0x26, 0x83, 0x7E } },
+    {                              1, { 0xA6, 0x16, 0x06, 0x7B } },
+    {     XXH3_SECRET_SIZE_MIN -   1, { 0xDA, 0x2A, 0x12, 0x11 } },
+    { XXH3_SECRET_DEFAULT_SIZE + 500, { 0x7E, 0x48, 0x0C, 0xA7 } }
+};
+
 static void XSUM_checkResult32(XXH32_hash_t r1, XXH32_hash_t r2)
 {
     static int nbTests = 1;
@@ -111,13 +263,22 @@ static void XSUM_checkResult128(XXH128_hash_t r1, XXH128_hash_t r2)
 }
 
 
-static void XSUM_testXXH32(const void* data, size_t len, XSUM_U32 seed, XSUM_U32 Nresult)
+static void XSUM_testXXH32(const void* data, const XSUM_testdata32_t* testData)
 {
     XXH32_state_t *state = XXH32_createState();
     size_t pos;
 
+    size_t len = testData->len;
+    XSUM_U32 seed = testData->seed;
+    XSUM_U32 Nresult = testData->Nresult;
+
+    if (len == 0) {
+        data = NULL;
+    } else {
+        assert(data != NULL);
+    }
+
     assert(state != NULL);
-    if (len>0) assert(data != NULL);
 
     XSUM_checkResult32(XXH32(data, len, seed), Nresult);
 
@@ -132,13 +293,22 @@ static void XSUM_testXXH32(const void* data, size_t len, XSUM_U32 seed, XSUM_U32
     XXH32_freeState(state);
 }
 
-static void XSUM_testXXH64(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
+static void XSUM_testXXH64(const void* data, const XSUM_testdata64_t* testData)
 {
     XXH64_state_t *state = XXH64_createState();
     size_t pos;
 
+    size_t len = (size_t)testData->len;
+    XSUM_U64 seed = testData->seed;
+    XSUM_U64 Nresult = testData->Nresult;
+
+    if (len == 0) {
+        data = NULL;
+    } else {
+        assert(data != NULL);
+    }
+
     assert(state != NULL);
-    if (len>0) assert(data != NULL);
 
     XSUM_checkResult64(XXH64(data, len, seed), Nresult);
 
@@ -153,6 +323,10 @@ static void XSUM_testXXH64(const void* data, size_t len, XSUM_U64 seed, XSUM_U64
     XXH64_freeState(state);
 }
 
+/*
+ * Used to get "random" (but actually 100% reproducible) lengths for
+ * XSUM_XXH3_randomUpdate.
+ */
 static XSUM_U32 XSUM_rand(void)
 {
     static XSUM_U64 seed = PRIME32;
@@ -160,11 +334,40 @@ static XSUM_U32 XSUM_rand(void)
     return (XSUM_U32)(seed >> 40);
 }
 
+/*
+ * Technically, XXH3_64bits_update is identical to XXH3_128bits_update as of
+ * v0.8.0, but we treat them as separate.
+ */
+typedef XXH_errorcode (*XSUM_XXH3_update_t)(XXH3_state_t* state, const void* input, size_t length);
 
-static void XSUM_testXXH3(const void* data, size_t len, XSUM_U64 seed, XSUM_U64 Nresult)
+/*
+ * Runs the passed XXH3_update variant on random lengths. This is to test the
+ * more complex logic of the update function, catching bugs like this one:
+ *    https://github.com/Cyan4973/xxHash/issues/378
+ */
+static void XSUM_XXH3_randomUpdate(XXH3_state_t* state, const void* data,
+                                   size_t len, XSUM_XXH3_update_t update_fn)
 {
-    if (len>0) assert(data != NULL);
+    size_t p = 0;
+    while (p < len) {
+        size_t const modulo = len > 2 ? len : 2;
+        size_t l = (size_t)(XSUM_rand()) % modulo;
+        if (p + l > len) l = len - p;
+        (void)update_fn(state, (const char*)data+p, l);
+        p += l;
+    }
+}
 
+static void XSUM_testXXH3(const void* data, const XSUM_testdata64_t* testData)
+{
+    size_t len = testData->len;
+    XSUM_U64 seed = testData->seed;
+    XSUM_U64 Nresult = testData->Nresult;
+    if (len == 0) {
+        data = NULL;
+    } else {
+        assert(data != NULL);
+    }
     {   XSUM_U64 const Dresult = XXH3_64bits_withSeed(data, len, seed);
         XSUM_checkResult64(Dresult, Nresult);
     }
@@ -184,17 +387,9 @@ static void XSUM_testXXH3(const void* data, size_t len, XSUM_U64 seed, XSUM_U64
         XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
 
         /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_64bits_reset_withSeed(state, seed);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_64bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-        }
+        (void)XXH3_64bits_reset_withSeed(state, seed);
+        XSUM_XXH3_randomUpdate(state, data, len, &XXH3_64bits_update);
+        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
 
         /* byte by byte ingestion */
         {   size_t pos;
@@ -207,10 +402,17 @@ static void XSUM_testXXH3(const void* data, size_t len, XSUM_U64 seed, XSUM_U64
     }
 }
 
-static void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XSUM_U64 Nresult)
+static void XSUM_testXXH3_withSecret(const void* data, const void* secret,
+                                     size_t secretSize, const XSUM_testdata64_t* testData)
 {
-    if (len>0) assert(data != NULL);
+    size_t len = (size_t)testData->len;
+    XSUM_U64 Nresult = testData->Nresult;
 
+    if (len == 0) {
+        data = NULL;
+    } else {
+        assert(data != NULL);
+    }
     {   XSUM_U64 const Dresult = XXH3_64bits_withSecret(data, len, secret, secretSize);
         XSUM_checkResult64(Dresult, Nresult);
     }
@@ -223,17 +425,9 @@ static void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* s
         XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
 
         /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_64bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
-        }
+        (void)XXH3_64bits_reset_withSecret(state, secret, secretSize);
+        XSUM_XXH3_randomUpdate(state, data, len, &XXH3_64bits_update);
+        XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
 
         /* byte by byte ingestion */
         {   size_t pos;
@@ -246,8 +440,17 @@ static void XSUM_testXXH3_withSecret(const void* data, size_t len, const void* s
     }
 }
 
-static void XSUM_testXXH128(const void* data, size_t len, XSUM_U64 seed, XXH128_hash_t Nresult)
+static void XSUM_testXXH128(const void* data, const XSUM_testdata128_t* testData)
 {
+    size_t len = (size_t)testData->len;
+    XSUM_U64 seed = testData->seed;
+    XXH128_hash_t const Nresult = testData->Nresult;
+    if (len == 0) {
+        data = NULL;
+    } else {
+        assert(data != NULL);
+    }
+
     {   XXH128_hash_t const Dresult = XXH3_128bits_withSeed(data, len, seed);
         XSUM_checkResult128(Dresult, Nresult);
     }
@@ -273,17 +476,9 @@ static void XSUM_testXXH128(const void* data, size_t len, XSUM_U64 seed, XXH128_
         XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
 
         /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_128bits_reset_withSeed(state, seed);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_128bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-        }
+        (void)XXH3_128bits_reset_withSeed(state, seed);
+        XSUM_XXH3_randomUpdate(state, data, len, &XXH3_128bits_update);
+        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
 
         /* byte by byte ingestion */
         {   size_t pos;
@@ -296,10 +491,15 @@ static void XSUM_testXXH128(const void* data, size_t len, XSUM_U64 seed, XXH128_
     }
 }
 
-static void XSUM_testXXH128_withSecret(const void* data, size_t len, const void* secret, size_t secretSize, XXH128_hash_t Nresult)
+static void XSUM_testXXH128_withSecret(const void* data, const void* secret, size_t secretSize, const XSUM_testdata128_t* testData)
 {
-    if (len>0) assert(data != NULL);
-
+    size_t len = testData->len;
+    XXH128_hash_t Nresult = testData->Nresult;
+    if (len == 0) {
+        data = NULL;
+    } else if (len>0) {
+        assert(data != NULL);
+    }
     {   XXH128_hash_t const Dresult = XXH3_128bits_withSecret(data, len, secret, secretSize);
         XSUM_checkResult128(Dresult, Nresult);
     }
@@ -312,17 +512,9 @@ static void XSUM_testXXH128_withSecret(const void* data, size_t len, const void*
         XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
 
         /* random ingestion */
-        {   size_t p = 0;
-            (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
-            while (p < len) {
-                size_t const modulo = len > 2 ? len : 2;
-                size_t l = (size_t)(XSUM_rand()) % modulo;
-                if (p + l > len) l = len - p;
-                (void)XXH3_128bits_update(state, (const char*)data+p, l);
-                p += l;
-            }
-            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
-        }
+        (void)XXH3_128bits_reset_withSecret(state, secret, secretSize);
+        XSUM_XXH3_randomUpdate(state, data, len, &XXH3_128bits_update);
+        XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
 
         /* byte by byte ingestion */
         {   size_t pos;
@@ -335,32 +527,28 @@ static void XSUM_testXXH128_withSecret(const void* data, size_t len, const void*
     }
 }
 
-#define SECRET_SAMPLE_NBBYTES 4
-typedef struct { XSUM_U8 byte[SECRET_SAMPLE_NBBYTES]; } verifSample_t;
-
-static void XSUM_testSecretGenerator(const void* customSeed, size_t len, verifSample_t result)
+static void XSUM_testSecretGenerator(const void* customSeed, const XSUM_testdata_sample_t* testData)
 {
     static int nbTests = 1;
     const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};
     XSUM_U8 secretBuffer[XXH3_SECRET_DEFAULT_SIZE] = {0};
-    verifSample_t samples;
+    XSUM_U8 samples[SECRET_SAMPLE_NBBYTES];
     int i;
 
-    XXH3_generateSecret(secretBuffer, customSeed, len);
+    XXH3_generateSecret(secretBuffer, customSeed, testData->len);
     for (i=0; i<SECRET_SAMPLE_NBBYTES; i++) {
-        samples.byte[i] = secretBuffer[sampleIndex[i]];
+        samples[i] = secretBuffer[sampleIndex[i]];
     }
-    if (memcmp(&samples, &result, sizeof(result))) {
+    if (memcmp(samples, testData->byte, sizeof(testData->byte))) {
         XSUM_log("\rError: Secret generation test %i: Internal sanity check failed. \n", nbTests);
         XSUM_log("\rGot { 0x%02X, 0x%02X, 0x%02X, 0x%02X }, expected { 0x%02X, 0x%02X, 0x%02X, 0x%02X } \n",
-                samples.byte[0], samples.byte[1], samples.byte[2], samples.byte[3],
-                result.byte[0], result.byte[1], result.byte[2], result.byte[3] );
+                samples[0], samples[1], samples[2], samples[3],
+                testData->byte[0], testData->byte[1], testData->byte[2], testData->byte[3] );
         exit(1);
     }
     nbTests++;
 }
 
-
 /*!
  * XSUM_sanityCheck():
  * Runs a sanity check before the benchmark.
@@ -369,192 +557,42 @@ static void XSUM_testSecretGenerator(const void* customSeed, size_t len, verifSa
  */
 XSUM_API void XSUM_sanityCheck(void)
 {
+    size_t i;
 #define SANITY_BUFFER_SIZE 2367
     XSUM_U8 sanityBuffer[SANITY_BUFFER_SIZE];
-    XSUM_fillTestBuffer(sanityBuffer, sizeof(sanityBuffer));
+    const void* const secret = sanityBuffer + 7;
+    const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
+    assert(sizeof(sanityBuffer) >= 7 + secretSize);
 
-    XSUM_testXXH32(NULL,          0, 0,       0x02CC5D05);
-    XSUM_testXXH32(NULL,          0, PRIME32, 0x36B78AE7);
-    XSUM_testXXH32(sanityBuffer,  1, 0,       0xCF65B03E);
-    XSUM_testXXH32(sanityBuffer,  1, PRIME32, 0xB4545AA4);
-    XSUM_testXXH32(sanityBuffer, 14, 0,       0x1208E7E2);
-    XSUM_testXXH32(sanityBuffer, 14, PRIME32, 0x6AF1D1FE);
-    XSUM_testXXH32(sanityBuffer,222, 0,       0x5BD11DBD);
-    XSUM_testXXH32(sanityBuffer,222, PRIME32, 0x58803C5F);
-
-    XSUM_testXXH64(NULL        ,  0, 0,       0xEF46DB3751D8E999ULL);
-    XSUM_testXXH64(NULL        ,  0, PRIME32, 0xAC75FDA2929B17EFULL);
-    XSUM_testXXH64(sanityBuffer,  1, 0,       0xE934A84ADB052768ULL);
-    XSUM_testXXH64(sanityBuffer,  1, PRIME32, 0x5014607643A9B4C3ULL);
-    XSUM_testXXH64(sanityBuffer,  4, 0,       0x9136A0DCA57457EEULL);
-    XSUM_testXXH64(sanityBuffer, 14, 0,       0x8282DCC4994E35C8ULL);
-    XSUM_testXXH64(sanityBuffer, 14, PRIME32, 0xC3BD6BF63DEB6DF0ULL);
-    XSUM_testXXH64(sanityBuffer,222, 0,       0xB641AE8CB691C174ULL);
-    XSUM_testXXH64(sanityBuffer,222, PRIME32, 0x20CB8AB7AE10C14AULL);
-
-    XSUM_testXXH3(NULL,           0, 0,       0x2D06800538D394C2ULL);  /* empty string */
-    XSUM_testXXH3(NULL,           0, PRIME64, 0xA8A6B918B2F0364AULL);
-    XSUM_testXXH3(sanityBuffer,   1, 0,       0xC44BDFF4074EECDBULL);  /*  1 -  3 */
-    XSUM_testXXH3(sanityBuffer,   1, PRIME64, 0x032BE332DD766EF8ULL);  /*  1 -  3 */
-    XSUM_testXXH3(sanityBuffer,   6, 0,       0x27B56A84CD2D7325ULL);  /*  4 -  8 */
-    XSUM_testXXH3(sanityBuffer,   6, PRIME64, 0x84589C116AB59AB9ULL);  /*  4 -  8 */
-    XSUM_testXXH3(sanityBuffer,  12, 0,       0xA713DAF0DFBB77E7ULL);  /*  9 - 16 */
-    XSUM_testXXH3(sanityBuffer,  12, PRIME64, 0xE7303E1B2336DE0EULL);  /*  9 - 16 */
-    XSUM_testXXH3(sanityBuffer,  24, 0,       0xA3FE70BF9D3510EBULL);  /* 17 - 32 */
-    XSUM_testXXH3(sanityBuffer,  24, PRIME64, 0x850E80FC35BDD690ULL);  /* 17 - 32 */
-    XSUM_testXXH3(sanityBuffer,  48, 0,       0x397DA259ECBA1F11ULL);  /* 33 - 64 */
-    XSUM_testXXH3(sanityBuffer,  48, PRIME64, 0xADC2CBAA44ACC616ULL);  /* 33 - 64 */
-    XSUM_testXXH3(sanityBuffer,  80, 0,       0xBCDEFBBB2C47C90AULL);  /* 65 - 96 */
-    XSUM_testXXH3(sanityBuffer,  80, PRIME64, 0xC6DD0CB699532E73ULL);  /* 65 - 96 */
-    XSUM_testXXH3(sanityBuffer, 195, 0,       0xCD94217EE362EC3AULL);  /* 129-240 */
-    XSUM_testXXH3(sanityBuffer, 195, PRIME64, 0xBA68003D370CB3D9ULL);  /* 129-240 */
-
-    XSUM_testXXH3(sanityBuffer, 403, 0,       0xCDEB804D65C6DEA4ULL);  /* one block, last stripe is overlapping */
-    XSUM_testXXH3(sanityBuffer, 403, PRIME64, 0x6259F6ECFD6443FDULL);  /* one block, last stripe is overlapping */
-    XSUM_testXXH3(sanityBuffer, 512, 0,       0x617E49599013CB6BULL);  /* one block, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer, 512, PRIME64, 0x3CE457DE14C27708ULL);  /* one block, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer,2048, 0,       0xDD59E2C3A5F038E0ULL);  /* 2 blocks, finishing at block boundary */
-    XSUM_testXXH3(sanityBuffer,2048, PRIME64, 0x66F81670669ABABCULL);  /* 2 blocks, finishing at block boundary */
-    XSUM_testXXH3(sanityBuffer,2240, 0,       0x6E73A90539CF2948ULL);  /* 3 blocks, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer,2240, PRIME64, 0x757BA8487D1B5247ULL);  /* 3 blocks, finishing at stripe boundary */
-    XSUM_testXXH3(sanityBuffer,2367, 0,       0xCB37AEB9E5D361EDULL);  /* 3 blocks, last stripe is overlapping */
-    XSUM_testXXH3(sanityBuffer,2367, PRIME64, 0xD2DB3415B942B42AULL);  /* 3 blocks, last stripe is overlapping */
-
-    /* XXH3 with Custom Secret */
-    {   const void* const secret = sanityBuffer + 7;
-        const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
-        assert(sizeof(sanityBuffer) >= 7 + secretSize);
-        XSUM_testXXH3_withSecret(NULL,           0, secret, secretSize, 0x3559D64878C5C66CULL);  /* empty string */
-        XSUM_testXXH3_withSecret(sanityBuffer,   1, secret, secretSize, 0x8A52451418B2DA4DULL);  /*  1 -  3 */
-        XSUM_testXXH3_withSecret(sanityBuffer,   6, secret, secretSize, 0x82C90AB0519369ADULL);  /*  4 -  8 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  12, secret, secretSize, 0x14631E773B78EC57ULL);  /*  9 - 16 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  24, secret, secretSize, 0xCDD5542E4A9D9FE8ULL);  /* 17 - 32 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  48, secret, secretSize, 0x33ABD54D094B2534ULL);  /* 33 - 64 */
-        XSUM_testXXH3_withSecret(sanityBuffer,  80, secret, secretSize, 0xE687BA1684965297ULL);  /* 65 - 96 */
-        XSUM_testXXH3_withSecret(sanityBuffer, 195, secret, secretSize, 0xA057273F5EECFB20ULL);  /* 129-240 */
-
-        XSUM_testXXH3_withSecret(sanityBuffer, 403, secret, secretSize, 0x14546019124D43B8ULL);  /* one block, last stripe is overlapping */
-        XSUM_testXXH3_withSecret(sanityBuffer, 512, secret, secretSize, 0x7564693DD526E28DULL);  /* one block, finishing at stripe boundary */
-        XSUM_testXXH3_withSecret(sanityBuffer,2048, secret, secretSize, 0xD32E975821D6519FULL);  /* >= 2 blocks, at least one scrambling */
-        XSUM_testXXH3_withSecret(sanityBuffer,2367, secret, secretSize, 0x293FA8E5173BB5E7ULL);  /* >= 2 blocks, at least one scrambling, last stripe unaligned */
-
-        XSUM_testXXH3_withSecret(sanityBuffer,64*10*3, secret, secretSize, 0x751D2EC54BC6038BULL);  /* exactly 3 full blocks, not a multiple of 256 */
-    }
+    XSUM_fillTestBuffer(sanityBuffer, sizeof(sanityBuffer));
 
-    /* XXH128 */
-    {   XXH128_hash_t const expected = { 0x6001C324468D497FULL, 0x99AA06D3014798D8ULL };
-        XSUM_testXXH128(NULL,           0, 0,     expected);         /* empty string */
-    }
-    {   XXH128_hash_t const expected = { 0x5444F7869C671AB0ULL, 0x92220AE55E14AB50ULL };
-        XSUM_testXXH128(NULL,           0, PRIME32, expected);
-    }
-    {   XXH128_hash_t const expected = { 0xC44BDFF4074EECDBULL, 0xA6CD5E9392000F6AULL };
-        XSUM_testXXH128(sanityBuffer,   1, 0,       expected);       /* 1-3 */
-    }
-    {   XXH128_hash_t const expected = { 0xB53D5557E7F76F8DULL, 0x89B99554BA22467CULL };
-        XSUM_testXXH128(sanityBuffer,   1, PRIME32, expected);       /* 1-3 */
-    }
-    {   XXH128_hash_t const expected = { 0x3E7039BDDA43CFC6ULL, 0x082AFE0B8162D12AULL };
-        XSUM_testXXH128(sanityBuffer,   6, 0,       expected);       /* 4-8 */
-    }
-    {   XXH128_hash_t const expected = { 0x269D8F70BE98856EULL, 0x5A865B5389ABD2B1ULL };
-        XSUM_testXXH128(sanityBuffer,   6, PRIME32, expected);       /* 4-8 */
-    }
-    {   XXH128_hash_t const expected = { 0x061A192713F69AD9ULL, 0x6E3EFD8FC7802B18ULL };
-        XSUM_testXXH128(sanityBuffer,  12, 0,       expected);       /* 9-16 */
-    }
-    {   XXH128_hash_t const expected = { 0x9BE9F9A67F3C7DFBULL, 0xD7E09D518A3405D3ULL };
-        XSUM_testXXH128(sanityBuffer,  12, PRIME32, expected);       /* 9-16 */
-    }
-    {   XXH128_hash_t const expected = { 0x1E7044D28B1B901DULL, 0x0CE966E4678D3761ULL };
-        XSUM_testXXH128(sanityBuffer,  24, 0,       expected);       /* 17-32 */
-    }
-    {   XXH128_hash_t const expected = { 0xD7304C54EBAD40A9ULL, 0x3162026714A6A243ULL };
-        XSUM_testXXH128(sanityBuffer,  24, PRIME32, expected);       /* 17-32 */
-    }
-    {   XXH128_hash_t const expected = { 0xF942219AED80F67BULL, 0xA002AC4E5478227EULL };
-        XSUM_testXXH128(sanityBuffer,  48, 0,       expected);       /* 33-64 */
-    }
-    {   XXH128_hash_t const expected = { 0x7BA3C3E453A1934EULL, 0x163ADDE36C072295ULL };
-        XSUM_testXXH128(sanityBuffer,  48, PRIME32, expected);       /* 33-64 */
-    }
-    {   XXH128_hash_t const expected = { 0x5E8BAFB9F95FB803ULL, 0x4952F58181AB0042ULL };
-        XSUM_testXXH128(sanityBuffer,  81, 0,       expected);       /* 65-96 */
+    /* XXH32 */
+    for (i = 0; i < (sizeof(XSUM_XXH32_testdata)/sizeof(XSUM_XXH32_testdata[0])); i++) {
+        XSUM_testXXH32(sanityBuffer, &XSUM_XXH32_testdata[i]);
     }
-    {   XXH128_hash_t const expected = { 0x703FBB3D7A5F755CULL, 0x2724EC7ADC750FB6ULL };
-        XSUM_testXXH128(sanityBuffer,  81, PRIME32, expected);       /* 65-96 */
+    /* XXH64 */
+    for (i = 0; i < (sizeof(XSUM_XXH64_testdata)/sizeof(XSUM_XXH64_testdata[0])); i++) {
+        XSUM_testXXH64(sanityBuffer, &XSUM_XXH64_testdata[i]);
     }
-    {   XXH128_hash_t const expected = { 0xF1AEBD597CEC6B3AULL, 0x337E09641B948717ULL };
-        XSUM_testXXH128(sanityBuffer, 222, 0,       expected);       /* 129-240 */
+    /* XXH3_64bits, seeded */
+    for (i = 0; i < (sizeof(XSUM_XXH3_testdata)/sizeof(XSUM_XXH3_testdata[0])); i++) {
+        XSUM_testXXH3(sanityBuffer, &XSUM_XXH3_testdata[i]);
     }
-    {   XXH128_hash_t const expected = { 0xAE995BB8AF917A8DULL, 0x91820016621E97F1ULL };
-        XSUM_testXXH128(sanityBuffer, 222, PRIME32, expected);       /* 129-240 */
+    /* XXH3_64bits, custom secret */
+    for (i = 0; i < (sizeof(XSUM_XXH3_withSecret_testdata)/sizeof(XSUM_XXH3_withSecret_testdata[0])); i++) {
+        XSUM_testXXH3_withSecret(sanityBuffer, secret, secretSize, &XSUM_XXH3_withSecret_testdata[i]);
     }
-    {   XXH128_hash_t const expected = { 0xCDEB804D65C6DEA4ULL, 0x1B6DE21E332DD73DULL };
-        XSUM_testXXH128(sanityBuffer, 403, 0,       expected);       /* one block, last stripe is overlapping */
-    }
-    {   XXH128_hash_t const expected = { 0x6259F6ECFD6443FDULL, 0xBED311971E0BE8F2ULL };
-        XSUM_testXXH128(sanityBuffer, 403, PRIME64, expected);       /* one block, last stripe is overlapping */
-    }
-    {   XXH128_hash_t const expected = { 0x617E49599013CB6BULL, 0x18D2D110DCC9BCA1ULL };
-        XSUM_testXXH128(sanityBuffer, 512, 0,       expected);       /* one block, finishing at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0x3CE457DE14C27708ULL, 0x925D06B8EC5B8040ULL };
-        XSUM_testXXH128(sanityBuffer, 512, PRIME64, expected);       /* one block, finishing at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0xDD59E2C3A5F038E0ULL, 0xF736557FD47073A5ULL };
-        XSUM_testXXH128(sanityBuffer,2048, 0,       expected);       /* two blocks, finishing at block boundary */
-    }
-    {   XXH128_hash_t const expected = { 0x230D43F30206260BULL, 0x7FB03F7E7186C3EAULL };
-        XSUM_testXXH128(sanityBuffer,2048, PRIME32, expected);       /* two blocks, finishing at block boundary */
-    }
-    {   XXH128_hash_t const expected = { 0x6E73A90539CF2948ULL, 0xCCB134FBFA7CE49DULL };
-        XSUM_testXXH128(sanityBuffer,2240, 0,       expected);      /* two blocks, ends at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0xED385111126FBA6FULL, 0x50A1FE17B338995FULL };
-        XSUM_testXXH128(sanityBuffer,2240, PRIME32, expected);       /* two blocks, ends at stripe boundary */
-    }
-    {   XXH128_hash_t const expected = { 0xCB37AEB9E5D361EDULL, 0xE89C0F6FF369B427ULL };
-        XSUM_testXXH128(sanityBuffer,2367, 0,       expected);       /* two blocks, last stripe is overlapping */
-    }
-    {   XXH128_hash_t const expected = { 0x6F5360AE69C2F406ULL, 0xD23AAE4B76C31ECBULL };
-        XSUM_testXXH128(sanityBuffer,2367, PRIME32, expected);       /* two blocks, last stripe is overlapping */
+    /* XXH128 */
+    for (i = 0; i < (sizeof(XSUM_XXH128_testdata)/sizeof(XSUM_XXH128_testdata[0])); i++) {
+        XSUM_testXXH128(sanityBuffer, &XSUM_XXH128_testdata[i]);
     }
-
     /* XXH128 with custom Secret */
-    {   const void* const secret = sanityBuffer + 7;
-        const size_t secretSize = XXH3_SECRET_SIZE_MIN + 11;
-        assert(sizeof(sanityBuffer) >= 7 + secretSize);
-
-        {   XXH128_hash_t const expected = { 0x005923CCEECBE8AEULL, 0x5F70F4EA232F1D38ULL };
-            XSUM_testXXH128_withSecret(NULL,           0, secret, secretSize,     expected);         /* empty string */
-        }
-        {   XXH128_hash_t const expected = { 0x8A52451418B2DA4DULL, 0x3A66AF5A9819198EULL };
-            XSUM_testXXH128_withSecret(sanityBuffer,   1, secret, secretSize,       expected);       /* 1-3 */
-        }
-        {   XXH128_hash_t const expected = { 0x0B61C8ACA7D4778FULL, 0x376BD91B6432F36DULL };
-            XSUM_testXXH128_withSecret(sanityBuffer,   6, secret, secretSize,       expected);       /* 4-8 */
-        }
-        {   XXH128_hash_t const expected = { 0xAF82F6EBA263D7D8ULL, 0x90A3C2D839F57D0FULL };
-            XSUM_testXXH128_withSecret(sanityBuffer,  12, secret, secretSize,       expected);       /* 9-16 */
-        }
+    for (i = 0; i < (sizeof(XSUM_XXH128_withSecret_testdata)/sizeof(XSUM_XXH128_withSecret_testdata[0])); i++) {
+        XSUM_testXXH128_withSecret(sanityBuffer, secret, secretSize, &XSUM_XXH128_withSecret_testdata[i]);
     }
-
     /* secret generator */
-    {   verifSample_t const expected = { { 0xB8, 0x26, 0x83, 0x7E } };
-        XSUM_testSecretGenerator(NULL, 0, expected);
-    }
-
-    {   verifSample_t const expected = { { 0xA6, 0x16, 0x06, 0x7B } };
-        XSUM_testSecretGenerator(sanityBuffer, 1, expected);
-    }
-
-    {   verifSample_t const expected = { { 0xDA, 0x2A, 0x12, 0x11 } };
-        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_SIZE_MIN - 1, expected);
-    }
-
-    {   verifSample_t const expected = { { 0x7E, 0x48, 0x0C, 0xA7 } };
-        XSUM_testSecretGenerator(sanityBuffer, XXH3_SECRET_DEFAULT_SIZE + 500, expected);
+    for (i = 0; i < (sizeof(XSUM_XXH3_generateSecret_testdata)/sizeof(XSUM_XXH3_generateSecret_testdata[0])); i++) {
+        XSUM_testSecretGenerator(sanityBuffer, &XSUM_XXH3_generateSecret_testdata[i]);
     }
 
     XSUM_logVerbose(3, "\r%70s\r", "");       /* Clean display line */

From 15f61dbe5504b755f752b04aa8e294ce46dda07a Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 26 Sep 2020 10:24:57 -0700
Subject: [PATCH 025/187] fix compilation on macOS

added macOS test to travisCI
---
 .travis.yml             | 10 +++++++++-
 cli/xsum_sanity_check.h |  5 ++++-
 2 files changed, 13 insertions(+), 2 deletions(-)

diff --git a/.travis.yml b/.travis.yml
index 2f3a2168..db7380e5 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -42,6 +42,14 @@ matrix:
         - CPPFLAGS=-DXXH_REROLL=1 make check   # reroll code path (#240)
         - make -C tests/bench
 
+    - name: macOS General Test
+      os: osx
+      compiler: clang
+      script:
+        - make   # test library build
+        - make clean
+        - make test MOREFLAGS='-Werror' | tee # test scenario where `stdout` is not the console
+
     - name: ARM compilation and consistency checks (Qemu)
       dist: xenial
       arch: amd64
@@ -111,7 +119,7 @@ matrix:
         - CPPFLAGS=-DXXH_VECTOR=XXH_VSX CFLAGS="-O3 -maltivec -mvsx -mpower8-vector -mcpu=power8" LDFLAGS="-static" make check
         # altivec.h redefinition issue #426
         - make clean
-        - CPPFLAGS=-DXXH_VECTOR=XXH_VSX CFLAGS="-maltivec -mvsx -mcpu=power8 -mpower8-vector" make -C tests test_ppc_redefine 
+        - CPPFLAGS=-DXXH_VECTOR=XXH_VSX CFLAGS="-maltivec -mvsx -mcpu=power8 -mpower8-vector" make -C tests test_ppc_redefine
 
     - name: IBM s390x compilation and consistency checks
       dist: bionic
diff --git a/cli/xsum_sanity_check.h b/cli/xsum_sanity_check.h
index a3f57a16..9f3f2b85 100644
--- a/cli/xsum_sanity_check.h
+++ b/cli/xsum_sanity_check.h
@@ -26,7 +26,10 @@
 #ifndef XSUM_SANITY_CHECK_H
 #define XSUM_SANITY_CHECK_H
 
-#include "xsum_config.h"
+#include "xsum_config.h"  /* XSUM_API, XSUM_U8 */
+
+#include <stddef.h>   /* size_t */
+
 
 #ifdef __cplusplus
 extern "C" {

From 56ea9135893dea4205833e8948bf32aec632f168 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 26 Sep 2020 10:38:27 -0700
Subject: [PATCH 026/187] fixed cat /proc/cpuinfo for macOS

---
 .travis.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.travis.yml b/.travis.yml
index db7380e5..675965f7 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -2,7 +2,7 @@ language: c
 
 # Dump CPU info before start
 before_install:
-  - cat /proc/cpuinfo
+  - cat /proc/cpuinfo || echo /proc/cpuinfo is not present
 
 matrix:
   fast_finish: true

From c6eb8210738ad4e3e6c74497288580a20231eee2 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 28 Sep 2020 09:21:51 -0700
Subject: [PATCH 027/187] added repology status

---
 README.md | 15 +++++++++++----
 1 file changed, 11 insertions(+), 4 deletions(-)

diff --git a/README.md b/README.md
index 1207969a..b976e3d1 100644
--- a/README.md
+++ b/README.md
@@ -220,10 +220,17 @@ thanks to great contributors.
 They are [listed here](http://www.xxhash.com/#other-languages).
 
 
-### Special Thanks
+### Packaging status
+
+Many distributions bundle a package manager
+which allows easy xxhash installation as both a `libxxhash` library
+and `xxhsum` command line interface.
 
-Takayuki Matsuoka, aka @t-mat, for creating `xxhsum -c` and general support during early xxh releases
+[![Packaging status](https://repology.org/badge/vertical-allrepos/xxhash.svg)](https://repology.org/project/xxhash/versions)
 
-Mathias Westerdahl, aka @JCash, for introducing the first version of `XXH64`
 
-Devin Hussey, aka @easyaspi314, for excellent low-level optimizations on `XXH3` and `XXH128`
+### Special Thanks
+
+- Takayuki Matsuoka, aka @t-mat, for creating `xxhsum -c` and great support during early xxh releases
+- Mathias Westerdahl, aka @JCash, for introducing the first version of `XXH64`
+- Devin Hussey, aka @easyaspi314, for incredible low-level optimizations on `XXH3` and `XXH128`

From d83cce5b91acf299535b8d3762285cc451b5c34a Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Wed, 30 Sep 2020 15:56:07 -0700
Subject: [PATCH 028/187] fix prefetching for x86 32-bit target on MSVC

---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 2ba034a5..b5e57b1b 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2481,7 +2481,7 @@ XXH_FORCE_INLINE xxh_u64x2 XXH_vec_mule(xxh_u32x4 a, xxh_u32x4 b)
 #if defined(XXH_NO_PREFETCH)
 #  define XXH_PREFETCH(ptr)  (void)(ptr)  /* disabled */
 #else
-#  if defined(_MSC_VER) && (defined(_M_X64) || defined(_M_I86))  /* _mm_prefetch() is not defined outside of x86/x64 */
+#  if defined(_MSC_VER) && (defined(_M_X64) || defined(_M_IX86))  /* _mm_prefetch() not defined outside of x86/x64 */
 #    include <mmintrin.h>   /* https://msdn.microsoft.com/fr-fr/library/84szxsww(v=vs.90).aspx */
 #    define XXH_PREFETCH(ptr)  _mm_prefetch((const char*)(ptr), _MM_HINT_T0)
 #  elif defined(__GNUC__) && ( (__GNUC__ >= 4) || ( (__GNUC__ == 3) && (__GNUC_MINOR__ >= 1) ) )

From ba6fdd9d21e16fdea5a28560bedc69dc91bcf846 Mon Sep 17 00:00:00 2001
From: Alex Pankratov <ap@swapped.ch>
Date: Thu, 8 Oct 2020 12:33:13 +0200
Subject: [PATCH 029/187] Fix xxh_x86dispatch build with MSVC

---
 xxh_x86dispatch.c | 25 +++++++++++++------------
 1 file changed, 13 insertions(+), 12 deletions(-)

diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index a618ae89..384004d2 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -44,8 +44,19 @@ extern "C" {
 #  error "Dispatching is currently only supported on x86 and x86_64."
 #endif
 
-#ifndef __GNUC__
-#  error "Dispatching requires __attribute__((__target__)) capability"
+#if defined(__GNUC__)
+#  include <immintrin.h> /* sse2 */
+#  include <emmintrin.h> /* avx2 */
+#  define XXH_TARGET_AVX512 __attribute__((__target__("avx512f")))
+#  define XXH_TARGET_AVX2 __attribute__((__target__("avx2")))
+#  define XXH_TARGET_SSE2 __attribute__((__target__("sse2")))
+#elif defined(_MSC_VER)
+#  include <intrin.h>
+#  define XXH_TARGET_AVX512
+#  define XXH_TARGET_AVX2
+#  define XXH_TARGET_SSE2
+#else
+#  error "Dispatching is currently not supported for your compiler."
 #endif
 
 #define XXH_DISPATCH_AVX2    /* enable dispatch towards AVX2 */
@@ -62,18 +73,8 @@ extern "C" {
 #endif
 #include <assert.h>
 
-#if defined(__GNUC__)
-#  include <immintrin.h> /* sse2 */
-#  include <emmintrin.h> /* avx2 */
-#elif defined(_MSC_VER)
-#  include <intrin.h>
-#endif
-
 #define XXH_INLINE_ALL
 #define XXH_X86DISPATCH
-#define XXH_TARGET_AVX512 __attribute__((__target__("avx512f")))
-#define XXH_TARGET_AVX2 __attribute__((__target__("avx2")))
-#define XXH_TARGET_SSE2 __attribute__((__target__("sse2")))
 #include "xxhash.h"
 
 /*

From 45b2cb406dcd16ebdc5887f33a18c7a1c9c7f6fa Mon Sep 17 00:00:00 2001
From: Alex Pankratov <ap@swapped.ch>
Date: Thu, 8 Oct 2020 12:36:27 +0200
Subject: [PATCH 030/187] remove duplicate 'intrin.h' include, move Intel's
 guide link to a proper place

---
 xxh_x86dispatch.c | 14 +++++---------
 1 file changed, 5 insertions(+), 9 deletions(-)

diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index 384004d2..1fc6fac8 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -77,14 +77,6 @@ extern "C" {
 #define XXH_X86DISPATCH
 #include "xxhash.h"
 
-/*
- * Modified version of Intel's guide
- * https://software.intel.com/en-us/articles/how-to-detect-new-instruction-support-in-the-4th-generation-intel-core-processor-family
- */
-#if defined(_MSC_VER)
-# include <intrin.h>
-#endif
-
 /*
  * Support both AT&T and Intel dialects
  *
@@ -103,7 +95,6 @@ extern "C" {
 #  define I_ATT(intel, att) "{" att "|" intel "}\n\t"
 #endif
 
-
 static void XXH_cpuid(xxh_u32 eax, xxh_u32 ecx, xxh_u32* abcd)
 {
 #if defined(_MSC_VER)
@@ -134,6 +125,11 @@ static void XXH_cpuid(xxh_u32 eax, xxh_u32 ecx, xxh_u32* abcd)
 #endif
 }
 
+/*
+ * Modified version of Intel's guide
+ * https://software.intel.com/en-us/articles/how-to-detect-new-instruction-support-in-the-4th-generation-intel-core-processor-family
+ */
+
 #if defined(XXH_DISPATCH_AVX2) || defined(XXH_DISPATCH_AVX512)
 /*
  * While the CPU may support AVX2, the operating system might not properly save

From e1784dda98688221509d9562a18827fae685e6f0 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Thu, 15 Oct 2020 18:33:27 -0400
Subject: [PATCH 031/187] [WIP] Doxygenize and redocument some funcs

XXH32's API, XXH*_state_s, the config macros, and some other things are
documented in Doxygen format, basic Doxyfile added.

Some functions were also redocumented in detail.

Still need to work on structuring the main page (set it to README.md?).

How to view docs for now:

 $ sudo apt install doxygen
 $ npm install -g http-server # or you can use LLVM's scan-view
 $ doxygen && hs doxygen/html -s -c-1 -o
---
 Doxyfile | 2589 ++++++++++++++++++++++++++++++++++++++++++++++++++++++
 xxhash.h |  881 +++++++++++++++----
 2 files changed, 3302 insertions(+), 168 deletions(-)
 create mode 100644 Doxyfile

diff --git a/Doxyfile b/Doxyfile
new file mode 100644
index 00000000..373986a5
--- /dev/null
+++ b/Doxyfile
@@ -0,0 +1,2589 @@
+# Doxyfile 1.8.20
+
+# This file describes the settings to be used by the documentation system
+# doxygen (www.doxygen.org) for a project.
+#
+# All text after a double hash (##) is considered a comment and is placed in
+# front of the TAG it is preceding.
+#
+# All text after a single hash (#) is considered a comment and will be ignored.
+# The format is:
+# TAG = value [value, ...]
+# For lists, items can also be appended using:
+# TAG += value [value, ...]
+# Values that contain spaces should be placed between quotes (\" \").
+
+#---------------------------------------------------------------------------
+# Project related configuration options
+#---------------------------------------------------------------------------
+
+# This tag specifies the encoding used for all characters in the configuration
+# file that follow. The default is UTF-8 which is also the encoding used for all
+# text before the first occurrence of this tag. Doxygen uses libiconv (or the
+# iconv built into libc) for the transcoding. See
+# https://www.gnu.org/software/libiconv/ for the list of possible encodings.
+# The default value is: UTF-8.
+
+DOXYFILE_ENCODING      = UTF-8
+
+# The PROJECT_NAME tag is a single word (or a sequence of words surrounded by
+# double-quotes, unless you are using Doxywizard) that should identify the
+# project for which the documentation is generated. This name is used in the
+# title of most generated pages and in a few other places.
+# The default value is: My Project.
+
+PROJECT_NAME           = "xxHash"
+
+# The PROJECT_NUMBER tag can be used to enter a project or revision number. This
+# could be handy for archiving the generated documentation or if some version
+# control system is used.
+
+PROJECT_NUMBER         = "0.8.0"
+
+# Using the PROJECT_BRIEF tag one can provide an optional one line description
+# for a project that appears at the top of each page and should give viewer a
+# quick idea about the purpose of the project. Keep the description short.
+
+PROJECT_BRIEF          = "Extremely fast non-cryptographic hash function"
+
+# With the PROJECT_LOGO tag one can specify a logo or an icon that is included
+# in the documentation. The maximum height of the logo should not exceed 55
+# pixels and the maximum width should not exceed 200 pixels. Doxygen will copy
+# the logo to the output directory.
+
+PROJECT_LOGO           =
+
+# The OUTPUT_DIRECTORY tag is used to specify the (relative or absolute) path
+# into which the generated documentation will be written. If a relative path is
+# entered, it will be relative to the location where doxygen was started. If
+# left blank the current directory will be used.
+
+OUTPUT_DIRECTORY       = doxygen
+
+# If the CREATE_SUBDIRS tag is set to YES then doxygen will create 4096 sub-
+# directories (in 2 levels) under the output directory of each output format and
+# will distribute the generated files over these directories. Enabling this
+# option can be useful when feeding doxygen a huge amount of source files, where
+# putting all generated files in the same directory would otherwise causes
+# performance problems for the file system.
+# The default value is: NO.
+
+CREATE_SUBDIRS         = NO
+
+# If the ALLOW_UNICODE_NAMES tag is set to YES, doxygen will allow non-ASCII
+# characters to appear in the names of generated files. If set to NO, non-ASCII
+# characters will be escaped, for example _xE3_x81_x84 will be used for Unicode
+# U+3044.
+# The default value is: NO.
+
+ALLOW_UNICODE_NAMES    = NO
+
+# The OUTPUT_LANGUAGE tag is used to specify the language in which all
+# documentation generated by doxygen is written. Doxygen will use this
+# information to generate all constant output in the proper language.
+# Possible values are: Afrikaans, Arabic, Armenian, Brazilian, Catalan, Chinese,
+# Chinese-Traditional, Croatian, Czech, Danish, Dutch, English (United States),
+# Esperanto, Farsi (Persian), Finnish, French, German, Greek, Hungarian,
+# Indonesian, Italian, Japanese, Japanese-en (Japanese with English messages),
+# Korean, Korean-en (Korean with English messages), Latvian, Lithuanian,
+# Macedonian, Norwegian, Persian (Farsi), Polish, Portuguese, Romanian, Russian,
+# Serbian, Serbian-Cyrillic, Slovak, Slovene, Spanish, Swedish, Turkish,
+# Ukrainian and Vietnamese.
+# The default value is: English.
+
+OUTPUT_LANGUAGE        = English
+
+# The OUTPUT_TEXT_DIRECTION tag is used to specify the direction in which all
+# documentation generated by doxygen is written. Doxygen will use this
+# information to generate all generated output in the proper direction.
+# Possible values are: None, LTR, RTL and Context.
+# The default value is: None.
+
+OUTPUT_TEXT_DIRECTION  = None
+
+# If the BRIEF_MEMBER_DESC tag is set to YES, doxygen will include brief member
+# descriptions after the members that are listed in the file and class
+# documentation (similar to Javadoc). Set to NO to disable this.
+# The default value is: YES.
+
+BRIEF_MEMBER_DESC      = YES
+
+# If the REPEAT_BRIEF tag is set to YES, doxygen will prepend the brief
+# description of a member or function before the detailed description
+#
+# Note: If both HIDE_UNDOC_MEMBERS and BRIEF_MEMBER_DESC are set to NO, the
+# brief descriptions will be completely suppressed.
+# The default value is: YES.
+
+REPEAT_BRIEF           = YES
+
+# This tag implements a quasi-intelligent brief description abbreviator that is
+# used to form the text in various listings. Each string in this list, if found
+# as the leading text of the brief description, will be stripped from the text
+# and the result, after processing the whole list, is used as the annotated
+# text. Otherwise, the brief description is used as-is. If left blank, the
+# following values are used ($name is automatically replaced with the name of
+# the entity):The $name class, The $name widget, The $name file, is, provides,
+# specifies, contains, represents, a, an and the.
+
+ABBREVIATE_BRIEF       = "The $name class" \
+                         "The $name widget" \
+                         "The $name file" \
+                         is \
+                         provides \
+                         specifies \
+                         contains \
+                         represents \
+                         a \
+                         an \
+                         the
+
+# If the ALWAYS_DETAILED_SEC and REPEAT_BRIEF tags are both set to YES then
+# doxygen will generate a detailed section even if there is only a brief
+# description.
+# The default value is: NO.
+
+ALWAYS_DETAILED_SEC    = NO
+
+# If the INLINE_INHERITED_MEMB tag is set to YES, doxygen will show all
+# inherited members of a class in the documentation of that class as if those
+# members were ordinary class members. Constructors, destructors and assignment
+# operators of the base classes will not be shown.
+# The default value is: NO.
+
+INLINE_INHERITED_MEMB  = NO
+
+# If the FULL_PATH_NAMES tag is set to YES, doxygen will prepend the full path
+# before files name in the file list and in the header files. If set to NO the
+# shortest path that makes the file name unique will be used
+# The default value is: YES.
+
+FULL_PATH_NAMES        = YES
+
+# The STRIP_FROM_PATH tag can be used to strip a user-defined part of the path.
+# Stripping is only done if one of the specified strings matches the left-hand
+# part of the path. The tag can be used to show relative paths in the file list.
+# If left blank the directory from which doxygen is run is used as the path to
+# strip.
+#
+# Note that you can specify absolute paths here, but also relative paths, which
+# will be relative from the directory where doxygen is started.
+# This tag requires that the tag FULL_PATH_NAMES is set to YES.
+
+STRIP_FROM_PATH        =
+
+# The STRIP_FROM_INC_PATH tag can be used to strip a user-defined part of the
+# path mentioned in the documentation of a class, which tells the reader which
+# header file to include in order to use a class. If left blank only the name of
+# the header file containing the class definition is used. Otherwise one should
+# specify the list of include paths that are normally passed to the compiler
+# using the -I flag.
+
+STRIP_FROM_INC_PATH    =
+
+# If the SHORT_NAMES tag is set to YES, doxygen will generate much shorter (but
+# less readable) file names. This can be useful is your file systems doesn't
+# support long names like on DOS, Mac, or CD-ROM.
+# The default value is: NO.
+
+SHORT_NAMES            = NO
+
+# If the JAVADOC_AUTOBRIEF tag is set to YES then doxygen will interpret the
+# first line (until the first dot) of a Javadoc-style comment as the brief
+# description. If set to NO, the Javadoc-style will behave just like regular Qt-
+# style comments (thus requiring an explicit @brief command for a brief
+# description.)
+# The default value is: NO.
+
+JAVADOC_AUTOBRIEF      = NO
+
+# If the JAVADOC_BANNER tag is set to YES then doxygen will interpret a line
+# such as
+# /***************
+# as being the beginning of a Javadoc-style comment "banner". If set to NO, the
+# Javadoc-style will behave just like regular comments and it will not be
+# interpreted by doxygen.
+# The default value is: NO.
+
+JAVADOC_BANNER         = NO
+
+# If the QT_AUTOBRIEF tag is set to YES then doxygen will interpret the first
+# line (until the first dot) of a Qt-style comment as the brief description. If
+# set to NO, the Qt-style will behave just like regular Qt-style comments (thus
+# requiring an explicit \brief command for a brief description.)
+# The default value is: NO.
+
+QT_AUTOBRIEF           = NO
+
+# The MULTILINE_CPP_IS_BRIEF tag can be set to YES to make doxygen treat a
+# multi-line C++ special comment block (i.e. a block of //! or /// comments) as
+# a brief description. This used to be the default behavior. The new default is
+# to treat a multi-line C++ comment block as a detailed description. Set this
+# tag to YES if you prefer the old behavior instead.
+#
+# Note that setting this tag to YES also means that rational rose comments are
+# not recognized any more.
+# The default value is: NO.
+
+MULTILINE_CPP_IS_BRIEF = NO
+
+# By default Python docstrings are displayed as preformatted text and doxygen's
+# special commands cannot be used. By setting PYTHON_DOCSTRING to NO the
+# doxygen's special commands can be used and the contents of the docstring
+# documentation blocks is shown as doxygen documentation.
+# The default value is: YES.
+
+PYTHON_DOCSTRING       = YES
+
+# If the INHERIT_DOCS tag is set to YES then an undocumented member inherits the
+# documentation from any documented member that it re-implements.
+# The default value is: YES.
+
+INHERIT_DOCS           = YES
+
+# If the SEPARATE_MEMBER_PAGES tag is set to YES then doxygen will produce a new
+# page for each member. If set to NO, the documentation of a member will be part
+# of the file/class/namespace that contains it.
+# The default value is: NO.
+
+SEPARATE_MEMBER_PAGES  = NO
+
+# The TAB_SIZE tag can be used to set the number of spaces in a tab. Doxygen
+# uses this value to replace tabs by spaces in code fragments.
+# Minimum value: 1, maximum value: 16, default value: 4.
+
+TAB_SIZE               = 4
+
+# This tag can be used to specify a number of aliases that act as commands in
+# the documentation. An alias has the form:
+# name=value
+# For example adding
+# "sideeffect=@par Side Effects:\n"
+# will allow you to put the command \sideeffect (or @sideeffect) in the
+# documentation, which will result in a user-defined paragraph with heading
+# "Side Effects:". You can put \n's in the value part of an alias to insert
+# newlines (in the resulting output). You can put ^^ in the value part of an
+# alias to insert a newline as if a physical newline was in the original file.
+# When you need a literal { or } or , in the value part of an alias you have to
+# escape them by means of a backslash (\), this can lead to conflicts with the
+# commands \{ and \} for these it is advised to use the version @{ and @} or use
+# a double escape (\\{ and \\})
+
+ALIASES                =
+
+# Set the OPTIMIZE_OUTPUT_FOR_C tag to YES if your project consists of C sources
+# only. Doxygen will then generate output that is more tailored for C. For
+# instance, some of the names that are used will be different. The list of all
+# members will be omitted, etc.
+# The default value is: NO.
+
+OPTIMIZE_OUTPUT_FOR_C  = YES
+
+# Set the OPTIMIZE_OUTPUT_JAVA tag to YES if your project consists of Java or
+# Python sources only. Doxygen will then generate output that is more tailored
+# for that language. For instance, namespaces will be presented as packages,
+# qualified scopes will look different, etc.
+# The default value is: NO.
+
+OPTIMIZE_OUTPUT_JAVA   = NO
+
+# Set the OPTIMIZE_FOR_FORTRAN tag to YES if your project consists of Fortran
+# sources. Doxygen will then generate output that is tailored for Fortran.
+# The default value is: NO.
+
+OPTIMIZE_FOR_FORTRAN   = NO
+
+# Set the OPTIMIZE_OUTPUT_VHDL tag to YES if your project consists of VHDL
+# sources. Doxygen will then generate output that is tailored for VHDL.
+# The default value is: NO.
+
+OPTIMIZE_OUTPUT_VHDL   = NO
+
+# Set the OPTIMIZE_OUTPUT_SLICE tag to YES if your project consists of Slice
+# sources only. Doxygen will then generate output that is more tailored for that
+# language. For instance, namespaces will be presented as modules, types will be
+# separated into more groups, etc.
+# The default value is: NO.
+
+OPTIMIZE_OUTPUT_SLICE  = NO
+
+# Doxygen selects the parser to use depending on the extension of the files it
+# parses. With this tag you can assign which parser to use for a given
+# extension. Doxygen has a built-in mapping, but you can override or extend it
+# using this tag. The format is ext=language, where ext is a file extension, and
+# language is one of the parsers supported by doxygen: IDL, Java, JavaScript,
+# Csharp (C#), C, C++, D, PHP, md (Markdown), Objective-C, Python, Slice, VHDL,
+# Fortran (fixed format Fortran: FortranFixed, free formatted Fortran:
+# FortranFree, unknown formatted Fortran: Fortran. In the later case the parser
+# tries to guess whether the code is fixed or free formatted code, this is the
+# default for Fortran type files). For instance to make doxygen treat .inc files
+# as Fortran files (default is PHP), and .f files as C (default is Fortran),
+# use: inc=Fortran f=C.
+#
+# Note: For files without extension you can use no_extension as a placeholder.
+#
+# Note that for custom extensions you also need to set FILE_PATTERNS otherwise
+# the files are not read by doxygen.
+
+EXTENSION_MAPPING      =
+
+# If the MARKDOWN_SUPPORT tag is enabled then doxygen pre-processes all comments
+# according to the Markdown format, which allows for more readable
+# documentation. See https://daringfireball.net/projects/markdown/ for details.
+# The output of markdown processing is further processed by doxygen, so you can
+# mix doxygen, HTML, and XML commands with Markdown formatting. Disable only in
+# case of backward compatibilities issues.
+# The default value is: YES.
+
+MARKDOWN_SUPPORT       = YES
+
+# When the TOC_INCLUDE_HEADINGS tag is set to a non-zero value, all headings up
+# to that level are automatically included in the table of contents, even if
+# they do not have an id attribute.
+# Note: This feature currently applies only to Markdown headings.
+# Minimum value: 0, maximum value: 99, default value: 5.
+# This tag requires that the tag MARKDOWN_SUPPORT is set to YES.
+
+TOC_INCLUDE_HEADINGS   = 5
+
+# When enabled doxygen tries to link words that correspond to documented
+# classes, or namespaces to their corresponding documentation. Such a link can
+# be prevented in individual cases by putting a % sign in front of the word or
+# globally by setting AUTOLINK_SUPPORT to NO.
+# The default value is: YES.
+
+AUTOLINK_SUPPORT       = YES
+
+# If you use STL classes (i.e. std::string, std::vector, etc.) but do not want
+# to include (a tag file for) the STL sources as input, then you should set this
+# tag to YES in order to let doxygen match functions declarations and
+# definitions whose arguments contain STL classes (e.g. func(std::string);
+# versus func(std::string) {}). This also make the inheritance and collaboration
+# diagrams that involve STL classes more complete and accurate.
+# The default value is: NO.
+
+BUILTIN_STL_SUPPORT    = NO
+
+# If you use Microsoft's C++/CLI language, you should set this option to YES to
+# enable parsing support.
+# The default value is: NO.
+
+CPP_CLI_SUPPORT        = NO
+
+# Set the SIP_SUPPORT tag to YES if your project consists of sip (see:
+# https://www.riverbankcomputing.com/software/sip/intro) sources only. Doxygen
+# will parse them like normal C++ but will assume all classes use public instead
+# of private inheritance when no explicit protection keyword is present.
+# The default value is: NO.
+
+SIP_SUPPORT            = NO
+
+# For Microsoft's IDL there are propget and propput attributes to indicate
+# getter and setter methods for a property. Setting this option to YES will make
+# doxygen to replace the get and set methods by a property in the documentation.
+# This will only work if the methods are indeed getting or setting a simple
+# type. If this is not the case, or you want to show the methods anyway, you
+# should set this option to NO.
+# The default value is: YES.
+
+IDL_PROPERTY_SUPPORT   = YES
+
+# If member grouping is used in the documentation and the DISTRIBUTE_GROUP_DOC
+# tag is set to YES then doxygen will reuse the documentation of the first
+# member in the group (if any) for the other members of the group. By default
+# all members of a group must be documented explicitly.
+# The default value is: NO.
+
+DISTRIBUTE_GROUP_DOC   = NO
+
+# If one adds a struct or class to a group and this option is enabled, then also
+# any nested class or struct is added to the same group. By default this option
+# is disabled and one has to add nested compounds explicitly via \ingroup.
+# The default value is: NO.
+
+GROUP_NESTED_COMPOUNDS = NO
+
+# Set the SUBGROUPING tag to YES to allow class member groups of the same type
+# (for instance a group of public functions) to be put as a subgroup of that
+# type (e.g. under the Public Functions section). Set it to NO to prevent
+# subgrouping. Alternatively, this can be done per class using the
+# \nosubgrouping command.
+# The default value is: YES.
+
+SUBGROUPING            = YES
+
+# When the INLINE_GROUPED_CLASSES tag is set to YES, classes, structs and unions
+# are shown inside the group in which they are included (e.g. using \ingroup)
+# instead of on a separate page (for HTML and Man pages) or section (for LaTeX
+# and RTF).
+#
+# Note that this feature does not work in combination with
+# SEPARATE_MEMBER_PAGES.
+# The default value is: NO.
+
+INLINE_GROUPED_CLASSES = NO
+
+# When the INLINE_SIMPLE_STRUCTS tag is set to YES, structs, classes, and unions
+# with only public data fields or simple typedef fields will be shown inline in
+# the documentation of the scope in which they are defined (i.e. file,
+# namespace, or group documentation), provided this scope is documented. If set
+# to NO, structs, classes, and unions are shown on a separate page (for HTML and
+# Man pages) or section (for LaTeX and RTF).
+# The default value is: NO.
+
+INLINE_SIMPLE_STRUCTS  = NO
+
+# When TYPEDEF_HIDES_STRUCT tag is enabled, a typedef of a struct, union, or
+# enum is documented as struct, union, or enum with the name of the typedef. So
+# typedef struct TypeS {} TypeT, will appear in the documentation as a struct
+# with name TypeT. When disabled the typedef will appear as a member of a file,
+# namespace, or class. And the struct will be named TypeS. This can typically be
+# useful for C code in case the coding convention dictates that all compound
+# types are typedef'ed and only the typedef is referenced, never the tag name.
+# The default value is: NO.
+
+TYPEDEF_HIDES_STRUCT   = NO
+
+# The size of the symbol lookup cache can be set using LOOKUP_CACHE_SIZE. This
+# cache is used to resolve symbols given their name and scope. Since this can be
+# an expensive process and often the same symbol appears multiple times in the
+# code, doxygen keeps a cache of pre-resolved symbols. If the cache is too small
+# doxygen will become slower. If the cache is too large, memory is wasted. The
+# cache size is given by this formula: 2^(16+LOOKUP_CACHE_SIZE). The valid range
+# is 0..9, the default is 0, corresponding to a cache size of 2^16=65536
+# symbols. At the end of a run doxygen will report the cache usage and suggest
+# the optimal cache size from a speed point of view.
+# Minimum value: 0, maximum value: 9, default value: 0.
+
+LOOKUP_CACHE_SIZE      = 0
+
+# The NUM_PROC_THREADS specifies the number threads doxygen is allowed to use
+# during processing. When set to 0 doxygen will based this on the number of
+# cores available in the system. You can set it explicitly to a value larger
+# than 0 to get more control over the balance between CPU load and processing
+# speed. At this moment only the input processing can be done using multiple
+# threads. Since this is still an experimental feature the default is set to 1,
+# which efficively disables parallel processing. Please report any issues you
+# encounter. Generating dot graphs in parallel is controlled by the
+# DOT_NUM_THREADS setting.
+# Minimum value: 0, maximum value: 32, default value: 1.
+
+NUM_PROC_THREADS       = 1
+
+#---------------------------------------------------------------------------
+# Build related configuration options
+#---------------------------------------------------------------------------
+
+# If the EXTRACT_ALL tag is set to YES, doxygen will assume all entities in
+# documentation are documented, even if no documentation was available. Private
+# class members and static file members will be hidden unless the
+# EXTRACT_PRIVATE respectively EXTRACT_STATIC tags are set to YES.
+# Note: This will also disable the warnings about undocumented members that are
+# normally produced when WARNINGS is set to YES.
+# The default value is: NO.
+
+EXTRACT_ALL            = NO
+
+# If the EXTRACT_PRIVATE tag is set to YES, all private members of a class will
+# be included in the documentation.
+# The default value is: NO.
+
+EXTRACT_PRIVATE        = NO
+
+# If the EXTRACT_PRIV_VIRTUAL tag is set to YES, documented private virtual
+# methods of a class will be included in the documentation.
+# The default value is: NO.
+
+EXTRACT_PRIV_VIRTUAL   = NO
+
+# If the EXTRACT_PACKAGE tag is set to YES, all members with package or internal
+# scope will be included in the documentation.
+# The default value is: NO.
+
+EXTRACT_PACKAGE        = NO
+
+# If the EXTRACT_STATIC tag is set to YES, all static members of a file will be
+# included in the documentation.
+# The default value is: NO.
+
+EXTRACT_STATIC         = YES
+
+# If the EXTRACT_LOCAL_CLASSES tag is set to YES, classes (and structs) defined
+# locally in source files will be included in the documentation. If set to NO,
+# only classes defined in header files are included. Does not have any effect
+# for Java sources.
+# The default value is: YES.
+
+EXTRACT_LOCAL_CLASSES  = YES
+
+# This flag is only useful for Objective-C code. If set to YES, local methods,
+# which are defined in the implementation section but not in the interface are
+# included in the documentation. If set to NO, only methods in the interface are
+# included.
+# The default value is: NO.
+
+EXTRACT_LOCAL_METHODS  = NO
+
+# If this flag is set to YES, the members of anonymous namespaces will be
+# extracted and appear in the documentation as a namespace called
+# 'anonymous_namespace{file}', where file will be replaced with the base name of
+# the file that contains the anonymous namespace. By default anonymous namespace
+# are hidden.
+# The default value is: NO.
+
+EXTRACT_ANON_NSPACES   = NO
+
+# If the HIDE_UNDOC_MEMBERS tag is set to YES, doxygen will hide all
+# undocumented members inside documented classes or files. If set to NO these
+# members will be included in the various overviews, but no documentation
+# section is generated. This option has no effect if EXTRACT_ALL is enabled.
+# The default value is: NO.
+
+HIDE_UNDOC_MEMBERS     = NO
+
+# If the HIDE_UNDOC_CLASSES tag is set to YES, doxygen will hide all
+# undocumented classes that are normally visible in the class hierarchy. If set
+# to NO, these classes will be included in the various overviews. This option
+# has no effect if EXTRACT_ALL is enabled.
+# The default value is: NO.
+
+HIDE_UNDOC_CLASSES     = NO
+
+# If the HIDE_FRIEND_COMPOUNDS tag is set to YES, doxygen will hide all friend
+# declarations. If set to NO, these declarations will be included in the
+# documentation.
+# The default value is: NO.
+
+HIDE_FRIEND_COMPOUNDS  = NO
+
+# If the HIDE_IN_BODY_DOCS tag is set to YES, doxygen will hide any
+# documentation blocks found inside the body of a function. If set to NO, these
+# blocks will be appended to the function's detailed documentation block.
+# The default value is: NO.
+
+HIDE_IN_BODY_DOCS      = NO
+
+# The INTERNAL_DOCS tag determines if documentation that is typed after a
+# \internal command is included. If the tag is set to NO then the documentation
+# will be excluded. Set it to YES to include the internal documentation.
+# The default value is: NO.
+
+INTERNAL_DOCS          = YES
+
+# If the CASE_SENSE_NAMES tag is set to NO then doxygen will only generate file
+# names in lower-case letters. If set to YES, upper-case letters are also
+# allowed. This is useful if you have classes or files whose names only differ
+# in case and if your file system supports case sensitive file names. Windows
+# (including Cygwin) and Mac users are advised to set this option to NO.
+# The default value is: system dependent.
+
+CASE_SENSE_NAMES       = YES
+
+# If the HIDE_SCOPE_NAMES tag is set to NO then doxygen will show members with
+# their full class and namespace scopes in the documentation. If set to YES, the
+# scope will be hidden.
+# The default value is: NO.
+
+HIDE_SCOPE_NAMES       = NO
+
+# If the HIDE_COMPOUND_REFERENCE tag is set to NO (default) then doxygen will
+# append additional text to a page's title, such as Class Reference. If set to
+# YES the compound reference will be hidden.
+# The default value is: NO.
+
+HIDE_COMPOUND_REFERENCE= NO
+
+# If the SHOW_INCLUDE_FILES tag is set to YES then doxygen will put a list of
+# the files that are included by a file in the documentation of that file.
+# The default value is: YES.
+
+SHOW_INCLUDE_FILES     = YES
+
+# If the SHOW_GROUPED_MEMB_INC tag is set to YES then Doxygen will add for each
+# grouped member an include statement to the documentation, telling the reader
+# which file to include in order to use the member.
+# The default value is: NO.
+
+SHOW_GROUPED_MEMB_INC  = NO
+
+# If the FORCE_LOCAL_INCLUDES tag is set to YES then doxygen will list include
+# files with double quotes in the documentation rather than with sharp brackets.
+# The default value is: NO.
+
+FORCE_LOCAL_INCLUDES   = NO
+
+# If the INLINE_INFO tag is set to YES then a tag [inline] is inserted in the
+# documentation for inline members.
+# The default value is: YES.
+
+INLINE_INFO            = YES
+
+# If the SORT_MEMBER_DOCS tag is set to YES then doxygen will sort the
+# (detailed) documentation of file and class members alphabetically by member
+# name. If set to NO, the members will appear in declaration order.
+# The default value is: YES.
+
+SORT_MEMBER_DOCS       = NO
+
+# If the SORT_BRIEF_DOCS tag is set to YES then doxygen will sort the brief
+# descriptions of file, namespace and class members alphabetically by member
+# name. If set to NO, the members will appear in declaration order. Note that
+# this will also influence the order of the classes in the class list.
+# The default value is: NO.
+
+SORT_BRIEF_DOCS        = NO
+
+# If the SORT_MEMBERS_CTORS_1ST tag is set to YES then doxygen will sort the
+# (brief and detailed) documentation of class members so that constructors and
+# destructors are listed first. If set to NO the constructors will appear in the
+# respective orders defined by SORT_BRIEF_DOCS and SORT_MEMBER_DOCS.
+# Note: If SORT_BRIEF_DOCS is set to NO this option is ignored for sorting brief
+# member documentation.
+# Note: If SORT_MEMBER_DOCS is set to NO this option is ignored for sorting
+# detailed member documentation.
+# The default value is: NO.
+
+SORT_MEMBERS_CTORS_1ST = NO
+
+# If the SORT_GROUP_NAMES tag is set to YES then doxygen will sort the hierarchy
+# of group names into alphabetical order. If set to NO the group names will
+# appear in their defined order.
+# The default value is: NO.
+
+SORT_GROUP_NAMES       = NO
+
+# If the SORT_BY_SCOPE_NAME tag is set to YES, the class list will be sorted by
+# fully-qualified names, including namespaces. If set to NO, the class list will
+# be sorted only by class name, not including the namespace part.
+# Note: This option is not very useful if HIDE_SCOPE_NAMES is set to YES.
+# Note: This option applies only to the class list, not to the alphabetical
+# list.
+# The default value is: NO.
+
+SORT_BY_SCOPE_NAME     = NO
+
+# If the STRICT_PROTO_MATCHING option is enabled and doxygen fails to do proper
+# type resolution of all parameters of a function it will reject a match between
+# the prototype and the implementation of a member function even if there is
+# only one candidate or it is obvious which candidate to choose by doing a
+# simple string match. By disabling STRICT_PROTO_MATCHING doxygen will still
+# accept a match between prototype and implementation in such cases.
+# The default value is: NO.
+
+STRICT_PROTO_MATCHING  = NO
+
+# The GENERATE_TODOLIST tag can be used to enable (YES) or disable (NO) the todo
+# list. This list is created by putting \todo commands in the documentation.
+# The default value is: YES.
+
+GENERATE_TODOLIST      = YES
+
+# The GENERATE_TESTLIST tag can be used to enable (YES) or disable (NO) the test
+# list. This list is created by putting \test commands in the documentation.
+# The default value is: YES.
+
+GENERATE_TESTLIST      = YES
+
+# The GENERATE_BUGLIST tag can be used to enable (YES) or disable (NO) the bug
+# list. This list is created by putting \bug commands in the documentation.
+# The default value is: YES.
+
+GENERATE_BUGLIST       = YES
+
+# The GENERATE_DEPRECATEDLIST tag can be used to enable (YES) or disable (NO)
+# the deprecated list. This list is created by putting \deprecated commands in
+# the documentation.
+# The default value is: YES.
+
+GENERATE_DEPRECATEDLIST= YES
+
+# The ENABLED_SECTIONS tag can be used to enable conditional documentation
+# sections, marked by \if <section_label> ... \endif and \cond <section_label>
+# ... \endcond blocks.
+
+ENABLED_SECTIONS       =
+
+# The MAX_INITIALIZER_LINES tag determines the maximum number of lines that the
+# initial value of a variable or macro / define can have for it to appear in the
+# documentation. If the initializer consists of more lines than specified here
+# it will be hidden. Use a value of 0 to hide initializers completely. The
+# appearance of the value of individual variables and macros / defines can be
+# controlled using \showinitializer or \hideinitializer command in the
+# documentation regardless of this setting.
+# Minimum value: 0, maximum value: 10000, default value: 30.
+
+MAX_INITIALIZER_LINES  = 30
+
+# Set the SHOW_USED_FILES tag to NO to disable the list of files generated at
+# the bottom of the documentation of classes and structs. If set to YES, the
+# list will mention the files that were used to generate the documentation.
+# The default value is: YES.
+
+SHOW_USED_FILES        = YES
+
+# Set the SHOW_FILES tag to NO to disable the generation of the Files page. This
+# will remove the Files entry from the Quick Index and from the Folder Tree View
+# (if specified).
+# The default value is: YES.
+
+SHOW_FILES             = YES
+
+# Set the SHOW_NAMESPACES tag to NO to disable the generation of the Namespaces
+# page. This will remove the Namespaces entry from the Quick Index and from the
+# Folder Tree View (if specified).
+# The default value is: YES.
+
+SHOW_NAMESPACES        = YES
+
+# The FILE_VERSION_FILTER tag can be used to specify a program or script that
+# doxygen should invoke to get the current version for each file (typically from
+# the version control system). Doxygen will invoke the program by executing (via
+# popen()) the command command input-file, where command is the value of the
+# FILE_VERSION_FILTER tag, and input-file is the name of an input file provided
+# by doxygen. Whatever the program writes to standard output is used as the file
+# version. For an example see the documentation.
+
+FILE_VERSION_FILTER    =
+
+# The LAYOUT_FILE tag can be used to specify a layout file which will be parsed
+# by doxygen. The layout file controls the global structure of the generated
+# output files in an output format independent way. To create the layout file
+# that represents doxygen's defaults, run doxygen with the -l option. You can
+# optionally specify a file name after the option, if omitted DoxygenLayout.xml
+# will be used as the name of the layout file.
+#
+# Note that if you run doxygen from a directory containing a file called
+# DoxygenLayout.xml, doxygen will parse it automatically even if the LAYOUT_FILE
+# tag is left empty.
+
+LAYOUT_FILE            =
+
+# The CITE_BIB_FILES tag can be used to specify one or more bib files containing
+# the reference definitions. This must be a list of .bib files. The .bib
+# extension is automatically appended if omitted. This requires the bibtex tool
+# to be installed. See also https://en.wikipedia.org/wiki/BibTeX for more info.
+# For LaTeX the style of the bibliography can be controlled using
+# LATEX_BIB_STYLE. To use this feature you need bibtex and perl available in the
+# search path. See also \cite for info how to create references.
+
+CITE_BIB_FILES         =
+
+#---------------------------------------------------------------------------
+# Configuration options related to warning and progress messages
+#---------------------------------------------------------------------------
+
+# The QUIET tag can be used to turn on/off the messages that are generated to
+# standard output by doxygen. If QUIET is set to YES this implies that the
+# messages are off.
+# The default value is: NO.
+
+QUIET                  = YES
+
+# The WARNINGS tag can be used to turn on/off the warning messages that are
+# generated to standard error (stderr) by doxygen. If WARNINGS is set to YES
+# this implies that the warnings are on.
+#
+# Tip: Turn warnings on while writing the documentation.
+# The default value is: YES.
+
+WARNINGS               = YES
+
+# If the WARN_IF_UNDOCUMENTED tag is set to YES then doxygen will generate
+# warnings for undocumented members. If EXTRACT_ALL is set to YES then this flag
+# will automatically be disabled.
+# The default value is: YES.
+
+WARN_IF_UNDOCUMENTED   = NO
+
+# If the WARN_IF_DOC_ERROR tag is set to YES, doxygen will generate warnings for
+# potential errors in the documentation, such as not documenting some parameters
+# in a documented function, or documenting parameters that don't exist or using
+# markup commands wrongly.
+# The default value is: YES.
+
+WARN_IF_DOC_ERROR      = YES
+
+# This WARN_NO_PARAMDOC option can be enabled to get warnings for functions that
+# are documented, but have no documentation for their parameters or return
+# value. If set to NO, doxygen will only warn about wrong or incomplete
+# parameter documentation, but not about the absence of documentation. If
+# EXTRACT_ALL is set to YES then this flag will automatically be disabled.
+# The default value is: NO.
+
+WARN_NO_PARAMDOC       = NO
+
+# If the WARN_AS_ERROR tag is set to YES then doxygen will immediately stop when
+# a warning is encountered.
+# The default value is: NO.
+
+WARN_AS_ERROR          = NO
+
+# The WARN_FORMAT tag determines the format of the warning messages that doxygen
+# can produce. The string should contain the $file, $line, and $text tags, which
+# will be replaced by the file and line number from which the warning originated
+# and the warning text. Optionally the format may contain $version, which will
+# be replaced by the version of the file (if it could be obtained via
+# FILE_VERSION_FILTER)
+# The default value is: $file:$line: $text.
+
+WARN_FORMAT            = "$file:$line: $text"
+
+# The WARN_LOGFILE tag can be used to specify a file to which warning and error
+# messages should be written. If left blank the output is written to standard
+# error (stderr).
+
+WARN_LOGFILE           =
+
+#---------------------------------------------------------------------------
+# Configuration options related to the input files
+#---------------------------------------------------------------------------
+
+# The INPUT tag is used to specify the files and/or directories that contain
+# documented source files. You may enter file names like myfile.cpp or
+# directories like /usr/src/myproject. Separate the files or directories with
+# spaces. See also FILE_PATTERNS and EXTENSION_MAPPING
+# Note: If this tag is empty the current directory is searched.
+
+INPUT                  =
+
+# This tag can be used to specify the character encoding of the source files
+# that doxygen parses. Internally doxygen uses the UTF-8 encoding. Doxygen uses
+# libiconv (or the iconv built into libc) for the transcoding. See the libiconv
+# documentation (see: https://www.gnu.org/software/libiconv/) for the list of
+# possible encodings.
+# The default value is: UTF-8.
+
+INPUT_ENCODING         = UTF-8
+
+# If the value of the INPUT tag contains directories, you can use the
+# FILE_PATTERNS tag to specify one or more wildcard patterns (like *.cpp and
+# *.h) to filter out the source-files in the directories.
+#
+# Note that for custom extensions or not directly supported extensions you also
+# need to set EXTENSION_MAPPING for the extension otherwise the files are not
+# read by doxygen.
+#
+# If left blank the following patterns are tested:*.c, *.cc, *.cxx, *.cpp,
+# *.c++, *.java, *.ii, *.ixx, *.ipp, *.i++, *.inl, *.idl, *.ddl, *.odl, *.h,
+# *.hh, *.hxx, *.hpp, *.h++, *.cs, *.d, *.php, *.php4, *.php5, *.phtml, *.inc,
+# *.m, *.markdown, *.md, *.mm, *.dox (to be provided as doxygen C comment),
+# *.doc (to be provided as doxygen C comment), *.txt (to be provided as doxygen
+# C comment), *.py, *.pyw, *.f90, *.f95, *.f03, *.f08, *.f18, *.f, *.for, *.vhd,
+# *.vhdl, *.ucf, *.qsf and *.ice.
+
+FILE_PATTERNS = xxhash.h
+#
+#FILE_PATTERNS          = *.c \
+#                         *.cc \
+#                         *.cxx \
+#                         *.cpp \
+#                         *.c++ \
+#                         *.java \
+#                         *.ii \
+#                         *.ixx \
+#                         *.ipp \
+#                         *.i++ \
+#                         *.inl \
+#                         *.idl \
+#                         *.ddl \
+#                         *.odl \
+#                         *.h \
+#                         *.hh \
+#                         *.hxx \
+#                         *.hpp \
+#                         *.h++ \
+#                         *.cs \
+#                         *.d \
+#                         *.php \
+#                         *.php4 \
+#                         *.php5 \
+#                         *.phtml \
+#                         *.inc \
+#                         *.m \
+#                         *.markdown \
+#                         *.md \
+#                         *.mm \
+#                         *.dox \
+#                         *.doc \
+#                         *.txt \
+#                         *.py \
+#                         *.pyw \
+#                         *.f90 \
+#                         *.f95 \
+#                         *.f03 \
+#                         *.f08 \
+#                         *.f18 \
+#                         *.f \
+#                         *.for \
+#                         *.vhd \
+#                         *.vhdl \
+#                         *.ucf \
+#                         *.qsf \
+#                         *.ice
+
+# The RECURSIVE tag can be used to specify whether or not subdirectories should
+# be searched for input files as well.
+# The default value is: NO.
+
+RECURSIVE              = NO
+
+# The EXCLUDE tag can be used to specify files and/or directories that should be
+# excluded from the INPUT source files. This way you can easily exclude a
+# subdirectory from a directory tree whose root is specified with the INPUT tag.
+#
+# Note that relative paths are relative to the directory from which doxygen is
+# run.
+
+EXCLUDE                =
+
+# The EXCLUDE_SYMLINKS tag can be used to select whether or not files or
+# directories that are symbolic links (a Unix file system feature) are excluded
+# from the input.
+# The default value is: NO.
+
+EXCLUDE_SYMLINKS       = NO
+
+# If the value of the INPUT tag contains directories, you can use the
+# EXCLUDE_PATTERNS tag to specify one or more wildcard patterns to exclude
+# certain files from those directories.
+#
+# Note that the wildcards are matched against the file with absolute path, so to
+# exclude all test directories for example use the pattern */test/*
+
+EXCLUDE_PATTERNS       =
+
+# The EXCLUDE_SYMBOLS tag can be used to specify one or more symbol names
+# (namespaces, classes, functions, etc.) that should be excluded from the
+# output. The symbol name can be a fully qualified name, a word, or if the
+# wildcard * is used, a substring. Examples: ANamespace, AClass,
+# AClass::ANamespace, ANamespace::*Test
+#
+# Note that the wildcards are matched against the file with absolute path, so to
+# exclude all test directories use the pattern */test/*
+
+EXCLUDE_SYMBOLS        =
+
+# The EXAMPLE_PATH tag can be used to specify one or more files or directories
+# that contain example code fragments that are included (see the \include
+# command).
+
+EXAMPLE_PATH           =
+
+# If the value of the EXAMPLE_PATH tag contains directories, you can use the
+# EXAMPLE_PATTERNS tag to specify one or more wildcard pattern (like *.cpp and
+# *.h) to filter out the source-files in the directories. If left blank all
+# files are included.
+
+EXAMPLE_PATTERNS       = *
+
+# If the EXAMPLE_RECURSIVE tag is set to YES then subdirectories will be
+# searched for input files to be used with the \include or \dontinclude commands
+# irrespective of the value of the RECURSIVE tag.
+# The default value is: NO.
+
+EXAMPLE_RECURSIVE      = NO
+
+# The IMAGE_PATH tag can be used to specify one or more files or directories
+# that contain images that are to be included in the documentation (see the
+# \image command).
+
+IMAGE_PATH             =
+
+# The INPUT_FILTER tag can be used to specify a program that doxygen should
+# invoke to filter for each input file. Doxygen will invoke the filter program
+# by executing (via popen()) the command:
+#
+# <filter> <input-file>
+#
+# where <filter> is the value of the INPUT_FILTER tag, and <input-file> is the
+# name of an input file. Doxygen will then use the output that the filter
+# program writes to standard output. If FILTER_PATTERNS is specified, this tag
+# will be ignored.
+#
+# Note that the filter must not add or remove lines; it is applied before the
+# code is scanned, but not when the output code is generated. If lines are added
+# or removed, the anchors will not be placed correctly.
+#
+# Note that for custom extensions or not directly supported extensions you also
+# need to set EXTENSION_MAPPING for the extension otherwise the files are not
+# properly processed by doxygen.
+
+INPUT_FILTER           =
+
+# The FILTER_PATTERNS tag can be used to specify filters on a per file pattern
+# basis. Doxygen will compare the file name with each pattern and apply the
+# filter if there is a match. The filters are a list of the form: pattern=filter
+# (like *.cpp=my_cpp_filter). See INPUT_FILTER for further information on how
+# filters are used. If the FILTER_PATTERNS tag is empty or if none of the
+# patterns match the file name, INPUT_FILTER is applied.
+#
+# Note that for custom extensions or not directly supported extensions you also
+# need to set EXTENSION_MAPPING for the extension otherwise the files are not
+# properly processed by doxygen.
+
+FILTER_PATTERNS        =
+
+# If the FILTER_SOURCE_FILES tag is set to YES, the input filter (if set using
+# INPUT_FILTER) will also be used to filter the input files that are used for
+# producing the source files to browse (i.e. when SOURCE_BROWSER is set to YES).
+# The default value is: NO.
+
+FILTER_SOURCE_FILES    = NO
+
+# The FILTER_SOURCE_PATTERNS tag can be used to specify source filters per file
+# pattern. A pattern will override the setting for FILTER_PATTERN (if any) and
+# it is also possible to disable source filtering for a specific pattern using
+# *.ext= (so without naming a filter).
+# This tag requires that the tag FILTER_SOURCE_FILES is set to YES.
+
+FILTER_SOURCE_PATTERNS =
+
+# If the USE_MDFILE_AS_MAINPAGE tag refers to the name of a markdown file that
+# is part of the input, its contents will be placed on the main page
+# (index.html). This can be useful if you have a project on for instance GitHub
+# and want to reuse the introduction page also for the doxygen output.
+
+USE_MDFILE_AS_MAINPAGE =
+
+#---------------------------------------------------------------------------
+# Configuration options related to source browsing
+#---------------------------------------------------------------------------
+
+# If the SOURCE_BROWSER tag is set to YES then a list of source files will be
+# generated. Documented entities will be cross-referenced with these sources.
+#
+# Note: To get rid of all source code in the generated output, make sure that
+# also VERBATIM_HEADERS is set to NO.
+# The default value is: NO.
+
+SOURCE_BROWSER         = NO
+
+# Setting the INLINE_SOURCES tag to YES will include the body of functions,
+# classes and enums directly into the documentation.
+# The default value is: NO.
+
+INLINE_SOURCES         = NO
+
+# Setting the STRIP_CODE_COMMENTS tag to YES will instruct doxygen to hide any
+# special comment blocks from generated source code fragments. Normal C, C++ and
+# Fortran comments will always remain visible.
+# The default value is: YES.
+
+STRIP_CODE_COMMENTS    = YES
+
+# If the REFERENCED_BY_RELATION tag is set to YES then for each documented
+# entity all documented functions referencing it will be listed.
+# The default value is: NO.
+
+REFERENCED_BY_RELATION = NO
+
+# If the REFERENCES_RELATION tag is set to YES then for each documented function
+# all documented entities called/used by that function will be listed.
+# The default value is: NO.
+
+REFERENCES_RELATION    = NO
+
+# If the REFERENCES_LINK_SOURCE tag is set to YES and SOURCE_BROWSER tag is set
+# to YES then the hyperlinks from functions in REFERENCES_RELATION and
+# REFERENCED_BY_RELATION lists will link to the source code. Otherwise they will
+# link to the documentation.
+# The default value is: YES.
+
+REFERENCES_LINK_SOURCE = YES
+
+# If SOURCE_TOOLTIPS is enabled (the default) then hovering a hyperlink in the
+# source code will show a tooltip with additional information such as prototype,
+# brief description and links to the definition and documentation. Since this
+# will make the HTML file larger and loading of large files a bit slower, you
+# can opt to disable this feature.
+# The default value is: YES.
+# This tag requires that the tag SOURCE_BROWSER is set to YES.
+
+SOURCE_TOOLTIPS        = YES
+
+# If the USE_HTAGS tag is set to YES then the references to source code will
+# point to the HTML generated by the htags(1) tool instead of doxygen built-in
+# source browser. The htags tool is part of GNU's global source tagging system
+# (see https://www.gnu.org/software/global/global.html). You will need version
+# 4.8.6 or higher.
+#
+# To use it do the following:
+# - Install the latest version of global
+# - Enable SOURCE_BROWSER and USE_HTAGS in the configuration file
+# - Make sure the INPUT points to the root of the source tree
+# - Run doxygen as normal
+#
+# Doxygen will invoke htags (and that will in turn invoke gtags), so these
+# tools must be available from the command line (i.e. in the search path).
+#
+# The result: instead of the source browser generated by doxygen, the links to
+# source code will now point to the output of htags.
+# The default value is: NO.
+# This tag requires that the tag SOURCE_BROWSER is set to YES.
+
+USE_HTAGS              = NO
+
+# If the VERBATIM_HEADERS tag is set the YES then doxygen will generate a
+# verbatim copy of the header file for each class for which an include is
+# specified. Set to NO to disable this.
+# See also: Section \class.
+# The default value is: YES.
+
+VERBATIM_HEADERS       = YES
+
+#---------------------------------------------------------------------------
+# Configuration options related to the alphabetical class index
+#---------------------------------------------------------------------------
+
+# If the ALPHABETICAL_INDEX tag is set to YES, an alphabetical index of all
+# compounds will be generated. Enable this if the project contains a lot of
+# classes, structs, unions or interfaces.
+# The default value is: YES.
+
+ALPHABETICAL_INDEX     = YES
+
+# The COLS_IN_ALPHA_INDEX tag can be used to specify the number of columns in
+# which the alphabetical index list will be split.
+# Minimum value: 1, maximum value: 20, default value: 5.
+# This tag requires that the tag ALPHABETICAL_INDEX is set to YES.
+
+COLS_IN_ALPHA_INDEX    = 5
+
+# In case all classes in a project start with a common prefix, all classes will
+# be put under the same header in the alphabetical index. The IGNORE_PREFIX tag
+# can be used to specify a prefix (or a list of prefixes) that should be ignored
+# while generating the index headers.
+# This tag requires that the tag ALPHABETICAL_INDEX is set to YES.
+
+IGNORE_PREFIX          =
+
+#---------------------------------------------------------------------------
+# Configuration options related to the HTML output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_HTML tag is set to YES, doxygen will generate HTML output
+# The default value is: YES.
+
+GENERATE_HTML          = YES
+
+# The HTML_OUTPUT tag is used to specify where the HTML docs will be put. If a
+# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
+# it.
+# The default directory is: html.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_OUTPUT            = html
+
+# The HTML_FILE_EXTENSION tag can be used to specify the file extension for each
+# generated HTML page (for example: .htm, .php, .asp).
+# The default value is: .html.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_FILE_EXTENSION    = .html
+
+# The HTML_HEADER tag can be used to specify a user-defined HTML header file for
+# each generated HTML page. If the tag is left blank doxygen will generate a
+# standard header.
+#
+# To get valid HTML the header file that includes any scripts and style sheets
+# that doxygen needs, which is dependent on the configuration options used (e.g.
+# the setting GENERATE_TREEVIEW). It is highly recommended to start with a
+# default header using
+# doxygen -w html new_header.html new_footer.html new_stylesheet.css
+# YourConfigFile
+# and then modify the file new_header.html. See also section "Doxygen usage"
+# for information on how to generate the default header that doxygen normally
+# uses.
+# Note: The header is subject to change so you typically have to regenerate the
+# default header when upgrading to a newer version of doxygen. For a description
+# of the possible markers and block names see the documentation.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_HEADER            =
+
+# The HTML_FOOTER tag can be used to specify a user-defined HTML footer for each
+# generated HTML page. If the tag is left blank doxygen will generate a standard
+# footer. See HTML_HEADER for more information on how to generate a default
+# footer and what special commands can be used inside the footer. See also
+# section "Doxygen usage" for information on how to generate the default footer
+# that doxygen normally uses.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_FOOTER            =
+
+# The HTML_STYLESHEET tag can be used to specify a user-defined cascading style
+# sheet that is used by each HTML page. It can be used to fine-tune the look of
+# the HTML output. If left blank doxygen will generate a default style sheet.
+# See also section "Doxygen usage" for information on how to generate the style
+# sheet that doxygen normally uses.
+# Note: It is recommended to use HTML_EXTRA_STYLESHEET instead of this tag, as
+# it is more robust and this tag (HTML_STYLESHEET) will in the future become
+# obsolete.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_STYLESHEET        =
+
+# The HTML_EXTRA_STYLESHEET tag can be used to specify additional user-defined
+# cascading style sheets that are included after the standard style sheets
+# created by doxygen. Using this option one can overrule certain style aspects.
+# This is preferred over using HTML_STYLESHEET since it does not replace the
+# standard style sheet and is therefore more robust against future updates.
+# Doxygen will copy the style sheet files to the output directory.
+# Note: The order of the extra style sheet files is of importance (e.g. the last
+# style sheet in the list overrules the setting of the previous ones in the
+# list). For an example see the documentation.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_EXTRA_STYLESHEET  =
+
+# The HTML_EXTRA_FILES tag can be used to specify one or more extra images or
+# other source files which should be copied to the HTML output directory. Note
+# that these files will be copied to the base HTML output directory. Use the
+# $relpath^ marker in the HTML_HEADER and/or HTML_FOOTER files to load these
+# files. In the HTML_STYLESHEET file, use the file name only. Also note that the
+# files will be copied as-is; there are no commands or markers available.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_EXTRA_FILES       =
+
+# The HTML_COLORSTYLE_HUE tag controls the color of the HTML output. Doxygen
+# will adjust the colors in the style sheet and background images according to
+# this color. Hue is specified as an angle on a colorwheel, see
+# https://en.wikipedia.org/wiki/Hue for more information. For instance the value
+# 0 represents red, 60 is yellow, 120 is green, 180 is cyan, 240 is blue, 300
+# purple, and 360 is red again.
+# Minimum value: 0, maximum value: 359, default value: 220.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_COLORSTYLE_HUE    = 220
+
+# The HTML_COLORSTYLE_SAT tag controls the purity (or saturation) of the colors
+# in the HTML output. For a value of 0 the output will use grayscales only. A
+# value of 255 will produce the most vivid colors.
+# Minimum value: 0, maximum value: 255, default value: 100.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_COLORSTYLE_SAT    = 100
+
+# The HTML_COLORSTYLE_GAMMA tag controls the gamma correction applied to the
+# luminance component of the colors in the HTML output. Values below 100
+# gradually make the output lighter, whereas values above 100 make the output
+# darker. The value divided by 100 is the actual gamma applied, so 80 represents
+# a gamma of 0.8, The value 220 represents a gamma of 2.2, and 100 does not
+# change the gamma.
+# Minimum value: 40, maximum value: 240, default value: 80.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_COLORSTYLE_GAMMA  = 100
+
+# If the HTML_TIMESTAMP tag is set to YES then the footer of each generated HTML
+# page will contain the date and time when the page was generated. Setting this
+# to YES can help to show when doxygen was last run and thus if the
+# documentation is up to date.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_TIMESTAMP         = NO
+
+# If the HTML_DYNAMIC_MENUS tag is set to YES then the generated HTML
+# documentation will contain a main index with vertical navigation menus that
+# are dynamically created via JavaScript. If disabled, the navigation index will
+# consists of multiple levels of tabs that are statically embedded in every HTML
+# page. Disable this option to support browsers that do not have JavaScript,
+# like the Qt help browser.
+# The default value is: YES.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_DYNAMIC_MENUS     = YES
+
+# If the HTML_DYNAMIC_SECTIONS tag is set to YES then the generated HTML
+# documentation will contain sections that can be hidden and shown after the
+# page has loaded.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_DYNAMIC_SECTIONS  = YES
+
+# With HTML_INDEX_NUM_ENTRIES one can control the preferred number of entries
+# shown in the various tree structured indices initially; the user can expand
+# and collapse entries dynamically later on. Doxygen will expand the tree to
+# such a level that at most the specified number of entries are visible (unless
+# a fully collapsed tree already exceeds this amount). So setting the number of
+# entries 1 will produce a full collapsed tree by default. 0 is a special value
+# representing an infinite number of entries and will result in a full expanded
+# tree by default.
+# Minimum value: 0, maximum value: 9999, default value: 100.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_INDEX_NUM_ENTRIES = 100
+
+# If the GENERATE_DOCSET tag is set to YES, additional index files will be
+# generated that can be used as input for Apple's Xcode 3 integrated development
+# environment (see: https://developer.apple.com/xcode/), introduced with OSX
+# 10.5 (Leopard). To create a documentation set, doxygen will generate a
+# Makefile in the HTML output directory. Running make will produce the docset in
+# that directory and running make install will install the docset in
+# ~/Library/Developer/Shared/Documentation/DocSets so that Xcode will find it at
+# startup. See https://developer.apple.com/library/archive/featuredarticles/Doxy
+# genXcode/_index.html for more information.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+GENERATE_DOCSET        = NO
+
+# This tag determines the name of the docset feed. A documentation feed provides
+# an umbrella under which multiple documentation sets from a single provider
+# (such as a company or product suite) can be grouped.
+# The default value is: Doxygen generated docs.
+# This tag requires that the tag GENERATE_DOCSET is set to YES.
+
+DOCSET_FEEDNAME        = "Doxygen generated docs"
+
+# This tag specifies a string that should uniquely identify the documentation
+# set bundle. This should be a reverse domain-name style string, e.g.
+# com.mycompany.MyDocSet. Doxygen will append .docset to the name.
+# The default value is: org.doxygen.Project.
+# This tag requires that the tag GENERATE_DOCSET is set to YES.
+
+DOCSET_BUNDLE_ID       = org.doxygen.Project
+
+# The DOCSET_PUBLISHER_ID tag specifies a string that should uniquely identify
+# the documentation publisher. This should be a reverse domain-name style
+# string, e.g. com.mycompany.MyDocSet.documentation.
+# The default value is: org.doxygen.Publisher.
+# This tag requires that the tag GENERATE_DOCSET is set to YES.
+
+DOCSET_PUBLISHER_ID    = org.doxygen.Publisher
+
+# The DOCSET_PUBLISHER_NAME tag identifies the documentation publisher.
+# The default value is: Publisher.
+# This tag requires that the tag GENERATE_DOCSET is set to YES.
+
+DOCSET_PUBLISHER_NAME  = Publisher
+
+# If the GENERATE_HTMLHELP tag is set to YES then doxygen generates three
+# additional HTML index files: index.hhp, index.hhc, and index.hhk. The
+# index.hhp is a project file that can be read by Microsoft's HTML Help Workshop
+# (see: https://www.microsoft.com/en-us/download/details.aspx?id=21138) on
+# Windows.
+#
+# The HTML Help Workshop contains a compiler that can convert all HTML output
+# generated by doxygen into a single compiled HTML file (.chm). Compiled HTML
+# files are now used as the Windows 98 help format, and will replace the old
+# Windows help format (.hlp) on all Windows platforms in the future. Compressed
+# HTML files also contain an index, a table of contents, and you can search for
+# words in the documentation. The HTML workshop also contains a viewer for
+# compressed HTML files.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+GENERATE_HTMLHELP      = NO
+
+# The CHM_FILE tag can be used to specify the file name of the resulting .chm
+# file. You can add a path in front of the file if the result should not be
+# written to the html output directory.
+# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
+
+CHM_FILE               =
+
+# The HHC_LOCATION tag can be used to specify the location (absolute path
+# including file name) of the HTML help compiler (hhc.exe). If non-empty,
+# doxygen will try to run the HTML help compiler on the generated index.hhp.
+# The file has to be specified with full path.
+# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
+
+HHC_LOCATION           =
+
+# The GENERATE_CHI flag controls if a separate .chi index file is generated
+# (YES) or that it should be included in the main .chm file (NO).
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
+
+GENERATE_CHI           = NO
+
+# The CHM_INDEX_ENCODING is used to encode HtmlHelp index (hhk), content (hhc)
+# and project file content.
+# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
+
+CHM_INDEX_ENCODING     =
+
+# The BINARY_TOC flag controls whether a binary table of contents is generated
+# (YES) or a normal table of contents (NO) in the .chm file. Furthermore it
+# enables the Previous and Next buttons.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
+
+BINARY_TOC             = NO
+
+# The TOC_EXPAND flag can be set to YES to add extra items for group members to
+# the table of contents of the HTML help documentation and to the tree view.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
+
+TOC_EXPAND             = NO
+
+# If the GENERATE_QHP tag is set to YES and both QHP_NAMESPACE and
+# QHP_VIRTUAL_FOLDER are set, an additional index file will be generated that
+# can be used as input for Qt's qhelpgenerator to generate a Qt Compressed Help
+# (.qch) of the generated HTML documentation.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+GENERATE_QHP           = NO
+
+# If the QHG_LOCATION tag is specified, the QCH_FILE tag can be used to specify
+# the file name of the resulting .qch file. The path specified is relative to
+# the HTML output folder.
+# This tag requires that the tag GENERATE_QHP is set to YES.
+
+QCH_FILE               =
+
+# The QHP_NAMESPACE tag specifies the namespace to use when generating Qt Help
+# Project output. For more information please see Qt Help Project / Namespace
+# (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#namespace).
+# The default value is: org.doxygen.Project.
+# This tag requires that the tag GENERATE_QHP is set to YES.
+
+QHP_NAMESPACE          = org.doxygen.Project
+
+# The QHP_VIRTUAL_FOLDER tag specifies the namespace to use when generating Qt
+# Help Project output. For more information please see Qt Help Project / Virtual
+# Folders (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#virtual-
+# folders).
+# The default value is: doc.
+# This tag requires that the tag GENERATE_QHP is set to YES.
+
+QHP_VIRTUAL_FOLDER     = doc
+
+# If the QHP_CUST_FILTER_NAME tag is set, it specifies the name of a custom
+# filter to add. For more information please see Qt Help Project / Custom
+# Filters (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#custom-
+# filters).
+# This tag requires that the tag GENERATE_QHP is set to YES.
+
+QHP_CUST_FILTER_NAME   =
+
+# The QHP_CUST_FILTER_ATTRS tag specifies the list of the attributes of the
+# custom filter to add. For more information please see Qt Help Project / Custom
+# Filters (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#custom-
+# filters).
+# This tag requires that the tag GENERATE_QHP is set to YES.
+
+QHP_CUST_FILTER_ATTRS  =
+
+# The QHP_SECT_FILTER_ATTRS tag specifies the list of the attributes this
+# project's filter section matches. Qt Help Project / Filter Attributes (see:
+# https://doc.qt.io/archives/qt-4.8/qthelpproject.html#filter-attributes).
+# This tag requires that the tag GENERATE_QHP is set to YES.
+
+QHP_SECT_FILTER_ATTRS  =
+
+# The QHG_LOCATION tag can be used to specify the location of Qt's
+# qhelpgenerator. If non-empty doxygen will try to run qhelpgenerator on the
+# generated .qhp file.
+# This tag requires that the tag GENERATE_QHP is set to YES.
+
+QHG_LOCATION           =
+
+# If the GENERATE_ECLIPSEHELP tag is set to YES, additional index files will be
+# generated, together with the HTML files, they form an Eclipse help plugin. To
+# install this plugin and make it available under the help contents menu in
+# Eclipse, the contents of the directory containing the HTML and XML files needs
+# to be copied into the plugins directory of eclipse. The name of the directory
+# within the plugins directory should be the same as the ECLIPSE_DOC_ID value.
+# After copying Eclipse needs to be restarted before the help appears.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+GENERATE_ECLIPSEHELP   = NO
+
+# A unique identifier for the Eclipse help plugin. When installing the plugin
+# the directory name containing the HTML and XML files should also have this
+# name. Each documentation set should have its own identifier.
+# The default value is: org.doxygen.Project.
+# This tag requires that the tag GENERATE_ECLIPSEHELP is set to YES.
+
+ECLIPSE_DOC_ID         = org.doxygen.Project
+
+# If you want full control over the layout of the generated HTML pages it might
+# be necessary to disable the index and replace it with your own. The
+# DISABLE_INDEX tag can be used to turn on/off the condensed index (tabs) at top
+# of each HTML page. A value of NO enables the index and the value YES disables
+# it. Since the tabs in the index contain the same information as the navigation
+# tree, you can set this option to YES if you also set GENERATE_TREEVIEW to YES.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+DISABLE_INDEX          = NO
+
+# The GENERATE_TREEVIEW tag is used to specify whether a tree-like index
+# structure should be generated to display hierarchical information. If the tag
+# value is set to YES, a side panel will be generated containing a tree-like
+# index structure (just like the one that is generated for HTML Help). For this
+# to work a browser that supports JavaScript, DHTML, CSS and frames is required
+# (i.e. any modern browser). Windows users are probably better off using the
+# HTML help feature. Via custom style sheets (see HTML_EXTRA_STYLESHEET) one can
+# further fine-tune the look of the index. As an example, the default style
+# sheet generated by doxygen has an example that shows how to put an image at
+# the root of the tree instead of the PROJECT_NAME. Since the tree basically has
+# the same information as the tab index, you could consider setting
+# DISABLE_INDEX to YES when enabling this option.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+GENERATE_TREEVIEW      = YES
+
+# The ENUM_VALUES_PER_LINE tag can be used to set the number of enum values that
+# doxygen will group on one line in the generated HTML documentation.
+#
+# Note that a value of 0 will completely suppress the enum values from appearing
+# in the overview section.
+# Minimum value: 0, maximum value: 20, default value: 4.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+ENUM_VALUES_PER_LINE   = 4
+
+# If the treeview is enabled (see GENERATE_TREEVIEW) then this tag can be used
+# to set the initial width (in pixels) of the frame in which the tree is shown.
+# Minimum value: 0, maximum value: 1500, default value: 250.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+TREEVIEW_WIDTH         = 250
+
+# If the EXT_LINKS_IN_WINDOW option is set to YES, doxygen will open links to
+# external symbols imported via tag files in a separate window.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+EXT_LINKS_IN_WINDOW    = NO
+
+# If the HTML_FORMULA_FORMAT option is set to svg, doxygen will use the pdf2svg
+# tool (see https://github.com/dawbarton/pdf2svg) or inkscape (see
+# https://inkscape.org) to generate formulas as SVG images instead of PNGs for
+# the HTML output. These images will generally look nicer at scaled resolutions.
+# Possible values are: png (the default) and svg (looks nicer but requires the
+# pdf2svg or inkscape tool).
+# The default value is: png.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+HTML_FORMULA_FORMAT    = png
+
+# Use this tag to change the font size of LaTeX formulas included as images in
+# the HTML documentation. When you change the font size after a successful
+# doxygen run you need to manually remove any form_*.png images from the HTML
+# output directory to force them to be regenerated.
+# Minimum value: 8, maximum value: 50, default value: 10.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+FORMULA_FONTSIZE       = 10
+
+# Use the FORMULA_TRANSPARENT tag to determine whether or not the images
+# generated for formulas are transparent PNGs. Transparent PNGs are not
+# supported properly for IE 6.0, but are supported on all modern browsers.
+#
+# Note that when changing this option you need to delete any form_*.png files in
+# the HTML output directory before the changes have effect.
+# The default value is: YES.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+FORMULA_TRANSPARENT    = YES
+
+# The FORMULA_MACROFILE can contain LaTeX \newcommand and \renewcommand commands
+# to create new LaTeX commands to be used in formulas as building blocks. See
+# the section "Including formulas" for details.
+
+FORMULA_MACROFILE      =
+
+# Enable the USE_MATHJAX option to render LaTeX formulas using MathJax (see
+# https://www.mathjax.org) which uses client side JavaScript for the rendering
+# instead of using pre-rendered bitmaps. Use this if you do not have LaTeX
+# installed or if you want to formulas look prettier in the HTML output. When
+# enabled you may also need to install MathJax separately and configure the path
+# to it using the MATHJAX_RELPATH option.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+USE_MATHJAX            = NO
+
+# When MathJax is enabled you can set the default output format to be used for
+# the MathJax output. See the MathJax site (see:
+# http://docs.mathjax.org/en/latest/output.html) for more details.
+# Possible values are: HTML-CSS (which is slower, but has the best
+# compatibility), NativeMML (i.e. MathML) and SVG.
+# The default value is: HTML-CSS.
+# This tag requires that the tag USE_MATHJAX is set to YES.
+
+MATHJAX_FORMAT         = HTML-CSS
+
+# When MathJax is enabled you need to specify the location relative to the HTML
+# output directory using the MATHJAX_RELPATH option. The destination directory
+# should contain the MathJax.js script. For instance, if the mathjax directory
+# is located at the same level as the HTML output directory, then
+# MATHJAX_RELPATH should be ../mathjax. The default value points to the MathJax
+# Content Delivery Network so you can quickly see the result without installing
+# MathJax. However, it is strongly recommended to install a local copy of
+# MathJax from https://www.mathjax.org before deployment.
+# The default value is: https://cdn.jsdelivr.net/npm/mathjax@2.
+# This tag requires that the tag USE_MATHJAX is set to YES.
+
+MATHJAX_RELPATH        = https://cdn.jsdelivr.net/npm/mathjax@2
+
+# The MATHJAX_EXTENSIONS tag can be used to specify one or more MathJax
+# extension names that should be enabled during MathJax rendering. For example
+# MATHJAX_EXTENSIONS = TeX/AMSmath TeX/AMSsymbols
+# This tag requires that the tag USE_MATHJAX is set to YES.
+
+MATHJAX_EXTENSIONS     =
+
+# The MATHJAX_CODEFILE tag can be used to specify a file with javascript pieces
+# of code that will be used on startup of the MathJax code. See the MathJax site
+# (see: http://docs.mathjax.org/en/latest/output.html) for more details. For an
+# example see the documentation.
+# This tag requires that the tag USE_MATHJAX is set to YES.
+
+MATHJAX_CODEFILE       =
+
+# When the SEARCHENGINE tag is enabled doxygen will generate a search box for
+# the HTML output. The underlying search engine uses javascript and DHTML and
+# should work on any modern browser. Note that when using HTML help
+# (GENERATE_HTMLHELP), Qt help (GENERATE_QHP), or docsets (GENERATE_DOCSET)
+# there is already a search function so this one should typically be disabled.
+# For large projects the javascript based search engine can be slow, then
+# enabling SERVER_BASED_SEARCH may provide a better solution. It is possible to
+# search using the keyboard; to jump to the search box use <access key> + S
+# (what the <access key> is depends on the OS and browser, but it is typically
+# <CTRL>, <ALT>/<option>, or both). Inside the search box use the <cursor down
+# key> to jump into the search results window, the results can be navigated
+# using the <cursor keys>. Press <Enter> to select an item or <escape> to cancel
+# the search. The filter options can be selected when the cursor is inside the
+# search box by pressing <Shift>+<cursor down>. Also here use the <cursor keys>
+# to select a filter and <Enter> or <escape> to activate or cancel the filter
+# option.
+# The default value is: YES.
+# This tag requires that the tag GENERATE_HTML is set to YES.
+
+SEARCHENGINE           = YES
+
+# When the SERVER_BASED_SEARCH tag is enabled the search engine will be
+# implemented using a web server instead of a web client using JavaScript. There
+# are two flavors of web server based searching depending on the EXTERNAL_SEARCH
+# setting. When disabled, doxygen will generate a PHP script for searching and
+# an index file used by the script. When EXTERNAL_SEARCH is enabled the indexing
+# and searching needs to be provided by external tools. See the section
+# "External Indexing and Searching" for details.
+# The default value is: NO.
+# This tag requires that the tag SEARCHENGINE is set to YES.
+
+SERVER_BASED_SEARCH    = NO
+
+# When EXTERNAL_SEARCH tag is enabled doxygen will no longer generate the PHP
+# script for searching. Instead the search results are written to an XML file
+# which needs to be processed by an external indexer. Doxygen will invoke an
+# external search engine pointed to by the SEARCHENGINE_URL option to obtain the
+# search results.
+#
+# Doxygen ships with an example indexer (doxyindexer) and search engine
+# (doxysearch.cgi) which are based on the open source search engine library
+# Xapian (see: https://xapian.org/).
+#
+# See the section "External Indexing and Searching" for details.
+# The default value is: NO.
+# This tag requires that the tag SEARCHENGINE is set to YES.
+
+EXTERNAL_SEARCH        = NO
+
+# The SEARCHENGINE_URL should point to a search engine hosted by a web server
+# which will return the search results when EXTERNAL_SEARCH is enabled.
+#
+# Doxygen ships with an example indexer (doxyindexer) and search engine
+# (doxysearch.cgi) which are based on the open source search engine library
+# Xapian (see: https://xapian.org/). See the section "External Indexing and
+# Searching" for details.
+# This tag requires that the tag SEARCHENGINE is set to YES.
+
+SEARCHENGINE_URL       =
+
+# When SERVER_BASED_SEARCH and EXTERNAL_SEARCH are both enabled the unindexed
+# search data is written to a file for indexing by an external tool. With the
+# SEARCHDATA_FILE tag the name of this file can be specified.
+# The default file is: searchdata.xml.
+# This tag requires that the tag SEARCHENGINE is set to YES.
+
+SEARCHDATA_FILE        = searchdata.xml
+
+# When SERVER_BASED_SEARCH and EXTERNAL_SEARCH are both enabled the
+# EXTERNAL_SEARCH_ID tag can be used as an identifier for the project. This is
+# useful in combination with EXTRA_SEARCH_MAPPINGS to search through multiple
+# projects and redirect the results back to the right project.
+# This tag requires that the tag SEARCHENGINE is set to YES.
+
+EXTERNAL_SEARCH_ID     =
+
+# The EXTRA_SEARCH_MAPPINGS tag can be used to enable searching through doxygen
+# projects other than the one defined by this configuration file, but that are
+# all added to the same external search index. Each project needs to have a
+# unique id set via EXTERNAL_SEARCH_ID. The search mapping then maps the id of
+# to a relative location where the documentation can be found. The format is:
+# EXTRA_SEARCH_MAPPINGS = tagname1=loc1 tagname2=loc2 ...
+# This tag requires that the tag SEARCHENGINE is set to YES.
+
+EXTRA_SEARCH_MAPPINGS  =
+
+#---------------------------------------------------------------------------
+# Configuration options related to the LaTeX output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_LATEX tag is set to YES, doxygen will generate LaTeX output.
+# The default value is: YES.
+
+GENERATE_LATEX         = YES
+
+# The LATEX_OUTPUT tag is used to specify where the LaTeX docs will be put. If a
+# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
+# it.
+# The default directory is: latex.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_OUTPUT           = latex
+
+# The LATEX_CMD_NAME tag can be used to specify the LaTeX command name to be
+# invoked.
+#
+# Note that when not enabling USE_PDFLATEX the default is latex when enabling
+# USE_PDFLATEX the default is pdflatex and when in the later case latex is
+# chosen this is overwritten by pdflatex. For specific output languages the
+# default can have been set differently, this depends on the implementation of
+# the output language.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_CMD_NAME         =
+
+# The MAKEINDEX_CMD_NAME tag can be used to specify the command name to generate
+# index for LaTeX.
+# Note: This tag is used in the Makefile / make.bat.
+# See also: LATEX_MAKEINDEX_CMD for the part in the generated output file
+# (.tex).
+# The default file is: makeindex.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+MAKEINDEX_CMD_NAME     = makeindex
+
+# The LATEX_MAKEINDEX_CMD tag can be used to specify the command name to
+# generate index for LaTeX. In case there is no backslash (\) as first character
+# it will be automatically added in the LaTeX code.
+# Note: This tag is used in the generated output file (.tex).
+# See also: MAKEINDEX_CMD_NAME for the part in the Makefile / make.bat.
+# The default value is: makeindex.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_MAKEINDEX_CMD    = makeindex
+
+# If the COMPACT_LATEX tag is set to YES, doxygen generates more compact LaTeX
+# documents. This may be useful for small projects and may help to save some
+# trees in general.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+COMPACT_LATEX          = NO
+
+# The PAPER_TYPE tag can be used to set the paper type that is used by the
+# printer.
+# Possible values are: a4 (210 x 297 mm), letter (8.5 x 11 inches), legal (8.5 x
+# 14 inches) and executive (7.25 x 10.5 inches).
+# The default value is: a4.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+PAPER_TYPE             = a4
+
+# The EXTRA_PACKAGES tag can be used to specify one or more LaTeX package names
+# that should be included in the LaTeX output. The package can be specified just
+# by its name or with the correct syntax as to be used with the LaTeX
+# \usepackage command. To get the times font for instance you can specify :
+# EXTRA_PACKAGES=times or EXTRA_PACKAGES={times}
+# To use the option intlimits with the amsmath package you can specify:
+# EXTRA_PACKAGES=[intlimits]{amsmath}
+# If left blank no extra packages will be included.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+EXTRA_PACKAGES         =
+
+# The LATEX_HEADER tag can be used to specify a personal LaTeX header for the
+# generated LaTeX document. The header should contain everything until the first
+# chapter. If it is left blank doxygen will generate a standard header. See
+# section "Doxygen usage" for information on how to let doxygen write the
+# default header to a separate file.
+#
+# Note: Only use a user-defined header if you know what you are doing! The
+# following commands have a special meaning inside the header: $title,
+# $datetime, $date, $doxygenversion, $projectname, $projectnumber,
+# $projectbrief, $projectlogo. Doxygen will replace $title with the empty
+# string, for the replacement values of the other commands the user is referred
+# to HTML_HEADER.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_HEADER           =
+
+# The LATEX_FOOTER tag can be used to specify a personal LaTeX footer for the
+# generated LaTeX document. The footer should contain everything after the last
+# chapter. If it is left blank doxygen will generate a standard footer. See
+# LATEX_HEADER for more information on how to generate a default footer and what
+# special commands can be used inside the footer.
+#
+# Note: Only use a user-defined footer if you know what you are doing!
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_FOOTER           =
+
+# The LATEX_EXTRA_STYLESHEET tag can be used to specify additional user-defined
+# LaTeX style sheets that are included after the standard style sheets created
+# by doxygen. Using this option one can overrule certain style aspects. Doxygen
+# will copy the style sheet files to the output directory.
+# Note: The order of the extra style sheet files is of importance (e.g. the last
+# style sheet in the list overrules the setting of the previous ones in the
+# list).
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_EXTRA_STYLESHEET =
+
+# The LATEX_EXTRA_FILES tag can be used to specify one or more extra images or
+# other source files which should be copied to the LATEX_OUTPUT output
+# directory. Note that the files will be copied as-is; there are no commands or
+# markers available.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_EXTRA_FILES      =
+
+# If the PDF_HYPERLINKS tag is set to YES, the LaTeX that is generated is
+# prepared for conversion to PDF (using ps2pdf or pdflatex). The PDF file will
+# contain links (just like the HTML output) instead of page references. This
+# makes the output suitable for online browsing using a PDF viewer.
+# The default value is: YES.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+PDF_HYPERLINKS         = YES
+
+# If the USE_PDFLATEX tag is set to YES, doxygen will use the engine as
+# specified with LATEX_CMD_NAME to generate the PDF file directly from the LaTeX
+# files. Set this option to YES, to get a higher quality PDF documentation.
+#
+# See also section LATEX_CMD_NAME for selecting the engine.
+# The default value is: YES.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+USE_PDFLATEX           = YES
+
+# If the LATEX_BATCHMODE tag is set to YES, doxygen will add the \batchmode
+# command to the generated LaTeX files. This will instruct LaTeX to keep running
+# if errors occur, instead of asking the user for help. This option is also used
+# when generating formulas in HTML.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_BATCHMODE        = NO
+
+# If the LATEX_HIDE_INDICES tag is set to YES then doxygen will not include the
+# index chapters (such as File Index, Compound Index, etc.) in the output.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_HIDE_INDICES     = NO
+
+# If the LATEX_SOURCE_CODE tag is set to YES then doxygen will include source
+# code with syntax highlighting in the LaTeX output.
+#
+# Note that which sources are shown also depends on other settings such as
+# SOURCE_BROWSER.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_SOURCE_CODE      = NO
+
+# The LATEX_BIB_STYLE tag can be used to specify the style to use for the
+# bibliography, e.g. plainnat, or ieeetr. See
+# https://en.wikipedia.org/wiki/BibTeX and \cite for more info.
+# The default value is: plain.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_BIB_STYLE        = plain
+
+# If the LATEX_TIMESTAMP tag is set to YES then the footer of each generated
+# page will contain the date and time when the page was generated. Setting this
+# to NO can help when comparing the output of multiple runs.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_TIMESTAMP        = NO
+
+# The LATEX_EMOJI_DIRECTORY tag is used to specify the (relative or absolute)
+# path from which the emoji images will be read. If a relative path is entered,
+# it will be relative to the LATEX_OUTPUT directory. If left blank the
+# LATEX_OUTPUT directory will be used.
+# This tag requires that the tag GENERATE_LATEX is set to YES.
+
+LATEX_EMOJI_DIRECTORY  =
+
+#---------------------------------------------------------------------------
+# Configuration options related to the RTF output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_RTF tag is set to YES, doxygen will generate RTF output. The
+# RTF output is optimized for Word 97 and may not look too pretty with other RTF
+# readers/editors.
+# The default value is: NO.
+
+GENERATE_RTF           = NO
+
+# The RTF_OUTPUT tag is used to specify where the RTF docs will be put. If a
+# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
+# it.
+# The default directory is: rtf.
+# This tag requires that the tag GENERATE_RTF is set to YES.
+
+RTF_OUTPUT             = rtf
+
+# If the COMPACT_RTF tag is set to YES, doxygen generates more compact RTF
+# documents. This may be useful for small projects and may help to save some
+# trees in general.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_RTF is set to YES.
+
+COMPACT_RTF            = NO
+
+# If the RTF_HYPERLINKS tag is set to YES, the RTF that is generated will
+# contain hyperlink fields. The RTF file will contain links (just like the HTML
+# output) instead of page references. This makes the output suitable for online
+# browsing using Word or some other Word compatible readers that support those
+# fields.
+#
+# Note: WordPad (write) and others do not support links.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_RTF is set to YES.
+
+RTF_HYPERLINKS         = NO
+
+# Load stylesheet definitions from file. Syntax is similar to doxygen's
+# configuration file, i.e. a series of assignments. You only have to provide
+# replacements, missing definitions are set to their default value.
+#
+# See also section "Doxygen usage" for information on how to generate the
+# default style sheet that doxygen normally uses.
+# This tag requires that the tag GENERATE_RTF is set to YES.
+
+RTF_STYLESHEET_FILE    =
+
+# Set optional variables used in the generation of an RTF document. Syntax is
+# similar to doxygen's configuration file. A template extensions file can be
+# generated using doxygen -e rtf extensionFile.
+# This tag requires that the tag GENERATE_RTF is set to YES.
+
+RTF_EXTENSIONS_FILE    =
+
+# If the RTF_SOURCE_CODE tag is set to YES then doxygen will include source code
+# with syntax highlighting in the RTF output.
+#
+# Note that which sources are shown also depends on other settings such as
+# SOURCE_BROWSER.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_RTF is set to YES.
+
+RTF_SOURCE_CODE        = NO
+
+#---------------------------------------------------------------------------
+# Configuration options related to the man page output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_MAN tag is set to YES, doxygen will generate man pages for
+# classes and files.
+# The default value is: NO.
+
+GENERATE_MAN           = NO
+
+# The MAN_OUTPUT tag is used to specify where the man pages will be put. If a
+# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
+# it. A directory man3 will be created inside the directory specified by
+# MAN_OUTPUT.
+# The default directory is: man.
+# This tag requires that the tag GENERATE_MAN is set to YES.
+
+MAN_OUTPUT             = man
+
+# The MAN_EXTENSION tag determines the extension that is added to the generated
+# man pages. In case the manual section does not start with a number, the number
+# 3 is prepended. The dot (.) at the beginning of the MAN_EXTENSION tag is
+# optional.
+# The default value is: .3.
+# This tag requires that the tag GENERATE_MAN is set to YES.
+
+MAN_EXTENSION          = .3
+
+# The MAN_SUBDIR tag determines the name of the directory created within
+# MAN_OUTPUT in which the man pages are placed. If defaults to man followed by
+# MAN_EXTENSION with the initial . removed.
+# This tag requires that the tag GENERATE_MAN is set to YES.
+
+MAN_SUBDIR             =
+
+# If the MAN_LINKS tag is set to YES and doxygen generates man output, then it
+# will generate one additional man file for each entity documented in the real
+# man page(s). These additional files only source the real man page, but without
+# them the man command would be unable to find the correct page.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_MAN is set to YES.
+
+MAN_LINKS              = NO
+
+#---------------------------------------------------------------------------
+# Configuration options related to the XML output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_XML tag is set to YES, doxygen will generate an XML file that
+# captures the structure of the code including all documentation.
+# The default value is: NO.
+
+GENERATE_XML           = NO
+
+# The XML_OUTPUT tag is used to specify where the XML pages will be put. If a
+# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
+# it.
+# The default directory is: xml.
+# This tag requires that the tag GENERATE_XML is set to YES.
+
+XML_OUTPUT             = xml
+
+# If the XML_PROGRAMLISTING tag is set to YES, doxygen will dump the program
+# listings (including syntax highlighting and cross-referencing information) to
+# the XML output. Note that enabling this will significantly increase the size
+# of the XML output.
+# The default value is: YES.
+# This tag requires that the tag GENERATE_XML is set to YES.
+
+XML_PROGRAMLISTING     = YES
+
+# If the XML_NS_MEMB_FILE_SCOPE tag is set to YES, doxygen will include
+# namespace members in file scope as well, matching the HTML output.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_XML is set to YES.
+
+XML_NS_MEMB_FILE_SCOPE = NO
+
+#---------------------------------------------------------------------------
+# Configuration options related to the DOCBOOK output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_DOCBOOK tag is set to YES, doxygen will generate Docbook files
+# that can be used to generate PDF.
+# The default value is: NO.
+
+GENERATE_DOCBOOK       = NO
+
+# The DOCBOOK_OUTPUT tag is used to specify where the Docbook pages will be put.
+# If a relative path is entered the value of OUTPUT_DIRECTORY will be put in
+# front of it.
+# The default directory is: docbook.
+# This tag requires that the tag GENERATE_DOCBOOK is set to YES.
+
+DOCBOOK_OUTPUT         = docbook
+
+# If the DOCBOOK_PROGRAMLISTING tag is set to YES, doxygen will include the
+# program listings (including syntax highlighting and cross-referencing
+# information) to the DOCBOOK output. Note that enabling this will significantly
+# increase the size of the DOCBOOK output.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_DOCBOOK is set to YES.
+
+DOCBOOK_PROGRAMLISTING = NO
+
+#---------------------------------------------------------------------------
+# Configuration options for the AutoGen Definitions output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_AUTOGEN_DEF tag is set to YES, doxygen will generate an
+# AutoGen Definitions (see http://autogen.sourceforge.net/) file that captures
+# the structure of the code including all documentation. Note that this feature
+# is still experimental and incomplete at the moment.
+# The default value is: NO.
+
+GENERATE_AUTOGEN_DEF   = NO
+
+#---------------------------------------------------------------------------
+# Configuration options related to the Perl module output
+#---------------------------------------------------------------------------
+
+# If the GENERATE_PERLMOD tag is set to YES, doxygen will generate a Perl module
+# file that captures the structure of the code including all documentation.
+#
+# Note that this feature is still experimental and incomplete at the moment.
+# The default value is: NO.
+
+GENERATE_PERLMOD       = NO
+
+# If the PERLMOD_LATEX tag is set to YES, doxygen will generate the necessary
+# Makefile rules, Perl scripts and LaTeX code to be able to generate PDF and DVI
+# output from the Perl module output.
+# The default value is: NO.
+# This tag requires that the tag GENERATE_PERLMOD is set to YES.
+
+PERLMOD_LATEX          = NO
+
+# If the PERLMOD_PRETTY tag is set to YES, the Perl module output will be nicely
+# formatted so it can be parsed by a human reader. This is useful if you want to
+# understand what is going on. On the other hand, if this tag is set to NO, the
+# size of the Perl module output will be much smaller and Perl will parse it
+# just the same.
+# The default value is: YES.
+# This tag requires that the tag GENERATE_PERLMOD is set to YES.
+
+PERLMOD_PRETTY         = YES
+
+# The names of the make variables in the generated doxyrules.make file are
+# prefixed with the string contained in PERLMOD_MAKEVAR_PREFIX. This is useful
+# so different doxyrules.make files included by the same Makefile don't
+# overwrite each other's variables.
+# This tag requires that the tag GENERATE_PERLMOD is set to YES.
+
+PERLMOD_MAKEVAR_PREFIX =
+
+#---------------------------------------------------------------------------
+# Configuration options related to the preprocessor
+#---------------------------------------------------------------------------
+
+# If the ENABLE_PREPROCESSING tag is set to YES, doxygen will evaluate all
+# C-preprocessor directives found in the sources and include files.
+# The default value is: YES.
+
+ENABLE_PREPROCESSING   = YES
+
+# If the MACRO_EXPANSION tag is set to YES, doxygen will expand all macro names
+# in the source code. If set to NO, only conditional compilation will be
+# performed. Macro expansion can be done in a controlled way by setting
+# EXPAND_ONLY_PREDEF to YES.
+# The default value is: NO.
+# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
+
+MACRO_EXPANSION        = YES
+
+# If the EXPAND_ONLY_PREDEF and MACRO_EXPANSION tags are both set to YES then
+# the macro expansion is limited to the macros specified with the PREDEFINED and
+# EXPAND_AS_DEFINED tags.
+# The default value is: NO.
+# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
+
+EXPAND_ONLY_PREDEF     = YES
+
+# If the SEARCH_INCLUDES tag is set to YES, the include files in the
+# INCLUDE_PATH will be searched if a #include is found.
+# The default value is: YES.
+# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
+
+SEARCH_INCLUDES        = YES
+
+# The INCLUDE_PATH tag can be used to specify one or more directories that
+# contain include files that are not input files but should be processed by the
+# preprocessor.
+# This tag requires that the tag SEARCH_INCLUDES is set to YES.
+
+INCLUDE_PATH           =
+
+# You can use the INCLUDE_FILE_PATTERNS tag to specify one or more wildcard
+# patterns (like *.h and *.hpp) to filter out the header-files in the
+# directories. If left blank, the patterns specified with FILE_PATTERNS will be
+# used.
+# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
+
+INCLUDE_FILE_PATTERNS  =
+
+# The PREDEFINED tag can be used to specify one or more macro names that are
+# defined before the preprocessor is started (similar to the -D option of e.g.
+# gcc). The argument of the tag is a list of macros of the form: name or
+# name=definition (no spaces). If the definition and the "=" are omitted, "=1"
+# is assumed. To prevent a macro definition from being undefined via #undef or
+# recursively expanded use the := operator instead of the = operator.
+# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
+
+PREDEFINED             = "XXH_PUBLIC_API=" \
+                         "XXH_FORCE_INLINE=static inline" \
+                         "XXH_NO_INLINE=static" \
+                         "XXH_RESTRICT=restrict" \
+                         "XSUM_API=" \
+                         "XXH_DOXYGEN=" \
+                         "XXH_STATIC_LINKING_ONLY" \
+                         "XXH_IMPLEMENTATION" \
+                         "XXH_ALIGN(N)=alignas(N)" \
+                         "XXH_ALIGN_MEMBER(align,type)=alignas(align) type"
+
+
+# If the MACRO_EXPANSION and EXPAND_ONLY_PREDEF tags are set to YES then this
+# tag can be used to specify a list of macro names that should be expanded. The
+# macro definition that is found in the sources will be used. Use the PREDEFINED
+# tag if you want to use a different macro definition that overrules the
+# definition found in the source code.
+# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
+
+EXPAND_AS_DEFINED      =
+
+
+# If the SKIP_FUNCTION_MACROS tag is set to YES then doxygen's preprocessor will
+# remove all references to function-like macros that are alone on a line, have
+# an all uppercase name, and do not end with a semicolon. Such function macros
+# are typically used for boiler-plate code, and will confuse the parser if not
+# removed.
+# The default value is: YES.
+# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
+
+SKIP_FUNCTION_MACROS   = YES
+
+#---------------------------------------------------------------------------
+# Configuration options related to external references
+#---------------------------------------------------------------------------
+
+# The TAGFILES tag can be used to specify one or more tag files. For each tag
+# file the location of the external documentation should be added. The format of
+# a tag file without this location is as follows:
+# TAGFILES = file1 file2 ...
+# Adding location for the tag files is done as follows:
+# TAGFILES = file1=loc1 "file2 = loc2" ...
+# where loc1 and loc2 can be relative or absolute paths or URLs. See the
+# section "Linking to external documentation" for more information about the use
+# of tag files.
+# Note: Each tag file must have a unique name (where the name does NOT include
+# the path). If a tag file is not located in the directory in which doxygen is
+# run, you must also specify the path to the tagfile here.
+
+TAGFILES               =
+
+# When a file name is specified after GENERATE_TAGFILE, doxygen will create a
+# tag file that is based on the input files it reads. See section "Linking to
+# external documentation" for more information about the usage of tag files.
+
+GENERATE_TAGFILE       =
+
+# If the ALLEXTERNALS tag is set to YES, all external class will be listed in
+# the class index. If set to NO, only the inherited external classes will be
+# listed.
+# The default value is: NO.
+
+ALLEXTERNALS           = NO
+
+# If the EXTERNAL_GROUPS tag is set to YES, all external groups will be listed
+# in the modules index. If set to NO, only the current project's groups will be
+# listed.
+# The default value is: YES.
+
+EXTERNAL_GROUPS        = YES
+
+# If the EXTERNAL_PAGES tag is set to YES, all external pages will be listed in
+# the related pages index. If set to NO, only the current project's pages will
+# be listed.
+# The default value is: YES.
+
+EXTERNAL_PAGES         = YES
+
+#---------------------------------------------------------------------------
+# Configuration options related to the dot tool
+#---------------------------------------------------------------------------
+
+# If the CLASS_DIAGRAMS tag is set to YES, doxygen will generate a class diagram
+# (in HTML and LaTeX) for classes with base or super classes. Setting the tag to
+# NO turns the diagrams off. Note that this option also works with HAVE_DOT
+# disabled, but it is recommended to install and use dot, since it yields more
+# powerful graphs.
+# The default value is: YES.
+
+CLASS_DIAGRAMS         = YES
+
+# You can include diagrams made with dia in doxygen documentation. Doxygen will
+# then run dia to produce the diagram and insert it in the documentation. The
+# DIA_PATH tag allows you to specify the directory where the dia binary resides.
+# If left empty dia is assumed to be found in the default search path.
+
+DIA_PATH               =
+
+# If set to YES the inheritance and collaboration graphs will hide inheritance
+# and usage relations if the target is undocumented or is not a class.
+# The default value is: YES.
+
+HIDE_UNDOC_RELATIONS   = YES
+
+# If you set the HAVE_DOT tag to YES then doxygen will assume the dot tool is
+# available from the path. This tool is part of Graphviz (see:
+# http://www.graphviz.org/), a graph visualization toolkit from AT&T and Lucent
+# Bell Labs. The other options in this section have no effect if this option is
+# set to NO
+# The default value is: NO.
+
+HAVE_DOT               = NO
+
+# The DOT_NUM_THREADS specifies the number of dot invocations doxygen is allowed
+# to run in parallel. When set to 0 doxygen will base this on the number of
+# processors available in the system. You can set it explicitly to a value
+# larger than 0 to get control over the balance between CPU load and processing
+# speed.
+# Minimum value: 0, maximum value: 32, default value: 0.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_NUM_THREADS        = 0
+
+# When you want a differently looking font in the dot files that doxygen
+# generates you can specify the font name using DOT_FONTNAME. You need to make
+# sure dot is able to find the font, which can be done by putting it in a
+# standard location or by setting the DOTFONTPATH environment variable or by
+# setting DOT_FONTPATH to the directory containing the font.
+# The default value is: Helvetica.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_FONTNAME           = Helvetica
+
+# The DOT_FONTSIZE tag can be used to set the size (in points) of the font of
+# dot graphs.
+# Minimum value: 4, maximum value: 24, default value: 10.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_FONTSIZE           = 10
+
+# By default doxygen will tell dot to use the default font as specified with
+# DOT_FONTNAME. If you specify a different font using DOT_FONTNAME you can set
+# the path where dot can find it using this tag.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_FONTPATH           =
+
+# If the CLASS_GRAPH tag is set to YES then doxygen will generate a graph for
+# each documented class showing the direct and indirect inheritance relations.
+# Setting this tag to YES will force the CLASS_DIAGRAMS tag to NO.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+CLASS_GRAPH            = YES
+
+# If the COLLABORATION_GRAPH tag is set to YES then doxygen will generate a
+# graph for each documented class showing the direct and indirect implementation
+# dependencies (inheritance, containment, and class references variables) of the
+# class with other documented classes.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+COLLABORATION_GRAPH    = YES
+
+# If the GROUP_GRAPHS tag is set to YES then doxygen will generate a graph for
+# groups, showing the direct groups dependencies.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+GROUP_GRAPHS           = YES
+
+# If the UML_LOOK tag is set to YES, doxygen will generate inheritance and
+# collaboration diagrams in a style similar to the OMG's Unified Modeling
+# Language.
+# The default value is: NO.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+UML_LOOK               = NO
+
+# If the UML_LOOK tag is enabled, the fields and methods are shown inside the
+# class node. If there are many fields or methods and many nodes the graph may
+# become too big to be useful. The UML_LIMIT_NUM_FIELDS threshold limits the
+# number of items for each type to make the size more manageable. Set this to 0
+# for no limit. Note that the threshold may be exceeded by 50% before the limit
+# is enforced. So when you set the threshold to 10, up to 15 fields may appear,
+# but if the number exceeds 15, the total amount of fields shown is limited to
+# 10.
+# Minimum value: 0, maximum value: 100, default value: 10.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+UML_LIMIT_NUM_FIELDS   = 10
+
+# If the TEMPLATE_RELATIONS tag is set to YES then the inheritance and
+# collaboration graphs will show the relations between templates and their
+# instances.
+# The default value is: NO.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+TEMPLATE_RELATIONS     = NO
+
+# If the INCLUDE_GRAPH, ENABLE_PREPROCESSING and SEARCH_INCLUDES tags are set to
+# YES then doxygen will generate a graph for each documented file showing the
+# direct and indirect include dependencies of the file with other documented
+# files.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+INCLUDE_GRAPH          = YES
+
+# If the INCLUDED_BY_GRAPH, ENABLE_PREPROCESSING and SEARCH_INCLUDES tags are
+# set to YES then doxygen will generate a graph for each documented file showing
+# the direct and indirect include dependencies of the file with other documented
+# files.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+INCLUDED_BY_GRAPH      = YES
+
+# If the CALL_GRAPH tag is set to YES then doxygen will generate a call
+# dependency graph for every global function or class method.
+#
+# Note that enabling this option will significantly increase the time of a run.
+# So in most cases it will be better to enable call graphs for selected
+# functions only using the \callgraph command. Disabling a call graph can be
+# accomplished by means of the command \hidecallgraph.
+# The default value is: NO.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+CALL_GRAPH             = NO
+
+# If the CALLER_GRAPH tag is set to YES then doxygen will generate a caller
+# dependency graph for every global function or class method.
+#
+# Note that enabling this option will significantly increase the time of a run.
+# So in most cases it will be better to enable caller graphs for selected
+# functions only using the \callergraph command. Disabling a caller graph can be
+# accomplished by means of the command \hidecallergraph.
+# The default value is: NO.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+CALLER_GRAPH           = NO
+
+# If the GRAPHICAL_HIERARCHY tag is set to YES then doxygen will graphical
+# hierarchy of all classes instead of a textual one.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+GRAPHICAL_HIERARCHY    = YES
+
+# If the DIRECTORY_GRAPH tag is set to YES then doxygen will show the
+# dependencies a directory has on other directories in a graphical way. The
+# dependency relations are determined by the #include relations between the
+# files in the directories.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DIRECTORY_GRAPH        = YES
+
+# The DOT_IMAGE_FORMAT tag can be used to set the image format of the images
+# generated by dot. For an explanation of the image formats see the section
+# output formats in the documentation of the dot tool (Graphviz (see:
+# http://www.graphviz.org/)).
+# Note: If you choose svg you need to set HTML_FILE_EXTENSION to xhtml in order
+# to make the SVG files visible in IE 9+ (other browsers do not have this
+# requirement).
+# Possible values are: png, jpg, gif, svg, png:gd, png:gd:gd, png:cairo,
+# png:cairo:gd, png:cairo:cairo, png:cairo:gdiplus, png:gdiplus and
+# png:gdiplus:gdiplus.
+# The default value is: png.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_IMAGE_FORMAT       = png
+
+# If DOT_IMAGE_FORMAT is set to svg, then this option can be set to YES to
+# enable generation of interactive SVG images that allow zooming and panning.
+#
+# Note that this requires a modern browser other than Internet Explorer. Tested
+# and working are Firefox, Chrome, Safari, and Opera.
+# Note: For IE 9+ you need to set HTML_FILE_EXTENSION to xhtml in order to make
+# the SVG files visible. Older versions of IE do not have SVG support.
+# The default value is: NO.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+INTERACTIVE_SVG        = NO
+
+# The DOT_PATH tag can be used to specify the path where the dot tool can be
+# found. If left blank, it is assumed the dot tool can be found in the path.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_PATH               =
+
+# The DOTFILE_DIRS tag can be used to specify one or more directories that
+# contain dot files that are included in the documentation (see the \dotfile
+# command).
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOTFILE_DIRS           =
+
+# The MSCFILE_DIRS tag can be used to specify one or more directories that
+# contain msc files that are included in the documentation (see the \mscfile
+# command).
+
+MSCFILE_DIRS           =
+
+# The DIAFILE_DIRS tag can be used to specify one or more directories that
+# contain dia files that are included in the documentation (see the \diafile
+# command).
+
+DIAFILE_DIRS           =
+
+# When using plantuml, the PLANTUML_JAR_PATH tag should be used to specify the
+# path where java can find the plantuml.jar file. If left blank, it is assumed
+# PlantUML is not used or called during a preprocessing step. Doxygen will
+# generate a warning when it encounters a \startuml command in this case and
+# will not generate output for the diagram.
+
+PLANTUML_JAR_PATH      =
+
+# When using plantuml, the PLANTUML_CFG_FILE tag can be used to specify a
+# configuration file for plantuml.
+
+PLANTUML_CFG_FILE      =
+
+# When using plantuml, the specified paths are searched for files specified by
+# the !include statement in a plantuml block.
+
+PLANTUML_INCLUDE_PATH  =
+
+# The DOT_GRAPH_MAX_NODES tag can be used to set the maximum number of nodes
+# that will be shown in the graph. If the number of nodes in a graph becomes
+# larger than this value, doxygen will truncate the graph, which is visualized
+# by representing a node as a red box. Note that doxygen if the number of direct
+# children of the root node in a graph is already larger than
+# DOT_GRAPH_MAX_NODES then the graph will not be shown at all. Also note that
+# the size of a graph can be further restricted by MAX_DOT_GRAPH_DEPTH.
+# Minimum value: 0, maximum value: 10000, default value: 50.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_GRAPH_MAX_NODES    = 50
+
+# The MAX_DOT_GRAPH_DEPTH tag can be used to set the maximum depth of the graphs
+# generated by dot. A depth value of 3 means that only nodes reachable from the
+# root by following a path via at most 3 edges will be shown. Nodes that lay
+# further from the root node will be omitted. Note that setting this option to 1
+# or 2 may greatly reduce the computation time needed for large code bases. Also
+# note that the size of a graph can be further restricted by
+# DOT_GRAPH_MAX_NODES. Using a depth of 0 means no depth restriction.
+# Minimum value: 0, maximum value: 1000, default value: 0.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+MAX_DOT_GRAPH_DEPTH    = 0
+
+# Set the DOT_TRANSPARENT tag to YES to generate images with a transparent
+# background. This is disabled by default, because dot on Windows does not seem
+# to support this out of the box.
+#
+# Warning: Depending on the platform used, enabling this option may lead to
+# badly anti-aliased labels on the edges of a graph (i.e. they become hard to
+# read).
+# The default value is: NO.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_TRANSPARENT        = NO
+
+# Set the DOT_MULTI_TARGETS tag to YES to allow dot to generate multiple output
+# files in one run (i.e. multiple -o and -T options on the command line). This
+# makes dot run faster, but since only newer versions of dot (>1.8.10) support
+# this, this feature is disabled by default.
+# The default value is: NO.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_MULTI_TARGETS      = NO
+
+# If the GENERATE_LEGEND tag is set to YES doxygen will generate a legend page
+# explaining the meaning of the various boxes and arrows in the dot generated
+# graphs.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+GENERATE_LEGEND        = YES
+
+# If the DOT_CLEANUP tag is set to YES, doxygen will remove the intermediate dot
+# files that are used to generate the various graphs.
+# The default value is: YES.
+# This tag requires that the tag HAVE_DOT is set to YES.
+
+DOT_CLEANUP            = YES
diff --git a/xxhash.h b/xxhash.h
index b5e57b1b..7bbf5e11 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -32,7 +32,12 @@
  *   - xxHash homepage: https://www.xxhash.com
  *   - xxHash source repository: https://github.com/Cyan4973/xxHash
  */
-
+/*!
+ * @mainpage xxHash
+ *
+ * @file xxhash.h
+ * xxHash prototypes and implementation
+ */
 /* TODO: update */
 /* Notice extracted from xxHash homepage:
 
@@ -165,6 +170,12 @@ extern "C" {
 #ifndef XXHASH_H_5627135585666179
 #define XXHASH_H_5627135585666179 1
 
+
+/*!
+ * @defgroup public Public API
+ * Contains details on the public xxHash functions.
+ * @{
+ */
 /* specific declaration modes for Windows */
 #if !defined(XXH_INLINE_ALL) && !defined(XXH_PRIVATE_API)
 #  if defined(WIN32) && defined(_MSC_VER) && (defined(XXH_IMPORT) || defined(XXH_EXPORT))
@@ -178,8 +189,9 @@ extern "C" {
 #  endif
 #endif
 
+#ifdef XXH_DOXYGEN
 /*!
- * XXH_NAMESPACE, aka Namespace Emulation:
+ * @brief Emulate a namespace by transparently prefixing all symbols.
  *
  * If you want to include _and expose_ xxHash functions from within your own
  * library, but also want to avoid symbol collisions with other libraries which
@@ -191,6 +203,10 @@ extern "C" {
  * includes `xxhash.h`: Regular symbol names will be automatically translated
  * by this header.
  */
+#  define XXH_NAMESPACE /* YOUR NAME HERE */
+#  undef XXH_NAMESPACE
+#endif
+
 #ifdef XXH_NAMESPACE
 #  define XXH_CAT(A,B) A##B
 #  define XXH_NAME2(A,B) XXH_CAT(A,B)
@@ -252,6 +268,14 @@ extern "C" {
 #define XXH_VERSION_MINOR    8
 #define XXH_VERSION_RELEASE  0
 #define XXH_VERSION_NUMBER  (XXH_VERSION_MAJOR *100*100 + XXH_VERSION_MINOR *100 + XXH_VERSION_RELEASE)
+
+/*!
+ * @brief Obtains the xxHash version that was compiles.
+ *
+ * This is only uaeful when xxHash is compiled as a shared library.
+ *
+ * @return `XXH_VERSION_NUMBER` as of when it was compiled.
+ */
 XXH_PUBLIC_API unsigned XXH_versionNumber (void);
 
 
@@ -265,7 +289,14 @@ typedef enum { XXH_OK=0, XXH_ERROR } XXH_errorcode;
 /*-**********************************************************************
 *  32-bit hash
 ************************************************************************/
-#if !defined (__VMS) \
+#if defined(XXH_DOXYGEN) /* Don't show <stdint.h> include */
+/*!
+ * @brief An unsigned 32-bit integer.
+ *
+ * Not necessarily defined to `uint32_t` but functionally equivalent.
+ */
+typedef uint32_t XXH32_hash_t;
+#elif !defined (__VMS) \
   && (defined (__cplusplus) \
   || (defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 199901L) /* C99 */) )
 #   include <stdint.h>
@@ -284,21 +315,48 @@ typedef enum { XXH_OK=0, XXH_ERROR } XXH_errorcode;
 #endif
 
 /*!
- * XXH32():
- *  Calculate the 32-bit hash of sequence "length" bytes stored at memory address "input".
- *  The memory between input & input+length must be valid (allocated and read-accessible).
- *  "seed" can be used to alter the result predictably.
- *  Speed on Core 2 Duo @ 3 GHz (single thread, SMHasher benchmark): 5.4 GB/s
- *
- * Note: XXH3 provides competitive speed for both 32-bit and 64-bit systems,
- * and offers true 64/128 bit hash results. It provides a superior level of
- * dispersion, and greatly reduces the risks of collisions.
+ * @}
+ *
+ * @defgroup xxh32_family XXH32 family
+ * @ingroup public
+ * Contains functions used in the classic 32-bit xxHash algorithm.
+ *
+ * @note
+ *   XXH32 is considered rather weak by today's standards.
+ *   The @ref xxh3_family provides competitive speed for both 32-bit and 64-bit
+ *   systems, and offers true 64/128 bit hash results. It provides a superior
+ *   level of dispersion, and greatly reduces the risks of collisions.
+ *
+ * @see @ref xxh64_family, @ref xxh3_family: Other xxHash families
+ * @see @ref xxh32_impl for implementation details
+ * @{
  */
-XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t length, XXH32_hash_t seed);
 
-/*******   Streaming   *******/
+/*!
+ * @brief Calculates the 32-bit hash of @p input using xxHash32.
+ *
+ * Speed on Core 2 Duo @ 3 GHz (single thread, SMHasher benchmark): 5.4 GB/s
+ *
+ * @param input The block of data to be hashed, at least @p length bytes in size
+ * @param length The length of @p input in bytes
+ * @param seed The 32-bit seed to alter the hash's output predictably.
+ *
+ * @pre
+ *   The memory between @p input and @p input + @p length must be valid,
+ *   readable, contiguous memory. However, if @p length is `0`, @p input may be
+ *   `NULL`. In C++, this also must be *TriviallyCopyable*.
+ *
+ * @return The calculated 32-bit hash
+ *
+ * @see
+ *    XXH64(), XXH3_64bits_withSeed(), XXH3_128bits_withSeed(), XXH128():
+ *    Direct equivalents for the other variants of xxHash
+ * @see
+ *    XXH32_createState(), XXH32_update(), XXH32_digest(): Streaming version.
+ */
+XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t length, XXH32_hash_t seed);
 
-/*
+/*!
  * Streaming functions generate the xxHash value from an incrememtal input.
  * This method is slower than single-call functions, due to state management.
  * For small inputs, prefer `XXH32()` and `XXH64()`, which are better optimized.
@@ -319,15 +377,119 @@ XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t length, XXH32_hash_
  * digest, and generate new hash values later on by invoking `XXH*_digest()`.
  *
  * When done, release the state using `XXH*_freeState()`.
+ *
+ * Example code for incrementally hashing a file:
+ * @code{.c}
+ *    #include <stdio.h>
+ *    #include <xxhash.h>
+ *    #define BUFFER_SIZE 256
+ *
+ *    // Note: XXH64 and XXH3 use the same interface.
+ *    XXH32_hash_t
+ *    hashFile(FILE* stream)
+ *    {
+ *        XXH32_state_t* state;
+ *        unsigned char buf[BUFFER_SIZE];
+ *        size_t amt;
+ *        XXH32_hash_t hash;
+ *
+ *        state = XXH32_createState();       // Create a state
+ *        assert(state != NULL);             // Error check here
+ *        XXH32_reset(state, 0xbaad5eed);    // Reset state with our seed
+ *        while ((amt = fread(buf, 1, sizeof(buf), stream)) != 0) {
+ *            XXH32_update(state, buf, amt); // Hash the file in chunks
+ *        }
+ *        hash = XXH32_digest(state);        // Finalize the hash
+ *        XXH32_freeState(state);            // Clean up
+ *        return hash;
+ *    }
+ * @endcode
+ *
+ */
+
+
+/*!
+ * @typedef struct XXH32_state_s XXH32_state_t
+ * @brief The opaque state struct for the XXH32 streaming API.
+ *
+ * @see XXH32_state_s for details.
  */
+typedef struct XXH32_state_s XXH32_state_t;
 
-typedef struct XXH32_state_s XXH32_state_t;   /* incomplete type */
+/*!
+ * @brief Allocates an @ref XXH32_state_t.
+ *
+ * Must be freed with XXH32_freeState().
+ * @return An allocated XXH32_state_t on success, `NULL` on failure.
+ */
 XXH_PUBLIC_API XXH32_state_t* XXH32_createState(void);
+/*!
+ * @brief Frees an @ref XXH32_state_t.
+ *
+ * Must be allocated with XXH32_createState().
+ * @param statePtr A pointer to an @ref XXH32_state_t allocated with @ref XXH32_createState().
+ * @return XXH_OK.
+ */
 XXH_PUBLIC_API XXH_errorcode  XXH32_freeState(XXH32_state_t* statePtr);
+/*!
+ * @brief Copies one @ref XXH32_state_t to another.
+ *
+ * @param dst_state The state to copy to
+ * @param src_state The state to copy from
+ * @pre
+ *   @p dst_state and @p src_state must not be `NULL` and must not overlap.
+ */
 XXH_PUBLIC_API void XXH32_copyState(XXH32_state_t* dst_state, const XXH32_state_t* src_state);
 
+/*!
+ * @brief Resets an @ref XXH32_state_t to begin a new hash.
+ *
+ * This function resets and seeds a state. Call it before @ref XXH32_update().
+ *
+ * @param statePtr The state struct to reset.
+ * @param seed The 32-bit seed to alter the hash result predictably.
+ *
+ * @pre
+ *   @p statePtr must not be `NULL`.
+ *
+ * @return @ref XXH_OK on success, @ref XXH_ERROR on failure
+ */
 XXH_PUBLIC_API XXH_errorcode XXH32_reset  (XXH32_state_t* statePtr, XXH32_hash_t seed);
+
+/*!
+ * @brief Consumes a block of @p input to an @ref XXH32_state_t.
+ *
+ * Call this to incrementally consume blocks of data.
+ *
+ * @param statePtr The state struct to update.
+ * @param input The block of data to be hashed, at least @p length bytes in size
+ * @param length The length of @p input in bytes
+ *
+ * @pre
+ *   @p statePtr must not be `NULL`.
+ * @pre
+ *   The memory between @p input and @p input + @p length must be valid,
+ *   readable, contiguous memory. However, if @p length is `0`, @p input may be
+ *   `NULL`. In C++, this also must be *TriviallyCopyable*.
+ *
+ * @return @ref XXH_OK on success, @ref XXH_ERROR on failure.
+ */
 XXH_PUBLIC_API XXH_errorcode XXH32_update (XXH32_state_t* statePtr, const void* input, size_t length);
+
+/*!
+ * @brief Returns the calculated hash value from an @ref XXH32_state_t.
+ *
+ * @note
+ *   Calling XXH32_digest() will not affect @p statePtr, so you can update,
+ *   digest, and update again.
+ *
+ * @param statePtr The state struct to calculate the hash from.
+ *
+ * @pre
+ *  @p statePtr must not be `NULL`.
+ *
+ * @return The calculated xxHash32 value from that state.
+ */
 XXH_PUBLIC_API XXH32_hash_t  XXH32_digest (const XXH32_state_t* statePtr);
 
 /*******   Canonical representation   *******/
@@ -351,41 +513,116 @@ XXH_PUBLIC_API XXH32_hash_t  XXH32_digest (const XXH32_state_t* statePtr);
  * canonical format.
  */
 
-typedef struct { unsigned char digest[4]; } XXH32_canonical_t;
+/*!
+ * @brief Canonical (big endian) representation of @ref XXH32_hash_t.
+ */
+typedef struct {
+    unsigned char digest[4]; /*!< Hash bytes, big endian */
+} XXH32_canonical_t;
+
+/*!
+ * @brief Converts an @ref XXH32_hash_t to a big endian @ref XXH32_canonical_t.
+ *
+ * @param dst The @ref XXH32_canonical_t pointer to be stored to
+ * @param hash The @ref XXH32_hash_t to be converted.
+ *
+ * @pre
+ *   @p dst must not be `NULL`.
+ */
 XXH_PUBLIC_API void XXH32_canonicalFromHash(XXH32_canonical_t* dst, XXH32_hash_t hash);
+
+/*!
+ * @brief Converts an @ref XXH32_canonical_t to a native @ref XXH32_hash_t.
+ *
+ * @param src The @ref XXH32_canonical_t to convert.
+ *
+ * @pre
+ *   @p src must not be `NULL`.
+ *
+ * @return The converted hash.
+ */
 XXH_PUBLIC_API XXH32_hash_t XXH32_hashFromCanonical(const XXH32_canonical_t* src);
 
 
+/*!
+ * @}
+ * @ingroup public
+ * @{
+ */
+
 #ifndef XXH_NO_LONG_LONG
 /*-**********************************************************************
 *  64-bit hash
 ************************************************************************/
-#if !defined (__VMS) \
+#if defined(XXH_DOXYGEN) /* don't include <stdint.h> */
+/*!
+ * @brief An unsigned 64-bit integer.
+ *
+ * Not necessarily defined to `uint64_t` but functionally equivalent.
+ */
+typedef uint64_t XXH64_hash_t;
+#elif !defined (__VMS) \
   && (defined (__cplusplus) \
   || (defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 199901L) /* C99 */) )
-#   include <stdint.h>
-    typedef uint64_t XXH64_hash_t;
+#  include <stdint.h>
+   typedef uint64_t XXH64_hash_t;
 #else
-    /* the following type must have a width of 64-bit */
-    typedef unsigned long long XXH64_hash_t;
+#  include <limits.h>
+#  if defined(__LP64__) && ULONG_MAX == 0xFFFFFFFFFFFFFFFFULL
+     /* LP64 ABI says uint64_t is unsigned long */
+     typedef unsigned long XXH64_hash_t;
+#  else
+     /* the following type must have a width of 64-bit */
+     typedef unsigned long long XXH64_hash_t;
+#  endif
 #endif
 
 /*!
- * XXH64():
- * Returns the 64-bit hash of sequence of length @length stored at memory
- * address @input.
- * @seed can be used to alter the result predictably.
+ * @}
+ *
+ * @defgroup xxh64_family XXH64 family
+ * @ingroup public
+ * @{
+ * Contains functions used in the classic 64-bit xxHash algorithm.
+ *
+ * @note
+ *   XXH3 provides competitive speed for both 32-bit and 64-bit systems,
+ *   and offers true 64/128 bit hash results. It provides a superior level of
+ *   dispersion, and greatly reduces the risks of collisions.
+ */
+
+
+/*!
+ * @brief Calculates the 64-bit hash of @p input using xxHash64.
  *
  * This function usually runs faster on 64-bit systems, but slower on 32-bit
  * systems (see benchmark).
  *
- * Note: XXH3 provides competitive speed for both 32-bit and 64-bit systems,
- * and offers true 64/128 bit hash results. It provides a superior level of
- * dispersion, and greatly reduces the risks of collisions.
+ * @param input The block of data to be hashed, at least @p length bytes in size.
+ * @param length The length of @p input, in bytes.
+ * @param seed The 64-bit seed to alter the hash's output predictably.
+ *
+ * @pre
+ *   The memory between @p input and @p input + @p length must be valid,
+ *   readable, contiguous memory. However, if @p length is `0`, @p input may be
+ *   `NULL`. In C++, this also must be *TriviallyCopyable*.
+ *
+ * @return The calculated 64-bit hash.
+ *
+ * @see
+ *    XXH32(), XXH3_64bits_withSeed(), XXH3_128bits_withSeed(), XXH128():
+ *    Direct equivalents for the other variants of xxHash
+ * @see
+ *    XXH64_createState(), XXH64_update(), XXH64_digest(): Streaming version.
  */
-XXH_PUBLIC_API XXH64_hash_t XXH64 (const void* input, size_t length, XXH64_hash_t seed);
+XXH_PUBLIC_API XXH64_hash_t XXH64(const void* input, size_t length, XXH64_hash_t seed);
 
 /*******   Streaming   *******/
+/*!
+ * @brief The opaque state struct for the XXH64 streaming API.
+ *
+ * @see XXH64_state_s for details.
+ */
 typedef struct XXH64_state_s XXH64_state_t;   /* incomplete type */
 XXH_PUBLIC_API XXH64_state_t* XXH64_createState(void);
 XXH_PUBLIC_API XXH_errorcode  XXH64_freeState(XXH64_state_t* statePtr);
@@ -400,12 +637,13 @@ typedef struct { unsigned char digest[sizeof(XXH64_hash_t)]; } XXH64_canonical_t
 XXH_PUBLIC_API void XXH64_canonicalFromHash(XXH64_canonical_t* dst, XXH64_hash_t hash);
 XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src);
 
-
-/*-**********************************************************************
-*  XXH3 64-bit variant
-************************************************************************/
-
-/* ************************************************************************
+/*!
+ * @}
+ * ************************************************************************
+ * @defgroup xxh3_family XXH3 family
+ * @ingroup public
+ * @{
+ *
  * XXH3 is a more recent hash algorithm featuring:
  *  - Improved speed for both small and large inputs
  *  - True 64-bit and 128-bit outputs
@@ -444,6 +682,10 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
  * The API supports one-shot hashing, streaming mode, and custom secrets.
  */
 
+/*-**********************************************************************
+*  XXH3 64-bit variant
+************************************************************************/
+
 /* XXH3_64bits():
  * default 64-bit variant, using default secret and default seed of 0.
  * It's the fastest variant. */
@@ -458,6 +700,15 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits(const void* data, size_t len);
  */
 XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSeed(const void* data, size_t len, XXH64_hash_t seed);
 
+/*!
+ * The bare minimum size for a custom secret.
+ *
+ * @see
+ *  XXH3_64bits_withSecret(), XXH3_64bits_reset_withSecret(),
+ *  XXH3_128bits_withSecret(), XXH3_128bits_reset_withSecret().
+ */
+#define XXH3_SECRET_SIZE_MIN 136
+
 /*
  * XXH3_64bits_withSecret():
  * It's possible to provide any blob of bytes as a "secret" to generate the hash.
@@ -471,7 +722,6 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSeed(const void* data, size_t len, X
  * and employ "XXH3_generateSecret()" (see below)
  * to generate a high entropy secret derived from the custom seed.
  */
-#define XXH3_SECRET_SIZE_MIN 136
 XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSecret(const void* data, size_t len, const void* secret, size_t secretSize);
 
 
@@ -482,6 +732,12 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSecret(const void* data, size_t len,
  * As a consequence, streaming is slower than one-shot hashing.
  * For better performance, prefer one-shot functions whenever applicable.
  */
+
+/*!
+ * @brief The state struct for the XXH3 streaming API.
+ *
+ * @see XXH3_state_s for details.
+ */
 typedef struct XXH3_state_s XXH3_state_t;
 XXH_PUBLIC_API XXH3_state_t* XXH3_createState(void);
 XXH_PUBLIC_API XXH_errorcode XXH3_freeState(XXH3_state_t* statePtr);
@@ -521,9 +777,15 @@ XXH_PUBLIC_API XXH64_hash_t  XXH3_64bits_digest (const XXH3_state_t* statePtr);
 *  XXH3 128-bit variant
 ************************************************************************/
 
+/*!
+ * @brief The return value from 128-bit hashes.
+ *
+ * Stored in little endian order, although the fields themselves are in native
+ * endianness.
+ */
 typedef struct {
- XXH64_hash_t low64;
- XXH64_hash_t high64;
+    XXH64_hash_t low64;   /*!< `value & 0xFFFFFFFFFFFFFFFF` */
+    XXH64_hash_t high64;  /*!< `value >> 64` */
 } XXH128_hash_t;
 
 XXH_PUBLIC_API XXH128_hash_t XXH3_128bits(const void* data, size_t len);
@@ -580,6 +842,9 @@ XXH_PUBLIC_API XXH128_hash_t XXH128_hashFromCanonical(const XXH128_canonical_t*
 
 #endif  /* XXH_NO_LONG_LONG */
 
+/*!
+ * @}
+ */
 #endif /* XXHASH_H_5627135585666179 */
 
 
@@ -600,31 +865,55 @@ XXH_PUBLIC_API XXH128_hash_t XXH128_hashFromCanonical(const XXH128_canonical_t*
  * Never **ever** access their members directly.
  */
 
+/*!
+ * @internal
+ * @brief Structure for XXH32 streaming API.
+ *
+ * @note This is only defined when @ref XXH_STATIC_LINKING_ONLY,
+ * @ref XXH_INLINE_ALL, or @ref XXH_IMPLEMENTATION is defined. Otherwise it is
+ * an opaque type. This allows fields to safely be changed.
+ *
+ * Typedef'd to @ref XXH32_state_t.
+ * Do not access the members of this struct directly.
+ * @see XXH64_state_s, XXH3_state_s
+ */
 struct XXH32_state_s {
-   XXH32_hash_t total_len_32;
-   XXH32_hash_t large_len;
-   XXH32_hash_t v1;
-   XXH32_hash_t v2;
-   XXH32_hash_t v3;
-   XXH32_hash_t v4;
-   XXH32_hash_t mem32[4];
-   XXH32_hash_t memsize;
-   XXH32_hash_t reserved;   /* never read nor write, might be removed in a future version */
+   XXH32_hash_t total_len_32; /*!< Total length hashed, modulo 2^32 */
+   XXH32_hash_t large_len;    /*!< Whether the hash is >= 16 (handles @ref total_len_32 overflow) */
+   XXH32_hash_t v1;           /*!< First accumulator lane */
+   XXH32_hash_t v2;           /*!< Second accumulator lane */
+   XXH32_hash_t v3;           /*!< Third accumulator lane */
+   XXH32_hash_t v4;           /*!< Fourth accumulator lane */
+   XXH32_hash_t mem32[4];     /*!< Internal buffer for partial reads. Treated as unsigned char[16]. */
+   XXH32_hash_t memsize;      /*!< Amount of data in @ref mem32 */
+   XXH32_hash_t reserved;     /*!< Reserved field. Do not read or write to it, it may be removed. */
 };   /* typedef'd to XXH32_state_t */
 
 
 #ifndef XXH_NO_LONG_LONG  /* defined when there is no 64-bit support */
 
+/*!
+ * @internal
+ * @brief Structure for XXH64 streaming API.
+ *
+ * @note This is only defined when @ref XXH_STATIC_LINKING_ONLY,
+ * @ref XXH_INLINE_ALL, or @ref XXH_IMPLEMENTATION is defined. Otherwise it is
+ * an opaque type. This allows fields to safely be changed.
+ *
+ * Typedef'd to @ref XXH64_state_t.
+ * Do not access the members of this struct directly.
+ * @see XXH32_state_s, XXH3_state_s
+ */
 struct XXH64_state_s {
-   XXH64_hash_t total_len;
-   XXH64_hash_t v1;
-   XXH64_hash_t v2;
-   XXH64_hash_t v3;
-   XXH64_hash_t v4;
-   XXH64_hash_t mem64[4];
-   XXH32_hash_t memsize;
-   XXH32_hash_t reserved32;  /* required for padding anyway */
-   XXH64_hash_t reserved64;  /* never read nor write, might be removed in a future version */
+   XXH64_hash_t total_len;    /*!< Total length hashed. This is always 64-bit. */
+   XXH64_hash_t v1;           /*!< First accumulator lane */
+   XXH64_hash_t v2;           /*!< Second accumulator lane */
+   XXH64_hash_t v3;           /*!< Third accumulator lane */
+   XXH64_hash_t v4;           /*!< Fourth accumulator lane */
+   XXH64_hash_t mem64[4];     /*!< Internal buffer for partial reads. Treated as unsigned char[32]. */
+   XXH32_hash_t memsize;      /*!< Amount of data in @ref mem64 */
+   XXH32_hash_t reserved32;   /*!< Reserved field, needed for padding anyways*/
+   XXH64_hash_t reserved64;   /*!< Reserved field. Do not read or write to it, it may be removed. */
 };   /* typedef'd to XXH64_state_t */
 
 #if defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 201112L)   /* C11+ */
@@ -646,29 +935,79 @@ struct XXH64_state_s {
 #   define XXH_ALIGN_MEMBER(align, type) XXH_ALIGN(align) type
 #endif
 
+/*!
+ * @brief The size of the internal XXH3 buffer.
+ *
+ * This is the optimal update size for incremental hashing.
+ *
+ * @see XXH3_64b_update(), XXH3_128b_update().
+ */
 #define XXH3_INTERNALBUFFER_SIZE 256
+
+/*!
+ * @brief Default size of the secret buffer (and @ref XXH3_kSecret).
+ *
+ * This is the size used in @ref XXH3_kSecret and the seeded functions.
+ *
+ * Not to be confused with @ref XXH3_SECRET_SIZE_MIN.
+ */
 #define XXH3_SECRET_DEFAULT_SIZE 192
+
+/*!
+ * @internal
+ * @brief Structure for XXH3 streaming API.
+ *
+ * @note This is only defined when @ref XXH_STATIC_LINKING_ONLY,
+ * @ref XXH_INLINE_ALL, or @ref XXH_IMPLEMENTATION is defined. Otherwise it is
+ * an opaque type. This allows fields to safely be changed.
+ *
+ * @note **This structure has a strict alignment requirement of 64 bytes.** Do
+ * not allocate this with `malloc()` or `new`, it will not be sufficiently
+ * aligned. Use @ref XXH3_createState(), @ref XXH3_freeState, or stack
+ * allocation.
+ *
+ * Typedef'd to @ref XXH3_state_t.
+ * Do not access the members of this struct directly.
+ *
+ * @see XXH3_INITSTATE() for stack initialization.
+ * @see XXH3_createState(), XXH3_freeState().
+ * @see XXH32_state_s, XXH64_state_s
+ */
 struct XXH3_state_s {
    XXH_ALIGN_MEMBER(64, XXH64_hash_t acc[8]);
-   /* used to store a custom secret generated from a seed */
+       /*!< The 8 accumulators. Similar to `vN` in @ref XXH32_state_s::v1 and @ref XXH64_state_s */
    XXH_ALIGN_MEMBER(64, unsigned char customSecret[XXH3_SECRET_DEFAULT_SIZE]);
+       /*!< Used to store a custom secret generated from a seed. */
    XXH_ALIGN_MEMBER(64, unsigned char buffer[XXH3_INTERNALBUFFER_SIZE]);
+       /*!< The internal buffer. @see XXH32_state_s::mem32 */
    XXH32_hash_t bufferedSize;
+       /*!< The amount of memory in @ref buffer, @see XXH32_state_s::memsize */
    XXH32_hash_t reserved32;
+       /*!< Reserved field. Needed for padding on 64-bit. */
    size_t nbStripesSoFar;
+       /*!< Number or stripes processed. */
    XXH64_hash_t totalLen;
+       /*!< Total length hashed. 64-bit even on 32-bit targets. */
    size_t nbStripesPerBlock;
+       /*!< Number of stripes per block. */
    size_t secretLimit;
+       /*!< Size of @ref customSecret or @ref extSecret */
    XXH64_hash_t seed;
+       /*!< Seed for _withSeed variants. Must be zero otherwise, @see XXH3_INITSTATE() */
    XXH64_hash_t reserved64;
-   const unsigned char* extSecret;  /* reference to external secret;
-                                     * if == NULL, use .customSecret instead */
+       /*!< Reserved field. */
+   const unsigned char* extSecret;
+       /*!< Reference to an external secret for the _withSecret variants, NULL
+        *   for other variants. */
    /* note: there may be some padding at the end due to alignment on 64 bytes */
 }; /* typedef'd to XXH3_state_t */
 
 #undef XXH_ALIGN_MEMBER
 
-/* When the XXH3_state_t structure is merely emplaced on stack,
+/*!
+ * @brief Initializes a stack-allocated `XXH3_state_s`.
+ *
+ * When the @ref XXH3_state_t structure is merely emplaced on stack,
  * it should be initialized with XXH3_INITSTATE() or a memset()
  * in case its first reset uses XXH3_NNbits_reset_withSeed().
  * This init can be omitted if the first reset uses default or _withSecret mode.
@@ -719,8 +1058,6 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 
 
 #endif  /* XXH_NO_LONG_LONG */
-
-
 #if defined(XXH_INLINE_ALL) || defined(XXH_PRIVATE_API)
 #  define XXH_IMPLEMENTATION
 #endif
@@ -762,8 +1099,24 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 /* *************************************
 *  Tuning parameters
 ***************************************/
+
+/*!
+ * @defgroup tuning Tuning parameters
+ * @{
+ *
+ * Various macros to control xxHash's behavior.
+ */
+#ifdef XXH_DOXYGEN
+/*!
+ * @brief Define this to disable 64-bit code.
+ *
+ * Useful if only using the @ref xxh32_family and you have a strict C90 compiler.
+ */
+#  define XXH_NO_LONG_LONG
+#  undef XXH_NO_LONG_LONG /* don't actually */
 /*!
- * XXH_FORCE_MEMORY_ACCESS:
+ * @brief Controls how unaligned memory is accessed.
+ *
  * By default, access to unaligned memory is controlled by `memcpy()`, which is
  * safe and portable.
  *
@@ -772,58 +1125,76 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  *
  * The below switch allow selection of a different access method
  * in the search for improved performance.
- * Method 0 (default):
- *     Use `memcpy()`. Safe and portable. Default.
- * Method 1:
- *     `__attribute__((packed))` statement. It depends on compiler extensions
- *     and is therefore not portable.
+ *
+ * @par Possible options:
+ *
+ *  - `XXH_FORCE_MEMORY_ACCESS=0` (default): `memcpy`
+ *   @par
+ *     Use `memcpy()`. Safe and portable. Note that most modern compilers will
+ *     eliminate the function call and treat it as an unaligned access.
+ *
+ *  - `XXH_FORCE_MEMORY_ACCESS=1`: `__attribute__((packed))`
+ *   @par
+ *     Depends on compiler extensions and is therefore not portable.
  *     This method is safe if your compiler supports it, and *generally* as
  *     fast or faster than `memcpy`.
- * Method 2:
- *     Direct access via cast. This method doesn't depend on the compiler but
- *     violates the C standard.
- *     It can generate buggy code on targets which do not support unaligned
- *     memory accesses.
- *     But in some circumstances, it's the only known way to get the most
- *     performance (example: GCC + ARMv6)
- * Method 3:
- *     Byteshift. This can generate the best code on old compilers which don't
+ *
+ *  - `XXH_FORCE_MEMORY_ACCESS=2`: Direct cast
+ *  @par
+ *     Casts directly and dereferences. This method doesn't depend on the
+ *     compiler, but it violates the C standard as it directly dereferences an
+ *     unaligned pointer. It can generate buggy code on targets which do not
+ *     support unaligned memory accesses, but in some circumstances, it's the
+ *     only known way to get the most performance (example: GCC + ARMv6).
+ *
+ *  - `XXH_FORCE_MEMORY_ACCESS=3`: Byteshift
+ *  @par
+ *     Also portable. This can generate the best code on old compilers which don't
  *     inline small `memcpy()` calls, and it might also be faster on big-endian
- *     systems which lack a native byteswap instruction.
+ *     systems which lack a native byteswap instruction. However, some compilers
+ *     will emit literal byteshifts even if the target supports unaligned access.
+ *
+ *  .
+ *
+ * @warning
+ *   Methods 1 and 2 rely on implementation-defined behavior. Use these with
+ *   care, as what works on one compiler/platform/optimization level may cause
+ *   another to read garbage data or even crash.
+ *
  * See https://stackoverflow.com/a/32095106/646947 for details.
- * Prefer these methods in priority order (0 > 1 > 2 > 3)
+ *
+ * Prefer these methods in priority order (0 > 3 > 1 > 2)
  */
-#ifndef XXH_FORCE_MEMORY_ACCESS   /* can be defined externally, on command line for example */
-#  if !defined(__clang__) && defined(__GNUC__) && defined(__ARM_FEATURE_UNALIGNED) && defined(__ARM_ARCH) && (__ARM_ARCH == 6)
-#    define XXH_FORCE_MEMORY_ACCESS 2
-#  elif !defined(__clang__) && ((defined(__INTEL_COMPILER) && !defined(_WIN32)) || \
-  (defined(__GNUC__) && (defined(__ARM_ARCH) && __ARM_ARCH >= 7)))
-#    define XXH_FORCE_MEMORY_ACCESS 1
-#  endif
-#endif
-
+#  define XXH_FORCE_MEMORY_ACCESS 0
 /*!
- * XXH_ACCEPT_NULL_INPUT_POINTER:
- * If the input pointer is NULL, xxHash's default behavior is to dereference it,
- * triggering a segfault.
+ * @def XXH_ACCEPT_NULL_INPUT_POINTER
+ * @brief Whether to add explicit `NULL` checks.
+ *
+ * If the input pointer is `NULL` and the length is non-zero, xxHash's default
+ * behavior is to dereference it, triggering a segfault.
+ *
  * When this macro is enabled, xxHash actively checks the input for a null pointer.
  * If it is, the result for null input pointers is the same as a zero-length input.
  */
-#ifndef XXH_ACCEPT_NULL_INPUT_POINTER   /* can be defined externally */
 #  define XXH_ACCEPT_NULL_INPUT_POINTER 0
-#endif
-
 /*!
- * XXH_FORCE_ALIGN_CHECK:
- * This is an important performance trick
- * for architectures without decent unaligned memory access performance.
- * It checks for input alignment, and when conditions are met,
- * uses a "fast path" employing direct 32-bit/64-bit read,
- * resulting in _dramatically faster_ read speed.
+ * @def XXH_FORCE_ALIGN_CHECK
+ * @brief If defined to non-zero, adds a special path for aligned inputs (XXH32()
+ * and XXH64() only).
  *
- * The check costs one initial branch per hash, which is generally negligible, but not zero.
- * Moreover, it's not useful to generate binary for an additional code path
- * if memory access uses same instruction for both aligned and unaligned adresses.
+ * This is an important performance trick for architectures without decent
+ * unaligned memory access performance.
+ *
+ * It checks for input alignment, and when conditions are met, uses a "fast
+ * path" employing direct 32-bit/64-bit reads, resulting in _dramatically
+ * faster_ read speed.
+ *
+ * The check costs one initial branch per hash, which is generally negligible,
+ * but not zero.
+ *
+ * Moreover, it's not useful to generate an additional code path if memory
+ * access uses the same instruction for both aligned and unaligned
+ * adresses (e.g. x86 and aarch64).
  *
  * In these cases, the alignment check can be removed by setting this macro to 0.
  * Then the code will always use unaligned memory access.
@@ -832,17 +1203,11 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  *
  * This option does not affect XXH3 (only XXH32 and XXH64).
  */
-#ifndef XXH_FORCE_ALIGN_CHECK  /* can be defined externally */
-#  if defined(__i386)  || defined(__x86_64__) || defined(__aarch64__) \
-   || defined(_M_IX86) || defined(_M_X64)     || defined(_M_ARM64) /* visual */
-#    define XXH_FORCE_ALIGN_CHECK 0
-#  else
-#    define XXH_FORCE_ALIGN_CHECK 1
-#  endif
-#endif
+#  define XXH_FORCE_ALIGN_CHECK 0
 
 /*!
- * XXH_NO_INLINE_HINTS:
+ * @def XXH_NO_INLINE_HINTS
+ * @brief When non-zero, sets all functions to `static`.
  *
  * By default, xxHash tries to force the compiler to inline almost all internal
  * functions.
@@ -860,6 +1225,57 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  * When not optimizing (-O0), optimizing for size (-Os, -Oz), or using
  * -fno-inline with GCC or Clang, this will automatically be defined.
  */
+#  define XXH_NO_INLINE_HINTS 0
+
+/*!
+ * @def XXH_REROLL
+ * @brief Whether to reroll `XXH32_finalize` and `XXH64_finalize`.
+ *
+ * For performance, `XXH32_finalize` and `XXH64_finalize` use an unrolled loop
+ * in the form of a switch statement.
+ *
+ * This is not always desirable, as it generates larger code, and depending on
+ * the architecture, may even be slower
+ *
+ * This is automatically defined with `-Os`/`-Oz` on GCC and Clang.
+ */
+#  define XXH_REROLL 0
+
+/*!
+ * @brief Redefines old internal names.
+ *
+ * For compatibility with code that uses xxHash's internals before the names
+ * were changed to improve namespacing. There is no other reason to use this.
+ */
+#  define XXH_OLD_NAMES
+#  undef XXH_OLD_NAMES /* don't actually use, it is ugly. */
+#endif /* XXH_DOXYGEN */
+/*!
+ * @}
+ */
+
+#ifndef XXH_FORCE_MEMORY_ACCESS   /* can be defined externally, on command line for example */
+#  if !defined(__clang__) && defined(__GNUC__) && defined(__ARM_FEATURE_UNALIGNED) && defined(__ARM_ARCH) && (__ARM_ARCH == 6)
+#    define XXH_FORCE_MEMORY_ACCESS 2
+#  elif !defined(__clang__) && ((defined(__INTEL_COMPILER) && !defined(_WIN32)) || \
+  (defined(__GNUC__) && (defined(__ARM_ARCH) && __ARM_ARCH >= 7)))
+#    define XXH_FORCE_MEMORY_ACCESS 1
+#  endif
+#endif
+
+#ifndef XXH_ACCEPT_NULL_INPUT_POINTER   /* can be defined externally */
+#  define XXH_ACCEPT_NULL_INPUT_POINTER 0
+#endif
+
+#ifndef XXH_FORCE_ALIGN_CHECK  /* can be defined externally */
+#  if defined(__i386)  || defined(__x86_64__) || defined(__aarch64__) \
+   || defined(_M_IX86) || defined(_M_X64)     || defined(_M_ARM64) /* visual */
+#    define XXH_FORCE_ALIGN_CHECK 0
+#  else
+#    define XXH_FORCE_ALIGN_CHECK 1
+#  endif
+#endif
+
 #ifndef XXH_NO_INLINE_HINTS
 #  if defined(__OPTIMIZE_SIZE__) /* -Os, -Oz */ \
    || defined(__NO_INLINE__)     /* -O0, -fno-inline */
@@ -869,13 +1285,6 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 #  endif
 #endif
 
-/*!
- * XXH_REROLL:
- * Whether to reroll XXH32_finalize, and XXH64_finalize,
- * instead of using an unrolled jump table/if statement loop.
- *
- * This is automatically defined on -Os/-Oz on GCC and Clang.
- */
 #ifndef XXH_REROLL
 #  if defined(__OPTIMIZE_SIZE__)
 #    define XXH_REROLL 1
@@ -884,6 +1293,11 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 #  endif
 #endif
 
+/*!
+ * @defgroup impl Implementation
+ * @{
+ */
+
 
 /* *************************************
 *  Includes & Memory related functions
@@ -942,7 +1356,11 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
 /* *************************************
 *  Debug
 ***************************************/
-/*
+/*!
+ * @ingroup tuning
+ * @def XXH_DEBUGLEVEL
+ * @brief Sets the debugging level.
+ *
  * XXH_DEBUGLEVEL is expected to be defined externally, typically via the
  * compiler's command line options. The value must be a number.
  */
@@ -1170,12 +1588,19 @@ XXH_readLE32_align(const void* ptr, XXH_alignment align)
 /* *************************************
 *  Misc
 ***************************************/
+/*! @ingroup public */
 XXH_PUBLIC_API unsigned XXH_versionNumber (void) { return XXH_VERSION_NUMBER; }
 
 
 /* *******************************************************************
 *  32-bit hash functions
 *********************************************************************/
+/*!
+ * @}
+ * @defgroup xxh32_impl XXH32 implementation
+ * @ingroup impl
+ * @{
+ */
 static const xxh_u32 XXH_PRIME32_1 = 0x9E3779B1U;   /* 0b10011110001101110111100110110001 */
 static const xxh_u32 XXH_PRIME32_2 = 0x85EBCA77U;   /* 0b10000101111010111100101001110111 */
 static const xxh_u32 XXH_PRIME32_3 = 0xC2B2AE3DU;   /* 0b11000010101100101010111000111101 */
@@ -1376,7 +1801,9 @@ XXH32_endian_align(const xxh_u8* input, size_t len, xxh_u32 seed, XXH_alignment
     return XXH32_finalize(h32, input, len&15, align);
 }
 
-
+/*!
+ * @ingroup xxh32_family
+ */
 XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t len, XXH32_hash_t seed)
 {
 #if 0
@@ -1385,9 +1812,7 @@ XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t len, XXH32_hash_t s
     XXH32_reset(&state, seed);
     XXH32_update(&state, (const xxh_u8*)input, len);
     return XXH32_digest(&state);
-
 #else
-
     if (XXH_FORCE_ALIGN_CHECK) {
         if ((((size_t)input) & 3) == 0) {   /* Input is 4-bytes aligned, leverage the speed benefit */
             return XXH32_endian_align((const xxh_u8*)input, len, seed, XXH_aligned);
@@ -1400,22 +1825,27 @@ XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t len, XXH32_hash_t s
 
 
 /*******   Hash streaming   *******/
-
+/*!
+ * @ingroup xxh32_family
+ */
 XXH_PUBLIC_API XXH32_state_t* XXH32_createState(void)
 {
     return (XXH32_state_t*)XXH_malloc(sizeof(XXH32_state_t));
 }
+/*! @ingroup xxh32_family */
 XXH_PUBLIC_API XXH_errorcode XXH32_freeState(XXH32_state_t* statePtr)
 {
     XXH_free(statePtr);
     return XXH_OK;
 }
 
+/*! @ingroup xxh32_family */
 XXH_PUBLIC_API void XXH32_copyState(XXH32_state_t* dstState, const XXH32_state_t* srcState)
 {
     memcpy(dstState, srcState, sizeof(*dstState));
 }
 
+/*! @ingroup xxh32_family */
 XXH_PUBLIC_API XXH_errorcode XXH32_reset(XXH32_state_t* statePtr, XXH32_hash_t seed)
 {
     XXH32_state_t state;   /* using a local state to memcpy() in order to avoid strict-aliasing warnings */
@@ -1430,6 +1860,7 @@ XXH_PUBLIC_API XXH_errorcode XXH32_reset(XXH32_state_t* statePtr, XXH32_hash_t s
 }
 
 
+/*! @ingroup xxh32_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH32_update(XXH32_state_t* state, const void* input, size_t len)
 {
@@ -1494,7 +1925,8 @@ XXH32_update(XXH32_state_t* state, const void* input, size_t len)
 }
 
 
-XXH_PUBLIC_API XXH32_hash_t XXH32_digest (const XXH32_state_t* state)
+/*! @ingroup xxh32_family */
+XXH_PUBLIC_API XXH32_hash_t XXH32_digest(const XXH32_state_t* state)
 {
     xxh_u32 h32;
 
@@ -1515,7 +1947,8 @@ XXH_PUBLIC_API XXH32_hash_t XXH32_digest (const XXH32_state_t* state)
 
 /*******   Canonical representation   *******/
 
-/*
+/*!
+ * @ingroup xxh32_family
  * The default return values from XXH functions are unsigned 32 and 64 bit
  * integers.
  *
@@ -1534,7 +1967,7 @@ XXH_PUBLIC_API void XXH32_canonicalFromHash(XXH32_canonical_t* dst, XXH32_hash_t
     if (XXH_CPU_LITTLE_ENDIAN) hash = XXH_swap32(hash);
     memcpy(dst, &hash, sizeof(*dst));
 }
-
+/*! @ingroup xxh32_family */
 XXH_PUBLIC_API XXH32_hash_t XXH32_hashFromCanonical(const XXH32_canonical_t* src)
 {
     return XXH_readBE32(src);
@@ -1546,7 +1979,11 @@ XXH_PUBLIC_API XXH32_hash_t XXH32_hashFromCanonical(const XXH32_canonical_t* src
 /* *******************************************************************
 *  64-bit hash functions
 *********************************************************************/
-
+/*!
+ * @}
+ * @ingroup impl
+ * @{
+ */
 /*******   Memory access   *******/
 
 typedef XXH64_hash_t xxh_u64;
@@ -1592,7 +2029,10 @@ typedef XXH64_hash_t xxh_u64;
 #elif (defined(XXH_FORCE_MEMORY_ACCESS) && (XXH_FORCE_MEMORY_ACCESS==2))
 
 /* Force direct memory access. Only works on CPU which support unaligned memory access in hardware */
-static xxh_u64 XXH_read64(const void* memPtr) { return *(const xxh_u64*) memPtr; }
+static xxh_u64 XXH_read64(const void* memPtr)
+{
+    return *(const xxh_u64*) memPtr;
+}
 
 #elif (defined(XXH_FORCE_MEMORY_ACCESS) && (XXH_FORCE_MEMORY_ACCESS==1))
 
@@ -1631,7 +2071,7 @@ static xxh_u64 XXH_read64(const void* memPtr)
 #elif XXH_GCC_VERSION >= 403
 #  define XXH_swap64 __builtin_bswap64
 #else
-static xxh_u64 XXH_swap64 (xxh_u64 x)
+static xxh_u64 XXH_swap64(xxh_u64 x)
 {
     return  ((x << 56) & 0xff00000000000000ULL) |
             ((x << 40) & 0x00ff000000000000ULL) |
@@ -1697,12 +2137,17 @@ XXH_readLE64_align(const void* ptr, XXH_alignment align)
 
 
 /*******   xxh64   *******/
-
-static const xxh_u64 XXH_PRIME64_1 = 0x9E3779B185EBCA87ULL;   /* 0b1001111000110111011110011011000110000101111010111100101010000111 */
-static const xxh_u64 XXH_PRIME64_2 = 0xC2B2AE3D27D4EB4FULL;   /* 0b1100001010110010101011100011110100100111110101001110101101001111 */
-static const xxh_u64 XXH_PRIME64_3 = 0x165667B19E3779F9ULL;   /* 0b0001011001010110011001111011000110011110001101110111100111111001 */
-static const xxh_u64 XXH_PRIME64_4 = 0x85EBCA77C2B2AE63ULL;   /* 0b1000010111101011110010100111011111000010101100101010111001100011 */
-static const xxh_u64 XXH_PRIME64_5 = 0x27D4EB2F165667C5ULL;   /* 0b0010011111010100111010110010111100010110010101100110011111000101 */
+/*!
+ * @}
+ * @defgroup xxh64_impl XXH64 implementation
+ * @ingroup impl
+ * @{
+ */
+static const xxh_u64 XXH_PRIME64_1 = 0x9E3779B185EBCA87ULL;   /*!< 0b1001111000110111011110011011000110000101111010111100101010000111 */
+static const xxh_u64 XXH_PRIME64_2 = 0xC2B2AE3D27D4EB4FULL;   /*!< 0b1100001010110010101011100011110100100111110101001110101101001111 */
+static const xxh_u64 XXH_PRIME64_3 = 0x165667B19E3779F9ULL;   /*!< 0b0001011001010110011001111011000110011110001101110111100111111001 */
+static const xxh_u64 XXH_PRIME64_4 = 0x85EBCA77C2B2AE63ULL;   /*!< 0b1000010111101011110010100111011111000010101100101010111001100011 */
+static const xxh_u64 XXH_PRIME64_5 = 0x27D4EB2F165667C5ULL;   /*!< 0b0010011111010100111010110010111100010110010101100110011111000101 */
 
 #ifdef XXH_OLD_NAMES
 #  define PRIME64_1 XXH_PRIME64_1
@@ -1919,6 +2364,7 @@ XXH64_endian_align(const xxh_u8* input, size_t len, xxh_u64 seed, XXH_alignment
 }
 
 
+/*! @ingroup xxh64_family */
 XXH_PUBLIC_API XXH64_hash_t XXH64 (const void* input, size_t len, XXH64_hash_t seed)
 {
 #if 0
@@ -1927,9 +2373,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH64 (const void* input, size_t len, XXH64_hash_t s
     XXH64_reset(&state, seed);
     XXH64_update(&state, (const xxh_u8*)input, len);
     return XXH64_digest(&state);
-
 #else
-
     if (XXH_FORCE_ALIGN_CHECK) {
         if ((((size_t)input) & 7)==0) {  /* Input is aligned, let's leverage the speed advantage */
             return XXH64_endian_align((const xxh_u8*)input, len, seed, XXH_aligned);
@@ -1942,21 +2386,25 @@ XXH_PUBLIC_API XXH64_hash_t XXH64 (const void* input, size_t len, XXH64_hash_t s
 
 /*******   Hash Streaming   *******/
 
+/*! @ingroup xxh64_family*/
 XXH_PUBLIC_API XXH64_state_t* XXH64_createState(void)
 {
     return (XXH64_state_t*)XXH_malloc(sizeof(XXH64_state_t));
 }
+/*! @ingroup xxh64_family */
 XXH_PUBLIC_API XXH_errorcode XXH64_freeState(XXH64_state_t* statePtr)
 {
     XXH_free(statePtr);
     return XXH_OK;
 }
 
+/*! @ingroup xxh64_family */
 XXH_PUBLIC_API void XXH64_copyState(XXH64_state_t* dstState, const XXH64_state_t* srcState)
 {
     memcpy(dstState, srcState, sizeof(*dstState));
 }
 
+/*! @ingroup xxh64_family */
 XXH_PUBLIC_API XXH_errorcode XXH64_reset(XXH64_state_t* statePtr, XXH64_hash_t seed)
 {
     XXH64_state_t state;   /* use a local state to memcpy() in order to avoid strict-aliasing warnings */
@@ -1970,6 +2418,7 @@ XXH_PUBLIC_API XXH_errorcode XXH64_reset(XXH64_state_t* statePtr, XXH64_hash_t s
     return XXH_OK;
 }
 
+/*! @ingroup xxh64_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH64_update (XXH64_state_t* state, const void* input, size_t len)
 {
@@ -1997,7 +2446,7 @@ XXH64_update (XXH64_state_t* state, const void* input, size_t len)
             state->v2 = XXH64_round(state->v2, XXH_readLE64(state->mem64+1));
             state->v3 = XXH64_round(state->v3, XXH_readLE64(state->mem64+2));
             state->v4 = XXH64_round(state->v4, XXH_readLE64(state->mem64+3));
-            p += 32-state->memsize;
+            p += 32 - state->memsize;
             state->memsize = 0;
         }
 
@@ -2031,7 +2480,8 @@ XXH64_update (XXH64_state_t* state, const void* input, size_t len)
 }
 
 
-XXH_PUBLIC_API XXH64_hash_t XXH64_digest (const XXH64_state_t* state)
+/*! @ingroup xxh64_family */
+XXH_PUBLIC_API XXH64_hash_t XXH64_digest(const XXH64_state_t* state)
 {
     xxh_u64 h64;
 
@@ -2058,6 +2508,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_digest (const XXH64_state_t* state)
 
 /******* Canonical representation   *******/
 
+/*! @ingroup xxh64_family */
 XXH_PUBLIC_API void XXH64_canonicalFromHash(XXH64_canonical_t* dst, XXH64_hash_t hash)
 {
     XXH_STATIC_ASSERT(sizeof(XXH64_canonical_t) == sizeof(XXH64_hash_t));
@@ -2065,6 +2516,7 @@ XXH_PUBLIC_API void XXH64_canonicalFromHash(XXH64_canonical_t* dst, XXH64_hash_t
     memcpy(dst, &hash, sizeof(*dst));
 }
 
+/*! @ingroup xxh64_family */
 XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src)
 {
     return XXH_readBE64(src);
@@ -2186,12 +2638,62 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
 /* ==========================================
  * Vectorization detection
  * ========================================== */
-#define XXH_SCALAR 0  /* Portable scalar version */
-#define XXH_SSE2   1  /* SSE2 for Pentium 4 and all x86_64 */
-#define XXH_AVX2   2  /* AVX2 for Haswell and Bulldozer */
-#define XXH_AVX512 3  /* AVX512 for Skylake and Icelake */
-#define XXH_NEON   4  /* NEON for most ARMv7-A and all AArch64 */
-#define XXH_VSX    5  /* VSX and ZVector for POWER8/z13 */
+
+#ifdef XXH_DOXYGEN
+/*!
+ * @ingroup tuning
+ * @brief Overrides the vectorization implementation chosen for XXH3.
+ *
+ * Can be defined to 0 to disable SIMD or any of the values mentioned in
+ * @ref XXH_VECTOR_TYPE.
+ *
+ * If this is not defined, it uses predefined macros to determine the best
+ * implementation.
+ */
+#  define XXH_VECTOR XXH_SCALAR
+/*!
+ * @ingroup tuning
+ * @brief Possible values for @ref XXH_VECTOR.
+ *
+ * Note that these are actually implemented as macros.
+ *
+ * If this is not defined, it is detected automatically.
+ * @ref XXH_X86DISPATCH overrides this.
+ */
+enum XXH_VECTOR_TYPE /* fake enum */ {
+    XXH_SCALAR = 0,  /*!< Portable scalar version */
+    XXH_SSE2   = 1,  /*!<
+                      * SSE2 for Pentium 4, Opteron, all x86_64.
+                      *
+                      * @note SSE2 is also guaranteed on Windows 10, macOS, and
+                      * Android x86.
+                      */
+    XXH_AVX2   = 2,  /*!< AVX2 for Haswell and Bulldozer */
+    XXH_AVX512 = 3,  /*!< AVX512 for Skylake and Icelake */
+    XXH_NEON   = 4,  /*!< NEON for most ARMv7-A and all AArch64 */
+    XXH_VSX    = 5,  /*!< VSX and ZVector for POWER8/z13 (64-bit) */
+};
+/*!
+ * @ingroup tuning
+ * @brief Selects the minumum alignment for XXH3's accumulators.
+ *
+ * When using SIMD, this should match the alignment reqired for said vector
+ * type, so, for example, 32 for AVX2.
+ *
+ * Default: Auto detected.
+ */
+#  define XXH_ACC_ALIGN 8
+#endif
+
+/* Actual definition */
+#ifndef XXH_DOXYGEN
+#  define XXH_SCALAR 0
+#  define XXH_SSE2   1
+#  define XXH_AVX2   2
+#  define XXH_AVX512 3
+#  define XXH_NEON   4
+#  define XXH_VSX    5
+#endif
 
 #ifndef XXH_VECTOR    /* can be defined on command line */
 #  if defined(__AVX512F__)
@@ -2344,7 +2846,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
  * This is available on ARMv7-A, but is less efficient than a single VZIP.32.
  */
 
-/*
+/*!
  * Function-like macro:
  * void XXH_SPLIT_IN_PLACE(uint64x2_t &in, uint32x2_t &outLo, uint32x2_t &outHi)
  * {
@@ -2417,10 +2919,12 @@ typedef __vector unsigned xxh_u32x4;
 # endif /* !defined(XXH_VSX_BE) */
 
 # if XXH_VSX_BE
-/* A wrapper for POWER9's vec_revb. */
 #  if defined(__POWER9_VECTOR__) || (defined(__clang__) && defined(__s390x__))
 #    define XXH_vec_revb vec_revb
 #  else
+/*!
+ * A polyfill for POWER9's vec_revb().
+ */
 XXH_FORCE_INLINE xxh_u64x2 XXH_vec_revb(xxh_u64x2 val)
 {
     xxh_u8x16 const vByteSwap = { 0x07, 0x06, 0x05, 0x04, 0x03, 0x02, 0x01, 0x00,
@@ -2430,8 +2934,8 @@ XXH_FORCE_INLINE xxh_u64x2 XXH_vec_revb(xxh_u64x2 val)
 #  endif
 # endif /* XXH_VSX_BE */
 
-/*
- * Performs an unaligned load and byte swaps it on big endian.
+/*!
+ * Performs an unaligned vector load and byte swaps it on big endian.
  */
 XXH_FORCE_INLINE xxh_u64x2 XXH_vec_loadu(const void *ptr)
 {
@@ -2502,7 +3006,7 @@ XXH_FORCE_INLINE xxh_u64x2 XXH_vec_mule(xxh_u32x4 a, xxh_u32x4 b)
 #  error "default keyset is not large enough"
 #endif
 
-/* Pseudorandom secret taken directly from FARSH */
+/*! Pseudorandom secret taken directly from FARSH. */
 XXH_ALIGN(64) static const xxh_u8 XXH3_kSecret[XXH_SECRET_DEFAULT_SIZE] = {
     0xb8, 0xfe, 0x6c, 0x39, 0x23, 0xa4, 0x4b, 0xbe, 0x7c, 0x01, 0x81, 0x2c, 0xf7, 0x21, 0xad, 0x1c,
     0xde, 0xd4, 0x6d, 0xe9, 0x83, 0x90, 0x97, 0xdb, 0x72, 0x40, 0xa4, 0xa4, 0xb7, 0xb3, 0x67, 0x1f,
@@ -2523,23 +3027,29 @@ XXH_ALIGN(64) static const xxh_u8 XXH3_kSecret[XXH_SECRET_DEFAULT_SIZE] = {
 #  define kSecret XXH3_kSecret
 #endif
 
-/*
- * Calculates a 32-bit to 64-bit long multiply.
+#ifdef XXH_DOXYGEN
+/*!
+ * @brief Calculates a 32-bit to 64-bit long multiply.
+ *
+ * Implemented as a macro.
  *
- * Wraps __emulu on MSVC x86 because it tends to call __allmul when it doesn't
+ * Wraps `__emulu` on MSVC x86 because it tends to call `__allmul` when it doesn't
  * need to (but it shouldn't need to anyways, it is about 7 instructions to do
- * a 64x64 multiply...). Since we know that this will _always_ emit MULL, we
+ * a 64x64 multiply...). Since we know that this will _always_ emit `MULL`, we
  * use that instead of the normal method.
  *
  * If you are compiling for platforms like Thumb-1 and don't have a better option,
  * you may also want to write your own long multiply routine here.
  *
- * XXH_FORCE_INLINE xxh_u64 XXH_mult32to64(xxh_u64 x, xxh_u64 y)
- * {
- *    return (x & 0xFFFFFFFF) * (y & 0xFFFFFFFF);
- * }
+ * @param x, y Numbers to be multiplied
+ * @return 64-bit product of the low 32 bits of @p x and @p y.
  */
-#if defined(_MSC_VER) && defined(_M_IX86)
+XXH_FORCE_INLINE xxh_u64
+XXH_mult32to64(xxh_u64 x, xxh_u64 y)
+{
+   return (x & 0xFFFFFFFF) * (y & 0xFFFFFFFF);
+}
+#elif defined(_MSC_VER) && defined(_M_IX86)
 #    include <intrin.h>
 #    define XXH_mult32to64(x, y) __emulu((unsigned)(x), (unsigned)(y))
 #else
@@ -2553,10 +3063,14 @@ XXH_ALIGN(64) static const xxh_u8 XXH3_kSecret[XXH_SECRET_DEFAULT_SIZE] = {
 #    define XXH_mult32to64(x, y) ((xxh_u64)(xxh_u32)(x) * (xxh_u64)(xxh_u32)(y))
 #endif
 
-/*
- * Calculates a 64->128-bit long multiply.
+/*!
+ * @brief Calculates a 64->128-bit long multiply.
+ *
+ * Uses `__uint128_t` and `_umul128` if available, otherwise uses a scalar
+ * version.
  *
- * Uses __uint128_t and _umul128 if available, otherwise uses a scalar version.
+ * @param lhs, rhs The 64-bit integers to be multiplied
+ * @return The 128-bit result represented in an @ref XXH128_hash_t.
  */
 static XXH128_hash_t
 XXH_mult64to128(xxh_u64 lhs, xxh_u64 rhs)
@@ -2667,11 +3181,15 @@ XXH_mult64to128(xxh_u64 lhs, xxh_u64 rhs)
 #endif
 }
 
-/*
- * Does a 64-bit to 128-bit multiply, then XOR folds it.
+/*!
+ * @brief Calculates a 64-bit to 128-bit multiply, then XOR folds it.
  *
  * The reason for the separate function is to prevent passing too many structs
  * around by value. This will hopefully inline the multiply, but we don't force it.
+ *
+ * @param lhs, rhs The 64-bit integers to multiply
+ * @return The low 64 bits of the product XOR'd by the high 64 bits.
+ * @see XXH_mult64to128()
  */
 static xxh_u64
 XXH3_mul128_fold64(xxh_u64 lhs, xxh_u64 rhs)
@@ -2680,7 +3198,7 @@ XXH3_mul128_fold64(xxh_u64 lhs, xxh_u64 rhs)
     return product.low64 ^ product.high64;
 }
 
-/* Seems to produce slightly better code on GCC for some reason. */
+/*! Seems to produce slightly better code on GCC for some reason. */
 XXH_FORCE_INLINE xxh_u64 XXH_xorshift64(xxh_u64 v64, int shift)
 {
     XXH_ASSERT(0 <= shift && shift < 64);
@@ -3851,17 +4369,20 @@ XXH3_64bits_internal(const void* XXH_RESTRICT input, size_t len,
 
 /* ===   Public entry point   === */
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH64_hash_t XXH3_64bits(const void* input, size_t len)
 {
     return XXH3_64bits_internal(input, len, 0, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_hashLong_64b_default);
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH64_hash_t
 XXH3_64bits_withSecret(const void* input, size_t len, const void* secret, size_t secretSize)
 {
     return XXH3_64bits_internal(input, len, 0, secret, secretSize, XXH3_hashLong_64b_withSecret);
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH64_hash_t
 XXH3_64bits_withSeed(const void* input, size_t len, XXH64_hash_t seed)
 {
@@ -3936,6 +4457,7 @@ static void XXH_alignedFree(void* p)
         XXH_free(base);
     }
 }
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH3_state_t* XXH3_createState(void)
 {
     XXH3_state_t* const state = (XXH3_state_t*)XXH_alignedMalloc(sizeof(XXH3_state_t), 64);
@@ -3944,12 +4466,14 @@ XXH_PUBLIC_API XXH3_state_t* XXH3_createState(void)
     return state;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode XXH3_freeState(XXH3_state_t* statePtr)
 {
     XXH_alignedFree(statePtr);
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API void
 XXH3_copyState(XXH3_state_t* dst_state, const XXH3_state_t* src_state)
 {
@@ -3982,6 +4506,7 @@ XXH3_64bits_reset_internal(XXH3_state_t* statePtr,
     statePtr->nbStripesPerBlock = statePtr->secretLimit / XXH_SECRET_CONSUME_RATE;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_64bits_reset(XXH3_state_t* statePtr)
 {
@@ -3990,6 +4515,7 @@ XXH3_64bits_reset(XXH3_state_t* statePtr)
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_64bits_reset_withSecret(XXH3_state_t* statePtr, const void* secret, size_t secretSize)
 {
@@ -4000,6 +4526,7 @@ XXH3_64bits_reset_withSecret(XXH3_state_t* statePtr, const void* secret, size_t
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_64bits_reset_withSeed(XXH3_state_t* statePtr, XXH64_hash_t seed)
 {
@@ -4109,6 +4636,7 @@ XXH3_update(XXH3_state_t* state,
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_64bits_update(XXH3_state_t* state, const void* input, size_t len)
 {
@@ -4151,6 +4679,7 @@ XXH3_digest_long (XXH64_hash_t* acc,
     }
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_digest (const XXH3_state_t* state)
 {
     const unsigned char* const secret = (state->extSecret == NULL) ? state->customSecret : state->extSecret;
@@ -4171,6 +4700,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_digest (const XXH3_state_t* state)
 
 #define XXH_MIN(x, y) (((x) > (y)) ? (y) : (x))
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API void
 XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSeedSize)
 {
@@ -4583,6 +5113,7 @@ XXH3_128bits_internal(const void* input, size_t len,
 
 /* ===   Public XXH128 API   === */
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH128_hash_t XXH3_128bits(const void* input, size_t len)
 {
     return XXH3_128bits_internal(input, len, 0,
@@ -4590,6 +5121,7 @@ XXH_PUBLIC_API XXH128_hash_t XXH3_128bits(const void* input, size_t len)
                                  XXH3_hashLong_128b_default);
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH128_hash_t
 XXH3_128bits_withSecret(const void* input, size_t len, const void* secret, size_t secretSize)
 {
@@ -4598,6 +5130,7 @@ XXH3_128bits_withSecret(const void* input, size_t len, const void* secret, size_
                                  XXH3_hashLong_128b_withSecret);
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH128_hash_t
 XXH3_128bits_withSeed(const void* input, size_t len, XXH64_hash_t seed)
 {
@@ -4606,6 +5139,7 @@ XXH3_128bits_withSeed(const void* input, size_t len, XXH64_hash_t seed)
                                  XXH3_hashLong_128b_withSeed);
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH128_hash_t
 XXH128(const void* input, size_t len, XXH64_hash_t seed)
 {
@@ -4628,6 +5162,7 @@ XXH3_128bits_reset_internal(XXH3_state_t* statePtr,
     XXH3_64bits_reset_internal(statePtr, seed, secret, secretSize);
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset(XXH3_state_t* statePtr)
 {
@@ -4636,6 +5171,7 @@ XXH3_128bits_reset(XXH3_state_t* statePtr)
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset_withSecret(XXH3_state_t* statePtr, const void* secret, size_t secretSize)
 {
@@ -4646,6 +5182,7 @@ XXH3_128bits_reset_withSecret(XXH3_state_t* statePtr, const void* secret, size_t
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset_withSeed(XXH3_state_t* statePtr, XXH64_hash_t seed)
 {
@@ -4656,6 +5193,7 @@ XXH3_128bits_reset_withSeed(XXH3_state_t* statePtr, XXH64_hash_t seed)
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_update(XXH3_state_t* state, const void* input, size_t len)
 {
@@ -4663,6 +5201,7 @@ XXH3_128bits_update(XXH3_state_t* state, const void* input, size_t len)
                        XXH3_accumulate_512, XXH3_scrambleAcc);
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH128_hash_t XXH3_128bits_digest (const XXH3_state_t* state)
 {
     const unsigned char* const secret = (state->extSecret == NULL) ? state->customSecret : state->extSecret;
@@ -4693,6 +5232,7 @@ XXH_PUBLIC_API XXH128_hash_t XXH3_128bits_digest (const XXH3_state_t* state)
 #include <string.h>   /* memcmp, memcpy */
 
 /* return : 1 is equal, 0 if different */
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API int XXH128_isEqual(XXH128_hash_t h1, XXH128_hash_t h2)
 {
     /* note : XXH128_hash_t is compact, it has no padding byte */
@@ -4703,6 +5243,7 @@ XXH_PUBLIC_API int XXH128_isEqual(XXH128_hash_t h1, XXH128_hash_t h2)
  * return : >0 if *h128_1  > *h128_2
  *          <0 if *h128_1  < *h128_2
  *          =0 if *h128_1 == *h128_2  */
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API int XXH128_cmp(const void* h128_1, const void* h128_2)
 {
     XXH128_hash_t const h1 = *(const XXH128_hash_t*)h128_1;
@@ -4715,6 +5256,7 @@ XXH_PUBLIC_API int XXH128_cmp(const void* h128_1, const void* h128_2)
 
 
 /*======   Canonical representation   ======*/
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API void
 XXH128_canonicalFromHash(XXH128_canonical_t* dst, XXH128_hash_t hash)
 {
@@ -4727,6 +5269,7 @@ XXH128_canonicalFromHash(XXH128_canonical_t* dst, XXH128_hash_t hash)
     memcpy((char*)dst + sizeof(hash.high64), &hash.low64, sizeof(hash.low64));
 }
 
+/*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH128_hash_t
 XXH128_hashFromCanonical(const XXH128_canonical_t* src)
 {
@@ -4745,7 +5288,9 @@ XXH128_hashFromCanonical(const XXH128_canonical_t* src)
 
 #endif  /* XXH_NO_LONG_LONG */
 
-
+/*!
+ * @}
+ */
 #endif  /* XXH_IMPLEMENTATION */
 
 

From 7179055461467d08c159716de1d73a0ec3732825 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Thu, 15 Oct 2020 20:14:44 -0400
Subject: [PATCH 032/187] Fix comment typos

---
 xxhash.h | 31 +++++++++++++++----------------
 1 file changed, 15 insertions(+), 16 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 7bbf5e11..006825eb 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -270,11 +270,12 @@ extern "C" {
 #define XXH_VERSION_NUMBER  (XXH_VERSION_MAJOR *100*100 + XXH_VERSION_MINOR *100 + XXH_VERSION_RELEASE)
 
 /*!
- * @brief Obtains the xxHash version that was compiles.
+ * @brief Obtains the xxHash version.
  *
- * This is only uaeful when xxHash is compiled as a shared library.
+ * This is only useful when xxHash is compiled as a shared library, as it is
+ * independent of the version defined in the header.
  *
- * @return `XXH_VERSION_NUMBER` as of when it was compiled.
+ * @return `XXH_VERSION_NUMBER` as of when the function was compiled.
  */
 XXH_PUBLIC_API unsigned XXH_versionNumber (void);
 
@@ -337,8 +338,8 @@ typedef uint32_t XXH32_hash_t;
  *
  * Speed on Core 2 Duo @ 3 GHz (single thread, SMHasher benchmark): 5.4 GB/s
  *
- * @param input The block of data to be hashed, at least @p length bytes in size
- * @param length The length of @p input in bytes
+ * @param input The block of data to be hashed, at least @p length bytes in size.
+ * @param length The length of @p input, in bytes.
  * @param seed The 32-bit seed to alter the hash's output predictably.
  *
  * @pre
@@ -346,11 +347,11 @@ typedef uint32_t XXH32_hash_t;
  *   readable, contiguous memory. However, if @p length is `0`, @p input may be
  *   `NULL`. In C++, this also must be *TriviallyCopyable*.
  *
- * @return The calculated 32-bit hash
+ * @return The calculated 32-bit hash value.
  *
  * @see
  *    XXH64(), XXH3_64bits_withSeed(), XXH3_128bits_withSeed(), XXH128():
- *    Direct equivalents for the other variants of xxHash
+ *    Direct equivalents for the other variants of xxHash.
  * @see
  *    XXH32_createState(), XXH32_update(), XXH32_digest(): Streaming version.
  */
@@ -404,10 +405,8 @@ XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t length, XXH32_hash_
  *        return hash;
  *    }
  * @endcode
- *
  */
 
-
 /*!
  * @typedef struct XXH32_state_s XXH32_state_t
  * @brief The opaque state struct for the XXH32 streaming API.
@@ -434,8 +433,8 @@ XXH_PUBLIC_API XXH_errorcode  XXH32_freeState(XXH32_state_t* statePtr);
 /*!
  * @brief Copies one @ref XXH32_state_t to another.
  *
- * @param dst_state The state to copy to
- * @param src_state The state to copy from
+ * @param dst_state The state to copy to.
+ * @param src_state The state to copy from.
  * @pre
  *   @p dst_state and @p src_state must not be `NULL` and must not overlap.
  */
@@ -452,7 +451,7 @@ XXH_PUBLIC_API void XXH32_copyState(XXH32_state_t* dst_state, const XXH32_state_
  * @pre
  *   @p statePtr must not be `NULL`.
  *
- * @return @ref XXH_OK on success, @ref XXH_ERROR on failure
+ * @return @ref XXH_OK on success, @ref XXH_ERROR on failure.
  */
 XXH_PUBLIC_API XXH_errorcode XXH32_reset  (XXH32_state_t* statePtr, XXH32_hash_t seed);
 
@@ -462,8 +461,8 @@ XXH_PUBLIC_API XXH_errorcode XXH32_reset  (XXH32_state_t* statePtr, XXH32_hash_t
  * Call this to incrementally consume blocks of data.
  *
  * @param statePtr The state struct to update.
- * @param input The block of data to be hashed, at least @p length bytes in size
- * @param length The length of @p input in bytes
+ * @param input The block of data to be hashed, at least @p length bytes in size.
+ * @param length The length of @p input, in bytes.
  *
  * @pre
  *   @p statePtr must not be `NULL`.
@@ -523,7 +522,7 @@ typedef struct {
 /*!
  * @brief Converts an @ref XXH32_hash_t to a big endian @ref XXH32_canonical_t.
  *
- * @param dst The @ref XXH32_canonical_t pointer to be stored to
+ * @param dst The @ref XXH32_canonical_t pointer to be stored to.
  * @param hash The @ref XXH32_hash_t to be converted.
  *
  * @pre
@@ -611,7 +610,7 @@ typedef uint64_t XXH64_hash_t;
  *
  * @see
  *    XXH32(), XXH3_64bits_withSeed(), XXH3_128bits_withSeed(), XXH128():
- *    Direct equivalents for the other variants of xxHash
+ *    Direct equivalents for the other variants of xxHash.
  * @see
  *    XXH64_createState(), XXH64_update(), XXH64_digest(): Streaming version.
  */

From 960709597b5e488493074eeea4e7ef4ad87890b9 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Fri, 16 Oct 2020 15:44:35 -0400
Subject: [PATCH 033/187] Finish XXH32 internal documentation.

---
 xxhash.h | 176 ++++++++++++++++++++++++++++++++++++++++++++++++++-----
 1 file changed, 160 insertions(+), 16 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 006825eb..a58c7c22 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -962,7 +962,7 @@ struct XXH64_state_s {
  *
  * @note **This structure has a strict alignment requirement of 64 bytes.** Do
  * not allocate this with `malloc()` or `new`, it will not be sufficiently
- * aligned. Use @ref XXH3_createState(), @ref XXH3_freeState, or stack
+ * aligned. Use @ref XXH3_createState() and @ref XXH3_freeState(), or stack
  * allocation.
  *
  * Typedef'd to @ref XXH3_state_t.
@@ -1241,6 +1241,7 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 #  define XXH_REROLL 0
 
 /*!
+ * @internal
  * @brief Redefines old internal names.
  *
  * For compatibility with code that uses xxHash's internals before the names
@@ -1301,17 +1302,30 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 /* *************************************
 *  Includes & Memory related functions
 ***************************************/
-/*!
+/*
  * Modify the local functions below should you wish to use
  * different memory routines for malloc() and free()
  */
 #include <stdlib.h>
 
+/*!
+ * @internal
+ * @brief Modify this function to use a different routine than malloc().
+ */
 static void* XXH_malloc(size_t s) { return malloc(s); }
+
+/*!
+ * @internal
+ * @brief Modify this function to use a different routine than free().
+ */
 static void XXH_free(void* p) { free(p); }
 
-/*! and for memcpy() */
 #include <string.h>
+
+/*!
+ * @internal
+ * @brief Modify this function to use a different routine than memcpy().
+ */
 static void* XXH_memcpy(void* dest, const void* src, size_t size)
 {
     return memcpy(dest,src,size);
@@ -1403,6 +1417,56 @@ typedef XXH32_hash_t xxh_u32;
 
 /* ***   Memory access   *** */
 
+/*!
+ * @internal
+ * @fn xxh_u32 XXH_read32(const void* ptr)
+ * @brief Reads an unaligned 32-bit integer from @p ptr in native endianness.
+ *
+ * Affected by @ref XXH_FORCE_MEMORY_ACCESS.
+ *
+ * @param ptr The pointer to read from.
+ * @return The 32-bit native endian integer from the bytes at @p ptr.
+ */
+
+/*!
+ * @internal
+ * @fn xxh_u32 XXH_readLE32(const void* ptr)
+ * @brief Reads an unaligned 32-bit little endian integer from @p ptr.
+ *
+ * Affected by @ref XXH_FORCE_MEMORY_ACCESS.
+ *
+ * @param ptr The pointer to read from.
+ * @return The 32-bit little endian integer from the bytes at @p ptr.
+ */
+
+/*!
+ * @internal
+ * @fn xxh_u32 XXH_readBE32(const void* ptr)
+ * @brief Reads an unaligned 32-bit big endian integer from @p ptr.
+ *
+ * Affected by @ref XXH_FORCE_MEMORY_ACCESS.
+ *
+ * @param ptr The pointer to read from.
+ * @return The 32-bit big endian integer from the bytes at @p ptr.
+ */
+
+/*!
+ * @internal
+ * @fn xxh_u32 XXH_readLE32_align(const void* ptr, XXH_alignment align)
+ * @brief Like @ref XXH_readLE32(), but has an option for aligned reads.
+ *
+ * Affected by @ref XXH_FORCE_MEMORY_ACCESS.
+ * Note that when @ref XXH_FORCE_ALIGN_CHECK == 0, the @p align parameter is
+ * always @ref XXH_alignment::XXH_unaligned.
+ *
+ * @param ptr The pointer to read from.
+ * @param align Whether @p ptr is aligned.
+ * @pre
+ *   If @p align == @ref XXH_alignment::XXH_aligned, @p ptr must be 4 byte
+ *   aligned.
+ * @return The 32-bit little endian integer from the bytes at @p ptr.
+ */
+
 #if (defined(XXH_FORCE_MEMORY_ACCESS) && (XXH_FORCE_MEMORY_ACCESS==3))
 /*
  * Manual byteshift. Best for old compilers which don't inline memcpy.
@@ -1453,12 +1517,20 @@ static xxh_u32 XXH_read32(const void* memPtr)
 typedef enum { XXH_bigEndian=0, XXH_littleEndian=1 } XXH_endianess;
 
 /*!
- * XXH_CPU_LITTLE_ENDIAN:
+ * @ingroup tuning
+ * @def XXH_CPU_LITTLE_ENDIAN
+ * @brief Whether the target is little endian.
+ *
  * Defined to 1 if the target is little endian, or 0 if it is big endian.
  * It can be defined externally, for example on the compiler command line.
  *
  * If it is not defined, a runtime check (which is usually constant folded)
  * is used instead.
+ *
+ * @note
+ *   This is not necessarily defined to an integer constant.
+ *
+ * @see XXH_isLittleEndian() for the runtime check.
  */
 #ifndef XXH_CPU_LITTLE_ENDIAN
 /*
@@ -1473,8 +1545,11 @@ typedef enum { XXH_bigEndian=0, XXH_littleEndian=1 } XXH_endianess;
      || (defined(__BYTE_ORDER__) && __BYTE_ORDER__ == __ORDER_BIG_ENDIAN__)
 #    define XXH_CPU_LITTLE_ENDIAN 0
 #  else
-/*
- * runtime test, presumed to simplify to a constant by compiler
+/*!
+ * @internal
+ * @brief Runtime check for @ref XXH_CPU_LITTLE_ENDIAN.
+ *
+ * Most compilers will constant fold this.
  */
 static int XXH_isLittleEndian(void)
 {
@@ -1503,6 +1578,19 @@ static int XXH_isLittleEndian(void)
 #  define XXH_HAS_BUILTIN(x) 0
 #endif
 
+/*!
+ * @internal
+ * @def XXH_rotl32(x,r)
+ * @brief 32-bit rotate left.
+ *
+ * @param x The 32-bit integer to be rotated.
+ * @param r The number of bits to rotate.
+ * @pre
+ *   @p r > 0 && @p r < 32
+ * @note
+ *   @p x and @p r may be evaluated multiple times.
+ * @return The rotated result.
+ */
 #if !defined(NO_CLANG_BUILTIN) && XXH_HAS_BUILTIN(__builtin_rotateleft32) \
                                && XXH_HAS_BUILTIN(__builtin_rotateleft64)
 #  define XXH_rotl32 __builtin_rotateleft32
@@ -1516,6 +1604,14 @@ static int XXH_isLittleEndian(void)
 #  define XXH_rotl64(x,r) (((x) << (r)) | ((x) >> (64 - (r))))
 #endif
 
+/*!
+ * @internal
+ * @fn xxh_u32 XXH_swap32(xxh_u32 x)
+ * @brief A 32-bit byteswap.
+ *
+ * @param x The 32-bit integer to byteswap.
+ * @return @p x, byteswapped.
+ */
 #if defined(_MSC_VER)     /* Visual Studio */
 #  define XXH_swap32 _byteswap_ulong
 #elif XXH_GCC_VERSION >= 403
@@ -1534,7 +1630,15 @@ static xxh_u32 XXH_swap32 (xxh_u32 x)
 /* ***************************
 *  Memory reads
 *****************************/
-typedef enum { XXH_aligned, XXH_unaligned } XXH_alignment;
+
+/*!
+ * @internal
+ * @brief Enum to indicate whether a pointer is aligned.
+ */
+typedef enum {
+    XXH_aligned,  /*!< Aligned */
+    XXH_unaligned /*!< Possibly unaligned */
+} XXH_alignment;
 
 /*
  * XXH_FORCE_MEMORY_ACCESS==3 is an endian-independent byteshift load.
@@ -1600,11 +1704,11 @@ XXH_PUBLIC_API unsigned XXH_versionNumber (void) { return XXH_VERSION_NUMBER; }
  * @ingroup impl
  * @{
  */
-static const xxh_u32 XXH_PRIME32_1 = 0x9E3779B1U;   /* 0b10011110001101110111100110110001 */
-static const xxh_u32 XXH_PRIME32_2 = 0x85EBCA77U;   /* 0b10000101111010111100101001110111 */
-static const xxh_u32 XXH_PRIME32_3 = 0xC2B2AE3DU;   /* 0b11000010101100101010111000111101 */
-static const xxh_u32 XXH_PRIME32_4 = 0x27D4EB2FU;   /* 0b00100111110101001110101100101111 */
-static const xxh_u32 XXH_PRIME32_5 = 0x165667B1U;   /* 0b00010110010101100110011110110001 */
+static const xxh_u32 XXH_PRIME32_1 = 0x9E3779B1U;   /*!< 0b10011110001101110111100110110001 */
+static const xxh_u32 XXH_PRIME32_2 = 0x85EBCA77U;   /*!< 0b10000101111010111100101001110111 */
+static const xxh_u32 XXH_PRIME32_3 = 0xC2B2AE3DU;   /*!< 0b11000010101100101010111000111101 */
+static const xxh_u32 XXH_PRIME32_4 = 0x27D4EB2FU;   /*!< 0b00100111110101001110101100101111 */
+static const xxh_u32 XXH_PRIME32_5 = 0x165667B1U;   /*!< 0b00010110010101100110011110110001 */
 
 #ifdef XXH_OLD_NAMES
 #  define PRIME32_1 XXH_PRIME32_1
@@ -1614,6 +1718,17 @@ static const xxh_u32 XXH_PRIME32_5 = 0x165667B1U;   /* 0b00010110010101100110011
 #  define PRIME32_5 XXH_PRIME32_5
 #endif
 
+/*!
+ * @internal
+ * @brief Normal stripe processing routine.
+ *
+ * This shuffles the bits so that any bit from @p input impacts several bits in
+ * @p acc.
+ *
+ * @param acc The accumulator lane.
+ * @param input The stripe of input to mix.
+ * @return The mixed accumulator lane.
+ */
 static xxh_u32 XXH32_round(xxh_u32 acc, xxh_u32 input)
 {
     acc += input * XXH_PRIME32_2;
@@ -1670,7 +1785,16 @@ static xxh_u32 XXH32_round(xxh_u32 acc, xxh_u32 input)
     return acc;
 }
 
-/* mix all bits */
+/*!
+ * @internal
+ * @brief Mixes all bits to finalize the hash.
+ *
+ * The final mix ensures that all input bits have a chance to impact any bit in
+ * the output digest, resulting in an unbiased distribution.
+ *
+ * @param h32 The hash to avalanche.
+ * @return The avalanched hash.
+ */
 static xxh_u32 XXH32_avalanche(xxh_u32 h32)
 {
     h32 ^= h32 >> 15;
@@ -1683,6 +1807,20 @@ static xxh_u32 XXH32_avalanche(xxh_u32 h32)
 
 #define XXH_get32bits(p) XXH_readLE32_align(p, align)
 
+/*!
+ * @internal
+ * @brief Processes the last 0-15 bytes of @p ptr.
+ *
+ * There may be up to 15 bytes remaining to consume from the input.
+ * This final stage will digest them to ensure that all input bytes are present
+ * in the final mix.
+ *
+ * @param h32 The hash to finalize.
+ * @param ptr The pointer to the remaining input.
+ * @param len The remaining length, modulo 16.
+ * @param align Whether @p ptr is aligned.
+ * @return The finalized hash.
+ */
 static xxh_u32
 XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
 {
@@ -1762,6 +1900,14 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
 #  undef XXH_PROCESS4
 #endif
 
+/*!
+ * @internal
+ * @brief The implementation for @ref XXH32().
+ *
+ * @param input, len, seed Directly passed from @ref XXH32().
+ * @param align Whether @p input is aligned.
+ * @return The calculated hash.
+ */
 XXH_FORCE_INLINE xxh_u32
 XXH32_endian_align(const xxh_u8* input, size_t len, xxh_u32 seed, XXH_alignment align)
 {
@@ -1800,9 +1946,7 @@ XXH32_endian_align(const xxh_u8* input, size_t len, xxh_u32 seed, XXH_alignment
     return XXH32_finalize(h32, input, len&15, align);
 }
 
-/*!
- * @ingroup xxh32_family
- */
+/*! @ingroup xxh32_family */
 XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t len, XXH32_hash_t seed)
 {
 #if 0

From f9a4ab51230c01624062e89d660b0ddda9db32eb Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Fri, 16 Oct 2020 15:47:45 -0400
Subject: [PATCH 034/187] Add missing XXH3 implementation group

---
 xxhash.h | 8 +++++++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index a58c7c22..5e65fea1 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -328,7 +328,7 @@ typedef uint32_t XXH32_hash_t;
  *   systems, and offers true 64/128 bit hash results. It provides a superior
  *   level of dispersion, and greatly reduces the risks of collisions.
  *
- * @see @ref xxh64_family, @ref xxh3_family: Other xxHash families
+ * @see @ref xxh64_family, @ref xxh3_family : Other xxHash families
  * @see @ref xxh32_impl for implementation details
  * @{
  */
@@ -2671,6 +2671,12 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
 *  XXH3
 *  New generation hash designed for speed on small keys and vectorization
 ************************************************************************ */
+/*!
+ * @}
+ * @defgroup xxh3_impl XXH3 implementation
+ * @ingroup impl
+ * @{
+ */
 
 /* ===   Compiler specifics   === */
 

From acc692c9c768e428532327fa1be47101c388796c Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Fri, 16 Oct 2020 16:08:25 -0400
Subject: [PATCH 035/187] Clean up Doxyfile

Rely on the defaults.
Short, clean, and concise.
---
 Doxyfile | 2599 +-----------------------------------------------------
 1 file changed, 32 insertions(+), 2567 deletions(-)

diff --git a/Doxyfile b/Doxyfile
index 373986a5..1ce66981 100644
--- a/Doxyfile
+++ b/Doxyfile
@@ -1,2589 +1,54 @@
-# Doxyfile 1.8.20
-
-# This file describes the settings to be used by the documentation system
-# doxygen (www.doxygen.org) for a project.
-#
-# All text after a double hash (##) is considered a comment and is placed in
-# front of the TAG it is preceding.
-#
-# All text after a single hash (#) is considered a comment and will be ignored.
-# The format is:
-# TAG = value [value, ...]
-# For lists, items can also be appended using:
-# TAG += value [value, ...]
-# Values that contain spaces should be placed between quotes (\" \").
-
-#---------------------------------------------------------------------------
-# Project related configuration options
-#---------------------------------------------------------------------------
-
-# This tag specifies the encoding used for all characters in the configuration
-# file that follow. The default is UTF-8 which is also the encoding used for all
-# text before the first occurrence of this tag. Doxygen uses libiconv (or the
-# iconv built into libc) for the transcoding. See
-# https://www.gnu.org/software/libiconv/ for the list of possible encodings.
-# The default value is: UTF-8.
-
+# Doxygen config for xxHash
 DOXYFILE_ENCODING      = UTF-8
 
-# The PROJECT_NAME tag is a single word (or a sequence of words surrounded by
-# double-quotes, unless you are using Doxywizard) that should identify the
-# project for which the documentation is generated. This name is used in the
-# title of most generated pages and in a few other places.
-# The default value is: My Project.
-
 PROJECT_NAME           = "xxHash"
-
-# The PROJECT_NUMBER tag can be used to enter a project or revision number. This
-# could be handy for archiving the generated documentation or if some version
-# control system is used.
-
 PROJECT_NUMBER         = "0.8.0"
-
-# Using the PROJECT_BRIEF tag one can provide an optional one line description
-# for a project that appears at the top of each page and should give viewer a
-# quick idea about the purpose of the project. Keep the description short.
-
 PROJECT_BRIEF          = "Extremely fast non-cryptographic hash function"
-
-# With the PROJECT_LOGO tag one can specify a logo or an icon that is included
-# in the documentation. The maximum height of the logo should not exceed 55
-# pixels and the maximum width should not exceed 200 pixels. Doxygen will copy
-# the logo to the output directory.
-
-PROJECT_LOGO           =
-
-# The OUTPUT_DIRECTORY tag is used to specify the (relative or absolute) path
-# into which the generated documentation will be written. If a relative path is
-# entered, it will be relative to the location where doxygen was started. If
-# left blank the current directory will be used.
-
 OUTPUT_DIRECTORY       = doxygen
-
-# If the CREATE_SUBDIRS tag is set to YES then doxygen will create 4096 sub-
-# directories (in 2 levels) under the output directory of each output format and
-# will distribute the generated files over these directories. Enabling this
-# option can be useful when feeding doxygen a huge amount of source files, where
-# putting all generated files in the same directory would otherwise causes
-# performance problems for the file system.
-# The default value is: NO.
-
-CREATE_SUBDIRS         = NO
-
-# If the ALLOW_UNICODE_NAMES tag is set to YES, doxygen will allow non-ASCII
-# characters to appear in the names of generated files. If set to NO, non-ASCII
-# characters will be escaped, for example _xE3_x81_x84 will be used for Unicode
-# U+3044.
-# The default value is: NO.
-
-ALLOW_UNICODE_NAMES    = NO
-
-# The OUTPUT_LANGUAGE tag is used to specify the language in which all
-# documentation generated by doxygen is written. Doxygen will use this
-# information to generate all constant output in the proper language.
-# Possible values are: Afrikaans, Arabic, Armenian, Brazilian, Catalan, Chinese,
-# Chinese-Traditional, Croatian, Czech, Danish, Dutch, English (United States),
-# Esperanto, Farsi (Persian), Finnish, French, German, Greek, Hungarian,
-# Indonesian, Italian, Japanese, Japanese-en (Japanese with English messages),
-# Korean, Korean-en (Korean with English messages), Latvian, Lithuanian,
-# Macedonian, Norwegian, Persian (Farsi), Polish, Portuguese, Romanian, Russian,
-# Serbian, Serbian-Cyrillic, Slovak, Slovene, Spanish, Swedish, Turkish,
-# Ukrainian and Vietnamese.
-# The default value is: English.
-
 OUTPUT_LANGUAGE        = English
 
-# The OUTPUT_TEXT_DIRECTION tag is used to specify the direction in which all
-# documentation generated by doxygen is written. Doxygen will use this
-# information to generate all generated output in the proper direction.
-# Possible values are: None, LTR, RTL and Context.
-# The default value is: None.
-
-OUTPUT_TEXT_DIRECTION  = None
-
-# If the BRIEF_MEMBER_DESC tag is set to YES, doxygen will include brief member
-# descriptions after the members that are listed in the file and class
-# documentation (similar to Javadoc). Set to NO to disable this.
-# The default value is: YES.
-
-BRIEF_MEMBER_DESC      = YES
-
-# If the REPEAT_BRIEF tag is set to YES, doxygen will prepend the brief
-# description of a member or function before the detailed description
-#
-# Note: If both HIDE_UNDOC_MEMBERS and BRIEF_MEMBER_DESC are set to NO, the
-# brief descriptions will be completely suppressed.
-# The default value is: YES.
-
-REPEAT_BRIEF           = YES
-
-# This tag implements a quasi-intelligent brief description abbreviator that is
-# used to form the text in various listings. Each string in this list, if found
-# as the leading text of the brief description, will be stripped from the text
-# and the result, after processing the whole list, is used as the annotated
-# text. Otherwise, the brief description is used as-is. If left blank, the
-# following values are used ($name is automatically replaced with the name of
-# the entity):The $name class, The $name widget, The $name file, is, provides,
-# specifies, contains, represents, a, an and the.
-
-ABBREVIATE_BRIEF       = "The $name class" \
-                         "The $name widget" \
-                         "The $name file" \
-                         is \
-                         provides \
-                         specifies \
-                         contains \
-                         represents \
-                         a \
-                         an \
-                         the
-
-# If the ALWAYS_DETAILED_SEC and REPEAT_BRIEF tags are both set to YES then
-# doxygen will generate a detailed section even if there is only a brief
-# description.
-# The default value is: NO.
-
-ALWAYS_DETAILED_SEC    = NO
-
-# If the INLINE_INHERITED_MEMB tag is set to YES, doxygen will show all
-# inherited members of a class in the documentation of that class as if those
-# members were ordinary class members. Constructors, destructors and assignment
-# operators of the base classes will not be shown.
-# The default value is: NO.
-
-INLINE_INHERITED_MEMB  = NO
-
-# If the FULL_PATH_NAMES tag is set to YES, doxygen will prepend the full path
-# before files name in the file list and in the header files. If set to NO the
-# shortest path that makes the file name unique will be used
-# The default value is: YES.
-
-FULL_PATH_NAMES        = YES
-
-# The STRIP_FROM_PATH tag can be used to strip a user-defined part of the path.
-# Stripping is only done if one of the specified strings matches the left-hand
-# part of the path. The tag can be used to show relative paths in the file list.
-# If left blank the directory from which doxygen is run is used as the path to
-# strip.
-#
-# Note that you can specify absolute paths here, but also relative paths, which
-# will be relative from the directory where doxygen is started.
-# This tag requires that the tag FULL_PATH_NAMES is set to YES.
-
-STRIP_FROM_PATH        =
-
-# The STRIP_FROM_INC_PATH tag can be used to strip a user-defined part of the
-# path mentioned in the documentation of a class, which tells the reader which
-# header file to include in order to use a class. If left blank only the name of
-# the header file containing the class definition is used. Otherwise one should
-# specify the list of include paths that are normally passed to the compiler
-# using the -I flag.
-
-STRIP_FROM_INC_PATH    =
-
-# If the SHORT_NAMES tag is set to YES, doxygen will generate much shorter (but
-# less readable) file names. This can be useful is your file systems doesn't
-# support long names like on DOS, Mac, or CD-ROM.
-# The default value is: NO.
-
-SHORT_NAMES            = NO
-
-# If the JAVADOC_AUTOBRIEF tag is set to YES then doxygen will interpret the
-# first line (until the first dot) of a Javadoc-style comment as the brief
-# description. If set to NO, the Javadoc-style will behave just like regular Qt-
-# style comments (thus requiring an explicit @brief command for a brief
-# description.)
-# The default value is: NO.
-
-JAVADOC_AUTOBRIEF      = NO
-
-# If the JAVADOC_BANNER tag is set to YES then doxygen will interpret a line
-# such as
-# /***************
-# as being the beginning of a Javadoc-style comment "banner". If set to NO, the
-# Javadoc-style will behave just like regular comments and it will not be
-# interpreted by doxygen.
-# The default value is: NO.
-
-JAVADOC_BANNER         = NO
-
-# If the QT_AUTOBRIEF tag is set to YES then doxygen will interpret the first
-# line (until the first dot) of a Qt-style comment as the brief description. If
-# set to NO, the Qt-style will behave just like regular Qt-style comments (thus
-# requiring an explicit \brief command for a brief description.)
-# The default value is: NO.
-
-QT_AUTOBRIEF           = NO
-
-# The MULTILINE_CPP_IS_BRIEF tag can be set to YES to make doxygen treat a
-# multi-line C++ special comment block (i.e. a block of //! or /// comments) as
-# a brief description. This used to be the default behavior. The new default is
-# to treat a multi-line C++ comment block as a detailed description. Set this
-# tag to YES if you prefer the old behavior instead.
-#
-# Note that setting this tag to YES also means that rational rose comments are
-# not recognized any more.
-# The default value is: NO.
-
-MULTILINE_CPP_IS_BRIEF = NO
-
-# By default Python docstrings are displayed as preformatted text and doxygen's
-# special commands cannot be used. By setting PYTHON_DOCSTRING to NO the
-# doxygen's special commands can be used and the contents of the docstring
-# documentation blocks is shown as doxygen documentation.
-# The default value is: YES.
-
-PYTHON_DOCSTRING       = YES
-
-# If the INHERIT_DOCS tag is set to YES then an undocumented member inherits the
-# documentation from any documented member that it re-implements.
-# The default value is: YES.
-
-INHERIT_DOCS           = YES
-
-# If the SEPARATE_MEMBER_PAGES tag is set to YES then doxygen will produce a new
-# page for each member. If set to NO, the documentation of a member will be part
-# of the file/class/namespace that contains it.
-# The default value is: NO.
-
-SEPARATE_MEMBER_PAGES  = NO
-
-# The TAB_SIZE tag can be used to set the number of spaces in a tab. Doxygen
-# uses this value to replace tabs by spaces in code fragments.
-# Minimum value: 1, maximum value: 16, default value: 4.
-
-TAB_SIZE               = 4
-
-# This tag can be used to specify a number of aliases that act as commands in
-# the documentation. An alias has the form:
-# name=value
-# For example adding
-# "sideeffect=@par Side Effects:\n"
-# will allow you to put the command \sideeffect (or @sideeffect) in the
-# documentation, which will result in a user-defined paragraph with heading
-# "Side Effects:". You can put \n's in the value part of an alias to insert
-# newlines (in the resulting output). You can put ^^ in the value part of an
-# alias to insert a newline as if a physical newline was in the original file.
-# When you need a literal { or } or , in the value part of an alias you have to
-# escape them by means of a backslash (\), this can lead to conflicts with the
-# commands \{ and \} for these it is advised to use the version @{ and @} or use
-# a double escape (\\{ and \\})
-
-ALIASES                =
-
-# Set the OPTIMIZE_OUTPUT_FOR_C tag to YES if your project consists of C sources
-# only. Doxygen will then generate output that is more tailored for C. For
-# instance, some of the names that are used will be different. The list of all
-# members will be omitted, etc.
-# The default value is: NO.
-
-OPTIMIZE_OUTPUT_FOR_C  = YES
-
-# Set the OPTIMIZE_OUTPUT_JAVA tag to YES if your project consists of Java or
-# Python sources only. Doxygen will then generate output that is more tailored
-# for that language. For instance, namespaces will be presented as packages,
-# qualified scopes will look different, etc.
-# The default value is: NO.
-
-OPTIMIZE_OUTPUT_JAVA   = NO
-
-# Set the OPTIMIZE_FOR_FORTRAN tag to YES if your project consists of Fortran
-# sources. Doxygen will then generate output that is tailored for Fortran.
-# The default value is: NO.
-
-OPTIMIZE_FOR_FORTRAN   = NO
-
-# Set the OPTIMIZE_OUTPUT_VHDL tag to YES if your project consists of VHDL
-# sources. Doxygen will then generate output that is tailored for VHDL.
-# The default value is: NO.
-
-OPTIMIZE_OUTPUT_VHDL   = NO
-
-# Set the OPTIMIZE_OUTPUT_SLICE tag to YES if your project consists of Slice
-# sources only. Doxygen will then generate output that is more tailored for that
-# language. For instance, namespaces will be presented as modules, types will be
-# separated into more groups, etc.
-# The default value is: NO.
-
-OPTIMIZE_OUTPUT_SLICE  = NO
-
-# Doxygen selects the parser to use depending on the extension of the files it
-# parses. With this tag you can assign which parser to use for a given
-# extension. Doxygen has a built-in mapping, but you can override or extend it
-# using this tag. The format is ext=language, where ext is a file extension, and
-# language is one of the parsers supported by doxygen: IDL, Java, JavaScript,
-# Csharp (C#), C, C++, D, PHP, md (Markdown), Objective-C, Python, Slice, VHDL,
-# Fortran (fixed format Fortran: FortranFixed, free formatted Fortran:
-# FortranFree, unknown formatted Fortran: Fortran. In the later case the parser
-# tries to guess whether the code is fixed or free formatted code, this is the
-# default for Fortran type files). For instance to make doxygen treat .inc files
-# as Fortran files (default is PHP), and .f files as C (default is Fortran),
-# use: inc=Fortran f=C.
-#
-# Note: For files without extension you can use no_extension as a placeholder.
-#
-# Note that for custom extensions you also need to set FILE_PATTERNS otherwise
-# the files are not read by doxygen.
-
-EXTENSION_MAPPING      =
-
-# If the MARKDOWN_SUPPORT tag is enabled then doxygen pre-processes all comments
-# according to the Markdown format, which allows for more readable
-# documentation. See https://daringfireball.net/projects/markdown/ for details.
-# The output of markdown processing is further processed by doxygen, so you can
-# mix doxygen, HTML, and XML commands with Markdown formatting. Disable only in
-# case of backward compatibilities issues.
-# The default value is: YES.
-
-MARKDOWN_SUPPORT       = YES
-
-# When the TOC_INCLUDE_HEADINGS tag is set to a non-zero value, all headings up
-# to that level are automatically included in the table of contents, even if
-# they do not have an id attribute.
-# Note: This feature currently applies only to Markdown headings.
-# Minimum value: 0, maximum value: 99, default value: 5.
-# This tag requires that the tag MARKDOWN_SUPPORT is set to YES.
-
-TOC_INCLUDE_HEADINGS   = 5
-
-# When enabled doxygen tries to link words that correspond to documented
-# classes, or namespaces to their corresponding documentation. Such a link can
-# be prevented in individual cases by putting a % sign in front of the word or
-# globally by setting AUTOLINK_SUPPORT to NO.
-# The default value is: YES.
-
-AUTOLINK_SUPPORT       = YES
-
-# If you use STL classes (i.e. std::string, std::vector, etc.) but do not want
-# to include (a tag file for) the STL sources as input, then you should set this
-# tag to YES in order to let doxygen match functions declarations and
-# definitions whose arguments contain STL classes (e.g. func(std::string);
-# versus func(std::string) {}). This also make the inheritance and collaboration
-# diagrams that involve STL classes more complete and accurate.
-# The default value is: NO.
-
-BUILTIN_STL_SUPPORT    = NO
-
-# If you use Microsoft's C++/CLI language, you should set this option to YES to
-# enable parsing support.
-# The default value is: NO.
-
-CPP_CLI_SUPPORT        = NO
-
-# Set the SIP_SUPPORT tag to YES if your project consists of sip (see:
-# https://www.riverbankcomputing.com/software/sip/intro) sources only. Doxygen
-# will parse them like normal C++ but will assume all classes use public instead
-# of private inheritance when no explicit protection keyword is present.
-# The default value is: NO.
-
-SIP_SUPPORT            = NO
-
-# For Microsoft's IDL there are propget and propput attributes to indicate
-# getter and setter methods for a property. Setting this option to YES will make
-# doxygen to replace the get and set methods by a property in the documentation.
-# This will only work if the methods are indeed getting or setting a simple
-# type. If this is not the case, or you want to show the methods anyway, you
-# should set this option to NO.
-# The default value is: YES.
-
-IDL_PROPERTY_SUPPORT   = YES
-
-# If member grouping is used in the documentation and the DISTRIBUTE_GROUP_DOC
-# tag is set to YES then doxygen will reuse the documentation of the first
-# member in the group (if any) for the other members of the group. By default
-# all members of a group must be documented explicitly.
-# The default value is: NO.
-
-DISTRIBUTE_GROUP_DOC   = NO
-
-# If one adds a struct or class to a group and this option is enabled, then also
-# any nested class or struct is added to the same group. By default this option
-# is disabled and one has to add nested compounds explicitly via \ingroup.
-# The default value is: NO.
-
-GROUP_NESTED_COMPOUNDS = NO
-
-# Set the SUBGROUPING tag to YES to allow class member groups of the same type
-# (for instance a group of public functions) to be put as a subgroup of that
-# type (e.g. under the Public Functions section). Set it to NO to prevent
-# subgrouping. Alternatively, this can be done per class using the
-# \nosubgrouping command.
-# The default value is: YES.
-
-SUBGROUPING            = YES
-
-# When the INLINE_GROUPED_CLASSES tag is set to YES, classes, structs and unions
-# are shown inside the group in which they are included (e.g. using \ingroup)
-# instead of on a separate page (for HTML and Man pages) or section (for LaTeX
-# and RTF).
-#
-# Note that this feature does not work in combination with
-# SEPARATE_MEMBER_PAGES.
-# The default value is: NO.
-
-INLINE_GROUPED_CLASSES = NO
-
-# When the INLINE_SIMPLE_STRUCTS tag is set to YES, structs, classes, and unions
-# with only public data fields or simple typedef fields will be shown inline in
-# the documentation of the scope in which they are defined (i.e. file,
-# namespace, or group documentation), provided this scope is documented. If set
-# to NO, structs, classes, and unions are shown on a separate page (for HTML and
-# Man pages) or section (for LaTeX and RTF).
-# The default value is: NO.
-
-INLINE_SIMPLE_STRUCTS  = NO
-
-# When TYPEDEF_HIDES_STRUCT tag is enabled, a typedef of a struct, union, or
-# enum is documented as struct, union, or enum with the name of the typedef. So
-# typedef struct TypeS {} TypeT, will appear in the documentation as a struct
-# with name TypeT. When disabled the typedef will appear as a member of a file,
-# namespace, or class. And the struct will be named TypeS. This can typically be
-# useful for C code in case the coding convention dictates that all compound
-# types are typedef'ed and only the typedef is referenced, never the tag name.
-# The default value is: NO.
-
-TYPEDEF_HIDES_STRUCT   = NO
-
-# The size of the symbol lookup cache can be set using LOOKUP_CACHE_SIZE. This
-# cache is used to resolve symbols given their name and scope. Since this can be
-# an expensive process and often the same symbol appears multiple times in the
-# code, doxygen keeps a cache of pre-resolved symbols. If the cache is too small
-# doxygen will become slower. If the cache is too large, memory is wasted. The
-# cache size is given by this formula: 2^(16+LOOKUP_CACHE_SIZE). The valid range
-# is 0..9, the default is 0, corresponding to a cache size of 2^16=65536
-# symbols. At the end of a run doxygen will report the cache usage and suggest
-# the optimal cache size from a speed point of view.
-# Minimum value: 0, maximum value: 9, default value: 0.
-
-LOOKUP_CACHE_SIZE      = 0
-
-# The NUM_PROC_THREADS specifies the number threads doxygen is allowed to use
-# during processing. When set to 0 doxygen will based this on the number of
-# cores available in the system. You can set it explicitly to a value larger
-# than 0 to get more control over the balance between CPU load and processing
-# speed. At this moment only the input processing can be done using multiple
-# threads. Since this is still an experimental feature the default is set to 1,
-# which efficively disables parallel processing. Please report any issues you
-# encounter. Generating dot graphs in parallel is controlled by the
-# DOT_NUM_THREADS setting.
-# Minimum value: 0, maximum value: 32, default value: 1.
-
-NUM_PROC_THREADS       = 1
-
-#---------------------------------------------------------------------------
-# Build related configuration options
-#---------------------------------------------------------------------------
-
-# If the EXTRACT_ALL tag is set to YES, doxygen will assume all entities in
-# documentation are documented, even if no documentation was available. Private
-# class members and static file members will be hidden unless the
-# EXTRACT_PRIVATE respectively EXTRACT_STATIC tags are set to YES.
-# Note: This will also disable the warnings about undocumented members that are
-# normally produced when WARNINGS is set to YES.
-# The default value is: NO.
-
-EXTRACT_ALL            = NO
-
-# If the EXTRACT_PRIVATE tag is set to YES, all private members of a class will
-# be included in the documentation.
-# The default value is: NO.
-
-EXTRACT_PRIVATE        = NO
-
-# If the EXTRACT_PRIV_VIRTUAL tag is set to YES, documented private virtual
-# methods of a class will be included in the documentation.
-# The default value is: NO.
-
-EXTRACT_PRIV_VIRTUAL   = NO
-
-# If the EXTRACT_PACKAGE tag is set to YES, all members with package or internal
-# scope will be included in the documentation.
-# The default value is: NO.
-
-EXTRACT_PACKAGE        = NO
-
-# If the EXTRACT_STATIC tag is set to YES, all static members of a file will be
-# included in the documentation.
-# The default value is: NO.
-
-EXTRACT_STATIC         = YES
-
-# If the EXTRACT_LOCAL_CLASSES tag is set to YES, classes (and structs) defined
-# locally in source files will be included in the documentation. If set to NO,
-# only classes defined in header files are included. Does not have any effect
-# for Java sources.
-# The default value is: YES.
-
-EXTRACT_LOCAL_CLASSES  = YES
-
-# This flag is only useful for Objective-C code. If set to YES, local methods,
-# which are defined in the implementation section but not in the interface are
-# included in the documentation. If set to NO, only methods in the interface are
-# included.
-# The default value is: NO.
-
-EXTRACT_LOCAL_METHODS  = NO
-
-# If this flag is set to YES, the members of anonymous namespaces will be
-# extracted and appear in the documentation as a namespace called
-# 'anonymous_namespace{file}', where file will be replaced with the base name of
-# the file that contains the anonymous namespace. By default anonymous namespace
-# are hidden.
-# The default value is: NO.
-
-EXTRACT_ANON_NSPACES   = NO
-
-# If the HIDE_UNDOC_MEMBERS tag is set to YES, doxygen will hide all
-# undocumented members inside documented classes or files. If set to NO these
-# members will be included in the various overviews, but no documentation
-# section is generated. This option has no effect if EXTRACT_ALL is enabled.
-# The default value is: NO.
-
-HIDE_UNDOC_MEMBERS     = NO
-
-# If the HIDE_UNDOC_CLASSES tag is set to YES, doxygen will hide all
-# undocumented classes that are normally visible in the class hierarchy. If set
-# to NO, these classes will be included in the various overviews. This option
-# has no effect if EXTRACT_ALL is enabled.
-# The default value is: NO.
-
-HIDE_UNDOC_CLASSES     = NO
-
-# If the HIDE_FRIEND_COMPOUNDS tag is set to YES, doxygen will hide all friend
-# declarations. If set to NO, these declarations will be included in the
-# documentation.
-# The default value is: NO.
-
-HIDE_FRIEND_COMPOUNDS  = NO
-
-# If the HIDE_IN_BODY_DOCS tag is set to YES, doxygen will hide any
-# documentation blocks found inside the body of a function. If set to NO, these
-# blocks will be appended to the function's detailed documentation block.
-# The default value is: NO.
-
-HIDE_IN_BODY_DOCS      = NO
-
-# The INTERNAL_DOCS tag determines if documentation that is typed after a
-# \internal command is included. If the tag is set to NO then the documentation
-# will be excluded. Set it to YES to include the internal documentation.
-# The default value is: NO.
-
+# We already separate the internal docs.
 INTERNAL_DOCS          = YES
-
-# If the CASE_SENSE_NAMES tag is set to NO then doxygen will only generate file
-# names in lower-case letters. If set to YES, upper-case letters are also
-# allowed. This is useful if you have classes or files whose names only differ
-# in case and if your file system supports case sensitive file names. Windows
-# (including Cygwin) and Mac users are advised to set this option to NO.
-# The default value is: system dependent.
-
-CASE_SENSE_NAMES       = YES
-
-# If the HIDE_SCOPE_NAMES tag is set to NO then doxygen will show members with
-# their full class and namespace scopes in the documentation. If set to YES, the
-# scope will be hidden.
-# The default value is: NO.
-
-HIDE_SCOPE_NAMES       = NO
-
-# If the HIDE_COMPOUND_REFERENCE tag is set to NO (default) then doxygen will
-# append additional text to a page's title, such as Class Reference. If set to
-# YES the compound reference will be hidden.
-# The default value is: NO.
-
-HIDE_COMPOUND_REFERENCE= NO
-
-# If the SHOW_INCLUDE_FILES tag is set to YES then doxygen will put a list of
-# the files that are included by a file in the documentation of that file.
-# The default value is: YES.
-
-SHOW_INCLUDE_FILES     = YES
-
-# If the SHOW_GROUPED_MEMB_INC tag is set to YES then Doxygen will add for each
-# grouped member an include statement to the documentation, telling the reader
-# which file to include in order to use the member.
-# The default value is: NO.
-
-SHOW_GROUPED_MEMB_INC  = NO
-
-# If the FORCE_LOCAL_INCLUDES tag is set to YES then doxygen will list include
-# files with double quotes in the documentation rather than with sharp brackets.
-# The default value is: NO.
-
-FORCE_LOCAL_INCLUDES   = NO
-
-# If the INLINE_INFO tag is set to YES then a tag [inline] is inserted in the
-# documentation for inline members.
-# The default value is: YES.
-
-INLINE_INFO            = YES
-
-# If the SORT_MEMBER_DOCS tag is set to YES then doxygen will sort the
-# (detailed) documentation of file and class members alphabetically by member
-# name. If set to NO, the members will appear in declaration order.
-# The default value is: YES.
-
+# Consistency
 SORT_MEMBER_DOCS       = NO
+BRIEF_MEMBER_DESC      = YES
+REPEAT_BRIEF           = YES
 
-# If the SORT_BRIEF_DOCS tag is set to YES then doxygen will sort the brief
-# descriptions of file, namespace and class members alphabetically by member
-# name. If set to NO, the members will appear in declaration order. Note that
-# this will also influence the order of the classes in the class list.
-# The default value is: NO.
-
-SORT_BRIEF_DOCS        = NO
-
-# If the SORT_MEMBERS_CTORS_1ST tag is set to YES then doxygen will sort the
-# (brief and detailed) documentation of class members so that constructors and
-# destructors are listed first. If set to NO the constructors will appear in the
-# respective orders defined by SORT_BRIEF_DOCS and SORT_MEMBER_DOCS.
-# Note: If SORT_BRIEF_DOCS is set to NO this option is ignored for sorting brief
-# member documentation.
-# Note: If SORT_MEMBER_DOCS is set to NO this option is ignored for sorting
-# detailed member documentation.
-# The default value is: NO.
-
-SORT_MEMBERS_CTORS_1ST = NO
-
-# If the SORT_GROUP_NAMES tag is set to YES then doxygen will sort the hierarchy
-# of group names into alphabetical order. If set to NO the group names will
-# appear in their defined order.
-# The default value is: NO.
-
-SORT_GROUP_NAMES       = NO
-
-# If the SORT_BY_SCOPE_NAME tag is set to YES, the class list will be sorted by
-# fully-qualified names, including namespaces. If set to NO, the class list will
-# be sorted only by class name, not including the namespace part.
-# Note: This option is not very useful if HIDE_SCOPE_NAMES is set to YES.
-# Note: This option applies only to the class list, not to the alphabetical
-# list.
-# The default value is: NO.
-
-SORT_BY_SCOPE_NAME     = NO
-
-# If the STRICT_PROTO_MATCHING option is enabled and doxygen fails to do proper
-# type resolution of all parameters of a function it will reject a match between
-# the prototype and the implementation of a member function even if there is
-# only one candidate or it is obvious which candidate to choose by doing a
-# simple string match. By disabling STRICT_PROTO_MATCHING doxygen will still
-# accept a match between prototype and implementation in such cases.
-# The default value is: NO.
-
-STRICT_PROTO_MATCHING  = NO
-
-# The GENERATE_TODOLIST tag can be used to enable (YES) or disable (NO) the todo
-# list. This list is created by putting \todo commands in the documentation.
-# The default value is: YES.
-
-GENERATE_TODOLIST      = YES
-
-# The GENERATE_TESTLIST tag can be used to enable (YES) or disable (NO) the test
-# list. This list is created by putting \test commands in the documentation.
-# The default value is: YES.
-
-GENERATE_TESTLIST      = YES
-
-# The GENERATE_BUGLIST tag can be used to enable (YES) or disable (NO) the bug
-# list. This list is created by putting \bug commands in the documentation.
-# The default value is: YES.
-
-GENERATE_BUGLIST       = YES
-
-# The GENERATE_DEPRECATEDLIST tag can be used to enable (YES) or disable (NO)
-# the deprecated list. This list is created by putting \deprecated commands in
-# the documentation.
-# The default value is: YES.
-
-GENERATE_DEPRECATEDLIST= YES
-
-# The ENABLED_SECTIONS tag can be used to enable conditional documentation
-# sections, marked by \if <section_label> ... \endif and \cond <section_label>
-# ... \endcond blocks.
-
-ENABLED_SECTIONS       =
-
-# The MAX_INITIALIZER_LINES tag determines the maximum number of lines that the
-# initial value of a variable or macro / define can have for it to appear in the
-# documentation. If the initializer consists of more lines than specified here
-# it will be hidden. Use a value of 0 to hide initializers completely. The
-# appearance of the value of individual variables and macros / defines can be
-# controlled using \showinitializer or \hideinitializer command in the
-# documentation regardless of this setting.
-# Minimum value: 0, maximum value: 10000, default value: 30.
-
-MAX_INITIALIZER_LINES  = 30
-
-# Set the SHOW_USED_FILES tag to NO to disable the list of files generated at
-# the bottom of the documentation of classes and structs. If set to YES, the
-# list will mention the files that were used to generate the documentation.
-# The default value is: YES.
-
-SHOW_USED_FILES        = YES
-
-# Set the SHOW_FILES tag to NO to disable the generation of the Files page. This
-# will remove the Files entry from the Quick Index and from the Folder Tree View
-# (if specified).
-# The default value is: YES.
-
-SHOW_FILES             = YES
-
-# Set the SHOW_NAMESPACES tag to NO to disable the generation of the Namespaces
-# page. This will remove the Namespaces entry from the Quick Index and from the
-# Folder Tree View (if specified).
-# The default value is: YES.
-
-SHOW_NAMESPACES        = YES
-
-# The FILE_VERSION_FILTER tag can be used to specify a program or script that
-# doxygen should invoke to get the current version for each file (typically from
-# the version control system). Doxygen will invoke the program by executing (via
-# popen()) the command command input-file, where command is the value of the
-# FILE_VERSION_FILTER tag, and input-file is the name of an input file provided
-# by doxygen. Whatever the program writes to standard output is used as the file
-# version. For an example see the documentation.
-
-FILE_VERSION_FILTER    =
-
-# The LAYOUT_FILE tag can be used to specify a layout file which will be parsed
-# by doxygen. The layout file controls the global structure of the generated
-# output files in an output format independent way. To create the layout file
-# that represents doxygen's defaults, run doxygen with the -l option. You can
-# optionally specify a file name after the option, if omitted DoxygenLayout.xml
-# will be used as the name of the layout file.
-#
-# Note that if you run doxygen from a directory containing a file called
-# DoxygenLayout.xml, doxygen will parse it automatically even if the LAYOUT_FILE
-# tag is left empty.
-
-LAYOUT_FILE            =
-
-# The CITE_BIB_FILES tag can be used to specify one or more bib files containing
-# the reference definitions. This must be a list of .bib files. The .bib
-# extension is automatically appended if omitted. This requires the bibtex tool
-# to be installed. See also https://en.wikipedia.org/wiki/BibTeX for more info.
-# For LaTeX the style of the bibliography can be controlled using
-# LATEX_BIB_STYLE. To use this feature you need bibtex and perl available in the
-# search path. See also \cite for info how to create references.
-
-CITE_BIB_FILES         =
-
-#---------------------------------------------------------------------------
-# Configuration options related to warning and progress messages
-#---------------------------------------------------------------------------
-
-# The QUIET tag can be used to turn on/off the messages that are generated to
-# standard output by doxygen. If QUIET is set to YES this implies that the
-# messages are off.
-# The default value is: NO.
-
+# Warnings
 QUIET                  = YES
-
-# The WARNINGS tag can be used to turn on/off the warning messages that are
-# generated to standard error (stderr) by doxygen. If WARNINGS is set to YES
-# this implies that the warnings are on.
-#
-# Tip: Turn warnings on while writing the documentation.
-# The default value is: YES.
-
-WARNINGS               = YES
-
-# If the WARN_IF_UNDOCUMENTED tag is set to YES then doxygen will generate
-# warnings for undocumented members. If EXTRACT_ALL is set to YES then this flag
-# will automatically be disabled.
-# The default value is: YES.
-
+# Until we document everything
 WARN_IF_UNDOCUMENTED   = NO
 
-# If the WARN_IF_DOC_ERROR tag is set to YES, doxygen will generate warnings for
-# potential errors in the documentation, such as not documenting some parameters
-# in a documented function, or documenting parameters that don't exist or using
-# markup commands wrongly.
-# The default value is: YES.
-
-WARN_IF_DOC_ERROR      = YES
-
-# This WARN_NO_PARAMDOC option can be enabled to get warnings for functions that
-# are documented, but have no documentation for their parameters or return
-# value. If set to NO, doxygen will only warn about wrong or incomplete
-# parameter documentation, but not about the absence of documentation. If
-# EXTRACT_ALL is set to YES then this flag will automatically be disabled.
-# The default value is: NO.
-
-WARN_NO_PARAMDOC       = NO
-
-# If the WARN_AS_ERROR tag is set to YES then doxygen will immediately stop when
-# a warning is encountered.
-# The default value is: NO.
-
-WARN_AS_ERROR          = NO
-
-# The WARN_FORMAT tag determines the format of the warning messages that doxygen
-# can produce. The string should contain the $file, $line, and $text tags, which
-# will be replaced by the file and line number from which the warning originated
-# and the warning text. Optionally the format may contain $version, which will
-# be replaced by the version of the file (if it could be obtained via
-# FILE_VERSION_FILTER)
-# The default value is: $file:$line: $text.
-
-WARN_FORMAT            = "$file:$line: $text"
-
-# The WARN_LOGFILE tag can be used to specify a file to which warning and error
-# messages should be written. If left blank the output is written to standard
-# error (stderr).
-
-WARN_LOGFILE           =
-
-#---------------------------------------------------------------------------
-# Configuration options related to the input files
-#---------------------------------------------------------------------------
-
-# The INPUT tag is used to specify the files and/or directories that contain
-# documented source files. You may enter file names like myfile.cpp or
-# directories like /usr/src/myproject. Separate the files or directories with
-# spaces. See also FILE_PATTERNS and EXTENSION_MAPPING
-# Note: If this tag is empty the current directory is searched.
-
-INPUT                  =
-
-# This tag can be used to specify the character encoding of the source files
-# that doxygen parses. Internally doxygen uses the UTF-8 encoding. Doxygen uses
-# libiconv (or the iconv built into libc) for the transcoding. See the libiconv
-# documentation (see: https://www.gnu.org/software/libiconv/) for the list of
-# possible encodings.
-# The default value is: UTF-8.
-
+# TODO: Add the other files. It is just xxhash.h for now.
+FILE_PATTERNS          = xxhash.h
+# Note: xxHash's source files are technically ASCII only.
 INPUT_ENCODING         = UTF-8
+TAB_SIZE               = 4
 
-# If the value of the INPUT tag contains directories, you can use the
-# FILE_PATTERNS tag to specify one or more wildcard patterns (like *.cpp and
-# *.h) to filter out the source-files in the directories.
-#
-# Note that for custom extensions or not directly supported extensions you also
-# need to set EXTENSION_MAPPING for the extension otherwise the files are not
-# read by doxygen.
-#
-# If left blank the following patterns are tested:*.c, *.cc, *.cxx, *.cpp,
-# *.c++, *.java, *.ii, *.ixx, *.ipp, *.i++, *.inl, *.idl, *.ddl, *.odl, *.h,
-# *.hh, *.hxx, *.hpp, *.h++, *.cs, *.d, *.php, *.php4, *.php5, *.phtml, *.inc,
-# *.m, *.markdown, *.md, *.mm, *.dox (to be provided as doxygen C comment),
-# *.doc (to be provided as doxygen C comment), *.txt (to be provided as doxygen
-# C comment), *.py, *.pyw, *.f90, *.f95, *.f03, *.f08, *.f18, *.f, *.for, *.vhd,
-# *.vhdl, *.ucf, *.qsf and *.ice.
-
-FILE_PATTERNS = xxhash.h
-#
-#FILE_PATTERNS          = *.c \
-#                         *.cc \
-#                         *.cxx \
-#                         *.cpp \
-#                         *.c++ \
-#                         *.java \
-#                         *.ii \
-#                         *.ixx \
-#                         *.ipp \
-#                         *.i++ \
-#                         *.inl \
-#                         *.idl \
-#                         *.ddl \
-#                         *.odl \
-#                         *.h \
-#                         *.hh \
-#                         *.hxx \
-#                         *.hpp \
-#                         *.h++ \
-#                         *.cs \
-#                         *.d \
-#                         *.php \
-#                         *.php4 \
-#                         *.php5 \
-#                         *.phtml \
-#                         *.inc \
-#                         *.m \
-#                         *.markdown \
-#                         *.md \
-#                         *.mm \
-#                         *.dox \
-#                         *.doc \
-#                         *.txt \
-#                         *.py \
-#                         *.pyw \
-#                         *.f90 \
-#                         *.f95 \
-#                         *.f03 \
-#                         *.f08 \
-#                         *.f18 \
-#                         *.f \
-#                         *.for \
-#                         *.vhd \
-#                         *.vhdl \
-#                         *.ucf \
-#                         *.qsf \
-#                         *.ice
-
-# The RECURSIVE tag can be used to specify whether or not subdirectories should
-# be searched for input files as well.
-# The default value is: NO.
-
-RECURSIVE              = NO
-
-# The EXCLUDE tag can be used to specify files and/or directories that should be
-# excluded from the INPUT source files. This way you can easily exclude a
-# subdirectory from a directory tree whose root is specified with the INPUT tag.
-#
-# Note that relative paths are relative to the directory from which doxygen is
-# run.
-
-EXCLUDE                =
-
-# The EXCLUDE_SYMLINKS tag can be used to select whether or not files or
-# directories that are symbolic links (a Unix file system feature) are excluded
-# from the input.
-# The default value is: NO.
-
-EXCLUDE_SYMLINKS       = NO
-
-# If the value of the INPUT tag contains directories, you can use the
-# EXCLUDE_PATTERNS tag to specify one or more wildcard patterns to exclude
-# certain files from those directories.
-#
-# Note that the wildcards are matched against the file with absolute path, so to
-# exclude all test directories for example use the pattern */test/*
-
-EXCLUDE_PATTERNS       =
-
-# The EXCLUDE_SYMBOLS tag can be used to specify one or more symbol names
-# (namespaces, classes, functions, etc.) that should be excluded from the
-# output. The symbol name can be a fully qualified name, a word, or if the
-# wildcard * is used, a substring. Examples: ANamespace, AClass,
-# AClass::ANamespace, ANamespace::*Test
-#
-# Note that the wildcards are matched against the file with absolute path, so to
-# exclude all test directories use the pattern */test/*
-
-EXCLUDE_SYMBOLS        =
-
-# The EXAMPLE_PATH tag can be used to specify one or more files or directories
-# that contain example code fragments that are included (see the \include
-# command).
-
-EXAMPLE_PATH           =
-
-# If the value of the EXAMPLE_PATH tag contains directories, you can use the
-# EXAMPLE_PATTERNS tag to specify one or more wildcard pattern (like *.cpp and
-# *.h) to filter out the source-files in the directories. If left blank all
-# files are included.
-
-EXAMPLE_PATTERNS       = *
-
-# If the EXAMPLE_RECURSIVE tag is set to YES then subdirectories will be
-# searched for input files to be used with the \include or \dontinclude commands
-# irrespective of the value of the RECURSIVE tag.
-# The default value is: NO.
-
-EXAMPLE_RECURSIVE      = NO
-
-# The IMAGE_PATH tag can be used to specify one or more files or directories
-# that contain images that are to be included in the documentation (see the
-# \image command).
-
-IMAGE_PATH             =
-
-# The INPUT_FILTER tag can be used to specify a program that doxygen should
-# invoke to filter for each input file. Doxygen will invoke the filter program
-# by executing (via popen()) the command:
-#
-# <filter> <input-file>
-#
-# where <filter> is the value of the INPUT_FILTER tag, and <input-file> is the
-# name of an input file. Doxygen will then use the output that the filter
-# program writes to standard output. If FILTER_PATTERNS is specified, this tag
-# will be ignored.
-#
-# Note that the filter must not add or remove lines; it is applied before the
-# code is scanned, but not when the output code is generated. If lines are added
-# or removed, the anchors will not be placed correctly.
-#
-# Note that for custom extensions or not directly supported extensions you also
-# need to set EXTENSION_MAPPING for the extension otherwise the files are not
-# properly processed by doxygen.
-
-INPUT_FILTER           =
-
-# The FILTER_PATTERNS tag can be used to specify filters on a per file pattern
-# basis. Doxygen will compare the file name with each pattern and apply the
-# filter if there is a match. The filters are a list of the form: pattern=filter
-# (like *.cpp=my_cpp_filter). See INPUT_FILTER for further information on how
-# filters are used. If the FILTER_PATTERNS tag is empty or if none of the
-# patterns match the file name, INPUT_FILTER is applied.
-#
-# Note that for custom extensions or not directly supported extensions you also
-# need to set EXTENSION_MAPPING for the extension otherwise the files are not
-# properly processed by doxygen.
-
-FILTER_PATTERNS        =
-
-# If the FILTER_SOURCE_FILES tag is set to YES, the input filter (if set using
-# INPUT_FILTER) will also be used to filter the input files that are used for
-# producing the source files to browse (i.e. when SOURCE_BROWSER is set to YES).
-# The default value is: NO.
-
-FILTER_SOURCE_FILES    = NO
-
-# The FILTER_SOURCE_PATTERNS tag can be used to specify source filters per file
-# pattern. A pattern will override the setting for FILTER_PATTERN (if any) and
-# it is also possible to disable source filtering for a specific pattern using
-# *.ext= (so without naming a filter).
-# This tag requires that the tag FILTER_SOURCE_FILES is set to YES.
-
-FILTER_SOURCE_PATTERNS =
-
-# If the USE_MDFILE_AS_MAINPAGE tag refers to the name of a markdown file that
-# is part of the input, its contents will be placed on the main page
-# (index.html). This can be useful if you have a project on for instance GitHub
-# and want to reuse the introduction page also for the doxygen output.
-
-USE_MDFILE_AS_MAINPAGE =
-
-#---------------------------------------------------------------------------
-# Configuration options related to source browsing
-#---------------------------------------------------------------------------
-
-# If the SOURCE_BROWSER tag is set to YES then a list of source files will be
-# generated. Documented entities will be cross-referenced with these sources.
-#
-# Note: To get rid of all source code in the generated output, make sure that
-# also VERBATIM_HEADERS is set to NO.
-# The default value is: NO.
-
-SOURCE_BROWSER         = NO
-
-# Setting the INLINE_SOURCES tag to YES will include the body of functions,
-# classes and enums directly into the documentation.
-# The default value is: NO.
-
-INLINE_SOURCES         = NO
-
-# Setting the STRIP_CODE_COMMENTS tag to YES will instruct doxygen to hide any
-# special comment blocks from generated source code fragments. Normal C, C++ and
-# Fortran comments will always remain visible.
-# The default value is: YES.
-
-STRIP_CODE_COMMENTS    = YES
-
-# If the REFERENCED_BY_RELATION tag is set to YES then for each documented
-# entity all documented functions referencing it will be listed.
-# The default value is: NO.
-
-REFERENCED_BY_RELATION = NO
-
-# If the REFERENCES_RELATION tag is set to YES then for each documented function
-# all documented entities called/used by that function will be listed.
-# The default value is: NO.
-
-REFERENCES_RELATION    = NO
-
-# If the REFERENCES_LINK_SOURCE tag is set to YES and SOURCE_BROWSER tag is set
-# to YES then the hyperlinks from functions in REFERENCES_RELATION and
-# REFERENCED_BY_RELATION lists will link to the source code. Otherwise they will
-# link to the documentation.
-# The default value is: YES.
-
-REFERENCES_LINK_SOURCE = YES
-
-# If SOURCE_TOOLTIPS is enabled (the default) then hovering a hyperlink in the
-# source code will show a tooltip with additional information such as prototype,
-# brief description and links to the definition and documentation. Since this
-# will make the HTML file larger and loading of large files a bit slower, you
-# can opt to disable this feature.
-# The default value is: YES.
-# This tag requires that the tag SOURCE_BROWSER is set to YES.
-
-SOURCE_TOOLTIPS        = YES
-
-# If the USE_HTAGS tag is set to YES then the references to source code will
-# point to the HTML generated by the htags(1) tool instead of doxygen built-in
-# source browser. The htags tool is part of GNU's global source tagging system
-# (see https://www.gnu.org/software/global/global.html). You will need version
-# 4.8.6 or higher.
-#
-# To use it do the following:
-# - Install the latest version of global
-# - Enable SOURCE_BROWSER and USE_HTAGS in the configuration file
-# - Make sure the INPUT points to the root of the source tree
-# - Run doxygen as normal
-#
-# Doxygen will invoke htags (and that will in turn invoke gtags), so these
-# tools must be available from the command line (i.e. in the search path).
-#
-# The result: instead of the source browser generated by doxygen, the links to
-# source code will now point to the output of htags.
-# The default value is: NO.
-# This tag requires that the tag SOURCE_BROWSER is set to YES.
-
-USE_HTAGS              = NO
-
-# If the VERBATIM_HEADERS tag is set the YES then doxygen will generate a
-# verbatim copy of the header file for each class for which an include is
-# specified. Set to NO to disable this.
-# See also: Section \class.
-# The default value is: YES.
-
-VERBATIM_HEADERS       = YES
-
-#---------------------------------------------------------------------------
-# Configuration options related to the alphabetical class index
-#---------------------------------------------------------------------------
-
-# If the ALPHABETICAL_INDEX tag is set to YES, an alphabetical index of all
-# compounds will be generated. Enable this if the project contains a lot of
-# classes, structs, unions or interfaces.
-# The default value is: YES.
-
-ALPHABETICAL_INDEX     = YES
-
-# The COLS_IN_ALPHA_INDEX tag can be used to specify the number of columns in
-# which the alphabetical index list will be split.
-# Minimum value: 1, maximum value: 20, default value: 5.
-# This tag requires that the tag ALPHABETICAL_INDEX is set to YES.
-
-COLS_IN_ALPHA_INDEX    = 5
-
-# In case all classes in a project start with a common prefix, all classes will
-# be put under the same header in the alphabetical index. The IGNORE_PREFIX tag
-# can be used to specify a prefix (or a list of prefixes) that should be ignored
-# while generating the index headers.
-# This tag requires that the tag ALPHABETICAL_INDEX is set to YES.
-
-IGNORE_PREFIX          =
-
-#---------------------------------------------------------------------------
-# Configuration options related to the HTML output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_HTML tag is set to YES, doxygen will generate HTML output
-# The default value is: YES.
+# xxHash is a C library
+OPTIMIZE_OUTPUT_FOR_C  = YES
+# So we can document the internals
+EXTRACT_STATIC         = YES
+# Document the macros
+MACRO_EXPANSION        = YES
+EXPAND_ONLY_PREDEF     = YES
+# Predefine some macros to clean up the output.
+PREDEFINED             = "XXH_DOXYGEN=" \
+                         "XXH_PUBLIC_API=" \
+                         "XXH_FORCE_INLINE=static inline" \
+                         "XXH_NO_INLINE=static" \
+                         "XXH_RESTRICT=restrict" \
+                         "XSUM_API=" \
+                         "XXH_STATIC_LINKING_ONLY" \
+                         "XXH_IMPLEMENTATION" \
+                         "XXH_ALIGN(N)=alignas(N)" \
+                         "XXH_ALIGN_MEMBER(align,type)=alignas(align) type"
 
+# We want HTML docs
 GENERATE_HTML          = YES
-
-# The HTML_OUTPUT tag is used to specify where the HTML docs will be put. If a
-# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
-# it.
-# The default directory is: html.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
 HTML_OUTPUT            = html
-
-# The HTML_FILE_EXTENSION tag can be used to specify the file extension for each
-# generated HTML page (for example: .htm, .php, .asp).
-# The default value is: .html.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
 HTML_FILE_EXTENSION    = .html
-
-# The HTML_HEADER tag can be used to specify a user-defined HTML header file for
-# each generated HTML page. If the tag is left blank doxygen will generate a
-# standard header.
-#
-# To get valid HTML the header file that includes any scripts and style sheets
-# that doxygen needs, which is dependent on the configuration options used (e.g.
-# the setting GENERATE_TREEVIEW). It is highly recommended to start with a
-# default header using
-# doxygen -w html new_header.html new_footer.html new_stylesheet.css
-# YourConfigFile
-# and then modify the file new_header.html. See also section "Doxygen usage"
-# for information on how to generate the default header that doxygen normally
-# uses.
-# Note: The header is subject to change so you typically have to regenerate the
-# default header when upgrading to a newer version of doxygen. For a description
-# of the possible markers and block names see the documentation.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_HEADER            =
-
-# The HTML_FOOTER tag can be used to specify a user-defined HTML footer for each
-# generated HTML page. If the tag is left blank doxygen will generate a standard
-# footer. See HTML_HEADER for more information on how to generate a default
-# footer and what special commands can be used inside the footer. See also
-# section "Doxygen usage" for information on how to generate the default footer
-# that doxygen normally uses.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_FOOTER            =
-
-# The HTML_STYLESHEET tag can be used to specify a user-defined cascading style
-# sheet that is used by each HTML page. It can be used to fine-tune the look of
-# the HTML output. If left blank doxygen will generate a default style sheet.
-# See also section "Doxygen usage" for information on how to generate the style
-# sheet that doxygen normally uses.
-# Note: It is recommended to use HTML_EXTRA_STYLESHEET instead of this tag, as
-# it is more robust and this tag (HTML_STYLESHEET) will in the future become
-# obsolete.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_STYLESHEET        =
-
-# The HTML_EXTRA_STYLESHEET tag can be used to specify additional user-defined
-# cascading style sheets that are included after the standard style sheets
-# created by doxygen. Using this option one can overrule certain style aspects.
-# This is preferred over using HTML_STYLESHEET since it does not replace the
-# standard style sheet and is therefore more robust against future updates.
-# Doxygen will copy the style sheet files to the output directory.
-# Note: The order of the extra style sheet files is of importance (e.g. the last
-# style sheet in the list overrules the setting of the previous ones in the
-# list). For an example see the documentation.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_EXTRA_STYLESHEET  =
-
-# The HTML_EXTRA_FILES tag can be used to specify one or more extra images or
-# other source files which should be copied to the HTML output directory. Note
-# that these files will be copied to the base HTML output directory. Use the
-# $relpath^ marker in the HTML_HEADER and/or HTML_FOOTER files to load these
-# files. In the HTML_STYLESHEET file, use the file name only. Also note that the
-# files will be copied as-is; there are no commands or markers available.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_EXTRA_FILES       =
-
-# The HTML_COLORSTYLE_HUE tag controls the color of the HTML output. Doxygen
-# will adjust the colors in the style sheet and background images according to
-# this color. Hue is specified as an angle on a colorwheel, see
-# https://en.wikipedia.org/wiki/Hue for more information. For instance the value
-# 0 represents red, 60 is yellow, 120 is green, 180 is cyan, 240 is blue, 300
-# purple, and 360 is red again.
-# Minimum value: 0, maximum value: 359, default value: 220.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
+# Tweak the colors a bit
 HTML_COLORSTYLE_HUE    = 220
-
-# The HTML_COLORSTYLE_SAT tag controls the purity (or saturation) of the colors
-# in the HTML output. For a value of 0 the output will use grayscales only. A
-# value of 255 will produce the most vivid colors.
-# Minimum value: 0, maximum value: 255, default value: 100.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_COLORSTYLE_SAT    = 100
-
-# The HTML_COLORSTYLE_GAMMA tag controls the gamma correction applied to the
-# luminance component of the colors in the HTML output. Values below 100
-# gradually make the output lighter, whereas values above 100 make the output
-# darker. The value divided by 100 is the actual gamma applied, so 80 represents
-# a gamma of 0.8, The value 220 represents a gamma of 2.2, and 100 does not
-# change the gamma.
-# Minimum value: 40, maximum value: 240, default value: 80.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
 HTML_COLORSTYLE_GAMMA  = 100
-
-# If the HTML_TIMESTAMP tag is set to YES then the footer of each generated HTML
-# page will contain the date and time when the page was generated. Setting this
-# to YES can help to show when doxygen was last run and thus if the
-# documentation is up to date.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_TIMESTAMP         = NO
-
-# If the HTML_DYNAMIC_MENUS tag is set to YES then the generated HTML
-# documentation will contain a main index with vertical navigation menus that
-# are dynamically created via JavaScript. If disabled, the navigation index will
-# consists of multiple levels of tabs that are statically embedded in every HTML
-# page. Disable this option to support browsers that do not have JavaScript,
-# like the Qt help browser.
-# The default value is: YES.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_DYNAMIC_MENUS     = YES
-
-# If the HTML_DYNAMIC_SECTIONS tag is set to YES then the generated HTML
-# documentation will contain sections that can be hidden and shown after the
-# page has loaded.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_DYNAMIC_SECTIONS  = YES
-
-# With HTML_INDEX_NUM_ENTRIES one can control the preferred number of entries
-# shown in the various tree structured indices initially; the user can expand
-# and collapse entries dynamically later on. Doxygen will expand the tree to
-# such a level that at most the specified number of entries are visible (unless
-# a fully collapsed tree already exceeds this amount). So setting the number of
-# entries 1 will produce a full collapsed tree by default. 0 is a special value
-# representing an infinite number of entries and will result in a full expanded
-# tree by default.
-# Minimum value: 0, maximum value: 9999, default value: 100.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_INDEX_NUM_ENTRIES = 100
-
-# If the GENERATE_DOCSET tag is set to YES, additional index files will be
-# generated that can be used as input for Apple's Xcode 3 integrated development
-# environment (see: https://developer.apple.com/xcode/), introduced with OSX
-# 10.5 (Leopard). To create a documentation set, doxygen will generate a
-# Makefile in the HTML output directory. Running make will produce the docset in
-# that directory and running make install will install the docset in
-# ~/Library/Developer/Shared/Documentation/DocSets so that Xcode will find it at
-# startup. See https://developer.apple.com/library/archive/featuredarticles/Doxy
-# genXcode/_index.html for more information.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-GENERATE_DOCSET        = NO
-
-# This tag determines the name of the docset feed. A documentation feed provides
-# an umbrella under which multiple documentation sets from a single provider
-# (such as a company or product suite) can be grouped.
-# The default value is: Doxygen generated docs.
-# This tag requires that the tag GENERATE_DOCSET is set to YES.
-
-DOCSET_FEEDNAME        = "Doxygen generated docs"
-
-# This tag specifies a string that should uniquely identify the documentation
-# set bundle. This should be a reverse domain-name style string, e.g.
-# com.mycompany.MyDocSet. Doxygen will append .docset to the name.
-# The default value is: org.doxygen.Project.
-# This tag requires that the tag GENERATE_DOCSET is set to YES.
-
-DOCSET_BUNDLE_ID       = org.doxygen.Project
-
-# The DOCSET_PUBLISHER_ID tag specifies a string that should uniquely identify
-# the documentation publisher. This should be a reverse domain-name style
-# string, e.g. com.mycompany.MyDocSet.documentation.
-# The default value is: org.doxygen.Publisher.
-# This tag requires that the tag GENERATE_DOCSET is set to YES.
-
-DOCSET_PUBLISHER_ID    = org.doxygen.Publisher
-
-# The DOCSET_PUBLISHER_NAME tag identifies the documentation publisher.
-# The default value is: Publisher.
-# This tag requires that the tag GENERATE_DOCSET is set to YES.
-
-DOCSET_PUBLISHER_NAME  = Publisher
-
-# If the GENERATE_HTMLHELP tag is set to YES then doxygen generates three
-# additional HTML index files: index.hhp, index.hhc, and index.hhk. The
-# index.hhp is a project file that can be read by Microsoft's HTML Help Workshop
-# (see: https://www.microsoft.com/en-us/download/details.aspx?id=21138) on
-# Windows.
-#
-# The HTML Help Workshop contains a compiler that can convert all HTML output
-# generated by doxygen into a single compiled HTML file (.chm). Compiled HTML
-# files are now used as the Windows 98 help format, and will replace the old
-# Windows help format (.hlp) on all Windows platforms in the future. Compressed
-# HTML files also contain an index, a table of contents, and you can search for
-# words in the documentation. The HTML workshop also contains a viewer for
-# compressed HTML files.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-GENERATE_HTMLHELP      = NO
-
-# The CHM_FILE tag can be used to specify the file name of the resulting .chm
-# file. You can add a path in front of the file if the result should not be
-# written to the html output directory.
-# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
-
-CHM_FILE               =
-
-# The HHC_LOCATION tag can be used to specify the location (absolute path
-# including file name) of the HTML help compiler (hhc.exe). If non-empty,
-# doxygen will try to run the HTML help compiler on the generated index.hhp.
-# The file has to be specified with full path.
-# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
-
-HHC_LOCATION           =
-
-# The GENERATE_CHI flag controls if a separate .chi index file is generated
-# (YES) or that it should be included in the main .chm file (NO).
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
-
-GENERATE_CHI           = NO
-
-# The CHM_INDEX_ENCODING is used to encode HtmlHelp index (hhk), content (hhc)
-# and project file content.
-# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
-
-CHM_INDEX_ENCODING     =
-
-# The BINARY_TOC flag controls whether a binary table of contents is generated
-# (YES) or a normal table of contents (NO) in the .chm file. Furthermore it
-# enables the Previous and Next buttons.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
-
-BINARY_TOC             = NO
-
-# The TOC_EXPAND flag can be set to YES to add extra items for group members to
-# the table of contents of the HTML help documentation and to the tree view.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTMLHELP is set to YES.
-
-TOC_EXPAND             = NO
-
-# If the GENERATE_QHP tag is set to YES and both QHP_NAMESPACE and
-# QHP_VIRTUAL_FOLDER are set, an additional index file will be generated that
-# can be used as input for Qt's qhelpgenerator to generate a Qt Compressed Help
-# (.qch) of the generated HTML documentation.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-GENERATE_QHP           = NO
-
-# If the QHG_LOCATION tag is specified, the QCH_FILE tag can be used to specify
-# the file name of the resulting .qch file. The path specified is relative to
-# the HTML output folder.
-# This tag requires that the tag GENERATE_QHP is set to YES.
-
-QCH_FILE               =
-
-# The QHP_NAMESPACE tag specifies the namespace to use when generating Qt Help
-# Project output. For more information please see Qt Help Project / Namespace
-# (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#namespace).
-# The default value is: org.doxygen.Project.
-# This tag requires that the tag GENERATE_QHP is set to YES.
-
-QHP_NAMESPACE          = org.doxygen.Project
-
-# The QHP_VIRTUAL_FOLDER tag specifies the namespace to use when generating Qt
-# Help Project output. For more information please see Qt Help Project / Virtual
-# Folders (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#virtual-
-# folders).
-# The default value is: doc.
-# This tag requires that the tag GENERATE_QHP is set to YES.
-
-QHP_VIRTUAL_FOLDER     = doc
-
-# If the QHP_CUST_FILTER_NAME tag is set, it specifies the name of a custom
-# filter to add. For more information please see Qt Help Project / Custom
-# Filters (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#custom-
-# filters).
-# This tag requires that the tag GENERATE_QHP is set to YES.
-
-QHP_CUST_FILTER_NAME   =
-
-# The QHP_CUST_FILTER_ATTRS tag specifies the list of the attributes of the
-# custom filter to add. For more information please see Qt Help Project / Custom
-# Filters (see: https://doc.qt.io/archives/qt-4.8/qthelpproject.html#custom-
-# filters).
-# This tag requires that the tag GENERATE_QHP is set to YES.
-
-QHP_CUST_FILTER_ATTRS  =
-
-# The QHP_SECT_FILTER_ATTRS tag specifies the list of the attributes this
-# project's filter section matches. Qt Help Project / Filter Attributes (see:
-# https://doc.qt.io/archives/qt-4.8/qthelpproject.html#filter-attributes).
-# This tag requires that the tag GENERATE_QHP is set to YES.
-
-QHP_SECT_FILTER_ATTRS  =
-
-# The QHG_LOCATION tag can be used to specify the location of Qt's
-# qhelpgenerator. If non-empty doxygen will try to run qhelpgenerator on the
-# generated .qhp file.
-# This tag requires that the tag GENERATE_QHP is set to YES.
-
-QHG_LOCATION           =
-
-# If the GENERATE_ECLIPSEHELP tag is set to YES, additional index files will be
-# generated, together with the HTML files, they form an Eclipse help plugin. To
-# install this plugin and make it available under the help contents menu in
-# Eclipse, the contents of the directory containing the HTML and XML files needs
-# to be copied into the plugins directory of eclipse. The name of the directory
-# within the plugins directory should be the same as the ECLIPSE_DOC_ID value.
-# After copying Eclipse needs to be restarted before the help appears.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-GENERATE_ECLIPSEHELP   = NO
-
-# A unique identifier for the Eclipse help plugin. When installing the plugin
-# the directory name containing the HTML and XML files should also have this
-# name. Each documentation set should have its own identifier.
-# The default value is: org.doxygen.Project.
-# This tag requires that the tag GENERATE_ECLIPSEHELP is set to YES.
-
-ECLIPSE_DOC_ID         = org.doxygen.Project
-
-# If you want full control over the layout of the generated HTML pages it might
-# be necessary to disable the index and replace it with your own. The
-# DISABLE_INDEX tag can be used to turn on/off the condensed index (tabs) at top
-# of each HTML page. A value of NO enables the index and the value YES disables
-# it. Since the tabs in the index contain the same information as the navigation
-# tree, you can set this option to YES if you also set GENERATE_TREEVIEW to YES.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-DISABLE_INDEX          = NO
-
-# The GENERATE_TREEVIEW tag is used to specify whether a tree-like index
-# structure should be generated to display hierarchical information. If the tag
-# value is set to YES, a side panel will be generated containing a tree-like
-# index structure (just like the one that is generated for HTML Help). For this
-# to work a browser that supports JavaScript, DHTML, CSS and frames is required
-# (i.e. any modern browser). Windows users are probably better off using the
-# HTML help feature. Via custom style sheets (see HTML_EXTRA_STYLESHEET) one can
-# further fine-tune the look of the index. As an example, the default style
-# sheet generated by doxygen has an example that shows how to put an image at
-# the root of the tree instead of the PROJECT_NAME. Since the tree basically has
-# the same information as the tab index, you could consider setting
-# DISABLE_INDEX to YES when enabling this option.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-GENERATE_TREEVIEW      = YES
-
-# The ENUM_VALUES_PER_LINE tag can be used to set the number of enum values that
-# doxygen will group on one line in the generated HTML documentation.
-#
-# Note that a value of 0 will completely suppress the enum values from appearing
-# in the overview section.
-# Minimum value: 0, maximum value: 20, default value: 4.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-ENUM_VALUES_PER_LINE   = 4
-
-# If the treeview is enabled (see GENERATE_TREEVIEW) then this tag can be used
-# to set the initial width (in pixels) of the frame in which the tree is shown.
-# Minimum value: 0, maximum value: 1500, default value: 250.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-TREEVIEW_WIDTH         = 250
-
-# If the EXT_LINKS_IN_WINDOW option is set to YES, doxygen will open links to
-# external symbols imported via tag files in a separate window.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-EXT_LINKS_IN_WINDOW    = NO
-
-# If the HTML_FORMULA_FORMAT option is set to svg, doxygen will use the pdf2svg
-# tool (see https://github.com/dawbarton/pdf2svg) or inkscape (see
-# https://inkscape.org) to generate formulas as SVG images instead of PNGs for
-# the HTML output. These images will generally look nicer at scaled resolutions.
-# Possible values are: png (the default) and svg (looks nicer but requires the
-# pdf2svg or inkscape tool).
-# The default value is: png.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-HTML_FORMULA_FORMAT    = png
-
-# Use this tag to change the font size of LaTeX formulas included as images in
-# the HTML documentation. When you change the font size after a successful
-# doxygen run you need to manually remove any form_*.png images from the HTML
-# output directory to force them to be regenerated.
-# Minimum value: 8, maximum value: 50, default value: 10.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-FORMULA_FONTSIZE       = 10
-
-# Use the FORMULA_TRANSPARENT tag to determine whether or not the images
-# generated for formulas are transparent PNGs. Transparent PNGs are not
-# supported properly for IE 6.0, but are supported on all modern browsers.
-#
-# Note that when changing this option you need to delete any form_*.png files in
-# the HTML output directory before the changes have effect.
-# The default value is: YES.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-FORMULA_TRANSPARENT    = YES
-
-# The FORMULA_MACROFILE can contain LaTeX \newcommand and \renewcommand commands
-# to create new LaTeX commands to be used in formulas as building blocks. See
-# the section "Including formulas" for details.
-
-FORMULA_MACROFILE      =
-
-# Enable the USE_MATHJAX option to render LaTeX formulas using MathJax (see
-# https://www.mathjax.org) which uses client side JavaScript for the rendering
-# instead of using pre-rendered bitmaps. Use this if you do not have LaTeX
-# installed or if you want to formulas look prettier in the HTML output. When
-# enabled you may also need to install MathJax separately and configure the path
-# to it using the MATHJAX_RELPATH option.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-USE_MATHJAX            = NO
-
-# When MathJax is enabled you can set the default output format to be used for
-# the MathJax output. See the MathJax site (see:
-# http://docs.mathjax.org/en/latest/output.html) for more details.
-# Possible values are: HTML-CSS (which is slower, but has the best
-# compatibility), NativeMML (i.e. MathML) and SVG.
-# The default value is: HTML-CSS.
-# This tag requires that the tag USE_MATHJAX is set to YES.
-
-MATHJAX_FORMAT         = HTML-CSS
-
-# When MathJax is enabled you need to specify the location relative to the HTML
-# output directory using the MATHJAX_RELPATH option. The destination directory
-# should contain the MathJax.js script. For instance, if the mathjax directory
-# is located at the same level as the HTML output directory, then
-# MATHJAX_RELPATH should be ../mathjax. The default value points to the MathJax
-# Content Delivery Network so you can quickly see the result without installing
-# MathJax. However, it is strongly recommended to install a local copy of
-# MathJax from https://www.mathjax.org before deployment.
-# The default value is: https://cdn.jsdelivr.net/npm/mathjax@2.
-# This tag requires that the tag USE_MATHJAX is set to YES.
-
-MATHJAX_RELPATH        = https://cdn.jsdelivr.net/npm/mathjax@2
-
-# The MATHJAX_EXTENSIONS tag can be used to specify one or more MathJax
-# extension names that should be enabled during MathJax rendering. For example
-# MATHJAX_EXTENSIONS = TeX/AMSmath TeX/AMSsymbols
-# This tag requires that the tag USE_MATHJAX is set to YES.
-
-MATHJAX_EXTENSIONS     =
-
-# The MATHJAX_CODEFILE tag can be used to specify a file with javascript pieces
-# of code that will be used on startup of the MathJax code. See the MathJax site
-# (see: http://docs.mathjax.org/en/latest/output.html) for more details. For an
-# example see the documentation.
-# This tag requires that the tag USE_MATHJAX is set to YES.
-
-MATHJAX_CODEFILE       =
-
-# When the SEARCHENGINE tag is enabled doxygen will generate a search box for
-# the HTML output. The underlying search engine uses javascript and DHTML and
-# should work on any modern browser. Note that when using HTML help
-# (GENERATE_HTMLHELP), Qt help (GENERATE_QHP), or docsets (GENERATE_DOCSET)
-# there is already a search function so this one should typically be disabled.
-# For large projects the javascript based search engine can be slow, then
-# enabling SERVER_BASED_SEARCH may provide a better solution. It is possible to
-# search using the keyboard; to jump to the search box use <access key> + S
-# (what the <access key> is depends on the OS and browser, but it is typically
-# <CTRL>, <ALT>/<option>, or both). Inside the search box use the <cursor down
-# key> to jump into the search results window, the results can be navigated
-# using the <cursor keys>. Press <Enter> to select an item or <escape> to cancel
-# the search. The filter options can be selected when the cursor is inside the
-# search box by pressing <Shift>+<cursor down>. Also here use the <cursor keys>
-# to select a filter and <Enter> or <escape> to activate or cancel the filter
-# option.
-# The default value is: YES.
-# This tag requires that the tag GENERATE_HTML is set to YES.
-
-SEARCHENGINE           = YES
-
-# When the SERVER_BASED_SEARCH tag is enabled the search engine will be
-# implemented using a web server instead of a web client using JavaScript. There
-# are two flavors of web server based searching depending on the EXTERNAL_SEARCH
-# setting. When disabled, doxygen will generate a PHP script for searching and
-# an index file used by the script. When EXTERNAL_SEARCH is enabled the indexing
-# and searching needs to be provided by external tools. See the section
-# "External Indexing and Searching" for details.
-# The default value is: NO.
-# This tag requires that the tag SEARCHENGINE is set to YES.
-
-SERVER_BASED_SEARCH    = NO
-
-# When EXTERNAL_SEARCH tag is enabled doxygen will no longer generate the PHP
-# script for searching. Instead the search results are written to an XML file
-# which needs to be processed by an external indexer. Doxygen will invoke an
-# external search engine pointed to by the SEARCHENGINE_URL option to obtain the
-# search results.
-#
-# Doxygen ships with an example indexer (doxyindexer) and search engine
-# (doxysearch.cgi) which are based on the open source search engine library
-# Xapian (see: https://xapian.org/).
-#
-# See the section "External Indexing and Searching" for details.
-# The default value is: NO.
-# This tag requires that the tag SEARCHENGINE is set to YES.
-
-EXTERNAL_SEARCH        = NO
-
-# The SEARCHENGINE_URL should point to a search engine hosted by a web server
-# which will return the search results when EXTERNAL_SEARCH is enabled.
-#
-# Doxygen ships with an example indexer (doxyindexer) and search engine
-# (doxysearch.cgi) which are based on the open source search engine library
-# Xapian (see: https://xapian.org/). See the section "External Indexing and
-# Searching" for details.
-# This tag requires that the tag SEARCHENGINE is set to YES.
-
-SEARCHENGINE_URL       =
-
-# When SERVER_BASED_SEARCH and EXTERNAL_SEARCH are both enabled the unindexed
-# search data is written to a file for indexing by an external tool. With the
-# SEARCHDATA_FILE tag the name of this file can be specified.
-# The default file is: searchdata.xml.
-# This tag requires that the tag SEARCHENGINE is set to YES.
-
-SEARCHDATA_FILE        = searchdata.xml
-
-# When SERVER_BASED_SEARCH and EXTERNAL_SEARCH are both enabled the
-# EXTERNAL_SEARCH_ID tag can be used as an identifier for the project. This is
-# useful in combination with EXTRA_SEARCH_MAPPINGS to search through multiple
-# projects and redirect the results back to the right project.
-# This tag requires that the tag SEARCHENGINE is set to YES.
-
-EXTERNAL_SEARCH_ID     =
-
-# The EXTRA_SEARCH_MAPPINGS tag can be used to enable searching through doxygen
-# projects other than the one defined by this configuration file, but that are
-# all added to the same external search index. Each project needs to have a
-# unique id set via EXTERNAL_SEARCH_ID. The search mapping then maps the id of
-# to a relative location where the documentation can be found. The format is:
-# EXTRA_SEARCH_MAPPINGS = tagname1=loc1 tagname2=loc2 ...
-# This tag requires that the tag SEARCHENGINE is set to YES.
-
-EXTRA_SEARCH_MAPPINGS  =
-
-#---------------------------------------------------------------------------
-# Configuration options related to the LaTeX output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_LATEX tag is set to YES, doxygen will generate LaTeX output.
-# The default value is: YES.
-
-GENERATE_LATEX         = YES
-
-# The LATEX_OUTPUT tag is used to specify where the LaTeX docs will be put. If a
-# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
-# it.
-# The default directory is: latex.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_OUTPUT           = latex
-
-# The LATEX_CMD_NAME tag can be used to specify the LaTeX command name to be
-# invoked.
-#
-# Note that when not enabling USE_PDFLATEX the default is latex when enabling
-# USE_PDFLATEX the default is pdflatex and when in the later case latex is
-# chosen this is overwritten by pdflatex. For specific output languages the
-# default can have been set differently, this depends on the implementation of
-# the output language.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_CMD_NAME         =
-
-# The MAKEINDEX_CMD_NAME tag can be used to specify the command name to generate
-# index for LaTeX.
-# Note: This tag is used in the Makefile / make.bat.
-# See also: LATEX_MAKEINDEX_CMD for the part in the generated output file
-# (.tex).
-# The default file is: makeindex.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-MAKEINDEX_CMD_NAME     = makeindex
-
-# The LATEX_MAKEINDEX_CMD tag can be used to specify the command name to
-# generate index for LaTeX. In case there is no backslash (\) as first character
-# it will be automatically added in the LaTeX code.
-# Note: This tag is used in the generated output file (.tex).
-# See also: MAKEINDEX_CMD_NAME for the part in the Makefile / make.bat.
-# The default value is: makeindex.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_MAKEINDEX_CMD    = makeindex
-
-# If the COMPACT_LATEX tag is set to YES, doxygen generates more compact LaTeX
-# documents. This may be useful for small projects and may help to save some
-# trees in general.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-COMPACT_LATEX          = NO
-
-# The PAPER_TYPE tag can be used to set the paper type that is used by the
-# printer.
-# Possible values are: a4 (210 x 297 mm), letter (8.5 x 11 inches), legal (8.5 x
-# 14 inches) and executive (7.25 x 10.5 inches).
-# The default value is: a4.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-PAPER_TYPE             = a4
-
-# The EXTRA_PACKAGES tag can be used to specify one or more LaTeX package names
-# that should be included in the LaTeX output. The package can be specified just
-# by its name or with the correct syntax as to be used with the LaTeX
-# \usepackage command. To get the times font for instance you can specify :
-# EXTRA_PACKAGES=times or EXTRA_PACKAGES={times}
-# To use the option intlimits with the amsmath package you can specify:
-# EXTRA_PACKAGES=[intlimits]{amsmath}
-# If left blank no extra packages will be included.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-EXTRA_PACKAGES         =
-
-# The LATEX_HEADER tag can be used to specify a personal LaTeX header for the
-# generated LaTeX document. The header should contain everything until the first
-# chapter. If it is left blank doxygen will generate a standard header. See
-# section "Doxygen usage" for information on how to let doxygen write the
-# default header to a separate file.
-#
-# Note: Only use a user-defined header if you know what you are doing! The
-# following commands have a special meaning inside the header: $title,
-# $datetime, $date, $doxygenversion, $projectname, $projectnumber,
-# $projectbrief, $projectlogo. Doxygen will replace $title with the empty
-# string, for the replacement values of the other commands the user is referred
-# to HTML_HEADER.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_HEADER           =
-
-# The LATEX_FOOTER tag can be used to specify a personal LaTeX footer for the
-# generated LaTeX document. The footer should contain everything after the last
-# chapter. If it is left blank doxygen will generate a standard footer. See
-# LATEX_HEADER for more information on how to generate a default footer and what
-# special commands can be used inside the footer.
-#
-# Note: Only use a user-defined footer if you know what you are doing!
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_FOOTER           =
-
-# The LATEX_EXTRA_STYLESHEET tag can be used to specify additional user-defined
-# LaTeX style sheets that are included after the standard style sheets created
-# by doxygen. Using this option one can overrule certain style aspects. Doxygen
-# will copy the style sheet files to the output directory.
-# Note: The order of the extra style sheet files is of importance (e.g. the last
-# style sheet in the list overrules the setting of the previous ones in the
-# list).
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_EXTRA_STYLESHEET =
-
-# The LATEX_EXTRA_FILES tag can be used to specify one or more extra images or
-# other source files which should be copied to the LATEX_OUTPUT output
-# directory. Note that the files will be copied as-is; there are no commands or
-# markers available.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_EXTRA_FILES      =
-
-# If the PDF_HYPERLINKS tag is set to YES, the LaTeX that is generated is
-# prepared for conversion to PDF (using ps2pdf or pdflatex). The PDF file will
-# contain links (just like the HTML output) instead of page references. This
-# makes the output suitable for online browsing using a PDF viewer.
-# The default value is: YES.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-PDF_HYPERLINKS         = YES
-
-# If the USE_PDFLATEX tag is set to YES, doxygen will use the engine as
-# specified with LATEX_CMD_NAME to generate the PDF file directly from the LaTeX
-# files. Set this option to YES, to get a higher quality PDF documentation.
-#
-# See also section LATEX_CMD_NAME for selecting the engine.
-# The default value is: YES.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-USE_PDFLATEX           = YES
-
-# If the LATEX_BATCHMODE tag is set to YES, doxygen will add the \batchmode
-# command to the generated LaTeX files. This will instruct LaTeX to keep running
-# if errors occur, instead of asking the user for help. This option is also used
-# when generating formulas in HTML.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_BATCHMODE        = NO
-
-# If the LATEX_HIDE_INDICES tag is set to YES then doxygen will not include the
-# index chapters (such as File Index, Compound Index, etc.) in the output.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_HIDE_INDICES     = NO
-
-# If the LATEX_SOURCE_CODE tag is set to YES then doxygen will include source
-# code with syntax highlighting in the LaTeX output.
-#
-# Note that which sources are shown also depends on other settings such as
-# SOURCE_BROWSER.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_SOURCE_CODE      = NO
-
-# The LATEX_BIB_STYLE tag can be used to specify the style to use for the
-# bibliography, e.g. plainnat, or ieeetr. See
-# https://en.wikipedia.org/wiki/BibTeX and \cite for more info.
-# The default value is: plain.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_BIB_STYLE        = plain
-
-# If the LATEX_TIMESTAMP tag is set to YES then the footer of each generated
-# page will contain the date and time when the page was generated. Setting this
-# to NO can help when comparing the output of multiple runs.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_TIMESTAMP        = NO
-
-# The LATEX_EMOJI_DIRECTORY tag is used to specify the (relative or absolute)
-# path from which the emoji images will be read. If a relative path is entered,
-# it will be relative to the LATEX_OUTPUT directory. If left blank the
-# LATEX_OUTPUT directory will be used.
-# This tag requires that the tag GENERATE_LATEX is set to YES.
-
-LATEX_EMOJI_DIRECTORY  =
-
-#---------------------------------------------------------------------------
-# Configuration options related to the RTF output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_RTF tag is set to YES, doxygen will generate RTF output. The
-# RTF output is optimized for Word 97 and may not look too pretty with other RTF
-# readers/editors.
-# The default value is: NO.
-
-GENERATE_RTF           = NO
-
-# The RTF_OUTPUT tag is used to specify where the RTF docs will be put. If a
-# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
-# it.
-# The default directory is: rtf.
-# This tag requires that the tag GENERATE_RTF is set to YES.
-
-RTF_OUTPUT             = rtf
-
-# If the COMPACT_RTF tag is set to YES, doxygen generates more compact RTF
-# documents. This may be useful for small projects and may help to save some
-# trees in general.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_RTF is set to YES.
-
-COMPACT_RTF            = NO
-
-# If the RTF_HYPERLINKS tag is set to YES, the RTF that is generated will
-# contain hyperlink fields. The RTF file will contain links (just like the HTML
-# output) instead of page references. This makes the output suitable for online
-# browsing using Word or some other Word compatible readers that support those
-# fields.
-#
-# Note: WordPad (write) and others do not support links.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_RTF is set to YES.
-
-RTF_HYPERLINKS         = NO
-
-# Load stylesheet definitions from file. Syntax is similar to doxygen's
-# configuration file, i.e. a series of assignments. You only have to provide
-# replacements, missing definitions are set to their default value.
-#
-# See also section "Doxygen usage" for information on how to generate the
-# default style sheet that doxygen normally uses.
-# This tag requires that the tag GENERATE_RTF is set to YES.
-
-RTF_STYLESHEET_FILE    =
-
-# Set optional variables used in the generation of an RTF document. Syntax is
-# similar to doxygen's configuration file. A template extensions file can be
-# generated using doxygen -e rtf extensionFile.
-# This tag requires that the tag GENERATE_RTF is set to YES.
-
-RTF_EXTENSIONS_FILE    =
-
-# If the RTF_SOURCE_CODE tag is set to YES then doxygen will include source code
-# with syntax highlighting in the RTF output.
-#
-# Note that which sources are shown also depends on other settings such as
-# SOURCE_BROWSER.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_RTF is set to YES.
-
-RTF_SOURCE_CODE        = NO
-
-#---------------------------------------------------------------------------
-# Configuration options related to the man page output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_MAN tag is set to YES, doxygen will generate man pages for
-# classes and files.
-# The default value is: NO.
-
-GENERATE_MAN           = NO
-
-# The MAN_OUTPUT tag is used to specify where the man pages will be put. If a
-# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
-# it. A directory man3 will be created inside the directory specified by
-# MAN_OUTPUT.
-# The default directory is: man.
-# This tag requires that the tag GENERATE_MAN is set to YES.
-
-MAN_OUTPUT             = man
-
-# The MAN_EXTENSION tag determines the extension that is added to the generated
-# man pages. In case the manual section does not start with a number, the number
-# 3 is prepended. The dot (.) at the beginning of the MAN_EXTENSION tag is
-# optional.
-# The default value is: .3.
-# This tag requires that the tag GENERATE_MAN is set to YES.
-
-MAN_EXTENSION          = .3
-
-# The MAN_SUBDIR tag determines the name of the directory created within
-# MAN_OUTPUT in which the man pages are placed. If defaults to man followed by
-# MAN_EXTENSION with the initial . removed.
-# This tag requires that the tag GENERATE_MAN is set to YES.
-
-MAN_SUBDIR             =
-
-# If the MAN_LINKS tag is set to YES and doxygen generates man output, then it
-# will generate one additional man file for each entity documented in the real
-# man page(s). These additional files only source the real man page, but without
-# them the man command would be unable to find the correct page.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_MAN is set to YES.
-
-MAN_LINKS              = NO
-
-#---------------------------------------------------------------------------
-# Configuration options related to the XML output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_XML tag is set to YES, doxygen will generate an XML file that
-# captures the structure of the code including all documentation.
-# The default value is: NO.
-
-GENERATE_XML           = NO
-
-# The XML_OUTPUT tag is used to specify where the XML pages will be put. If a
-# relative path is entered the value of OUTPUT_DIRECTORY will be put in front of
-# it.
-# The default directory is: xml.
-# This tag requires that the tag GENERATE_XML is set to YES.
-
-XML_OUTPUT             = xml
-
-# If the XML_PROGRAMLISTING tag is set to YES, doxygen will dump the program
-# listings (including syntax highlighting and cross-referencing information) to
-# the XML output. Note that enabling this will significantly increase the size
-# of the XML output.
-# The default value is: YES.
-# This tag requires that the tag GENERATE_XML is set to YES.
-
-XML_PROGRAMLISTING     = YES
-
-# If the XML_NS_MEMB_FILE_SCOPE tag is set to YES, doxygen will include
-# namespace members in file scope as well, matching the HTML output.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_XML is set to YES.
-
-XML_NS_MEMB_FILE_SCOPE = NO
-
-#---------------------------------------------------------------------------
-# Configuration options related to the DOCBOOK output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_DOCBOOK tag is set to YES, doxygen will generate Docbook files
-# that can be used to generate PDF.
-# The default value is: NO.
-
-GENERATE_DOCBOOK       = NO
-
-# The DOCBOOK_OUTPUT tag is used to specify where the Docbook pages will be put.
-# If a relative path is entered the value of OUTPUT_DIRECTORY will be put in
-# front of it.
-# The default directory is: docbook.
-# This tag requires that the tag GENERATE_DOCBOOK is set to YES.
-
-DOCBOOK_OUTPUT         = docbook
-
-# If the DOCBOOK_PROGRAMLISTING tag is set to YES, doxygen will include the
-# program listings (including syntax highlighting and cross-referencing
-# information) to the DOCBOOK output. Note that enabling this will significantly
-# increase the size of the DOCBOOK output.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_DOCBOOK is set to YES.
-
-DOCBOOK_PROGRAMLISTING = NO
-
-#---------------------------------------------------------------------------
-# Configuration options for the AutoGen Definitions output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_AUTOGEN_DEF tag is set to YES, doxygen will generate an
-# AutoGen Definitions (see http://autogen.sourceforge.net/) file that captures
-# the structure of the code including all documentation. Note that this feature
-# is still experimental and incomplete at the moment.
-# The default value is: NO.
-
-GENERATE_AUTOGEN_DEF   = NO
-
-#---------------------------------------------------------------------------
-# Configuration options related to the Perl module output
-#---------------------------------------------------------------------------
-
-# If the GENERATE_PERLMOD tag is set to YES, doxygen will generate a Perl module
-# file that captures the structure of the code including all documentation.
-#
-# Note that this feature is still experimental and incomplete at the moment.
-# The default value is: NO.
-
-GENERATE_PERLMOD       = NO
-
-# If the PERLMOD_LATEX tag is set to YES, doxygen will generate the necessary
-# Makefile rules, Perl scripts and LaTeX code to be able to generate PDF and DVI
-# output from the Perl module output.
-# The default value is: NO.
-# This tag requires that the tag GENERATE_PERLMOD is set to YES.
-
-PERLMOD_LATEX          = NO
-
-# If the PERLMOD_PRETTY tag is set to YES, the Perl module output will be nicely
-# formatted so it can be parsed by a human reader. This is useful if you want to
-# understand what is going on. On the other hand, if this tag is set to NO, the
-# size of the Perl module output will be much smaller and Perl will parse it
-# just the same.
-# The default value is: YES.
-# This tag requires that the tag GENERATE_PERLMOD is set to YES.
-
-PERLMOD_PRETTY         = YES
-
-# The names of the make variables in the generated doxyrules.make file are
-# prefixed with the string contained in PERLMOD_MAKEVAR_PREFIX. This is useful
-# so different doxyrules.make files included by the same Makefile don't
-# overwrite each other's variables.
-# This tag requires that the tag GENERATE_PERLMOD is set to YES.
-
-PERLMOD_MAKEVAR_PREFIX =
-
-#---------------------------------------------------------------------------
-# Configuration options related to the preprocessor
-#---------------------------------------------------------------------------
-
-# If the ENABLE_PREPROCESSING tag is set to YES, doxygen will evaluate all
-# C-preprocessor directives found in the sources and include files.
-# The default value is: YES.
-
-ENABLE_PREPROCESSING   = YES
-
-# If the MACRO_EXPANSION tag is set to YES, doxygen will expand all macro names
-# in the source code. If set to NO, only conditional compilation will be
-# performed. Macro expansion can be done in a controlled way by setting
-# EXPAND_ONLY_PREDEF to YES.
-# The default value is: NO.
-# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
-
-MACRO_EXPANSION        = YES
-
-# If the EXPAND_ONLY_PREDEF and MACRO_EXPANSION tags are both set to YES then
-# the macro expansion is limited to the macros specified with the PREDEFINED and
-# EXPAND_AS_DEFINED tags.
-# The default value is: NO.
-# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
-
-EXPAND_ONLY_PREDEF     = YES
-
-# If the SEARCH_INCLUDES tag is set to YES, the include files in the
-# INCLUDE_PATH will be searched if a #include is found.
-# The default value is: YES.
-# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
-
-SEARCH_INCLUDES        = YES
-
-# The INCLUDE_PATH tag can be used to specify one or more directories that
-# contain include files that are not input files but should be processed by the
-# preprocessor.
-# This tag requires that the tag SEARCH_INCLUDES is set to YES.
-
-INCLUDE_PATH           =
-
-# You can use the INCLUDE_FILE_PATTERNS tag to specify one or more wildcard
-# patterns (like *.h and *.hpp) to filter out the header-files in the
-# directories. If left blank, the patterns specified with FILE_PATTERNS will be
-# used.
-# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
-
-INCLUDE_FILE_PATTERNS  =
-
-# The PREDEFINED tag can be used to specify one or more macro names that are
-# defined before the preprocessor is started (similar to the -D option of e.g.
-# gcc). The argument of the tag is a list of macros of the form: name or
-# name=definition (no spaces). If the definition and the "=" are omitted, "=1"
-# is assumed. To prevent a macro definition from being undefined via #undef or
-# recursively expanded use the := operator instead of the = operator.
-# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
-
-PREDEFINED             = "XXH_PUBLIC_API=" \
-                         "XXH_FORCE_INLINE=static inline" \
-                         "XXH_NO_INLINE=static" \
-                         "XXH_RESTRICT=restrict" \
-                         "XSUM_API=" \
-                         "XXH_DOXYGEN=" \
-                         "XXH_STATIC_LINKING_ONLY" \
-                         "XXH_IMPLEMENTATION" \
-                         "XXH_ALIGN(N)=alignas(N)" \
-                         "XXH_ALIGN_MEMBER(align,type)=alignas(align) type"
-
-
-# If the MACRO_EXPANSION and EXPAND_ONLY_PREDEF tags are set to YES then this
-# tag can be used to specify a list of macro names that should be expanded. The
-# macro definition that is found in the sources will be used. Use the PREDEFINED
-# tag if you want to use a different macro definition that overrules the
-# definition found in the source code.
-# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
-
-EXPAND_AS_DEFINED      =
-
-
-# If the SKIP_FUNCTION_MACROS tag is set to YES then doxygen's preprocessor will
-# remove all references to function-like macros that are alone on a line, have
-# an all uppercase name, and do not end with a semicolon. Such function macros
-# are typically used for boiler-plate code, and will confuse the parser if not
-# removed.
-# The default value is: YES.
-# This tag requires that the tag ENABLE_PREPROCESSING is set to YES.
-
-SKIP_FUNCTION_MACROS   = YES
-
-#---------------------------------------------------------------------------
-# Configuration options related to external references
-#---------------------------------------------------------------------------
-
-# The TAGFILES tag can be used to specify one or more tag files. For each tag
-# file the location of the external documentation should be added. The format of
-# a tag file without this location is as follows:
-# TAGFILES = file1 file2 ...
-# Adding location for the tag files is done as follows:
-# TAGFILES = file1=loc1 "file2 = loc2" ...
-# where loc1 and loc2 can be relative or absolute paths or URLs. See the
-# section "Linking to external documentation" for more information about the use
-# of tag files.
-# Note: Each tag file must have a unique name (where the name does NOT include
-# the path). If a tag file is not located in the directory in which doxygen is
-# run, you must also specify the path to the tagfile here.
-
-TAGFILES               =
-
-# When a file name is specified after GENERATE_TAGFILE, doxygen will create a
-# tag file that is based on the input files it reads. See section "Linking to
-# external documentation" for more information about the usage of tag files.
-
-GENERATE_TAGFILE       =
-
-# If the ALLEXTERNALS tag is set to YES, all external class will be listed in
-# the class index. If set to NO, only the inherited external classes will be
-# listed.
-# The default value is: NO.
-
-ALLEXTERNALS           = NO
-
-# If the EXTERNAL_GROUPS tag is set to YES, all external groups will be listed
-# in the modules index. If set to NO, only the current project's groups will be
-# listed.
-# The default value is: YES.
-
-EXTERNAL_GROUPS        = YES
-
-# If the EXTERNAL_PAGES tag is set to YES, all external pages will be listed in
-# the related pages index. If set to NO, only the current project's pages will
-# be listed.
-# The default value is: YES.
-
-EXTERNAL_PAGES         = YES
-
-#---------------------------------------------------------------------------
-# Configuration options related to the dot tool
-#---------------------------------------------------------------------------
-
-# If the CLASS_DIAGRAMS tag is set to YES, doxygen will generate a class diagram
-# (in HTML and LaTeX) for classes with base or super classes. Setting the tag to
-# NO turns the diagrams off. Note that this option also works with HAVE_DOT
-# disabled, but it is recommended to install and use dot, since it yields more
-# powerful graphs.
-# The default value is: YES.
-
-CLASS_DIAGRAMS         = YES
-
-# You can include diagrams made with dia in doxygen documentation. Doxygen will
-# then run dia to produce the diagram and insert it in the documentation. The
-# DIA_PATH tag allows you to specify the directory where the dia binary resides.
-# If left empty dia is assumed to be found in the default search path.
-
-DIA_PATH               =
-
-# If set to YES the inheritance and collaboration graphs will hide inheritance
-# and usage relations if the target is undocumented or is not a class.
-# The default value is: YES.
-
-HIDE_UNDOC_RELATIONS   = YES
-
-# If you set the HAVE_DOT tag to YES then doxygen will assume the dot tool is
-# available from the path. This tool is part of Graphviz (see:
-# http://www.graphviz.org/), a graph visualization toolkit from AT&T and Lucent
-# Bell Labs. The other options in this section have no effect if this option is
-# set to NO
-# The default value is: NO.
-
-HAVE_DOT               = NO
-
-# The DOT_NUM_THREADS specifies the number of dot invocations doxygen is allowed
-# to run in parallel. When set to 0 doxygen will base this on the number of
-# processors available in the system. You can set it explicitly to a value
-# larger than 0 to get control over the balance between CPU load and processing
-# speed.
-# Minimum value: 0, maximum value: 32, default value: 0.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_NUM_THREADS        = 0
-
-# When you want a differently looking font in the dot files that doxygen
-# generates you can specify the font name using DOT_FONTNAME. You need to make
-# sure dot is able to find the font, which can be done by putting it in a
-# standard location or by setting the DOTFONTPATH environment variable or by
-# setting DOT_FONTPATH to the directory containing the font.
-# The default value is: Helvetica.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_FONTNAME           = Helvetica
-
-# The DOT_FONTSIZE tag can be used to set the size (in points) of the font of
-# dot graphs.
-# Minimum value: 4, maximum value: 24, default value: 10.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_FONTSIZE           = 10
-
-# By default doxygen will tell dot to use the default font as specified with
-# DOT_FONTNAME. If you specify a different font using DOT_FONTNAME you can set
-# the path where dot can find it using this tag.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_FONTPATH           =
-
-# If the CLASS_GRAPH tag is set to YES then doxygen will generate a graph for
-# each documented class showing the direct and indirect inheritance relations.
-# Setting this tag to YES will force the CLASS_DIAGRAMS tag to NO.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-CLASS_GRAPH            = YES
-
-# If the COLLABORATION_GRAPH tag is set to YES then doxygen will generate a
-# graph for each documented class showing the direct and indirect implementation
-# dependencies (inheritance, containment, and class references variables) of the
-# class with other documented classes.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-COLLABORATION_GRAPH    = YES
-
-# If the GROUP_GRAPHS tag is set to YES then doxygen will generate a graph for
-# groups, showing the direct groups dependencies.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-GROUP_GRAPHS           = YES
-
-# If the UML_LOOK tag is set to YES, doxygen will generate inheritance and
-# collaboration diagrams in a style similar to the OMG's Unified Modeling
-# Language.
-# The default value is: NO.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-UML_LOOK               = NO
-
-# If the UML_LOOK tag is enabled, the fields and methods are shown inside the
-# class node. If there are many fields or methods and many nodes the graph may
-# become too big to be useful. The UML_LIMIT_NUM_FIELDS threshold limits the
-# number of items for each type to make the size more manageable. Set this to 0
-# for no limit. Note that the threshold may be exceeded by 50% before the limit
-# is enforced. So when you set the threshold to 10, up to 15 fields may appear,
-# but if the number exceeds 15, the total amount of fields shown is limited to
-# 10.
-# Minimum value: 0, maximum value: 100, default value: 10.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-UML_LIMIT_NUM_FIELDS   = 10
-
-# If the TEMPLATE_RELATIONS tag is set to YES then the inheritance and
-# collaboration graphs will show the relations between templates and their
-# instances.
-# The default value is: NO.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-TEMPLATE_RELATIONS     = NO
-
-# If the INCLUDE_GRAPH, ENABLE_PREPROCESSING and SEARCH_INCLUDES tags are set to
-# YES then doxygen will generate a graph for each documented file showing the
-# direct and indirect include dependencies of the file with other documented
-# files.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-INCLUDE_GRAPH          = YES
-
-# If the INCLUDED_BY_GRAPH, ENABLE_PREPROCESSING and SEARCH_INCLUDES tags are
-# set to YES then doxygen will generate a graph for each documented file showing
-# the direct and indirect include dependencies of the file with other documented
-# files.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-INCLUDED_BY_GRAPH      = YES
-
-# If the CALL_GRAPH tag is set to YES then doxygen will generate a call
-# dependency graph for every global function or class method.
-#
-# Note that enabling this option will significantly increase the time of a run.
-# So in most cases it will be better to enable call graphs for selected
-# functions only using the \callgraph command. Disabling a call graph can be
-# accomplished by means of the command \hidecallgraph.
-# The default value is: NO.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-CALL_GRAPH             = NO
-
-# If the CALLER_GRAPH tag is set to YES then doxygen will generate a caller
-# dependency graph for every global function or class method.
-#
-# Note that enabling this option will significantly increase the time of a run.
-# So in most cases it will be better to enable caller graphs for selected
-# functions only using the \callergraph command. Disabling a caller graph can be
-# accomplished by means of the command \hidecallergraph.
-# The default value is: NO.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-CALLER_GRAPH           = NO
-
-# If the GRAPHICAL_HIERARCHY tag is set to YES then doxygen will graphical
-# hierarchy of all classes instead of a textual one.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-GRAPHICAL_HIERARCHY    = YES
-
-# If the DIRECTORY_GRAPH tag is set to YES then doxygen will show the
-# dependencies a directory has on other directories in a graphical way. The
-# dependency relations are determined by the #include relations between the
-# files in the directories.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DIRECTORY_GRAPH        = YES
-
-# The DOT_IMAGE_FORMAT tag can be used to set the image format of the images
-# generated by dot. For an explanation of the image formats see the section
-# output formats in the documentation of the dot tool (Graphviz (see:
-# http://www.graphviz.org/)).
-# Note: If you choose svg you need to set HTML_FILE_EXTENSION to xhtml in order
-# to make the SVG files visible in IE 9+ (other browsers do not have this
-# requirement).
-# Possible values are: png, jpg, gif, svg, png:gd, png:gd:gd, png:cairo,
-# png:cairo:gd, png:cairo:cairo, png:cairo:gdiplus, png:gdiplus and
-# png:gdiplus:gdiplus.
-# The default value is: png.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_IMAGE_FORMAT       = png
-
-# If DOT_IMAGE_FORMAT is set to svg, then this option can be set to YES to
-# enable generation of interactive SVG images that allow zooming and panning.
-#
-# Note that this requires a modern browser other than Internet Explorer. Tested
-# and working are Firefox, Chrome, Safari, and Opera.
-# Note: For IE 9+ you need to set HTML_FILE_EXTENSION to xhtml in order to make
-# the SVG files visible. Older versions of IE do not have SVG support.
-# The default value is: NO.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-INTERACTIVE_SVG        = NO
-
-# The DOT_PATH tag can be used to specify the path where the dot tool can be
-# found. If left blank, it is assumed the dot tool can be found in the path.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_PATH               =
-
-# The DOTFILE_DIRS tag can be used to specify one or more directories that
-# contain dot files that are included in the documentation (see the \dotfile
-# command).
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOTFILE_DIRS           =
-
-# The MSCFILE_DIRS tag can be used to specify one or more directories that
-# contain msc files that are included in the documentation (see the \mscfile
-# command).
-
-MSCFILE_DIRS           =
-
-# The DIAFILE_DIRS tag can be used to specify one or more directories that
-# contain dia files that are included in the documentation (see the \diafile
-# command).
-
-DIAFILE_DIRS           =
-
-# When using plantuml, the PLANTUML_JAR_PATH tag should be used to specify the
-# path where java can find the plantuml.jar file. If left blank, it is assumed
-# PlantUML is not used or called during a preprocessing step. Doxygen will
-# generate a warning when it encounters a \startuml command in this case and
-# will not generate output for the diagram.
-
-PLANTUML_JAR_PATH      =
-
-# When using plantuml, the PLANTUML_CFG_FILE tag can be used to specify a
-# configuration file for plantuml.
-
-PLANTUML_CFG_FILE      =
-
-# When using plantuml, the specified paths are searched for files specified by
-# the !include statement in a plantuml block.
-
-PLANTUML_INCLUDE_PATH  =
-
-# The DOT_GRAPH_MAX_NODES tag can be used to set the maximum number of nodes
-# that will be shown in the graph. If the number of nodes in a graph becomes
-# larger than this value, doxygen will truncate the graph, which is visualized
-# by representing a node as a red box. Note that doxygen if the number of direct
-# children of the root node in a graph is already larger than
-# DOT_GRAPH_MAX_NODES then the graph will not be shown at all. Also note that
-# the size of a graph can be further restricted by MAX_DOT_GRAPH_DEPTH.
-# Minimum value: 0, maximum value: 10000, default value: 50.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_GRAPH_MAX_NODES    = 50
-
-# The MAX_DOT_GRAPH_DEPTH tag can be used to set the maximum depth of the graphs
-# generated by dot. A depth value of 3 means that only nodes reachable from the
-# root by following a path via at most 3 edges will be shown. Nodes that lay
-# further from the root node will be omitted. Note that setting this option to 1
-# or 2 may greatly reduce the computation time needed for large code bases. Also
-# note that the size of a graph can be further restricted by
-# DOT_GRAPH_MAX_NODES. Using a depth of 0 means no depth restriction.
-# Minimum value: 0, maximum value: 1000, default value: 0.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-MAX_DOT_GRAPH_DEPTH    = 0
-
-# Set the DOT_TRANSPARENT tag to YES to generate images with a transparent
-# background. This is disabled by default, because dot on Windows does not seem
-# to support this out of the box.
-#
-# Warning: Depending on the platform used, enabling this option may lead to
-# badly anti-aliased labels on the edges of a graph (i.e. they become hard to
-# read).
-# The default value is: NO.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_TRANSPARENT        = NO
-
-# Set the DOT_MULTI_TARGETS tag to YES to allow dot to generate multiple output
-# files in one run (i.e. multiple -o and -T options on the command line). This
-# makes dot run faster, but since only newer versions of dot (>1.8.10) support
-# this, this feature is disabled by default.
-# The default value is: NO.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_MULTI_TARGETS      = NO
-
-# If the GENERATE_LEGEND tag is set to YES doxygen will generate a legend page
-# explaining the meaning of the various boxes and arrows in the dot generated
-# graphs.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-GENERATE_LEGEND        = YES
-
-# If the DOT_CLEANUP tag is set to YES, doxygen will remove the intermediate dot
-# files that are used to generate the various graphs.
-# The default value is: YES.
-# This tag requires that the tag HAVE_DOT is set to YES.
-
-DOT_CLEANUP            = YES
+HTML_COLORSTYLE_SAT    = 100

From b68e06ea25c262d2886e9d2391e41afe4961f220 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Fri, 16 Oct 2020 16:14:42 -0400
Subject: [PATCH 036/187] Doxyfile: Disable LaTeX

HTML *and* LaTeX is a little redundant.
---
 Doxyfile | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/Doxyfile b/Doxyfile
index 1ce66981..2c27c987 100644
--- a/Doxyfile
+++ b/Doxyfile
@@ -24,6 +24,7 @@ FILE_PATTERNS          = xxhash.h
 # Note: xxHash's source files are technically ASCII only.
 INPUT_ENCODING         = UTF-8
 TAB_SIZE               = 4
+MARKDOWN_SUPPORT       = YES
 
 # xxHash is a C library
 OPTIMIZE_OUTPUT_FOR_C  = YES
@@ -52,3 +53,6 @@ HTML_FILE_EXTENSION    = .html
 HTML_COLORSTYLE_HUE    = 220
 HTML_COLORSTYLE_GAMMA  = 100
 HTML_COLORSTYLE_SAT    = 100
+
+# We don't want LaTeX.
+GENERATE_LATEX         = NO

From d4dbf709fcbccf39a3a253435be315baba8f4da0 Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Sun, 18 Oct 2020 16:58:06 -0400
Subject: [PATCH 037/187] Various dispatcher improvements

 - Test the compiler for AVX2/AVX512 support instead of unconditionally
   defining `XXH_DISPATCH_*`(fixes #464)
   - Can also be enabled/disabled on the command line
 - Use a macro template to reduce code repetition
 - Don't dispatch the scalar path when we don't need it. It can be
   rather wasteful, especially on 32-bit.
   - Specifically, don't dispatch when SSE2 is globally enabled on the
     compiler or when it is guaranteed on the platform.
 - Add some Doxygen documentation for xxh_x86dispatch.c.
---
 Doxyfile          |   2 +-
 xxh_x86dispatch.c | 593 +++++++++++++++++++++++-----------------------
 2 files changed, 299 insertions(+), 296 deletions(-)

diff --git a/Doxyfile b/Doxyfile
index 2c27c987..634e1e1a 100644
--- a/Doxyfile
+++ b/Doxyfile
@@ -20,7 +20,7 @@ QUIET                  = YES
 WARN_IF_UNDOCUMENTED   = NO
 
 # TODO: Add the other files. It is just xxhash.h for now.
-FILE_PATTERNS          = xxhash.h
+FILE_PATTERNS          = xxhash.h xxh_x86dispatch.c
 # Note: xxHash's source files are technically ASCII only.
 INPUT_ENCODING         = UTF-8
 TAB_SIZE               = 4
diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index 1fc6fac8..8b433020 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -33,35 +33,132 @@
  */
 
 
+/*!
+ * @file xxh_x86dispatch.c
+ *
+ * Automatic dispatcher code for the @ref xxh3_family on x86-based targets.
+ *
+ * Optional add-on.
+ *
+ * @defgroup dispatch x86 Dispatcher
+ * @{
+ */
+
 #if defined (__cplusplus)
 extern "C" {
 #endif
 
-/*
- * Dispatcher code for XXH3 on x86-based targets.
- */
 #if !(defined(__x86_64__) || defined(__i386__) || defined(_M_IX86) || defined(_M_X64))
 #  error "Dispatching is currently only supported on x86 and x86_64."
 #endif
 
+#ifdef __has_include
+#  define XXH_HAS_INCLUDE(header) __has_include(header)
+#else
+#  define XXH_HAS_INCLUDE(header) 0
+#endif
+
+/*!
+ * @def XXH_DISPATCH_SCALAR
+ * @brief Enables/dispatching the scalar code path.
+ *
+ * If this is defined to 0, SSE2 support is assumed. This reduces code size
+ * when the scalar path is not needed.
+ *
+ * This is automatically defined to 0 when...
+ *   - SSE2 support is enabled in the compiler
+ *   - Targeting x86_64
+ *   - Targeting Android x86
+ *   - Targeting macOS
+ */
+#ifndef XXH_DISPATCH_SCALAR
+#  if defined(__SSE2__) || (defined(_M_IX86_FP) && _M_IX86_FP >= 2) /* SSE2 on by default */ \
+     || defined(__x86_64__) || defined(_M_X64) /* x86_64 */ \
+     || defined(__ANDROID__) || defined(__APPLEv__) /* Android or macOS */
+#     define XXH_DISPATCH_SCALAR 0 /* disable */
+#  else
+#     define XXH_DISPATCH_SCALAR 1
+#  endif
+#endif
+/*!
+ * @def XXH_DISPATCH_AVX2
+ * @brief Enables/disables dispatching for AVX2.
+ *
+ * This is automatically detected if it is not defined.
+ *  - GCC 4.7 and later are known to support AVX2.
+ *  - Visual Studio 2013 Update 2 and later are known to support AVX2.
+ *  - The GCC/Clang internal header `<avx2intrin.h>` is detected. While this is
+ *    not allowed to be included directly, it still appears in the builtin
+ *    include path and is detectable with `__has_include`.
+ *
+ * @see XXH_AVX2
+ */
+#ifndef XXH_DISPATCH_AVX2
+#  if (defined(__GNUC__) \
+       && (__GNUC__ > 4 || (__GNUC__ == 4 && __GNUC_MINOR__ >= 7))) /* GCC 4.7+ */ \
+   || (defined(_MSC_VER) && _MSC_VER >= 1900) /* VS 2015+ */ \
+   || (defined(_MSC_FULL_VER) && _MSC_FULL_VER >= 180030501) /* VS 2013 Update 2 */ \
+   || XXH_HAS_INCLUDE(<avx2intrin.h>) /* GCC/Clang internal header */
+#    define XXH_DISPATCH_AVX2 1   /* enable dispatch towards AVX2 */
+#  else
+#    define XXH_DISPATCH_AVX2 0
+#  endif
+#endif /* XXH_DISPATCH_AVX2 */
+
+/*!
+ * @def XXH_DISPATCH_AVX512
+ * @brief Enables/disables dispatching for AVX512.
+ *
+ * Automatically detected if one of the following conditions is met:
+ *  - GCC 4.9 and later are known to support AVX512.
+ *  - Visual Studio 2017  and later are known to support AVX2.
+ *  - The GCC/Clang internal header `<avx512fintrin.h>` is detected. While this
+ *    is not allowed to be included directly, it still appears in the builtin
+ *    include path and is detectable with `__has_include`.
+ *
+ * @see XXH_AVX512
+ */
+#ifndef XXH_DISPATCH_AVX512
+#  if (defined(__GNUC__) \
+       && (__GNUC__ > 4 || (__GNUC__ == 4 && __GNUC_MINOR__ >= 9))) /* GCC 4.9+ */ \
+   || (defined(_MSC_VER) && _MSC_VER >= 1910) /* VS 2017+ */ \
+   || XXH_HAS_INCLUDE(<avx512fintrin.h>) /* GCC/Clang internal header */
+#    define XXH_DISPATCH_AVX512 1   /* enable dispatch towards AVX512 */
+#  else
+#    define XXH_DISPATCH_AVX512 0
+#  endif
+#endif /* XXH_DISPATCH_AVX512 */
+
+/*!
+ * @def XXH_TARGET_SSE2
+ * @brief Allows a function to be compiled with SSE2 intrinsics.
+ *
+ * Uses `__attribute__((__target__("sse2")))` on GCC to allow SSE2 to be used
+ * even with `-mno-sse2`.
+ *
+ * @def XXH_TARGET_AVX2
+ * @brief Like @ref XXH_TARGET_SSE2, but for AVX2.
+ *
+ * @def XXH_TARGET_AVX512
+ * @brief Like @ref XXH_TARGET_SSE2, but for AVX512.
+ */
 #if defined(__GNUC__)
-#  include <immintrin.h> /* sse2 */
-#  include <emmintrin.h> /* avx2 */
-#  define XXH_TARGET_AVX512 __attribute__((__target__("avx512f")))
-#  define XXH_TARGET_AVX2 __attribute__((__target__("avx2")))
+#  include <emmintrin.h> /* SSE2 */
+#  if XXH_DISPATCH_AVX2 || XXH_DISPATCH_AVX512
+#    include <immintrin.h> /* AVX2, AVX512F */
+#  endif
 #  define XXH_TARGET_SSE2 __attribute__((__target__("sse2")))
+#  define XXH_TARGET_AVX2 __attribute__((__target__("avx2")))
+#  define XXH_TARGET_AVX512 __attribute__((__target__("avx512f")))
 #elif defined(_MSC_VER)
 #  include <intrin.h>
-#  define XXH_TARGET_AVX512
-#  define XXH_TARGET_AVX2
 #  define XXH_TARGET_SSE2
+#  define XXH_TARGET_AVX2
+#  define XXH_TARGET_AVX512
 #else
 #  error "Dispatching is currently not supported for your compiler."
 #endif
 
-#define XXH_DISPATCH_AVX2    /* enable dispatch towards AVX2 */
-#define XXH_DISPATCH_AVX512  /* enable dispatch towards AVX512 */
-
 #ifdef XXH_DISPATCH_DEBUG
 /* debug logging */
 #  include <stdio.h>
@@ -95,6 +192,13 @@ extern "C" {
 #  define I_ATT(intel, att) "{" att "|" intel "}\n\t"
 #endif
 
+/*!
+ * @internal
+ * @brief Runs CPUID.
+ *
+ * @param eax, ecx The parameters to pass to CPUID, %eax and %ecx respectively.
+ * @param abcd The array to store the result in, `{ eax, ebx, ecx, edx }`
+ */
 static void XXH_cpuid(xxh_u32 eax, xxh_u32 ecx, xxh_u32* abcd)
 {
 #if defined(_MSC_VER)
@@ -131,7 +235,10 @@ static void XXH_cpuid(xxh_u32 eax, xxh_u32 ecx, xxh_u32* abcd)
  */
 
 #if defined(XXH_DISPATCH_AVX2) || defined(XXH_DISPATCH_AVX512)
-/*
+/*!
+ * @internal
+ * @brief Runs `XGETBV`.
+ *
  * While the CPU may support AVX2, the operating system might not properly save
  * the full YMM/ZMM registers.
  *
@@ -170,15 +277,24 @@ static xxh_u64 XXH_xgetbv(void)
 #define AVX512F_CPUID_MASK (1 << 16)
 #define AVX512F_XGETBV_MASK ((7 << 5) | (1 << 2) | (1 << 1))
 
-/* Returns the best XXH3 implementation */
+/*!
+ * @internal
+ * @brief Returns the best XXH3 implementation.
+ *
+ * Runs various CPUID/XGETBV tests to try and determine the best implementation.
+ *
+ * @ret The best @ref XXH_VECTOR implementation.
+ * @see XXH_VECTOR_TYPES
+ */
 static int XXH_featureTest(void)
 {
     xxh_u32 abcd[4];
     xxh_u32 max_leaves;
     int best = XXH_SCALAR;
-#if defined(XXH_DISPATCH_AVX2) || defined(XXH_DISPATCH_AVX512)
+#if XXH_DISPATCH_AVX2 || XXH_DISPATCH_AVX512
     xxh_u64 xgetbv_val;
 #endif
+#if XXH_DISPATCH_SCALAR
 #if defined(__GNUC__) && defined(__i386__)
     xxh_u32 cpuid_supported;
     __asm__(
@@ -239,9 +355,10 @@ static int XXH_featureTest(void)
         return best;
 
     XXH_debugPrint("SSE2 support detected.");
+#endif /* XXH_DISPATCH_SCALAR */
 
     best = XXH_SSE2;
-#if defined(XXH_DISPATCH_AVX2) || defined(XXH_DISPATCH_AVX512)
+#if XXH_DISPATCH_AVX2 || XXH_DISPATCH_AVX512
     /* Make sure we have enough leaves */
     if (XXH_unlikely(max_leaves < 7))
         return best;
@@ -254,7 +371,7 @@ static int XXH_featureTest(void)
     XXH_cpuid(7, 0, abcd);
 
     xgetbv_val = XXH_xgetbv();
-#if defined(XXH_DISPATCH_AVX2)
+#if XXH_DISPATCH_AVX2
     /* Validate that AVX2 is supported by the CPU */
     if ((abcd[1] & AVX2_CPUID_MASK) != AVX2_CPUID_MASK)
         return best;
@@ -269,7 +386,7 @@ static int XXH_featureTest(void)
     XXH_debugPrint("AVX2 support detected.");
     best = XXH_AVX2;
 #endif
-#if defined(XXH_DISPATCH_AVX512)
+#if XXH_DISPATCH_AVX512
     /* Check if AVX512F is supported by the CPU */
     if ((abcd[1] & AVX512F_CPUID_MASK) != AVX512F_CPUID_MASK) {
         XXH_debugPrint("AVX512F not supported by CPU");
@@ -293,269 +410,117 @@ static int XXH_featureTest(void)
 
 /* ===   Vector implementations   === */
 
-/* ===   XXH3, default variants   === */
-
-XXH_NO_INLINE XXH64_hash_t
-XXHL64_default_scalar(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_64b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH64_hash_t
-XXHL64_default_sse2(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_64b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH64_hash_t
-XXHL64_default_avx2(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_64b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH64_hash_t
-XXHL64_default_avx512(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_64b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512);
-}
-#endif
-
-/* ===   XXH3, Seeded variants   === */
-
-XXH_NO_INLINE XXH64_hash_t
-XXHL64_seed_scalar(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_64b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar, XXH3_initCustomSecret_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH64_hash_t
-XXHL64_seed_sse2(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_64b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2, XXH3_initCustomSecret_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH64_hash_t
-XXHL64_seed_avx2(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_64b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2, XXH3_initCustomSecret_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH64_hash_t
-XXHL64_seed_avx512(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_64b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512, XXH3_initCustomSecret_avx512);
-}
-#endif
-
-/* ===   XXH3, Secret variants   === */
-
-XXH_NO_INLINE XXH64_hash_t
-XXHL64_secret_scalar(const void* XXH_RESTRICT input, size_t len, const void* secret, size_t secretLen)
-{
-    return XXH3_hashLong_64b_internal(input, len, secret, secretLen,
-                    XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH64_hash_t
-XXHL64_secret_sse2(const void* XXH_RESTRICT input, size_t len, const void* secret, size_t secretLen)
-{
-    return XXH3_hashLong_64b_internal(input, len, secret, secretLen,
-                    XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH64_hash_t
-XXHL64_secret_avx2(const void* XXH_RESTRICT input, size_t len, const void* secret, size_t secretLen)
-{
-    return XXH3_hashLong_64b_internal(input, len, secret, secretLen,
-                    XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH64_hash_t
-XXHL64_secret_avx512(const void* XXH_RESTRICT input, size_t len, const void* secret, size_t secretLen)
-{
-    return XXH3_hashLong_64b_internal(input, len, secret, secretLen,
-                    XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512);
-}
-#endif
-
-/* ===   XXH3 update variants   === */
-
-XXH_NO_INLINE XXH_errorcode
-XXH3_64bits_update_scalar(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH_errorcode
-XXH3_64bits_update_sse2(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH_errorcode
-XXH3_64bits_update_avx2(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH_errorcode
-XXH3_64bits_update_avx512(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512);
-}
-#endif
-
-/* ===   XXH128 default variants   === */
-
-XXH_NO_INLINE XXH128_hash_t
-XXHL128_default_scalar(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_128b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH128_hash_t
-XXHL128_default_sse2(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_128b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH128_hash_t
-XXHL128_default_avx2(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_128b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH128_hash_t
-XXHL128_default_avx512(const void* XXH_RESTRICT input, size_t len)
-{
-    return XXH3_hashLong_128b_internal(input, len, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512);
-}
-#endif
-
-/* ===   XXH128 Secret variants   === */
-
-XXH_NO_INLINE XXH128_hash_t
-XXHL128_secret_scalar(const void* XXH_RESTRICT input, size_t len, const void* XXH_RESTRICT secret, size_t secretLen)
-{
-    return XXH3_hashLong_128b_internal(input, len, (const xxh_u8*)secret, secretLen,
-                    XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH128_hash_t
-XXHL128_secret_sse2(const void* XXH_RESTRICT input, size_t len, const void* XXH_RESTRICT secret, size_t secretLen)
-{
-    return XXH3_hashLong_128b_internal(input, len, (const xxh_u8*)secret, secretLen,
-                    XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH128_hash_t
-XXHL128_secret_avx2(const void* XXH_RESTRICT input, size_t len, const void* XXH_RESTRICT secret, size_t secretLen)
-{
-    return XXH3_hashLong_128b_internal(input, len, (const xxh_u8*)secret, secretLen,
-                    XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH128_hash_t
-XXHL128_secret_avx512(const void* XXH_RESTRICT input, size_t len, const void* XXH_RESTRICT secret, size_t secretLen)
-{
-    return XXH3_hashLong_128b_internal(input, len, (const xxh_u8*)secret, secretLen,
-                    XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512);
-}
-#endif
-
-/* ===   XXH128 Seeded variants   === */
-
-XXH_NO_INLINE XXH128_hash_t
-XXHL128_seed_scalar(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_128b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar, XXH3_initCustomSecret_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH128_hash_t
-XXHL128_seed_sse2(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_128b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2, XXH3_initCustomSecret_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH128_hash_t
-XXHL128_seed_avx2(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_128b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2, XXH3_initCustomSecret_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH128_hash_t
-XXHL128_seed_avx512(const void* XXH_RESTRICT input, size_t len, XXH64_hash_t seed)
-{
-    return XXH3_hashLong_128b_withSeed_internal(input, len, seed,
-                    XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512, XXH3_initCustomSecret_avx512);
-}
-#endif
-
-/* ===   XXH128 update variants   === */
-
-XXH_NO_INLINE XXH_errorcode
-XXH3_128bits_update_scalar(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_scalar, XXH3_scrambleAcc_scalar);
-}
-
-XXH_NO_INLINE XXH_TARGET_SSE2 XXH_errorcode
-XXH3_128bits_update_sse2(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_sse2, XXH3_scrambleAcc_sse2);
-}
-
-#ifdef XXH_DISPATCH_AVX2
-XXH_NO_INLINE XXH_TARGET_AVX2 XXH_errorcode
-XXH3_128bits_update_avx2(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_avx2, XXH3_scrambleAcc_avx2);
-}
-#endif
-
-#ifdef XXH_DISPATCH_AVX512
-XXH_NO_INLINE XXH_TARGET_AVX512 XXH_errorcode
-XXH3_128bits_update_avx512(XXH3_state_t* state, const void* input, size_t len)
-{
-    return XXH3_update(state, (const xxh_u8*)input, len,
-                       XXH3_accumulate_512_avx512, XXH3_scrambleAcc_avx512);
-}
-#endif
+/*!
+ * @internal
+ * @brief Defines the various dispatch functions.
+ *
+ * TODO: Consolidate?
+ *
+ * @param suffix The suffix for the functions, e.g. sse2 or scalar
+ * @param target XXH_TARGET_* or empty.
+ */
+#define XXH_DEFINE_DISPATCH_FUNCS(suffix, target)                             \
+                                                                              \
+/* ===   XXH3, default variants   === */                                      \
+                                                                              \
+XXH_NO_INLINE target XXH64_hash_t                                             \
+XXHL64_default_##suffix(const void* XXH_RESTRICT input, size_t len)           \
+{                                                                             \
+    return XXH3_hashLong_64b_internal(                                        \
+               input, len, XXH3_kSecret, sizeof(XXH3_kSecret),                \
+               XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix        \
+    );                                                                        \
+}                                                                             \
+                                                                              \
+/* ===   XXH3, Seeded variants   === */                                       \
+                                                                              \
+XXH_NO_INLINE target XXH64_hash_t                                             \
+XXHL64_seed_##suffix(const void* XXH_RESTRICT input, size_t len,              \
+                     XXH64_hash_t seed)                                       \
+{                                                                             \
+    return XXH3_hashLong_64b_withSeed_internal(                               \
+                    input, len, seed, XXH3_accumulate_512_##suffix,           \
+                    XXH3_scrambleAcc_##suffix, XXH3_initCustomSecret_##suffix \
+    );                                                                        \
+}                                                                             \
+                                                                              \
+/* ===   XXH3, Secret variants   === */                                       \
+                                                                              \
+XXH_NO_INLINE target XXH64_hash_t                                             \
+XXHL64_secret_##suffix(const void* XXH_RESTRICT input, size_t len,            \
+                       const void* secret, size_t secretLen)                  \
+{                                                                             \
+    return XXH3_hashLong_64b_internal(                                        \
+                    input, len, secret, secretLen,                            \
+                    XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix   \
+    );                                                                        \
+}                                                                             \
+                                                                              \
+/* ===   XXH3 update variants   === */                                        \
+                                                                              \
+XXH_NO_INLINE target XXH_errorcode                                            \
+XXH3_64bits_update_##suffix(XXH3_state_t* state, const void* input,           \
+                            size_t len)                                       \
+{                                                                             \
+    return XXH3_update(state, (const xxh_u8*)input, len,                      \
+                    XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix); \
+}                                                                             \
+                                                                              \
+/* ===   XXH128 default variants   === */                                     \
+                                                                              \
+XXH_NO_INLINE target XXH128_hash_t                                            \
+XXHL128_default_##suffix(const void* XXH_RESTRICT input, size_t len)          \
+{                                                                             \
+    return XXH3_hashLong_128b_internal(                                       \
+                    input, len, XXH3_kSecret, sizeof(XXH3_kSecret),           \
+                    XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix   \
+    );                                                                        \
+}                                                                             \
+                                                                              \
+/* ===   XXH128 Secret variants   === */                                      \
+                                                                              \
+XXH_NO_INLINE target XXH128_hash_t                                            \
+XXHL128_secret_##suffix(const void* XXH_RESTRICT input, size_t len,           \
+                        const void* XXH_RESTRICT secret, size_t secretLen)    \
+{                                                                             \
+    return XXH3_hashLong_128b_internal(                                       \
+                    input, len, (const xxh_u8*)secret, secretLen,             \
+                    XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix); \
+}                                                                             \
+                                                                              \
+/* ===   XXH128 Seeded variants   === */                                      \
+                                                                              \
+XXH_NO_INLINE target XXH128_hash_t                                            \
+XXHL128_seed_##suffix(const void* XXH_RESTRICT input, size_t len,             \
+                      XXH64_hash_t seed)                                      \
+{                                                                             \
+    return XXH3_hashLong_128b_withSeed_internal(input, len, seed,             \
+                    XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix,  \
+                    XXH3_initCustomSecret_##suffix);                          \
+}                                                                             \
+                                                                              \
+/* ===   XXH128 update variants   === */                                      \
+                                                                              \
+XXH_NO_INLINE target XXH_errorcode                                            \
+XXH3_128bits_update_##suffix(XXH3_state_t* state, const void* input,          \
+                             size_t len)                                      \
+{                                                                             \
+    return XXH3_update(state, (const xxh_u8*)input, len,                      \
+                    XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix); \
+}
+/* End XXH_DEFINE_DISPATCH_FUNCS */
+
+#if XXH_DISPATCH_SCALAR
+XXH_DEFINE_DISPATCH_FUNCS(scalar, /* nothing */)
+#endif
+XXH_DEFINE_DISPATCH_FUNCS(sse2, XXH_TARGET_SSE2)
+#if XXH_DISPATCH_AVX2
+XXH_DEFINE_DISPATCH_FUNCS(avx2, XXH_TARGET_AVX2)
+#endif
+#if XXH_DISPATCH_AVX512
+XXH_DEFINE_DISPATCH_FUNCS(avx512, XXH_TARGET_AVX512)
+#endif
+#undef XXH_DEFINE_DISPATCH_FUNCS
 
 /* ====    Dispatchers    ==== */
 
@@ -574,21 +539,36 @@ typedef struct {
     XXH3_dispatchx86_update                update;
 } dispatchFunctions_s;
 
+/*!
+ * @internal
+ * @brief The selected dispatch table for @ref XXH3_64bits().
+ */
 static dispatchFunctions_s g_dispatch = { NULL, NULL, NULL, NULL};
 
 #define NB_DISPATCHES 4
+
+/*!
+ * @internal
+ * @brief Table of dispatchers for @ref XXH3_64bits().
+ *
+ * @pre The indices must match @ref XXH_VECTOR_TYPE.
+ */
 static const dispatchFunctions_s k_dispatch[NB_DISPATCHES] = {
-        /* scalar */ { XXHL64_default_scalar, XXHL64_seed_scalar, XXHL64_secret_scalar, XXH3_64bits_update_scalar },
-        /* sse2   */ { XXHL64_default_sse2,   XXHL64_seed_sse2,   XXHL64_secret_sse2,   XXH3_64bits_update_sse2 },
+#if XXH_DISPATCH_SCALAR
+    /* Scalar */ { XXHL64_default_scalar, XXHL64_seed_scalar, XXHL64_secret_scalar, XXH3_64bits_update_scalar },
+#else
+    /* Scalar */ { NULL, NULL, NULL, NULL },
+#endif
+    /* SSE2   */ { XXHL64_default_sse2,   XXHL64_seed_sse2,   XXHL64_secret_sse2,   XXH3_64bits_update_sse2 },
 #ifdef XXH_DISPATCH_AVX2
-        /* avx2   */ { XXHL64_default_avx2,   XXHL64_seed_avx2,   XXHL64_secret_avx2,   XXH3_64bits_update_avx2 },
+    /* AVX2   */ { XXHL64_default_avx2,   XXHL64_seed_avx2,   XXHL64_secret_avx2,   XXH3_64bits_update_avx2 },
 #else
-        /* avx2 */ { NULL, NULL, NULL, NULL },
+    /* AVX2   */ { NULL, NULL, NULL, NULL },
 #endif
 #ifdef XXH_DISPATCH_AVX512
-        /* avx512 */ { XXHL64_default_avx512, XXHL64_seed_avx512, XXHL64_secret_avx512, XXH3_64bits_update_avx512 }
+    /* AVX512 */ { XXHL64_default_avx512, XXHL64_seed_avx512, XXHL64_secret_avx512, XXH3_64bits_update_avx512 }
 #else
-        /* avx512 */ { NULL, NULL, NULL, NULL }
+    /* AVX512 */ { NULL, NULL, NULL, NULL }
 #endif
 };
 
@@ -605,32 +585,54 @@ typedef struct {
     XXH3_dispatchx86_update                 update;
 } dispatch128Functions_s;
 
+
+/*!
+ * @internal
+ * @brief The selected dispatch table for @ref XXH3_64bits().
+ */
 static dispatch128Functions_s g_dispatch128 = { NULL, NULL, NULL, NULL };
 
+/*!
+ * @internal
+ * @brief Table of dispatchers for @ref XXH3_128bits().
+ *
+ * @pre The indices must match @ref XXH_VECTOR_TYPE.
+ */
 static const dispatch128Functions_s k_dispatch128[NB_DISPATCHES] = {
-        /* scalar */ { XXHL128_default_scalar, XXHL128_seed_scalar, XXHL128_secret_scalar, XXH3_128bits_update_scalar },
-        /* sse2   */ { XXHL128_default_sse2,   XXHL128_seed_sse2,   XXHL128_secret_sse2,   XXH3_128bits_update_sse2 },
+#if XXH_DISPATCH_SCALAR
+    /* Scalar */ { XXHL128_default_scalar, XXHL128_seed_scalar, XXHL128_secret_scalar, XXH3_128bits_update_scalar },
+#else
+    /* Scalar */ { NULL, NULL, NULL, NULL },
+#endif
+    /* SSE2   */ { XXHL128_default_sse2,   XXHL128_seed_sse2,   XXHL128_secret_sse2,   XXH3_128bits_update_sse2 },
 #ifdef XXH_DISPATCH_AVX2
-        /* avx2   */ { XXHL128_default_avx2,   XXHL128_seed_avx2,   XXHL128_secret_avx2,   XXH3_128bits_update_avx2 },
+    /* AVX2   */ { XXHL128_default_avx2,   XXHL128_seed_avx2,   XXHL128_secret_avx2,   XXH3_128bits_update_avx2 },
 #else
-        /* avx2 */ { NULL, NULL, NULL, NULL },
+    /* AVX2   */ { NULL, NULL, NULL, NULL },
 #endif
 #ifdef XXH_DISPATCH_AVX512
-        /* avx512 */ { XXHL128_default_avx512, XXHL128_seed_avx512, XXHL128_secret_avx512, XXH3_128bits_update_avx512 }
+    /* AVX512 */ { XXHL128_default_avx512, XXHL128_seed_avx512, XXHL128_secret_avx512, XXH3_128bits_update_avx512 }
 #else
-        /* avx512 */ { NULL, NULL, NULL, NULL }
+    /* AVX512 */ { NULL, NULL, NULL, NULL }
 #endif
 };
 
+/*!
+ * @internal
+ * @brief Runs a CPUID check and sets the correct dispatch tables.
+ */
 static void setDispatch(void)
 {
     int vecID = XXH_featureTest();
     XXH_STATIC_ASSERT(XXH_AVX512 == NB_DISPATCHES-1);
     assert(XXH_SCALAR <= vecID && vecID <= XXH_AVX512);
-#ifndef XXH_DISPATCH_AVX512
+#if !XXH_DISPATCH_SCALAR
+    assert(vecID != XXH_SCALAR);
+#endif
+#if !XXH_DISPATCH_AVX512
     assert(vecID != XXH_AVX512);
 #endif
-#ifndef XXH_DISPATCH_AVX2
+#if !XXH_DISPATCH_AVX2
     assert(vecID != XXH_AVX2);
 #endif
     g_dispatch = k_dispatch[vecID];
@@ -744,3 +746,4 @@ XXH3_128bits_update_dispatch(XXH3_state_t* state, const void* input, size_t len)
 #if defined (__cplusplus)
 }
 #endif
+/*! @} */

From 5a31cf5834d704c3f72459acac399417e2050e9e Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Mon, 19 Oct 2020 10:19:56 -0400
Subject: [PATCH 038/187] Add sanity check for -mavx2 etc on dispatcher

Unless XXH_X86DISPATCH_ALLOW_AVX is defined, xxh_x86dispatch.c will
now error if it is compiled with `__AVX__` defined.

This prevents the misconception that xxh_x86dispatch.c is supposed
to be compiled with -mavx2/-mavx512f/-march=haswell/etc, preventing
difficult to notice crashes when the compiler generates VEX prefix
instructions.
---
 xxh_x86dispatch.c | 33 +++++++++++++++++++++++++++++++++
 1 file changed, 33 insertions(+)

diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index 8b433020..9f491fdd 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -40,6 +40,10 @@
  *
  * Optional add-on.
  *
+ * **Compile this file with the default flags for your target.** Do not compile
+ * with flags like `-mavx*`, `-march=native`, or `/arch:AVX*`, there will be
+ * an error. See @ref XXH_X86DISPATCH_ALLOW_AVX for details.
+ *
  * @defgroup dispatch x86 Dispatcher
  * @{
  */
@@ -52,6 +56,35 @@ extern "C" {
 #  error "Dispatching is currently only supported on x86 and x86_64."
 #endif
 
+/*!
+ * @def XXH_X86DISPATCH_ALLOW_AVX
+ * @brief Disables the AVX sanity check.
+ *
+ * Don't compile xxh_x86dispatch.c with options like `-mavx*`, `-march=native`,
+ * or `/arch:AVX*`. It is intended to be compiled for the minumum target, and
+ * it selectively enables SSE2, AVX2, and AVX512 when it is needed.
+ *
+ * Using this option _globally_ allows this feature, and therefore makes it
+ * undefined behavior to execute on any CPU without said feature.
+ *
+ * Even if the source code isn't directly using AVX intrinsics in a function,
+ * the compiler can still generate AVX code from autovectorization and by
+ * "upgrading" SSE2 intrinsics to use the VEX prefixes (a.k.a. AVX128).
+ *
+ * Use the same flags that you use to compile the rest of the program; this
+ * file will safely generate SSE2, AVX2, and AVX512 without these flags.
+ *
+ * Define XXH_X86DISPATCH_ALLOW_AVX to ignore this check, and feel free to open
+ * an issue if there is a target in the future where AVX is a default feature.
+ */
+#ifdef XXH_DOXYGEN
+#  define XXH_X86DISPATCH_ALLOW_AVX
+#endif
+
+#if defined(__AVX__) && !defined(XXH_X86DISPATCH_ALLOW_AVX)
+#  error "Do not compile xxh_x86dispatch.c with AVX enabled! See the comment above."
+#endif
+
 #ifdef __has_include
 #  define XXH_HAS_INCLUDE(header) __has_include(header)
 #else

From 81e11d991bfbba017a9cc3888fc34f4ea29fa4fa Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Mon, 19 Oct 2020 10:25:03 -0400
Subject: [PATCH 039/187] Disable disabled dispatch paths in xxhash.h

Now, when `XXH_DISPATCH_AVX*` is zero, the codepaths will be properly
disabled, preventing any issues from old compilers.

Fixes #464 (again).
---
 xxhash.h | 7 +++++--
 1 file changed, 5 insertions(+), 2 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 5e65fea1..6be5466b 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3678,7 +3678,8 @@ XXH_FORCE_INLINE void XXH_writeLE64(void* dst, xxh_u64 v64)
  * Both XXH3_64bits and XXH3_128bits use this subroutine.
  */
 
-#if (XXH_VECTOR == XXH_AVX512) || defined(XXH_X86DISPATCH)
+#if (XXH_VECTOR == XXH_AVX512) \
+     || (defined(XXH_DISPATCH_AVX512) && XXH_DISPATCH_AVX512 != 0)
 
 #ifndef XXH_TARGET_AVX512
 # define XXH_TARGET_AVX512  /* disable attribute target */
@@ -3784,7 +3785,8 @@ XXH3_initCustomSecret_avx512(void* XXH_RESTRICT customSecret, xxh_u64 seed64)
 
 #endif
 
-#if (XXH_VECTOR == XXH_AVX2) || defined(XXH_X86DISPATCH)
+#if (XXH_VECTOR == XXH_AVX2) \
+    || (defined(XXH_DISPATCH_AVX2) && XXH_DISPATCH_AVX2 != 0)
 
 #ifndef XXH_TARGET_AVX2
 # define XXH_TARGET_AVX2  /* disable attribute target */
@@ -3890,6 +3892,7 @@ XXH_FORCE_INLINE XXH_TARGET_AVX2 void XXH3_initCustomSecret_avx2(void* XXH_RESTR
 
 #endif
 
+/* x86dispatch always generates SSE2 */
 #if (XXH_VECTOR == XXH_SSE2) || defined(XXH_X86DISPATCH)
 
 #ifndef XXH_TARGET_SSE2

From 084f72dc36f6ec36849a1a170a671f873a1036fa Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Mon, 19 Oct 2020 15:51:10 -0400
Subject: [PATCH 040/187] Dispatcher: Final fixes

 - Remove duplicate XXH3_update dispatch (Both paths are identical as of
   0.8.0)
 - Namespace everything
 - Fix some final compilation issues I missed.
---
 xxh_x86dispatch.c | 168 ++++++++++++++++++++++------------------------
 1 file changed, 79 insertions(+), 89 deletions(-)

diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index 9f491fdd..6387619b 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -220,9 +220,9 @@ extern "C" {
  * Note: Comments are written in the inline assembly itself.
  */
 #ifdef __clang__
-#  define I_ATT(intel, att) att "\n\t"
+#  define XXH_I_ATT(intel, att) att "\n\t"
 #else
-#  define I_ATT(intel, att) "{" att "|" intel "}\n\t"
+#  define XXH_I_ATT(intel, att) "{" att "|" intel "}\n\t"
 #endif
 
 /*!
@@ -244,14 +244,14 @@ static void XXH_cpuid(xxh_u32 eax, xxh_u32 ecx, xxh_u32* abcd)
         "#\n\t"
         "# On 32-bit x86 with PIC enabled, we are not allowed to overwrite\n\t"
         "# EBX, so we use EDI instead.\n\t"
-        I_ATT("mov     edi, ebx",   "movl    %%ebx, %%edi")
-        I_ATT("cpuid",              "cpuid"               )
-        I_ATT("xchg    edi, ebx",   "xchgl   %%ebx, %%edi")
+        XXH_I_ATT("mov     edi, ebx",   "movl    %%ebx, %%edi")
+        XXH_I_ATT("cpuid",              "cpuid"               )
+        XXH_I_ATT("xchg    edi, ebx",   "xchgl   %%ebx, %%edi")
         : "=D" (ebx),
 # else
     __asm__(
         "# Call CPUID\n\t"
-        I_ATT("cpuid",              "cpuid")
+        XXH_I_ATT("cpuid",              "cpuid")
         : "=b" (ebx),
 # endif
               "+a" (eax), "+c" (ecx), "=d" (edx));
@@ -267,7 +267,7 @@ static void XXH_cpuid(xxh_u32 eax, xxh_u32 ecx, xxh_u32* abcd)
  * https://software.intel.com/en-us/articles/how-to-detect-new-instruction-support-in-the-4th-generation-intel-core-processor-family
  */
 
-#if defined(XXH_DISPATCH_AVX2) || defined(XXH_DISPATCH_AVX512)
+#if XXH_DISPATCH_AVX2 || XXH_DISPATCH_AVX512
 /*!
  * @internal
  * @brief Runs `XGETBV`.
@@ -303,12 +303,12 @@ static xxh_u64 XXH_xgetbv(void)
 }
 #endif
 
-#define SSE2_CPUID_MASK (1 << 26)
-#define OSXSAVE_CPUID_MASK ((1 << 26) | (1 << 27))
-#define AVX2_CPUID_MASK (1 << 5)
-#define AVX2_XGETBV_MASK ((1 << 2) | (1 << 1))
-#define AVX512F_CPUID_MASK (1 << 16)
-#define AVX512F_XGETBV_MASK ((7 << 5) | (1 << 2) | (1 << 1))
+#define XXH_SSE2_CPUID_MASK (1 << 26)
+#define XXH_OSXSAVE_CPUID_MASK ((1 << 26) | (1 << 27))
+#define XXH_AVX2_CPUID_MASK (1 << 5)
+#define XXH_AVX2_XGETBV_MASK ((1 << 2) | (1 << 1))
+#define XXH_AVX512F_CPUID_MASK (1 << 16)
+#define XXH_AVX512F_XGETBV_MASK ((7 << 5) | (1 << 2) | (1 << 1))
 
 /*!
  * @internal
@@ -343,23 +343,23 @@ static int XXH_featureTest(void)
         "# Routine is from <https://wiki.osdev.org/CPUID>.\n\t"
 
         "# Save EFLAGS\n\t"
-        I_ATT("pushfd",                           "pushfl"                    )
+        XXH_I_ATT("pushfd",                           "pushfl"                    )
         "# Store EFLAGS\n\t"
-        I_ATT("pushfd",                           "pushfl"                    )
+        XXH_I_ATT("pushfd",                           "pushfl"                    )
         "# Invert the ID bit in stored EFLAGS\n\t"
-        I_ATT("xor     dword ptr[esp], 0x200000", "xorl    $0x200000, (%%esp)")
+        XXH_I_ATT("xor     dword ptr[esp], 0x200000", "xorl    $0x200000, (%%esp)")
         "# Load stored EFLAGS (with ID bit inverted)\n\t"
-        I_ATT("popfd",                            "popfl"                     )
+        XXH_I_ATT("popfd",                            "popfl"                     )
         "# Store EFLAGS again (ID bit may or not be inverted)\n\t"
-        I_ATT("pushfd",                           "pushfl"                    )
+        XXH_I_ATT("pushfd",                           "pushfl"                    )
         "# eax = modified EFLAGS (ID bit may or may not be inverted)\n\t"
-        I_ATT("pop     eax",                      "popl    %%eax"             )
+        XXH_I_ATT("pop     eax",                      "popl    %%eax"             )
         "# eax = whichever bits were changed\n\t"
-        I_ATT("xor     eax, dword ptr[esp]",      "xorl    (%%esp), %%eax"    )
+        XXH_I_ATT("xor     eax, dword ptr[esp]",      "xorl    (%%esp), %%eax"    )
         "# Restore original EFLAGS\n\t"
-        I_ATT("popfd",                            "popfl"                     )
+        XXH_I_ATT("popfd",                            "popfl"                     )
         "# eax = zero if ID bit can't be changed, else non-zero\n\t"
-        I_ATT("and     eax, 0x200000",            "andl    $0x200000, %%eax"  )
+        XXH_I_ATT("and     eax, 0x200000",            "andl    $0x200000, %%eax"  )
         : "=a" (cpuid_supported) :: "cc");
 
     if (XXH_unlikely(!cpuid_supported)) {
@@ -384,7 +384,7 @@ static int XXH_featureTest(void)
     /*
      * Test for SSE2. The check is redundant on x86_64, but it doesn't hurt.
      */
-    if (XXH_unlikely((abcd[3] & SSE2_CPUID_MASK) != SSE2_CPUID_MASK))
+    if (XXH_unlikely((abcd[3] & XXH_SSE2_CPUID_MASK) != XXH_SSE2_CPUID_MASK))
         return best;
 
     XXH_debugPrint("SSE2 support detected.");
@@ -397,7 +397,7 @@ static int XXH_featureTest(void)
         return best;
 
     /* Test for OSXSAVE and XGETBV */
-    if ((abcd[2] & OSXSAVE_CPUID_MASK) != OSXSAVE_CPUID_MASK)
+    if ((abcd[2] & XXH_OSXSAVE_CPUID_MASK) != XXH_OSXSAVE_CPUID_MASK)
         return best;
 
     /* CPUID check for AVX features */
@@ -406,11 +406,11 @@ static int XXH_featureTest(void)
     xgetbv_val = XXH_xgetbv();
 #if XXH_DISPATCH_AVX2
     /* Validate that AVX2 is supported by the CPU */
-    if ((abcd[1] & AVX2_CPUID_MASK) != AVX2_CPUID_MASK)
+    if ((abcd[1] & XXH_AVX2_CPUID_MASK) != XXH_AVX2_CPUID_MASK)
         return best;
 
     /* Validate that the OS supports YMM registers */
-    if ((xgetbv_val & AVX2_XGETBV_MASK) != AVX2_XGETBV_MASK) {
+    if ((xgetbv_val & XXH_AVX2_XGETBV_MASK) != XXH_AVX2_XGETBV_MASK) {
         XXH_debugPrint("AVX2 supported by the CPU, but not the OS.");
         return best;
     }
@@ -421,13 +421,13 @@ static int XXH_featureTest(void)
 #endif
 #if XXH_DISPATCH_AVX512
     /* Check if AVX512F is supported by the CPU */
-    if ((abcd[1] & AVX512F_CPUID_MASK) != AVX512F_CPUID_MASK) {
+    if ((abcd[1] & XXH_AVX512F_CPUID_MASK) != XXH_AVX512F_CPUID_MASK) {
         XXH_debugPrint("AVX512F not supported by CPU");
         return best;
     }
 
     /* Validate that the OS supports ZMM registers */
-    if ((xgetbv_val & AVX512F_XGETBV_MASK) != AVX512F_XGETBV_MASK) {
+    if ((xgetbv_val & XXH_AVX512F_XGETBV_MASK) != XXH_AVX512F_XGETBV_MASK) {
         XXH_debugPrint("AVX512F supported by the CPU, but not the OS.");
         return best;
     }
@@ -492,8 +492,7 @@ XXHL64_secret_##suffix(const void* XXH_RESTRICT input, size_t len,            \
 /* ===   XXH3 update variants   === */                                        \
                                                                               \
 XXH_NO_INLINE target XXH_errorcode                                            \
-XXH3_64bits_update_##suffix(XXH3_state_t* state, const void* input,           \
-                            size_t len)                                       \
+XXH3_update_##suffix(XXH3_state_t* state, const void* input, size_t len)      \
 {                                                                             \
     return XXH3_update(state, (const xxh_u8*)input, len,                      \
                     XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix); \
@@ -530,17 +529,8 @@ XXHL128_seed_##suffix(const void* XXH_RESTRICT input, size_t len,             \
     return XXH3_hashLong_128b_withSeed_internal(input, len, seed,             \
                     XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix,  \
                     XXH3_initCustomSecret_##suffix);                          \
-}                                                                             \
-                                                                              \
-/* ===   XXH128 update variants   === */                                      \
-                                                                              \
-XXH_NO_INLINE target XXH_errorcode                                            \
-XXH3_128bits_update_##suffix(XXH3_state_t* state, const void* input,          \
-                             size_t len)                                      \
-{                                                                             \
-    return XXH3_update(state, (const xxh_u8*)input, len,                      \
-                    XXH3_accumulate_512_##suffix, XXH3_scrambleAcc_##suffix); \
 }
+
 /* End XXH_DEFINE_DISPATCH_FUNCS */
 
 #if XXH_DISPATCH_SCALAR
@@ -570,15 +560,9 @@ typedef struct {
     XXH3_dispatchx86_hashLong64_withSeed   hashLong64_seed;
     XXH3_dispatchx86_hashLong64_withSecret hashLong64_secret;
     XXH3_dispatchx86_update                update;
-} dispatchFunctions_s;
+} XXH_dispatchFunctions_s;
 
-/*!
- * @internal
- * @brief The selected dispatch table for @ref XXH3_64bits().
- */
-static dispatchFunctions_s g_dispatch = { NULL, NULL, NULL, NULL};
-
-#define NB_DISPATCHES 4
+#define XXH_NB_DISPATCHES 4
 
 /*!
  * @internal
@@ -586,24 +570,30 @@ static dispatchFunctions_s g_dispatch = { NULL, NULL, NULL, NULL};
  *
  * @pre The indices must match @ref XXH_VECTOR_TYPE.
  */
-static const dispatchFunctions_s k_dispatch[NB_DISPATCHES] = {
+static const XXH_dispatchFunctions_s XXH_kDispatch[XXH_NB_DISPATCHES] = {
 #if XXH_DISPATCH_SCALAR
-    /* Scalar */ { XXHL64_default_scalar, XXHL64_seed_scalar, XXHL64_secret_scalar, XXH3_64bits_update_scalar },
+    /* Scalar */ { XXHL64_default_scalar, XXHL64_seed_scalar, XXHL64_secret_scalar, XXH3_update_scalar },
 #else
     /* Scalar */ { NULL, NULL, NULL, NULL },
 #endif
-    /* SSE2   */ { XXHL64_default_sse2,   XXHL64_seed_sse2,   XXHL64_secret_sse2,   XXH3_64bits_update_sse2 },
-#ifdef XXH_DISPATCH_AVX2
-    /* AVX2   */ { XXHL64_default_avx2,   XXHL64_seed_avx2,   XXHL64_secret_avx2,   XXH3_64bits_update_avx2 },
+    /* SSE2   */ { XXHL64_default_sse2,   XXHL64_seed_sse2,   XXHL64_secret_sse2,   XXH3_update_sse2 },
+#if XXH_DISPATCH_AVX2
+    /* AVX2   */ { XXHL64_default_avx2,   XXHL64_seed_avx2,   XXHL64_secret_avx2,   XXH3_update_avx2 },
 #else
     /* AVX2   */ { NULL, NULL, NULL, NULL },
 #endif
-#ifdef XXH_DISPATCH_AVX512
-    /* AVX512 */ { XXHL64_default_avx512, XXHL64_seed_avx512, XXHL64_secret_avx512, XXH3_64bits_update_avx512 }
+#if XXH_DISPATCH_AVX512
+    /* AVX512 */ { XXHL64_default_avx512, XXHL64_seed_avx512, XXHL64_secret_avx512, XXH3_update_avx512 }
 #else
     /* AVX512 */ { NULL, NULL, NULL, NULL }
 #endif
 };
+/*!
+ * @internal
+ * @brief The selected dispatch table for @ref XXH3_64bits().
+ */
+static XXH_dispatchFunctions_s XXH_g_dispatch = { NULL, NULL, NULL, NULL };
+
 
 typedef XXH128_hash_t (*XXH3_dispatchx86_hashLong128_default)(const void* XXH_RESTRICT, size_t);
 
@@ -616,48 +606,48 @@ typedef struct {
     XXH3_dispatchx86_hashLong128_withSeed   hashLong128_seed;
     XXH3_dispatchx86_hashLong128_withSecret hashLong128_secret;
     XXH3_dispatchx86_update                 update;
-} dispatch128Functions_s;
+} XXH_dispatch128Functions_s;
 
 
-/*!
- * @internal
- * @brief The selected dispatch table for @ref XXH3_64bits().
- */
-static dispatch128Functions_s g_dispatch128 = { NULL, NULL, NULL, NULL };
-
 /*!
  * @internal
  * @brief Table of dispatchers for @ref XXH3_128bits().
  *
  * @pre The indices must match @ref XXH_VECTOR_TYPE.
  */
-static const dispatch128Functions_s k_dispatch128[NB_DISPATCHES] = {
+static const XXH_dispatch128Functions_s XXH_kDispatch128[XXH_NB_DISPATCHES] = {
 #if XXH_DISPATCH_SCALAR
-    /* Scalar */ { XXHL128_default_scalar, XXHL128_seed_scalar, XXHL128_secret_scalar, XXH3_128bits_update_scalar },
+    /* Scalar */ { XXHL128_default_scalar, XXHL128_seed_scalar, XXHL128_secret_scalar, XXH3_update_scalar },
 #else
     /* Scalar */ { NULL, NULL, NULL, NULL },
 #endif
-    /* SSE2   */ { XXHL128_default_sse2,   XXHL128_seed_sse2,   XXHL128_secret_sse2,   XXH3_128bits_update_sse2 },
-#ifdef XXH_DISPATCH_AVX2
-    /* AVX2   */ { XXHL128_default_avx2,   XXHL128_seed_avx2,   XXHL128_secret_avx2,   XXH3_128bits_update_avx2 },
+    /* SSE2   */ { XXHL128_default_sse2,   XXHL128_seed_sse2,   XXHL128_secret_sse2,   XXH3_update_sse2 },
+#if XXH_DISPATCH_AVX2
+    /* AVX2   */ { XXHL128_default_avx2,   XXHL128_seed_avx2,   XXHL128_secret_avx2,   XXH3_update_avx2 },
 #else
     /* AVX2   */ { NULL, NULL, NULL, NULL },
 #endif
-#ifdef XXH_DISPATCH_AVX512
-    /* AVX512 */ { XXHL128_default_avx512, XXHL128_seed_avx512, XXHL128_secret_avx512, XXH3_128bits_update_avx512 }
+#if XXH_DISPATCH_AVX512
+    /* AVX512 */ { XXHL128_default_avx512, XXHL128_seed_avx512, XXHL128_secret_avx512, XXH3_update_avx512 }
 #else
     /* AVX512 */ { NULL, NULL, NULL, NULL }
 #endif
 };
 
+/*!
+ * @internal
+ * @brief The selected dispatch table for @ref XXH3_64bits().
+ */
+static XXH_dispatch128Functions_s XXH_g_dispatch128 = { NULL, NULL, NULL, NULL };
+
 /*!
  * @internal
  * @brief Runs a CPUID check and sets the correct dispatch tables.
  */
-static void setDispatch(void)
+static void XXH_setDispatch(void)
 {
     int vecID = XXH_featureTest();
-    XXH_STATIC_ASSERT(XXH_AVX512 == NB_DISPATCHES-1);
+    XXH_STATIC_ASSERT(XXH_AVX512 == XXH_NB_DISPATCHES-1);
     assert(XXH_SCALAR <= vecID && vecID <= XXH_AVX512);
 #if !XXH_DISPATCH_SCALAR
     assert(vecID != XXH_SCALAR);
@@ -668,8 +658,8 @@ static void setDispatch(void)
 #if !XXH_DISPATCH_AVX2
     assert(vecID != XXH_AVX2);
 #endif
-    g_dispatch = k_dispatch[vecID];
-    g_dispatch128 = k_dispatch128[vecID];
+    XXH_g_dispatch = XXH_kDispatch[vecID];
+    XXH_g_dispatch128 = XXH_kDispatch128[vecID];
 }
 
 
@@ -680,8 +670,8 @@ XXH3_hashLong_64b_defaultSecret_selection(const void* input, size_t len,
                                           XXH64_hash_t seed64, const xxh_u8* secret, size_t secretLen)
 {
     (void)seed64; (void)secret; (void)secretLen;
-    if (g_dispatch.hashLong64_default == NULL) setDispatch();
-    return g_dispatch.hashLong64_default(input, len);
+    if (XXH_g_dispatch.hashLong64_default == NULL) XXH_setDispatch();
+    return XXH_g_dispatch.hashLong64_default(input, len);
 }
 
 XXH64_hash_t XXH3_64bits_dispatch(const void* input, size_t len)
@@ -694,8 +684,8 @@ XXH3_hashLong_64b_withSeed_selection(const void* input, size_t len,
                                      XXH64_hash_t seed64, const xxh_u8* secret, size_t secretLen)
 {
     (void)secret; (void)secretLen;
-    if (g_dispatch.hashLong64_seed == NULL) setDispatch();
-    return g_dispatch.hashLong64_seed(input, len, seed64);
+    if (XXH_g_dispatch.hashLong64_seed == NULL) XXH_setDispatch();
+    return XXH_g_dispatch.hashLong64_seed(input, len, seed64);
 }
 
 XXH64_hash_t XXH3_64bits_withSeed_dispatch(const void* input, size_t len, XXH64_hash_t seed)
@@ -708,8 +698,8 @@ XXH3_hashLong_64b_withSecret_selection(const void* input, size_t len,
                                        XXH64_hash_t seed64, const xxh_u8* secret, size_t secretLen)
 {
     (void)seed64;
-    if (g_dispatch.hashLong64_secret == NULL) setDispatch();
-    return g_dispatch.hashLong64_secret(input, len, secret, secretLen);
+    if (XXH_g_dispatch.hashLong64_secret == NULL) XXH_setDispatch();
+    return XXH_g_dispatch.hashLong64_secret(input, len, secret, secretLen);
 }
 
 XXH64_hash_t XXH3_64bits_withSecret_dispatch(const void* input, size_t len, const void* secret, size_t secretLen)
@@ -720,8 +710,8 @@ XXH64_hash_t XXH3_64bits_withSecret_dispatch(const void* input, size_t len, cons
 XXH_errorcode
 XXH3_64bits_update_dispatch(XXH3_state_t* state, const void* input, size_t len)
 {
-    if (g_dispatch.update == NULL) setDispatch();
-    return g_dispatch.update(state, (const xxh_u8*)input, len);
+    if (XXH_g_dispatch.update == NULL) XXH_setDispatch();
+    return XXH_g_dispatch.update(state, (const xxh_u8*)input, len);
 }
 
 
@@ -732,8 +722,8 @@ XXH3_hashLong_128b_defaultSecret_selection(const void* input, size_t len,
                                            XXH64_hash_t seed64, const void* secret, size_t secretLen)
 {
     (void)seed64; (void)secret; (void)secretLen;
-    if (g_dispatch128.hashLong128_default == NULL) setDispatch();
-    return g_dispatch128.hashLong128_default(input, len);
+    if (XXH_g_dispatch128.hashLong128_default == NULL) XXH_setDispatch();
+    return XXH_g_dispatch128.hashLong128_default(input, len);
 }
 
 XXH128_hash_t XXH3_128bits_dispatch(const void* input, size_t len)
@@ -746,8 +736,8 @@ XXH3_hashLong_128b_withSeed_selection(const void* input, size_t len,
                                      XXH64_hash_t seed64, const void* secret, size_t secretLen)
 {
     (void)secret; (void)secretLen;
-    if (g_dispatch128.hashLong128_seed == NULL) setDispatch();
-    return g_dispatch128.hashLong128_seed(input, len, seed64);
+    if (XXH_g_dispatch128.hashLong128_seed == NULL) XXH_setDispatch();
+    return XXH_g_dispatch128.hashLong128_seed(input, len, seed64);
 }
 
 XXH128_hash_t XXH3_128bits_withSeed_dispatch(const void* input, size_t len, XXH64_hash_t seed)
@@ -760,8 +750,8 @@ XXH3_hashLong_128b_withSecret_selection(const void* input, size_t len,
                                         XXH64_hash_t seed64, const void* secret, size_t secretLen)
 {
     (void)seed64;
-    if (g_dispatch128.hashLong128_secret == NULL) setDispatch();
-    return g_dispatch128.hashLong128_secret(input, len, secret, secretLen);
+    if (XXH_g_dispatch128.hashLong128_secret == NULL) XXH_setDispatch();
+    return XXH_g_dispatch128.hashLong128_secret(input, len, secret, secretLen);
 }
 
 XXH128_hash_t XXH3_128bits_withSecret_dispatch(const void* input, size_t len, const void* secret, size_t secretLen)
@@ -772,8 +762,8 @@ XXH128_hash_t XXH3_128bits_withSecret_dispatch(const void* input, size_t len, co
 XXH_errorcode
 XXH3_128bits_update_dispatch(XXH3_state_t* state, const void* input, size_t len)
 {
-    if (g_dispatch128.update == NULL) setDispatch();
-    return g_dispatch128.update(state, (const xxh_u8*)input, len);
+    if (XXH_g_dispatch128.update == NULL) XXH_setDispatch();
+    return XXH_g_dispatch128.update(state, (const xxh_u8*)input, len);
 }
 
 #if defined (__cplusplus)

From dabb850dfa4193f354f5f1a8f13edc996b217a64 Mon Sep 17 00:00:00 2001
From: Devon Powell <devon.f.powell@gmail.com>
Date: Sun, 25 Oct 2020 21:23:04 -0400
Subject: [PATCH 041/187] Fix issue with XXHASH_BUILD_XXHSUM cmake setting

Using `set(... CACHE ...)` causes `XXHASH_BUILD_XXHSUM` to be stuck `ON` even
when doing a `set(XXHASH_BUILD_XXHSUM OFF)` for some cmake version/generators.
Switching to `option(...)` fixes this issue and allows you to properly turn off
the building of xxhsum in all situations.
---
 cmake_unofficial/CMakeLists.txt | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index cd38be4b..41c71121 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -44,7 +44,7 @@ if(NOT CMAKE_CONFIGURATION_TYPES)
 endif()
 
 option(BUILD_SHARED_LIBS "Build shared library" ON)
-set(XXHASH_BUILD_XXHSUM ON CACHE BOOL "Build the xxhsum binary")
+option(XXHASH_BUILD_XXHSUM "Build the xxhsum binary" ON)
 
 # If XXHASH is being bundled in another project, we don't want to
 # install anything.  However, we want to let people override this, so

From 4122b83f992d7b5c00692b3eb29db0d736ea2474 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 12 Nov 2020 10:48:14 -0800
Subject: [PATCH 042/187] made travis tests more thorough

no longer tolerate compilation warnings
---
 .travis.yml | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

diff --git a/.travis.yml b/.travis.yml
index 675965f7..2f2106df 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -8,24 +8,22 @@ matrix:
   fast_finish: true
   include:
 
-    - name: General linux tests (Xenial)
-      dist: xenial
+    - name: General linux x64 tests
       arch: amd64
       addons:
         apt:
           packages:
-            - clang
             - g++-multilib
             - gcc-multilib
             - cppcheck
       script:
         - make -B test-all
         - make clean
-        - make dispatch
+        - CFLAGS="-Werror" make dispatch
         - make clean
-        - CC=g++ CFLAGS="-O1 -mavx512f" make
+        - CC=g++ CFLAGS="-O1 -mavx512f -Werror" make
         - make clean
-        - CC=g++ CFLAGS="-Wall -Wextra" make DISPATCH=1
+        - CC=g++ CFLAGS="-Wall -Wextra -Werror" make DISPATCH=1
 
 
     - name: Check results consistency on x64

From 21cf00d42dcdff80e8ccdafd326ad765715dfa52 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 12 Nov 2020 11:57:32 -0800
Subject: [PATCH 043/187] fixed compile time x86 cpu detection

---
 .travis.yml     | 4 ++--
 cli/xsum_arch.h | 2 +-
 2 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/.travis.yml b/.travis.yml
index 2f2106df..0bef84d1 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -44,7 +44,7 @@ matrix:
       os: osx
       compiler: clang
       script:
-        - make   # test library build
+        - CFLAGS="-Werror" make   # test library build
         - make clean
         - make test MOREFLAGS='-Werror' | tee # test scenario where `stdout` is not the console
 
@@ -135,4 +135,4 @@ matrix:
         - mkdir build
         - cd build
         - cmake ..
-        - make
+        - CFLAGS=-Werror make
diff --git a/cli/xsum_arch.h b/cli/xsum_arch.h
index 1fb9a634..cc392979 100644
--- a/cli/xsum_arch.h
+++ b/cli/xsum_arch.h
@@ -87,7 +87,7 @@
 #endif
 
 /* Try to detect the architecture. */
-#if defined(ARCH_X86)
+#if defined(XSUM_ARCH_X86)
 #  if defined(XXHSUM_DISPATCH)
 #    define XSUM_ARCH XSUM_ARCH_X86 " autoVec"
 #  elif defined(__AVX512F__)

From 3c60072f478509ac21a3456472bce0924b683e06 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 12 Nov 2020 14:26:47 -0800
Subject: [PATCH 044/187] fix x86/x64 vector detection

was accidentally disabled in #465
---
 xxh_x86dispatch.c | 2 --
 1 file changed, 2 deletions(-)

diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index 6387619b..4ea33409 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -327,7 +327,6 @@ static int XXH_featureTest(void)
 #if XXH_DISPATCH_AVX2 || XXH_DISPATCH_AVX512
     xxh_u64 xgetbv_val;
 #endif
-#if XXH_DISPATCH_SCALAR
 #if defined(__GNUC__) && defined(__i386__)
     xxh_u32 cpuid_supported;
     __asm__(
@@ -388,7 +387,6 @@ static int XXH_featureTest(void)
         return best;
 
     XXH_debugPrint("SSE2 support detected.");
-#endif /* XXH_DISPATCH_SCALAR */
 
     best = XXH_SSE2;
 #if XXH_DISPATCH_AVX2 || XXH_DISPATCH_AVX512

From 8aa549bf9a44a4b68537ec1efdb87efc8e507524 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 12 Nov 2020 16:46:48 -0800
Subject: [PATCH 045/187] remove sign conversion warning for gcc-5 + avx512

which is the version used on travis by default (xenial).
seems a bug in the intrinsic's definition
---
 .travis.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.travis.yml b/.travis.yml
index 0bef84d1..83476869 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -19,7 +19,7 @@ matrix:
       script:
         - make -B test-all
         - make clean
-        - CFLAGS="-Werror" make dispatch
+        - CFLAGS="-Werror" MOREFLAGS="-Wno-sign-conversion" make dispatch  # removing sign conversion warnings due to a bug in gcc-5's definition of some AVX512 intrinsics
         - make clean
         - CC=g++ CFLAGS="-O1 -mavx512f -Werror" make
         - make clean

From a37613edfe4ab56fa28617c500e627229bba0f05 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 12 Nov 2020 20:40:15 -0800
Subject: [PATCH 046/187] removed master branch badge

no longer used
---
 README.md | 1 -
 1 file changed, 1 deletion(-)

diff --git a/README.md b/README.md
index b976e3d1..7d9ae9dd 100644
--- a/README.md
+++ b/README.md
@@ -9,7 +9,6 @@ Code is highly portable, and hashes are identical across all platforms (little /
 
 |Branch      |Status   |
 |------------|---------|
-|master      | [![Build Status](https://travis-ci.org/Cyan4973/xxHash.svg?branch=master)](https://travis-ci.org/Cyan4973/xxHash?branch=master) |
 |dev         | [![Build Status](https://travis-ci.org/Cyan4973/xxHash.svg?branch=dev)](https://travis-ci.org/Cyan4973/xxHash?branch=dev) |
 
 

From 967060b1d10eb22cf3628a6847a81f51eee97cb9 Mon Sep 17 00:00:00 2001
From: jonsykkel <jonrevold@gmail.com>
Date: Fri, 13 Nov 2020 06:15:08 +0100
Subject: [PATCH 047/187] fixed CRITICAL typo in comment

---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 6be5466b..763b9651 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -49,7 +49,7 @@ Comparison (single thread, Windows Seven 32 bits, using SMHasher on a Core 2 Duo
 Name            Speed       Q.Score   Author
 xxHash          5.4 GB/s     10
 CrapWow         3.2 GB/s      2       Andrew
-MumurHash 3a    2.7 GB/s     10       Austin Appleby
+MurmurHash 3a   2.7 GB/s     10       Austin Appleby
 SpookyHash      2.0 GB/s     10       Bob Jenkins
 SBox            1.4 GB/s      9       Bret Mulvey
 Lookup3         1.2 GB/s      9       Bob Jenkins

From 417578779b14866f339415de96f8a2d3de0d1d95 Mon Sep 17 00:00:00 2001
From: Will Bryant <will.bryant@gmail.com>
Date: Sat, 14 Nov 2020 11:55:00 +1300
Subject: [PATCH 048/187] Don't attempt to dispatch to AVX2 on GCC <= 4.9
 (closes #473)

Although GCC 4.7 through 4.9 have AVX and AVX2 support and the necessary definitions, they aren't accessible to dispatcher code because you must turn on AVX/AVX2 mode to access them. We don't want to compile with that on as then the dispatcher code itself may get AVX/AVX2 optimizations and break.
---
 xxh_x86dispatch.c | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index 4ea33409..f7967d1d 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -118,7 +118,8 @@ extern "C" {
  * @brief Enables/disables dispatching for AVX2.
  *
  * This is automatically detected if it is not defined.
- *  - GCC 4.7 and later are known to support AVX2.
+ *  - GCC 4.7 and later are known to support AVX2, but >4.9 is required for
+ *    to get the AVX2 intrinsics and typedefs without -mavx -mavx2.
  *  - Visual Studio 2013 Update 2 and later are known to support AVX2.
  *  - The GCC/Clang internal header `<avx2intrin.h>` is detected. While this is
  *    not allowed to be included directly, it still appears in the builtin
@@ -127,8 +128,7 @@ extern "C" {
  * @see XXH_AVX2
  */
 #ifndef XXH_DISPATCH_AVX2
-#  if (defined(__GNUC__) \
-       && (__GNUC__ > 4 || (__GNUC__ == 4 && __GNUC_MINOR__ >= 7))) /* GCC 4.7+ */ \
+#  if (defined(__GNUC__) && (__GNUC__ > 4)) /* GCC 5.0+ */ \
    || (defined(_MSC_VER) && _MSC_VER >= 1900) /* VS 2015+ */ \
    || (defined(_MSC_FULL_VER) && _MSC_FULL_VER >= 180030501) /* VS 2013 Update 2 */ \
    || XXH_HAS_INCLUDE(<avx2intrin.h>) /* GCC/Clang internal header */

From 3e8cc41d2a7b6b8e8ccddfda345408af5d46f33a Mon Sep 17 00:00:00 2001
From: Tim Gates <tim.gates@iress.com>
Date: Fri, 20 Nov 2020 08:52:07 +1100
Subject: [PATCH 049/187] docs: fix simple typo, minumum -> minimum

There is a small typo in xxh_x86dispatch.c, xxhash.h.

Should read `minimum` rather than `minumum`.
---
 xxh_x86dispatch.c | 2 +-
 xxhash.h          | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index 4ea33409..9e0f8ead 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -61,7 +61,7 @@ extern "C" {
  * @brief Disables the AVX sanity check.
  *
  * Don't compile xxh_x86dispatch.c with options like `-mavx*`, `-march=native`,
- * or `/arch:AVX*`. It is intended to be compiled for the minumum target, and
+ * or `/arch:AVX*`. It is intended to be compiled for the minimum target, and
  * it selectively enables SSE2, AVX2, and AVX512 when it is needed.
  *
  * Using this option _globally_ allows this feature, and therefore makes it
diff --git a/xxhash.h b/xxhash.h
index 763b9651..5a57f31b 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2824,7 +2824,7 @@ enum XXH_VECTOR_TYPE /* fake enum */ {
 };
 /*!
  * @ingroup tuning
- * @brief Selects the minumum alignment for XXH3's accumulators.
+ * @brief Selects the minimum alignment for XXH3's accumulators.
  *
  * When using SIMD, this should match the alignment reqired for said vector
  * type, so, for example, 32 for AVX2.

From 88c11374f2644076486e782731b0f6974a8f85b5 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Thu, 26 Nov 2020 10:19:04 -0800
Subject: [PATCH 050/187] display boundaries of small tests

---
 tests/bench/bhDisplay.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/tests/bench/bhDisplay.c b/tests/bench/bhDisplay.c
index 6cf1a537..601ca1f3 100644
--- a/tests/bench/bhDisplay.c
+++ b/tests/bench/bhDisplay.c
@@ -84,7 +84,8 @@ static void bench_throughput_oneHash_smallInputs(Bench_Entry hashDesc, size_t si
 
 void bench_throughput_smallInputs(Bench_Entry const* hashDescTable, int nbHashes, size_t sizeMin, size_t sizeMax)
 {
-    printf("Throughput small inputs of fixed size : \n");
+    printf("Throughput small inputs of fixed size (from %zu to %zu bytes): \n",
+            sizeMin, sizeMax);
     for (int i=0; i<nbHashes; i++)
         bench_throughput_oneHash_smallInputs(hashDescTable[i], sizeMin, sizeMax);
 }

From 758ed175ae8b0451d1b2065a3a4d8c5129d70c26 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Thu, 26 Nov 2020 15:02:33 -0800
Subject: [PATCH 051/187] do not display collision hash values by default

---
 tests/collisions/main.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/tests/collisions/main.c b/tests/collisions/main.c
index 3cdf5b4e..f71b859b 100644
--- a/tests/collisions/main.c
+++ b/tests/collisions/main.c
@@ -785,11 +785,13 @@ static size_t search_collisions(
     size_t collisions = 0;
     for (size_t n=1; n<nbCandidates; n++) {
         if (isEqual(hashCandidates, n, n-1, htype)) {
+#if defined(COL_DISPLAY_DUPLICATES)
             printf("collision: ");
             printHash(hashCandidates, n, htype);
             printf(" / ");
             printHash(hashCandidates, n-1, htype);
             printf(" \n");
+#endif
             collisions++;
     }   }
 

From 4c881f796d6af27ef7d9c48f87817da0d3d75dc1 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Thu, 26 Nov 2020 15:04:08 -0800
Subject: [PATCH 052/187] fix minor warning

---
 tests/collisions/main.c | 8 +-------
 1 file changed, 1 insertion(+), 7 deletions(-)

diff --git a/tests/collisions/main.c b/tests/collisions/main.c
index f71b859b..a857341b 100644
--- a/tests/collisions/main.c
+++ b/tests/collisions/main.c
@@ -69,13 +69,7 @@ static void hexRaw(const void* buffer, size_t size)
     }
 }
 
-void hexDisp(const void* buffer, size_t size)
-{
-    hexRaw(buffer, size);
-    printf("\n");
-}
-
-static void printHash(const void* table, size_t n, Htype_e htype)
+void printHash(const void* table, size_t n, Htype_e htype)
 {
     if ((htype == ht64) || (htype == ht32)){
         uint64_t const h64 = ((const uint64_t*)table)[n];

From b23f2e19e5542aa26178c8a4abc748f023a588d3 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Wed, 9 Dec 2020 15:40:03 -0800
Subject: [PATCH 053/187] simplify XXH3 reset

single function XXH3_reset_internal()
---
 tests/bench/main.c |  4 ++--
 xxhash.h           | 22 +++++++---------------
 2 files changed, 9 insertions(+), 17 deletions(-)

diff --git a/tests/bench/main.c b/tests/bench/main.c
index 85c4364b..1cf6e80f 100644
--- a/tests/bench/main.c
+++ b/tests/bench/main.c
@@ -74,7 +74,7 @@ static int readIntFromChar(const char** stringPtr)
 
 
 /**
- * longCommand():
+ * isCommand():
  * Checks if string is the same as longCommand.
  * If yes, @return 1, otherwise @return 0
  */
@@ -169,7 +169,7 @@ static int badusage(const char* exename)
     return 1;
 }
 
-int main(int argc, const char** argv)
+int main(int argc, const char* argv[])
 {
     const char* const exename = argv[0];
     int hashNb = 0;
diff --git a/xxhash.h b/xxhash.h
index 5a57f31b..b4c44877 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4633,7 +4633,7 @@ XXH3_copyState(XXH3_state_t* dst_state, const XXH3_state_t* src_state)
 }
 
 static void
-XXH3_64bits_reset_internal(XXH3_state_t* statePtr,
+XXH3_reset_internal(XXH3_state_t* statePtr,
                            XXH64_hash_t seed,
                            const void* secret, size_t secretSize)
 {
@@ -4663,7 +4663,7 @@ XXH_PUBLIC_API XXH_errorcode
 XXH3_64bits_reset(XXH3_state_t* statePtr)
 {
     if (statePtr == NULL) return XXH_ERROR;
-    XXH3_64bits_reset_internal(statePtr, 0, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
+    XXH3_reset_internal(statePtr, 0, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
     return XXH_OK;
 }
 
@@ -4672,7 +4672,7 @@ XXH_PUBLIC_API XXH_errorcode
 XXH3_64bits_reset_withSecret(XXH3_state_t* statePtr, const void* secret, size_t secretSize)
 {
     if (statePtr == NULL) return XXH_ERROR;
-    XXH3_64bits_reset_internal(statePtr, 0, secret, secretSize);
+    XXH3_reset_internal(statePtr, 0, secret, secretSize);
     if (secret == NULL) return XXH_ERROR;
     if (secretSize < XXH3_SECRET_SIZE_MIN) return XXH_ERROR;
     return XXH_OK;
@@ -4685,7 +4685,7 @@ XXH3_64bits_reset_withSeed(XXH3_state_t* statePtr, XXH64_hash_t seed)
     if (statePtr == NULL) return XXH_ERROR;
     if (seed==0) return XXH3_64bits_reset(statePtr);
     if (seed != statePtr->seed) XXH3_initCustomSecret(statePtr->customSecret, seed);
-    XXH3_64bits_reset_internal(statePtr, seed, NULL, XXH_SECRET_DEFAULT_SIZE);
+    XXH3_reset_internal(statePtr, seed, NULL, XXH_SECRET_DEFAULT_SIZE);
     return XXH_OK;
 }
 
@@ -5306,20 +5306,12 @@ XXH128(const void* input, size_t len, XXH64_hash_t seed)
  * The only difference is the finalizatiom routine.
  */
 
-static void
-XXH3_128bits_reset_internal(XXH3_state_t* statePtr,
-                            XXH64_hash_t seed,
-                            const void* secret, size_t secretSize)
-{
-    XXH3_64bits_reset_internal(statePtr, seed, secret, secretSize);
-}
-
 /*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset(XXH3_state_t* statePtr)
 {
     if (statePtr == NULL) return XXH_ERROR;
-    XXH3_128bits_reset_internal(statePtr, 0, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
+    XXH3_reset_internal(statePtr, 0, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
     return XXH_OK;
 }
 
@@ -5328,7 +5320,7 @@ XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset_withSecret(XXH3_state_t* statePtr, const void* secret, size_t secretSize)
 {
     if (statePtr == NULL) return XXH_ERROR;
-    XXH3_128bits_reset_internal(statePtr, 0, secret, secretSize);
+    XXH3_reset_internal(statePtr, 0, secret, secretSize);
     if (secret == NULL) return XXH_ERROR;
     if (secretSize < XXH3_SECRET_SIZE_MIN) return XXH_ERROR;
     return XXH_OK;
@@ -5341,7 +5333,7 @@ XXH3_128bits_reset_withSeed(XXH3_state_t* statePtr, XXH64_hash_t seed)
     if (statePtr == NULL) return XXH_ERROR;
     if (seed==0) return XXH3_128bits_reset(statePtr);
     if (seed != statePtr->seed) XXH3_initCustomSecret(statePtr->customSecret, seed);
-    XXH3_128bits_reset_internal(statePtr, seed, NULL, XXH_SECRET_DEFAULT_SIZE);
+    XXH3_reset_internal(statePtr, seed, NULL, XXH_SECRET_DEFAULT_SIZE);
     return XXH_OK;
 }
 

From 6b44373c2fc06b92d511e3b3099b45c6e1526d9f Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Wed, 9 Dec 2020 15:44:00 -0800
Subject: [PATCH 054/187] check that bufferedSize remains within bounds

to detect situations like #482
---
 xxhash.h | 1 +
 1 file changed, 1 insertion(+)

diff --git a/xxhash.h b/xxhash.h
index b4c44877..69a369aa 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4736,6 +4736,7 @@ XXH3_update(XXH3_state_t* state,
         const unsigned char* const secret = (state->extSecret == NULL) ? state->customSecret : state->extSecret;
 
         state->totalLen += len;
+        XXH_ASSERT(state->bufferedSize <= XXH3_INTERNALBUFFER_SIZE);
 
         if (state->bufferedSize + len <= XXH3_INTERNALBUFFER_SIZE) {  /* fill in tmp buffer */
             XXH_memcpy(state->buffer + state->bufferedSize, input, len);

From 2183a7cc82316f9e0156c5e90f6ac3e344bab069 Mon Sep 17 00:00:00 2001
From: begasus <begasus@gmail.com>
Date: Sun, 27 Dec 2020 13:21:24 +0000
Subject: [PATCH 055/187] added make install for Haiku

---
 Makefile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/Makefile b/Makefile
index 83092358..23cf4a8c 100644
--- a/Makefile
+++ b/Makefile
@@ -381,7 +381,7 @@ trailingWhitespace:
 # =========================================================
 # make install is validated only for the following targets
 # =========================================================
-ifneq (,$(filter Linux Darwin GNU/kFreeBSD GNU OpenBSD FreeBSD NetBSD DragonFly SunOS CYGWIN% , $(UNAME)))
+ifneq (,$(filter Linux Darwin GNU/kFreeBSD GNU Haiku OpenBSD FreeBSD NetBSD DragonFly SunOS CYGWIN% , $(UNAME)))
 
 DESTDIR     ?=
 # directory variables: GNU conventions prefer lowercase

From 9f45b562e19c257f986ac752307238ef8679696f Mon Sep 17 00:00:00 2001
From: butteredmonkey <buttered.monkey@gmail.com>
Date: Tue, 5 Jan 2021 15:00:13 +0000
Subject: [PATCH 056/187] Update README.md

Fix typo
---
 cmake_unofficial/README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/cmake_unofficial/README.md b/cmake_unofficial/README.md
index 554c55a2..66df7909 100644
--- a/cmake_unofficial/README.md
+++ b/cmake_unofficial/README.md
@@ -26,7 +26,7 @@ Add lines into downstream CMakeLists.txt:
 ### Way 2: Add subdirectory
 Add lines into downstream CMakeLists.txt:
 
-    option(BUILD_SHARE_LIBS "Build shared libs" OFF) #optional
+    option(BUILD_SHARED_LIBS "Build shared libs" OFF) #optional
     ...
     set(XXHASH_BUILD_ENABLE_INLINE_API OFF) #optional
     set(XXHASH_BUILD_XXHSUM OFF) #optional

From 91759f7e0183e32c12a25c721b6417b3767b8912 Mon Sep 17 00:00:00 2001
From: Thomas Waldmann <tw@waldmann-edv.de>
Date: Thu, 7 Jan 2021 18:55:54 +0100
Subject: [PATCH 057/187] fix typos (work done by Andrea Gelmini)

---
 xxhash.h | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 69a369aa..15ec2081 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -358,7 +358,7 @@ typedef uint32_t XXH32_hash_t;
 XXH_PUBLIC_API XXH32_hash_t XXH32 (const void* input, size_t length, XXH32_hash_t seed);
 
 /*!
- * Streaming functions generate the xxHash value from an incrememtal input.
+ * Streaming functions generate the xxHash value from an incremental input.
  * This method is slower than single-call functions, due to state management.
  * For small inputs, prefer `XXH32()` and `XXH64()`, which are better optimized.
  *
@@ -1193,7 +1193,7 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  *
  * Moreover, it's not useful to generate an additional code path if memory
  * access uses the same instruction for both aligned and unaligned
- * adresses (e.g. x86 and aarch64).
+ * addresses (e.g. x86 and aarch64).
  *
  * In these cases, the alignment check can be removed by setting this macro to 0.
  * Then the code will always use unaligned memory access.
@@ -1513,7 +1513,7 @@ static xxh_u32 XXH_read32(const void* memPtr)
 #endif   /* XXH_FORCE_DIRECT_MEMORY_ACCESS */
 
 
-/* ***   Endianess   *** */
+/* ***   Endianness   *** */
 typedef enum { XXH_bigEndian=0, XXH_littleEndian=1 } XXH_endianess;
 
 /*!
@@ -1739,7 +1739,7 @@ static xxh_u32 XXH32_round(xxh_u32 acc, xxh_u32 input)
      * UGLY HACK:
      * This inline assembly hack forces acc into a normal register. This is the
      * only thing that prevents GCC and Clang from autovectorizing the XXH32
-     * loop (pragmas and attributes don't work for some resason) without globally
+     * loop (pragmas and attributes don't work for some reason) without globally
      * disabling SSE4.1.
      *
      * The reason we want to avoid vectorization is because despite working on
@@ -5304,7 +5304,7 @@ XXH128(const void* input, size_t len, XXH64_hash_t seed)
 
 /*
  * All the functions are actually the same as for 64-bit streaming variant.
- * The only difference is the finalizatiom routine.
+ * The only difference is the finalization routine.
  */
 
 /*! @ingroup xxh3_family */

From febe7d01c87c317f6a76cdd87db6c886d9216c1d Mon Sep 17 00:00:00 2001
From: TheVice <thewinlab@hotmail.com>
Date: Sat, 23 Jan 2021 21:09:44 +0200
Subject: [PATCH 058/187] [xxhash_spec.md] added missed semicolon at the
 pseudo-code.

---
 doc/xxhash_spec.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/doc/xxhash_spec.md b/doc/xxhash_spec.md
index cd593d4e..af7ba90d 100644
--- a/doc/xxhash_spec.md
+++ b/doc/xxhash_spec.md
@@ -227,7 +227,7 @@ Note that accumulator convergence is more complex than 32-bit variant, and requi
 
     mergeAccumulator(acc,accN):
     acc  = acc xor round(0, accN);
-    acc  = acc * PRIME64_1
+    acc  = acc * PRIME64_1;
     return acc + PRIME64_4;
 
 which is then used in the convergence formula:

From a470f2ef95a87dfa7c35c7f3efc8cf3f9a812584 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Tue, 2 Feb 2021 06:40:22 -0800
Subject: [PATCH 059/187] update default memory access for armv6

Now uses `memcpy()` (method 0) by default
fix #490
---
 xxhash.h | 15 +++++++--------
 1 file changed, 7 insertions(+), 8 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 15ec2081..29c44e45 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1135,8 +1135,8 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  *  - `XXH_FORCE_MEMORY_ACCESS=1`: `__attribute__((packed))`
  *   @par
  *     Depends on compiler extensions and is therefore not portable.
- *     This method is safe if your compiler supports it, and *generally* as
- *     fast or faster than `memcpy`.
+ *     This method is safe _if_ your compiler supports it,
+ *     and *generally* as fast or faster than `memcpy`.
  *
  *  - `XXH_FORCE_MEMORY_ACCESS=2`: Direct cast
  *  @par
@@ -1144,7 +1144,7 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  *     compiler, but it violates the C standard as it directly dereferences an
  *     unaligned pointer. It can generate buggy code on targets which do not
  *     support unaligned memory accesses, but in some circumstances, it's the
- *     only known way to get the most performance (example: GCC + ARMv6).
+ *     only known way to get the most performance.
  *
  *  - `XXH_FORCE_MEMORY_ACCESS=3`: Byteshift
  *  @par
@@ -1152,7 +1152,6 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  *     inline small `memcpy()` calls, and it might also be faster on big-endian
  *     systems which lack a native byteswap instruction. However, some compilers
  *     will emit literal byteshifts even if the target supports unaligned access.
- *
  *  .
  *
  * @warning
@@ -1255,10 +1254,10 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  */
 
 #ifndef XXH_FORCE_MEMORY_ACCESS   /* can be defined externally, on command line for example */
-#  if !defined(__clang__) && defined(__GNUC__) && defined(__ARM_FEATURE_UNALIGNED) && defined(__ARM_ARCH) && (__ARM_ARCH == 6)
-#    define XXH_FORCE_MEMORY_ACCESS 2
-#  elif !defined(__clang__) && ((defined(__INTEL_COMPILER) && !defined(_WIN32)) || \
-  (defined(__GNUC__) && (defined(__ARM_ARCH) && __ARM_ARCH >= 7)))
+   /* prefer __packed__ structures (method 1) for gcc on armv7 and armv8 */
+#  if !defined(__clang__) && ( \
+    (defined(__INTEL_COMPILER) && !defined(_WIN32)) || \
+    (defined(__GNUC__) && (defined(__ARM_ARCH) && __ARM_ARCH >= 7)) )
 #    define XXH_FORCE_MEMORY_ACCESS 1
 #  endif
 #endif

From 01fc2e6294cee587f0b444cae94a0c79a22f6634 Mon Sep 17 00:00:00 2001
From: Travers <traversc@gmail.com>
Date: Sat, 20 Feb 2021 17:29:41 -0800
Subject: [PATCH 060/187] solaris restrict keyword fix

---
 xxhash.h | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 29c44e45..966103a5 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2679,7 +2679,10 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
 
 /* ===   Compiler specifics   === */
 
-#if defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
+/* ===   Compiler specifics   === */
+#if defined(sun) || defined(__sun)
+#  define XXH_RESTRICT /* disable */
+#elif defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
 #  define XXH_RESTRICT   restrict
 #else
 /* Note: it might be useful to define __restrict or __restrict__ for some C++ compilers */

From c865e556c9ee9b291112dc62427033b351e0d078 Mon Sep 17 00:00:00 2001
From: Travers <traversc@gmail.com>
Date: Mon, 22 Feb 2021 09:41:20 -0800
Subject: [PATCH 061/187] fix to solaris restrict keyword with c++

---
 xxhash.h | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 966103a5..8af139d5 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2679,8 +2679,9 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
 
 /* ===   Compiler specifics   === */
 
-/* ===   Compiler specifics   === */
-#if defined(sun) || defined(__sun)
+#if ((defined(sun) || defined(__sun)) && __cplusplus) /* Solaris includes __STDC_VERSION__ with C++. Tested with GCC 5.5 */
+#  define XXH_RESTRICT /* disable */
+#elif defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
 #  define XXH_RESTRICT /* disable */
 #elif defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
 #  define XXH_RESTRICT   restrict

From f79cd22a806993b4a62d8a4f1ba529a29a9d9ff5 Mon Sep 17 00:00:00 2001
From: Dmitry Kalinkin <dmitry.kalinkin@gmail.com>
Date: Mon, 22 Feb 2021 14:07:58 -0500
Subject: [PATCH 062/187] Makefile: escape special regex characters in paths

Fixes a problem with certain valid install paths:

make prefix=/tmp/a+b/
Makefile:434: *** configured libdir (/tmp/a+b//lib) is outside of exec_prefix (/tmp/a+b/), can't generate pkg-config file.  Stop.
---
 Makefile | 12 ++++++++----
 1 file changed, 8 insertions(+), 4 deletions(-)

diff --git a/Makefile b/Makefile
index 23cf4a8c..3b174314 100644
--- a/Makefile
+++ b/Makefile
@@ -423,14 +423,18 @@ INSTALL_PROGRAM ?= $(INSTALL)
 INSTALL_DATA    ?= $(INSTALL) -m 644
 
 
-PCLIBDIR ?= $(shell echo "$(LIBDIR)"     | $(SED) -n $(SED_ERE_OPT) -e "s@^$(EXEC_PREFIX)(/|$$)@@p")
-PCINCDIR ?= $(shell echo "$(INCLUDEDIR)" | $(SED) -n $(SED_ERE_OPT) -e "s@^$(PREFIX)(/|$$)@@p")
+# Escape special symbols by putting each character into its separate class
+EXEC_PREFIX_REGEX ?= $(shell echo "$(EXEC_PREFIX)" | $(SED) $(SED_ERE_OPT) -e "s/([^^])/[\1]/g" -e "s/\\^/\\\\^/g")
+PREFIX_REGEX ?= $(shell echo "$(PREFIX)" | $(SED) $(SED_ERE_OPT) -e "s/([^^])/[\1]/g" -e "s/\\^/\\\\^/g")
+
+PCLIBDIR ?= $(shell echo "$(LIBDIR)"     | $(SED) -n $(SED_ERE_OPT) -e "s@^$(EXEC_PREFIX_REGEX)(/|$$)@@p")
+PCINCDIR ?= $(shell echo "$(INCLUDEDIR)" | $(SED) -n $(SED_ERE_OPT) -e "s@^$(PREFIX_REGEX)(/|$$)@@p")
 PCEXECDIR?= $(if $(filter $(PREFIX),$(EXEC_PREFIX)),$$\{prefix\},$(EXEC_PREFIX))
 
 ifeq (,$(PCLIBDIR))
 # Additional prefix check is required, since the empty string is technically a
 # valid PCLIBDIR
-ifeq (,$(shell echo "$(LIBDIR)" | $(SED) -n $(SED_ERE_OPT) -e "\\@^$(EXEC_PREFIX)(/|$$)@ p"))
+ifeq (,$(shell echo "$(LIBDIR)" | $(SED) -n $(SED_ERE_OPT) -e "\\@^$(EXEC_PREFIX_REGEX)(/|$$)@ p"))
 $(error configured libdir ($(LIBDIR)) is outside of exec_prefix ($(EXEC_PREFIX)), can't generate pkg-config file)
 endif
 endif
@@ -438,7 +442,7 @@ endif
 ifeq (,$(PCINCDIR))
 # Additional prefix check is required, since the empty string is technically a
 # valid PCINCDIR
-ifeq (,$(shell echo "$(INCLUDEDIR)" | $(SED) -n $(SED_ERE_OPT) -e "\\@^$(PREFIX)(/|$$)@ p"))
+ifeq (,$(shell echo "$(INCLUDEDIR)" | $(SED) -n $(SED_ERE_OPT) -e "\\@^$(PREFIX_REGEX)(/|$$)@ p"))
 $(error configured includedir ($(INCLUDEDIR)) is outside of prefix ($(PREFIX)), can't generate pkg-config file)
 endif
 endif

From 4254f6e46d8dbe3d3dc7d1656b4ab3ae63bab5f6 Mon Sep 17 00:00:00 2001
From: Yann Collet <Cyan4973@users.noreply.github.com>
Date: Mon, 22 Feb 2021 14:57:36 -0800
Subject: [PATCH 063/187] Revert "solaris restrict keyword fix"

---
 xxhash.h | 6 +-----
 1 file changed, 1 insertion(+), 5 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 8af139d5..29c44e45 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2679,11 +2679,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
 
 /* ===   Compiler specifics   === */
 
-#if ((defined(sun) || defined(__sun)) && __cplusplus) /* Solaris includes __STDC_VERSION__ with C++. Tested with GCC 5.5 */
-#  define XXH_RESTRICT /* disable */
-#elif defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
-#  define XXH_RESTRICT /* disable */
-#elif defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
+#if defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
 #  define XXH_RESTRICT   restrict
 #else
 /* Note: it might be useful to define __restrict or __restrict__ for some C++ compilers */

From a340845f94ad708ce920cb3afecfa55c5f0b78dd Mon Sep 17 00:00:00 2001
From: Neal Richardson <neal.p.richardson@gmail.com>
Date: Mon, 22 Feb 2021 15:52:38 -0800
Subject: [PATCH 064/187] Fix the solaris patch

---
 xxhash.h | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 29c44e45..cd4aebc9 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2679,7 +2679,9 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
 
 /* ===   Compiler specifics   === */
 
-#if defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
+#if ((defined(sun) || defined(__sun)) && __cplusplus) /* Solaris includes __STDC_VERSION__ with C++. Tested with GCC 5.5 */
+#  define XXH_RESTRICT /* disable */
+#elif defined (__STDC_VERSION__) && __STDC_VERSION__ >= 199901L   /* >= C99 */
 #  define XXH_RESTRICT   restrict
 #else
 /* Note: it might be useful to define __restrict or __restrict__ for some C++ compilers */

From 935313786515da8fc114f389bbb98b1076533ec3 Mon Sep 17 00:00:00 2001
From: Koichi Shiraishi <zchee.io@gmail.com>
Date: Mon, 1 Mar 2021 00:17:42 +0900
Subject: [PATCH 065/187] doc: add markdown fences to some C snippets

Signed-off-by: Koichi Shiraishi <zchee.io@gmail.com>
---
 doc/xxhash_spec.md | 202 ++++++++++++++++++++++++++-------------------
 1 file changed, 118 insertions(+), 84 deletions(-)

diff --git a/doc/xxhash_spec.md b/doc/xxhash_spec.md
index af7ba90d..cd13bc09 100644
--- a/doc/xxhash_spec.md
+++ b/doc/xxhash_spec.md
@@ -63,11 +63,13 @@ The algorithm collect and transform input in _stripes_ of 16 bytes. The transfor
 
 The algorithm uses 32-bits addition, multiplication, rotate, shift and xor operations. Many operations require some 32-bits prime number constants, all defined below:
 
-    static const u32 PRIME32_1 = 0x9E3779B1U;  // 0b10011110001101110111100110110001
-    static const u32 PRIME32_2 = 0x85EBCA77U;  // 0b10000101111010111100101001110111
-    static const u32 PRIME32_3 = 0xC2B2AE3DU;  // 0b11000010101100101010111000111101
-    static const u32 PRIME32_4 = 0x27D4EB2FU;  // 0b00100111110101001110101100101111
-    static const u32 PRIME32_5 = 0x165667B1U;  // 0b00010110010101100110011110110001
+```c
+static const u32 PRIME32_1 = 0x9E3779B1U;  // 0b10011110001101110111100110110001
+static const u32 PRIME32_2 = 0x85EBCA77U;  // 0b10000101111010111100101001110111
+static const u32 PRIME32_3 = 0xC2B2AE3DU;  // 0b11000010101100101010111000111101
+static const u32 PRIME32_4 = 0x27D4EB2FU;  // 0b00100111110101001110101100101111
+static const u32 PRIME32_5 = 0x165667B1U;  // 0b00010110010101100110011110110001
+```
 
 These constants are prime numbers, and feature a good mix of bits 1 and 0, neither too regular, nor too dissymmetric. These properties help dispersion capabilities.
 
@@ -75,10 +77,12 @@ These constants are prime numbers, and feature a good mix of bits 1 and 0, neith
 
 Each accumulator gets an initial value based on optional `seed` input. Since the `seed` is optional, it can be `0`.
 
-        u32 acc1 = seed + PRIME32_1 + PRIME32_2;
-        u32 acc2 = seed + PRIME32_2;
-        u32 acc3 = seed + 0;
-        u32 acc4 = seed - PRIME32_1;
+```c
+    u32 acc1 = seed + PRIME32_1 + PRIME32_2;
+    u32 acc2 = seed + PRIME32_2;
+    u32 acc3 = seed + 0;
+    u32 acc4 = seed - PRIME32_1;
+```
 
 #### Special case: input is less than 16 bytes
 
@@ -86,7 +90,9 @@ When the input is too small (< 16 bytes), the algorithm will not process any str
 
 In this case, a simplified initialization is performed, using a single accumulator:
 
-      u32 acc  = seed + PRIME32_5;
+```c
+  u32 acc  = seed + PRIME32_5;
+```
 
 The algorithm then proceeds directly to step 4.
 
@@ -100,9 +106,11 @@ Each lane read its associated 32-bit value using __little-endian__ convention.
 
 For each {lane, accumulator}, the update process is called a _round_, and applies the following formula:
 
-    accN = accN + (laneN * PRIME32_2);
-    accN = accN <<< 13;
-    accN = accN * PRIME32_1;
+```c
+accN = accN + (laneN * PRIME32_2);
+accN = accN <<< 13;
+accN = accN * PRIME32_1;
+```
 
 This shuffles the bits so that any bit from input _lane_ impacts several bits in output _accumulator_. All operations are performed modulo 2^32.
 
@@ -113,13 +121,17 @@ When that happens, move to step 3.
 
 All 4 lane accumulators from the previous steps are merged to produce a single remaining accumulator of the same width (32-bit). The associated formula is as follows:
 
-    acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
+```c
+acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
+```
 
 ### Step 4. Add input length
 
 The input total length is presumed known at this stage. This step is just about adding the length to accumulator, so that it participates to final mixing.
 
-    acc = acc + (u32)inputLength;
+```c
+acc = acc + (u32)inputLength;
+```
 
 Note that, if input length is so large that it requires more than 32-bits, only the lower 32-bits are added to the accumulator.
 
@@ -128,19 +140,21 @@ Note that, if input length is so large that it requires more than 32-bits, only
 There may be up to 15 bytes remaining to consume from the input.
 The final stage will digest them according to following pseudo-code:
 
-    while (remainingLength >= 4) {
-        lane = read_32bit_little_endian(input_ptr);
-        acc = acc + lane * PRIME32_3;
-        acc = (acc <<< 17) * PRIME32_4;
-        input_ptr += 4; remainingLength -= 4;
-    }
-
-    while (remainingLength >= 1) {
-        lane = read_byte(input_ptr);
-        acc = acc + lane * PRIME32_5;
-        acc = (acc <<< 11) * PRIME32_1;
-        input_ptr += 1; remainingLength -= 1;
-    }
+```c
+while (remainingLength >= 4) {
+    lane = read_32bit_little_endian(input_ptr);
+    acc = acc + lane * PRIME32_3;
+    acc = (acc <<< 17) * PRIME32_4;
+    input_ptr += 4; remainingLength -= 4;
+}
+
+while (remainingLength >= 1) {
+    lane = read_byte(input_ptr);
+    acc = acc + lane * PRIME32_5;
+    acc = (acc <<< 11) * PRIME32_1;
+    input_ptr += 1; remainingLength -= 1;
+}
+```
 
 This process ensures that all input bytes are present in the final mix.
 
@@ -148,11 +162,13 @@ This process ensures that all input bytes are present in the final mix.
 
 The final mix ensures that all input bits have a chance to impact any bit in the output digest, resulting in an unbiased distribution. This is also called avalanche effect.
 
-    acc = acc xor (acc >> 15);
-    acc = acc * PRIME32_2;
-    acc = acc xor (acc >> 13);
-    acc = acc * PRIME32_3;
-    acc = acc xor (acc >> 16);
+```c
+acc = acc xor (acc >> 15);
+acc = acc * PRIME32_2;
+acc = acc xor (acc >> 13);
+acc = acc * PRIME32_3;
+acc = acc xor (acc >> 16);
+```
 
 ### Step 7. Output
 
@@ -172,11 +188,13 @@ The algorithm collects and transforms input in _stripes_ of 32 bytes. The transf
 
 The algorithm uses 64-bit addition, multiplication, rotate, shift and xor operations. Many operations require some 64-bit prime number constants, all defined below:
 
-    static const u64 PRIME64_1 = 0x9E3779B185EBCA87ULL;  // 0b1001111000110111011110011011000110000101111010111100101010000111
-    static const u64 PRIME64_2 = 0xC2B2AE3D27D4EB4FULL;  // 0b1100001010110010101011100011110100100111110101001110101101001111
-    static const u64 PRIME64_3 = 0x165667B19E3779F9ULL;  // 0b0001011001010110011001111011000110011110001101110111100111111001
-    static const u64 PRIME64_4 = 0x85EBCA77C2B2AE63ULL;  // 0b1000010111101011110010100111011111000010101100101010111001100011
-    static const u64 PRIME64_5 = 0x27D4EB2F165667C5ULL;  // 0b0010011111010100111010110010111100010110010101100110011111000101
+```c
+static const u64 PRIME64_1 = 0x9E3779B185EBCA87ULL;  // 0b1001111000110111011110011011000110000101111010111100101010000111
+static const u64 PRIME64_2 = 0xC2B2AE3D27D4EB4FULL;  // 0b1100001010110010101011100011110100100111110101001110101101001111
+static const u64 PRIME64_3 = 0x165667B19E3779F9ULL;  // 0b0001011001010110011001111011000110011110001101110111100111111001
+static const u64 PRIME64_4 = 0x85EBCA77C2B2AE63ULL;  // 0b1000010111101011110010100111011111000010101100101010111001100011
+static const u64 PRIME64_5 = 0x27D4EB2F165667C5ULL;  // 0b0010011111010100111010110010111100010110010101100110011111000101
+```
 
 These constants are prime numbers, and feature a good mix of bits 1 and 0, neither too regular, nor too dissymmetric. These properties help dispersion capabilities.
 
@@ -184,10 +202,12 @@ These constants are prime numbers, and feature a good mix of bits 1 and 0, neith
 
 Each accumulator gets an initial value based on optional `seed` input. Since the `seed` is optional, it can be `0`.
 
-        u64 acc1 = seed + PRIME64_1 + PRIME64_2;
-        u64 acc2 = seed + PRIME64_2;
-        u64 acc3 = seed + 0;
-        u64 acc4 = seed - PRIME64_1;
+```c
+    u64 acc1 = seed + PRIME64_1 + PRIME64_2;
+    u64 acc2 = seed + PRIME64_2;
+    u64 acc3 = seed + 0;
+    u64 acc4 = seed - PRIME64_1;
+```
 
 #### Special case: input is less than 32 bytes
 
@@ -195,7 +215,9 @@ When the input is too small (< 32 bytes), the algorithm will not process any str
 
 In this case, a simplified initialization is performed, using a single accumulator:
 
-      u64 acc  = seed + PRIME64_5;
+```c
+  u64 acc  = seed + PRIME64_5;
+```
 
 The algorithm then proceeds directly to step 4.
 
@@ -209,10 +231,12 @@ Each lane read its associated 64-bit value using __little-endian__ convention.
 
 For each {lane, accumulator}, the update process is called a _round_, and applies the following formula:
 
-    round(accN,laneN):
-    accN = accN + (laneN * PRIME64_2);
-    accN = accN <<< 31;
-    return accN * PRIME64_1;
+```c
+round(accN,laneN):
+accN = accN + (laneN * PRIME64_2);
+accN = accN <<< 31;
+return accN * PRIME64_1;
+```
 
 This shuffles the bits so that any bit from input _lane_ impacts several bits in output _accumulator_. All operations are performed modulo 2^64.
 
@@ -225,52 +249,60 @@ All 4 lane accumulators from previous steps are merged to produce a single remai
 
 Note that accumulator convergence is more complex than 32-bit variant, and requires to define another function called _mergeAccumulator()_:
 
-    mergeAccumulator(acc,accN):
-    acc  = acc xor round(0, accN);
-    acc  = acc * PRIME64_1;
-    return acc + PRIME64_4;
+```c
+mergeAccumulator(acc,accN):
+acc  = acc xor round(0, accN);
+acc  = acc * PRIME64_1;
+return acc + PRIME64_4;
+```
 
 which is then used in the convergence formula:
 
-    acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
-    acc = mergeAccumulator(acc, acc1);
-    acc = mergeAccumulator(acc, acc2);
-    acc = mergeAccumulator(acc, acc3);
-    acc = mergeAccumulator(acc, acc4);
+```c
+acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
+acc = mergeAccumulator(acc, acc1);
+acc = mergeAccumulator(acc, acc2);
+acc = mergeAccumulator(acc, acc3);
+acc = mergeAccumulator(acc, acc4);
+```
 
 ### Step 4. Add input length
 
 The input total length is presumed known at this stage. This step is just about adding the length to accumulator, so that it participates to final mixing.
 
-    acc = acc + inputLength;
+```c
+acc = acc + inputLength;
+```
 
 ### Step 5. Consume remaining input
 
 There may be up to 31 bytes remaining to consume from the input.
 The final stage will digest them according to following pseudo-code:
 
-    while (remainingLength >= 8) {
-        lane = read_64bit_little_endian(input_ptr);
-        acc = acc xor round(0, lane);
-        acc = (acc <<< 27) * PRIME64_1;
-        acc = acc + PRIME64_4;
-        input_ptr += 8; remainingLength -= 8;
-    }
-
-    if (remainingLength >= 4) {
-        lane = read_32bit_little_endian(input_ptr);
-        acc = acc xor (lane * PRIME64_1);
-        acc = (acc <<< 23) * PRIME64_2;
-        acc = acc + PRIME64_3;
-        input_ptr += 4; remainingLength -= 4;
-    }
-
-    while (remainingLength >= 1) {
-        lane = read_byte(input_ptr);
-        acc = acc xor (lane * PRIME64_5);
-        acc = (acc <<< 11) * PRIME64_1;
-        input_ptr += 1; remainingLength -= 1;
-    }
+```c
+while (remainingLength >= 8) {
+    lane = read_64bit_little_endian(input_ptr);
+    acc = acc xor round(0, lane);
+    acc = (acc <<< 27) * PRIME64_1;
+    acc = acc + PRIME64_4;
+    input_ptr += 8; remainingLength -= 8;
+}
+
+if (remainingLength >= 4) {
+    lane = read_32bit_little_endian(input_ptr);
+    acc = acc xor (lane * PRIME64_1);
+    acc = (acc <<< 23) * PRIME64_2;
+    acc = acc + PRIME64_3;
+    input_ptr += 4; remainingLength -= 4;
+}
+
+while (remainingLength >= 1) {
+    lane = read_byte(input_ptr);
+    acc = acc xor (lane * PRIME64_5);
+    acc = (acc <<< 11) * PRIME64_1;
+    input_ptr += 1; remainingLength -= 1;
+}
+```
 
 This process ensures that all input bytes are present in the final mix.
 
@@ -278,11 +310,13 @@ This process ensures that all input bytes are present in the final mix.
 
 The final mix ensures that all input bits have a chance to impact any bit in the output digest, resulting in an unbiased distribution. This is also called avalanche effect.
 
-    acc = acc xor (acc >> 33);
-    acc = acc * PRIME64_2;
-    acc = acc xor (acc >> 29);
-    acc = acc * PRIME64_3;
-    acc = acc xor (acc >> 32);
+```c
+acc = acc xor (acc >> 33);
+acc = acc * PRIME64_2;
+acc = acc xor (acc >> 29);
+acc = acc * PRIME64_3;
+acc = acc xor (acc >> 32);
+```
 
 ### Step 7. Output
 

From 475bddca397e76a837d23b21d53358c7f84d039a Mon Sep 17 00:00:00 2001
From: Koichi Shiraishi <zchee.io@gmail.com>
Date: Tue, 2 Mar 2021 00:18:52 +0900
Subject: [PATCH 066/187] docs: add 2 space indent to all code snippet

---
 doc/xxhash_spec.md | 164 ++++++++++++++++++++++-----------------------
 1 file changed, 82 insertions(+), 82 deletions(-)

diff --git a/doc/xxhash_spec.md b/doc/xxhash_spec.md
index cd13bc09..1befd713 100644
--- a/doc/xxhash_spec.md
+++ b/doc/xxhash_spec.md
@@ -64,11 +64,11 @@ The algorithm collect and transform input in _stripes_ of 16 bytes. The transfor
 The algorithm uses 32-bits addition, multiplication, rotate, shift and xor operations. Many operations require some 32-bits prime number constants, all defined below:
 
 ```c
-static const u32 PRIME32_1 = 0x9E3779B1U;  // 0b10011110001101110111100110110001
-static const u32 PRIME32_2 = 0x85EBCA77U;  // 0b10000101111010111100101001110111
-static const u32 PRIME32_3 = 0xC2B2AE3DU;  // 0b11000010101100101010111000111101
-static const u32 PRIME32_4 = 0x27D4EB2FU;  // 0b00100111110101001110101100101111
-static const u32 PRIME32_5 = 0x165667B1U;  // 0b00010110010101100110011110110001
+  static const u32 PRIME32_1 = 0x9E3779B1U;  // 0b10011110001101110111100110110001
+  static const u32 PRIME32_2 = 0x85EBCA77U;  // 0b10000101111010111100101001110111
+  static const u32 PRIME32_3 = 0xC2B2AE3DU;  // 0b11000010101100101010111000111101
+  static const u32 PRIME32_4 = 0x27D4EB2FU;  // 0b00100111110101001110101100101111
+  static const u32 PRIME32_5 = 0x165667B1U;  // 0b00010110010101100110011110110001
 ```
 
 These constants are prime numbers, and feature a good mix of bits 1 and 0, neither too regular, nor too dissymmetric. These properties help dispersion capabilities.
@@ -78,10 +78,10 @@ These constants are prime numbers, and feature a good mix of bits 1 and 0, neith
 Each accumulator gets an initial value based on optional `seed` input. Since the `seed` is optional, it can be `0`.
 
 ```c
-    u32 acc1 = seed + PRIME32_1 + PRIME32_2;
-    u32 acc2 = seed + PRIME32_2;
-    u32 acc3 = seed + 0;
-    u32 acc4 = seed - PRIME32_1;
+  u32 acc1 = seed + PRIME32_1 + PRIME32_2;
+  u32 acc2 = seed + PRIME32_2;
+  u32 acc3 = seed + 0;
+  u32 acc4 = seed - PRIME32_1;
 ```
 
 #### Special case: input is less than 16 bytes
@@ -107,9 +107,9 @@ Each lane read its associated 32-bit value using __little-endian__ convention.
 For each {lane, accumulator}, the update process is called a _round_, and applies the following formula:
 
 ```c
-accN = accN + (laneN * PRIME32_2);
-accN = accN <<< 13;
-accN = accN * PRIME32_1;
+  accN = accN + (laneN * PRIME32_2);
+  accN = accN <<< 13;
+  accN = accN * PRIME32_1;
 ```
 
 This shuffles the bits so that any bit from input _lane_ impacts several bits in output _accumulator_. All operations are performed modulo 2^32.
@@ -122,7 +122,7 @@ When that happens, move to step 3.
 All 4 lane accumulators from the previous steps are merged to produce a single remaining accumulator of the same width (32-bit). The associated formula is as follows:
 
 ```c
-acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
+  acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
 ```
 
 ### Step 4. Add input length
@@ -130,7 +130,7 @@ acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
 The input total length is presumed known at this stage. This step is just about adding the length to accumulator, so that it participates to final mixing.
 
 ```c
-acc = acc + (u32)inputLength;
+  acc = acc + (u32)inputLength;
 ```
 
 Note that, if input length is so large that it requires more than 32-bits, only the lower 32-bits are added to the accumulator.
@@ -141,19 +141,19 @@ There may be up to 15 bytes remaining to consume from the input.
 The final stage will digest them according to following pseudo-code:
 
 ```c
-while (remainingLength >= 4) {
-    lane = read_32bit_little_endian(input_ptr);
-    acc = acc + lane * PRIME32_3;
-    acc = (acc <<< 17) * PRIME32_4;
-    input_ptr += 4; remainingLength -= 4;
-}
-
-while (remainingLength >= 1) {
-    lane = read_byte(input_ptr);
-    acc = acc + lane * PRIME32_5;
-    acc = (acc <<< 11) * PRIME32_1;
-    input_ptr += 1; remainingLength -= 1;
-}
+  while (remainingLength >= 4) {
+      lane = read_32bit_little_endian(input_ptr);
+      acc = acc + lane * PRIME32_3;
+      acc = (acc <<< 17) * PRIME32_4;
+      input_ptr += 4; remainingLength -= 4;
+  }
+
+  while (remainingLength >= 1) {
+      lane = read_byte(input_ptr);
+      acc = acc + lane * PRIME32_5;
+      acc = (acc <<< 11) * PRIME32_1;
+      input_ptr += 1; remainingLength -= 1;
+  }
 ```
 
 This process ensures that all input bytes are present in the final mix.
@@ -163,11 +163,11 @@ This process ensures that all input bytes are present in the final mix.
 The final mix ensures that all input bits have a chance to impact any bit in the output digest, resulting in an unbiased distribution. This is also called avalanche effect.
 
 ```c
-acc = acc xor (acc >> 15);
-acc = acc * PRIME32_2;
-acc = acc xor (acc >> 13);
-acc = acc * PRIME32_3;
-acc = acc xor (acc >> 16);
+  acc = acc xor (acc >> 15);
+  acc = acc * PRIME32_2;
+  acc = acc xor (acc >> 13);
+  acc = acc * PRIME32_3;
+  acc = acc xor (acc >> 16);
 ```
 
 ### Step 7. Output
@@ -189,11 +189,11 @@ The algorithm collects and transforms input in _stripes_ of 32 bytes. The transf
 The algorithm uses 64-bit addition, multiplication, rotate, shift and xor operations. Many operations require some 64-bit prime number constants, all defined below:
 
 ```c
-static const u64 PRIME64_1 = 0x9E3779B185EBCA87ULL;  // 0b1001111000110111011110011011000110000101111010111100101010000111
-static const u64 PRIME64_2 = 0xC2B2AE3D27D4EB4FULL;  // 0b1100001010110010101011100011110100100111110101001110101101001111
-static const u64 PRIME64_3 = 0x165667B19E3779F9ULL;  // 0b0001011001010110011001111011000110011110001101110111100111111001
-static const u64 PRIME64_4 = 0x85EBCA77C2B2AE63ULL;  // 0b1000010111101011110010100111011111000010101100101010111001100011
-static const u64 PRIME64_5 = 0x27D4EB2F165667C5ULL;  // 0b0010011111010100111010110010111100010110010101100110011111000101
+  static const u64 PRIME64_1 = 0x9E3779B185EBCA87ULL;  // 0b1001111000110111011110011011000110000101111010111100101010000111
+  static const u64 PRIME64_2 = 0xC2B2AE3D27D4EB4FULL;  // 0b1100001010110010101011100011110100100111110101001110101101001111
+  static const u64 PRIME64_3 = 0x165667B19E3779F9ULL;  // 0b0001011001010110011001111011000110011110001101110111100111111001
+  static const u64 PRIME64_4 = 0x85EBCA77C2B2AE63ULL;  // 0b1000010111101011110010100111011111000010101100101010111001100011
+  static const u64 PRIME64_5 = 0x27D4EB2F165667C5ULL;  // 0b0010011111010100111010110010111100010110010101100110011111000101
 ```
 
 These constants are prime numbers, and feature a good mix of bits 1 and 0, neither too regular, nor too dissymmetric. These properties help dispersion capabilities.
@@ -203,10 +203,10 @@ These constants are prime numbers, and feature a good mix of bits 1 and 0, neith
 Each accumulator gets an initial value based on optional `seed` input. Since the `seed` is optional, it can be `0`.
 
 ```c
-    u64 acc1 = seed + PRIME64_1 + PRIME64_2;
-    u64 acc2 = seed + PRIME64_2;
-    u64 acc3 = seed + 0;
-    u64 acc4 = seed - PRIME64_1;
+  u64 acc1 = seed + PRIME64_1 + PRIME64_2;
+  u64 acc2 = seed + PRIME64_2;
+  u64 acc3 = seed + 0;
+  u64 acc4 = seed - PRIME64_1;
 ```
 
 #### Special case: input is less than 32 bytes
@@ -232,10 +232,10 @@ Each lane read its associated 64-bit value using __little-endian__ convention.
 For each {lane, accumulator}, the update process is called a _round_, and applies the following formula:
 
 ```c
-round(accN,laneN):
-accN = accN + (laneN * PRIME64_2);
-accN = accN <<< 31;
-return accN * PRIME64_1;
+  round(accN,laneN):
+  accN = accN + (laneN * PRIME64_2);
+  accN = accN <<< 31;
+  return accN * PRIME64_1;
 ```
 
 This shuffles the bits so that any bit from input _lane_ impacts several bits in output _accumulator_. All operations are performed modulo 2^64.
@@ -250,20 +250,20 @@ All 4 lane accumulators from previous steps are merged to produce a single remai
 Note that accumulator convergence is more complex than 32-bit variant, and requires to define another function called _mergeAccumulator()_:
 
 ```c
-mergeAccumulator(acc,accN):
-acc  = acc xor round(0, accN);
-acc  = acc * PRIME64_1;
-return acc + PRIME64_4;
+  mergeAccumulator(acc,accN):
+  acc  = acc xor round(0, accN);
+  acc  = acc * PRIME64_1;
+  return acc + PRIME64_4;
 ```
 
 which is then used in the convergence formula:
 
 ```c
-acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
-acc = mergeAccumulator(acc, acc1);
-acc = mergeAccumulator(acc, acc2);
-acc = mergeAccumulator(acc, acc3);
-acc = mergeAccumulator(acc, acc4);
+  acc = (acc1 <<< 1) + (acc2 <<< 7) + (acc3 <<< 12) + (acc4 <<< 18);
+  acc = mergeAccumulator(acc, acc1);
+  acc = mergeAccumulator(acc, acc2);
+  acc = mergeAccumulator(acc, acc3);
+  acc = mergeAccumulator(acc, acc4);
 ```
 
 ### Step 4. Add input length
@@ -271,7 +271,7 @@ acc = mergeAccumulator(acc, acc4);
 The input total length is presumed known at this stage. This step is just about adding the length to accumulator, so that it participates to final mixing.
 
 ```c
-acc = acc + inputLength;
+  acc = acc + inputLength;
 ```
 
 ### Step 5. Consume remaining input
@@ -280,28 +280,28 @@ There may be up to 31 bytes remaining to consume from the input.
 The final stage will digest them according to following pseudo-code:
 
 ```c
-while (remainingLength >= 8) {
-    lane = read_64bit_little_endian(input_ptr);
-    acc = acc xor round(0, lane);
-    acc = (acc <<< 27) * PRIME64_1;
-    acc = acc + PRIME64_4;
-    input_ptr += 8; remainingLength -= 8;
-}
-
-if (remainingLength >= 4) {
-    lane = read_32bit_little_endian(input_ptr);
-    acc = acc xor (lane * PRIME64_1);
-    acc = (acc <<< 23) * PRIME64_2;
-    acc = acc + PRIME64_3;
-    input_ptr += 4; remainingLength -= 4;
-}
-
-while (remainingLength >= 1) {
-    lane = read_byte(input_ptr);
-    acc = acc xor (lane * PRIME64_5);
-    acc = (acc <<< 11) * PRIME64_1;
-    input_ptr += 1; remainingLength -= 1;
-}
+  while (remainingLength >= 8) {
+      lane = read_64bit_little_endian(input_ptr);
+      acc = acc xor round(0, lane);
+      acc = (acc <<< 27) * PRIME64_1;
+      acc = acc + PRIME64_4;
+      input_ptr += 8; remainingLength -= 8;
+  }
+
+  if (remainingLength >= 4) {
+      lane = read_32bit_little_endian(input_ptr);
+      acc = acc xor (lane * PRIME64_1);
+      acc = (acc <<< 23) * PRIME64_2;
+      acc = acc + PRIME64_3;
+      input_ptr += 4; remainingLength -= 4;
+  }
+
+  while (remainingLength >= 1) {
+      lane = read_byte(input_ptr);
+      acc = acc xor (lane * PRIME64_5);
+      acc = (acc <<< 11) * PRIME64_1;
+      input_ptr += 1; remainingLength -= 1;
+  }
 ```
 
 This process ensures that all input bytes are present in the final mix.
@@ -311,11 +311,11 @@ This process ensures that all input bytes are present in the final mix.
 The final mix ensures that all input bits have a chance to impact any bit in the output digest, resulting in an unbiased distribution. This is also called avalanche effect.
 
 ```c
-acc = acc xor (acc >> 33);
-acc = acc * PRIME64_2;
-acc = acc xor (acc >> 29);
-acc = acc * PRIME64_3;
-acc = acc xor (acc >> 32);
+  acc = acc xor (acc >> 33);
+  acc = acc * PRIME64_2;
+  acc = acc xor (acc >> 29);
+  acc = acc * PRIME64_3;
+  acc = acc xor (acc >> 32);
 ```
 
 ### Step 7. Output

From 164e1604709d9fcd24f20d54bcd2db9a747b02f3 Mon Sep 17 00:00:00 2001
From: Matthias Gabriel <matthias.gabriel@etit.tu-chemnitz.de>
Date: Mon, 8 Mar 2021 15:46:44 +0100
Subject: [PATCH 067/187] fix soversion, compile flags and pkg-config

---
 cmake_unofficial/CMakeLists.txt | 31 +++++++++++++------------------
 1 file changed, 13 insertions(+), 18 deletions(-)

diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index 41c71121..80ecbe5e 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -78,7 +78,7 @@ if (BUILD_SHARED_LIBS)
   target_compile_definitions(xxhash PUBLIC XXH_EXPORT)
 endif ()
 set_target_properties(xxhash PROPERTIES
-  SOVERSION "${XXHASH_VERSION_STRING}"
+  SOVERSION "${XXHASH_LIB_SOVERSION}"
   VERSION "${XXHASH_VERSION_STRING}")
 
 if(XXHASH_BUILD_XXHSUM)
@@ -100,23 +100,6 @@ include (CheckCCompilerFlag)
 if (XXHASH_C_FLAGS)
   set(CMAKE_C_FLAGS "${CMAKE_C_FLAGS} ${XXHASH_C_FLAGS}")
 endif()
-foreach (flag
-    -Wall -Wextra -Wcast-qual -Wcast-align -Wshadow
-    -Wstrict-aliasing=1 -Wswitch-enum -Wdeclaration-after-statement
-    -Wstrict-prototypes -Wundef)
-  # Because https://gcc.gnu.org/wiki/FAQ#wnowarning
-  string(REGEX REPLACE "\\-Wno\\-(.+)" "-W\\1" flag_to_test "${flag}")
-  string(REGEX REPLACE "[^a-zA-Z0-9]+" "_" test_name "CFLAG_${flag_to_test}")
-
-  check_c_compiler_flag("${ADD_COMPILER_FLAGS_PREPEND} ${flag_to_test}" ${test_name})
-
-  if(${test_name})
-    set(CMAKE_C_FLAGS "${flag} ${CMAKE_C_FLAGS}")
-  endif()
-
-  unset(test_name)
-  unset(flag_to_test)
-endforeach (flag)
 
 if(NOT XXHASH_BUNDLED_MODE)
   include(GNUInstallDirs)
@@ -170,4 +153,16 @@ if(NOT XXHASH_BUNDLED_MODE)
   install(EXPORT xxHashTargets
     DESTINATION ${xxHash_CONFIG_INSTALL_DIR}
     NAMESPACE ${PROJECT_NAME}::)
+
+  # configure and install pkg-config
+  set(PREFIX ${CMAKE_INSTALL_PREFIX})
+  set(EXECPREFIX "\${prefix}")
+  set(INCLUDEDIR "${CMAKE_INSTALL_INCLUDEDIR}")
+  set(LIBDIR "${CMAKE_INSTALL_LIBDIR}")
+  set(VERSION "${XXHASH_VERSION_STRING}")
+  configure_file(${XXHASH_DIR}/libxxhash.pc.in ${CMAKE_BINARY_DIR}/libxxhash.pc @ONLY)
+
+  install(FILES ${CMAKE_BINARY_DIR}/libxxhash.pc
+    DESTINATION ${CMAKE_INSTALL_LIBDIR}/pkgconfig)
+
 endif(NOT XXHASH_BUNDLED_MODE)

From 34b51c52b94e52f0d733ad3e390f37d21a2385eb Mon Sep 17 00:00:00 2001
From: "P.M" <60963077+goodengineer@users.noreply.github.com>
Date: Fri, 12 Mar 2021 09:57:40 +0200
Subject: [PATCH 068/187] Update multiInclude.c

---
 tests/multiInclude.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tests/multiInclude.c b/tests/multiInclude.c
index 7d2bc8a9..650f38e8 100644
--- a/tests/multiInclude.c
+++ b/tests/multiInclude.c
@@ -32,7 +32,7 @@
 
 /*
  * Advanced include, gives access to experimental symbols
- * This test ensure that xxhash.h can be included multiple times and in any
+ * This test ensures that xxhash.h can be included multiple times and in any
  * order. This order is more difficult: Without care, the declaration of
  * experimental symbols could be skipped.
  */

From 4dab8579451e5e21e3c422ad740e2ebc3c926299 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Fri, 12 Mar 2021 10:44:55 -0800
Subject: [PATCH 069/187] strict c90 compliance

added travisCI tests
added minor fixes
---
 .travis.yml |  8 ++++++--
 xxhash.h    | 22 ++++++++++++----------
 2 files changed, 18 insertions(+), 12 deletions(-)

diff --git a/.travis.yml b/.travis.yml
index 83476869..9c0b4b59 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -21,9 +21,13 @@ matrix:
         - make clean
         - CFLAGS="-Werror" MOREFLAGS="-Wno-sign-conversion" make dispatch  # removing sign conversion warnings due to a bug in gcc-5's definition of some AVX512 intrinsics
         - make clean
-        - CC=g++ CFLAGS="-O1 -mavx512f -Werror" make
+        - CFLAGS="-O1 -mavx512f -Werror" make
         - make clean
-        - CC=g++ CFLAGS="-Wall -Wextra -Werror" make DISPATCH=1
+        - CFLAGS="-Wall -Wextra -Werror" make DISPATCH=1
+        - make clean
+        - CFLAGS="-std=c90 -pedantic -Wno-long-long -Werror" make xxhsum  # check C90 compliance
+        - make clean
+        - CFLAGS="-std=c90 -pedantic -Werror" CPPFLAGS="-DXXH_NO_LONG_LONG" make libxxhash  # do not use long-long type, effectively reduced to XXH32
 
 
     - name: Check results consistency on x64
diff --git a/xxhash.h b/xxhash.h
index cd4aebc9..a138ca0b 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1703,11 +1703,12 @@ XXH_PUBLIC_API unsigned XXH_versionNumber (void) { return XXH_VERSION_NUMBER; }
  * @ingroup impl
  * @{
  */
-static const xxh_u32 XXH_PRIME32_1 = 0x9E3779B1U;   /*!< 0b10011110001101110111100110110001 */
-static const xxh_u32 XXH_PRIME32_2 = 0x85EBCA77U;   /*!< 0b10000101111010111100101001110111 */
-static const xxh_u32 XXH_PRIME32_3 = 0xC2B2AE3DU;   /*!< 0b11000010101100101010111000111101 */
-static const xxh_u32 XXH_PRIME32_4 = 0x27D4EB2FU;   /*!< 0b00100111110101001110101100101111 */
-static const xxh_u32 XXH_PRIME32_5 = 0x165667B1U;   /*!< 0b00010110010101100110011110110001 */
+ /* #define instead of static const, to be used as initializers */
+#define XXH_PRIME32_1  0x9E3779B1U  /*!< 0b10011110001101110111100110110001 */
+#define XXH_PRIME32_2  0x85EBCA77U  /*!< 0b10000101111010111100101001110111 */
+#define XXH_PRIME32_3  0xC2B2AE3DU  /*!< 0b11000010101100101010111000111101 */
+#define XXH_PRIME32_4  0x27D4EB2FU  /*!< 0b00100111110101001110101100101111 */
+#define XXH_PRIME32_5  0x165667B1U  /*!< 0b00010110010101100110011110110001 */
 
 #ifdef XXH_OLD_NAMES
 #  define PRIME32_1 XXH_PRIME32_1
@@ -2285,11 +2286,12 @@ XXH_readLE64_align(const void* ptr, XXH_alignment align)
  * @ingroup impl
  * @{
  */
-static const xxh_u64 XXH_PRIME64_1 = 0x9E3779B185EBCA87ULL;   /*!< 0b1001111000110111011110011011000110000101111010111100101010000111 */
-static const xxh_u64 XXH_PRIME64_2 = 0xC2B2AE3D27D4EB4FULL;   /*!< 0b1100001010110010101011100011110100100111110101001110101101001111 */
-static const xxh_u64 XXH_PRIME64_3 = 0x165667B19E3779F9ULL;   /*!< 0b0001011001010110011001111011000110011110001101110111100111111001 */
-static const xxh_u64 XXH_PRIME64_4 = 0x85EBCA77C2B2AE63ULL;   /*!< 0b1000010111101011110010100111011111000010101100101010111001100011 */
-static const xxh_u64 XXH_PRIME64_5 = 0x27D4EB2F165667C5ULL;   /*!< 0b0010011111010100111010110010111100010110010101100110011111000101 */
+/* #define rather that static const, to be used as initializers */
+#define XXH_PRIME64_1  0x9E3779B185EBCA87ULL  /*!< 0b1001111000110111011110011011000110000101111010111100101010000111 */
+#define XXH_PRIME64_2  0xC2B2AE3D27D4EB4FULL  /*!< 0b1100001010110010101011100011110100100111110101001110101101001111 */
+#define XXH_PRIME64_3  0x165667B19E3779F9ULL  /*!< 0b0001011001010110011001111011000110011110001101110111100111111001 */
+#define XXH_PRIME64_4  0x85EBCA77C2B2AE63ULL  /*!< 0b1000010111101011110010100111011111000010101100101010111001100011 */
+#define XXH_PRIME64_5  0x27D4EB2F165667C5ULL  /*!< 0b0010011111010100111010110010111100010110010101100110011111000101 */
 
 #ifdef XXH_OLD_NAMES
 #  define PRIME64_1 XXH_PRIME64_1

From a9054f397d7f41bc505638df3853b270eb9e7493 Mon Sep 17 00:00:00 2001
From: Jan <jsteemann@users.noreply.github.com>
Date: Wed, 17 Mar 2021 17:51:40 +0100
Subject: [PATCH 070/187] Small improvements (#515)

* fix typo in README.md

* fix typo in code comment

* remove superfluous space chars at end of output strings

* partially revert changes
---
 README.md | 2 +-
 xxh3.h    | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index 7d9ae9dd..df07e382 100644
--- a/README.md
+++ b/README.md
@@ -75,7 +75,7 @@ and passes all tests, ensuring reasonable quality levels.
 It also passes extended tests from [newer forks of SMHasher], featuring additional scenarios and conditions.
 
 Finally, xxHash provides its own [massive collision tester](https://github.com/Cyan4973/xxHash/tree/dev/tests/collisions),
-able to generate and compare billions of hash to test the limits of 64-bit hash algorithms.
+able to generate and compare billions of hashes to test the limits of 64-bit hash algorithms.
 On this front too, xxHash features good results, in line with the [birthday paradox].
 A more detailed analysis is documented [in the wiki](https://github.com/Cyan4973/xxHash/wiki/Collision-ratio-comparison).
 
diff --git a/xxh3.h b/xxh3.h
index 7e83e641..f7dc1959 100644
--- a/xxh3.h
+++ b/xxh3.h
@@ -42,7 +42,7 @@
  * but it is still provided for compatibility with source code
  * which used to include it directly.
  *
- * Programs are now highly discourage to include xxh3.h.
+ * Programs are now highly discouraged to include xxh3.h.
  * Include `xxhash.h` instead, which is the officially supported interface.
  *
  * In the future, xxh3.h will start to generate warnings, then errors,

From 94e7193eeaad2829f70aef9b9de4e52fac9bef5d Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Thu, 25 Mar 2021 17:59:27 -0400
Subject: [PATCH 071/187] AArch64 tuning, put asm guard in macro (#519)

- AArch64 does not benefit enough from unrolling XXH64_finalize to
   justify the code size increase.
 - Include aarch64 in the XXH32 asm guard. Clang autovectorizes this
   incorrectly. 2.5->4 GB/s on Clang 11 + Snapdragon 730G.
 - Replace the asm guards with a macro.
---
 xxhash.h | 67 +++++++++++++++++++++++++++++++-------------------------
 1 file changed, 37 insertions(+), 30 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index a138ca0b..74921992 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1394,6 +1394,27 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
 /* note: use after variable declarations */
 #define XXH_STATIC_ASSERT(c)  do { enum { XXH_sa = 1/(int)(!!(c)) }; } while (0)
 
+/*!
+ * @internal
+ * @def XXH_COMPILER_GUARD(var)
+ * @brief Used to prevent unwanted optimizations for @p var.
+ *
+ * It uses an empty GCC inline assembly statement with a register constraint
+ * which forces @p var into a general purpose register (eg eax, ebx, ecx
+ * on x86) and marks it as modified.
+ *
+ * This is used in a few places to avoid unwanted autovectorization (e.g.
+ * XXH32_round()). All vectorization we want is explicit via intrinsics,
+ * and _usually_ isn't wanted elsewhere.
+ *
+ * We also use it to prevent unwanted constant folding for AArch64 in
+ * XXH3_initCustomSecret_scalar().
+ */
+#ifdef __GNUC__
+#  define XXH_COMPILER_GUARD(var) __asm__ __volatile__("" : "+r" (var))
+#else
+#  define XXH_COMPILER_GUARD(var) ((void)0)
+#endif
 
 /* *************************************
 *  Basic Types
@@ -1734,13 +1755,12 @@ static xxh_u32 XXH32_round(xxh_u32 acc, xxh_u32 input)
     acc += input * XXH_PRIME32_2;
     acc  = XXH_rotl32(acc, 13);
     acc *= XXH_PRIME32_1;
-#if defined(__GNUC__) && defined(__SSE4_1__) && !defined(XXH_ENABLE_AUTOVECTORIZE)
+#if (defined(__SSE4_1__) || defined(__aarch64__)) && !defined(XXH_ENABLE_AUTOVECTORIZE)
     /*
      * UGLY HACK:
-     * This inline assembly hack forces acc into a normal register. This is the
-     * only thing that prevents GCC and Clang from autovectorizing the XXH32
-     * loop (pragmas and attributes don't work for some reason) without globally
-     * disabling SSE4.1.
+     * A compiler fence is the only thing that prevents GCC and Clang from
+     * autovectorizing the XXH32 loop (pragmas and attributes don't work for some
+     * reason) without globally disabling SSE4.1.
      *
      * The reason we want to avoid vectorization is because despite working on
      * 4 integers at a time, there are multiple factors slowing XXH32 down on
@@ -1765,22 +1785,11 @@ static xxh_u32 XXH32_round(xxh_u32 acc, xxh_u32 input)
      *   can load data, while v3 can multiply. SSE forces them to operate
      *   together.
      *
-     * How this hack works:
-     * __asm__(""       // Declare an assembly block but don't declare any instructions
-     *          :       // However, as an Input/Output Operand,
-     *          "+r"    // constrain a read/write operand (+) as a general purpose register (r).
-     *          (acc)   // and set acc as the operand
-     * );
-     *
-     * Because of the 'r', the compiler has promised that seed will be in a
-     * general purpose register and the '+' says that it will be 'read/write',
-     * so it has to assume it has changed. It is like volatile without all the
-     * loads and stores.
-     *
-     * Since the argument has to be in a normal register (not an SSE register),
-     * each time XXH32_round is called, it is impossible to vectorize.
+     * This is also enabled on AArch64, as Clang autovectorizes it incorrectly
+     * and it is pointless writing a NEON implementation that is basically the
+     * same speed as scalar for XXH32.
      */
-    __asm__("" : "+r" (acc));
+    XXH_COMPILER_GUARD(acc);
 #endif
     return acc;
 }
@@ -2149,12 +2158,14 @@ typedef XXH64_hash_t xxh_u64;
  * also slightly faster because it fits into cache better and is more likely
  * to be inlined by the compiler.
  *
+ * Unrolling XXH64 is also disabled on AArch64. While it is a 64-bit platform,
+ * there isn't enough benefit to justify the larger code size.
+ *
  * If XXH_REROLL is defined, this is ignored and the loop is always rerolled.
  */
 #ifndef XXH_REROLL_XXH64
 #  if (defined(__ILP32__) || defined(_ILP32)) /* ILP32 is often defined on 32-bit GCC family */ \
    || !(defined(__x86_64__) || defined(_M_X64) || defined(_M_AMD64) /* x86-64 */ \
-     || defined(_M_ARM64) || defined(__aarch64__) || defined(__arm64__) /* aarch64 */ \
      || defined(__PPC64__) || defined(__PPC64LE__) || defined(__ppc64__) || defined(__powerpc64__) /* ppc64 */ \
      || defined(__mips64__) || defined(__mips64)) /* mips64 */ \
    || (!defined(SIZE_MAX) || SIZE_MAX < ULLONG_MAX) /* check limits */
@@ -3531,7 +3542,7 @@ XXH_FORCE_INLINE xxh_u64 XXH3_mix16B(const xxh_u8* XXH_RESTRICT input,
      * GCC generates much better scalar code than Clang for the rest of XXH3,
      * which is why finding a more optimal codepath is an interest.
      */
-    __asm__ ("" : "+r" (seed64));
+    XXH_COMPILER_GUARD(seed64);
 #endif
     {   xxh_u64 const input_lo = XXH_readLE64(input);
         xxh_u64 const input_hi = XXH_readLE64(input+8);
@@ -3875,12 +3886,8 @@ XXH_FORCE_INLINE XXH_TARGET_AVX2 void XXH3_initCustomSecret_avx2(void* XXH_RESTR
          * On GCC & Clang, marking 'dest' as modified will cause the compiler:
          *   - do not extract the secret from sse registers in the internal loop
          *   - use less common registers, and avoid pushing these reg into stack
-         * The asm hack causes Clang to assume that XXH3_kSecretPtr aliases with
-         * customSecret, and on aarch64, this prevented LDP from merging two
-         * loads together for free. Putting the loads together before the stores
-         * properly generates LDP.
          */
-        __asm__("" : "+r" (dest));
+        XXH_COMPILER_GUARD(dest);
 #       endif
 
         /* GCC -O2 need unroll loop manually */
@@ -3989,7 +3996,7 @@ XXH_FORCE_INLINE XXH_TARGET_SSE2 void XXH3_initCustomSecret_sse2(void* XXH_RESTR
          *   - do not extract the secret from sse registers in the internal loop
          *   - use less common registers, and avoid pushing these reg into stack
          */
-        __asm__("" : "+r" (dest));
+        XXH_COMPILER_GUARD(dest);
 #       endif
 
         for (i=0; i < nbRounds; ++i) {
@@ -4235,7 +4242,7 @@ XXH3_initCustomSecret_scalar(void* XXH_RESTRICT customSecret, xxh_u64 seed64)
      *   without hack: 2654.4 MB/s
      *   with hack:    3202.9 MB/s
      */
-    __asm__("" : "+r" (kSecretPtr));
+    XXH_COMPILER_GUARD(kSecretPtr);
 #endif
     /*
      * Note: in debug mode, this overrides the asm optimization
@@ -4400,7 +4407,7 @@ XXH3_mergeAccs(const xxh_u64* XXH_RESTRICT acc, const xxh_u8* XXH_RESTRICT secre
          *   without hack: 2063.7 MB/s
          *   with hack:    2560.7 MB/s
          */
-        __asm__("" : "+r" (result64));
+        XXH_COMPILER_GUARD(result64);
 #endif
     }
 

From 7bf3d9f331d0b7d0f5856ae1894e0314e2b304c2 Mon Sep 17 00:00:00 2001
From: Yann Collet <Cyan4973@users.noreply.github.com>
Date: Fri, 26 Mar 2021 08:28:41 -0700
Subject: [PATCH 072/187] removed XXH64's switch finalizer (#521)

which performs generally worse than simpler loop finalizer
(see https://github.com/Cyan4973/xxHash/pull/519#issuecomment-807868078)
especially on 32-bit / arm systems.
The switch finalizer also largely increases the binary size of XXH64 function.

removed XXH_REROLL_XXH64 which is no longer needed.

simplifies the code base.
---
 xxhash.h | 169 +++++++------------------------------------------------
 1 file changed, 19 insertions(+), 150 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 74921992..7950691a 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2144,37 +2144,6 @@ typedef XXH64_hash_t xxh_u64;
 #  define U64 xxh_u64
 #endif
 
-/*!
- * XXH_REROLL_XXH64:
- * Whether to reroll the XXH64_finalize() loop.
- *
- * Just like XXH32, we can unroll the XXH64_finalize() loop. This can be a
- * performance gain on 64-bit hosts, as only one jump is required.
- *
- * However, on 32-bit hosts, because arithmetic needs to be done with two 32-bit
- * registers, and 64-bit arithmetic needs to be simulated, it isn't beneficial
- * to unroll. The code becomes ridiculously large (the largest function in the
- * binary on i386!), and rerolling it saves anywhere from 3kB to 20kB. It is
- * also slightly faster because it fits into cache better and is more likely
- * to be inlined by the compiler.
- *
- * Unrolling XXH64 is also disabled on AArch64. While it is a 64-bit platform,
- * there isn't enough benefit to justify the larger code size.
- *
- * If XXH_REROLL is defined, this is ignored and the loop is always rerolled.
- */
-#ifndef XXH_REROLL_XXH64
-#  if (defined(__ILP32__) || defined(_ILP32)) /* ILP32 is often defined on 32-bit GCC family */ \
-   || !(defined(__x86_64__) || defined(_M_X64) || defined(_M_AMD64) /* x86-64 */ \
-     || defined(__PPC64__) || defined(__PPC64LE__) || defined(__ppc64__) || defined(__powerpc64__) /* ppc64 */ \
-     || defined(__mips64__) || defined(__mips64)) /* mips64 */ \
-   || (!defined(SIZE_MAX) || SIZE_MAX < ULLONG_MAX) /* check limits */
-#    define XXH_REROLL_XXH64 1
-#  else
-#    define XXH_REROLL_XXH64 0
-#  endif
-#endif /* !defined(XXH_REROLL_XXH64) */
-
 #if (defined(XXH_FORCE_MEMORY_ACCESS) && (XXH_FORCE_MEMORY_ACCESS==3))
 /*
  * Manual byteshift. Best for old compilers which don't inline memcpy.
@@ -2344,126 +2313,26 @@ static xxh_u64 XXH64_avalanche(xxh_u64 h64)
 static xxh_u64
 XXH64_finalize(xxh_u64 h64, const xxh_u8* ptr, size_t len, XXH_alignment align)
 {
-#define XXH_PROCESS1_64 do {                                   \
-    h64 ^= (*ptr++) * XXH_PRIME64_5;                           \
-    h64 = XXH_rotl64(h64, 11) * XXH_PRIME64_1;                 \
-} while (0)
-
-#define XXH_PROCESS4_64 do {                                   \
-    h64 ^= (xxh_u64)(XXH_get32bits(ptr)) * XXH_PRIME64_1;      \
-    ptr += 4;                                              \
-    h64 = XXH_rotl64(h64, 23) * XXH_PRIME64_2 + XXH_PRIME64_3;     \
-} while (0)
-
-#define XXH_PROCESS8_64 do {                                   \
-    xxh_u64 const k1 = XXH64_round(0, XXH_get64bits(ptr)); \
-    ptr += 8;                                              \
-    h64 ^= k1;                                             \
-    h64  = XXH_rotl64(h64,27) * XXH_PRIME64_1 + XXH_PRIME64_4;     \
-} while (0)
-
-    /* Rerolled version for 32-bit targets is faster and much smaller. */
-    if (XXH_REROLL || XXH_REROLL_XXH64) {
-        len &= 31;
-        while (len >= 8) {
-            XXH_PROCESS8_64;
-            len -= 8;
-        }
-        if (len >= 4) {
-            XXH_PROCESS4_64;
-            len -= 4;
-        }
-        while (len > 0) {
-            XXH_PROCESS1_64;
-            --len;
-        }
-         return  XXH64_avalanche(h64);
-    } else {
-        switch(len & 31) {
-           case 24: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 16: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case  8: XXH_PROCESS8_64;
-                    return XXH64_avalanche(h64);
-
-           case 28: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 20: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 12: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case  4: XXH_PROCESS4_64;
-                    return XXH64_avalanche(h64);
-
-           case 25: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 17: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case  9: XXH_PROCESS8_64;
-                    XXH_PROCESS1_64;
-                    return XXH64_avalanche(h64);
-
-           case 29: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 21: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 13: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case  5: XXH_PROCESS4_64;
-                    XXH_PROCESS1_64;
-                    return XXH64_avalanche(h64);
-
-           case 26: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 18: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 10: XXH_PROCESS8_64;
-                    XXH_PROCESS1_64;
-                    XXH_PROCESS1_64;
-                    return XXH64_avalanche(h64);
-
-           case 30: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 22: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 14: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case  6: XXH_PROCESS4_64;
-                    XXH_PROCESS1_64;
-                    XXH_PROCESS1_64;
-                    return XXH64_avalanche(h64);
-
-           case 27: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 19: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 11: XXH_PROCESS8_64;
-                    XXH_PROCESS1_64;
-                    XXH_PROCESS1_64;
-                    XXH_PROCESS1_64;
-                    return XXH64_avalanche(h64);
-
-           case 31: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 23: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case 15: XXH_PROCESS8_64;
-                         /* fallthrough */
-           case  7: XXH_PROCESS4_64;
-                         /* fallthrough */
-           case  3: XXH_PROCESS1_64;
-                         /* fallthrough */
-           case  2: XXH_PROCESS1_64;
-                         /* fallthrough */
-           case  1: XXH_PROCESS1_64;
-                         /* fallthrough */
-           case  0: return XXH64_avalanche(h64);
-        }
+    len &= 31;
+    while (len >= 8) {
+        xxh_u64 const k1 = XXH64_round(0, XXH_get64bits(ptr));
+        ptr += 8;
+        h64 ^= k1;
+        h64  = XXH_rotl64(h64,27) * XXH_PRIME64_1 + XXH_PRIME64_4;
+        len -= 8;
+    }
+    if (len >= 4) {
+        h64 ^= (xxh_u64)(XXH_get32bits(ptr)) * XXH_PRIME64_1;
+        ptr += 4;
+        h64 = XXH_rotl64(h64, 23) * XXH_PRIME64_2 + XXH_PRIME64_3;
+        len -= 4;
+    }
+    while (len > 0) {
+        h64 ^= (*ptr++) * XXH_PRIME64_5;
+        h64 = XXH_rotl64(h64, 11) * XXH_PRIME64_1;
+        --len;
     }
-    /* impossible to reach */
-    XXH_ASSERT(0);
-    return 0;  /* unreachable, but some compilers complain without it */
+    return  XXH64_avalanche(h64);
 }
 
 #ifdef XXH_OLD_NAMES

From 1f2dc264af558c6a6d6e43647915e82c6d4ad7dd Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 8 Apr 2021 09:45:56 -0700
Subject: [PATCH 073/187] fixed incorrect assert condition

close #522
reported by @La-cu-na
---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 7950691a..df16dbd9 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3325,7 +3325,7 @@ XXH3_len_4to8_64b(const xxh_u8* input, size_t len, const xxh_u8* secret, XXH64_h
 {
     XXH_ASSERT(input != NULL);
     XXH_ASSERT(secret != NULL);
-    XXH_ASSERT(4 <= len && len < 8);
+    XXH_ASSERT(4 <= len && len <= 8);
     seed ^= (xxh_u64)XXH_swap32((xxh_u32)seed) << 32;
     {   xxh_u32 const input1 = XXH_readLE32(input);
         xxh_u32 const input2 = XXH_readLE32(input + len - 4);

From 55be05e5c8ff90b6da6e9eb1fe09ced26385e165 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 18 Apr 2021 09:06:20 -0700
Subject: [PATCH 074/187] added compilation flag XXH_NO_XXH3

which removes generation of XXH3 symbols from library
resulting in smaller binary size.
---
 .travis.yml |  6 +++---
 Makefile    | 15 +++++++++++++--
 README.md   |  2 ++
 xxhash.h    |  4 +++-
 4 files changed, 21 insertions(+), 6 deletions(-)

diff --git a/.travis.yml b/.travis.yml
index 9c0b4b59..9f9e42ca 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -25,9 +25,9 @@ matrix:
         - make clean
         - CFLAGS="-Wall -Wextra -Werror" make DISPATCH=1
         - make clean
-        - CFLAGS="-std=c90 -pedantic -Wno-long-long -Werror" make xxhsum  # check C90 compliance
-        - make clean
-        - CFLAGS="-std=c90 -pedantic -Werror" CPPFLAGS="-DXXH_NO_LONG_LONG" make libxxhash  # do not use long-long type, effectively reduced to XXH32
+        - CFLAGS="-std=c90 -pedantic -Wno-long-long -Werror" make xxhsum  # check C90 + long long compliance
+        - make c90test # strict c90, with no long long support; resulting in no XXH64_* symbol
+        - make noxxh3test # check library can be compiled with XXH_NO_XXH3, resulting in no XXH3_* symbol
 
 
     - name: Check results consistency on x64
diff --git a/Makefile b/Makefile
index 3b174314..466aef70 100644
--- a/Makefile
+++ b/Makefile
@@ -151,9 +151,10 @@ lib: libxxhash.a libxxhash
 
 # helper targets
 
-AWK = awk
+AWK  = awk
 GREP = grep
 SORT = sort
+NM   = nm
 
 .PHONY: list
 list:  ## list all Makefile targets
@@ -305,10 +306,20 @@ c90test: CFLAGS += -std=c90 -Werror -pedantic
 c90test: xxhash.c
 	@echo ---- test strict C90 compilation [xxh32 only] ----
 	$(RM) xxhash.o
-	$(CC) $(FLAGS) $^ $(LDFLAGS) -c
+	$(CC) $(FLAGS) $^ -c
+	$(NM) xxhash.o | $(GREP) XXH64 ; test $$? -eq 1
 	$(RM) xxhash.o
 endif
 
+noxxh3test: CPPFLAGS += -DXXH_NO_XXH3
+noxxh3test: CFLAGS += -Werror -pedantic
+noxxh3test: xxhash.c
+	@echo ---- test compilation without XXH3 ----
+	$(RM) xxhash.o
+	$(CC) $(FLAGS) $^ -c
+	$(NM) xxhash.o | $(GREP) XXH3_ ; test $$? -eq 1
+	$(RM) xxhash.o
+
 .PHONY: usan
 usan: CC=clang
 usan: CXX=clang++
diff --git a/README.md b/README.md
index df07e382..e5732f63 100644
--- a/README.md
+++ b/README.md
@@ -125,6 +125,8 @@ The following macros can be set at compilation time to modify libxxhash's behavi
                                    Adds one branch at the beginning of each hash.
 - `XXH_STATIC_LINKING_ONLY`: gives access to the state declaration for static allocation.
                              Incompatible with dynamic linking, due to risks of ABI changes.
+- `XXH_NO_XXH3` : removes symbols related to `XXH3` (both 64 & 128 bits) from generated binary.
+                  Useful to reduce binary size, notably for applications which do not use `XXH3`.
 - `XXH_NO_LONG_LONG`: removes compilation of algorithms relying on 64-bit types (XXH3 and XXH64). Only XXH32 will be compiled.
                       Useful for targets (architectures and compilers) without 64-bit support.
 - `XXH_IMPORT`: MSVC specific: should only be defined for dynamic linking, as it prevents linkage errors.
diff --git a/xxhash.h b/xxhash.h
index df16dbd9..12611cae 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2546,7 +2546,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_hashFromCanonical(const XXH64_canonical_t* src
     return XXH_readBE64(src);
 }
 
-
+#ifndef XXH_NO_XXH3
 
 /* *********************************************************************
 *  XXH3
@@ -5312,6 +5312,8 @@ XXH128_hashFromCanonical(const XXH128_canonical_t* src)
 
 #endif  /* XXH_NO_LONG_LONG */
 
+#endif  /* XXH_NO_XXH3 */
+
 /*!
  * @}
  */

From a6b1ea78ae88ab1dfbeda2551f7742c91114bfc1 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Mon, 24 May 2021 20:13:01 +0900
Subject: [PATCH 075/187] Fix "stdin" issue #470

This change set adds special treatment for "stdin" as a filename
in the xxhsum check file.

When `xxhsum -c` recognizes "stdin" as a filename, it automatically reads
data from stdin as a file target of the checksum line.
---
 xxhsum.c | 13 +++++++++----
 1 file changed, 9 insertions(+), 4 deletions(-)

diff --git a/xxhsum.c b/xxhsum.c
index 82324091..48c2cb85 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -93,6 +93,7 @@ static size_t XSUM_DEFAULT_SAMPLE_SIZE = 100 KB;
 #define MAX_MEM    (2 GB - 64 MB)
 
 static const char stdinName[] = "-";
+static const char stdinFileName[] = "stdin";
 typedef enum { algo_xxh32=0, algo_xxh64=1, algo_xxh128=2 } AlgoSelected;
 static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & XSUM_usage() */
 
@@ -654,7 +655,7 @@ static int XSUM_hashFile(const char* fileName,
     /* Check file existence */
     if (fileName == stdinName) {
         inFile = stdin;
-        fileName = "stdin";
+        fileName = stdinFileName; /* "stdin" */
         XSUM_setBinaryMode(stdin);
     } else {
         if (XSUM_isDirectory(fileName)) {
@@ -1053,7 +1054,11 @@ static void XSUM_parseFile1(ParseFileArg* XSUM_parseFileArg, int rev)
         report->nProperlyFormattedLines++;
 
         do {
-            FILE* const fp = XSUM_fopen(parsedLine.filename, "rb");
+            int const fnameIsStdin = (strcmp(parsedLine.filename, stdinFileName) == 0); // "stdin"
+            FILE* const fp = fnameIsStdin ? stdin : XSUM_fopen(parsedLine.filename, "rb");
+            if (fp == stdin) {
+                XSUM_setBinaryMode(stdin);
+            }
             if (fp == NULL) {
                 lineStatus = LineStatus_failedToOpen;
                 break;
@@ -1085,7 +1090,7 @@ static void XSUM_parseFile1(ParseFileArg* XSUM_parseFileArg, int rev)
             default:
                 break;
             }
-            fclose(fp);
+            if (fp != stdin) fclose(fp);
         } while (0);
 
         switch (lineStatus)
@@ -1157,7 +1162,7 @@ static int XSUM_checkFile(const char* inFileName,
          * Note: Since we expect text input for xxhash -c mode,
          * we don't set binary mode for stdin.
          */
-        inFileName = "stdin";
+        inFileName = stdinFileName; /* "stdin" */
         inFile = stdin;
     } else {
         inFile = XSUM_fopen( inFileName, "rt" );

From 5d8a8133eb07ef535de6717a0df75eae64f68ce1 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Mon, 24 May 2021 20:17:28 +0900
Subject: [PATCH 076/187] Add test for issue #470

---
 Makefile | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/Makefile b/Makefile
index 466aef70..b836fdfa 100644
--- a/Makefile
+++ b/Makefile
@@ -223,6 +223,9 @@ test-xxhsum-c: xxhsum
 	# xxhsum to/from pipe
 	./xxhsum xxh* | ./xxhsum -c -
 	./xxhsum -H0 xxh* | ./xxhsum -c -
+	# xxhsum -c is unable to verify checksum of file from STDIN (#470)
+	./xxhsum < README.md > .test.README.md.xxh
+	./xxhsum -c .test.README.md.xxh < README.md
 	# xxhsum -q does not display "Loading" message into stderr (#251)
 	! ./xxhsum -q xxh* 2>&1 | grep Loading
 	# xxhsum does not display "Loading" message into stderr either

From 1a4fd0e6c284d7592106c1a7ede4cb76783e0f7c Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Mon, 24 May 2021 20:53:40 +0900
Subject: [PATCH 077/187] Fix for C90 limitation

---
 xxhsum.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhsum.c b/xxhsum.c
index 48c2cb85..0eca8fa8 100644
--- a/xxhsum.c
+++ b/xxhsum.c
@@ -1054,7 +1054,7 @@ static void XSUM_parseFile1(ParseFileArg* XSUM_parseFileArg, int rev)
         report->nProperlyFormattedLines++;
 
         do {
-            int const fnameIsStdin = (strcmp(parsedLine.filename, stdinFileName) == 0); // "stdin"
+            int const fnameIsStdin = (strcmp(parsedLine.filename, stdinFileName) == 0); /* "stdin" */
             FILE* const fp = fnameIsStdin ? stdin : XSUM_fopen(parsedLine.filename, "rb");
             if (fp == stdin) {
                 XSUM_setBinaryMode(stdin);

From 1528d7c88352c7b23618746e184d4cff97604445 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Wed, 26 May 2021 09:38:43 +0900
Subject: [PATCH 078/187] Create GitHub Actions script ci.yml

---
 .github/workflows/ci.yml | 371 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 371 insertions(+)
 create mode 100644 .github/workflows/ci.yml

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
new file mode 100644
index 00000000..67bb2aba
--- /dev/null
+++ b/.github/workflows/ci.yml
@@ -0,0 +1,371 @@
+# Known critical issues:
+# - AVX512 related tests are disabled.  Because default environment of
+#   GitHub Actions doesn't guarantee to support AVX512.
+#   As of May 2021, they're using Xeon E5-2673 (which doesn't support
+#   AVX512) and Xeon Platinum 8171M (which supports AVX512).
+#   See also https://github.com/actions/runner/issues/1069
+#
+# Known issues:
+# - This test script ignores exit code of cppcheck which can see under
+#   Job:Linux x64 misc tests > cppcheck in the GitHub Actions report.
+#   Because xxHash project doesn't 100% follow their recommendation.
+#   Also sometimes it reports false positives.
+#
+# - GitHub Actions doesn't support Visual Studio 2015 and 2013.
+#   https://github.com/actions/virtual-environments/issues/387
+#
+# - Setup procedure for msys2 environment is painfully slow.  It takes
+#   3..5 minutes.
+#
+# - Sometimes apt-get fails to retrieve package information.
+#   I have absolutely no idea.
+#
+# Notes:
+# - You can investigate various information at the right pane of GitHub
+#   Actions report page.
+#
+#   | Item                      | Section in the right pane             |
+#   | ------------------------- | ------------------------------------- |
+#   | OS, VM                    | Set up job                            |
+#   | git repo, commit hash     | Run actions/checkout@v2               |
+#   | gcc, tools                | Environment info                      |
+#
+# - To fail earlier, oreder of tests in the same job are roughly sorted by
+#   elapsed time.
+#
+# - "ubuntu-latest" (Ubuntu 20.04) has the following software
+#   https://github.com/actions/virtual-environments/blob/main/images/linux/Ubuntu2004-README.md
+#
+# Todos:
+# - [ ] Linux: Add native ARM runner.
+# - [ ] Linux: Add native ARM64 runner.
+# - [ ] Linux: Add native PPC64LE runner.
+# - [ ] Linux: Add native S390X runner.
+# - [ ] Windows: Add VS2013.
+# - [ ] Windows: Add VS2015.
+# - [ ] Windows: Add clang for msys2.
+# - [ ] Windows: Add native or emulated ARM runner.
+# - [ ] Windows: Add native or emulated ARM64 runner.
+
+
+# Name of the workflow is also displayed as a SVG badge
+name: xxHash CI tests
+
+on: [push, pull_request]
+
+jobs:
+
+  # Linux, x64
+
+  ubuntu-general:
+    name: Linux x64
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v2 # https://github.com/actions/checkout
+
+    - name: apt-get install
+      run: |
+        sudo apt-get install gcc-multilib
+
+    - name: Environment info
+      run: |
+        echo && gcc --version
+        echo && clang --version
+        echo && make -v
+        echo && cat /proc/cpuinfo || echo /proc/cpuinfo is not present
+
+    - name: C90 + no-long-long compliance
+      run: |
+        CFLAGS="-std=c90 -pedantic -Wno-long-long -Werror" make clean xxhsum
+
+    - name: C90 + XXH_NO_LONG_LONG
+      run: |
+        # strict c90, with no long long support; resulting in no XXH64_* symbol
+        make clean c90test
+
+    - name: dispatch
+      run: |
+        # removing sign conversion warnings due to a bug in gcc-5's definition of some AVX512 intrinsics
+        CFLAGS="-Werror" MOREFLAGS="-Wno-sign-conversion" make clean dispatch
+
+    - name: DISPATCH=1
+      run: |
+        CFLAGS="-Wall -Wextra -Werror" make DISPATCH=1 clean default
+
+    - name: noxxh3test
+      run: |
+        # check library can be compiled with XXH_NO_XXH3, resulting in no XXH3_* symbol
+        make clean noxxh3test
+
+# As for AVX512, see "Known critical issues" at the top of this file
+#   - name: make avx512f
+#     run: |
+#       CFLAGS="-O1 -mavx512f -Werror" make clean default
+
+    - name: test-all
+      run: |
+        make clean test-all
+
+
+  ubuntu-consistency:
+    name: Linux x64 check results consistency
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v2 # https://github.com/actions/checkout
+
+    - name: Environment info
+      run: |
+        echo && gcc --version
+        echo && make -v
+        echo && cat /proc/cpuinfo || echo /proc/cpuinfo is not present
+
+    - name: Scalar code path
+      run: |
+        CPPFLAGS=-DXXH_VECTOR=XXH_SCALAR make clean check
+
+    - name: SSE2 code path
+      run: |
+        CPPFLAGS=-DXXH_VECTOR=XXH_SSE2 make clean check
+
+    - name: AVX2 code path
+      run: |
+        CPPFLAGS="-mavx2 -DXXH_VECTOR=XXH_AVX2" make clean check
+
+# As for AVX512, see "Known critical issues" at the top of this file
+#   - name: AVX512 code path
+#     run: |
+#       CPPFLAGS="-mavx512f -DXXH_VECTOR=XXH_AVX512" make clean check
+
+    - name: reroll code path (#240)
+      run: |
+        CPPFLAGS=-DXXH_REROLL=1 make clean check
+
+    - name: tests/bench
+      run: |
+        make -C tests/bench
+
+
+  ubuntu-misc:
+    name: Linux x64 misc tests
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v2 # https://github.com/actions/checkout
+
+    - name: apt-get install
+      run: |
+        sudo apt-get install valgrind cppcheck
+
+    - name: Environment info
+      run: |
+        echo && gcc --version
+        echo && clang --version
+        echo && valgrind --version
+        echo && cppcheck --version
+        echo && make -v
+        echo && cat /proc/cpuinfo || echo /proc/cpuinfo is not present
+
+    - name: cppcheck
+      run: |
+        make clean cppcheck || echo There are some cppcheck reports
+
+    - name: test-mem (valgrind)
+      run: |
+        make clean test-mem
+
+
+  ubuntu-cmake-unofficial:
+    name: Linux x64 cmake unofficial build test
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v2 # https://github.com/actions/checkout
+
+    - name: Environment info
+      run: |
+        echo && gcc --version
+        echo && cmake --version
+        echo && make -v
+        echo && cat /proc/cpuinfo || echo /proc/cpuinfo is not present
+
+    - name: cmake
+      run: |
+        cd cmake_unofficial
+        mkdir build
+        cd build
+        cmake ..
+        CFLAGS=-Werror make
+
+
+  # Linux, { ARM, ARM64, PPC64LE, S390X }
+  # All tests are using QEMU and gcc cross compiler.
+
+  qemu-consistency:
+    name: QEMU ${{ matrix.name }}
+    runs-on: ubuntu-latest
+    strategy:
+      fail-fast: false  # 'false' means Don't stop matrix workflows even if some matrix failed.
+      matrix:
+        include: [
+          { name: ARM,      xcc_pkg: gcc-arm-linux-gnueabi,     xcc: arm-linux-gnueabi-gcc,     xemu_pkg: qemu-system-arm,    xemu: qemu-arm-static     },
+          { name: ARM64,    xcc_pkg: gcc-aarch64-linux-gnu,     xcc: aarch64-linux-gnu-gcc,     xemu_pkg: qemu-system-arm,    xemu: qemu-aarch64-static },
+          { name: PPC64LE,  xcc_pkg: gcc-powerpc64le-linux-gnu, xcc: powerpc64le-linux-gnu-gcc, xemu_pkg: qemu-system-ppc,    xemu: qemu-ppc64le-static },
+          { name: S390X,    xcc_pkg: gcc-s390x-linux-gnu,       xcc: s390x-linux-gnu-gcc,       xemu_pkg: qemu-system-s390x,  xemu: qemu-s390x-static   },
+        ]
+    env:                        # Set environment variables
+      XCC: ${{ matrix.xcc }}
+      XEMU: ${{ matrix.xemu }}
+    steps:
+    - uses: actions/checkout@v2 # https://github.com/actions/checkout
+    - name: apt update & install
+      run: |
+        sudo apt-get update
+        sudo apt-get install gcc-multilib g++-multilib qemu-utils qemu-user-static
+        sudo apt-get install ${{ matrix.xcc_pkg }} ${{ matrix.xemu_pkg }} 
+
+    - name: Environment info
+      run: |
+        echo && which $XCC
+        echo && $XCC --version
+        echo && $XCC -v  # Show built-in specs
+        echo && which $XEMU
+        echo && $XEMU --version
+
+    - name: ARM (XXH_VECTOR=[ scalar, NEON ])
+      if: ${{ matrix.name == 'ARM' }}
+      run: |
+        CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+        CPPFLAGS="-DXXH_VECTOR=XXH_NEON" CFLAGS="-O3 -march=armv7-a -fPIC -mfloat-abi=softfp -mfpu=neon-vfpv4" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+
+    - name: ARM64 (XXH_VECTOR=[ scalar, NEON ])
+      if: ${{ matrix.name == 'ARM64' }}
+      run: |
+        CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+        CPPFLAGS="-DXXH_VECTOR=XXH_NEON" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+
+    - name: PPC64LE (XXH_VECTOR=[ scalar, VSX ])
+      if: ${{ matrix.name == 'PPC64LE' }}
+      run: |
+        CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+        CPPFLAGS="-DXXH_VECTOR=XXH_VSX" CFLAGS="-O3 -maltivec -mvsx -mpower8-vector -mcpu=power8" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+
+    - name: S390X (XXH_VECTOR=[ scalar, VSX ])
+      if: ${{ matrix.name == 'S390X' }}
+      run: |
+        CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+        CPPFLAGS=-DXXH_VECTOR=XXH_VSX CFLAGS="-O3 -march=arch11 -mzvector" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
+
+
+  # macOS
+
+  macos-latest-general:
+    name: macOS general test
+    runs-on: macos-latest
+    steps:
+    - uses: actions/checkout@v2
+
+    - name: Environment info
+      run: |
+        echo && clang --version
+        echo && sysctl -a | grep machdep.cpu   # cpuinfo
+
+    - name: make
+      run: |
+        CFLAGS="-Werror" make clean default
+
+    - name: make test
+      run: |
+        # test scenario where "stdout" is not the console
+        make clean test MOREFLAGS='-Werror' | tee
+
+
+  # Windows, { VC++2019, VC++2017 } x { x64, Win32, ARM, ARM64 }
+  #
+  # - Default shell for Windows environment is PowerShell Core.
+  #   https://docs.github.com/en/actions/reference/workflow-syntax-for-github-actions#using-a-specific-shell
+  #
+  # - "windows-2019" uses Visual Studio 2019.
+  #   https://github.com/actions/virtual-environments/blob/main/images/win/Windows2019-Readme.md#visual-studio-enterprise-2019
+  #
+  # - "windows-2016" uses Visual Studio 2017.
+  #   https://github.com/actions/virtual-environments/blob/main/images/win/Windows2016-Readme.md#visual-studio-enterprise-2017
+
+  windows-visualc-general:
+    name: ${{ matrix.system.vc }}, ${{ matrix.arch }}
+    runs-on: ${{ matrix.system.os }}   # Runs-on foreach value of strategy.matrix.system.os
+    strategy:
+      fail-fast: false  # 'false' means Don't stop matrix workflows even if some matrix failed.
+      matrix:
+        system: [
+          { os: windows-2019, vc: "VC++ 2019" },
+          { os: windows-2016, vc: "VC++ 2017" },
+        ]
+        arch: [ x64, Win32, ARM, ARM64 ]
+    steps:
+    - uses: actions/checkout@v2
+
+    - name: Build ${{ matrix.system.os }}, ${{ matrix.arch }}
+      run: |
+        cd cmake_unofficial
+        mkdir build
+        cd build
+        cmake .. -DCMAKE_BUILD_TYPE=Release -A ${{ matrix.arch }} -DXXHASH_C_FLAGS="/WX"
+        cmake --build . --config Release
+
+    - name: Test
+      # Run benchmark for testing only if target arch is x64 or Win32.
+      if: ${{ matrix.arch == 'x64' || matrix.arch == 'Win32' }}
+      run: |
+        .\cmake_unofficial\build\Release\xxhsum.exe -bi1
+
+
+  # Windows, { mingw64, mingw32 }
+  #
+  # - Shell for msys2 is sh (msys2).  defaults.run.shell is for this setting.
+  #
+  # https://github.com/msys2/MINGW-packages/blob/master/.github/workflows/main.yml
+  # https://github.com/actions/starter-workflows/issues/95
+
+  windows-msys2-general:
+    name: Windows ${{ matrix.msystem }}
+    runs-on: windows-latest
+    strategy:
+      fail-fast: false  # 'false' means Don't stop matrix workflows even if some matrix failed.
+      matrix:
+        include: [
+          { msystem: mingw64, toolchain: mingw-w64-x86_64-toolchain },
+          { msystem: mingw32, toolchain: mingw-w64-i686-toolchain },
+        ]
+    defaults:
+      run:
+        shell: msys2 {0}
+    steps:
+      - uses: actions/checkout@v2  # https://github.com/actions/checkout
+      - uses: msys2/setup-msys2@v2 # https://github.com/msys2/setup-msys2
+        with:
+          msystem: MSYS
+          install: mingw-w64-i686-make ${{ matrix.toolchain }}
+          update: true
+
+      - name: Update
+        run: |
+          pacman --noconfirm -Suuy
+          pacman --noconfirm -Suu
+
+      - name: mingw64
+        if: ${{ matrix.msystem == 'mingw64' }}
+        run: |
+          PATH=/mingw64/bin:$PATH /mingw32/bin/mingw32-make clean test MOREFLAGS=-Werror
+          PATH=/mingw64/bin:$PATH /mingw32/bin/mingw32-make -C tests/bench
+          # Abort if result of "file ./xxhsum.exe" doesn't contain 'x86-64'.
+          # Expected output is "./xxhsum.exe: PE32+ executable (console) x86-64, for MS Windows"
+          file ./xxhsum.exe | grep -q 'x86-64' || $(exit 1)
+          ./xxhsum.exe --version
+
+      - name: mingw32
+        if: ${{ matrix.msystem == 'mingw32' }}
+        run: |
+          PATH=/mingw32/bin:$PATH /mingw32/bin/mingw32-make.exe clean test MOREFLAGS=-Werror
+          PATH=/mingw32/bin:$PATH /mingw32/bin/mingw32-make.exe -C tests/bench
+          # Abort if result of "file ./xxhsum.exe" doesn't contain '80386'.
+          # Expected output is "./xxhsum.exe: PE32 executable (console) Intel 80386, for MS Windows"
+          file ./xxhsum.exe | grep -q '80386' || $(exit 1)
+          ./xxhsum.exe --version

From 219b3e3ccfd593ed09d25844dd2cfb7536251e2a Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Wed, 26 May 2021 10:14:45 +0900
Subject: [PATCH 079/187] Fix make usan runtime error

---
 xxhash.h | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 12611cae..5ff7044d 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1920,7 +1920,7 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
 XXH_FORCE_INLINE xxh_u32
 XXH32_endian_align(const xxh_u8* input, size_t len, xxh_u32 seed, XXH_alignment align)
 {
-    const xxh_u8* bEnd = input + len;
+    const xxh_u8* bEnd = input ? input + len : NULL;
     xxh_u32 h32;
 
 #if defined(XXH_ACCEPT_NULL_INPUT_POINTER) && (XXH_ACCEPT_NULL_INPUT_POINTER>=1)
@@ -2348,7 +2348,7 @@ XXH64_finalize(xxh_u64 h64, const xxh_u8* ptr, size_t len, XXH_alignment align)
 XXH_FORCE_INLINE xxh_u64
 XXH64_endian_align(const xxh_u8* input, size_t len, xxh_u64 seed, XXH_alignment align)
 {
-    const xxh_u8* bEnd = input + len;
+    const xxh_u8* bEnd = input ? input + len : NULL;
     xxh_u64 h64;
 
 #if defined(XXH_ACCEPT_NULL_INPUT_POINTER) && (XXH_ACCEPT_NULL_INPUT_POINTER>=1)

From 03b493cc2971f3bdbf53df6014768d92e7b94f4a Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Wed, 26 May 2021 17:10:10 +0900
Subject: [PATCH 080/187] Update comments in ci.yml

---
 .github/workflows/ci.yml | 13 ++++++-------
 1 file changed, 6 insertions(+), 7 deletions(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index 67bb2aba..5ad61476 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -17,9 +17,6 @@
 # - Setup procedure for msys2 environment is painfully slow.  It takes
 #   3..5 minutes.
 #
-# - Sometimes apt-get fails to retrieve package information.
-#   I have absolutely no idea.
-#
 # Notes:
 # - You can investigate various information at the right pane of GitHub
 #   Actions report page.
@@ -30,12 +27,9 @@
 #   | git repo, commit hash     | Run actions/checkout@v2               |
 #   | gcc, tools                | Environment info                      |
 #
-# - To fail earlier, oreder of tests in the same job are roughly sorted by
+# - To fail earlier, order of tests in the same job are roughly sorted by
 #   elapsed time.
 #
-# - "ubuntu-latest" (Ubuntu 20.04) has the following software
-#   https://github.com/actions/virtual-environments/blob/main/images/linux/Ubuntu2004-README.md
-#
 # Todos:
 # - [ ] Linux: Add native ARM runner.
 # - [ ] Linux: Add native ARM64 runner.
@@ -56,6 +50,9 @@ on: [push, pull_request]
 jobs:
 
   # Linux, x64
+  #
+  # - "ubuntu-latest" (Ubuntu 20.04) has the following software
+  #   https://github.com/actions/virtual-environments/blob/main/images/linux/Ubuntu2004-README.md
 
   ubuntu-general:
     name: Linux x64
@@ -166,6 +163,8 @@ jobs:
 
     - name: cppcheck
       run: |
+        # This test script ignores exit code of cppcheck.  See knowin issues
+        # at the top of this file.
         make clean cppcheck || echo There are some cppcheck reports
 
     - name: test-mem (valgrind)

From 3bd63d5c5999bdaf908947a5cf1e77215966b840 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Thu, 27 May 2021 01:07:47 +0900
Subject: [PATCH 081/187] Enable AVX512 tests partially

It always build AVX512.  But runs `make check` if test runner supports AVX512.
---
 .github/workflows/ci.yml | 24 +++++++++++++++---------
 1 file changed, 15 insertions(+), 9 deletions(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index 5ad61476..343baa19 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -1,10 +1,16 @@
 # Known critical issues:
-# - AVX512 related tests are disabled.  Because default environment of
+# - AVX512 related tests are incomplete.  Because default environment of
 #   GitHub Actions doesn't guarantee to support AVX512.
 #   As of May 2021, they're using Xeon E5-2673 (which doesn't support
 #   AVX512) and Xeon Platinum 8171M (which supports AVX512).
 #   See also https://github.com/actions/runner/issues/1069
 #
+#   In this CI script, it always run `make default` which compiles xxHash
+#   with AVX512 intrinsics.  But if test runner doesn't support AVX512,
+#   it doesn't run `make check` which tests runtime error/consistency.
+#   It means that this test stochastically detects a failure in AVX512
+#   code path.
+#
 # Known issues:
 # - This test script ignores exit code of cppcheck which can see under
 #   Job:Linux x64 misc tests > cppcheck in the GitHub Actions report.
@@ -94,10 +100,9 @@ jobs:
         # check library can be compiled with XXH_NO_XXH3, resulting in no XXH3_* symbol
         make clean noxxh3test
 
-# As for AVX512, see "Known critical issues" at the top of this file
-#   - name: make avx512f
-#     run: |
-#       CFLAGS="-O1 -mavx512f -Werror" make clean default
+    - name: make avx512f
+      run: |
+        CFLAGS="-O1 -mavx512f -Werror" make clean default
 
     - name: test-all
       run: |
@@ -128,10 +133,11 @@ jobs:
       run: |
         CPPFLAGS="-mavx2 -DXXH_VECTOR=XXH_AVX2" make clean check
 
-# As for AVX512, see "Known critical issues" at the top of this file
-#   - name: AVX512 code path
-#     run: |
-#       CPPFLAGS="-mavx512f -DXXH_VECTOR=XXH_AVX512" make clean check
+    # As for AVX512, see "Known critical issues" at the top of this file
+    - name: AVX512 code path
+      run: |
+        # Run "make check" if /proc/cpuinfo has flags for avx512.
+        grep -q "^flags.*\bavx512\b" /proc/cpuinfo && CPPFLAGS="-mavx512f -DXXH_VECTOR=XXH_AVX512" make clean check || (echo This test runner does not support AVX512. && $(exit 0))
 
     - name: reroll code path (#240)
       run: |

From f257e949ea3d6d7475c114ace5382f9cc544be1c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 27 Jun 2021 22:53:57 -0700
Subject: [PATCH 082/187] updated version number to v0.8.1

to distinguish from latest release during support
as we are introducing more and more changes within `dev`.
---
 xxhash.h | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 5ff7044d..742db05f 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -266,7 +266,7 @@ extern "C" {
 ***************************************/
 #define XXH_VERSION_MAJOR    0
 #define XXH_VERSION_MINOR    8
-#define XXH_VERSION_RELEASE  0
+#define XXH_VERSION_RELEASE  1
 #define XXH_VERSION_NUMBER  (XXH_VERSION_MAJOR *100*100 + XXH_VERSION_MINOR *100 + XXH_VERSION_RELEASE)
 
 /*!
@@ -275,7 +275,7 @@ extern "C" {
  * This is only useful when xxHash is compiled as a shared library, as it is
  * independent of the version defined in the header.
  *
- * @return `XXH_VERSION_NUMBER` as of when the function was compiled.
+ * @return `XXH_VERSION_NUMBER` as of when the libray was compiled.
  */
 XXH_PUBLIC_API unsigned XXH_versionNumber (void);
 

From 29b25f324a7903e7090dfc80de5cf1bf22587401 Mon Sep 17 00:00:00 2001
From: Matthew Dolan <info@mattdolan.com>
Date: Mon, 28 Jun 2021 06:58:04 +0100
Subject: [PATCH 083/187] Correct assertion

---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 742db05f..ba031cfe 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3341,7 +3341,7 @@ XXH3_len_9to16_64b(const xxh_u8* input, size_t len, const xxh_u8* secret, XXH64_
 {
     XXH_ASSERT(input != NULL);
     XXH_ASSERT(secret != NULL);
-    XXH_ASSERT(8 <= len && len <= 16);
+    XXH_ASSERT(9 <= len && len <= 16);
     {   xxh_u64 const bitflip1 = (XXH_readLE64(secret+24) ^ XXH_readLE64(secret+32)) + seed;
         xxh_u64 const bitflip2 = (XXH_readLE64(secret+40) ^ XXH_readLE64(secret+48)) - seed;
         xxh_u64 const input_lo = XXH_readLE64(input)           ^ bitflip1;

From 00b7bd1243eeeed503cc2df74ea90d14bf246c6e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 12 Jul 2021 22:34:34 -0700
Subject: [PATCH 084/187] regroup all xxhsum files into cli/

---
 Makefile                       | 15 +++++++-------
 xxhsum.1 => cli/xxhsum.1       |  0
 xxhsum.1.md => cli/xxhsum.1.md |  0
 xxhsum.c => cli/xxhsum.c       | 16 +++++++-------
 tests/Makefile                 | 38 ++++++++++++++++++++++++++++++----
 5 files changed, 50 insertions(+), 19 deletions(-)
 rename xxhsum.1 => cli/xxhsum.1 (100%)
 rename xxhsum.1.md => cli/xxhsum.1.md (100%)
 rename xxhsum.c => cli/xxhsum.c (99%)

diff --git a/Makefile b/Makefile
index b836fdfa..156bb47c 100644
--- a/Makefile
+++ b/Makefile
@@ -72,7 +72,8 @@ endif
 LIBXXH = libxxhash.$(SHARED_EXT_VER)
 
 XXHSUM_SRC_DIR = cli
-XXHSUM_SPLIT_SRCS = $(XXHSUM_SRC_DIR)/xsum_os_specific.c \
+XXHSUM_SPLIT_SRCS = $(XXHSUM_SRC_DIR)/xxhsum.c \
+                    $(XXHSUM_SRC_DIR)/xsum_os_specific.c \
                     $(XXHSUM_SRC_DIR)/xsum_output.c \
                     $(XXHSUM_SRC_DIR)/xsum_sanity_check.c
 XXHSUM_SPLIT_OBJS = $(XXHSUM_SPLIT_SRCS:.c=.o)
@@ -95,20 +96,20 @@ ifeq ($(DISPATCH),1)
 xxhsum: CPPFLAGS += -DXXHSUM_DISPATCH=1
 xxhsum: xxh_x86dispatch.o
 endif
-xxhsum: xxhash.o xxhsum.o $(XXHSUM_SPLIT_OBJS)
+xxhsum: xxhash.o $(XXHSUM_SPLIT_OBJS)
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 xxhsum32: CFLAGS += -m32  ## generate CLI in 32-bits mode
-xxhsum32: xxhash.c xxhsum.c $(XXHSUM_SPLIT_SRCS) ## do not generate object (avoid mixing different ABI)
+xxhsum32: xxhash.c $(XXHSUM_SPLIT_SRCS) ## do not generate object (avoid mixing different ABI)
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 ## dispatch only works for x86/x64 systems
 dispatch: CPPFLAGS += -DXXHSUM_DISPATCH=1
-dispatch: xxhash.o xxh_x86dispatch.o xxhsum.c $(XXHSUM_SPLIT_SRCS)
+dispatch: xxhash.o xxh_x86dispatch.o $(XXHSUM_SPLIT_SRCS)
 	$(CC) $(FLAGS) $^ $(LDFLAGS) -o $@$(EXT)
 
 xxhash.o: xxhash.c xxhash.h
-xxhsum.o: xxhsum.c $(XXHSUM_HEADERS) \
+xxhsum.o: $(XXHSUM_SRC_DIR)/xxhsum.c $(XXHSUM_HEADERS) \
     xxhash.h xxh_x86dispatch.h
 xxh_x86dispatch.o: xxh_x86dispatch.c xxh_x86dispatch.h xxhash.h
 
@@ -119,7 +120,7 @@ xxh32sum xxh64sum xxh128sum: xxhsum
 	ln -sf $<$(EXT) $@$(EXT)
 
 xxhsum_inlinedXXH: CPPFLAGS += -DXXH_INLINE_ALL
-xxhsum_inlinedXXH: xxhsum.c $(XXHSUM_SPLIT_SRCS)
+xxhsum_inlinedXXH: $(XXHSUM_SPLIT_SRCS)
 	$(CC) $(FLAGS) $< -o $@$(EXT)
 
 
@@ -347,7 +348,7 @@ cppcheck:  ## check C source files using $(CPPCHECK) static analyzer
 namespaceTest:  ## ensure XXH_NAMESPACE redefines all public symbols
 	$(CC) -c xxhash.c
 	$(CC) -DXXH_NAMESPACE=TEST_ -c xxhash.c -o xxhash2.o
-	$(CC) xxhash.o xxhash2.o xxhsum.c $(XXHSUM_SPLIT_SRCS)  -o xxhsum2  # will fail if one namespace missing (symbol collision)
+	$(CC) xxhash.o xxhash2.o $(XXHSUM_SPLIT_SRCS)  -o xxhsum2  # will fail if one namespace missing (symbol collision)
 	$(RM) *.o xxhsum2  # clean
 
 MD2ROFF ?= ronn
diff --git a/xxhsum.1 b/cli/xxhsum.1
similarity index 100%
rename from xxhsum.1
rename to cli/xxhsum.1
diff --git a/xxhsum.1.md b/cli/xxhsum.1.md
similarity index 100%
rename from xxhsum.1.md
rename to cli/xxhsum.1.md
diff --git a/xxhsum.c b/cli/xxhsum.c
similarity index 99%
rename from xxhsum.c
rename to cli/xxhsum.c
index 0eca8fa8..e4d61da6 100644
--- a/xxhsum.c
+++ b/cli/xxhsum.c
@@ -30,15 +30,15 @@
  */
 
 /* Transitional headers */
-#include "cli/xsum_config.h"
-#include "cli/xsum_arch.h"
-#include "cli/xsum_os_specific.h"
-#include "cli/xsum_output.h"
-#include "cli/xsum_sanity_check.h"
+#include "xsum_config.h"
+#include "xsum_arch.h"
+#include "xsum_os_specific.h"
+#include "xsum_output.h"
+#include "xsum_sanity_check.h"
 #ifdef XXH_INLINE_ALL
-#  include "cli/xsum_os_specific.c"
-#  include "cli/xsum_output.c"
-#  include "cli/xsum_sanity_check.c"
+#  include "xsum_os_specific.c"
+#  include "xsum_output.c"
+#  include "xsum_sanity_check.c"
 #endif
 
 /* ************************************
diff --git a/tests/Makefile b/tests/Makefile
index 092711ad..75a41ded 100644
--- a/tests/Makefile
+++ b/tests/Makefile
@@ -1,7 +1,35 @@
+# ################################################################
+# xxHash Makefile
+# Copyright (C) 2012-2020 Yann Collet
+#
+# GPL v2 License
+#
+# This program is free software; you can redistribute it and/or modify
+# it under the terms of the GNU General Public License as published by
+# the Free Software Foundation; either version 2 of the License, or
+# (at your option) any later version.
+#
+# This program is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+# GNU General Public License for more details.
+#
+# You should have received a copy of the GNU General Public License along
+# with this program; if not, write to the Free Software Foundation, Inc.,
+# 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+#
+# You can contact the author at:
+#   - xxHash homepage: https://www.xxhash.com
+#   - xxHash source repository: https://github.com/Cyan4973/xxHash
+# ################################################################
+
 CFLAGS += -Wall -Wextra -Wundef -g
 
+CP = cp
 NM = nm
 GREP = grep
+XXHSUM_DIR = ..
+XXHSUM = $(XXHSUM_DIR)/xxhsum
 
 # Define *.exe as extension for Windows systems
 ifneq (,$(filter Windows%,$(OS)))
@@ -52,8 +80,10 @@ test_ppc_redefine: ppc_define.c
 	@$(MAKE) clean
 	$(CC) $(CPPFLAGS) $(CFLAGS) -c $^
 
-xxhsum$(EXT): ../xxhash.c ../xxhash.h ../xxhsum.c
-	$(CC) $(CPPFLAGS) $(CFLAGS) $(LDFLAGS) -DXXH_INLINE_ALL ../xxhsum.c -o $@
+.PHONY: $(XXHSUM)
+$(XXHSUM):
+	$(MAKE) -C $(XXHSUM_DIR) xxhsum
+	$(CP) $(XXHSUM) .
 
 # Make sure that Unicode filenames work.
 # https://github.com/Cyan4973/xxHash/issues/293
@@ -63,7 +93,7 @@ test_unicode:
 	@echo "Skipping Unicode test, your terminal doesn't appear to support UTF-8."
 	@echo "Try with ENABLE_UNICODE=1"
 else
-test_unicode: xxhsum$(EXT) generate_unicode_test.c
+test_unicode: $(XXHSUM) generate_unicode_test.c
 	# Generate a Unicode filename test dynamically
 	# to keep UTF-8 out of the source tree.
 	$(CC) $(CFLAGS) $(LDFLAGS) generate_unicode_test.c -o generate_unicode_test$(EXT)
@@ -80,4 +110,4 @@ multiInclude_withxxhash: multiInclude.o xxhash.o
 clean:
 	@$(RM) *.o
 	@$(RM) multiInclude multiInclude_withxxhash
-	@$(RM) *.unicode generate_unicode_test$(EXT) unicode_test.* xxhsum$(EXT)
+	@$(RM) *.unicode generate_unicode_test$(EXT) unicode_test.* xxhsum*

From 5f0ce255301db10a6542053b11e020fc76e1fb43 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 12 Jul 2021 22:37:01 -0700
Subject: [PATCH 085/187] fixed cmake script

new location of `xxhsum.c` into `cli/`
---
 cmake_unofficial/CMakeLists.txt | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index 80ecbe5e..01d62a8c 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -84,7 +84,7 @@ set_target_properties(xxhash PROPERTIES
 if(XXHASH_BUILD_XXHSUM)
   set(XXHSUM_DIR "${XXHASH_DIR}/cli")
   # xxhsum
-  add_executable(xxhsum "${XXHASH_DIR}/xxhsum.c"
+  add_executable(xxhsum "${XXHSUM_DIR}/xxhsum.c"
                         "${XXHSUM_DIR}/xsum_os_specific.c"
                         "${XXHSUM_DIR}/xsum_output.c"
                         "${XXHSUM_DIR}/xsum_sanity_check.c"

From 6e7a7b8679fe74df79e6e82fb9166dbef5934d72 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 12 Jul 2021 22:39:40 -0700
Subject: [PATCH 086/187] fix relative include directory

`xxhash.h` expected into `../` directory.
This method does not depend on setting `-I` include directory.
However, it relies on source code preserving its original arborescence.
---
 cli/xxhsum.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index e4d61da6..cedb9355 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -55,10 +55,10 @@
 #include <errno.h>      /* errno */
 
 #define XXH_STATIC_LINKING_ONLY   /* *_state_t */
-#include "xxhash.h"
+#include "../xxhash.h"
 
 #ifdef XXHSUM_DISPATCH
-#  include "xxh_x86dispatch.h"
+#  include "../xxh_x86dispatch.h"
 #endif
 
 static unsigned XSUM_isLittleEndian(void)

From fcb2454c4303078ac1ea06ce60fed30cadbea29e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 12 Jul 2021 22:42:55 -0700
Subject: [PATCH 087/187] fixed `make check`

no more `xxhsum.*` file present at root
---
 Makefile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/Makefile b/Makefile
index 156bb47c..6d07ba77 100644
--- a/Makefile
+++ b/Makefile
@@ -188,7 +188,7 @@ check: xxhsum   ## basic tests for xxhsum CLI, set RUN_ENV for emulated environm
 	# stdin
 	$(RUN_ENV) ./xxhsum$(EXT) < xxhash.c
 	# multiple files
-	$(RUN_ENV) ./xxhsum$(EXT) xxhash.* xxhsum.*
+	$(RUN_ENV) ./xxhsum$(EXT) xxhash.*
 	# internal bench
 	$(RUN_ENV) ./xxhsum$(EXT) -bi0
 	# long bench command

From b79a8d55f0689530cf9bd7e99f27c7cafd5e64ff Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 12 Jul 2021 23:26:34 -0700
Subject: [PATCH 088/187] speed up test32

faster tests,
pre-cleaning not required.
---
 Makefile | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/Makefile b/Makefile
index 6d07ba77..254964bd 100644
--- a/Makefile
+++ b/Makefile
@@ -215,9 +215,9 @@ test-mem: RUN_ENV = $(VALGRIND)
 test-mem: xxhsum check
 
 .PHONY: test32
-test32: clean xxhsum32
+test32: xxhsum32
 	@echo ---- test 32-bit ----
-	./xxhsum32 -bi1 xxhash.c
+	./xxhsum32 -bi0 xxhash.c
 
 .PHONY: test-xxhsum-c
 test-xxhsum-c: xxhsum

From a280d5595d17c346adcc381440dfee3b0d0bad68 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 13 Jul 2021 05:14:10 -0700
Subject: [PATCH 089/187] fixed test order

test-unicode must be played after creation of xxhsum,
and before object files get mixed.

also : fix trailingwhitespace
---
 Makefile | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/Makefile b/Makefile
index 254964bd..dd9cf643 100644
--- a/Makefile
+++ b/Makefile
@@ -377,7 +377,7 @@ test-inline:
 
 .PHONY: test-all
 test-all: CFLAGS += -Werror
-test-all: test test32 clangtest cxxtest usan test-inline listL120 trailingWhitespace test-unicode
+test-all: test test32 test-unicode clangtest cxxtest usan test-inline listL120 trailingWhitespace
 
 .PHONY: test-tools
 test-tools:
@@ -390,7 +390,7 @@ listL120:  # extract lines >= 120 characters in *.{c,h}, by Takayuki Matsuoka (n
 
 .PHONY: trailingWhitespace
 trailingWhitespace:
-	! $(GREP) -E "`printf '[ \\t]$$'`" xxhsum.1 *.c *.h LICENSE Makefile cmake_unofficial/CMakeLists.txt
+	! $(GREP) -E "`printf '[ \\t]$$'`" cli/* *.c *.h LICENSE Makefile cmake_unofficial/CMakeLists.txt
 
 
 # =========================================================

From 4ec7c569bab5adb77e21b93143c8285b6d72d854 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 13 Jul 2021 05:17:37 -0700
Subject: [PATCH 090/187] fixed trailingwhitespace

works well on its own,
but wildcard also catches binary object files after their creation,
failing the test.
Fixed by constraining file selection to source only
---
 Makefile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/Makefile b/Makefile
index dd9cf643..b8afac02 100644
--- a/Makefile
+++ b/Makefile
@@ -390,7 +390,7 @@ listL120:  # extract lines >= 120 characters in *.{c,h}, by Takayuki Matsuoka (n
 
 .PHONY: trailingWhitespace
 trailingWhitespace:
-	! $(GREP) -E "`printf '[ \\t]$$'`" cli/* *.c *.h LICENSE Makefile cmake_unofficial/CMakeLists.txt
+	! $(GREP) -E "`printf '[ \\t]$$'`" cli/*.{c,h} *.c *.h LICENSE Makefile cmake_unofficial/CMakeLists.txt
 
 
 # =========================================================

From 4789a207b4c0284e6aff2cc50ca4f3fde373e611 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 13 Jul 2021 05:46:19 -0700
Subject: [PATCH 091/187] fix trailingwhitespace

---
 Makefile | 7 ++++---
 1 file changed, 4 insertions(+), 3 deletions(-)

diff --git a/Makefile b/Makefile
index b8afac02..82329431 100644
--- a/Makefile
+++ b/Makefile
@@ -351,13 +351,14 @@ namespaceTest:  ## ensure XXH_NAMESPACE redefines all public symbols
 	$(CC) xxhash.o xxhash2.o $(XXHSUM_SPLIT_SRCS)  -o xxhsum2  # will fail if one namespace missing (symbol collision)
 	$(RM) *.o xxhsum2  # clean
 
+MAN = $(XXHSUM_SRC_DIR)/xxhsum.1
 MD2ROFF ?= ronn
 MD2ROFF_FLAGS ?= --roff --warnings --manual="User Commands" --organization="xxhsum $(XXHSUM_VERSION)"
-xxhsum.1: xxhsum.1.md xxhash.h
+$(MAN): $(XXHSUM_SRC_DIR)/xxhsum.1.md xxhash.h
 	cat $< | $(MD2ROFF) $(MD2ROFF_FLAGS) | $(SED) -n '/^\.\\\".*/!p' > $@
 
 .PHONY: man
-man: xxhsum.1  ## generate man page from markdown source
+man: $(MAN)  ## generate man page from markdown source
 
 .PHONY: clean-man
 clean-man:
@@ -390,7 +391,7 @@ listL120:  # extract lines >= 120 characters in *.{c,h}, by Takayuki Matsuoka (n
 
 .PHONY: trailingWhitespace
 trailingWhitespace:
-	! $(GREP) -E "`printf '[ \\t]$$'`" cli/*.{c,h} *.c *.h LICENSE Makefile cmake_unofficial/CMakeLists.txt
+	! $(GREP) -E "`printf '[ \\t]$$'`" cli/*.{c,h,1} *.c *.h LICENSE Makefile cmake_unofficial/CMakeLists.txt
 
 
 # =========================================================

From eff96851ac4596a9e324813423b6df2c3a8b69b3 Mon Sep 17 00:00:00 2001
From: "W. Felix Handte" <w@felixhandte.com>
Date: Thu, 22 Jul 2021 11:06:51 -0400
Subject: [PATCH 092/187] Use `alignas()` in C++11

The previous macro test only detected C11 and failed in modern C++, which
actually goes one step further and makes `alignas` a keyword. It's not clear
that this actually improves the situation with respect to #543, but it should
be slightly more correct in some sense.
---
 xxhash.h | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index ba031cfe..c0d6f276 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -915,9 +915,12 @@ struct XXH64_state_s {
    XXH64_hash_t reserved64;   /*!< Reserved field. Do not read or write to it, it may be removed. */
 };   /* typedef'd to XXH64_state_t */
 
-#if defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 201112L)   /* C11+ */
+#if defined(__STDC_VERSION__) && (__STDC_VERSION__ >= 201112L) /* >= C11 */
 #  include <stdalign.h>
 #  define XXH_ALIGN(n)      alignas(n)
+#elif defined(__cplusplus) && (__cplusplus >= 201103L) /* >= C++11 */
+/* In C++ alignas() is a keyword */
+#  define XXH_ALIGN(n)      alignas(n)
 #elif defined(__GNUC__)
 #  define XXH_ALIGN(n)      __attribute__ ((aligned(n)))
 #elif defined(_MSC_VER)

From 445c5092da7f64f8a91d2f2f00f52fc12910373b Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Sun, 1 Aug 2021 14:16:19 +0900
Subject: [PATCH 093/187] Add GitHub Actions badge

- Remove travis-ci.org badge and link to test log.
- Add gh-actions workflow badge and link to log filter for dev branch.
---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index e5732f63..18704f53 100644
--- a/README.md
+++ b/README.md
@@ -9,7 +9,7 @@ Code is highly portable, and hashes are identical across all platforms (little /
 
 |Branch      |Status   |
 |------------|---------|
-|dev         | [![Build Status](https://travis-ci.org/Cyan4973/xxHash.svg?branch=dev)](https://travis-ci.org/Cyan4973/xxHash?branch=dev) |
+|dev         | [![Build Status](https://github.com/Cyan4973/xxHash/actions/workflows/ci.yml/badge.svg?branch=dev)](https://github.com/Cyan4973/xxHash/actions?query=branch%3Adev+) |
 
 
 Benchmarks

From 4caf6243174e8c533455d50101698cd5af4f941e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Fri, 6 Aug 2021 15:17:26 -0700
Subject: [PATCH 094/187] minor : make clean should also clean tests/

---
 Makefile | 1 +
 1 file changed, 1 insertion(+)

diff --git a/Makefile b/Makefile
index 82329431..b8552a9f 100644
--- a/Makefile
+++ b/Makefile
@@ -174,6 +174,7 @@ clean:  ## remove all build artifacts
 	$(Q)$(RM) xxhsum$(EXT) xxhsum32$(EXT) xxhsum_inlinedXXH$(EXT) dispatch$(EXT)
 	$(Q)$(RM) xxh32sum$(EXT) xxh64sum$(EXT) xxh128sum$(EXT)
 	$(Q)$(RM) $(XXHSUM_SRC_DIR)/*.o $(XXHSUM_SRC_DIR)/*.obj
+	$(MAKE) -C tests clean
 	@echo cleaning completed
 
 

From e345ccaf4daa8c35698ab02c0de346fcc0ca9ef6 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Fri, 6 Aug 2021 15:29:32 -0700
Subject: [PATCH 095/187] fixed man page installation

---
 Makefile | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/Makefile b/Makefile
index b8552a9f..556a32a0 100644
--- a/Makefile
+++ b/Makefile
@@ -498,10 +498,10 @@ endif
 	$(Q)ln -sf xxhsum $(DESTDIR)$(BINDIR)/xxh64sum
 	$(Q)ln -sf xxhsum $(DESTDIR)$(BINDIR)/xxh128sum
 	@echo Installing man pages
-	$(Q)$(INSTALL_DATA) xxhsum.1 $(DESTDIR)$(MANDIR)/xxhsum.1
-	$(Q)ln -sf xxhsum.1 $(DESTDIR)$(MANDIR)/xxh32sum.1
-	$(Q)ln -sf xxhsum.1 $(DESTDIR)$(MANDIR)/xxh64sum.1
-	$(Q)ln -sf xxhsum.1 $(DESTDIR)$(MANDIR)/xxh128sum.1
+	$(Q)$(INSTALL_DATA) $(MAN) $(DESTDIR)$(MANDIR)/xxhsum.1
+	$(Q)ln -sf $(MAN) $(DESTDIR)$(MANDIR)/xxh32sum.1
+	$(Q)ln -sf $(MAN) $(DESTDIR)$(MANDIR)/xxh64sum.1
+	$(Q)ln -sf $(MAN) $(DESTDIR)$(MANDIR)/xxh128sum.1
 	@echo xxhash installation completed
 
 .PHONY: uninstall

From 3e89a6e97e593dd369210a9e3607726ae8c25c6f Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 9 Aug 2021 13:45:20 -0700
Subject: [PATCH 096/187] minor code comment clarification

---
 xxhash.h | 18 +++++++++++-------
 1 file changed, 11 insertions(+), 7 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index c0d6f276..a189717b 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -297,11 +297,13 @@ typedef enum { XXH_OK=0, XXH_ERROR } XXH_errorcode;
  * Not necessarily defined to `uint32_t` but functionally equivalent.
  */
 typedef uint32_t XXH32_hash_t;
+
 #elif !defined (__VMS) \
   && (defined (__cplusplus) \
   || (defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 199901L) /* C99 */) )
 #   include <stdint.h>
     typedef uint32_t XXH32_hash_t;
+
 #else
 #   include <limits.h>
 #   if UINT_MAX == 0xFFFFFFFFUL
@@ -960,16 +962,18 @@ struct XXH64_state_s {
  * @brief Structure for XXH3 streaming API.
  *
  * @note This is only defined when @ref XXH_STATIC_LINKING_ONLY,
- * @ref XXH_INLINE_ALL, or @ref XXH_IMPLEMENTATION is defined. Otherwise it is
- * an opaque type. This allows fields to safely be changed.
+ * @ref XXH_INLINE_ALL, or @ref XXH_IMPLEMENTATION is defined.
+ * Otherwise it is an opaque type.
+ * Never use this definition in combination with dynamic library.
+ * This allows fields to safely be changed in the future.
  *
- * @note **This structure has a strict alignment requirement of 64 bytes.** Do
- * not allocate this with `malloc()` or `new`, it will not be sufficiently
- * aligned. Use @ref XXH3_createState() and @ref XXH3_freeState(), or stack
- * allocation.
+ * @note ** This structure has a strict alignment requirement of 64 bytes!! **
+ * Do not allocate this with `malloc()` or `new`,
+ * it will not be sufficiently aligned.
+ * Use @ref XXH3_createState() and @ref XXH3_freeState(), or stack allocation.
  *
  * Typedef'd to @ref XXH3_state_t.
- * Do not access the members of this struct directly.
+ * Do never access the members of this struct directly.
  *
  * @see XXH3_INITSTATE() for stack initialization.
  * @see XXH3_createState(), XXH3_freeState().

From 45fbee7ad66ea31c41d14ac5c8da829028bf7be3 Mon Sep 17 00:00:00 2001
From: Peter Dillinger <peterd@fb.com>
Date: Tue, 10 Aug 2021 22:14:15 -0700
Subject: [PATCH 097/187] Fix technical UB negating signed

Summary: UBSAN run reported on using seed 1<<63 with XXH3 because of
`-(xxh_i64)seed64` overflow. Seen in CI for
https://github.com/facebook/rocksdb/pull/8634

To fix, negate as unsigned (well defined under/overflow) and then cast
to signed.

Test Plan: same patch fixes the report in RocksDB
---
 xxhash.h | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index ba031cfe..da74ccf2 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3649,7 +3649,7 @@ XXH3_initCustomSecret_avx512(void* XXH_RESTRICT customSecret, xxh_u64 seed64)
     XXH_ASSERT(((size_t)customSecret & 63) == 0);
     (void)(&XXH_writeLE64);
     {   int const nbRounds = XXH_SECRET_DEFAULT_SIZE / sizeof(__m512i);
-        __m512i const seed = _mm512_mask_set1_epi64(_mm512_set1_epi64((xxh_i64)seed64), 0xAA, -(xxh_i64)seed64);
+        __m512i const seed = _mm512_mask_set1_epi64(_mm512_set1_epi64((xxh_i64)seed64), 0xAA, (xxh_i64)(0U - seed64));
 
         XXH_ALIGN(64) const __m512i* const src  = (const __m512i*) XXH3_kSecret;
         XXH_ALIGN(64)       __m512i* const dest = (      __m512i*) customSecret;
@@ -3745,7 +3745,7 @@ XXH_FORCE_INLINE XXH_TARGET_AVX2 void XXH3_initCustomSecret_avx2(void* XXH_RESTR
     XXH_STATIC_ASSERT(XXH_SEC_ALIGN <= 64);
     (void)(&XXH_writeLE64);
     XXH_PREFETCH(customSecret);
-    {   __m256i const seed = _mm256_set_epi64x(-(xxh_i64)seed64, (xxh_i64)seed64, -(xxh_i64)seed64, (xxh_i64)seed64);
+    {   __m256i const seed = _mm256_set_epi64x((xxh_i64)(0U - seed64), (xxh_i64)seed64, (xxh_i64)(0U - seed64), (xxh_i64)seed64);
 
         XXH_ALIGN(64) const __m256i* const src  = (const __m256i*) XXH3_kSecret;
         XXH_ALIGN(64)       __m256i*       dest = (      __m256i*) customSecret;
@@ -3850,10 +3850,10 @@ XXH_FORCE_INLINE XXH_TARGET_SSE2 void XXH3_initCustomSecret_sse2(void* XXH_RESTR
 
 #       if defined(_MSC_VER) && defined(_M_IX86) && _MSC_VER < 1900
         // MSVC 32bit mode does not support _mm_set_epi64x before 2015
-        XXH_ALIGN(16) const xxh_i64 seed64x2[2] = { (xxh_i64)seed64, -(xxh_i64)seed64 };
+        XXH_ALIGN(16) const xxh_i64 seed64x2[2] = { (xxh_i64)seed64, (xxh_i64)(0U - seed64) };
         __m128i const seed = _mm_load_si128((__m128i const*)seed64x2);
 #       else
-        __m128i const seed = _mm_set_epi64x(-(xxh_i64)seed64, (xxh_i64)seed64);
+        __m128i const seed = _mm_set_epi64x((xxh_i64)(0U - seed64), (xxh_i64)seed64);
 #       endif
         int i;
 

From 3bfced775a54b1cf0dc95668369ee565262f519f Mon Sep 17 00:00:00 2001
From: Lior Lahav <LahavLior@gmail.com>
Date: Sun, 15 Aug 2021 15:13:30 +0300
Subject: [PATCH 098/187] added support for C and CPP [[fallthrough]] attribute

---
 xxhash.h | 52 +++++++++++++++++++++++++++++++++++++++-------------
 1 file changed, 39 insertions(+), 13 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index c3cb3aaa..93ac3ef5 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -545,6 +545,32 @@ XXH_PUBLIC_API void XXH32_canonicalFromHash(XXH32_canonical_t* dst, XXH32_hash_t
 XXH_PUBLIC_API XXH32_hash_t XXH32_hashFromCanonical(const XXH32_canonical_t* src);
 
 
+/*
+Define XXH_FALLTHROUGH macro for annotating switch case with the 'fallthrough' attribute
+introduced in CPP17 and C23.
+CPP17 : https://en.cppreference.com/w/cpp/language/attributes/fallthrough
+C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
+*/
+
+#if defined (__has_c_attribute)
+#   if __has_c_attribute(fallthrough) 
+#       define XXH_FALLTHROUGH [[fallthrough]]
+#   endif
+
+#elif defined(__cplusplus) && defined(__has_cpp_attribute) 
+#   if __has_cpp_attribute(fallthrough)
+#       define XXH_FALLTHROUGH [[fallthrough]]
+#   endif
+#endif
+
+#ifndef XXH_FALLTHROUGH
+#   if defined(__GNUC__) && __GNUC__ >= 7
+#       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
+#   else
+#       define XXH_FALLTHROUGH 
+#	endif
+#endif
+
 /*!
  * @}
  * @ingroup public
@@ -1866,41 +1892,41 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
     } else {
          switch(len&15) /* or switch(bEnd - p) */ {
            case 12:      XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 8:       XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 4:       XXH_PROCESS4;
                          return XXH32_avalanche(h32);
-
+                         
            case 13:      XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 9:       XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 5:       XXH_PROCESS4;
                          XXH_PROCESS1;
                          return XXH32_avalanche(h32);
 
            case 14:      XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 10:      XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 6:       XXH_PROCESS4;
                          XXH_PROCESS1;
                          XXH_PROCESS1;
                          return XXH32_avalanche(h32);
 
            case 15:      XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 11:      XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 7:       XXH_PROCESS4;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 3:       XXH_PROCESS1;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 2:       XXH_PROCESS1;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 1:       XXH_PROCESS1;
-                         /* fallthrough */
+                         XXH_FALLTHROUGH;
            case 0:       return XXH32_avalanche(h32);
         }
         XXH_ASSERT(0);

From f480c7693a999933e5bfa28d2691bbacc1b35c1d Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 16 Aug 2021 15:53:19 -0700
Subject: [PATCH 099/187] introduce TEST_FILES variable

list file names to run xxhsum on.
Fixed a minor issue when the previous list, `xxh*`,
would include some directory that would happen to be present in the same directory.
---
 Makefile | 33 +++++++++++++++++----------------
 1 file changed, 17 insertions(+), 16 deletions(-)

diff --git a/Makefile b/Makefile
index 556a32a0..d9218873 100644
--- a/Makefile
+++ b/Makefile
@@ -220,33 +220,34 @@ test32: xxhsum32
 	@echo ---- test 32-bit ----
 	./xxhsum32 -bi0 xxhash.c
 
+TEST_FILES = xxhsum xxhash.c xxhash.h
 .PHONY: test-xxhsum-c
 test-xxhsum-c: xxhsum
 	# xxhsum to/from pipe
-	./xxhsum xxh* | ./xxhsum -c -
-	./xxhsum -H0 xxh* | ./xxhsum -c -
+	./xxhsum $(TEST_FILES) | ./xxhsum -c -
+	./xxhsum -H0 $(TEST_FILES) | ./xxhsum -c -
 	# xxhsum -c is unable to verify checksum of file from STDIN (#470)
 	./xxhsum < README.md > .test.README.md.xxh
 	./xxhsum -c .test.README.md.xxh < README.md
 	# xxhsum -q does not display "Loading" message into stderr (#251)
-	! ./xxhsum -q xxh* 2>&1 | grep Loading
+	! ./xxhsum -q $(TEST_FILES) 2>&1 | grep Loading
 	# xxhsum does not display "Loading" message into stderr either
-	! ./xxhsum xxh* 2>&1 | grep Loading
+	! ./xxhsum $(TEST_FILES) 2>&1 | grep Loading
 	# Check that xxhsum do display filename that it failed to open.
 	LC_ALL=C ./xxhsum nonexistent 2>&1 | grep "Error: Could not open 'nonexistent'"
 	# xxhsum to/from file, shell redirection
-	./xxhsum xxh* > .test.xxh64
-	./xxhsum --tag xxh* > .test.xxh64_tag
-	./xxhsum --little-endian xxh* > .test.le_xxh64
-	./xxhsum --tag --little-endian xxh* > .test.le_xxh64_tag
-	./xxhsum -H0 xxh* > .test.xxh32
-	./xxhsum -H0 --tag xxh* > .test.xxh32_tag
-	./xxhsum -H0 --little-endian xxh* > .test.le_xxh32
-	./xxhsum -H0 --tag --little-endian xxh* > .test.le_xxh32_tag
-	./xxhsum -H2 xxh* > .test.xxh128
-	./xxhsum -H2 --tag xxh* > .test.xxh128_tag
-	./xxhsum -H2 --little-endian xxh* > .test.le_xxh128
-	./xxhsum -H2 --tag --little-endian xxh* > .test.le_xxh128_tag
+	./xxhsum $(TEST_FILES) > .test.xxh64
+	./xxhsum --tag $(TEST_FILES) > .test.xxh64_tag
+	./xxhsum --little-endian $(TEST_FILES) > .test.le_xxh64
+	./xxhsum --tag --little-endian $(TEST_FILES) > .test.le_xxh64_tag
+	./xxhsum -H0 $(TEST_FILES) > .test.xxh32
+	./xxhsum -H0 --tag $(TEST_FILES) > .test.xxh32_tag
+	./xxhsum -H0 --little-endian $(TEST_FILES) > .test.le_xxh32
+	./xxhsum -H0 --tag --little-endian $(TEST_FILES) > .test.le_xxh32_tag
+	./xxhsum -H2 $(TEST_FILES) > .test.xxh128
+	./xxhsum -H2 --tag $(TEST_FILES) > .test.xxh128_tag
+	./xxhsum -H2 --little-endian $(TEST_FILES) > .test.le_xxh128
+	./xxhsum -H2 --tag --little-endian $(TEST_FILES) > .test.le_xxh128_tag
 	./xxhsum -c .test.xxh*
 	./xxhsum -c --little-endian .test.le_xxh*
 	./xxhsum -c .test.*_tag

From 56cda18e02980ed72bdd49bb52a69639ceabb0a6 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 16 Aug 2021 16:20:12 -0700
Subject: [PATCH 100/187] fixed mingw tests

reported & suggested by @t-mat
---
 Makefile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/Makefile b/Makefile
index d9218873..4562ec30 100644
--- a/Makefile
+++ b/Makefile
@@ -220,7 +220,7 @@ test32: xxhsum32
 	@echo ---- test 32-bit ----
 	./xxhsum32 -bi0 xxhash.c
 
-TEST_FILES = xxhsum xxhash.c xxhash.h
+TEST_FILES = xxhsum$(EXT) xxhash.c xxhash.h
 .PHONY: test-xxhsum-c
 test-xxhsum-c: xxhsum
 	# xxhsum to/from pipe

From 29a7dea6c6b9d4af7eb9d0c13f0741b070023e13 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 16 Aug 2021 16:56:22 -0700
Subject: [PATCH 101/187] ensure that make clean also clean sub-projects

such as tests/bench and tests/collisions
---
 Makefile | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/Makefile b/Makefile
index 4562ec30..a5c5d392 100644
--- a/Makefile
+++ b/Makefile
@@ -175,6 +175,8 @@ clean:  ## remove all build artifacts
 	$(Q)$(RM) xxh32sum$(EXT) xxh64sum$(EXT) xxh128sum$(EXT)
 	$(Q)$(RM) $(XXHSUM_SRC_DIR)/*.o $(XXHSUM_SRC_DIR)/*.obj
 	$(MAKE) -C tests clean
+	$(MAKE) -C tests/bench clean
+	$(MAKE) -C tests/collisions clean
 	@echo cleaning completed
 
 

From c47ccf032a35104599e4fd693d474a34b9d350a6 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 16 Aug 2021 16:59:32 -0700
Subject: [PATCH 102/187] fix #557

---
 xxhash.h | 1 +
 1 file changed, 1 insertion(+)

diff --git a/xxhash.h b/xxhash.h
index c3cb3aaa..a4d7432e 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -933,6 +933,7 @@ struct XXH64_state_s {
 
 /* Old GCC versions only accept the attribute after the type in structures. */
 #if !(defined(__STDC_VERSION__) && (__STDC_VERSION__ >= 201112L))   /* C11+ */ \
+    && ! (defined(__cplusplus) && (__cplusplus >= 201103L)) /* >= C++11 */ \
     && defined(__GNUC__)
 #   define XXH_ALIGN_MEMBER(align, type) type XXH_ALIGN(align)
 #else

From 0b3df0328df06216a1bc1358c28a846c3870b58e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 16 Aug 2021 21:01:37 -0700
Subject: [PATCH 103/187] fix #560

fix noxxh3test with gcc 4.8
---
 Makefile | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/Makefile b/Makefile
index 4562ec30..ff53ea62 100644
--- a/Makefile
+++ b/Makefile
@@ -318,7 +318,7 @@ c90test: xxhash.c
 endif
 
 noxxh3test: CPPFLAGS += -DXXH_NO_XXH3
-noxxh3test: CFLAGS += -Werror -pedantic
+noxxh3test: CFLAGS += -Werror -pedantic -Wno-long-long  # XXH64 requires long long support
 noxxh3test: xxhash.c
 	@echo ---- test compilation without XXH3 ----
 	$(RM) xxhash.o

From c11b0b96f8d473fcb4ad97a080f707f029db701f Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 17 Aug 2021 08:55:46 -0700
Subject: [PATCH 104/187] fix #563

ensure that building test tools preserves compilation flags
including -std=c99 and -std=c++11
when invoked with CFLAGS pre-set
so that it properly compiles with gcc <= 4.8 .

reported by @t-mat.
---
 tests/bench/Makefile      | 3 ++-
 tests/collisions/Makefile | 7 ++++---
 2 files changed, 6 insertions(+), 4 deletions(-)

diff --git a/tests/bench/Makefile b/tests/bench/Makefile
index cdccfffd..de975cbe 100644
--- a/tests/bench/Makefile
+++ b/tests/bench/Makefile
@@ -29,7 +29,8 @@
 
 CPPFLAGS += -I../..   # directory of xxHash source files
 CFLAGS   ?= -O3
-CFLAGS   += -std=c99 -Wall -Wextra -Wstrict-aliasing=1
+CFLAGS   += -Wall -Wextra -Wstrict-aliasing=1 \
+            -std=c99
 CFLAGS   += $(MOREFLAGS)   # custom way to add flags
 CXXFLAGS ?= -O3
 LDFLAGS  += $(MOREFLAGS)
diff --git a/tests/collisions/Makefile b/tests/collisions/Makefile
index bad9835b..a070c25c 100644
--- a/tests/collisions/Makefile
+++ b/tests/collisions/Makefile
@@ -26,9 +26,10 @@
 SRC_DIRS = ./ ../../ allcodecs/
 VPATH = $(SRC_DIRS)
 CPPFLAGS += $(addprefix -I ,$(SRC_DIRS))
-CFLAGS   ?= -std=c99 \
-            -Wall -Wextra -Wconversion
-CXXFLAGS ?= -Wall -Wextra -Wconversion -std=c++11
+CFLAGS   += -Wall -Wextra -Wconversion \
+            -std=c99
+CXXFLAGS += -Wall -Wextra -Wconversion \
+            -std=c++11
 LDFLAGS  += -pthread
 TESTHASHES = 110000000
 

From f657e87c9154fdcc7ef31c91309c948bc3819680 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 17 Aug 2021 09:08:03 -0700
Subject: [PATCH 105/187] fix minor conversion warnings

---
 tests/collisions/main.c | 16 +++++++++-------
 1 file changed, 9 insertions(+), 7 deletions(-)

diff --git a/tests/collisions/main.c b/tests/collisions/main.c
index a857341b..86497cc6 100644
--- a/tests/collisions/main.c
+++ b/tests/collisions/main.c
@@ -106,7 +106,7 @@ static uint64_t avalanche64(uint64_t h64)
     return h64;
 }
 
-static unsigned char randomByte(size_t n)
+static unsigned char randomByte(uint64_t n)
 {
     uint64_t n64 = avalanche64(n+1);
     n64 *= prime64_1;
@@ -240,7 +240,7 @@ typedef struct {
     /* slab5 */
     size_t nbSlabs;
     size_t current;
-    size_t prngSeed;
+    uint64_t prngSeed;
 } sampleFactory;
 
 static void init_sampleFactory(sampleFactory* sf, uint64_t htotal)
@@ -282,7 +282,7 @@ static void free_sampleFactory(sampleFactory* sf)
 
 static void flipbit(void* buffer, uint64_t bitID)
 {
-    size_t const pos = bitID >> 3;
+    size_t const pos = (size_t)(bitID >> 3);
     unsigned char const mask = (unsigned char)(1 << (bitID & 7));
     unsigned char* const p = (unsigned char*)buffer;
     p[pos] ^= mask;
@@ -416,7 +416,7 @@ static inline int Filter_insert(Filter* bf, int bflog, uint64_t hash)
      hash >>= 8;
 
      size_t const fclmask = ((size_t)1 << (bflog-6)) - 1;
-     size_t const cacheLineNb = hash & fclmask;
+     size_t const cacheLineNb = (size_t)hash & fclmask;
 
      size_t const pos1 = (cacheLineNb << 6) + (slot1 >> 2);
      unsigned const shift1 = (slot1 & 3) * 2;
@@ -456,7 +456,7 @@ static inline int Filter_check(const Filter* bf, int bflog, uint64_t hash)
      hash >>= 8;
 
      size_t const fclmask = ((size_t)1 << (bflog-6)) - 1;
-     size_t const cacheLineNb = hash & fclmask;
+     size_t const cacheLineNb = (size_t)hash & fclmask;
 
      size_t const pos1 = (cacheLineNb << 6) + (slot1 >> 2);
      unsigned const shift1 = (slot1 & 3) * 2;
@@ -709,7 +709,7 @@ static size_t search_collisions(
 
     time_t const storeTBegin = time(NULL);
     size_t const hashByteSize = (htype == ht128) ? 16 : 8;
-    size_t const tableSize = (nbPresents+1) * hashByteSize;
+    size_t const tableSize = (size_t)((nbPresents+1) * hashByteSize);
     assert(tableSize > nbPresents);  /* check tableSize calculation overflow */
     DISPLAY(" Storing hash candidates (%i MB) \n", (int)(tableSize >> 20));
 
@@ -835,6 +835,7 @@ static size_t search_collisions(
 
 
 #if defined(__MACH__) || defined(__linux__)
+
 #include <sys/resource.h>
 static size_t getProcessMemUsage(int children)
 {
@@ -843,8 +844,9 @@ static size_t getProcessMemUsage(int children)
       return (size_t)stats.ru_maxrss;
     return 0;
 }
+
 #else
-static size_t getProcessMemUsage(int ignore) { return 0; }
+static size_t getProcessMemUsage(int ignore) { (void)ignore; return 0; }
 #endif
 
 void time_collisions(searchCollisions_parameters param)

From 8407be8ce8ff8d50062ad9a1488211d573f03eaf Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Wed, 18 Aug 2021 01:53:11 +0900
Subject: [PATCH 106/187] Fix C++ style comment

gcc-4.8 -pedantic reports the following warning (as an error).

gcc-4.8 -O3 -Wall -Wextra -Wconversion -Wcast-qual -Wcast-align -Wshadow -Wstrict-aliasing=1 -Wswitch-enum -Wdeclaration-after-statement -Wstrict-prototypes -Wundef -Wpointer-arith -Wformat-security -Wvla -Wformat=2 -Winit-self -Wfloat-equal -Wwrite-strings -Wredundant-decls -Wstrict-overflow=2  -Werror -pedantic -Wno-long-long   -DXXH_NO_XXH3 xxhash.c -c
In file included from xxhash.c:43:0:
xxhash.h:3860:9: error: C++ style comments are not allowed in ISO C90 [-Werror]
         // MSVC 32bit mode does not support _mm_set_epi64x before 2015
         ^
---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index a4d7432e..f3a377ef 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3857,7 +3857,7 @@ XXH_FORCE_INLINE XXH_TARGET_SSE2 void XXH3_initCustomSecret_sse2(void* XXH_RESTR
     {   int const nbRounds = XXH_SECRET_DEFAULT_SIZE / sizeof(__m128i);
 
 #       if defined(_MSC_VER) && defined(_M_IX86) && _MSC_VER < 1900
-        // MSVC 32bit mode does not support _mm_set_epi64x before 2015
+        /* MSVC 32bit mode does not support _mm_set_epi64x before 2015 */
         XXH_ALIGN(16) const xxh_i64 seed64x2[2] = { (xxh_i64)seed64, (xxh_i64)(0U - seed64) };
         __m128i const seed = _mm_load_si128((__m128i const*)seed64x2);
 #       else

From e523b457b5b6f38e9577bb06f2839fc65da0357e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 17 Aug 2021 15:31:05 -0700
Subject: [PATCH 107/187] blindfix for mingw32 conversion warning

---
 tests/collisions/main.c | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/tests/collisions/main.c b/tests/collisions/main.c
index 86497cc6..1f9318d8 100644
--- a/tests/collisions/main.c
+++ b/tests/collisions/main.c
@@ -432,8 +432,10 @@ static inline int Filter_insert(Filter* bf, int bflog, uint64_t hash)
      static const unsigned nextValue[4] = { 1, 2, 3, 3 };
 
      bf[pos1] &= (Filter)(~(3 << shift1)); /* erase previous value */
-     bf[pos1] |= (Filter)(MAX(ex1, nextValue[existing]) << shift1);
-     bf[pos2] |= (Filter)(MAX(ex2, nextValue[existing]) << shift2);
+     unsigned const max1 = MAX(ex1, nextValue[existing]);
+     bf[pos1] |= (Filter)(max1 << shift1);
+     unsigned const max2 = MAX(ex2, nextValue[existing]);
+     bf[pos2] |= (Filter)(max2 << shift2);
 
      return addCandidates[existing];
  }

From 81ae5e6b7ed991e9da2f16a1cc14e6318114005e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 17 Aug 2021 16:00:03 -0700
Subject: [PATCH 108/187] removed some flacky Appveyor tests

Their equivalent already runs fine on Github Actions.

This will also make Appveyor tests finish faster,
which matters as Appveyor is the limiting factor.
---
 appveyor.yml | 15 +++++++++------
 1 file changed, 9 insertions(+), 6 deletions(-)

diff --git a/appveyor.yml b/appveyor.yml
index 850f48b1..ab23eee8 100644
--- a/appveyor.yml
+++ b/appveyor.yml
@@ -30,12 +30,15 @@ environment:
     ARCH: "ARM64"
     APPVEYOR_BUILD_WORKER_IMAGE: Visual Studio 2017
     # note: ARM64 is not available with Visual Studio 14 2015, which is default for Appveyor
-  - COMPILER: "gcc"
-    PLATFORM: "mingw64"
-  - COMPILER: "gcc"
-    PLATFORM: "mingw32"
-  - COMPILER: "gcc"
-    PLATFORM: "clang"
+# Below tests are now disabled.
+# They are flacky on Appveyor, for various reasons.
+# Moreover, their equivalent already runs correctly on Github Actions.
+#  - COMPILER: "gcc"
+#    PLATFORM: "mingw64"
+#  - COMPILER: "gcc"
+#    PLATFORM: "mingw32"
+#  - COMPILER: "gcc"
+#    PLATFORM: "clang"
 
 install:
   - ECHO Installing %COMPILER% %PLATFORM% %ARCH%

From 1dcc4d5d3ea79c5dc77ccbcecb16c02db36388f7 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Tue, 17 Aug 2021 16:21:21 -0700
Subject: [PATCH 109/187] fix #562 : compilation with gcc-8

checked that performance remains unaffected
---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index a4d7432e..d13ca280 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4653,7 +4653,7 @@ XXH3_update(XXH3_state_t* state,
         XXH_ASSERT(input < bEnd);
 
         /* Consume input by a multiple of internal buffer size */
-        if (input+XXH3_INTERNALBUFFER_SIZE < bEnd) {
+        if (bEnd - input > XXH3_INTERNALBUFFER_SIZE) {
             const xxh_u8* const limit = bEnd - XXH3_INTERNALBUFFER_SIZE;
             do {
                 XXH3_consumeStripes(state->acc,

From afc3454dd5d0e4dab8f2b7f2f99dc20881b9c50b Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 21 Aug 2021 06:40:14 -0700
Subject: [PATCH 110/187] first attempt at removing const float*

to fix #559
---
 xxhash.h | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 081fc681..f7972b1f 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3865,8 +3865,8 @@ XXH_FORCE_INLINE XXH_TARGET_SSE2 void XXH3_initCustomSecret_sse2(void* XXH_RESTR
 #       endif
         int i;
 
-        XXH_ALIGN(64)        const float* const src  = (float const*) XXH3_kSecret;
-        XXH_ALIGN(XXH_SEC_ALIGN) __m128i*       dest = (__m128i*) customSecret;
+        const __m128i* const src16 = (const __m128i *)XXH3_kSecret;
+        __m128i* dest = (__m128i*) customSecret;
 #       if defined(__GNUC__) || defined(__clang__)
         /*
          * On GCC & Clang, marking 'dest' as modified will cause the compiler:
@@ -3877,7 +3877,7 @@ XXH_FORCE_INLINE XXH_TARGET_SSE2 void XXH3_initCustomSecret_sse2(void* XXH_RESTR
 #       endif
 
         for (i=0; i < nbRounds; ++i) {
-            dest[i] = _mm_add_epi64(_mm_castps_si128(_mm_load_ps(src+i*4)), seed);
+            dest[i] = _mm_add_epi64(_mm_load_si128(src16+i), seed);
     }   }
 }
 

From 79458e46d12cedc9acff0b02852cbee1e2daf17c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 21 Aug 2021 06:59:27 -0700
Subject: [PATCH 111/187] added alignment check

---
 xxhash.h | 10 ++++++----
 1 file changed, 6 insertions(+), 4 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index f7972b1f..4eeb2a74 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3865,19 +3865,21 @@ XXH_FORCE_INLINE XXH_TARGET_SSE2 void XXH3_initCustomSecret_sse2(void* XXH_RESTR
 #       endif
         int i;
 
-        const __m128i* const src16 = (const __m128i *)XXH3_kSecret;
-        __m128i* dest = (__m128i*) customSecret;
+        const void* const src16 = XXH3_kSecret;
+        __m128i* dst16 = (__m128i*) customSecret;
 #       if defined(__GNUC__) || defined(__clang__)
         /*
          * On GCC & Clang, marking 'dest' as modified will cause the compiler:
          *   - do not extract the secret from sse registers in the internal loop
          *   - use less common registers, and avoid pushing these reg into stack
          */
-        XXH_COMPILER_GUARD(dest);
+        XXH_COMPILER_GUARD(dst16);
 #       endif
+        XXH_ASSERT(((size_t)src16 & 15) == 0); /* control alignment */
+        XXH_ASSERT(((size_t)dst16 & 15) == 0);
 
         for (i=0; i < nbRounds; ++i) {
-            dest[i] = _mm_add_epi64(_mm_load_si128(src16+i), seed);
+            dst16[i] = _mm_add_epi64(_mm_load_si128((const __m128i *)src16+i), seed);
     }   }
 }
 

From 286752b58e4221f34226631d5eded412805ebd91 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 21 Aug 2021 09:07:11 -0700
Subject: [PATCH 112/187] remove redudant tests from Appveyor

VS 2017 tests are already run on Github Actions.

This reduce the time spent in Appveyor,
which remains the limiting factor in test speed.
---
 appveyor.yml | 23 ++++++++++++-----------
 1 file changed, 12 insertions(+), 11 deletions(-)

diff --git a/appveyor.yml b/appveyor.yml
index ab23eee8..7aef900f 100644
--- a/appveyor.yml
+++ b/appveyor.yml
@@ -13,10 +13,6 @@ environment:
   - COMPILER: "visual"
     ARCH: "x64"
     TEST_XXHSUM: "true"
-  - COMPILER: "visual"
-    ARCH: "x64"
-    APPVEYOR_BUILD_WORKER_IMAGE: Visual Studio 2017
-    TEST_XXHSUM: "true"
   - COMPILER: "visual"
     ARCH: "Win32"
     TEST_XXHSUM: "true"
@@ -26,13 +22,18 @@ environment:
     TEST_XXHSUM: "true"
   - COMPILER: "visual"
     ARCH: "ARM"
-  - COMPILER: "visual"
-    ARCH: "ARM64"
-    APPVEYOR_BUILD_WORKER_IMAGE: Visual Studio 2017
-    # note: ARM64 is not available with Visual Studio 14 2015, which is default for Appveyor
-# Below tests are now disabled.
-# They are flacky on Appveyor, for various reasons.
-# Moreover, their equivalent already runs correctly on Github Actions.
+# Below tests are now disabled due to redundancy.
+# Their equivalent already runs correctly on Github Actions.
+#  - COMPILER: "visual"
+#    ARCH: "x64"
+#    APPVEYOR_BUILD_WORKER_IMAGE: Visual Studio 2017
+#    TEST_XXHSUM: "true"
+#  - COMPILER: "visual"
+#    ARCH: "ARM64"
+#    APPVEYOR_BUILD_WORKER_IMAGE: Visual Studio 2017
+#    # note: ARM64 is not available with Visual Studio 14 2015, which is default for Appveyor
+
+# The following tests were also flacky on Appveyor, for various reasons.
 #  - COMPILER: "gcc"
 #    PLATFORM: "mingw64"
 #  - COMPILER: "gcc"

From f87aec4a7adea9973c0be5690cb34cd9f839edd7 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 21 Aug 2021 09:56:34 -0700
Subject: [PATCH 113/187] minor cleaning

As suggested by @pdillinger in #549,
`XXH_endianess` is never used and should be removed.
---
 cli/xsum_sanity_check.h | 8 ++++----
 xxhash.h                | 5 ++---
 2 files changed, 6 insertions(+), 7 deletions(-)

diff --git a/cli/xsum_sanity_check.h b/cli/xsum_sanity_check.h
index 9f3f2b85..4e3bc0f6 100644
--- a/cli/xsum_sanity_check.h
+++ b/cli/xsum_sanity_check.h
@@ -40,16 +40,16 @@ extern "C" {
  *
  * Exits if any of these tests fail, printing a message to stderr.
  *
- * If XSUM_NO_TESTS is defined to non-zero, this will instead print a warning
- * if this is called (e.g. via xxhsum -b).
+ * If XSUM_NO_TESTS is defined to non-zero,
+ * this will instead print a warning if this is called (e.g. via xxhsum -b).
  */
 XSUM_API void XSUM_sanityCheck(void);
 
 /*
  * Fills a test buffer with pseudorandom data.
  *
- * This is used in the sanity check and the benchmarks - its values must not be
- * changed.
+ * This is used in the sanity check and the benchmarks.
+ * Its values must not be changed.
  */
 XSUM_API void XSUM_fillTestBuffer(XSUM_U8* buffer, size_t len);
 
diff --git a/xxhash.h b/xxhash.h
index 081fc681..940676f0 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1542,7 +1542,6 @@ static xxh_u32 XXH_read32(const void* memPtr)
 
 
 /* ***   Endianness   *** */
-typedef enum { XXH_bigEndian=0, XXH_littleEndian=1 } XXH_endianess;
 
 /*!
  * @ingroup tuning
@@ -1552,8 +1551,8 @@ typedef enum { XXH_bigEndian=0, XXH_littleEndian=1 } XXH_endianess;
  * Defined to 1 if the target is little endian, or 0 if it is big endian.
  * It can be defined externally, for example on the compiler command line.
  *
- * If it is not defined, a runtime check (which is usually constant folded)
- * is used instead.
+ * If it is not defined,
+ * a runtime check (which is usually constant folded) is used instead.
  *
  * @note
  *   This is not necessarily defined to an integer constant.

From 7a41e52df3bcccae9ed174fd930a1ed292938290 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 23 Aug 2021 06:31:01 -0700
Subject: [PATCH 114/187] added memory access method 1 for mips on gcc

recommended by info@mobile-stream.com
---
 xxhash.h | 25 ++++++++++++++++++-------
 1 file changed, 18 insertions(+), 7 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 570eef7a..3500f1e6 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -553,11 +553,11 @@ C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
 */
 
 #if defined (__has_c_attribute)
-#   if __has_c_attribute(fallthrough) 
+#   if __has_c_attribute(fallthrough)
 #       define XXH_FALLTHROUGH [[fallthrough]]
 #   endif
 
-#elif defined(__cplusplus) && defined(__has_cpp_attribute) 
+#elif defined(__cplusplus) && defined(__has_cpp_attribute)
 #   if __has_cpp_attribute(fallthrough)
 #       define XXH_FALLTHROUGH [[fallthrough]]
 #   endif
@@ -567,7 +567,7 @@ C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
 #   if defined(__GNUC__) && __GNUC__ >= 7
 #       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
 #   else
-#       define XXH_FALLTHROUGH 
+#       define XXH_FALLTHROUGH
 #	endif
 #endif
 
@@ -1288,10 +1288,21 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  */
 
 #ifndef XXH_FORCE_MEMORY_ACCESS   /* can be defined externally, on command line for example */
-   /* prefer __packed__ structures (method 1) for gcc on armv7 and armv8 */
-#  if !defined(__clang__) && ( \
+   /* prefer __packed__ structures (method 1) for gcc on armv7+ and mips */
+#  if !defined(__clang__) && \
+( \
     (defined(__INTEL_COMPILER) && !defined(_WIN32)) || \
-    (defined(__GNUC__) && (defined(__ARM_ARCH) && __ARM_ARCH >= 7)) )
+    ( \
+        defined(__GNUC__) && ( \
+            (defined(__ARM_ARCH) && __ARM_ARCH >= 7) || \
+            ( \
+                defined(__mips__) && \
+                (__mips <= 5 || __mips_isa_rev < 6) && \
+                (!defined(__mips16) || defined(__mips_mips16e2)) \
+            ) \
+        ) \
+    ) \
+)
 #    define XXH_FORCE_MEMORY_ACCESS 1
 #  endif
 #endif
@@ -1897,7 +1908,7 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
                          XXH_FALLTHROUGH;
            case 4:       XXH_PROCESS4;
                          return XXH32_avalanche(h32);
-                         
+
            case 13:      XXH_PROCESS4;
                          XXH_FALLTHROUGH;
            case 9:       XXH_PROCESS4;

From e31dc898d8fa7a11b7d42a98f3ff77f8eb0f3a73 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 23 Aug 2021 06:46:31 -0700
Subject: [PATCH 115/187] add qemu-mips test to GA

---
 .github/workflows/ci.yml | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index 343baa19..8a478317 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -214,6 +214,7 @@ jobs:
           { name: ARM64,    xcc_pkg: gcc-aarch64-linux-gnu,     xcc: aarch64-linux-gnu-gcc,     xemu_pkg: qemu-system-arm,    xemu: qemu-aarch64-static },
           { name: PPC64LE,  xcc_pkg: gcc-powerpc64le-linux-gnu, xcc: powerpc64le-linux-gnu-gcc, xemu_pkg: qemu-system-ppc,    xemu: qemu-ppc64le-static },
           { name: S390X,    xcc_pkg: gcc-s390x-linux-gnu,       xcc: s390x-linux-gnu-gcc,       xemu_pkg: qemu-system-s390x,  xemu: qemu-s390x-static   },
+          { name: MIPS,     xcc_pkg: gcc-mips-linux-gnu,        xcc: mips-linux-gnu-gcc,        xemu_pkg: qemu-system-mips,   xemu: qemu-mips-static    },
         ]
     env:                        # Set environment variables
       XCC: ${{ matrix.xcc }}
@@ -224,7 +225,7 @@ jobs:
       run: |
         sudo apt-get update
         sudo apt-get install gcc-multilib g++-multilib qemu-utils qemu-user-static
-        sudo apt-get install ${{ matrix.xcc_pkg }} ${{ matrix.xemu_pkg }} 
+        sudo apt-get install ${{ matrix.xcc_pkg }} ${{ matrix.xemu_pkg }}
 
     - name: Environment info
       run: |
@@ -258,6 +259,10 @@ jobs:
         CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
         CPPFLAGS=-DXXH_VECTOR=XXH_VSX CFLAGS="-O3 -march=arch11 -mzvector" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
 
+    - name: MIPS (XXH_VECTOR=[ scalar ])
+      if: ${{ matrix.name == 'MIPS' }}
+      run: |
+        LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
 
   # macOS
 

From 3231cdfb155342f3da457529c118764f28dd430c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 23 Aug 2021 11:45:34 -0700
Subject: [PATCH 116/187] XXH_INLINE_ALL can be declared after XXH_NAMESPACE

It's now available again,
and tested with `multiInclude` test
---
 tests/Makefile       | 19 +++++++--------
 tests/multiInclude.c |  9 ++++---
 xxhash.h             | 57 ++++++++++++++++++++++++++++++++++++++++++--
 3 files changed, 70 insertions(+), 15 deletions(-)

diff --git a/tests/Makefile b/tests/Makefile
index 75a41ded..936a2ac1 100644
--- a/tests/Makefile
+++ b/tests/Makefile
@@ -63,17 +63,16 @@ test_multiInclude:
 	# compile with xxhash.o, to detect duplicated symbols
 	$(MAKE) multiInclude_withxxhash
 	@$(MAKE) clean
-	# Note: XXH_INLINE_ALL with XXH_NAMESPACE is currently disabled
-	# compile with XXH_NAMESPACE
-	# CPPFLAGS=-DXXH_NAMESPACE=TESTN_ $(MAKE) multiInclude_withxxhash
-	# no symbol prefixed TESTN_ should exist
-	# ! $(NM) multiInclude_withxxhash | $(GREP) TESTN_
-	#$(MAKE) clean
-	# compile with XXH_NAMESPACE and without xxhash.o
-	# CPPFLAGS=-DXXH_NAMESPACE=TESTN_ $(MAKE) multiInclude
+	# compile with XXH_NAMESPACE before XXH_INLINE_ALL
+	CPPFLAGS=-DXXH_NAMESPACE=TESTN_ $(MAKE) multiInclude
 	# no symbol prefixed TESTN_ should exist
-	# ! $(NM) multiInclude | $(GREP) TESTN_
-	#@$(MAKE) clean
+	! $(NM) multiInclude | $(GREP) TESTN_
+	$(MAKE) clean
+	# compile with XXH_NAMESPACE
+	CPPFLAGS=-DXXH_NAMESPACE=TESTN_ $(MAKE) multiInclude_withxxhash
+	# symbols prefixed TESTN_ should exist in xxhash.o (though not be invoked)
+	$(NM) multiInclude_withxxhash | $(GREP) TESTN_
+	$(MAKE) clean
 
 .PHONY: test_ppc_redefine
 test_ppc_redefine: ppc_define.c
diff --git a/tests/multiInclude.c b/tests/multiInclude.c
index 650f38e8..fc7e46fd 100644
--- a/tests/multiInclude.c
+++ b/tests/multiInclude.c
@@ -50,9 +50,9 @@
 #include "../xxhash.h"
 
 
-int main(void)
+void hash_advanced(void)
 {
-    XXH3_state_t state;   /* part of experimental API */
+    XXH3_state_t state;   /* this type is part of experimental API */
 
     XXH3_64bits_reset(&state);
     const char input[] = "Hello World !";
@@ -61,6 +61,9 @@ int main(void)
 
     XXH64_hash_t const h = XXH3_64bits_digest(&state);
     printf("hash '%s': %08x%08x \n", input, (unsigned)(h >> 32), (unsigned)h);
+}
 
-    return 0;
+int main(void)
+{
+    hash_advanced();
 }
diff --git a/xxhash.h b/xxhash.h
index 3500f1e6..3b5d347c 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -129,7 +129,60 @@ extern "C" {
     * avoiding naming collision with previous inclusions.
     */
 #  ifdef XXH_NAMESPACE
-#    error "XXH_INLINE_ALL with XXH_NAMESPACE is not supported"
+    /* #undef all symbols, they will be redefined right after */
+#      undef XXH_versionNumber
+    /* XXH32 */
+#      undef XXH32
+#      undef XXH32_createState
+#      undef XXH32_freeState
+#      undef XXH32_reset
+#      undef XXH32_update
+#      undef XXH32_digest
+#      undef XXH32_copyState
+#      undef XXH32_canonicalFromHash
+#      undef XXH32_hashFromCanonical
+    /* XXH64 */
+#      undef XXH64
+#      undef XXH64_createState
+#      undef XXH64_freeState
+#      undef XXH64_reset
+#      undef XXH64_update
+#      undef XXH64_digest
+#      undef XXH64_copyState
+#      undef XXH64_canonicalFromHash
+#      undef XXH64_hashFromCanonical
+    /* XXH3_64bits */
+#      undef XXH3_64bits
+#      undef XXH3_64bits_withSecret
+#      undef XXH3_64bits_withSeed
+#      undef XXH3_createState
+#      undef XXH3_freeState
+#      undef XXH3_copyState
+#      undef XXH3_64bits_reset
+#      undef XXH3_64bits_reset_withSeed
+#      undef XXH3_64bits_reset_withSecret
+#      undef XXH3_64bits_update
+#      undef XXH3_64bits_digest
+#      undef XXH3_generateSecret
+    /* XXH3_128bits */
+#      undef XXH128
+#      undef XXH3_128bits
+#      undef XXH3_128bits_withSeed
+#      undef XXH3_128bits_withSecret
+#      undef XXH3_128bits_reset
+#      undef XXH3_128bits_reset_withSeed
+#      undef XXH3_128bits_reset_withSecret
+#      undef XXH3_128bits_update
+#      undef XXH3_128bits_digest
+#      undef XXH128_isEqual
+#      undef XXH128_cmp
+#      undef XXH128_canonicalFromHash
+#      undef XXH128_hashFromCanonical
+    /* Finally, free the namespace itself */
+#      undef XXH_NAMESPACE
+
+
+//#    error "XXH_INLINE_ALL with XXH_NAMESPACE is not supported"
      /*
       * Note: Alternative: #undef all symbols (it's a pretty large list).
       * Without #error: it compiles, but functions are actually not inlined.
@@ -143,7 +196,7 @@ extern "C" {
     * However, this requires some #ifdefs, and is a more dispersed action.
     * Meanwhile, renaming can be achieved in a single block
     */
-#  define XXH_IPREF(Id)   XXH_INLINE_ ## Id
+#  define XXH_IPREF(Id)   XXH_NAMESPACE ## Id
 #  define XXH_OK XXH_IPREF(XXH_OK)
 #  define XXH_ERROR XXH_IPREF(XXH_ERROR)
 #  define XXH_errorcode XXH_IPREF(XXH_errorcode)

From 81d343d04b5405244f8b42e7bf20fc60b822cb75 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 23 Aug 2021 11:59:03 -0700
Subject: [PATCH 117/187] check multiple inclusions of XXH_INLINE_ALL

and tidy up code.
---
 tests/Makefile       |   2 +-
 tests/multiInclude.c |  21 +++++---
 xxhash.h             | 116 +++++++++++++++++++++----------------------
 3 files changed, 72 insertions(+), 67 deletions(-)

diff --git a/tests/Makefile b/tests/Makefile
index 936a2ac1..37424ce4 100644
--- a/tests/Makefile
+++ b/tests/Makefile
@@ -68,7 +68,7 @@ test_multiInclude:
 	# no symbol prefixed TESTN_ should exist
 	! $(NM) multiInclude | $(GREP) TESTN_
 	$(MAKE) clean
-	# compile with XXH_NAMESPACE
+	# compile xxhash.o with XXH_NAMESPACE
 	CPPFLAGS=-DXXH_NAMESPACE=TESTN_ $(MAKE) multiInclude_withxxhash
 	# symbols prefixed TESTN_ should exist in xxhash.o (though not be invoked)
 	$(NM) multiInclude_withxxhash | $(GREP) TESTN_
diff --git a/tests/multiInclude.c b/tests/multiInclude.c
index fc7e46fd..8912771d 100644
--- a/tests/multiInclude.c
+++ b/tests/multiInclude.c
@@ -30,21 +30,30 @@
 /* Normal include, gives access to public symbols */
 #include "../xxhash.h"
 
+/* Multiple consecutive inclusions are handled properly. */
+#include "../xxhash.h"
+
 /*
  * Advanced include, gives access to experimental symbols
- * This test ensures that xxhash.h can be included multiple times and in any
- * order. This order is more difficult: Without care, the declaration of
- * experimental symbols could be skipped.
+ * This test ensures that xxhash.h can be included multiple times
+ * and in any order. The tested order is more difficult:
+ * without care, the declaration of experimental symbols could be skipped.
  */
 #define XXH_STATIC_LINKING_ONLY
 #include "../xxhash.h"
 
 /*
- * Inlining: Re-define all identifiers, keep them private to the unit.
+ * Inlining: redefine all identifiers, keep them private to the unit.
  * Note: Without specific efforts, the identifier names would collide.
  *
- * To be linked with and without xxhash.o to test the symbol's presence and
- * naming collisions.
+ * To be linked with and without xxhash.o
+ * to test the symbol's presence and naming collisions.
+ */
+#define XXH_INLINE_ALL
+#include "../xxhash.h"
+
+/*
+ * Multiple consecutive inclusions with XXH_INLINE_ALL are handled properly.
  */
 #define XXH_INLINE_ALL
 #include "../xxhash.h"
diff --git a/xxhash.h b/xxhash.h
index 3b5d347c..ced5dcc1 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -121,80 +121,76 @@ extern "C" {
 
    /*
     * This part deals with the special case where a unit wants to inline xxHash,
-    * but "xxhash.h" has previously been included without XXH_INLINE_ALL, such
-    * as part of some previously included *.h header file.
+    * but "xxhash.h" has previously been included without XXH_INLINE_ALL,
+    * such as part of some previously included *.h header file.
     * Without further action, the new include would just be ignored,
     * and functions would effectively _not_ be inlined (silent failure).
     * The following macros solve this situation by prefixing all inlined names,
     * avoiding naming collision with previous inclusions.
     */
-#  ifdef XXH_NAMESPACE
-    /* #undef all symbols, they will be redefined right after */
-#      undef XXH_versionNumber
+   /* Before that, we unconditionally #undef all symbols,
+    * in case they were already defined with XXH_NAMESPACE.
+    * They will then be redefined for XXH_INLINE_ALL
+    */
+#  undef XXH_versionNumber
     /* XXH32 */
-#      undef XXH32
-#      undef XXH32_createState
-#      undef XXH32_freeState
-#      undef XXH32_reset
-#      undef XXH32_update
-#      undef XXH32_digest
-#      undef XXH32_copyState
-#      undef XXH32_canonicalFromHash
-#      undef XXH32_hashFromCanonical
+#  undef XXH32
+#  undef XXH32_createState
+#  undef XXH32_freeState
+#  undef XXH32_reset
+#  undef XXH32_update
+#  undef XXH32_digest
+#  undef XXH32_copyState
+#  undef XXH32_canonicalFromHash
+#  undef XXH32_hashFromCanonical
     /* XXH64 */
-#      undef XXH64
-#      undef XXH64_createState
-#      undef XXH64_freeState
-#      undef XXH64_reset
-#      undef XXH64_update
-#      undef XXH64_digest
-#      undef XXH64_copyState
-#      undef XXH64_canonicalFromHash
-#      undef XXH64_hashFromCanonical
+#  undef XXH64
+#  undef XXH64_createState
+#  undef XXH64_freeState
+#  undef XXH64_reset
+#  undef XXH64_update
+#  undef XXH64_digest
+#  undef XXH64_copyState
+#  undef XXH64_canonicalFromHash
+#  undef XXH64_hashFromCanonical
     /* XXH3_64bits */
-#      undef XXH3_64bits
-#      undef XXH3_64bits_withSecret
-#      undef XXH3_64bits_withSeed
-#      undef XXH3_createState
-#      undef XXH3_freeState
-#      undef XXH3_copyState
-#      undef XXH3_64bits_reset
-#      undef XXH3_64bits_reset_withSeed
-#      undef XXH3_64bits_reset_withSecret
-#      undef XXH3_64bits_update
-#      undef XXH3_64bits_digest
-#      undef XXH3_generateSecret
+#  undef XXH3_64bits
+#  undef XXH3_64bits_withSecret
+#  undef XXH3_64bits_withSeed
+#  undef XXH3_createState
+#  undef XXH3_freeState
+#  undef XXH3_copyState
+#  undef XXH3_64bits_reset
+#  undef XXH3_64bits_reset_withSeed
+#  undef XXH3_64bits_reset_withSecret
+#  undef XXH3_64bits_update
+#  undef XXH3_64bits_digest
+#  undef XXH3_generateSecret
     /* XXH3_128bits */
-#      undef XXH128
-#      undef XXH3_128bits
-#      undef XXH3_128bits_withSeed
-#      undef XXH3_128bits_withSecret
-#      undef XXH3_128bits_reset
-#      undef XXH3_128bits_reset_withSeed
-#      undef XXH3_128bits_reset_withSecret
-#      undef XXH3_128bits_update
-#      undef XXH3_128bits_digest
-#      undef XXH128_isEqual
-#      undef XXH128_cmp
-#      undef XXH128_canonicalFromHash
-#      undef XXH128_hashFromCanonical
+#  undef XXH128
+#  undef XXH3_128bits
+#  undef XXH3_128bits_withSeed
+#  undef XXH3_128bits_withSecret
+#  undef XXH3_128bits_reset
+#  undef XXH3_128bits_reset_withSeed
+#  undef XXH3_128bits_reset_withSecret
+#  undef XXH3_128bits_update
+#  undef XXH3_128bits_digest
+#  undef XXH128_isEqual
+#  undef XXH128_cmp
+#  undef XXH128_canonicalFromHash
+#  undef XXH128_hashFromCanonical
     /* Finally, free the namespace itself */
-#      undef XXH_NAMESPACE
-
+#  undef XXH_NAMESPACE
 
-//#    error "XXH_INLINE_ALL with XXH_NAMESPACE is not supported"
-     /*
-      * Note: Alternative: #undef all symbols (it's a pretty large list).
-      * Without #error: it compiles, but functions are actually not inlined.
-      */
-#  endif
+    /* employ the namespace for XXH_INLINE_ALL */
 #  define XXH_NAMESPACE XXH_INLINE_
    /*
-    * Some identifiers (enums, type names) are not symbols, but they must
-    * still be renamed to avoid redeclaration.
+    * Some identifiers (enums, type names) are not symbols,
+    * but they must nonetheless be renamed to avoid redeclaration.
     * Alternative solution: do not redeclare them.
-    * However, this requires some #ifdefs, and is a more dispersed action.
-    * Meanwhile, renaming can be achieved in a single block
+    * However, this requires some #ifdefs, and has a more dispersed impact.
+    * Meanwhile, renaming can be achieved in a single place.
     */
 #  define XXH_IPREF(Id)   XXH_NAMESPACE ## Id
 #  define XXH_OK XXH_IPREF(XXH_OK)

From dcb69ac3ef2b47bfe2bc33a186cc070e588232b7 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 23 Aug 2021 13:57:07 -0700
Subject: [PATCH 118/187] set XXH_REROLL=1 on gcc

but not on clang.
---
 xxhash.h | 12 +++++++-----
 1 file changed, 7 insertions(+), 5 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index ced5dcc1..147c560c 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1310,13 +1310,13 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 
 /*!
  * @def XXH_REROLL
- * @brief Whether to reroll `XXH32_finalize` and `XXH64_finalize`.
+ * @brief Whether to reroll `XXH32_finalize`.
  *
- * For performance, `XXH32_finalize` and `XXH64_finalize` use an unrolled loop
+ * For performance, `XXH32_finalize` uses an unrolled loop
  * in the form of a switch statement.
  *
- * This is not always desirable, as it generates larger code, and depending on
- * the architecture, may even be slower
+ * This is not always desirable, as it generates larger code,
+ * and depending on the architecture, may even be slower
  *
  * This is automatically defined with `-Os`/`-Oz` on GCC and Clang.
  */
@@ -1379,7 +1379,9 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 #endif
 
 #ifndef XXH_REROLL
-#  if defined(__OPTIMIZE_SIZE__)
+#  if defined(__OPTIMIZE_SIZE__) /* -Os, -Oz */ || \
+     (defined(__GNUC__) && !defined(__clang__))
+     /* The if/then loop is preferable to switch/case on gcc (on x64) */
 #    define XXH_REROLL 1
 #  else
 #    define XXH_REROLL 0

From a067a4e9b9d3962505ed1ca73af04a871c6fa43e Mon Sep 17 00:00:00 2001
From: Lior Lahav <LahavLior@gmail.com>
Date: Tue, 24 Aug 2021 09:30:15 +0300
Subject: [PATCH 119/187] Fixed: fallthrough attribute for Clang

---
 xxhash.h | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/xxhash.h b/xxhash.h
index ced5dcc1..b5dceab9 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -615,6 +615,8 @@ C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
 #ifndef XXH_FALLTHROUGH
 #   if defined(__GNUC__) && __GNUC__ >= 7
 #       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
+#   elif defined(__clang__) && (__clang_major__ > 3 || (__clang_major__ == 3 && __clang_minor__  >= 9))
+#       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
 #   else
 #       define XXH_FALLTHROUGH
 #	endif

From b94f4160f3fe4c76b42af9f21715d27ced689f07 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Thu, 26 Aug 2021 03:42:03 +0900
Subject: [PATCH 120/187] Fix clang version checking for fallthrough attribute

Without this change, "CC=clang-9 make clean all" fails with the following warning (as an error).

```
$ CC=clang-9 make clean all
clang-9 -O3     -c -o xxhash.o xxhash.c
In file included from xxhash.c:43:
./xxhash.h:1959:26: warning: declaration does not declare anything [-Wmissing-declarations]
                         XXH_FALLTHROUGH;
                         ^
./xxhash.h:619:32: note: expanded from macro 'XXH_FALLTHROUGH'
#       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
                               ^
```

Since clang [3.9 , 9] don't support GNU-style __attribute__((fallthrough)),
we need to fix the version cheking.

Please refer the Clang documentations below

fallthrough - Clang 10 documentation
https://releases.llvm.org/10.0.0/tools/clang/docs/AttributeReference.html#fallthrough

fallthrough - Clang 9 documentation
https://releases.llvm.org/9.0.0/tools/clang/docs/AttributeReference.html#fallthrough

fallthrough - Clang 3.9 documentation
https://releases.llvm.org/3.9.0/tools/clang/docs/AttributeReference.html#fallthrough-clang-fallthrough
---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 4abc1e30..bb60f9a0 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -615,7 +615,7 @@ C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
 #ifndef XXH_FALLTHROUGH
 #   if defined(__GNUC__) && __GNUC__ >= 7
 #       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
-#   elif defined(__clang__) && (__clang_major__ > 3 || (__clang_major__ == 3 && __clang_minor__  >= 9))
+#   elif defined(__clang__) && (__clang_major__ >= 10)
 #       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
 #   else
 #       define XXH_FALLTHROUGH

From 2c053739afdfd88c75cfa91c12c493bf4ba16aab Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Thu, 26 Aug 2021 03:47:12 +0900
Subject: [PATCH 121/187] Add C2x version checking for "gcc-11 -pedantic"
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

"gcc-11 -std=c90 -pedantic" reports the following warning as an error.

```
$ CC=gcc-11 make clean all

gcc-11 -std=c90 -pedantic -Wno-long-long -Werror -Wall -Wextra -Wconversion -Wcast-qual -Wcast-align -Wshadow -Wstrict-aliasing=1 -Wswitch-enum -Wdeclaration-after-statement -Wstrict-prototypes -Wundef -Wpointer-arith -Wformat-security -Wvla -Wformat=2 -Winit-self -Wfloat-equal -Wwrite-strings -Wredundant-decls -Wstrict-overflow=2 -c -o xxhash.o xxhash.c

In file included from xxhash.c:43:
xxhash.h: In function ‘XXH32_finalize’:
xxhash.h:557:32: error: ISO C does not support ‘[[]]’ attributes before C2X [-Werror=pedantic]
557 | # define XXH_FALLTHROUGH [[fallthrough]]
```

Since we can't use "[[...]]" style attributes with pre-C2X, we must check __STDC_VERSION__.
---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index bb60f9a0..c79b033f 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -601,7 +601,7 @@ CPP17 : https://en.cppreference.com/w/cpp/language/attributes/fallthrough
 C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
 */
 
-#if defined (__has_c_attribute)
+#if defined (__has_c_attribute) && defined (__STDC_VERSION__) && (__STDC_VERSION__ > 201710L) /* C2x */
 #   if __has_c_attribute(fallthrough)
 #       define XXH_FALLTHROUGH [[fallthrough]]
 #   endif

From aa9a586693f72d8bc12fad65619527176d9a3dd6 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Thu, 26 Aug 2021 04:02:48 +0900
Subject: [PATCH 122/187] Introduce cast via void* for AVX2 and AVX512

This redundant cast has introduced at 79458e4 to avoid
strange warning from clang-3.9 and earlier versions.

Since it's compiler's bug, we can remove this cast in future.

See also: https://github.com/Cyan4973/xxHash/pull/569
---
 xxhash.h | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 4abc1e30..82915bbe 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3748,7 +3748,7 @@ XXH3_initCustomSecret_avx512(void* XXH_RESTRICT customSecret, xxh_u64 seed64)
     {   int const nbRounds = XXH_SECRET_DEFAULT_SIZE / sizeof(__m512i);
         __m512i const seed = _mm512_mask_set1_epi64(_mm512_set1_epi64((xxh_i64)seed64), 0xAA, (xxh_i64)(0U - seed64));
 
-        XXH_ALIGN(64) const __m512i* const src  = (const __m512i*) XXH3_kSecret;
+        XXH_ALIGN(64) const __m512i* const src  = (const __m512i*) ((const void*) XXH3_kSecret);
         XXH_ALIGN(64)       __m512i* const dest = (      __m512i*) customSecret;
         int i;
         for (i=0; i < nbRounds; ++i) {
@@ -3844,7 +3844,7 @@ XXH_FORCE_INLINE XXH_TARGET_AVX2 void XXH3_initCustomSecret_avx2(void* XXH_RESTR
     XXH_PREFETCH(customSecret);
     {   __m256i const seed = _mm256_set_epi64x((xxh_i64)(0U - seed64), (xxh_i64)seed64, (xxh_i64)(0U - seed64), (xxh_i64)seed64);
 
-        XXH_ALIGN(64) const __m256i* const src  = (const __m256i*) XXH3_kSecret;
+        XXH_ALIGN(64) const __m256i* const src  = (const __m256i*) ((const void*) XXH3_kSecret);
         XXH_ALIGN(64)       __m256i*       dest = (      __m256i*) customSecret;
 
 #       if defined(__GNUC__) || defined(__clang__)

From d80f820834517bbc5beff82a08971196774debad Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Thu, 26 Aug 2021 04:11:48 +0900
Subject: [PATCH 123/187] Remove unnecessary XXH_ALIGN()

Remove unnecessary XXH_ALIGN()

These XXH_ALIGN() declares alignment of address of the pointers.
It secures

	((size_t) &src) % 64 == 0 /* true */

But it doesn't guarantee

	((size_t) src) % 64 == 0 /* ? */

Therefore, we don't need these XXH_ALIGN()s.

Also it consumes stack space for alignment (padding),
removing it reduces the amount of stack allocation.

TODO : we need to review other XXH_ALIGN() for pointers.
---
 xxhash.h | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 82915bbe..dcfff3ea 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3748,15 +3748,15 @@ XXH3_initCustomSecret_avx512(void* XXH_RESTRICT customSecret, xxh_u64 seed64)
     {   int const nbRounds = XXH_SECRET_DEFAULT_SIZE / sizeof(__m512i);
         __m512i const seed = _mm512_mask_set1_epi64(_mm512_set1_epi64((xxh_i64)seed64), 0xAA, (xxh_i64)(0U - seed64));
 
-        XXH_ALIGN(64) const __m512i* const src  = (const __m512i*) ((const void*) XXH3_kSecret);
-        XXH_ALIGN(64)       __m512i* const dest = (      __m512i*) customSecret;
+        const __m512i* const src  = (const __m512i*) ((const void*) XXH3_kSecret);
+              __m512i* const dest = (      __m512i*) customSecret;
         int i;
         for (i=0; i < nbRounds; ++i) {
             /* GCC has a bug, _mm512_stream_load_si512 accepts 'void*', not 'void const*',
              * this will warn "discards ‘const’ qualifier". */
             union {
-                XXH_ALIGN(64) const __m512i* cp;
-                XXH_ALIGN(64) void* p;
+                const __m512i* cp;
+                void* p;
             } remote_const_void;
             remote_const_void.cp = src + i;
             dest[i] = _mm512_add_epi64(_mm512_stream_load_si512(remote_const_void.p), seed);
@@ -3844,8 +3844,8 @@ XXH_FORCE_INLINE XXH_TARGET_AVX2 void XXH3_initCustomSecret_avx2(void* XXH_RESTR
     XXH_PREFETCH(customSecret);
     {   __m256i const seed = _mm256_set_epi64x((xxh_i64)(0U - seed64), (xxh_i64)seed64, (xxh_i64)(0U - seed64), (xxh_i64)seed64);
 
-        XXH_ALIGN(64) const __m256i* const src  = (const __m256i*) ((const void*) XXH3_kSecret);
-        XXH_ALIGN(64)       __m256i*       dest = (      __m256i*) customSecret;
+        const __m256i* const src  = (const __m256i*) ((const void*) XXH3_kSecret);
+              __m256i*       dest = (      __m256i*) customSecret;
 
 #       if defined(__GNUC__) || defined(__clang__)
         /*

From 68ed62099e7bd576feff454462821603e1aa6c80 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Thu, 26 Aug 2021 04:13:52 +0900
Subject: [PATCH 124/187] Add XXH_ASSERT() for alignment

---
 xxhash.h | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/xxhash.h b/xxhash.h
index dcfff3ea..38270c97 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3751,6 +3751,8 @@ XXH3_initCustomSecret_avx512(void* XXH_RESTRICT customSecret, xxh_u64 seed64)
         const __m512i* const src  = (const __m512i*) ((const void*) XXH3_kSecret);
               __m512i* const dest = (      __m512i*) customSecret;
         int i;
+        XXH_ASSERT(((size_t)src & 63) == 0); /* control alignment */
+        XXH_ASSERT(((size_t)dest & 63) == 0);
         for (i=0; i < nbRounds; ++i) {
             /* GCC has a bug, _mm512_stream_load_si512 accepts 'void*', not 'void const*',
              * this will warn "discards ‘const’ qualifier". */
@@ -3855,6 +3857,8 @@ XXH_FORCE_INLINE XXH_TARGET_AVX2 void XXH3_initCustomSecret_avx2(void* XXH_RESTR
          */
         XXH_COMPILER_GUARD(dest);
 #       endif
+        XXH_ASSERT(((size_t)src & 31) == 0); /* control alignment */
+        XXH_ASSERT(((size_t)dest & 31) == 0);
 
         /* GCC -O2 need unroll loop manually */
         dest[0] = _mm256_add_epi64(_mm256_stream_load_si256(src+0), seed);

From 889d2c3017bd8749b05dc2efd741492b345684f5 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Wed, 25 Aug 2021 14:01:02 -0700
Subject: [PATCH 125/187] fix for Apple clang

Apple clang version numbers are unlike regular versions of clang.
See https://en.wikipedia.org/wiki/Xcode#Xcode_11.x_-_13.x_(since_SwiftUI_framework)_2
---
 xxhash.h | 6 +++++-
 1 file changed, 5 insertions(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 93b5fc33..ac2e2408 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -615,7 +615,11 @@ C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
 #ifndef XXH_FALLTHROUGH
 #   if defined(__GNUC__) && __GNUC__ >= 7
 #       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
-#   elif defined(__clang__) && (__clang_major__ >= 10)
+#   elif defined(__clang__) && (__clang_major__ >= 10) \
+     && (!defined(__APPLE__) || (__clang_major__ >= 12))
+     /* Apple clang 12 is effectively clang-10 ,
+      * see https://en.wikipedia.org/wiki/Xcode for details
+      */
 #       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
 #   else
 #       define XXH_FALLTHROUGH

From a916599792f01a8b998ea95c65de8b0b86b507dc Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Thu, 26 Aug 2021 06:07:07 +0900
Subject: [PATCH 126/187] Add OS/Compiler matrix to GA

This change set adds test for various version of `gcc` and `clang`.

- Adds `gcc-[4.8 , 11]`
- Adds `clang-[3.9 , 12]`
- Since `gcc-4.8` doesn't support `-mavx512f`, this change set introduces special matrix parameter `avx512`.
---
 .github/workflows/ci.yml | 69 +++++++++++++++++++++++++++++++++-------
 1 file changed, 58 insertions(+), 11 deletions(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index 8a478317..37ffcf24 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -54,57 +54,104 @@ name: xxHash CI tests
 on: [push, pull_request]
 
 jobs:
+  xxhash-c-compilers:
+    name: CC=${{ matrix.cc }}, ${{ matrix.os }}
+    strategy:
+      fail-fast: false  # 'false' means Don't stop matrix workflows even if some matrix entry fails.
+      matrix:
+        include: [
+          # You can access the following values via ${{ matrix.??? }}
+          #
+          #   pkgs    : apt-get package names.  It can include multiple package names which are delimited by space.
+          #   cc      : C compiler executable.
+          #   cxx     : C++ compiler executable for `make ctocpptest`.
+          #   avx512  : Set 'true' if compiler supports avx512.  Otherwise, set 'false'.
+          #   os      : GitHub Actions YAML workflow label.  See https://github.com/actions/virtual-environments#available-environments
+
+          # cc
+          { pkgs: '',                                  cc: cc,        cxx: c++,         avx512: 'true',  os: ubuntu-latest, },
+
+          # gcc
+          { pkgs: '',                                  cc: gcc,       cxx: g++,         avx512: 'true',  os: ubuntu-latest, },
+          { pkgs: 'gcc-11  g++-11  lib32gcc-11-dev',   cc: gcc-11,    cxx: g++-11,      avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'gcc-10  g++-10  lib32gcc-10-dev',   cc: gcc-10,    cxx: g++-10,      avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'gcc-9   g++-9   lib32gcc-9-dev',    cc: gcc-9,     cxx: g++-9,       avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'gcc-8   g++-8   lib32gcc-8-dev',    cc: gcc-8,     cxx: g++-8,       avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'gcc-7   g++-7   lib32gcc-7-dev',    cc: gcc-7,     cxx: g++-7,       avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'gcc-6   g++-6   lib32gcc-6-dev',    cc: gcc-6,     cxx: g++-6,       avx512: 'true',  os: ubuntu-18.04,  },
+          { pkgs: 'gcc-5   g++-5   lib32gcc-5-dev',    cc: gcc-5,     cxx: g++-5,       avx512: 'true',  os: ubuntu-18.04,  },
+          { pkgs: 'gcc-4.8 g++-4.8 lib32gcc-4.8-dev ', cc: gcc-4.8,   cxx: g++-4.8,     avx512: 'false', os: ubuntu-18.04,  },
+
+          # clang
+          { pkgs: '',                                  cc: clang,     cxx: clang++,     avx512: 'true',  os: ubuntu-latest, },
+          { pkgs: 'clang-12',                          cc: clang-12,  cxx: clang++-12,  avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'clang-11',                          cc: clang-11,  cxx: clang++-11,  avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'clang-10',                          cc: clang-10,  cxx: clang++-10,  avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'clang-9',                           cc: clang-9,   cxx: clang++-9,   avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'clang-8',                           cc: clang-8,   cxx: clang++-8,   avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'clang-7',                           cc: clang-7,   cxx: clang++-7,   avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'clang-6.0',                         cc: clang-6.0, cxx: clang++-6.0, avx512: 'true',  os: ubuntu-20.04,  },
+          { pkgs: 'clang-5.0',                         cc: clang-5.0, cxx: clang++-5.0, avx512: 'true',  os: ubuntu-18.04,  },
+          { pkgs: 'clang-4.0',                         cc: clang-4.0, cxx: clang++-4.0, avx512: 'true',  os: ubuntu-18.04,  },
+          { pkgs: 'clang-3.9',                         cc: clang-3.9, cxx: clang++-3.9, avx512: 'true',  os: ubuntu-18.04,  },
+        ]
 
-  # Linux, x64
-  #
-  # - "ubuntu-latest" (Ubuntu 20.04) has the following software
-  #   https://github.com/actions/virtual-environments/blob/main/images/linux/Ubuntu2004-README.md
-
-  ubuntu-general:
-    name: Linux x64
-    runs-on: ubuntu-latest
+    runs-on: ${{ matrix.os }}
+    env:                        # Set environment variables
+      # We globally set CC and CXX to improve compatibility with .travis.yml
+      CC: ${{ matrix.cc }}
+      CXX: ${{ matrix.cxx }}
     steps:
     - uses: actions/checkout@v2 # https://github.com/actions/checkout
 
     - name: apt-get install
       run: |
+        sudo apt-get update
         sudo apt-get install gcc-multilib
+        sudo apt-get install ${{ matrix.pkgs }}
 
     - name: Environment info
       run: |
-        echo && gcc --version
-        echo && clang --version
-        echo && make -v
+        echo && type $CC && which $CC && $CC --version
+        echo && type $CXX && which $CXX && $CXX --version
+        echo && type make && make -v
         echo && cat /proc/cpuinfo || echo /proc/cpuinfo is not present
 
     - name: C90 + no-long-long compliance
+      if: always()
       run: |
         CFLAGS="-std=c90 -pedantic -Wno-long-long -Werror" make clean xxhsum
 
     - name: C90 + XXH_NO_LONG_LONG
+      if: always()
       run: |
         # strict c90, with no long long support; resulting in no XXH64_* symbol
         make clean c90test
 
     - name: dispatch
+      if: always()
       run: |
         # removing sign conversion warnings due to a bug in gcc-5's definition of some AVX512 intrinsics
         CFLAGS="-Werror" MOREFLAGS="-Wno-sign-conversion" make clean dispatch
 
     - name: DISPATCH=1
+      if: always()
       run: |
         CFLAGS="-Wall -Wextra -Werror" make DISPATCH=1 clean default
 
     - name: noxxh3test
+      if: always()
       run: |
         # check library can be compiled with XXH_NO_XXH3, resulting in no XXH3_* symbol
         make clean noxxh3test
 
     - name: make avx512f
+      if: ${{ matrix.avx512 == 'true' }}
       run: |
         CFLAGS="-O1 -mavx512f -Werror" make clean default
 
     - name: test-all
+      if: always()
       run: |
         make clean test-all
 

From 665424e72078511238fe9053159dc073244672e8 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Wed, 25 Aug 2021 15:30:42 -0700
Subject: [PATCH 127/187] added variant XXH3_64bits_withSecretandSeed()

and its companion XXH3_generateSecret_fromSeed().

The new variant uses @seed for "small" keys,
and @secret for "large" keys.
When combined with the new generator,
it provides exactly the same results as _withSeed(),
with the benefit of pre-caculated @secret for "large keys",
resulting in a speed boost for "not too large" keys (<1 KB).
---
 cli/xsum_sanity_check.c |  10 +++
 xxhash.h                | 153 ++++++++++++++++++++++++++++------------
 2 files changed, 117 insertions(+), 46 deletions(-)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 347d1db5..27a2337d 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -378,6 +378,16 @@ static void XSUM_testXXH3(const void* data, const XSUM_testdata64_t* testData)
         XSUM_checkResult64(Dresult, Nresult);
     }
 
+    /* check that the combination of
+     * XXH3_generateSecret_fromSeed() and XXH3_64bits_withSecretandSeed()
+     * results in exactly the same hash generation as XXH3_64bits_withSeed() */
+    {   char secretBuffer[XXH3_SECRET_DEFAULT_SIZE+1];
+        char* const secret = secretBuffer + 1;  /* intentional unalignment */
+        XXH3_generateSecret_fromSeed(secret, seed);
+        {   XSUM_U64 const Dresult = XXH3_64bits_withSecretandSeed(data, len, secret, XXH3_SECRET_DEFAULT_SIZE, seed);
+            XSUM_checkResult64(Dresult, Nresult);
+    }   }
+
     /* streaming API test */
     {   XXH3_state_t* const state = XXH3_createState();
         assert(state != NULL);
diff --git a/xxhash.h b/xxhash.h
index ac2e2408..1af5961f 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1141,6 +1141,42 @@ struct XXH3_state_s {
 XXH_PUBLIC_API void XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSeedSize);
 
 
+/*
+ * XXH3_generateSecret_fromSeed():
+ *
+ * Generate the same secret as the one built from @seed when using _withSeed() variants.
+ *
+ * The resulting secret has a length XXH3_SECRET_DEFAULT_SIZE (necessarily).
+ * @secretBuffer must be already allocated, of size at least XXH3_SECRET_DEFAULT_SIZE bytes.
+ *
+ * The generated secret can be used in combination with
+ *`*_withSecret()` and `_withSecretandSeed()` variants.
+ * This generator is notably useful for `_withSecretandSeed()`,
+ * as it makes this variant generate the same values as corresponding `_withSeed()` variant.
+ */
+XXH_PUBLIC_API void XXH3_generateSecret_fromSeed(void* secretBuffer, XXH64_hash_t seed);
+
+/*
+ * *_withSecretandSeed() :
+ * This variants generate hash values using either
+ * @seed for "short" keys (< XXH3_MIDSIZE_MAX = 240 bytes)
+ * or @secret for "large" keys (>= XXH3_MIDSIZE_MAX).
+ * This generally benefits speed, compared to `_withSeed()` or `_withSecret()`.
+ * `_withSeed()` has to generate the secret on the fly for "large" keys.
+ * It's fast, but can be perceptible for "not so large" keys < 1 KB.
+ * `_withSecret()` has to generate the masks on the fly for "small" keys,
+ * which require more instructions than _withSeed() variants.
+ * _withSecretandSeed variant therefore combines the best of both worlds.
+ * When @secret has been generated by XXH3_generateSecret_fromSeed(),
+ * this variant produces exactly the same results as `_withSeed()` variant,
+ * thus offering solely a speed benefit for "large" keys,
+ * since there is no need to regenerate the secret for every large key.
+ */
+XXH_PUBLIC_API XXH64_hash_t
+XXH3_64bits_withSecretandSeed(const void* data, size_t len,
+                              const void* secret, size_t secretSize,
+                              XXH64_hash_t seed);
+
 /* simple short-cut to pre-selected XXH3_128bits variant */
 XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t seed);
 
@@ -4527,6 +4563,14 @@ XXH3_64bits_withSeed(const void* input, size_t len, XXH64_hash_t seed)
     return XXH3_64bits_internal(input, len, seed, XXH3_kSecret, sizeof(XXH3_kSecret), XXH3_hashLong_64b_withSeed);
 }
 
+XXH_PUBLIC_API XXH64_hash_t
+XXH3_64bits_withSecretandSeed(const void* input, size_t len, const void* secret, size_t secretSize, XXH64_hash_t seed)
+{
+    if (len <= XXH3_MIDSIZE_MAX)
+        return XXH3_64bits_internal(input, len, seed, XXH3_kSecret, sizeof(XXH3_kSecret), NULL);
+    return XXH3_64bits_internal(input, len, seed, secret, secretSize, XXH3_hashLong_64b_withSecret);
+}
+
 
 /* ===   XXH3 streaming   === */
 
@@ -4837,52 +4881,6 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_digest (const XXH3_state_t* state)
 }
 
 
-#define XXH_MIN(x, y) (((x) > (y)) ? (y) : (x))
-
-/*! @ingroup xxh3_family */
-XXH_PUBLIC_API void
-XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSeedSize)
-{
-    XXH_ASSERT(secretBuffer != NULL);
-    if (customSeedSize == 0) {
-        memcpy(secretBuffer, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
-        return;
-    }
-    XXH_ASSERT(customSeed != NULL);
-
-    {   size_t const segmentSize = sizeof(XXH128_hash_t);
-        size_t const nbSegments = XXH_SECRET_DEFAULT_SIZE / segmentSize;
-        XXH128_canonical_t scrambler;
-        XXH64_hash_t seeds[12];
-        size_t segnb;
-        XXH_ASSERT(nbSegments == 12);
-        XXH_ASSERT(segmentSize * nbSegments == XXH_SECRET_DEFAULT_SIZE); /* exact multiple */
-        XXH128_canonicalFromHash(&scrambler, XXH128(customSeed, customSeedSize, 0));
-
-        /*
-        * Copy customSeed to seeds[], truncating or repeating as necessary.
-        */
-        {   size_t toFill = XXH_MIN(customSeedSize, sizeof(seeds));
-            size_t filled = toFill;
-            memcpy(seeds, customSeed, toFill);
-            while (filled < sizeof(seeds)) {
-                toFill = XXH_MIN(filled, sizeof(seeds) - filled);
-                memcpy((char*)seeds + filled, seeds, toFill);
-                filled += toFill;
-        }   }
-
-        /* generate secret */
-        memcpy(secretBuffer, &scrambler, sizeof(scrambler));
-        for (segnb=1; segnb < nbSegments; segnb++) {
-            size_t const segmentStart = segnb * segmentSize;
-            XXH128_canonical_t segment;
-            XXH128_canonicalFromHash(&segment,
-                XXH128(&scrambler, sizeof(scrambler), XXH_readLE64(seeds + segnb) + segnb) );
-            memcpy((char*)secretBuffer + segmentStart, &segment, sizeof(segment));
-    }   }
-}
-
-
 /* ==========================================
  * XXH3 128 bits (a.k.a XXH128)
  * ==========================================
@@ -5410,6 +5408,69 @@ XXH128_hashFromCanonical(const XXH128_canonical_t* src)
     return h;
 }
 
+
+
+/* ==========================================
+ * Secret generators
+ * ==========================================
+ */
+#define XXH_MIN(x, y) (((x) > (y)) ? (y) : (x))
+
+/*! @ingroup xxh3_family */
+XXH_PUBLIC_API void
+XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSeedSize)
+{
+    XXH_ASSERT(secretBuffer != NULL);
+    if (customSeedSize == 0) {
+        memcpy(secretBuffer, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
+        return;
+    }
+    XXH_ASSERT(customSeed != NULL);
+
+    {   size_t const segmentSize = sizeof(XXH128_hash_t);
+        size_t const nbSegments = XXH_SECRET_DEFAULT_SIZE / segmentSize;
+        XXH128_canonical_t scrambler;
+        XXH64_hash_t seeds[12];
+        size_t segnb;
+        XXH_ASSERT(nbSegments == 12);
+        XXH_ASSERT(segmentSize * nbSegments == XXH_SECRET_DEFAULT_SIZE); /* exact multiple */
+        XXH128_canonicalFromHash(&scrambler, XXH128(customSeed, customSeedSize, 0));
+
+        /*
+        * Copy customSeed to seeds[], truncating or repeating as necessary.
+        */
+        {   size_t toFill = XXH_MIN(customSeedSize, sizeof(seeds));
+            size_t filled = toFill;
+            memcpy(seeds, customSeed, toFill);
+            while (filled < sizeof(seeds)) {
+                toFill = XXH_MIN(filled, sizeof(seeds) - filled);
+                memcpy((char*)seeds + filled, seeds, toFill);
+                filled += toFill;
+        }   }
+
+        /* generate secret */
+        memcpy(secretBuffer, &scrambler, sizeof(scrambler));
+        for (segnb=1; segnb < nbSegments; segnb++) {
+            size_t const segmentStart = segnb * segmentSize;
+            XXH128_canonical_t segment;
+            XXH128_canonicalFromHash(&segment,
+                XXH128(&scrambler, sizeof(scrambler), XXH_readLE64(seeds + segnb) + segnb) );
+            memcpy((char*)secretBuffer + segmentStart, &segment, sizeof(segment));
+    }   }
+}
+
+/*! @ingroup xxh3_family */
+XXH_PUBLIC_API void
+XXH3_generateSecret_fromSeed(void* secretBuffer, XXH64_hash_t seed)
+{
+    XXH_ALIGN(XXH_SEC_ALIGN) xxh_u8 secret[XXH_SECRET_DEFAULT_SIZE];
+    XXH3_initCustomSecret(secret, seed);
+    XXH_ASSERT(secretBuffer != NULL);
+    memcpy(secretBuffer, secret, XXH_SECRET_DEFAULT_SIZE);
+}
+
+
+
 /* Pop our optimization override from above */
 #if XXH_VECTOR == XXH_AVX2 /* AVX2 */ \
   && defined(__GNUC__) && !defined(__clang__) /* GCC, not Clang */ \

From f1cef5b8a450a66874a916a7a8d11fe27e06e21e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Wed, 25 Aug 2021 17:38:20 -0700
Subject: [PATCH 128/187] fixed multi-include with XXH_NAMESPACE

---
 xxhash.h | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/xxhash.h b/xxhash.h
index 1af5961f..42cba795 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -284,6 +284,7 @@ extern "C" {
 #  define XXH3_64bits XXH_NAME2(XXH_NAMESPACE, XXH3_64bits)
 #  define XXH3_64bits_withSecret XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_withSecret)
 #  define XXH3_64bits_withSeed XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_withSeed)
+#  define XXH3_64bits_withSecretandSeed XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_withSecretandSeed)
 #  define XXH3_createState XXH_NAME2(XXH_NAMESPACE, XXH3_createState)
 #  define XXH3_freeState XXH_NAME2(XXH_NAMESPACE, XXH3_freeState)
 #  define XXH3_copyState XXH_NAME2(XXH_NAMESPACE, XXH3_copyState)
@@ -293,6 +294,7 @@ extern "C" {
 #  define XXH3_64bits_update XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_update)
 #  define XXH3_64bits_digest XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_digest)
 #  define XXH3_generateSecret XXH_NAME2(XXH_NAMESPACE, XXH3_generateSecret)
+#  define XXH3_generateSecret_fromSeed XXH_NAME2(XXH_NAMESPACE, XXH3_generateSecret_fromSeed)
 /* XXH3_128bits */
 #  define XXH128 XXH_NAME2(XXH_NAMESPACE, XXH128)
 #  define XXH3_128bits XXH_NAME2(XXH_NAMESPACE, XXH3_128bits)

From a4ecdedafb7872d7dd9d6a2aa1d84ae47803e069 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 26 Aug 2021 11:08:46 -0700
Subject: [PATCH 129/187] improve _withSecret() performance

for secrets of fixed size.
---
 xxhash.h | 15 ++++++++-------
 1 file changed, 8 insertions(+), 7 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 42cba795..b757c4a5 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4450,9 +4450,11 @@ XXH3_hashLong_64b_internal(const void* XXH_RESTRICT input, size_t len,
 }
 
 /*
- * It's important for performance that XXH3_hashLong is not inlined.
+ * It's important for performance to transmit secret's size (when it's static)
+ * so that the compiler can properly optimize the vectorized loop.
+ * This makes a big performance difference for "medium" keys (<1 KB) when using AVX instruction set.
  */
-XXH_NO_INLINE XXH64_hash_t
+XXH_FORCE_INLINE XXH64_hash_t
 XXH3_hashLong_64b_withSecret(const void* XXH_RESTRICT input, size_t len,
                              XXH64_hash_t seed64, const xxh_u8* XXH_RESTRICT secret, size_t secretLen)
 {
@@ -4461,11 +4463,10 @@ XXH3_hashLong_64b_withSecret(const void* XXH_RESTRICT input, size_t len,
 }
 
 /*
- * It's important for performance that XXH3_hashLong is not inlined.
- * Since the function is not inlined, the compiler may not be able to understand that,
- * in some scenarios, its `secret` argument is actually a compile time constant.
- * This variant enforces that the compiler can detect that,
- * and uses this opportunity to streamline the generated code for better performance.
+ * It's preferable for performance that XXH3_hashLong is not inlined,
+ * as it results in a smaller function for small data, easier to the instruction cache.
+ * Note that inside this no_inline function, we do inline the internal loop,
+ * and provide a statically defined secret size to allow optimization of vector loop.
  */
 XXH_NO_INLINE XXH64_hash_t
 XXH3_hashLong_64b_default(const void* XXH_RESTRICT input, size_t len,

From dfbd52684fc0a12d8e237a046a10c813bdb4ad5c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 26 Aug 2021 11:47:41 -0700
Subject: [PATCH 130/187] added XXH3_128bits_withSecretandSeed() variant

---
 cli/xsum_sanity_check.c | 10 ++++++++++
 xxhash.h                | 26 ++++++++++++++++++++++----
 2 files changed, 32 insertions(+), 4 deletions(-)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 27a2337d..34e4e516 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -476,6 +476,16 @@ static void XSUM_testXXH128(const void* data, const XSUM_testdata128_t* testData
         XSUM_checkResult128(Dresult, Nresult);
     }
 
+    /* check that the combination of
+     * XXH3_generateSecret_fromSeed() and XXH3_128bits_withSecretandSeed()
+     * results in exactly the same hash generation as XXH3_64bits_withSeed() */
+    {   char secretBuffer[XXH3_SECRET_DEFAULT_SIZE+1];
+        char* const secret = secretBuffer + 1;  /* intentional unalignment */
+        XXH3_generateSecret_fromSeed(secret, seed);
+        {   XXH128_hash_t const Dresult = XXH3_128bits_withSecretandSeed(data, len, secret, XXH3_SECRET_DEFAULT_SIZE, seed);
+            XSUM_checkResult128(Dresult, Nresult);
+    }   }
+
     /* streaming API test */
     {   XXH3_state_t *state = XXH3_createState();
         assert(state != NULL);
diff --git a/xxhash.h b/xxhash.h
index b757c4a5..3a544467 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -300,6 +300,7 @@ extern "C" {
 #  define XXH3_128bits XXH_NAME2(XXH_NAMESPACE, XXH3_128bits)
 #  define XXH3_128bits_withSeed XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_withSeed)
 #  define XXH3_128bits_withSecret XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_withSecret)
+#  define XXH3_128bits_withSecretandSeed XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_withSecretandSeed)
 #  define XXH3_128bits_reset XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_reset)
 #  define XXH3_128bits_reset_withSeed XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_reset_withSeed)
 #  define XXH3_128bits_reset_withSecret XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_reset_withSecret)
@@ -1179,7 +1180,14 @@ XXH3_64bits_withSecretandSeed(const void* data, size_t len,
                               const void* secret, size_t secretSize,
                               XXH64_hash_t seed);
 
-/* simple short-cut to pre-selected XXH3_128bits variant */
+XXH_PUBLIC_API XXH128_hash_t
+XXH3_128bits_withSecretandSeed(const void* data, size_t len,
+                              const void* secret, size_t secretSize,
+                              XXH64_hash_t seed64);
+
+/* XXH128() :
+ * simple alias to pre-selected XXH3_128bits variant
+ */
 XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t seed);
 
 
@@ -4571,7 +4579,7 @@ XXH3_64bits_withSecretandSeed(const void* input, size_t len, const void* secret,
 {
     if (len <= XXH3_MIDSIZE_MAX)
         return XXH3_64bits_internal(input, len, seed, XXH3_kSecret, sizeof(XXH3_kSecret), NULL);
-    return XXH3_64bits_internal(input, len, seed, secret, secretSize, XXH3_hashLong_64b_withSecret);
+    return XXH3_hashLong_64b_withSecret(input, len, seed, secret, secretSize);
 }
 
 
@@ -5184,9 +5192,10 @@ XXH3_hashLong_128b_default(const void* XXH_RESTRICT input, size_t len,
 }
 
 /*
- * It's important for performance that XXH3_hashLong is not inlined.
+ * It's important for performance to pass @secretLen (when it's static)
+ * to the compiler, so that it can properly optimize the vectorized loop.
  */
-XXH_NO_INLINE XXH128_hash_t
+XXH_FORCE_INLINE XXH128_hash_t
 XXH3_hashLong_128b_withSecret(const void* XXH_RESTRICT input, size_t len,
                               XXH64_hash_t seed64,
                               const void* XXH_RESTRICT secret, size_t secretLen)
@@ -5279,6 +5288,15 @@ XXH3_128bits_withSeed(const void* input, size_t len, XXH64_hash_t seed)
                                  XXH3_hashLong_128b_withSeed);
 }
 
+/*! @ingroup xxh3_family */
+XXH_PUBLIC_API XXH128_hash_t
+XXH3_128bits_withSecretandSeed(const void* input, size_t len, const void* secret, size_t secretSize, XXH64_hash_t seed)
+{
+    if (len <= XXH3_MIDSIZE_MAX)
+        return XXH3_128bits_internal(input, len, seed, XXH3_kSecret, sizeof(XXH3_kSecret), NULL);
+    return XXH3_hashLong_128b_withSecret(input, len, seed, secret, secretSize);
+}
+
 /*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH128_hash_t
 XXH128(const void* input, size_t len, XXH64_hash_t seed)

From ad7824dc7f6ff8cee6bad5f79aea7ff7df9a6741 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 26 Aug 2021 12:00:20 -0700
Subject: [PATCH 131/187] fix minor c++ type casting issue

---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 3a544467..bbfb74f2 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4579,7 +4579,7 @@ XXH3_64bits_withSecretandSeed(const void* input, size_t len, const void* secret,
 {
     if (len <= XXH3_MIDSIZE_MAX)
         return XXH3_64bits_internal(input, len, seed, XXH3_kSecret, sizeof(XXH3_kSecret), NULL);
-    return XXH3_hashLong_64b_withSecret(input, len, seed, secret, secretSize);
+    return XXH3_hashLong_64b_withSecret(input, len, seed, (const xxh_u8*)secret, secretSize);
 }
 
 

From 7aca7804c3eeefd76d1ee2afa2bc19ee439b3f40 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 07:31:14 +0900
Subject: [PATCH 132/187] Add standard static_assert()

This change set supports standard static_assert() if available.

Also definition of non-standard static assertion is changed for MSVC.
As for MSVC, see #567 for details.
https://github.com/Cyan4973/xxHash/issues/567
---
 xxhash.h | 12 +++++++++++-
 1 file changed, 11 insertions(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index ac2e2408..2905d3e7 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1494,7 +1494,17 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
 #endif
 
 /* note: use after variable declarations */
-#define XXH_STATIC_ASSERT(c)  do { enum { XXH_sa = 1/(int)(!!(c)) }; } while (0)
+#ifndef XXH_STATIC_ASSERT
+#  if defined(__STDC_VERSION__) && (__STDC_VERSION__ >= 201112L)    /* C11 */
+#    include <assert.h>
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) static_assert((c),m)
+#  elif defined(__cplusplus) && (__cplusplus >= 201103L)            /* C++11 */
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) static_assert((c),m)
+#  else
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { struct XXH_sa { char xxh_static_assert[(c) ? 1 : -1]; }; } while (0)
+#  endif
+#  define XXH_STATIC_ASSERT(c) XXH_STATIC_ASSERT_WITH_MESSAGE((c),"XXH_STATIC_ASSERT")
+#endif
 
 /*!
  * @internal

From 5537b9525bb9c31f0253d9fd06ff70466b954a5f Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 08:28:19 +0900
Subject: [PATCH 133/187] Fix for -Wdeclaration-after-statement

This change supports  C11 + -Wdeclaration-after-statement.
---
 xxhash.h | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 2905d3e7..76e76bfc 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1497,13 +1497,13 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
 #ifndef XXH_STATIC_ASSERT
 #  if defined(__STDC_VERSION__) && (__STDC_VERSION__ >= 201112L)    /* C11 */
 #    include <assert.h>
-#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) static_assert((c),m)
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { static_assert((c),m); } while(0)
 #  elif defined(__cplusplus) && (__cplusplus >= 201103L)            /* C++11 */
-#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) static_assert((c),m)
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { static_assert((c),m); } while(0)
 #  else
-#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { struct XXH_sa { char xxh_static_assert[(c) ? 1 : -1]; }; } while (0)
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { char XXH_sa[(c) ? 1 : -1]; (void) XXH_sa; } while(0)
 #  endif
-#  define XXH_STATIC_ASSERT(c) XXH_STATIC_ASSERT_WITH_MESSAGE((c),"XXH_STATIC_ASSERT")
+#  define XXH_STATIC_ASSERT(c) XXH_STATIC_ASSERT_WITH_MESSAGE(c,#c)
 #endif
 
 /*!

From 96a2ca14cd900a8f3910e9fe810605885cb8adb0 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 09:06:55 +0900
Subject: [PATCH 134/187] Fix XXH_STATIC_ASSERT to avoid possible stack
 allocation

This change avoids possible stack allocation by array in local scope.
---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 76e76bfc..e69d25aa 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1501,7 +1501,7 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
 #  elif defined(__cplusplus) && (__cplusplus >= 201103L)            /* C++11 */
 #    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { static_assert((c),m); } while(0)
 #  else
-#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { char XXH_sa[(c) ? 1 : -1]; (void) XXH_sa; } while(0)
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { void xxh_sa(int x[1 - 2 * ((c) ? 0 : 1)]); (void) xxh_sa; } while(0)
 #  endif
 #  define XXH_STATIC_ASSERT(c) XXH_STATIC_ASSERT_WITH_MESSAGE(c,#c)
 #endif

From d5ce7ed289a6b0a7ec2c6edd5e9d523bce3f657e Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 09:44:30 +0900
Subject: [PATCH 135/187] Redefine XXH_STATIC_ASSERT to pass gcc's
 -Wredundant-decls

---
 xxhash.h | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index e69d25aa..ceaac0e2 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1501,9 +1501,9 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
 #  elif defined(__cplusplus) && (__cplusplus >= 201103L)            /* C++11 */
 #    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { static_assert((c),m); } while(0)
 #  else
-#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { void xxh_sa(int x[1 - 2 * ((c) ? 0 : 1)]); (void) xxh_sa; } while(0)
+#    define XXH_STATIC_ASSERT_WITH_MESSAGE(c,m) do { struct xxh_sa { char x[(c) ? 1 : -1]; }; } while(0)
 #  endif
-#  define XXH_STATIC_ASSERT(c) XXH_STATIC_ASSERT_WITH_MESSAGE(c,#c)
+#  define XXH_STATIC_ASSERT(c) XXH_STATIC_ASSERT_WITH_MESSAGE((c),#c)
 #endif
 
 /*!

From 79c55d5bfb2a51ab3e15c3f4566010fe4a820acc Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 14:06:33 +0900
Subject: [PATCH 136/187] Add make usan to GA

---
 .github/workflows/ci.yml | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index 37ffcf24..df0065d8 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -224,6 +224,10 @@ jobs:
       run: |
         make clean test-mem
 
+    - name: usan
+      run: |
+        make clean usan
+
 
   ubuntu-cmake-unofficial:
     name: Linux x64 cmake unofficial build test

From b3c51c736f5eae667753ea966920e6e401ac8d7c Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 14:38:17 +0900
Subject: [PATCH 137/187] Replace Unicode character U+2018 with 7bit-ASCII
 (U+0027)

---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index ceaac0e2..6e7ef053 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3769,7 +3769,7 @@ XXH3_initCustomSecret_avx512(void* XXH_RESTRICT customSecret, xxh_u64 seed64)
         XXH_ASSERT(((size_t)dest & 63) == 0);
         for (i=0; i < nbRounds; ++i) {
             /* GCC has a bug, _mm512_stream_load_si512 accepts 'void*', not 'void const*',
-             * this will warn "discards ‘const’ qualifier". */
+             * this will warn "discards 'const' qualifier". */
             union {
                 const __m512i* cp;
                 void* p;

From eeb1cde87fe9ccac79b54cf9654ac8324fc3262a Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 14:35:12 +0900
Subject: [PATCH 138/187] Import unicode_lint.sh from lz4/lz4#1020

unicode_lint.sh is originally written by @servusdei2018
---
 Makefile              |  3 +++
 tests/unicode_lint.sh | 43 +++++++++++++++++++++++++++++++++++++++++++
 2 files changed, 46 insertions(+)
 create mode 100644 tests/unicode_lint.sh

diff --git a/Makefile b/Makefile
index 5a37fbd3..da6e0818 100644
--- a/Makefile
+++ b/Makefile
@@ -397,6 +397,9 @@ listL120:  # extract lines >= 120 characters in *.{c,h}, by Takayuki Matsuoka (n
 trailingWhitespace:
 	! $(GREP) -E "`printf '[ \\t]$$'`" cli/*.{c,h,1} *.c *.h LICENSE Makefile cmake_unofficial/CMakeLists.txt
 
+.PHONY: lint-unicode
+lint-unicode:
+	./tests/unicode_lint.sh
 
 # =========================================================
 # make install is validated only for the following targets
diff --git a/tests/unicode_lint.sh b/tests/unicode_lint.sh
new file mode 100644
index 00000000..2e68f586
--- /dev/null
+++ b/tests/unicode_lint.sh
@@ -0,0 +1,43 @@
+#!/bin/bash
+
+# `unicode_lint.sh' determines whether source files under ${dirs} directories
+# contain Unicode characters, and fails if any do.
+#
+# We don't recommend to call this script directly.
+# Instead of it, use `make lint-unicode` via root directory Makefile.
+
+# ${dirs} : target directories
+dirs=(./ ./cli ./tests ./tests/bench ./tests/collisions)
+
+SCRIPT_DIR="`dirname "${BASH_SOURCE[0]}"`"
+cd ${SCRIPT_DIR}/..
+
+echo "Ensure no unicode character is present in source files *.{c,h}"
+pass=true
+
+# Scan each directory in ${dirs} for Unicode in source (*.c, *.h) files
+i=0
+while [ $i -lt ${#dirs[@]} ]
+do
+  dir=${dirs[$i]}
+  echo dir=$dir
+  result=$(
+    find ${dir} -regex '.*\.\(c\|h\)$' -exec grep -P -n "[^\x00-\x7F]" {} \; -exec echo "{}: FAIL" \;
+  )
+  if [[ $result ]]; then
+    echo "$result"
+    pass=false
+  fi
+  i=`expr $i + 1`
+done
+
+
+# Result
+if [ "$pass" = true ]; then
+  echo "All tests successful: no unicode character detected"
+  echo "Result: PASS"
+  exit 0
+else
+  echo "Result: FAIL"
+  exit 1
+fi

From 5616f1eb140eb135ea572ae3b441a05659f0fa10 Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Fri, 27 Aug 2021 15:27:48 +0900
Subject: [PATCH 139/187] Add executable permission

---
 tests/unicode_lint.sh | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 mode change 100644 => 100755 tests/unicode_lint.sh

diff --git a/tests/unicode_lint.sh b/tests/unicode_lint.sh
old mode 100644
new mode 100755

From c9312558513e0433680c60cb26cee5003924470a Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Sat, 28 Aug 2021 03:25:34 +0900
Subject: [PATCH 140/187] Add make lint-unicode test to GA

---
 .github/workflows/ci.yml | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index df0065d8..ed84dbd1 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -228,6 +228,10 @@ jobs:
       run: |
         make clean usan
 
+    - name: Lint Unicode in root-dir, cli/, tests/, tests/bench/, tests/collisions/.
+      run: |
+        make lint-unicode
+
 
   ubuntu-cmake-unofficial:
     name: Linux x64 cmake unofficial build test

From 11126e38d9bb00ee1de3b2a581c386f7a02da06b Mon Sep 17 00:00:00 2001
From: Takayuki Matsuoka <takayuki.matsuoka@gmail.com>
Date: Wed, 1 Sep 2021 09:31:22 +0900
Subject: [PATCH 141/187] Remove unnecessary XXH_ALIGN() from pointer
 declaration

We still have assert() for each pointer value.
See also #559 and #569.
---
 xxhash.h | 18 +++++++++---------
 1 file changed, 9 insertions(+), 9 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 6e7ef053..485d4662 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3684,7 +3684,7 @@ XXH3_accumulate_512_avx512(void* XXH_RESTRICT acc,
                      const void* XXH_RESTRICT input,
                      const void* XXH_RESTRICT secret)
 {
-    XXH_ALIGN(64) __m512i* const xacc = (__m512i *) acc;
+    __m512i* const xacc = (__m512i *) acc;
     XXH_ASSERT((((size_t)acc) & 63) == 0);
     XXH_STATIC_ASSERT(XXH_STRIPE_LEN == sizeof(__m512i));
 
@@ -3733,7 +3733,7 @@ XXH3_scrambleAcc_avx512(void* XXH_RESTRICT acc, const void* XXH_RESTRICT secret)
 {
     XXH_ASSERT((((size_t)acc) & 63) == 0);
     XXH_STATIC_ASSERT(XXH_STRIPE_LEN == sizeof(__m512i));
-    {   XXH_ALIGN(64) __m512i* const xacc = (__m512i*) acc;
+    {   __m512i* const xacc = (__m512i*) acc;
         const __m512i prime32 = _mm512_set1_epi32((int)XXH_PRIME32_1);
 
         /* xacc[0] ^= (xacc[0] >> 47) */
@@ -3794,7 +3794,7 @@ XXH3_accumulate_512_avx2( void* XXH_RESTRICT acc,
                     const void* XXH_RESTRICT secret)
 {
     XXH_ASSERT((((size_t)acc) & 31) == 0);
-    {   XXH_ALIGN(32) __m256i* const xacc    =       (__m256i *) acc;
+    {   __m256i* const xacc    =       (__m256i *) acc;
         /* Unaligned. This is mainly for pointer arithmetic, and because
          * _mm256_loadu_si256 requires  a const __m256i * pointer for some reason. */
         const         __m256i* const xinput  = (const __m256i *) input;
@@ -3826,7 +3826,7 @@ XXH_FORCE_INLINE XXH_TARGET_AVX2 void
 XXH3_scrambleAcc_avx2(void* XXH_RESTRICT acc, const void* XXH_RESTRICT secret)
 {
     XXH_ASSERT((((size_t)acc) & 31) == 0);
-    {   XXH_ALIGN(32) __m256i* const xacc = (__m256i*) acc;
+    {   __m256i* const xacc = (__m256i*) acc;
         /* Unaligned. This is mainly for pointer arithmetic, and because
          * _mm256_loadu_si256 requires a const __m256i * pointer for some reason. */
         const         __m256i* const xsecret = (const __m256i *) secret;
@@ -3900,7 +3900,7 @@ XXH3_accumulate_512_sse2( void* XXH_RESTRICT acc,
 {
     /* SSE2 is just a half-scale version of the AVX2 version. */
     XXH_ASSERT((((size_t)acc) & 15) == 0);
-    {   XXH_ALIGN(16) __m128i* const xacc    =       (__m128i *) acc;
+    {   __m128i* const xacc    =       (__m128i *) acc;
         /* Unaligned. This is mainly for pointer arithmetic, and because
          * _mm_loadu_si128 requires a const __m128i * pointer for some reason. */
         const         __m128i* const xinput  = (const __m128i *) input;
@@ -3932,7 +3932,7 @@ XXH_FORCE_INLINE XXH_TARGET_SSE2 void
 XXH3_scrambleAcc_sse2(void* XXH_RESTRICT acc, const void* XXH_RESTRICT secret)
 {
     XXH_ASSERT((((size_t)acc) & 15) == 0);
-    {   XXH_ALIGN(16) __m128i* const xacc = (__m128i*) acc;
+    {   __m128i* const xacc = (__m128i*) acc;
         /* Unaligned. This is mainly for pointer arithmetic, and because
          * _mm_loadu_si128 requires a const __m128i * pointer for some reason. */
         const         __m128i* const xsecret = (const __m128i *) secret;
@@ -4001,7 +4001,7 @@ XXH3_accumulate_512_neon( void* XXH_RESTRICT acc,
 {
     XXH_ASSERT((((size_t)acc) & 15) == 0);
     {
-        XXH_ALIGN(16) uint64x2_t* const xacc = (uint64x2_t *) acc;
+        uint64x2_t* const xacc = (uint64x2_t *) acc;
         /* We don't use a uint32x4_t pointer because it causes bus errors on ARMv7. */
         uint8_t const* const xinput = (const uint8_t *) input;
         uint8_t const* const xsecret  = (const uint8_t *) secret;
@@ -4158,7 +4158,7 @@ XXH3_accumulate_512_scalar(void* XXH_RESTRICT acc,
                      const void* XXH_RESTRICT input,
                      const void* XXH_RESTRICT secret)
 {
-    XXH_ALIGN(XXH_ACC_ALIGN) xxh_u64* const xacc = (xxh_u64*) acc; /* presumed aligned */
+    xxh_u64* const xacc = (xxh_u64*) acc; /* presumed aligned */
     const xxh_u8* const xinput  = (const xxh_u8*) input;  /* no alignment restriction */
     const xxh_u8* const xsecret = (const xxh_u8*) secret;   /* no alignment restriction */
     size_t i;
@@ -4174,7 +4174,7 @@ XXH3_accumulate_512_scalar(void* XXH_RESTRICT acc,
 XXH_FORCE_INLINE void
 XXH3_scrambleAcc_scalar(void* XXH_RESTRICT acc, const void* XXH_RESTRICT secret)
 {
-    XXH_ALIGN(XXH_ACC_ALIGN) xxh_u64* const xacc = (xxh_u64*) acc;   /* presumed aligned */
+    xxh_u64* const xacc = (xxh_u64*) acc;   /* presumed aligned */
     const xxh_u8* const xsecret = (const xxh_u8*) secret;   /* no alignment restriction */
     size_t i;
     XXH_ASSERT((((size_t)acc) & (XXH_ACC_ALIGN-1)) == 0);

From 92c4b4f2b1d2a4b0f6f467ae542d6ec6b26b52d1 Mon Sep 17 00:00:00 2001
From: Nick Terrell <terrelln@fb.com>
Date: Thu, 23 Sep 2021 10:42:21 -0700
Subject: [PATCH 142/187] Simplify XXH_FALLTHROUGH macro detection

Simplify macro detection using `__has_attribute()`, and clean up the
usage of `__has_c_attribute()` and `__has_cpp_attribute()`.
---
 xxhash.h | 51 +++++++++++++++++++++++++++------------------------
 1 file changed, 27 insertions(+), 24 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 485d4662..3bd125cb 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -594,36 +594,39 @@ XXH_PUBLIC_API void XXH32_canonicalFromHash(XXH32_canonical_t* dst, XXH32_hash_t
 XXH_PUBLIC_API XXH32_hash_t XXH32_hashFromCanonical(const XXH32_canonical_t* src);
 
 
+#ifdef __has_attribute
+# define XXH_HAS_ATTRIBUTE(x) __has_attribute(x)
+#else
+# define XXH_HAS_ATTRIBUTE(x) 0
+#endif
+
+/* C-language Attributes are added in C23. */
+#if defined(__STDC_VERSION__) && (__STDC_VERSION__ > 201710L) && defined(__has_c_attribute)
+# define XXH_HAS_C_ATTRIBUTE(x) __has_c_attribute(x)
+#else
+# define XXH_HAS_C_ATTRIBUTE(x) 0
+#endif
+
+#if defined(__cplusplus) && defined(__has_cpp_attribute)
+# define XXH_HAS_CPP_ATTRIBUTE(x) __has_cpp_attribute(x)
+#else
+# define XXH_HAS_CPP_ATTRIBUTE(x) 0
+#endif
+
 /*
 Define XXH_FALLTHROUGH macro for annotating switch case with the 'fallthrough' attribute
 introduced in CPP17 and C23.
 CPP17 : https://en.cppreference.com/w/cpp/language/attributes/fallthrough
 C23   : https://en.cppreference.com/w/c/language/attributes/fallthrough
 */
-
-#if defined (__has_c_attribute) && defined (__STDC_VERSION__) && (__STDC_VERSION__ > 201710L) /* C2x */
-#   if __has_c_attribute(fallthrough)
-#       define XXH_FALLTHROUGH [[fallthrough]]
-#   endif
-
-#elif defined(__cplusplus) && defined(__has_cpp_attribute)
-#   if __has_cpp_attribute(fallthrough)
-#       define XXH_FALLTHROUGH [[fallthrough]]
-#   endif
-#endif
-
-#ifndef XXH_FALLTHROUGH
-#   if defined(__GNUC__) && __GNUC__ >= 7
-#       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
-#   elif defined(__clang__) && (__clang_major__ >= 10) \
-     && (!defined(__APPLE__) || (__clang_major__ >= 12))
-     /* Apple clang 12 is effectively clang-10 ,
-      * see https://en.wikipedia.org/wiki/Xcode for details
-      */
-#       define XXH_FALLTHROUGH __attribute__ ((fallthrough))
-#   else
-#       define XXH_FALLTHROUGH
-#	endif
+#if XXH_HAS_C_ATTRIBUTE(x)
+# define XXH_FALLTHROUGH [[fallthrough]]
+#elif XXH_HAS_CPP_ATTRIBUTE(x)
+# define XXH_FALLTHROUGH [[fallthrough]]
+#elif XXH_HAS_ATTRIBUTE(__fallthrough__)
+# define XXH_FALLTHROUGH __attribute__ ((fallthrough))
+#else
+# define XXH_FALLTHROUGH
 #endif
 
 /*!

From 15abfc22c7d3c3c94e1da9858c8818531f038006 Mon Sep 17 00:00:00 2001
From: Leonard Hecker <leonard@hecker.io>
Date: Thu, 28 Oct 2021 16:34:34 +0200
Subject: [PATCH 143/187] Improve XXH_mult64to128 performance on MSVC ARM64

This commit specializes XXH_mult64to128 to use `__umulh`
on ARM64/aarch64 when compiled with MSVC.
---
 xxhash.h | 15 +++++++++++++++
 1 file changed, 15 insertions(+)

diff --git a/xxhash.h b/xxhash.h
index 3bd125cb..fa8fc14d 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -3265,6 +3265,21 @@ XXH_mult64to128(xxh_u64 lhs, xxh_u64 rhs)
     r128.high64 = product_high;
     return r128;
 
+    /*
+     * MSVC for ARM64's __umulh method.
+     *
+     * This compiles to the same MUL + UMULH as GCC/Clang's __uint128_t method.
+     */
+#elif defined(_M_ARM64)
+
+#ifndef _MSC_VER
+#   pragma intrinsic(__umulh)
+#endif
+    XXH128_hash_t r128;
+    r128.low64  = lhs * rhs;
+    r128.high64 = __umulh(lhs, rhs);
+    return r128;
+
 #else
     /*
      * Portable scalar method. Optimized for 32-bit and 64-bit ALUs.

From b802f53487653c9c1bb264adece3dbd2685bf5c9 Mon Sep 17 00:00:00 2001
From: Dimitris Apostolou <dimitris.apostolou@icloud.com>
Date: Sat, 13 Nov 2021 11:02:14 +0200
Subject: [PATCH 144/187] Fix typos

---
 README.md               | 4 ++--
 cli/xsum_sanity_check.c | 2 +-
 tests/collisions/main.c | 2 +-
 xxhash.h                | 2 +-
 4 files changed, 5 insertions(+), 5 deletions(-)

diff --git a/README.md b/README.md
index 18704f53..9281ea31 100644
--- a/README.md
+++ b/README.md
@@ -111,7 +111,7 @@ The following macros can be set at compilation time to modify libxxhash's behavi
                            This option is automatically disabled on `x86`, `x64` and `aarch64`, and enabled on all other platforms.
 - `XXH_VECTOR` : manually select a vector instruction set (default: auto-selected at compilation time). Available instruction sets are `XXH_SCALAR`, `XXH_SSE2`, `XXH_AVX2`, `XXH_AVX512`, `XXH_NEON` and `XXH_VSX`. Compiler may require additional flags to ensure proper support (for example, `gcc` on linux will require `-mavx2` for AVX2, and `-mavx512f` for AVX512).
 - `XXH_NO_PREFETCH` : disable prefetching. XXH3 only.
-- `XXH_PREFETCH_DIST` : select prefecting distance. XXH3 only.
+- `XXH_PREFETCH_DIST` : select prefetching distance. XXH3 only.
 - `XXH_NO_INLINE_HINTS`: By default, xxHash uses `__attribute__((always_inline))` and `__forceinline` to improve performance at the cost of code size.
                          Defining this macro to 1 will mark all internal functions as `static`, allowing the compiler to decide whether to inline a function or not.
                          This is very useful when optimizing for smallest binary size,
@@ -130,7 +130,7 @@ The following macros can be set at compilation time to modify libxxhash's behavi
 - `XXH_NO_LONG_LONG`: removes compilation of algorithms relying on 64-bit types (XXH3 and XXH64). Only XXH32 will be compiled.
                       Useful for targets (architectures and compilers) without 64-bit support.
 - `XXH_IMPORT`: MSVC specific: should only be defined for dynamic linking, as it prevents linkage errors.
-- `XXH_CPU_LITTLE_ENDIAN`: By default, endianess is determined by a runtime test resolved at compile time.
+- `XXH_CPU_LITTLE_ENDIAN`: By default, endianness is determined by a runtime test resolved at compile time.
                            If, for some reason, the compiler cannot simplify the runtime test, it can cost performance.
                            It's possible to skip auto-detection and simply state that the architecture is little-endian by setting this macro to 1.
                            Setting it to 0 states big-endian.
diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 347d1db5..92d937f1 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -60,7 +60,7 @@ XSUM_API void XSUM_fillTestBuffer(XSUM_U8* buffer, size_t len)
 
 /* ************************************************
  * Self-test:
- * ensure results consistency accross platforms
+ * ensure results consistency across platforms
  *********************************************** */
 #if XSUM_NO_TESTS
 XSUM_API void XSUM_sanityCheck(void)
diff --git a/tests/collisions/main.c b/tests/collisions/main.c
index 1f9318d8..08feb6ba 100644
--- a/tests/collisions/main.c
+++ b/tests/collisions/main.c
@@ -800,7 +800,7 @@ static size_t search_collisions(
         for (int nbHBits = 1; nbHBits < hashBits; nbHBits++) {
             uint64_t const nbSlots = (uint64_t)1 << nbHBits;
             double const expectedCollisions = estimateNbCollisions(nbCandidates, nbHBits);
-            if ( (nbSlots > nbCandidates * 100)  /* within range for meaningfull collision analysis results */
+            if ( (nbSlots > nbCandidates * 100)  /* within range for meaningful collision analysis results */
               && (expectedCollisions > 18.0) ) {
                 int const rShift = hashBits - nbHBits;
                 size_t HBits_collisions = 0;
diff --git a/xxhash.h b/xxhash.h
index fa8fc14d..247a8449 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -324,7 +324,7 @@ extern "C" {
  * This is only useful when xxHash is compiled as a shared library, as it is
  * independent of the version defined in the header.
  *
- * @return `XXH_VERSION_NUMBER` as of when the libray was compiled.
+ * @return `XXH_VERSION_NUMBER` as of when the library was compiled.
  */
 XXH_PUBLIC_API unsigned XXH_versionNumber (void);
 

From 388e66467c9bb9b4545efd708b48d99b0a8cd25b Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?=D7=A8=D7=98=D7=95=20=E2=80=A2=20Reto?=
 <retokromer@users.noreply.github.com>
Date: Sat, 13 Nov 2021 11:13:26 +0100
Subject: [PATCH 145/187] nit: use always US English

---
 doc/xxhash_spec.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/doc/xxhash_spec.md b/doc/xxhash_spec.md
index 1befd713..80576d6b 100644
--- a/doc/xxhash_spec.md
+++ b/doc/xxhash_spec.md
@@ -198,7 +198,7 @@ The algorithm uses 64-bit addition, multiplication, rotate, shift and xor operat
 
 These constants are prime numbers, and feature a good mix of bits 1 and 0, neither too regular, nor too dissymmetric. These properties help dispersion capabilities.
 
-### Step 1. Initialise internal accumulators
+### Step 1. Initialize internal accumulators
 
 Each accumulator gets an initial value based on optional `seed` input. Since the `seed` is optional, it can be `0`.
 

From 7631115f2d6115e95713f2a54d6b592d969945ba Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Fri, 12 Nov 2021 22:54:24 +1100
Subject: [PATCH 146/187] [VSX] Fix issues on older compilers

Fix some issues with older GCC and Clang versions, accept IBM XL

Needs some BE testing, but works with the following on a PPC64LE POWER8:
 - GCC 4.8
 - Clang 10
 - GCC 9
 - IBM XL 13
 - IBM XL 16

Note that IBM XL 13 is Yet Another Clang Fork.

Compile error fixes:
 - Remove builtins for vmuleuw and vmulouw (at least for now). They seem
   to have issues on older GCC and Clang/XL versions
Fixes for -O3 on old compilers:
 - Make the inline asm volatile
 - Remove most strict aliasing violations
   - Old GCC/Clang treat the cast to u64x2 as a strict aliasing
     violation and "optimize" it.
   - Access xacc with vec_xl and vec_xst, cast to unsigned long long and
     const unsigned char pointers
     - Switch to xxh_u64 typedefs since xxh_u64 might be unsigned long?
   - XXX: Should we do this for all SIMD paths, and get rid of void pointer
     hell?
     - Scalar works on normal types
     - NEON and VSX have loads and stores for specific types and can
       handle xxh_u64 and xxh_u8
     - SSE2 is literally impossible without casting pointers, but we can
       do a "only cast when calling intrinsics" thing.
       - Accessing them only through _mm_load/_mm_store is safe AFAICT.
     - Makes it more clear which types are native and which types are
       raw unaligned data.
     - Removes multiple casts

mpe: Rebase onto mainline, squash fixes from original pull request:
     https://github.com/Cyan4973/xxHash/pull/433

     This fixes test failures seen when compiling with GCC 10. Tested on
     Power8 & Power9, ppc64le, with GCC9 & 10. And Power8 & Power9,
     ppc64, with GCC 10.
---
 xxhash.h | 63 +++++++++++++++++++++++++++++---------------------------
 1 file changed, 33 insertions(+), 30 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 247a8449..78bb9157 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2855,7 +2855,7 @@ enum XXH_VECTOR_TYPE /* fake enum */ {
 #    define XXH_VECTOR XXH_NEON
 #  elif (defined(__PPC64__) && defined(__POWER8_VECTOR__)) \
      || (defined(__s390x__) && defined(__VEC__)) \
-     && defined(__GNUC__) /* TODO: IBM XL */
+     && defined(__GNUC__)
 #    define XXH_VECTOR XXH_VSX
 #  else
 #    define XXH_VECTOR XXH_SCALAR
@@ -3103,23 +3103,19 @@ XXH_FORCE_INLINE xxh_u64x2 XXH_vec_loadu(const void *ptr)
  /* s390x is always big endian, no issue on this platform */
 #  define XXH_vec_mulo vec_mulo
 #  define XXH_vec_mule vec_mule
-# elif defined(__clang__) && XXH_HAS_BUILTIN(__builtin_altivec_vmuleuw)
-/* Clang has a better way to control this, we can just use the builtin which doesn't swap. */
-#  define XXH_vec_mulo __builtin_altivec_vmulouw
-#  define XXH_vec_mule __builtin_altivec_vmuleuw
 # else
-/* gcc needs inline assembly */
+/* GCC needs inline assembly */
 /* Adapted from https://github.com/google/highwayhash/blob/master/highwayhash/hh_vsx.h. */
 XXH_FORCE_INLINE xxh_u64x2 XXH_vec_mulo(xxh_u32x4 a, xxh_u32x4 b)
 {
     xxh_u64x2 result;
-    __asm__("vmulouw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
+    __asm__ __volatile__("vmulouw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
     return result;
 }
 XXH_FORCE_INLINE xxh_u64x2 XXH_vec_mule(xxh_u32x4 a, xxh_u32x4 b)
 {
     xxh_u64x2 result;
-    __asm__("vmuleuw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
+    __asm__ __volatile__("vmuleuw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
     return result;
 }
 # endif /* XXH_vec_mulo, XXH_vec_mule */
@@ -4111,59 +4107,66 @@ XXH3_accumulate_512_vsx(  void* XXH_RESTRICT acc,
                     const void* XXH_RESTRICT input,
                     const void* XXH_RESTRICT secret)
 {
-          xxh_u64x2* const xacc     =       (xxh_u64x2*) acc;    /* presumed aligned */
-    xxh_u64x2 const* const xinput   = (xxh_u64x2 const*) input;   /* no alignment restriction */
-    xxh_u64x2 const* const xsecret  = (xxh_u64x2 const*) secret;    /* no alignment restriction */
-    xxh_u64x2 const v32 = { 32, 32 };
+    /* presumed aligned */
+    unsigned long long* const xacc = (unsigned long long*) acc;
+    /* presumed unaligned */
+    unsigned char const* const xinput  = (unsigned char const*) input;
+    unsigned char const* const xsecret = (unsigned char const*) secret;
+    xxh_u64x2 const v32 = vec_splats(32ULL);
     size_t i;
     for (i = 0; i < XXH_STRIPE_LEN / sizeof(xxh_u64x2); i++) {
         /* data_vec = xinput[i]; */
-        xxh_u64x2 const data_vec = XXH_vec_loadu(xinput + i);
+        xxh_u64x2 const data_vec = XXH_vec_loadu(xinput + 16 * i);
         /* key_vec = xsecret[i]; */
-        xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + i);
+        xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + 16 * i);
         xxh_u64x2 const data_key = data_vec ^ key_vec;
         /* shuffled = (data_key << 32) | (data_key >> 32); */
-        xxh_u32x4 const shuffled = (xxh_u32x4)vec_rl(data_key, v32);
+        xxh_u64x2 const shuffled = vec_rl(data_key, v32);
         /* product = ((xxh_u64x2)data_key & 0xFFFFFFFF) * ((xxh_u64x2)shuffled & 0xFFFFFFFF); */
-        xxh_u64x2 const product  = XXH_vec_mulo((xxh_u32x4)data_key, shuffled);
-        xacc[i] += product;
+        xxh_u64x2 const product  = XXH_vec_mulo((xxh_u32x4)data_key, (xxh_u32x4)shuffled);
+        /* acc_vec = xacc[i]; */
+        xxh_u64x2 acc_vec        = vec_xl(0, xacc + 2 * i);
+        acc_vec += product;
 
         /* swap high and low halves */
 #ifdef __s390x__
-        xacc[i] += vec_permi(data_vec, data_vec, 2);
+        acc_vec += vec_permi(data_vec, data_vec, 2);
 #else
-        xacc[i] += vec_xxpermdi(data_vec, data_vec, 2);
+        acc_vec += vec_xxpermdi(data_vec, data_vec, 2);
 #endif
+        /* xacc[i] = acc_vec; */
+        vec_xst(acc_vec, 0, xacc + 2 * i);
     }
 }
 
 XXH_FORCE_INLINE void
-XXH3_scrambleAcc_vsx(void* XXH_RESTRICT acc, const void* XXH_RESTRICT secret)
+XXH3_scrambleAcc_vsx(void* XXH_RESTRICT acc, void const* XXH_RESTRICT secret)
 {
     XXH_ASSERT((((size_t)acc) & 15) == 0);
 
-    {         xxh_u64x2* const xacc    =       (xxh_u64x2*) acc;
-        const xxh_u64x2* const xsecret = (const xxh_u64x2*) secret;
+    {   unsigned long long* const xacc = (unsigned long long*) acc;
+        unsigned char const* const xsecret = (unsigned char const*) secret;
         /* constants */
-        xxh_u64x2 const v32  = { 32, 32 };
-        xxh_u64x2 const v47 = { 47, 47 };
-        xxh_u32x4 const prime = { XXH_PRIME32_1, XXH_PRIME32_1, XXH_PRIME32_1, XXH_PRIME32_1 };
+        xxh_u64x2 const v32 = vec_splats(32ULL);
+        xxh_u64x2 const v47 = vec_splats(47ULL);
+        xxh_u32x4 const prime = vec_splats(XXH_PRIME32_1);
         size_t i;
         for (i = 0; i < XXH_STRIPE_LEN / sizeof(xxh_u64x2); i++) {
             /* xacc[i] ^= (xacc[i] >> 47); */
-            xxh_u64x2 const acc_vec  = xacc[i];
+            xxh_u64x2 const acc_vec  = vec_xl(0, xacc + 2 * i);
             xxh_u64x2 const data_vec = acc_vec ^ (acc_vec >> v47);
 
             /* xacc[i] ^= xsecret[i]; */
-            xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + i);
+            xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + 16 * i);
             xxh_u64x2 const data_key = data_vec ^ key_vec;
 
             /* xacc[i] *= XXH_PRIME32_1 */
             /* prod_lo = ((xxh_u64x2)data_key & 0xFFFFFFFF) * ((xxh_u64x2)prime & 0xFFFFFFFF);  */
-            xxh_u64x2 const prod_even  = XXH_vec_mule((xxh_u32x4)data_key, prime);
+            xxh_u64x2 const prod_lo = XXH_vec_mulo((xxh_u32x4)data_key, prime);
             /* prod_hi = ((xxh_u64x2)data_key >> 32) * ((xxh_u64x2)prime >> 32);  */
-            xxh_u64x2 const prod_odd  = XXH_vec_mulo((xxh_u32x4)data_key, prime);
-            xacc[i] = prod_odd + (prod_even << v32);
+            xxh_u64x2 const prod_hi = XXH_vec_mule((xxh_u32x4)data_key, prime);
+            xxh_u64x2 const product = prod_lo + (prod_hi << v32);
+            vec_xst(product, 0, xacc + 2 * i);
     }   }
 }
 

From d8ce409e6c5618542e2ef54e25327f0e4bf61803 Mon Sep 17 00:00:00 2001
From: Michael Ellerman <mpe@ellerman.id.au>
Date: Mon, 15 Nov 2021 21:48:36 +1100
Subject: [PATCH 147/187] Add ppc64le build with GCC 10 to github actions

---
 .github/workflows/ci.yml | 7 ++++---
 1 file changed, 4 insertions(+), 3 deletions(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index ed84dbd1..8e7a0dec 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -267,7 +267,8 @@ jobs:
         include: [
           { name: ARM,      xcc_pkg: gcc-arm-linux-gnueabi,     xcc: arm-linux-gnueabi-gcc,     xemu_pkg: qemu-system-arm,    xemu: qemu-arm-static     },
           { name: ARM64,    xcc_pkg: gcc-aarch64-linux-gnu,     xcc: aarch64-linux-gnu-gcc,     xemu_pkg: qemu-system-arm,    xemu: qemu-aarch64-static },
-          { name: PPC64LE,  xcc_pkg: gcc-powerpc64le-linux-gnu, xcc: powerpc64le-linux-gnu-gcc, xemu_pkg: qemu-system-ppc,    xemu: qemu-ppc64le-static },
+          { name: PPC64LE-gcc9,  xcc_pkg: gcc-9-powerpc64le-linux-gnu,  xcc: powerpc64le-linux-gnu-gcc-9,  xemu_pkg: qemu-system-ppc, xemu: qemu-ppc64le-static },
+          { name: PPC64LE-gcc10, xcc_pkg: gcc-10-powerpc64le-linux-gnu, xcc: powerpc64le-linux-gnu-gcc-10, xemu_pkg: qemu-system-ppc, xemu: qemu-ppc64le-static },
           { name: S390X,    xcc_pkg: gcc-s390x-linux-gnu,       xcc: s390x-linux-gnu-gcc,       xemu_pkg: qemu-system-s390x,  xemu: qemu-s390x-static   },
           { name: MIPS,     xcc_pkg: gcc-mips-linux-gnu,        xcc: mips-linux-gnu-gcc,        xemu_pkg: qemu-system-mips,   xemu: qemu-mips-static    },
         ]
@@ -302,8 +303,8 @@ jobs:
         CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
         CPPFLAGS="-DXXH_VECTOR=XXH_NEON" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
 
-    - name: PPC64LE (XXH_VECTOR=[ scalar, VSX ])
-      if: ${{ matrix.name == 'PPC64LE' }}
+    - name: ${{ matrix.name }} (XXH_VECTOR=[ scalar, VSX ])
+      if: ${{ startsWith(matrix.name, 'PPC64LE') }}
       run: |
         CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
         CPPFLAGS="-DXXH_VECTOR=XXH_VSX" CFLAGS="-O3 -maltivec -mvsx -mpower8-vector -mcpu=power8" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check

From 790fdccd7fb1ac5f24d5141f63e7a3faa8de1afd Mon Sep 17 00:00:00 2001
From: Michael Ellerman <mpe@ellerman.id.au>
Date: Mon, 15 Nov 2021 21:52:40 +1100
Subject: [PATCH 148/187] Add ppc64 (big endian) build with GCC 9 & 10 to
 github actions

---
 .github/workflows/ci.yml | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index 8e7a0dec..882925f8 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -255,7 +255,7 @@ jobs:
         CFLAGS=-Werror make
 
 
-  # Linux, { ARM, ARM64, PPC64LE, S390X }
+  # Linux, { ARM, ARM64, PPC64LE, PPC64, S390X }
   # All tests are using QEMU and gcc cross compiler.
 
   qemu-consistency:
@@ -269,6 +269,8 @@ jobs:
           { name: ARM64,    xcc_pkg: gcc-aarch64-linux-gnu,     xcc: aarch64-linux-gnu-gcc,     xemu_pkg: qemu-system-arm,    xemu: qemu-aarch64-static },
           { name: PPC64LE-gcc9,  xcc_pkg: gcc-9-powerpc64le-linux-gnu,  xcc: powerpc64le-linux-gnu-gcc-9,  xemu_pkg: qemu-system-ppc, xemu: qemu-ppc64le-static },
           { name: PPC64LE-gcc10, xcc_pkg: gcc-10-powerpc64le-linux-gnu, xcc: powerpc64le-linux-gnu-gcc-10, xemu_pkg: qemu-system-ppc, xemu: qemu-ppc64le-static },
+          { name: PPC64-gcc9,    xcc_pkg: gcc-9-powerpc64-linux-gnu,    xcc: powerpc64-linux-gnu-gcc-9,    xemu_pkg: qemu-system-ppc, xemu: qemu-ppc64-static },
+          { name: PPC64-gcc10,   xcc_pkg: gcc-10-powerpc64-linux-gnu,   xcc: powerpc64-linux-gnu-gcc-10,   xemu_pkg: qemu-system-ppc, xemu: qemu-ppc64-static },
           { name: S390X,    xcc_pkg: gcc-s390x-linux-gnu,       xcc: s390x-linux-gnu-gcc,       xemu_pkg: qemu-system-s390x,  xemu: qemu-s390x-static   },
           { name: MIPS,     xcc_pkg: gcc-mips-linux-gnu,        xcc: mips-linux-gnu-gcc,        xemu_pkg: qemu-system-mips,   xemu: qemu-mips-static    },
         ]
@@ -304,7 +306,7 @@ jobs:
         CPPFLAGS="-DXXH_VECTOR=XXH_NEON" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
 
     - name: ${{ matrix.name }} (XXH_VECTOR=[ scalar, VSX ])
-      if: ${{ startsWith(matrix.name, 'PPC64LE') }}
+      if: ${{ startsWith(matrix.name, 'PPC64') }}
       run: |
         CPPFLAGS="-DXXH_VECTOR=XXH_SCALAR" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check
         CPPFLAGS="-DXXH_VECTOR=XXH_VSX" CFLAGS="-O3 -maltivec -mvsx -mpower8-vector -mcpu=power8" LDFLAGS="-static" CC=$XCC RUN_ENV=$XEMU make clean check

From 398e4844c0b749632003e1250f5284ec63a2e0a5 Mon Sep 17 00:00:00 2001
From: Michael Ellerman <mpe@ellerman.id.au>
Date: Tue, 16 Nov 2021 13:52:06 +1100
Subject: [PATCH 149/187] Revert "[VSX] Fix issues on older compilers"

This reverts commit 7631115f2d6115e95713f2a54d6b592d969945ba.
---
 xxhash.h | 63 +++++++++++++++++++++++++++-----------------------------
 1 file changed, 30 insertions(+), 33 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 78bb9157..247a8449 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2855,7 +2855,7 @@ enum XXH_VECTOR_TYPE /* fake enum */ {
 #    define XXH_VECTOR XXH_NEON
 #  elif (defined(__PPC64__) && defined(__POWER8_VECTOR__)) \
      || (defined(__s390x__) && defined(__VEC__)) \
-     && defined(__GNUC__)
+     && defined(__GNUC__) /* TODO: IBM XL */
 #    define XXH_VECTOR XXH_VSX
 #  else
 #    define XXH_VECTOR XXH_SCALAR
@@ -3103,19 +3103,23 @@ XXH_FORCE_INLINE xxh_u64x2 XXH_vec_loadu(const void *ptr)
  /* s390x is always big endian, no issue on this platform */
 #  define XXH_vec_mulo vec_mulo
 #  define XXH_vec_mule vec_mule
+# elif defined(__clang__) && XXH_HAS_BUILTIN(__builtin_altivec_vmuleuw)
+/* Clang has a better way to control this, we can just use the builtin which doesn't swap. */
+#  define XXH_vec_mulo __builtin_altivec_vmulouw
+#  define XXH_vec_mule __builtin_altivec_vmuleuw
 # else
-/* GCC needs inline assembly */
+/* gcc needs inline assembly */
 /* Adapted from https://github.com/google/highwayhash/blob/master/highwayhash/hh_vsx.h. */
 XXH_FORCE_INLINE xxh_u64x2 XXH_vec_mulo(xxh_u32x4 a, xxh_u32x4 b)
 {
     xxh_u64x2 result;
-    __asm__ __volatile__("vmulouw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
+    __asm__("vmulouw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
     return result;
 }
 XXH_FORCE_INLINE xxh_u64x2 XXH_vec_mule(xxh_u32x4 a, xxh_u32x4 b)
 {
     xxh_u64x2 result;
-    __asm__ __volatile__("vmuleuw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
+    __asm__("vmuleuw %0, %1, %2" : "=v" (result) : "v" (a), "v" (b));
     return result;
 }
 # endif /* XXH_vec_mulo, XXH_vec_mule */
@@ -4107,66 +4111,59 @@ XXH3_accumulate_512_vsx(  void* XXH_RESTRICT acc,
                     const void* XXH_RESTRICT input,
                     const void* XXH_RESTRICT secret)
 {
-    /* presumed aligned */
-    unsigned long long* const xacc = (unsigned long long*) acc;
-    /* presumed unaligned */
-    unsigned char const* const xinput  = (unsigned char const*) input;
-    unsigned char const* const xsecret = (unsigned char const*) secret;
-    xxh_u64x2 const v32 = vec_splats(32ULL);
+          xxh_u64x2* const xacc     =       (xxh_u64x2*) acc;    /* presumed aligned */
+    xxh_u64x2 const* const xinput   = (xxh_u64x2 const*) input;   /* no alignment restriction */
+    xxh_u64x2 const* const xsecret  = (xxh_u64x2 const*) secret;    /* no alignment restriction */
+    xxh_u64x2 const v32 = { 32, 32 };
     size_t i;
     for (i = 0; i < XXH_STRIPE_LEN / sizeof(xxh_u64x2); i++) {
         /* data_vec = xinput[i]; */
-        xxh_u64x2 const data_vec = XXH_vec_loadu(xinput + 16 * i);
+        xxh_u64x2 const data_vec = XXH_vec_loadu(xinput + i);
         /* key_vec = xsecret[i]; */
-        xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + 16 * i);
+        xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + i);
         xxh_u64x2 const data_key = data_vec ^ key_vec;
         /* shuffled = (data_key << 32) | (data_key >> 32); */
-        xxh_u64x2 const shuffled = vec_rl(data_key, v32);
+        xxh_u32x4 const shuffled = (xxh_u32x4)vec_rl(data_key, v32);
         /* product = ((xxh_u64x2)data_key & 0xFFFFFFFF) * ((xxh_u64x2)shuffled & 0xFFFFFFFF); */
-        xxh_u64x2 const product  = XXH_vec_mulo((xxh_u32x4)data_key, (xxh_u32x4)shuffled);
-        /* acc_vec = xacc[i]; */
-        xxh_u64x2 acc_vec        = vec_xl(0, xacc + 2 * i);
-        acc_vec += product;
+        xxh_u64x2 const product  = XXH_vec_mulo((xxh_u32x4)data_key, shuffled);
+        xacc[i] += product;
 
         /* swap high and low halves */
 #ifdef __s390x__
-        acc_vec += vec_permi(data_vec, data_vec, 2);
+        xacc[i] += vec_permi(data_vec, data_vec, 2);
 #else
-        acc_vec += vec_xxpermdi(data_vec, data_vec, 2);
+        xacc[i] += vec_xxpermdi(data_vec, data_vec, 2);
 #endif
-        /* xacc[i] = acc_vec; */
-        vec_xst(acc_vec, 0, xacc + 2 * i);
     }
 }
 
 XXH_FORCE_INLINE void
-XXH3_scrambleAcc_vsx(void* XXH_RESTRICT acc, void const* XXH_RESTRICT secret)
+XXH3_scrambleAcc_vsx(void* XXH_RESTRICT acc, const void* XXH_RESTRICT secret)
 {
     XXH_ASSERT((((size_t)acc) & 15) == 0);
 
-    {   unsigned long long* const xacc = (unsigned long long*) acc;
-        unsigned char const* const xsecret = (unsigned char const*) secret;
+    {         xxh_u64x2* const xacc    =       (xxh_u64x2*) acc;
+        const xxh_u64x2* const xsecret = (const xxh_u64x2*) secret;
         /* constants */
-        xxh_u64x2 const v32 = vec_splats(32ULL);
-        xxh_u64x2 const v47 = vec_splats(47ULL);
-        xxh_u32x4 const prime = vec_splats(XXH_PRIME32_1);
+        xxh_u64x2 const v32  = { 32, 32 };
+        xxh_u64x2 const v47 = { 47, 47 };
+        xxh_u32x4 const prime = { XXH_PRIME32_1, XXH_PRIME32_1, XXH_PRIME32_1, XXH_PRIME32_1 };
         size_t i;
         for (i = 0; i < XXH_STRIPE_LEN / sizeof(xxh_u64x2); i++) {
             /* xacc[i] ^= (xacc[i] >> 47); */
-            xxh_u64x2 const acc_vec  = vec_xl(0, xacc + 2 * i);
+            xxh_u64x2 const acc_vec  = xacc[i];
             xxh_u64x2 const data_vec = acc_vec ^ (acc_vec >> v47);
 
             /* xacc[i] ^= xsecret[i]; */
-            xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + 16 * i);
+            xxh_u64x2 const key_vec  = XXH_vec_loadu(xsecret + i);
             xxh_u64x2 const data_key = data_vec ^ key_vec;
 
             /* xacc[i] *= XXH_PRIME32_1 */
             /* prod_lo = ((xxh_u64x2)data_key & 0xFFFFFFFF) * ((xxh_u64x2)prime & 0xFFFFFFFF);  */
-            xxh_u64x2 const prod_lo = XXH_vec_mulo((xxh_u32x4)data_key, prime);
+            xxh_u64x2 const prod_even  = XXH_vec_mule((xxh_u32x4)data_key, prime);
             /* prod_hi = ((xxh_u64x2)data_key >> 32) * ((xxh_u64x2)prime >> 32);  */
-            xxh_u64x2 const prod_hi = XXH_vec_mule((xxh_u32x4)data_key, prime);
-            xxh_u64x2 const product = prod_lo + (prod_hi << v32);
-            vec_xst(product, 0, xacc + 2 * i);
+            xxh_u64x2 const prod_odd  = XXH_vec_mulo((xxh_u32x4)data_key, prime);
+            xacc[i] = prod_odd + (prod_even << v32);
     }   }
 }
 

From b40bf8dc3002b6de3f4ec7ab38a45fbca2e5c15e Mon Sep 17 00:00:00 2001
From: Michael Ellerman <mpe@ellerman.id.au>
Date: Tue, 16 Nov 2021 14:20:32 +1100
Subject: [PATCH 150/187] Re-instate just the xacc changes in
 XXH3_accumulate_512_vsx()

---
 xxhash.h | 13 +++++++++----
 1 file changed, 9 insertions(+), 4 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 247a8449..4e4da935 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4111,7 +4111,8 @@ XXH3_accumulate_512_vsx(  void* XXH_RESTRICT acc,
                     const void* XXH_RESTRICT input,
                     const void* XXH_RESTRICT secret)
 {
-          xxh_u64x2* const xacc     =       (xxh_u64x2*) acc;    /* presumed aligned */
+    /* presumed aligned */
+    unsigned long long* const xacc = (unsigned long long*) acc;
     xxh_u64x2 const* const xinput   = (xxh_u64x2 const*) input;   /* no alignment restriction */
     xxh_u64x2 const* const xsecret  = (xxh_u64x2 const*) secret;    /* no alignment restriction */
     xxh_u64x2 const v32 = { 32, 32 };
@@ -4126,14 +4127,18 @@ XXH3_accumulate_512_vsx(  void* XXH_RESTRICT acc,
         xxh_u32x4 const shuffled = (xxh_u32x4)vec_rl(data_key, v32);
         /* product = ((xxh_u64x2)data_key & 0xFFFFFFFF) * ((xxh_u64x2)shuffled & 0xFFFFFFFF); */
         xxh_u64x2 const product  = XXH_vec_mulo((xxh_u32x4)data_key, shuffled);
-        xacc[i] += product;
+        /* acc_vec = xacc[i]; */
+        xxh_u64x2 acc_vec        = vec_xl(0, xacc + 2 * i);
+        acc_vec += product;
 
         /* swap high and low halves */
 #ifdef __s390x__
-        xacc[i] += vec_permi(data_vec, data_vec, 2);
+        acc_vec += vec_permi(data_vec, data_vec, 2);
 #else
-        xacc[i] += vec_xxpermdi(data_vec, data_vec, 2);
+        acc_vec += vec_xxpermdi(data_vec, data_vec, 2);
 #endif
+        /* xacc[i] = acc_vec; */
+        vec_xst(acc_vec, 0, xacc + 2 * i);
     }
 }
 

From 51184e2d7bcf14f0745414fbda07d9c5970a0eec Mon Sep 17 00:00:00 2001
From: "easyaspi314 (Devin)" <easyaspi314@users.noreply.github.com>
Date: Tue, 23 Nov 2021 10:39:02 -0500
Subject: [PATCH 151/187] [NEON] Enable XXH_NEON for MSVC ARM/ARM64

When MSVC is targeting ARMv7VE or ARM64, it will now use the NEON path.

It actually just works(TM) out of the box.

The code hasn't been tested, as I don't have access to an ARM64 Windows
device, but the code appears to be correct judging from the assembly
listings.
---
 xxhash.h | 17 ++++++++++-------
 1 file changed, 10 insertions(+), 7 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 247a8449..8e50df70 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2848,10 +2848,13 @@ enum XXH_VECTOR_TYPE /* fake enum */ {
 #    define XXH_VECTOR XXH_AVX2
 #  elif defined(__SSE2__) || defined(_M_AMD64) || defined(_M_X64) || (defined(_M_IX86_FP) && (_M_IX86_FP == 2))
 #    define XXH_VECTOR XXH_SSE2
-#  elif defined(__GNUC__) /* msvc support maybe later */ \
-  && (defined(__ARM_NEON__) || defined(__ARM_NEON)) \
-  && (defined(__LITTLE_ENDIAN__) /* We only support little endian NEON */ \
-    || (defined(__BYTE_ORDER__) && __BYTE_ORDER__ == __ORDER_LITTLE_ENDIAN__))
+#  elif ( \
+        defined(__ARM_NEON__) || defined(__ARM_NEON) /* gcc */ \
+     || defined(_M_ARM64) || defined(_M_ARM_ARMV7VE) /* msvc */ \
+   ) && ( \
+        defined(_WIN32) || defined(__LITTLE_ENDIAN__) /* little endian only */ \
+    || (defined(__BYTE_ORDER__) && __BYTE_ORDER__ == __ORDER_LITTLE_ENDIAN__) \
+   )
 #    define XXH_VECTOR XXH_NEON
 #  elif (defined(__PPC64__) && defined(__POWER8_VECTOR__)) \
      || (defined(__s390x__) && defined(__VEC__)) \
@@ -3003,7 +3006,7 @@ enum XXH_VECTOR_TYPE /* fake enum */ {
  */
 # if !defined(XXH_NO_VZIP_HACK) /* define to disable */ \
    && defined(__GNUC__) \
-   && !defined(__aarch64__) && !defined(__arm64__)
+   && !defined(__aarch64__) && !defined(__arm64__) && !defined(_M_ARM64)
 #  define XXH_SPLIT_IN_PLACE(in, outLo, outHi)                                              \
     do {                                                                                    \
       /* Undocumented GCC/Clang operand modifier: %e0 = lower D half, %f0 = upper D half */ \
@@ -4066,8 +4069,8 @@ XXH3_scrambleAcc_neon(void* XXH_RESTRICT acc, const void* XXH_RESTRICT secret)
             uint64x2_t data_vec = veorq_u64   (acc_vec, shifted);
 
             /* xacc[i] ^= xsecret[i]; */
-            uint8x16_t key_vec  = vld1q_u8(xsecret + (i * 16));
-            uint64x2_t data_key = veorq_u64(data_vec, vreinterpretq_u64_u8(key_vec));
+            uint8x16_t key_vec  = vld1q_u8    (xsecret + (i * 16));
+            uint64x2_t data_key = veorq_u64   (data_vec, vreinterpretq_u64_u8(key_vec));
 
             /* xacc[i] *= XXH_PRIME32_1 */
             uint32x2_t data_key_lo, data_key_hi;

From cfb05264c282414e6553a83f3aa245c6394d4437 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 10:56:21 -0800
Subject: [PATCH 152/187] removed XXH_ACCEPT_NULL_INPUT_POINTER

All variants are now directly compatible with input_ptr==NULL,
with no extra check nor indirection,
though it now *requires* that, in this case, len == 0.
---
 README.md    |  4 ----
 cli/xxhsum.c |  4 +++-
 xxhash.h     | 66 +++++++++++++++-------------------------------------
 3 files changed, 22 insertions(+), 52 deletions(-)

diff --git a/README.md b/README.md
index 9281ea31..2c9f19f3 100644
--- a/README.md
+++ b/README.md
@@ -119,10 +119,6 @@ The following macros can be set at compilation time to modify libxxhash's behavi
                          This may also increase performance depending on compiler and architecture.
 - `XXH_REROLL`: Reduces the size of the generated code by not unrolling some loops.
                 Impact on performance may vary, depending on platform and algorithm.
-- `XXH_ACCEPT_NULL_INPUT_POINTER`: if set to `1`, when input is a `NULL` pointer,
-                                   xxHash'd result is the same as a zero-length input
-                                   (instead of a dereference segfault).
-                                   Adds one branch at the beginning of each hash.
 - `XXH_STATIC_LINKING_ONLY`: gives access to the state declaration for static allocation.
                              Incompatible with dynamic linking, due to risks of ABI changes.
 - `XXH_NO_XXH3` : removes symbols related to `XXH3` (both 64 & 128 bits) from generated binary.
diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index cedb9355..a9015324 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -1414,7 +1414,9 @@ XSUM_API int XSUM_main(int argc, char* argv[])
             {
             /* Display version */
             case 'V':
-                XSUM_log(FULL_WELCOME_MESSAGE(exename)); return 0;
+                XSUM_log(FULL_WELCOME_MESSAGE(exename));
+                XSUM_sanityCheck(); 
+                return 0;
 
             /* Display help on XSUM_usage */
             case 'h':
diff --git a/xxhash.h b/xxhash.h
index 8e50df70..182211ce 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1256,17 +1256,7 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  * Prefer these methods in priority order (0 > 3 > 1 > 2)
  */
 #  define XXH_FORCE_MEMORY_ACCESS 0
-/*!
- * @def XXH_ACCEPT_NULL_INPUT_POINTER
- * @brief Whether to add explicit `NULL` checks.
- *
- * If the input pointer is `NULL` and the length is non-zero, xxHash's default
- * behavior is to dereference it, triggering a segfault.
- *
- * When this macro is enabled, xxHash actively checks the input for a null pointer.
- * If it is, the result for null input pointers is the same as a zero-length input.
- */
-#  define XXH_ACCEPT_NULL_INPUT_POINTER 0
+
 /*!
  * @def XXH_FORCE_ALIGN_CHECK
  * @brief If defined to non-zero, adds a special path for aligned inputs (XXH32()
@@ -1365,10 +1355,6 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
 #  endif
 #endif
 
-#ifndef XXH_ACCEPT_NULL_INPUT_POINTER   /* can be defined externally */
-#  define XXH_ACCEPT_NULL_INPUT_POINTER 0
-#endif
-
 #ifndef XXH_FORCE_ALIGN_CHECK  /* can be defined externally */
 #  if defined(__i386)  || defined(__x86_64__) || defined(__aarch64__) \
    || defined(_M_IX86) || defined(_M_X64)     || defined(_M_ARM64) /* visual */
@@ -1958,6 +1944,8 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
     h32  = XXH_rotl32(h32, 17) * XXH_PRIME32_4;     \
 } while (0)
 
+    if (ptr==NULL) XXH_ASSERT(len == 0);
+
     /* Compact rerolled version */
     if (XXH_REROLL) {
         len &= 15;
@@ -2034,17 +2022,12 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
 XXH_FORCE_INLINE xxh_u32
 XXH32_endian_align(const xxh_u8* input, size_t len, xxh_u32 seed, XXH_alignment align)
 {
-    const xxh_u8* bEnd = input ? input + len : NULL;
     xxh_u32 h32;
 
-#if defined(XXH_ACCEPT_NULL_INPUT_POINTER) && (XXH_ACCEPT_NULL_INPUT_POINTER>=1)
-    if (input==NULL) {
-        len=0;
-        bEnd=input=(const xxh_u8*)(size_t)16;
-    }
-#endif
+    if (input==NULL) XXH_ASSERT(len == 0);
 
     if (len>=16) {
+        const xxh_u8* const bEnd = input + len;
         const xxh_u8* const limit = bEnd - 15;
         xxh_u32 v1 = seed + XXH_PRIME32_1 + XXH_PRIME32_2;
         xxh_u32 v2 = seed + XXH_PRIME32_2;
@@ -2130,12 +2113,10 @@ XXH_PUBLIC_API XXH_errorcode XXH32_reset(XXH32_state_t* statePtr, XXH32_hash_t s
 XXH_PUBLIC_API XXH_errorcode
 XXH32_update(XXH32_state_t* state, const void* input, size_t len)
 {
-    if (input==NULL)
-#if defined(XXH_ACCEPT_NULL_INPUT_POINTER) && (XXH_ACCEPT_NULL_INPUT_POINTER>=1)
+    if (input==NULL) {
+        XXH_ASSERT(len == 0);
         return XXH_OK;
-#else
-        return XXH_ERROR;
-#endif
+    }
 
     {   const xxh_u8* p = (const xxh_u8*)input;
         const xxh_u8* const bEnd = p + len;
@@ -2427,6 +2408,7 @@ static xxh_u64 XXH64_avalanche(xxh_u64 h64)
 static xxh_u64
 XXH64_finalize(xxh_u64 h64, const xxh_u8* ptr, size_t len, XXH_alignment align)
 {
+    if (ptr==NULL) XXH_ASSERT(len == 0);
     len &= 31;
     while (len >= 8) {
         xxh_u64 const k1 = XXH64_round(0, XXH_get64bits(ptr));
@@ -2462,18 +2444,12 @@ XXH64_finalize(xxh_u64 h64, const xxh_u8* ptr, size_t len, XXH_alignment align)
 XXH_FORCE_INLINE xxh_u64
 XXH64_endian_align(const xxh_u8* input, size_t len, xxh_u64 seed, XXH_alignment align)
 {
-    const xxh_u8* bEnd = input ? input + len : NULL;
     xxh_u64 h64;
-
-#if defined(XXH_ACCEPT_NULL_INPUT_POINTER) && (XXH_ACCEPT_NULL_INPUT_POINTER>=1)
-    if (input==NULL) {
-        len=0;
-        bEnd=input=(const xxh_u8*)(size_t)32;
-    }
-#endif
+    if (input==NULL) XXH_ASSERT(len == 0);
 
     if (len>=32) {
-        const xxh_u8* const limit = bEnd - 32;
+        const xxh_u8* const bEnd = input + len;
+        const xxh_u8* const limit = bEnd - 31;
         xxh_u64 v1 = seed + XXH_PRIME64_1 + XXH_PRIME64_2;
         xxh_u64 v2 = seed + XXH_PRIME64_2;
         xxh_u64 v3 = seed + 0;
@@ -2484,7 +2460,7 @@ XXH64_endian_align(const xxh_u8* input, size_t len, xxh_u64 seed, XXH_alignment
             v2 = XXH64_round(v2, XXH_get64bits(input)); input+=8;
             v3 = XXH64_round(v3, XXH_get64bits(input)); input+=8;
             v4 = XXH64_round(v4, XXH_get64bits(input)); input+=8;
-        } while (input<=limit);
+        } while (input<limit);
 
         h64 = XXH_rotl64(v1, 1) + XXH_rotl64(v2, 7) + XXH_rotl64(v3, 12) + XXH_rotl64(v4, 18);
         h64 = XXH64_mergeRound(h64, v1);
@@ -2560,12 +2536,10 @@ XXH_PUBLIC_API XXH_errorcode XXH64_reset(XXH64_state_t* statePtr, XXH64_hash_t s
 XXH_PUBLIC_API XXH_errorcode
 XXH64_update (XXH64_state_t* state, const void* input, size_t len)
 {
-    if (input==NULL)
-#if defined(XXH_ACCEPT_NULL_INPUT_POINTER) && (XXH_ACCEPT_NULL_INPUT_POINTER>=1)
+    if (input==NULL) {
+        XXH_ASSERT(len == 0);
         return XXH_OK;
-#else
-        return XXH_ERROR;
-#endif
+    }
 
     {   const xxh_u8* p = (const xxh_u8*)input;
         const xxh_u8* const bEnd = p + len;
@@ -4742,12 +4716,10 @@ XXH3_update(XXH3_state_t* state,
             XXH3_f_accumulate_512 f_acc512,
             XXH3_f_scrambleAcc f_scramble)
 {
-    if (input==NULL)
-#if defined(XXH_ACCEPT_NULL_INPUT_POINTER) && (XXH_ACCEPT_NULL_INPUT_POINTER>=1)
+    if (input==NULL) {
+        XXH_ASSERT(len == 0);
         return XXH_OK;
-#else
-        return XXH_ERROR;
-#endif
+    }
 
     {   const xxh_u8* const bEnd = input + len;
         const unsigned char* const secret = (state->extSecret == NULL) ? state->customSecret : state->extSecret;

From 85e1ea2ab0d90463b804f4a65b0f67d99718c6f8 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 13:57:02 -0800
Subject: [PATCH 153/187] clarify license

fix request #548
---
 LICENSE       |  22 ----
 cli/COPYING   | 339 ++++++++++++++++++++++++++++++++++++++++++++++++++
 cli/README.md |   4 +
 3 files changed, 343 insertions(+), 22 deletions(-)
 create mode 100644 cli/COPYING
 create mode 100644 cli/README.md

diff --git a/LICENSE b/LICENSE
index fa20595d..6bc30a1b 100644
--- a/LICENSE
+++ b/LICENSE
@@ -24,25 +24,3 @@ LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
 ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
 SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
-
-----------------------------------------------------
-
-xxhsum command line interface
-Copyright (c) 2013-2020 Yann Collet
-All rights reserved.
-
-GPL v2 License
-
-This program is free software; you can redistribute it and/or modify
-it under the terms of the GNU General Public License as published by
-the Free Software Foundation; either version 2 of the License, or
-(at your option) any later version.
-
-This program is distributed in the hope that it will be useful,
-but WITHOUT ANY WARRANTY; without even the implied warranty of
-MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
-GNU General Public License for more details.
-
-You should have received a copy of the GNU General Public License along
-with this program; if not, write to the Free Software Foundation, Inc.,
-51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
diff --git a/cli/COPYING b/cli/COPYING
new file mode 100644
index 00000000..d159169d
--- /dev/null
+++ b/cli/COPYING
@@ -0,0 +1,339 @@
+                    GNU GENERAL PUBLIC LICENSE
+                       Version 2, June 1991
+
+ Copyright (C) 1989, 1991 Free Software Foundation, Inc.,
+ 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA
+ Everyone is permitted to copy and distribute verbatim copies
+ of this license document, but changing it is not allowed.
+
+                            Preamble
+
+  The licenses for most software are designed to take away your
+freedom to share and change it.  By contrast, the GNU General Public
+License is intended to guarantee your freedom to share and change free
+software--to make sure the software is free for all its users.  This
+General Public License applies to most of the Free Software
+Foundation's software and to any other program whose authors commit to
+using it.  (Some other Free Software Foundation software is covered by
+the GNU Lesser General Public License instead.)  You can apply it to
+your programs, too.
+
+  When we speak of free software, we are referring to freedom, not
+price.  Our General Public Licenses are designed to make sure that you
+have the freedom to distribute copies of free software (and charge for
+this service if you wish), that you receive source code or can get it
+if you want it, that you can change the software or use pieces of it
+in new free programs; and that you know you can do these things.
+
+  To protect your rights, we need to make restrictions that forbid
+anyone to deny you these rights or to ask you to surrender the rights.
+These restrictions translate to certain responsibilities for you if you
+distribute copies of the software, or if you modify it.
+
+  For example, if you distribute copies of such a program, whether
+gratis or for a fee, you must give the recipients all the rights that
+you have.  You must make sure that they, too, receive or can get the
+source code.  And you must show them these terms so they know their
+rights.
+
+  We protect your rights with two steps: (1) copyright the software, and
+(2) offer you this license which gives you legal permission to copy,
+distribute and/or modify the software.
+
+  Also, for each author's protection and ours, we want to make certain
+that everyone understands that there is no warranty for this free
+software.  If the software is modified by someone else and passed on, we
+want its recipients to know that what they have is not the original, so
+that any problems introduced by others will not reflect on the original
+authors' reputations.
+
+  Finally, any free program is threatened constantly by software
+patents.  We wish to avoid the danger that redistributors of a free
+program will individually obtain patent licenses, in effect making the
+program proprietary.  To prevent this, we have made it clear that any
+patent must be licensed for everyone's free use or not licensed at all.
+
+  The precise terms and conditions for copying, distribution and
+modification follow.
+
+                    GNU GENERAL PUBLIC LICENSE
+   TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION
+
+  0. This License applies to any program or other work which contains
+a notice placed by the copyright holder saying it may be distributed
+under the terms of this General Public License.  The "Program", below,
+refers to any such program or work, and a "work based on the Program"
+means either the Program or any derivative work under copyright law:
+that is to say, a work containing the Program or a portion of it,
+either verbatim or with modifications and/or translated into another
+language.  (Hereinafter, translation is included without limitation in
+the term "modification".)  Each licensee is addressed as "you".
+
+Activities other than copying, distribution and modification are not
+covered by this License; they are outside its scope.  The act of
+running the Program is not restricted, and the output from the Program
+is covered only if its contents constitute a work based on the
+Program (independent of having been made by running the Program).
+Whether that is true depends on what the Program does.
+
+  1. You may copy and distribute verbatim copies of the Program's
+source code as you receive it, in any medium, provided that you
+conspicuously and appropriately publish on each copy an appropriate
+copyright notice and disclaimer of warranty; keep intact all the
+notices that refer to this License and to the absence of any warranty;
+and give any other recipients of the Program a copy of this License
+along with the Program.
+
+You may charge a fee for the physical act of transferring a copy, and
+you may at your option offer warranty protection in exchange for a fee.
+
+  2. You may modify your copy or copies of the Program or any portion
+of it, thus forming a work based on the Program, and copy and
+distribute such modifications or work under the terms of Section 1
+above, provided that you also meet all of these conditions:
+
+    a) You must cause the modified files to carry prominent notices
+    stating that you changed the files and the date of any change.
+
+    b) You must cause any work that you distribute or publish, that in
+    whole or in part contains or is derived from the Program or any
+    part thereof, to be licensed as a whole at no charge to all third
+    parties under the terms of this License.
+
+    c) If the modified program normally reads commands interactively
+    when run, you must cause it, when started running for such
+    interactive use in the most ordinary way, to print or display an
+    announcement including an appropriate copyright notice and a
+    notice that there is no warranty (or else, saying that you provide
+    a warranty) and that users may redistribute the program under
+    these conditions, and telling the user how to view a copy of this
+    License.  (Exception: if the Program itself is interactive but
+    does not normally print such an announcement, your work based on
+    the Program is not required to print an announcement.)
+
+These requirements apply to the modified work as a whole.  If
+identifiable sections of that work are not derived from the Program,
+and can be reasonably considered independent and separate works in
+themselves, then this License, and its terms, do not apply to those
+sections when you distribute them as separate works.  But when you
+distribute the same sections as part of a whole which is a work based
+on the Program, the distribution of the whole must be on the terms of
+this License, whose permissions for other licensees extend to the
+entire whole, and thus to each and every part regardless of who wrote it.
+
+Thus, it is not the intent of this section to claim rights or contest
+your rights to work written entirely by you; rather, the intent is to
+exercise the right to control the distribution of derivative or
+collective works based on the Program.
+
+In addition, mere aggregation of another work not based on the Program
+with the Program (or with a work based on the Program) on a volume of
+a storage or distribution medium does not bring the other work under
+the scope of this License.
+
+  3. You may copy and distribute the Program (or a work based on it,
+under Section 2) in object code or executable form under the terms of
+Sections 1 and 2 above provided that you also do one of the following:
+
+    a) Accompany it with the complete corresponding machine-readable
+    source code, which must be distributed under the terms of Sections
+    1 and 2 above on a medium customarily used for software interchange; or,
+
+    b) Accompany it with a written offer, valid for at least three
+    years, to give any third party, for a charge no more than your
+    cost of physically performing source distribution, a complete
+    machine-readable copy of the corresponding source code, to be
+    distributed under the terms of Sections 1 and 2 above on a medium
+    customarily used for software interchange; or,
+
+    c) Accompany it with the information you received as to the offer
+    to distribute corresponding source code.  (This alternative is
+    allowed only for noncommercial distribution and only if you
+    received the program in object code or executable form with such
+    an offer, in accord with Subsection b above.)
+
+The source code for a work means the preferred form of the work for
+making modifications to it.  For an executable work, complete source
+code means all the source code for all modules it contains, plus any
+associated interface definition files, plus the scripts used to
+control compilation and installation of the executable.  However, as a
+special exception, the source code distributed need not include
+anything that is normally distributed (in either source or binary
+form) with the major components (compiler, kernel, and so on) of the
+operating system on which the executable runs, unless that component
+itself accompanies the executable.
+
+If distribution of executable or object code is made by offering
+access to copy from a designated place, then offering equivalent
+access to copy the source code from the same place counts as
+distribution of the source code, even though third parties are not
+compelled to copy the source along with the object code.
+
+  4. You may not copy, modify, sublicense, or distribute the Program
+except as expressly provided under this License.  Any attempt
+otherwise to copy, modify, sublicense or distribute the Program is
+void, and will automatically terminate your rights under this License.
+However, parties who have received copies, or rights, from you under
+this License will not have their licenses terminated so long as such
+parties remain in full compliance.
+
+  5. You are not required to accept this License, since you have not
+signed it.  However, nothing else grants you permission to modify or
+distribute the Program or its derivative works.  These actions are
+prohibited by law if you do not accept this License.  Therefore, by
+modifying or distributing the Program (or any work based on the
+Program), you indicate your acceptance of this License to do so, and
+all its terms and conditions for copying, distributing or modifying
+the Program or works based on it.
+
+  6. Each time you redistribute the Program (or any work based on the
+Program), the recipient automatically receives a license from the
+original licensor to copy, distribute or modify the Program subject to
+these terms and conditions.  You may not impose any further
+restrictions on the recipients' exercise of the rights granted herein.
+You are not responsible for enforcing compliance by third parties to
+this License.
+
+  7. If, as a consequence of a court judgment or allegation of patent
+infringement or for any other reason (not limited to patent issues),
+conditions are imposed on you (whether by court order, agreement or
+otherwise) that contradict the conditions of this License, they do not
+excuse you from the conditions of this License.  If you cannot
+distribute so as to satisfy simultaneously your obligations under this
+License and any other pertinent obligations, then as a consequence you
+may not distribute the Program at all.  For example, if a patent
+license would not permit royalty-free redistribution of the Program by
+all those who receive copies directly or indirectly through you, then
+the only way you could satisfy both it and this License would be to
+refrain entirely from distribution of the Program.
+
+If any portion of this section is held invalid or unenforceable under
+any particular circumstance, the balance of the section is intended to
+apply and the section as a whole is intended to apply in other
+circumstances.
+
+It is not the purpose of this section to induce you to infringe any
+patents or other property right claims or to contest validity of any
+such claims; this section has the sole purpose of protecting the
+integrity of the free software distribution system, which is
+implemented by public license practices.  Many people have made
+generous contributions to the wide range of software distributed
+through that system in reliance on consistent application of that
+system; it is up to the author/donor to decide if he or she is willing
+to distribute software through any other system and a licensee cannot
+impose that choice.
+
+This section is intended to make thoroughly clear what is believed to
+be a consequence of the rest of this License.
+
+  8. If the distribution and/or use of the Program is restricted in
+certain countries either by patents or by copyrighted interfaces, the
+original copyright holder who places the Program under this License
+may add an explicit geographical distribution limitation excluding
+those countries, so that distribution is permitted only in or among
+countries not thus excluded.  In such case, this License incorporates
+the limitation as if written in the body of this License.
+
+  9. The Free Software Foundation may publish revised and/or new versions
+of the General Public License from time to time.  Such new versions will
+be similar in spirit to the present version, but may differ in detail to
+address new problems or concerns.
+
+Each version is given a distinguishing version number.  If the Program
+specifies a version number of this License which applies to it and "any
+later version", you have the option of following the terms and conditions
+either of that version or of any later version published by the Free
+Software Foundation.  If the Program does not specify a version number of
+this License, you may choose any version ever published by the Free Software
+Foundation.
+
+  10. If you wish to incorporate parts of the Program into other free
+programs whose distribution conditions are different, write to the author
+to ask for permission.  For software which is copyrighted by the Free
+Software Foundation, write to the Free Software Foundation; we sometimes
+make exceptions for this.  Our decision will be guided by the two goals
+of preserving the free status of all derivatives of our free software and
+of promoting the sharing and reuse of software generally.
+
+                            NO WARRANTY
+
+  11. BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY
+FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW.  EXCEPT WHEN
+OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES
+PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED
+OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
+MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.  THE ENTIRE RISK AS
+TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU.  SHOULD THE
+PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING,
+REPAIR OR CORRECTION.
+
+  12. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
+WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR
+REDISTRIBUTE THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES,
+INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING
+OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED
+TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY
+YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER
+PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE
+POSSIBILITY OF SUCH DAMAGES.
+
+                     END OF TERMS AND CONDITIONS
+
+            How to Apply These Terms to Your New Programs
+
+  If you develop a new program, and you want it to be of the greatest
+possible use to the public, the best way to achieve this is to make it
+free software which everyone can redistribute and change under these terms.
+
+  To do so, attach the following notices to the program.  It is safest
+to attach them to the start of each source file to most effectively
+convey the exclusion of warranty; and each file should have at least
+the "copyright" line and a pointer to where the full notice is found.
+
+    <one line to give the program's name and a brief idea of what it does.>
+    Copyright (C) <year>  <name of author>
+
+    This program is free software; you can redistribute it and/or modify
+    it under the terms of the GNU General Public License as published by
+    the Free Software Foundation; either version 2 of the License, or
+    (at your option) any later version.
+
+    This program is distributed in the hope that it will be useful,
+    but WITHOUT ANY WARRANTY; without even the implied warranty of
+    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+    GNU General Public License for more details.
+
+    You should have received a copy of the GNU General Public License along
+    with this program; if not, write to the Free Software Foundation, Inc.,
+    51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+
+Also add information on how to contact you by electronic and paper mail.
+
+If the program is interactive, make it output a short notice like this
+when it starts in an interactive mode:
+
+    Gnomovision version 69, Copyright (C) year name of author
+    Gnomovision comes with ABSOLUTELY NO WARRANTY; for details type `show w'.
+    This is free software, and you are welcome to redistribute it
+    under certain conditions; type `show c' for details.
+
+The hypothetical commands `show w' and `show c' should show the appropriate
+parts of the General Public License.  Of course, the commands you use may
+be called something other than `show w' and `show c'; they could even be
+mouse-clicks or menu items--whatever suits your program.
+
+You should also get your employer (if you work as a programmer) or your
+school, if any, to sign a "copyright disclaimer" for the program, if
+necessary.  Here is a sample; alter the names:
+
+  Yoyodyne, Inc., hereby disclaims all copyright interest in the program
+  `Gnomovision' (which makes passes at compilers) written by James Hacker.
+
+  <signature of Ty Coon>, 1 April 1989
+  Ty Coon, President of Vice
+
+This General Public License does not permit incorporating your program into
+proprietary programs.  If your program is a subroutine library, you may
+consider it more useful to permit linking proprietary applications with the
+library.  If this is what you want to do, use the GNU Lesser General
+Public License instead of this License.
diff --git a/cli/README.md b/cli/README.md
new file mode 100644
index 00000000..a60a945f
--- /dev/null
+++ b/cli/README.md
@@ -0,0 +1,4 @@
+This directory contains source code dedicated to the `xxhsum` command line utility,
+which is a user program of `libxxhash`.
+
+Note that, in contrast with the library `libxxhash`, the command line utility `xxhsum` ships with GPLv2 license.

From 4381186f54aed3dcd896becce738a753ef1cb494 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 15:35:19 -0800
Subject: [PATCH 154/187] fixed -Wdocumentation

fix #590,
thanks to @t-mat for providing the solution !
---
 .github/workflows/ci.yml | 2 +-
 xxh_x86dispatch.c        | 2 +-
 xxhash.h                 | 6 +++---
 3 files changed, 5 insertions(+), 5 deletions(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index ed84dbd1..250a72d2 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -334,7 +334,7 @@ jobs:
 
     - name: make
       run: |
-        CFLAGS="-Werror" make clean default
+        CFLAGS="-Werror -Wdocumentation" make clean default
 
     - name: make test
       run: |
diff --git a/xxh_x86dispatch.c b/xxh_x86dispatch.c
index ab338329..399bad90 100644
--- a/xxh_x86dispatch.c
+++ b/xxh_x86dispatch.c
@@ -229,7 +229,7 @@ extern "C" {
  * @internal
  * @brief Runs CPUID.
  *
- * @param eax, ecx The parameters to pass to CPUID, %eax and %ecx respectively.
+ * @param eax , ecx The parameters to pass to CPUID, %eax and %ecx respectively.
  * @param abcd The array to store the result in, `{ eax, ebx, ecx, edx }`
  */
 static void XXH_cpuid(xxh_u32 eax, xxh_u32 ecx, xxh_u32* abcd)
diff --git a/xxhash.h b/xxhash.h
index 182211ce..7081468c 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -2015,7 +2015,7 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
  * @internal
  * @brief The implementation for @ref XXH32().
  *
- * @param input, len, seed Directly passed from @ref XXH32().
+ * @param input , len , seed Directly passed from @ref XXH32().
  * @param align Whether @p input is aligned.
  * @return The calculated hash.
  */
@@ -3192,7 +3192,7 @@ XXH_mult32to64(xxh_u64 x, xxh_u64 y)
  * Uses `__uint128_t` and `_umul128` if available, otherwise uses a scalar
  * version.
  *
- * @param lhs, rhs The 64-bit integers to be multiplied
+ * @param lhs , rhs The 64-bit integers to be multiplied
  * @return The 128-bit result represented in an @ref XXH128_hash_t.
  */
 static XXH128_hash_t
@@ -3325,7 +3325,7 @@ XXH_mult64to128(xxh_u64 lhs, xxh_u64 rhs)
  * The reason for the separate function is to prevent passing too many structs
  * around by value. This will hopefully inline the multiply, but we don't force it.
  *
- * @param lhs, rhs The 64-bit integers to multiply
+ * @param lhs , rhs The 64-bit integers to multiply
  * @return The low 64 bits of the product XOR'd by the high 64 bits.
  * @see XXH_mult64to128()
  */

From 658d2fef45a49cdcb5988218fa075fb767b63edd Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 16:05:44 -0800
Subject: [PATCH 155/187] removed -Wdocumentation test from CI

seems like a bug in the clang version used by GA.
---
 .github/workflows/ci.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
index 250a72d2..ed84dbd1 100644
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -334,7 +334,7 @@ jobs:
 
     - name: make
       run: |
-        CFLAGS="-Werror -Wdocumentation" make clean default
+        CFLAGS="-Werror" make clean default
 
     - name: make test
       run: |

From 7cc9a62a170c50066e29e973bce620db1cb80513 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 18:37:26 -0800
Subject: [PATCH 156/187] fixed "unused function" warning under MSVC + clang
 compilation

---
 xxhash.h | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 7081468c..a08e70fd 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1432,19 +1432,19 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
 #endif
 
 #if XXH_NO_INLINE_HINTS  /* disable inlining hints */
-#  if defined(__GNUC__)
+#  if defined(__GNUC__) || defined(__clang__)
 #    define XXH_FORCE_INLINE static __attribute__((unused))
 #  else
 #    define XXH_FORCE_INLINE static
 #  endif
 #  define XXH_NO_INLINE static
 /* enable inlining hints */
+#elif defined(__GNUC__) || defined(__clang__)
+#  define XXH_FORCE_INLINE static __inline__ __attribute__((always_inline, unused))
+#  define XXH_NO_INLINE static __attribute__((noinline))
 #elif defined(_MSC_VER)  /* Visual Studio */
 #  define XXH_FORCE_INLINE static __forceinline
 #  define XXH_NO_INLINE static __declspec(noinline)
-#elif defined(__GNUC__)
-#  define XXH_FORCE_INLINE static __inline__ __attribute__((always_inline, unused))
-#  define XXH_NO_INLINE static __attribute__((noinline))
 #elif defined (__cplusplus) \
   || (defined (__STDC_VERSION__) && (__STDC_VERSION__ >= 199901L))   /* C99 */
 #  define XXH_FORCE_INLINE static inline

From cdc182510d4aeae6d1ff08e8d4ab475d87317b0f Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 19:59:45 -0800
Subject: [PATCH 157/187] fix minor conversion warning in clang for windows

fix #588
---
 cli/xxhsum.c | 11 +++--------
 1 file changed, 3 insertions(+), 8 deletions(-)

diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index a9015324..84b5adb0 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -105,11 +105,6 @@ static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & X
 #define MAX_LINE_LENGTH (32 KB)
 
 
-/* ************************************
- *  Display macros
- **************************************/
-
-
 /* ************************************
  *  Local variables
  **************************************/
@@ -589,7 +584,7 @@ static void XSUM_printLine_BSD_internal(const char* filename,
                                         const char* algoString[],
                                         XSUM_displayHash_f f_displayHash)
 {
-    assert(0 <= hashType && hashType <= XSUM_TABLE_ELT_SIZE(XSUM_algoName));
+    assert(0 <= hashType && (size_t)hashType <= XSUM_TABLE_ELT_SIZE(XSUM_algoName));
     {   const char* const typeString = algoString[hashType];
         const size_t hashLength = XSUM_algoLength[hashType];
         XSUM_output("%s (%s) = ", typeString, filename);
@@ -611,7 +606,7 @@ static void XSUM_printLine_GNU_internal(const char* filename,
                                const void* canonicalHash, const AlgoSelected hashType,
                                XSUM_displayHash_f f_displayHash)
 {
-    assert(0 <= hashType && hashType <= XSUM_TABLE_ELT_SIZE(XSUM_algoName));
+    assert(0 <= hashType && (size_t)hashType <= XSUM_TABLE_ELT_SIZE(XSUM_algoName));
     {   const size_t hashLength = XSUM_algoLength[hashType];
         f_displayHash(canonicalHash, hashLength);
         XSUM_output("  %s\n", filename);
@@ -1415,7 +1410,7 @@ XSUM_API int XSUM_main(int argc, char* argv[])
             /* Display version */
             case 'V':
                 XSUM_log(FULL_WELCOME_MESSAGE(exename));
-                XSUM_sanityCheck(); 
+                XSUM_sanityCheck();
                 return 0;
 
             /* Display help on XSUM_usage */

From 259dcd4638d2f5328c3c515a3f0066b904e0b2e9 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 20:10:37 -0800
Subject: [PATCH 158/187] attempt to enable assert() for debug builds

---
 cmake_unofficial/CMakeLists.txt | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index 01d62a8c..7acc8b1f 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -42,6 +42,10 @@ endif()
 if(NOT CMAKE_CONFIGURATION_TYPES)
   message(STATUS "xxHash build type: ${CMAKE_BUILD_TYPE}")
 endif()
+# Enable assert() statements in debug builds
+if("${CMAKE_BUILD_TYPE}" STREQUAL "Debug")
+  add_compile_definitions(XXH_DEBUGLEVEL=1)
+endif()
 
 option(BUILD_SHARED_LIBS "Build shared library" ON)
 option(XXHASH_BUILD_XXHSUM "Build the xxhsum binary" ON)

From d2602b9053ad3139a42703220708dcbd95e9ac99 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Thu, 25 Nov 2021 21:06:15 -0800
Subject: [PATCH 159/187] prevent clang's detrimental autovectorization of
 XXH32

when clang is used as the backend compiler of MSVC.
---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index a08e70fd..86fe5a1f 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1511,7 +1511,7 @@ static void* XXH_memcpy(void* dest, const void* src, size_t size)
  * We also use it to prevent unwanted constant folding for AArch64 in
  * XXH3_initCustomSecret_scalar().
  */
-#ifdef __GNUC__
+#if defined(__GNUC__) || defined(__clang__)
 #  define XXH_COMPILER_GUARD(var) __asm__ __volatile__("" : "+r" (var))
 #else
 #  define XXH_COMPILER_GUARD(var) ((void)0)

From a2d465dadf12f965d499bb0f9b8d79d3169d14b5 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Fri, 26 Nov 2021 16:24:59 -0800
Subject: [PATCH 160/187] added XXH32 + XXH64 streaming variants to benchmark

---
 cli/xxhsum.c | 26 ++++++++++++++++++++++----
 1 file changed, 22 insertions(+), 4 deletions(-)

diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index 84b5adb0..56bce1ba 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -186,10 +186,26 @@ static XSUM_U32 localXXH32(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     return XXH32(buffer, bufferSize, seed);
 }
+static XSUM_U32 localXXH32_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH32_state_t state;
+    (void)seed;
+    XXH32_reset(&state, seed);
+    XXH32_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH32_digest(&state);
+}
 static XSUM_U32 localXXH64(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     return (XSUM_U32)XXH64(buffer, bufferSize, seed);
 }
+static XSUM_U32 localXXH64_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH64_state_t state;
+    (void)seed;
+    XXH64_reset(&state, seed);
+    XXH64_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH64_digest(&state);
+}
 static XSUM_U32 localXXH3_64b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
 {
     (void)seed;
@@ -257,8 +273,7 @@ typedef struct {
     hashFunction func;
 } hashInfo;
 
-#define NB_HASHFUNC 12
-static const hashInfo g_hashesToBench[NB_HASHFUNC] = {
+static const hashInfo g_hashesToBench[] = {
     { "XXH32",             &localXXH32 },
     { "XXH64",             &localXXH64 },
     { "XXH3_64b",          &localXXH3_64b },
@@ -267,11 +282,14 @@ static const hashInfo g_hashesToBench[NB_HASHFUNC] = {
     { "XXH128",            &localXXH3_128b },
     { "XXH128 w/seed",     &localXXH3_128b_seeded },
     { "XXH128 w/secret",   &localXXH3_128b_secret },
+    { "XXH32_stream",      &localXXH32_stream },
+    { "XXH64_stream",      &localXXH64_stream },
     { "XXH3_stream",       &localXXH3_stream },
     { "XXH3_stream w/seed",&localXXH3_stream_seeded },
     { "XXH128_stream",     &localXXH128_stream },
     { "XXH128_stream w/seed",&localXXH128_stream_seeded },
 };
+#define NB_HASHFUNC (sizeof(g_hashesToBench) / sizeof(*g_hashesToBench))
 
 #define NB_TESTFUNC (1 + 2 * NB_HASHFUNC)
 static char g_testIDs[NB_TESTFUNC] = { 0 };
@@ -283,7 +301,7 @@ static const char k_testIDs_default[NB_TESTFUNC] = { 0,
 
 #define HASHNAME_MAX 29
 static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
-                          const void* buffer, size_t bufferSize)
+                           const void* buffer, size_t bufferSize)
 {
     XSUM_U32 nbh_perIteration = (XSUM_U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
     unsigned iterationNb, nbIterations = g_nbIterations + !g_nbIterations /* min 1 */;
@@ -384,7 +402,7 @@ static void XSUM_benchMem(const void* buffer, size_t bufferSize)
     assert((((size_t)buffer) & 15) == 0);  /* ensure alignment */
     XSUM_fillTestBuffer(g_benchSecretBuf, sizeof(g_benchSecretBuf));
     {   int i;
-        for (i = 1; i < NB_TESTFUNC; i++) {
+        for (i = 1; i < (int)NB_TESTFUNC; i++) {
             int const hashFuncID = (i-1) / 2;
             assert(g_hashesToBench[hashFuncID].name != NULL);
             if (g_testIDs[i] == 0) continue;

From d14b2934c5391895e0c27c34aceb97202601fe7c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Fri, 26 Nov 2021 20:41:18 -0800
Subject: [PATCH 161/187] reduce stack usage of XXH32 streaming variant

no longer need intermediate stack variables.
Checked that performance remains identical
across a large series of compilers and versions.
---
 xxhash.h | 47 ++++++++++++++++++-----------------------------
 1 file changed, 18 insertions(+), 29 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 86fe5a1f..2cf82a03 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -965,10 +965,7 @@ XXH_PUBLIC_API XXH128_hash_t XXH128_hashFromCanonical(const XXH128_canonical_t*
 struct XXH32_state_s {
    XXH32_hash_t total_len_32; /*!< Total length hashed, modulo 2^32 */
    XXH32_hash_t large_len;    /*!< Whether the hash is >= 16 (handles @ref total_len_32 overflow) */
-   XXH32_hash_t v1;           /*!< First accumulator lane */
-   XXH32_hash_t v2;           /*!< Second accumulator lane */
-   XXH32_hash_t v3;           /*!< Third accumulator lane */
-   XXH32_hash_t v4;           /*!< Fourth accumulator lane */
+   XXH32_hash_t v[4];         /*!< Accumulator lanes */
    XXH32_hash_t mem32[4];     /*!< Internal buffer for partial reads. Treated as unsigned char[16]. */
    XXH32_hash_t memsize;      /*!< Amount of data in @ref mem32 */
    XXH32_hash_t reserved;     /*!< Reserved field. Do not read or write to it, it may be removed. */
@@ -2099,10 +2096,10 @@ XXH_PUBLIC_API XXH_errorcode XXH32_reset(XXH32_state_t* statePtr, XXH32_hash_t s
 {
     XXH32_state_t state;   /* using a local state to memcpy() in order to avoid strict-aliasing warnings */
     memset(&state, 0, sizeof(state));
-    state.v1 = seed + XXH_PRIME32_1 + XXH_PRIME32_2;
-    state.v2 = seed + XXH_PRIME32_2;
-    state.v3 = seed + 0;
-    state.v4 = seed - XXH_PRIME32_1;
+    state.v[0] = seed + XXH_PRIME32_1 + XXH_PRIME32_2;
+    state.v[1] = seed + XXH_PRIME32_2;
+    state.v[2] = seed + 0;
+    state.v[3] = seed - XXH_PRIME32_1;
     /* do not write into reserved, planned to be removed in a future version */
     memcpy(statePtr, &state, sizeof(state) - sizeof(state.reserved));
     return XXH_OK;
@@ -2133,10 +2130,10 @@ XXH32_update(XXH32_state_t* state, const void* input, size_t len)
         if (state->memsize) {   /* some data left from previous update */
             XXH_memcpy((xxh_u8*)(state->mem32) + state->memsize, input, 16-state->memsize);
             {   const xxh_u32* p32 = state->mem32;
-                state->v1 = XXH32_round(state->v1, XXH_readLE32(p32)); p32++;
-                state->v2 = XXH32_round(state->v2, XXH_readLE32(p32)); p32++;
-                state->v3 = XXH32_round(state->v3, XXH_readLE32(p32)); p32++;
-                state->v4 = XXH32_round(state->v4, XXH_readLE32(p32));
+                state->v[0] = XXH32_round(state->v[0], XXH_readLE32(p32)); p32++;
+                state->v[1] = XXH32_round(state->v[1], XXH_readLE32(p32)); p32++;
+                state->v[2] = XXH32_round(state->v[2], XXH_readLE32(p32)); p32++;
+                state->v[3] = XXH32_round(state->v[3], XXH_readLE32(p32));
             }
             p += 16-state->memsize;
             state->memsize = 0;
@@ -2144,22 +2141,14 @@ XXH32_update(XXH32_state_t* state, const void* input, size_t len)
 
         if (p <= bEnd-16) {
             const xxh_u8* const limit = bEnd - 16;
-            xxh_u32 v1 = state->v1;
-            xxh_u32 v2 = state->v2;
-            xxh_u32 v3 = state->v3;
-            xxh_u32 v4 = state->v4;
 
             do {
-                v1 = XXH32_round(v1, XXH_readLE32(p)); p+=4;
-                v2 = XXH32_round(v2, XXH_readLE32(p)); p+=4;
-                v3 = XXH32_round(v3, XXH_readLE32(p)); p+=4;
-                v4 = XXH32_round(v4, XXH_readLE32(p)); p+=4;
+                state->v[0] = XXH32_round(state->v[0], XXH_readLE32(p)); p+=4;
+                state->v[1] = XXH32_round(state->v[1], XXH_readLE32(p)); p+=4;
+                state->v[2] = XXH32_round(state->v[2], XXH_readLE32(p)); p+=4;
+                state->v[3] = XXH32_round(state->v[3], XXH_readLE32(p)); p+=4;
             } while (p<=limit);
 
-            state->v1 = v1;
-            state->v2 = v2;
-            state->v3 = v3;
-            state->v4 = v4;
         }
 
         if (p < bEnd) {
@@ -2178,12 +2167,12 @@ XXH_PUBLIC_API XXH32_hash_t XXH32_digest(const XXH32_state_t* state)
     xxh_u32 h32;
 
     if (state->large_len) {
-        h32 = XXH_rotl32(state->v1, 1)
-            + XXH_rotl32(state->v2, 7)
-            + XXH_rotl32(state->v3, 12)
-            + XXH_rotl32(state->v4, 18);
+        h32 = XXH_rotl32(state->v[0], 1)
+            + XXH_rotl32(state->v[1], 7)
+            + XXH_rotl32(state->v[2], 12)
+            + XXH_rotl32(state->v[3], 18);
     } else {
-        h32 = state->v3 /* == seed */ + XXH_PRIME32_5;
+        h32 = state->v[2] /* == seed */ + XXH_PRIME32_5;
     }
 
     h32 += state->total_len_32;

From 61041e5b22b0fffc3521c525df6934f8b91d7964 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 09:12:35 -0800
Subject: [PATCH 162/187] reduce stack usage of XXH64 streaming variant

---
 xxhash.h | 54 +++++++++++++++++++-----------------------------------
 1 file changed, 19 insertions(+), 35 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 2cf82a03..ebd1a006 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -988,10 +988,7 @@ struct XXH32_state_s {
  */
 struct XXH64_state_s {
    XXH64_hash_t total_len;    /*!< Total length hashed. This is always 64-bit. */
-   XXH64_hash_t v1;           /*!< First accumulator lane */
-   XXH64_hash_t v2;           /*!< Second accumulator lane */
-   XXH64_hash_t v3;           /*!< Third accumulator lane */
-   XXH64_hash_t v4;           /*!< Fourth accumulator lane */
+   XXH64_hash_t v[4];         /*!< Accumulator lanes */
    XXH64_hash_t mem64[4];     /*!< Internal buffer for partial reads. Treated as unsigned char[32]. */
    XXH32_hash_t memsize;      /*!< Amount of data in @ref mem64 */
    XXH32_hash_t reserved32;   /*!< Reserved field, needed for padding anyways*/
@@ -2512,10 +2509,10 @@ XXH_PUBLIC_API XXH_errorcode XXH64_reset(XXH64_state_t* statePtr, XXH64_hash_t s
 {
     XXH64_state_t state;   /* use a local state to memcpy() in order to avoid strict-aliasing warnings */
     memset(&state, 0, sizeof(state));
-    state.v1 = seed + XXH_PRIME64_1 + XXH_PRIME64_2;
-    state.v2 = seed + XXH_PRIME64_2;
-    state.v3 = seed + 0;
-    state.v4 = seed - XXH_PRIME64_1;
+    state.v[0] = seed + XXH_PRIME64_1 + XXH_PRIME64_2;
+    state.v[1] = seed + XXH_PRIME64_2;
+    state.v[2] = seed + 0;
+    state.v[3] = seed - XXH_PRIME64_1;
      /* do not write into reserved64, might be removed in a future version */
     memcpy(statePtr, &state, sizeof(state) - sizeof(state.reserved64));
     return XXH_OK;
@@ -2543,32 +2540,24 @@ XXH64_update (XXH64_state_t* state, const void* input, size_t len)
 
         if (state->memsize) {   /* tmp buffer is full */
             XXH_memcpy(((xxh_u8*)state->mem64) + state->memsize, input, 32-state->memsize);
-            state->v1 = XXH64_round(state->v1, XXH_readLE64(state->mem64+0));
-            state->v2 = XXH64_round(state->v2, XXH_readLE64(state->mem64+1));
-            state->v3 = XXH64_round(state->v3, XXH_readLE64(state->mem64+2));
-            state->v4 = XXH64_round(state->v4, XXH_readLE64(state->mem64+3));
+            state->v[0] = XXH64_round(state->v[0], XXH_readLE64(state->mem64+0));
+            state->v[1] = XXH64_round(state->v[1], XXH_readLE64(state->mem64+1));
+            state->v[2] = XXH64_round(state->v[2], XXH_readLE64(state->mem64+2));
+            state->v[3] = XXH64_round(state->v[3], XXH_readLE64(state->mem64+3));
             p += 32 - state->memsize;
             state->memsize = 0;
         }
 
         if (p+32 <= bEnd) {
             const xxh_u8* const limit = bEnd - 32;
-            xxh_u64 v1 = state->v1;
-            xxh_u64 v2 = state->v2;
-            xxh_u64 v3 = state->v3;
-            xxh_u64 v4 = state->v4;
 
             do {
-                v1 = XXH64_round(v1, XXH_readLE64(p)); p+=8;
-                v2 = XXH64_round(v2, XXH_readLE64(p)); p+=8;
-                v3 = XXH64_round(v3, XXH_readLE64(p)); p+=8;
-                v4 = XXH64_round(v4, XXH_readLE64(p)); p+=8;
+                state->v[0] = XXH64_round(state->v[0], XXH_readLE64(p)); p+=8;
+                state->v[1] = XXH64_round(state->v[1], XXH_readLE64(p)); p+=8;
+                state->v[2] = XXH64_round(state->v[2], XXH_readLE64(p)); p+=8;
+                state->v[3] = XXH64_round(state->v[3], XXH_readLE64(p)); p+=8;
             } while (p<=limit);
 
-            state->v1 = v1;
-            state->v2 = v2;
-            state->v3 = v3;
-            state->v4 = v4;
         }
 
         if (p < bEnd) {
@@ -2587,18 +2576,13 @@ XXH_PUBLIC_API XXH64_hash_t XXH64_digest(const XXH64_state_t* state)
     xxh_u64 h64;
 
     if (state->total_len >= 32) {
-        xxh_u64 const v1 = state->v1;
-        xxh_u64 const v2 = state->v2;
-        xxh_u64 const v3 = state->v3;
-        xxh_u64 const v4 = state->v4;
-
-        h64 = XXH_rotl64(v1, 1) + XXH_rotl64(v2, 7) + XXH_rotl64(v3, 12) + XXH_rotl64(v4, 18);
-        h64 = XXH64_mergeRound(h64, v1);
-        h64 = XXH64_mergeRound(h64, v2);
-        h64 = XXH64_mergeRound(h64, v3);
-        h64 = XXH64_mergeRound(h64, v4);
+        h64 = XXH_rotl64(state->v[0], 1) + XXH_rotl64(state->v[1], 7) + XXH_rotl64(state->v[2], 12) + XXH_rotl64(state->v[3], 18);
+        h64 = XXH64_mergeRound(h64, state->v[0]);
+        h64 = XXH64_mergeRound(h64, state->v[1]);
+        h64 = XXH64_mergeRound(h64, state->v[2]);
+        h64 = XXH64_mergeRound(h64, state->v[3]);
     } else {
-        h64  = state->v3 /*seed*/ + XXH_PRIME64_5;
+        h64  = state->v[2] /*seed*/ + XXH_PRIME64_5;
     }
 
     h64 += (xxh_u64) state->total_len;

From 911497efe10698ff6af3b610570e8f6cbd6ba603 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 10:03:21 -0800
Subject: [PATCH 163/187] updated code comments and documentation

notably the existence of build macro XXH_DEBUGLEVEL
to enable assert() debugging statements.
---
 README.md |  6 ++++--
 xxhash.h  | 21 ++++++++++-----------
 2 files changed, 14 insertions(+), 13 deletions(-)

diff --git a/README.md b/README.md
index 2c9f19f3..760199ee 100644
--- a/README.md
+++ b/README.md
@@ -119,7 +119,7 @@ The following macros can be set at compilation time to modify libxxhash's behavi
                          This may also increase performance depending on compiler and architecture.
 - `XXH_REROLL`: Reduces the size of the generated code by not unrolling some loops.
                 Impact on performance may vary, depending on platform and algorithm.
-- `XXH_STATIC_LINKING_ONLY`: gives access to the state declaration for static allocation.
+- `XXH_STATIC_LINKING_ONLY`: gives access to internal state declaration, required for static allocation.
                              Incompatible with dynamic linking, due to risks of ABI changes.
 - `XXH_NO_XXH3` : removes symbols related to `XXH3` (both 64 & 128 bits) from generated binary.
                   Useful to reduce binary size, notably for applications which do not use `XXH3`.
@@ -130,8 +130,10 @@ The following macros can be set at compilation time to modify libxxhash's behavi
                            If, for some reason, the compiler cannot simplify the runtime test, it can cost performance.
                            It's possible to skip auto-detection and simply state that the architecture is little-endian by setting this macro to 1.
                            Setting it to 0 states big-endian.
+- `XXH_DEBUGLEVEL` : When set to any value >= 1, enables `assert()` statements.
+                     This (slightly) slows down execution, but may help finding bugs during debugging sessions.
 
-For the Command Line Interface `xxhsum`, the following environment variables can also be set :
+When compiling the Command Line Interface `xxhsum` with `make`, the following environment variables can also be set :
 - `DISPATCH=1` : use `xxh_x86dispatch.c`, to automatically select between `scalar`, `sse2`, `avx2` or `avx512` instruction set at runtime, depending on local host. This option is only valid for `x86`/`x64` systems.
 
 
diff --git a/xxhash.h b/xxhash.h
index ebd1a006..11073736 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -321,16 +321,16 @@ extern "C" {
 /*!
  * @brief Obtains the xxHash version.
  *
- * This is only useful when xxHash is compiled as a shared library, as it is
- * independent of the version defined in the header.
+ * This is mostly useful when xxHash is compiled as a shared library,
+ * since the returned value comes from the library, as opposed to header file.
  *
- * @return `XXH_VERSION_NUMBER` as of when the library was compiled.
+ * @return `XXH_VERSION_NUMBER` of the invoked library.
  */
 XXH_PUBLIC_API unsigned XXH_versionNumber (void);
 
 
 /* ****************************
-*  Definitions
+*  Common basic types
 ******************************/
 #include <stddef.h>   /* size_t */
 typedef enum { XXH_OK=0, XXH_ERROR } XXH_errorcode;
@@ -374,10 +374,9 @@ typedef uint32_t XXH32_hash_t;
  * Contains functions used in the classic 32-bit xxHash algorithm.
  *
  * @note
- *   XXH32 is considered rather weak by today's standards.
- *   The @ref xxh3_family provides competitive speed for both 32-bit and 64-bit
- *   systems, and offers true 64/128 bit hash results. It provides a superior
- *   level of dispersion, and greatly reduces the risks of collisions.
+ *   XXH32 is useful for older platforms, with no or poor 64-bit performance.
+ *   Note that @ref xxh3_family provides competitive speed
+ *   for both 32-bit and 64-bit systems, and offers true 64/128 bit hash results.
  *
  * @see @ref xxh64_family, @ref xxh3_family : Other xxHash families
  * @see @ref xxh32_impl for implementation details
@@ -672,8 +671,8 @@ typedef uint64_t XXH64_hash_t;
  *
  * @note
  *   XXH3 provides competitive speed for both 32-bit and 64-bit systems,
- *   and offers true 64/128 bit hash results. It provides a superior level of
- *   dispersion, and greatly reduces the risks of collisions.
+ *   and offers true 64/128 bit hash results.
+ *   It provides better speed for systems with vector processing capabilities.
  */
 
 
@@ -805,7 +804,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSeed(const void* data, size_t len, X
  * Whenever unsure about the "randomness" of the blob of bytes,
  * consider relabelling it as a "custom seed" instead,
  * and employ "XXH3_generateSecret()" (see below)
- * to generate a high entropy secret derived from the custom seed.
+ * to generate a proper high entropy secret derived from the custom seed.
  */
 XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSecret(const void* data, size_t len, const void* secret, size_t secretSize);
 

From c43b8c79e5cd5046877593b26f567491ad26d91f Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 10:32:22 -0800
Subject: [PATCH 164/187] xxhash: all invocations of memcpy() via XXH_memcpy()
 shim

---
 xxhash.h | 49 +++++++++++++++++++++++++------------------------
 1 file changed, 25 insertions(+), 24 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 11073736..691f6cd7 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1620,7 +1620,7 @@ static xxh_u32 XXH_read32(const void* ptr)
 static xxh_u32 XXH_read32(const void* memPtr)
 {
     xxh_u32 val;
-    memcpy(&val, memPtr, sizeof(val));
+    XXH_memcpy(&val, memPtr, sizeof(val));
     return val;
 }
 
@@ -2084,7 +2084,7 @@ XXH_PUBLIC_API XXH_errorcode XXH32_freeState(XXH32_state_t* statePtr)
 /*! @ingroup xxh32_family */
 XXH_PUBLIC_API void XXH32_copyState(XXH32_state_t* dstState, const XXH32_state_t* srcState)
 {
-    memcpy(dstState, srcState, sizeof(*dstState));
+    XXH_memcpy(dstState, srcState, sizeof(*dstState));
 }
 
 /*! @ingroup xxh32_family */
@@ -2097,7 +2097,7 @@ XXH_PUBLIC_API XXH_errorcode XXH32_reset(XXH32_state_t* statePtr, XXH32_hash_t s
     state.v[2] = seed + 0;
     state.v[3] = seed - XXH_PRIME32_1;
     /* do not write into reserved, planned to be removed in a future version */
-    memcpy(statePtr, &state, sizeof(state) - sizeof(state.reserved));
+    XXH_memcpy(statePtr, &state, sizeof(state) - sizeof(state.reserved));
     return XXH_OK;
 }
 
@@ -2197,7 +2197,7 @@ XXH_PUBLIC_API void XXH32_canonicalFromHash(XXH32_canonical_t* dst, XXH32_hash_t
 {
     XXH_STATIC_ASSERT(sizeof(XXH32_canonical_t) == sizeof(XXH32_hash_t));
     if (XXH_CPU_LITTLE_ENDIAN) hash = XXH_swap32(hash);
-    memcpy(dst, &hash, sizeof(*dst));
+    XXH_memcpy(dst, &hash, sizeof(*dst));
 }
 /*! @ingroup xxh32_family */
 XXH_PUBLIC_API XXH32_hash_t XXH32_hashFromCanonical(const XXH32_canonical_t* src)
@@ -2263,7 +2263,7 @@ static xxh_u64 XXH_read64(const void* ptr)
 static xxh_u64 XXH_read64(const void* memPtr)
 {
     xxh_u64 val;
-    memcpy(&val, memPtr, sizeof(val));
+    XXH_memcpy(&val, memPtr, sizeof(val));
     return val;
 }
 
@@ -2500,7 +2500,7 @@ XXH_PUBLIC_API XXH_errorcode XXH64_freeState(XXH64_state_t* statePtr)
 /*! @ingroup xxh64_family */
 XXH_PUBLIC_API void XXH64_copyState(XXH64_state_t* dstState, const XXH64_state_t* srcState)
 {
-    memcpy(dstState, srcState, sizeof(*dstState));
+    XXH_memcpy(dstState, srcState, sizeof(*dstState));
 }
 
 /*! @ingroup xxh64_family */
@@ -2513,7 +2513,7 @@ XXH_PUBLIC_API XXH_errorcode XXH64_reset(XXH64_state_t* statePtr, XXH64_hash_t s
     state.v[2] = seed + 0;
     state.v[3] = seed - XXH_PRIME64_1;
      /* do not write into reserved64, might be removed in a future version */
-    memcpy(statePtr, &state, sizeof(state) - sizeof(state.reserved64));
+    XXH_memcpy(statePtr, &state, sizeof(state) - sizeof(state.reserved64));
     return XXH_OK;
 }
 
@@ -2597,7 +2597,7 @@ XXH_PUBLIC_API void XXH64_canonicalFromHash(XXH64_canonical_t* dst, XXH64_hash_t
 {
     XXH_STATIC_ASSERT(sizeof(XXH64_canonical_t) == sizeof(XXH64_hash_t));
     if (XXH_CPU_LITTLE_ENDIAN) hash = XXH_swap64(hash);
-    memcpy(dst, &hash, sizeof(*dst));
+    XXH_memcpy(dst, &hash, sizeof(*dst));
 }
 
 /*! @ingroup xxh64_family */
@@ -3035,7 +3035,7 @@ XXH_FORCE_INLINE xxh_u64x2 XXH_vec_revb(xxh_u64x2 val)
 XXH_FORCE_INLINE xxh_u64x2 XXH_vec_loadu(const void *ptr)
 {
     xxh_u64x2 ret;
-    memcpy(&ret, ptr, sizeof(xxh_u64x2));
+    XXH_memcpy(&ret, ptr, sizeof(xxh_u64x2));
 # if XXH_VSX_BE
     ret = XXH_vec_revb(ret);
 # endif
@@ -3599,7 +3599,7 @@ XXH3_len_129to240_64b(const xxh_u8* XXH_RESTRICT input, size_t len,
 XXH_FORCE_INLINE void XXH_writeLE64(void* dst, xxh_u64 v64)
 {
     if (!XXH_CPU_LITTLE_ENDIAN) v64 = XXH_swap64(v64);
-    memcpy(dst, &v64, sizeof(v64));
+    XXH_memcpy(dst, &v64, sizeof(v64));
 }
 
 /* Several intrinsic functions below are supposed to accept __int64 as argument,
@@ -4592,7 +4592,7 @@ XXH_PUBLIC_API XXH_errorcode XXH3_freeState(XXH3_state_t* statePtr)
 XXH_PUBLIC_API void
 XXH3_copyState(XXH3_state_t* dst_state, const XXH3_state_t* src_state)
 {
-    memcpy(dst_state, src_state, sizeof(*dst_state));
+    XXH_memcpy(dst_state, src_state, sizeof(*dst_state));
 }
 
 static void
@@ -4699,13 +4699,14 @@ XXH3_update(XXH3_state_t* state,
         state->totalLen += len;
         XXH_ASSERT(state->bufferedSize <= XXH3_INTERNALBUFFER_SIZE);
 
-        if (state->bufferedSize + len <= XXH3_INTERNALBUFFER_SIZE) {  /* fill in tmp buffer */
+        /* small input : just fill in tmp buffer */
+        if (state->bufferedSize + len <= XXH3_INTERNALBUFFER_SIZE) {
             XXH_memcpy(state->buffer + state->bufferedSize, input, len);
             state->bufferedSize += (XXH32_hash_t)len;
             return XXH_OK;
         }
-        /* total input is now > XXH3_INTERNALBUFFER_SIZE */
 
+        /* total input is now > XXH3_INTERNALBUFFER_SIZE */
         #define XXH3_INTERNALBUFFER_STRIPES (XXH3_INTERNALBUFFER_SIZE / XXH_STRIPE_LEN)
         XXH_STATIC_ASSERT(XXH3_INTERNALBUFFER_SIZE % XXH_STRIPE_LEN == 0);   /* clean multiple */
 
@@ -4738,7 +4739,7 @@ XXH3_update(XXH3_state_t* state,
                 input += XXH3_INTERNALBUFFER_SIZE;
             } while (input<limit);
             /* for last partial stripe */
-            memcpy(state->buffer + sizeof(state->buffer) - XXH_STRIPE_LEN, input - XXH_STRIPE_LEN, XXH_STRIPE_LEN);
+            XXH_memcpy(state->buffer + sizeof(state->buffer) - XXH_STRIPE_LEN, input - XXH_STRIPE_LEN, XXH_STRIPE_LEN);
         }
         XXH_ASSERT(input < bEnd);
 
@@ -4768,7 +4769,7 @@ XXH3_digest_long (XXH64_hash_t* acc,
      * Digest on a local copy. This way, the state remains unaltered, and it can
      * continue ingesting more input afterwards.
      */
-    memcpy(acc, state->acc, sizeof(state->acc));
+    XXH_memcpy(acc, state->acc, sizeof(state->acc));
     if (state->bufferedSize >= XXH_STRIPE_LEN) {
         size_t const nbStripes = (state->bufferedSize - 1) / XXH_STRIPE_LEN;
         size_t nbStripesSoFar = state->nbStripesSoFar;
@@ -4785,8 +4786,8 @@ XXH3_digest_long (XXH64_hash_t* acc,
         xxh_u8 lastStripe[XXH_STRIPE_LEN];
         size_t const catchupSize = XXH_STRIPE_LEN - state->bufferedSize;
         XXH_ASSERT(state->bufferedSize > 0);  /* there is always some input buffered */
-        memcpy(lastStripe, state->buffer + sizeof(state->buffer) - catchupSize, catchupSize);
-        memcpy(lastStripe + catchupSize, state->buffer, state->bufferedSize);
+        XXH_memcpy(lastStripe, state->buffer + sizeof(state->buffer) - catchupSize, catchupSize);
+        XXH_memcpy(lastStripe + catchupSize, state->buffer, state->bufferedSize);
         XXH3_accumulate_512(acc,
                             lastStripe,
                             secret + state->secretLimit - XXH_SECRET_LASTACC_START);
@@ -4820,7 +4821,7 @@ XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSee
 {
     XXH_ASSERT(secretBuffer != NULL);
     if (customSeedSize == 0) {
-        memcpy(secretBuffer, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
+        XXH_memcpy(secretBuffer, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
         return;
     }
     XXH_ASSERT(customSeed != NULL);
@@ -4839,21 +4840,21 @@ XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSee
         */
         {   size_t toFill = XXH_MIN(customSeedSize, sizeof(seeds));
             size_t filled = toFill;
-            memcpy(seeds, customSeed, toFill);
+            XXH_memcpy(seeds, customSeed, toFill);
             while (filled < sizeof(seeds)) {
                 toFill = XXH_MIN(filled, sizeof(seeds) - filled);
-                memcpy((char*)seeds + filled, seeds, toFill);
+                XXH_memcpy((char*)seeds + filled, seeds, toFill);
                 filled += toFill;
         }   }
 
         /* generate secret */
-        memcpy(secretBuffer, &scrambler, sizeof(scrambler));
+        XXH_memcpy(secretBuffer, &scrambler, sizeof(scrambler));
         for (segnb=1; segnb < nbSegments; segnb++) {
             size_t const segmentStart = segnb * segmentSize;
             XXH128_canonical_t segment;
             XXH128_canonicalFromHash(&segment,
                 XXH128(&scrambler, sizeof(scrambler), XXH_readLE64(seeds + segnb) + segnb) );
-            memcpy((char*)secretBuffer + segmentStart, &segment, sizeof(segment));
+            XXH_memcpy((char*)secretBuffer + segmentStart, &segment, sizeof(segment));
     }   }
 }
 
@@ -5371,8 +5372,8 @@ XXH128_canonicalFromHash(XXH128_canonical_t* dst, XXH128_hash_t hash)
         hash.high64 = XXH_swap64(hash.high64);
         hash.low64  = XXH_swap64(hash.low64);
     }
-    memcpy(dst, &hash.high64, sizeof(hash.high64));
-    memcpy((char*)dst + sizeof(hash.high64), &hash.low64, sizeof(hash.low64));
+    XXH_memcpy(dst, &hash.high64, sizeof(hash.high64));
+    XXH_memcpy((char*)dst + sizeof(hash.high64), &hash.low64, sizeof(hash.low64));
 }
 
 /*! @ingroup xxh3_family */

From 5bc519250df9325f3340c00c2b6917ba5a6b180c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 12:19:18 -0800
Subject: [PATCH 165/187] streaming consumes per block

---
 xxhash.h | 29 +++++++++++++++++++++++++++--
 1 file changed, 27 insertions(+), 2 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 691f6cd7..76af756d 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4683,8 +4683,8 @@ XXH3_consumeStripes(xxh_u64* XXH_RESTRICT acc,
  * Both XXH3_64bits_update and XXH3_128bits_update use this routine.
  */
 XXH_FORCE_INLINE XXH_errorcode
-XXH3_update(XXH3_state_t* state,
-            const xxh_u8* input, size_t len,
+XXH3_update(XXH3_state_t* XXH_RESTRICT const state,
+            const xxh_u8* XXH_RESTRICT input, size_t len,
             XXH3_f_accumulate_512 f_acc512,
             XXH3_f_scrambleAcc f_scramble)
 {
@@ -4727,6 +4727,31 @@ XXH3_update(XXH3_state_t* state,
         }
         XXH_ASSERT(input < bEnd);
 
+        /* Consume input per full block */
+        if ((size_t)(bEnd - input) > state->nbStripesPerBlock * XXH_STRIPE_LEN) {
+            size_t nbStripes = (size_t)(bEnd - input) / XXH_STRIPE_LEN;
+            XXH_ASSERT(state->nbStripesPerBlock >= state->nbStripesSoFar);
+            /* join to current block's end */
+            {   size_t const nbStripesToEnd = state->nbStripesPerBlock - state->nbStripesSoFar;
+                XXH_ASSERT(nbStripes <= nbStripes);
+                XXH3_accumulate(state->acc, input, secret + state->nbStripesSoFar * XXH_SECRET_CONSUME_RATE, nbStripesToEnd, f_acc512);
+                f_scramble(state->acc, secret + state->secretLimit);
+                state->nbStripesSoFar = 0;
+                input += nbStripesToEnd * XXH_STRIPE_LEN;
+                nbStripes -= nbStripesToEnd;
+            }
+            /* consume per entire blocks */
+            while(nbStripes > state->nbStripesPerBlock) {
+                XXH3_accumulate(state->acc, input, secret, state->nbStripesPerBlock, f_acc512);
+                f_scramble(state->acc, secret + state->secretLimit);
+                input += state->nbStripesPerBlock * XXH_STRIPE_LEN;
+                nbStripes -= state->nbStripesPerBlock;
+            }
+            /* pay attention to potentially last partial stripe */
+            if (bEnd - input < XXH_STRIPE_LEN) {
+                XXH_memcpy(state->buffer + sizeof(state->buffer) - XXH_STRIPE_LEN, input - XXH_STRIPE_LEN, XXH_STRIPE_LEN);
+        }   }
+
         /* Consume input by a multiple of internal buffer size */
         if (bEnd - input > XXH3_INTERNALBUFFER_SIZE) {
             const xxh_u8* const limit = bEnd - XXH3_INTERNALBUFFER_SIZE;

From 9c1a86e25989a0ba023443fd1fc3d0896c9343e1 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 15:11:33 -0800
Subject: [PATCH 166/187] use stack space for accumulators

gcc and msvc seem to suffer greatly
when operating accumulators directly into state.
---
 xxhash.h | 80 +++++++++++++++++++++++++++++++++++++-------------------
 1 file changed, 53 insertions(+), 27 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 76af756d..84344ea7 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4679,6 +4679,11 @@ XXH3_consumeStripes(xxh_u64* XXH_RESTRICT acc,
     }
 }
 
+#ifndef XXH3_STREAM_USE_STACK
+# ifndef __clang__ /* clang doesn't need additional stack space */
+#   define XXH3_STREAM_USE_STACK 1
+# endif
+#endif
 /*
  * Both XXH3_64bits_update and XXH3_128bits_update use this routine.
  */
@@ -4693,9 +4698,18 @@ XXH3_update(XXH3_state_t* XXH_RESTRICT const state,
         return XXH_OK;
     }
 
+    XXH_ASSERT(state != NULL);
     {   const xxh_u8* const bEnd = input + len;
         const unsigned char* const secret = (state->extSecret == NULL) ? state->customSecret : state->extSecret;
-
+#if defined(XXH3_STREAM_USE_STACK) && XXH3_STREAM_USE_STACK >= 1
+        /* For some reason, gcc and MSVC seem to suffer greatly
+         * when operating accumulators directly into state.
+         * Operating into stack space seems to enable proper optimization.
+         * clang, on the other hand, doesn't seem to need this trick */
+        xxh_u64 acc[8]; memcpy(acc, state->acc, sizeof(acc));
+#else
+        xxh_u64* XXH_RESTRICT const acc = state->acc;
+#endif
         state->totalLen += len;
         XXH_ASSERT(state->bufferedSize <= XXH3_INTERNALBUFFER_SIZE);
 
@@ -4718,7 +4732,7 @@ XXH3_update(XXH3_state_t* XXH_RESTRICT const state,
             size_t const loadSize = XXH3_INTERNALBUFFER_SIZE - state->bufferedSize;
             XXH_memcpy(state->buffer + state->bufferedSize, input, loadSize);
             input += loadSize;
-            XXH3_consumeStripes(state->acc,
+            XXH3_consumeStripes(acc,
                                &state->nbStripesSoFar, state->nbStripesPerBlock,
                                 state->buffer, XXH3_INTERNALBUFFER_STRIPES,
                                 secret, state->secretLimit,
@@ -4727,50 +4741,62 @@ XXH3_update(XXH3_state_t* XXH_RESTRICT const state,
         }
         XXH_ASSERT(input < bEnd);
 
-        /* Consume input per full block */
+        /* large input to consume : ingest per full block */
         if ((size_t)(bEnd - input) > state->nbStripesPerBlock * XXH_STRIPE_LEN) {
-            size_t nbStripes = (size_t)(bEnd - input) / XXH_STRIPE_LEN;
+            size_t nbStripes = (size_t)(bEnd - 1 - input) / XXH_STRIPE_LEN;
             XXH_ASSERT(state->nbStripesPerBlock >= state->nbStripesSoFar);
             /* join to current block's end */
             {   size_t const nbStripesToEnd = state->nbStripesPerBlock - state->nbStripesSoFar;
                 XXH_ASSERT(nbStripes <= nbStripes);
-                XXH3_accumulate(state->acc, input, secret + state->nbStripesSoFar * XXH_SECRET_CONSUME_RATE, nbStripesToEnd, f_acc512);
-                f_scramble(state->acc, secret + state->secretLimit);
+                XXH3_accumulate(acc, input, secret + state->nbStripesSoFar * XXH_SECRET_CONSUME_RATE, nbStripesToEnd, f_acc512);
+                f_scramble(acc, secret + state->secretLimit);
                 state->nbStripesSoFar = 0;
                 input += nbStripesToEnd * XXH_STRIPE_LEN;
                 nbStripes -= nbStripesToEnd;
             }
             /* consume per entire blocks */
-            while(nbStripes > state->nbStripesPerBlock) {
-                XXH3_accumulate(state->acc, input, secret, state->nbStripesPerBlock, f_acc512);
-                f_scramble(state->acc, secret + state->secretLimit);
+            while(nbStripes >= state->nbStripesPerBlock) {
+                XXH3_accumulate(acc, input, secret, state->nbStripesPerBlock, f_acc512);
+                f_scramble(acc, secret + state->secretLimit);
                 input += state->nbStripesPerBlock * XXH_STRIPE_LEN;
                 nbStripes -= state->nbStripesPerBlock;
             }
-            /* pay attention to potentially last partial stripe */
-            if (bEnd - input < XXH_STRIPE_LEN) {
-                XXH_memcpy(state->buffer + sizeof(state->buffer) - XXH_STRIPE_LEN, input - XXH_STRIPE_LEN, XXH_STRIPE_LEN);
-        }   }
-
-        /* Consume input by a multiple of internal buffer size */
-        if (bEnd - input > XXH3_INTERNALBUFFER_SIZE) {
-            const xxh_u8* const limit = bEnd - XXH3_INTERNALBUFFER_SIZE;
-            do {
-                XXH3_consumeStripes(state->acc,
-                                   &state->nbStripesSoFar, state->nbStripesPerBlock,
-                                    input, XXH3_INTERNALBUFFER_STRIPES,
-                                    secret, state->secretLimit,
-                                    f_acc512, f_scramble);
-                input += XXH3_INTERNALBUFFER_SIZE;
-            } while (input<limit);
-            /* for last partial stripe */
+            /* consume last partial block */
+            XXH3_accumulate(acc, input, secret, nbStripes, f_acc512);
+            input += nbStripes * XXH_STRIPE_LEN;
+            XXH_ASSERT(input < bEnd);  /* at least some bytes left */
+            state->nbStripesSoFar = nbStripes;
+            /* buffer predecessor of last partial stripe */
             XXH_memcpy(state->buffer + sizeof(state->buffer) - XXH_STRIPE_LEN, input - XXH_STRIPE_LEN, XXH_STRIPE_LEN);
+            XXH_ASSERT(bEnd - input <= XXH_STRIPE_LEN);
+        } else {
+            /* content to consume <= block size */
+            /* Consume input by a multiple of internal buffer size */
+            if (bEnd - input > XXH3_INTERNALBUFFER_SIZE) {
+                const xxh_u8* const limit = bEnd - XXH3_INTERNALBUFFER_SIZE;
+                do {
+                    XXH3_consumeStripes(acc,
+                                       &state->nbStripesSoFar, state->nbStripesPerBlock,
+                                        input, XXH3_INTERNALBUFFER_STRIPES,
+                                        secret, state->secretLimit,
+                                        f_acc512, f_scramble);
+                    input += XXH3_INTERNALBUFFER_SIZE;
+                } while (input<limit);
+                /* buffer predecessor of last partial stripe */
+                XXH_memcpy(state->buffer + sizeof(state->buffer) - XXH_STRIPE_LEN, input - XXH_STRIPE_LEN, XXH_STRIPE_LEN);
+            }
         }
-        XXH_ASSERT(input < bEnd);
 
         /* Some remaining input (always) : buffer it */
+        XXH_ASSERT(input < bEnd);
+        XXH_ASSERT(bEnd - input <= XXH3_INTERNALBUFFER_SIZE);
+        XXH_ASSERT(state->bufferedSize == 0);
         XXH_memcpy(state->buffer, input, (size_t)(bEnd-input));
         state->bufferedSize = (XXH32_hash_t)(bEnd-input);
+#if defined(XXH3_STREAM_USE_STACK) && XXH3_STREAM_USE_STACK >= 1
+        /* save stack accumulators into state */
+        memcpy(state->acc, acc, sizeof(acc));
+#endif
     }
 
     return XXH_OK;

From 3e41ababbf51f4fc2fcd653f49ccb5f7c761fe26 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 17:17:20 -0800
Subject: [PATCH 167/187] align stack accumulators

---
 xxhash.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/xxhash.h b/xxhash.h
index 62abb440..69c37be2 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -4711,7 +4711,7 @@ XXH3_update(XXH3_state_t* XXH_RESTRICT const state,
          * when operating accumulators directly into state.
          * Operating into stack space seems to enable proper optimization.
          * clang, on the other hand, doesn't seem to need this trick */
-        xxh_u64 acc[8]; memcpy(acc, state->acc, sizeof(acc));
+        XXH_ALIGN(XXH_ACC_ALIGN) xxh_u64 acc[8]; memcpy(acc, state->acc, sizeof(acc));
 #else
         xxh_u64* XXH_RESTRICT const acc = state->acc;
 #endif

From 8658b5e55f512ed38cdfff404e1cfd16601b167e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 17:50:13 -0800
Subject: [PATCH 168/187] improve doc about prefetching

---
 README.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index 760199ee..5ba73bb3 100644
--- a/README.md
+++ b/README.md
@@ -110,8 +110,8 @@ The following macros can be set at compilation time to modify libxxhash's behavi
                            It is (slightly) detrimental on platform with good unaligned memory access performance (same instruction for both aligned and unaligned accesses).
                            This option is automatically disabled on `x86`, `x64` and `aarch64`, and enabled on all other platforms.
 - `XXH_VECTOR` : manually select a vector instruction set (default: auto-selected at compilation time). Available instruction sets are `XXH_SCALAR`, `XXH_SSE2`, `XXH_AVX2`, `XXH_AVX512`, `XXH_NEON` and `XXH_VSX`. Compiler may require additional flags to ensure proper support (for example, `gcc` on linux will require `-mavx2` for AVX2, and `-mavx512f` for AVX512).
-- `XXH_NO_PREFETCH` : disable prefetching. XXH3 only.
-- `XXH_PREFETCH_DIST` : select prefetching distance. XXH3 only.
+- `XXH_NO_PREFETCH` : disable prefetching. Some platforms or situations may perform better without prefetching. XXH3 only.
+- `XXH_PREFETCH_DIST` : select prefetching distance. For close-to-metal adaptation to specific hardware platforms. XXH3 only.
 - `XXH_NO_INLINE_HINTS`: By default, xxHash uses `__attribute__((always_inline))` and `__forceinline` to improve performance at the cost of code size.
                          Defining this macro to 1 will mark all internal functions as `static`, allowing the compiler to decide whether to inline a function or not.
                          This is very useful when optimizing for smallest binary size,

From 6a23376980f12cda2d627c1a213e0e24266f9856 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 20:14:13 -0800
Subject: [PATCH 169/187] added variant XXH3_64bits_reset_withSecretandSeed()

---
 cli/xsum_sanity_check.c | 14 ++++++++
 xxhash.h                | 77 +++++++++++++++++++++++++++++------------
 2 files changed, 69 insertions(+), 22 deletions(-)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 8f361687..1df25174 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -408,8 +408,22 @@ static void XSUM_testXXH3(const void* data, const XSUM_testdata64_t* testData)
                 (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
             XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
         }
+
+        /* check that streaming with a combination of
+         * XXH3_generateSecret_fromSeed() and XXH3_64bits_reset_withSecretandSeed()
+         * results in exactly the same hash generation as XXH3_64bits_reset_withSeed() */
+        {   char secretBuffer[XXH3_SECRET_DEFAULT_SIZE+1];
+            char* const secret = secretBuffer + 1;  /* intentional unalignment */
+            XXH3_generateSecret_fromSeed(secret, seed);
+            /* single ingestion */
+            (void)XXH3_64bits_reset_withSecretandSeed(state, secret, XXH3_SECRET_DEFAULT_SIZE, seed);
+            (void)XXH3_64bits_update(state, data, len);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+        }
+
         XXH3_freeState(state);
     }
+
 }
 
 static void XSUM_testXXH3_withSecret(const void* data, const void* secret,
diff --git a/xxhash.h b/xxhash.h
index 5e029f4f..1add4e0b 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -157,6 +157,7 @@ extern "C" {
 #  undef XXH3_64bits
 #  undef XXH3_64bits_withSecret
 #  undef XXH3_64bits_withSeed
+#  undef XXH3_64bits_withSecretandSeed
 #  undef XXH3_createState
 #  undef XXH3_freeState
 #  undef XXH3_copyState
@@ -291,6 +292,7 @@ extern "C" {
 #  define XXH3_64bits_reset XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_reset)
 #  define XXH3_64bits_reset_withSeed XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_reset_withSeed)
 #  define XXH3_64bits_reset_withSecret XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_reset_withSecret)
+#  define XXH3_64bits_reset_withSecretandSeed XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_reset_withSecretandSeed)
 #  define XXH3_64bits_update XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_update)
 #  define XXH3_64bits_digest XXH_NAME2(XXH_NAMESPACE, XXH3_64bits_digest)
 #  define XXH3_generateSecret XXH_NAME2(XXH_NAMESPACE, XXH3_generateSecret)
@@ -1069,7 +1071,7 @@ struct XXH3_state_s {
        /*!< The internal buffer. @see XXH32_state_s::mem32 */
    XXH32_hash_t bufferedSize;
        /*!< The amount of memory in @ref buffer, @see XXH32_state_s::memsize */
-   XXH32_hash_t reserved32;
+   XXH32_hash_t useSeed;
        /*!< Reserved field. Needed for padding on 64-bit. */
    size_t nbStripesSoFar;
        /*!< Number or stripes processed. */
@@ -1105,6 +1107,12 @@ struct XXH3_state_s {
 #define XXH3_INITSTATE(XXH3_state_ptr)   { (XXH3_state_ptr)->seed = 0; }
 
 
+/* XXH128() :
+ * simple alias to pre-selected XXH3_128bits variant
+ */
+XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t seed);
+
+
 /* ===   Experimental API   === */
 /* Symbols defined below must be considered tied to a specific library version. */
 
@@ -1143,33 +1151,44 @@ XXH_PUBLIC_API void XXH3_generateSecret(void* secretBuffer, const void* customSe
 /*
  * XXH3_generateSecret_fromSeed():
  *
- * Generate the same secret as the one built from @seed when using _withSeed() variants.
+ * Generate the same secret as the _withSeed() variants.
  *
- * The resulting secret has a length XXH3_SECRET_DEFAULT_SIZE (necessarily).
+ * The resulting secret has a length of XXH3_SECRET_DEFAULT_SIZE (necessarily).
  * @secretBuffer must be already allocated, of size at least XXH3_SECRET_DEFAULT_SIZE bytes.
  *
  * The generated secret can be used in combination with
  *`*_withSecret()` and `_withSecretandSeed()` variants.
- * This generator is notably useful for `_withSecretandSeed()`,
- * as it makes this variant generate the same values as corresponding `_withSeed()` variant.
+ * This generator is notably useful in combination with `_withSecretandSeed()`,
+ * as a way to emulate a faster `_withSeed()` variant.
  */
 XXH_PUBLIC_API void XXH3_generateSecret_fromSeed(void* secretBuffer, XXH64_hash_t seed);
 
 /*
  * *_withSecretandSeed() :
- * This variants generate hash values using either
+ * These variants generate hash values using either
  * @seed for "short" keys (< XXH3_MIDSIZE_MAX = 240 bytes)
  * or @secret for "large" keys (>= XXH3_MIDSIZE_MAX).
+ *
  * This generally benefits speed, compared to `_withSeed()` or `_withSecret()`.
  * `_withSeed()` has to generate the secret on the fly for "large" keys.
- * It's fast, but can be perceptible for "not so large" keys < 1 KB.
+ * It's fast, but can be perceptible for "not so large" keys (< 1 KB).
  * `_withSecret()` has to generate the masks on the fly for "small" keys,
- * which require more instructions than _withSeed() variants.
- * _withSecretandSeed variant therefore combines the best of both worlds.
+ * which requires more instructions than _withSeed() variants.
+ * Therefore, _withSecretandSeed variant combines the best of both worlds.
+ *
  * When @secret has been generated by XXH3_generateSecret_fromSeed(),
- * this variant produces exactly the same results as `_withSeed()` variant,
- * thus offering solely a speed benefit for "large" keys,
- * since there is no need to regenerate the secret for every large key.
+ * this variant produces *exactly* the same results as `_withSeed()` variant,
+ * hence offering only a pure speed benefit on "large" input,
+ * by skipping the need to regenerate the secret for every large input.
+ *
+ * Another usage scenario is to hash the secret to a 64-bit hash value,
+ * for example with XXH3_64bits(), which then becomes the seed,
+ * and then employ both the seed and the secret in _withSecretandSeed().
+ * On top of speed, an added benefit is that each bit in the secret
+ * has a 50% chance to swap each bit in the output,
+ * via its impact to the seed.
+ * This is not guaranteed when using the secret directly in "small data" scenarios,
+ * because only portions of the secret are employed for small data.
  */
 XXH_PUBLIC_API XXH64_hash_t
 XXH3_64bits_withSecretandSeed(const void* data, size_t len,
@@ -1178,13 +1197,13 @@ XXH3_64bits_withSecretandSeed(const void* data, size_t len,
 
 XXH_PUBLIC_API XXH128_hash_t
 XXH3_128bits_withSecretandSeed(const void* data, size_t len,
-                              const void* secret, size_t secretSize,
-                              XXH64_hash_t seed64);
+                               const void* secret, size_t secretSize,
+                               XXH64_hash_t seed64);
 
-/* XXH128() :
- * simple alias to pre-selected XXH3_128bits variant
- */
-XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t seed);
+XXH_PUBLIC_API XXH_errorcode
+XXH3_64bits_reset_withSecretandSeed(XXH3_state_t* statePtr,
+                                    const void* secret, size_t secretSize,
+                                    XXH64_hash_t seed64);
 
 
 #endif  /* XXH_NO_LONG_LONG */
@@ -4657,8 +4676,8 @@ XXH3_copyState(XXH3_state_t* dst_state, const XXH3_state_t* src_state)
 
 static void
 XXH3_reset_internal(XXH3_state_t* statePtr,
-                           XXH64_hash_t seed,
-                           const void* secret, size_t secretSize)
+                    XXH64_hash_t seed,
+                    const void* secret, size_t secretSize)
 {
     size_t const initStart = offsetof(XXH3_state_t, bufferedSize);
     size_t const initLength = offsetof(XXH3_state_t, nbStripesPerBlock) - initStart;
@@ -4675,6 +4694,7 @@ XXH3_reset_internal(XXH3_state_t* statePtr,
     statePtr->acc[6] = XXH_PRIME64_5;
     statePtr->acc[7] = XXH_PRIME32_1;
     statePtr->seed = seed;
+    statePtr->useSeed = (seed != 0);
     statePtr->extSecret = (const unsigned char*)secret;
     XXH_ASSERT(secretSize >= XXH3_SECRET_SIZE_MIN);
     statePtr->secretLimit = secretSize - XXH_STRIPE_LEN;
@@ -4707,11 +4727,24 @@ XXH3_64bits_reset_withSeed(XXH3_state_t* statePtr, XXH64_hash_t seed)
 {
     if (statePtr == NULL) return XXH_ERROR;
     if (seed==0) return XXH3_64bits_reset(statePtr);
-    if (seed != statePtr->seed) XXH3_initCustomSecret(statePtr->customSecret, seed);
+    if ((seed != statePtr->seed) || (statePtr->extSecret != NULL))
+        XXH3_initCustomSecret(statePtr->customSecret, seed);
     XXH3_reset_internal(statePtr, seed, NULL, XXH_SECRET_DEFAULT_SIZE);
     return XXH_OK;
 }
 
+/*! @ingroup xxh3_family */
+XXH_PUBLIC_API XXH_errorcode
+XXH3_64bits_reset_withSecretandSeed(XXH3_state_t* statePtr, const void* secret, size_t secretSize, XXH64_hash_t seed64)
+{
+    if (statePtr == NULL) return XXH_ERROR;
+    if (secret == NULL) return XXH_ERROR;
+    if (secretSize < XXH3_SECRET_SIZE_MIN) return XXH_ERROR;
+    XXH3_reset_internal(statePtr, seed64, secret, secretSize);
+    statePtr->useSeed = 1; /* always, even if seed64==0 */
+    return XXH_OK;
+}
+
 /* Note : when XXH3_consumeStripes() is invoked,
  * there must be a guarantee that at least one more byte must be consumed from input
  * so that the function can blindly consume all stripes using the "normal" secret segment */
@@ -4917,7 +4950,7 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_digest (const XXH3_state_t* state)
                               (xxh_u64)state->totalLen * XXH_PRIME64_1);
     }
     /* totalLen <= XXH3_MIDSIZE_MAX: digesting a short input */
-    if (state->seed)
+    if (state->useSeed)
         return XXH3_64bits_withSeed(state->buffer, (size_t)state->totalLen, state->seed);
     return XXH3_64bits_withSecret(state->buffer, (size_t)(state->totalLen),
                                   secret, state->secretLimit + XXH_STRIPE_LEN);

From f87c51515dd4f1270654209727755e8ef59e4444 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 20:58:36 -0800
Subject: [PATCH 170/187] added variant XXH3_128bits_reset_withSecretandSeed()

---
 cli/xsum_sanity_check.c | 13 +++++++++++
 xxhash.h                | 48 ++++++++++++++++++++++++-----------------
 2 files changed, 41 insertions(+), 20 deletions(-)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 1df25174..1346bd88 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -521,6 +521,19 @@ static void XSUM_testXXH128(const void* data, const XSUM_testdata128_t* testData
                 (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
             XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
         }
+
+        /* check that streaming with a combination of
+         * XXH3_generateSecret_fromSeed() and XXH3_128bits_reset_withSecretandSeed()
+         * results in exactly the same hash generation as XXH3_128bits_reset_withSeed() */
+        {   char secretBuffer[XXH3_SECRET_DEFAULT_SIZE+1];
+            char* const secret = secretBuffer + 1;  /* intentional unalignment */
+            XXH3_generateSecret_fromSeed(secret, seed);
+            /* single ingestion */
+            (void)XXH3_128bits_reset_withSecretandSeed(state, secret, XXH3_SECRET_DEFAULT_SIZE, seed);
+            (void)XXH3_128bits_update(state, data, len);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+        }
+
         XXH3_freeState(state);
     }
 }
diff --git a/xxhash.h b/xxhash.h
index 1add4e0b..4f9c1644 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -175,6 +175,7 @@ extern "C" {
 #  undef XXH3_128bits_reset
 #  undef XXH3_128bits_reset_withSeed
 #  undef XXH3_128bits_reset_withSecret
+#  undef XXH3_128bits_reset_withSecretandSeed
 #  undef XXH3_128bits_update
 #  undef XXH3_128bits_digest
 #  undef XXH128_isEqual
@@ -306,6 +307,7 @@ extern "C" {
 #  define XXH3_128bits_reset XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_reset)
 #  define XXH3_128bits_reset_withSeed XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_reset_withSeed)
 #  define XXH3_128bits_reset_withSecret XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_reset_withSecret)
+#  define XXH3_128bits_reset_withSecretandSeed XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_reset_withSecretandSeed)
 #  define XXH3_128bits_update XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_update)
 #  define XXH3_128bits_digest XXH_NAME2(XXH_NAMESPACE, XXH3_128bits_digest)
 #  define XXH128_isEqual XXH_NAME2(XXH_NAMESPACE, XXH128_isEqual)
@@ -803,13 +805,17 @@ XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSeed(const void* data, size_t len, X
  * It's possible to provide any blob of bytes as a "secret" to generate the hash.
  * This makes it more difficult for an external actor to prepare an intentional collision.
  * The main condition is that secretSize *must* be large enough (>= XXH3_SECRET_SIZE_MIN).
- * However, the quality of produced hash values depends on secret's entropy.
- * Technically, the secret must look like a bunch of random bytes.
+ * However, the quality of the secret impacts the dispersion of the hash algorithm.
+ * Therefore, the secret _must_ look like a bunch of random bytes.
  * Avoid "trivial" or structured data such as repeated sequences or a text document.
- * Whenever unsure about the "randomness" of the blob of bytes,
- * consider relabelling it as a "custom seed" instead,
- * and employ "XXH3_generateSecret()" (see below)
- * to generate a proper high entropy secret derived from the custom seed.
+ * Whenever in doubt about the "randomness" of the blob of bytes,
+ * consider employing "XXH3_generateSecret()" instead (see below).
+ * It will generate a proper high entropy secret derived from the blob of bytes.
+ * Another advantage of using XXH3_generateSecret() is that
+ * it guarantees that all bits within the initial blob of bytes
+ * will impact every bit of the output.
+ * This is not necessarily the case when using the blob of bytes directly
+ * because, when hashing _small_ inputs, only a portion of the secret is employed.
  */
 XXH_PUBLIC_API XXH64_hash_t XXH3_64bits_withSecret(const void* data, size_t len, const void* secret, size_t secretSize);
 
@@ -1205,6 +1211,11 @@ XXH3_64bits_reset_withSecretandSeed(XXH3_state_t* statePtr,
                                     const void* secret, size_t secretSize,
                                     XXH64_hash_t seed64);
 
+XXH_PUBLIC_API XXH_errorcode
+XXH3_128bits_reset_withSecretandSeed(XXH3_state_t* statePtr,
+                                     const void* secret, size_t secretSize,
+                                     XXH64_hash_t seed64);
+
 
 #endif  /* XXH_NO_LONG_LONG */
 #if defined(XXH_INLINE_ALL) || defined(XXH_PRIVATE_API)
@@ -5374,7 +5385,7 @@ XXH128(const void* input, size_t len, XXH64_hash_t seed)
 /* ===   XXH3 128-bit streaming   === */
 
 /*
- * All the functions are actually the same as for 64-bit streaming variant.
+ * All initialization and update functions are identical to 64-bit streaming variant.
  * The only difference is the finalization routine.
  */
 
@@ -5382,31 +5393,28 @@ XXH128(const void* input, size_t len, XXH64_hash_t seed)
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset(XXH3_state_t* statePtr)
 {
-    if (statePtr == NULL) return XXH_ERROR;
-    XXH3_reset_internal(statePtr, 0, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
-    return XXH_OK;
+    return XXH3_64bits_reset(statePtr);
 }
 
 /*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset_withSecret(XXH3_state_t* statePtr, const void* secret, size_t secretSize)
 {
-    if (statePtr == NULL) return XXH_ERROR;
-    XXH3_reset_internal(statePtr, 0, secret, secretSize);
-    if (secret == NULL) return XXH_ERROR;
-    if (secretSize < XXH3_SECRET_SIZE_MIN) return XXH_ERROR;
-    return XXH_OK;
+    return XXH3_64bits_reset_withSecret(statePtr, secret, secretSize);
 }
 
 /*! @ingroup xxh3_family */
 XXH_PUBLIC_API XXH_errorcode
 XXH3_128bits_reset_withSeed(XXH3_state_t* statePtr, XXH64_hash_t seed)
 {
-    if (statePtr == NULL) return XXH_ERROR;
-    if (seed==0) return XXH3_128bits_reset(statePtr);
-    if (seed != statePtr->seed) XXH3_initCustomSecret(statePtr->customSecret, seed);
-    XXH3_reset_internal(statePtr, seed, NULL, XXH_SECRET_DEFAULT_SIZE);
-    return XXH_OK;
+    return XXH3_64bits_reset_withSeed(statePtr, seed);
+}
+
+/*! @ingroup xxh3_family */
+XXH_PUBLIC_API XXH_errorcode
+XXH3_128bits_reset_withSecretandSeed(XXH3_state_t* statePtr, const void* secret, size_t secretSize, XXH64_hash_t seed)
+{
+    return XXH3_64bits_reset_withSecretandSeed(statePtr, secret, secretSize, seed);
 }
 
 /*! @ingroup xxh3_family */

From 08ae0257300ddde62ccca5b0765d66bc7a2fc1b4 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sat, 27 Nov 2021 23:02:43 -0800
Subject: [PATCH 171/187] added test for correspondance with _withSecret()

---
 cli/xsum_sanity_check.c | 39 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 39 insertions(+)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 1346bd88..3f462b7f 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -426,6 +426,11 @@ static void XSUM_testXXH3(const void* data, const XSUM_testdata64_t* testData)
 
 }
 
+
+#ifndef XXH3_MIDSIZE_MAX
+# define XXH3_MIDSIZE_MAX 240
+#endif
+
 static void XSUM_testXXH3_withSecret(const void* data, const void* secret,
                                      size_t secretSize, const XSUM_testdata64_t* testData)
 {
@@ -441,6 +446,13 @@ static void XSUM_testXXH3_withSecret(const void* data, const void* secret,
         XSUM_checkResult64(Dresult, Nresult);
     }
 
+    /* check that XXH3_64bits_withSecretandSeed()
+     * results in exactly the same return value as XXH3_64bits_withSecret() */
+    if (len > XXH3_MIDSIZE_MAX)
+    {   XSUM_U64 const Dresult = XXH3_64bits_withSecretandSeed(data, len, secret, secretSize, 0);
+        XSUM_checkResult64(Dresult, Nresult);
+    }
+
     /* streaming API test */
     {   XXH3_state_t *state = XXH3_createState();
         assert(state != NULL);
@@ -460,6 +472,16 @@ static void XSUM_testXXH3_withSecret(const void* data, const void* secret,
                 (void)XXH3_64bits_update(state, ((const char*)data)+pos, 1);
             XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
         }
+
+        /* check that XXH3_64bits_reset_withSecretandSeed()
+         * results in exactly the same return value as XXH3_64bits_reset_withSecret() */
+         if (len > XXH3_MIDSIZE_MAX) {
+            /* single ingestion */
+            (void)XXH3_64bits_reset_withSecretandSeed(state, secret, secretSize, 0);
+            (void)XXH3_64bits_update(state, data, len);
+            XSUM_checkResult64(XXH3_64bits_digest(state), Nresult);
+        }
+
         XXH3_freeState(state);
     }
 }
@@ -551,6 +573,13 @@ static void XSUM_testXXH128_withSecret(const void* data, const void* secret, siz
         XSUM_checkResult128(Dresult, Nresult);
     }
 
+    /* check that XXH3_128bits_withSecretandSeed()
+     * results in exactly the same return value as XXH3_128bits_withSecret() */
+    if (len > XXH3_MIDSIZE_MAX)
+    {   XXH128_hash_t const Dresult = XXH3_128bits_withSecretandSeed(data, len, secret, secretSize, 0);
+        XSUM_checkResult128(Dresult, Nresult);
+    }
+
     /* streaming API test */
     {   XXH3_state_t* const state = XXH3_createState();
         assert(state != NULL);
@@ -570,6 +599,16 @@ static void XSUM_testXXH128_withSecret(const void* data, const void* secret, siz
                 (void)XXH3_128bits_update(state, ((const char*)data)+pos, 1);
             XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
         }
+
+        /* check that XXH3_128bits_reset_withSecretandSeed()
+         * results in exactly the same return value as XXH3_128bits_reset_withSecret() */
+         if (len > XXH3_MIDSIZE_MAX) {
+            /* single ingestion */
+            (void)XXH3_128bits_reset_withSecretandSeed(state, secret, secretSize, 0);
+            (void)XXH3_128bits_update(state, data, len);
+            XSUM_checkResult128(XXH3_128bits_digest(state), Nresult);
+        }
+
         XXH3_freeState(state);
     }
 }

From aeda0f9438591913299d0f873a9dd8ab6ef2ec81 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 10:41:26 -0800
Subject: [PATCH 172/187] separate benchmark functions into xsum_bench.c

---
 Makefile                        |   6 +-
 cli/xsum_bench.c                | 435 ++++++++++++++++++++++++++++++++
 cli/xsum_bench.h                |  51 ++++
 cli/xsum_config.h               |  11 +-
 cli/xsum_os_specific.c          |   2 +-
 cli/xsum_os_specific.h          |   2 +-
 cli/xxhsum.c                    | 435 ++------------------------------
 cmake_unofficial/CMakeLists.txt |   1 +
 8 files changed, 521 insertions(+), 422 deletions(-)
 create mode 100644 cli/xsum_bench.c
 create mode 100644 cli/xsum_bench.h

diff --git a/Makefile b/Makefile
index da6e0818..dc1db341 100644
--- a/Makefile
+++ b/Makefile
@@ -75,13 +75,15 @@ XXHSUM_SRC_DIR = cli
 XXHSUM_SPLIT_SRCS = $(XXHSUM_SRC_DIR)/xxhsum.c \
                     $(XXHSUM_SRC_DIR)/xsum_os_specific.c \
                     $(XXHSUM_SRC_DIR)/xsum_output.c \
-                    $(XXHSUM_SRC_DIR)/xsum_sanity_check.c
+                    $(XXHSUM_SRC_DIR)/xsum_sanity_check.c \
+                    $(XXHSUM_SRC_DIR)/xsum_bench.c
 XXHSUM_SPLIT_OBJS = $(XXHSUM_SPLIT_SRCS:.c=.o)
 XXHSUM_HEADERS = $(XXHSUM_SRC_DIR)/xsum_config.h \
                  $(XXHSUM_SRC_DIR)/xsum_arch.h \
                  $(XXHSUM_SRC_DIR)/xsum_os_specific.h \
                  $(XXHSUM_SRC_DIR)/xsum_output.h \
-                 $(XXHSUM_SRC_DIR)/xsum_sanity_check.h
+                 $(XXHSUM_SRC_DIR)/xsum_sanity_check.h \
+                 $(XXHSUM_SRC_DIR)/xsum_bench.h
 
 ## generate CLI and libraries in release mode (default for `make`)
 .PHONY: default
diff --git a/cli/xsum_bench.c b/cli/xsum_bench.c
new file mode 100644
index 00000000..f8af2adc
--- /dev/null
+++ b/cli/xsum_bench.c
@@ -0,0 +1,435 @@
+/*
+ * xum_bench - Benchmark functions for xxhsum
+ * Copyright (C) 2013-2021 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#include "xsum_config.h"
+#include "xsum_output.h"
+#include "xsum_bench.h"
+#include "xsum_sanity_check.h" /* XSUM_fillTestBuffer */
+#include "xsum_os_specific.h"  /* XSUM_getFileSize */
+#include <stdlib.h>
+#include <assert.h>
+#include <string.h>
+#ifndef XXH_STATIC_LINKING_ONLY
+#  define XXH_STATIC_LINKING_ONLY
+#endif
+#include "../xxhash.h"
+
+#include <stdio.h>  /* FILE */
+#include <time.h>   /* clock_t, clock, CLOCKS_PER_SEC */
+#include <errno.h>  /* errno */
+
+#define TIMELOOP_S 1
+#define TIMELOOP  (TIMELOOP_S * CLOCKS_PER_SEC)   /* target timing per iteration */
+#define TIMELOOP_MIN (TIMELOOP / 2)               /* minimum timing to validate a result */
+
+#define MAX_MEM    (2 GB - 64 MB)
+
+static clock_t XSUM_clockSpan( clock_t start )
+{
+    return clock() - start;   /* works even if overflow; Typical max span ~ 30 mn */
+}
+
+static size_t XSUM_findMaxMem(XSUM_U64 requiredMem)
+{
+    size_t const step = 64 MB;
+    void* testmem = NULL;
+
+    requiredMem = (((requiredMem >> 26) + 1) << 26);
+    requiredMem += 2*step;
+    if (requiredMem > MAX_MEM) requiredMem = MAX_MEM;
+
+    while (!testmem) {
+        if (requiredMem > step) requiredMem -= step;
+        else requiredMem >>= 1;
+        testmem = malloc ((size_t)requiredMem);
+    }
+    free (testmem);
+
+    /* keep some space available */
+    if (requiredMem > step) requiredMem -= step;
+    else requiredMem >>= 1;
+
+    return (size_t)requiredMem;
+}
+
+/*
+ * A secret buffer used for benchmarking XXH3's withSecret variants.
+ *
+ * In order for the bench to be realistic, the secret buffer would need to be
+ * pre-generated.
+ *
+ * Adding a pointer to the parameter list would be messy.
+ */
+static XSUM_U8 g_benchSecretBuf[XXH3_SECRET_SIZE_MIN];
+
+/*
+ * Wrappers for the benchmark.
+ *
+ * If you would like to add other hashes to the bench, create a wrapper and add
+ * it to the g_hashesToBench table. It will automatically be added.
+ */
+typedef XSUM_U32 (*hashFunction)(const void* buffer, size_t bufferSize, XSUM_U32 seed);
+
+static XSUM_U32 localXXH32(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return XXH32(buffer, bufferSize, seed);
+}
+static XSUM_U32 localXXH32_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH32_state_t state;
+    (void)seed;
+    XXH32_reset(&state, seed);
+    XXH32_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH32_digest(&state);
+}
+static XSUM_U32 localXXH64(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return (XSUM_U32)XXH64(buffer, bufferSize, seed);
+}
+static XSUM_U32 localXXH64_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH64_state_t state;
+    (void)seed;
+    XXH64_reset(&state, seed);
+    XXH64_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH64_digest(&state);
+}
+static XSUM_U32 localXXH3_64b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)XXH3_64bits(buffer, bufferSize);
+}
+static XSUM_U32 localXXH3_64b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return (XSUM_U32)XXH3_64bits_withSeed(buffer, bufferSize, seed);
+}
+static XSUM_U32 localXXH3_64b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)XXH3_64bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf));
+}
+static XSUM_U32 localXXH3_128b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)(XXH3_128bits(buffer, bufferSize).low64);
+}
+static XSUM_U32 localXXH3_128b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return (XSUM_U32)(XXH3_128bits_withSeed(buffer, bufferSize, seed).low64);
+}
+static XSUM_U32 localXXH3_128b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)(XXH3_128bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf)).low64);
+}
+static XSUM_U32 localXXH3_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    (void)seed;
+    XXH3_64bits_reset(&state);
+    XXH3_64bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH3_64bits_digest(&state);
+}
+static XSUM_U32 localXXH3_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    XXH3_INITSTATE(&state);
+    XXH3_64bits_reset_withSeed(&state, (XXH64_hash_t)seed);
+    XXH3_64bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH3_64bits_digest(&state);
+}
+static XSUM_U32 localXXH128_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    (void)seed;
+    XXH3_128bits_reset(&state);
+    XXH3_128bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
+}
+static XSUM_U32 localXXH128_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    XXH3_INITSTATE(&state);
+    XXH3_128bits_reset_withSeed(&state, (XXH64_hash_t)seed);
+    XXH3_128bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
+}
+
+
+typedef struct {
+    const char*  name;
+    hashFunction func;
+} hashInfo;
+
+static const hashInfo g_hashesToBench[] = {
+    { "XXH32",             &localXXH32 },
+    { "XXH64",             &localXXH64 },
+    { "XXH3_64b",          &localXXH3_64b },
+    { "XXH3_64b w/seed",   &localXXH3_64b_seeded },
+    { "XXH3_64b w/secret", &localXXH3_64b_secret },
+    { "XXH128",            &localXXH3_128b },
+    { "XXH128 w/seed",     &localXXH3_128b_seeded },
+    { "XXH128 w/secret",   &localXXH3_128b_secret },
+    { "XXH32_stream",      &localXXH32_stream },
+    { "XXH64_stream",      &localXXH64_stream },
+    { "XXH3_stream",       &localXXH3_stream },
+    { "XXH3_stream w/seed",&localXXH3_stream_seeded },
+    { "XXH128_stream",     &localXXH128_stream },
+    { "XXH128_stream w/seed",&localXXH128_stream_seeded },
+};
+#define NB_HASHFUNC (sizeof(g_hashesToBench) / sizeof(*g_hashesToBench))
+
+#define NB_TESTFUNC (1 + 2 * NB_HASHFUNC)
+int const g_nbTestFunctions = NB_TESTFUNC;
+char g_testIDs[NB_TESTFUNC] = { 0 };
+const char k_testIDs_default[NB_TESTFUNC] = { 0,
+        1 /*XXH32*/, 0,
+        1 /*XXH64*/, 0,
+        1 /*XXH3*/, 0, 0, 0, 0, 0,
+        1 /*XXH128*/ };
+
+int g_nbIterations = NBLOOPS_DEFAULT;
+#define HASHNAME_MAX 29
+static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
+                           const void* buffer, size_t bufferSize)
+{
+    XSUM_U32 nbh_perIteration = (XSUM_U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
+    int iterationNb, nbIterations = g_nbIterations + !g_nbIterations /* min 1 */;
+    double fastestH = 100000000.;
+    assert(HASHNAME_MAX > 2);
+    XSUM_logVerbose(2, "\r%80s\r", "");       /* Clean display line */
+
+    for (iterationNb = 1; iterationNb <= nbIterations; iterationNb++) {
+        XSUM_U32 r=0;
+        clock_t cStart;
+
+        XSUM_logVerbose(2, "%2i-%-*.*s : %10u ->\r",
+                        iterationNb,
+                        HASHNAME_MAX, HASHNAME_MAX, hName,
+                        (unsigned)bufferSize);
+        cStart = clock();
+        while (clock() == cStart);   /* starts clock() at its exact beginning */
+        cStart = clock();
+
+        {   XSUM_U32 u;
+            for (u=0; u<nbh_perIteration; u++)
+                r += h(buffer, bufferSize, u);
+        }
+        if (r==0) XSUM_logVerbose(3,".\r");  /* do something with r to defeat compiler "optimizing" hash away */
+
+        {   clock_t const nbTicks = XSUM_clockSpan(cStart);
+            double const ticksPerHash = ((double)nbTicks / TIMELOOP) / nbh_perIteration;
+            /*
+             * clock() is the only decent portable timer, but it isn't very
+             * precise.
+             *
+             * Sometimes, this lack of precision is enough that the benchmark
+             * finishes before there are enough ticks to get a meaningful result.
+             *
+             * For example, on a Core 2 Duo (without any sort of Turbo Boost),
+             * the imprecise timer caused peculiar results like so:
+             *
+             *    XXH3_64b                   4800.0 MB/s // conveniently even
+             *    XXH3_64b unaligned         4800.0 MB/s
+             *    XXH3_64b seeded            9600.0 MB/s // magical 2x speedup?!
+             *    XXH3_64b seeded unaligned  4800.0 MB/s
+             *
+             * If we sense a suspiciously low number of ticks, we increase the
+             * iterations until we can get something meaningful.
+             */
+            if (nbTicks < TIMELOOP_MIN) {
+                /* Not enough time spent in benchmarking, risk of rounding bias */
+                if (nbTicks == 0) { /* faster than resolution timer */
+                    nbh_perIteration *= 100;
+                } else {
+                    /*
+                     * update nbh_perIteration so that the next round lasts
+                     * approximately 1 second.
+                     */
+                    double nbh_perSecond = (1 / ticksPerHash) + 1;
+                    if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
+                    nbh_perIteration = (XSUM_U32)nbh_perSecond;
+                }
+                /* g_nbIterations==0 => quick evaluation, no claim of accuracy */
+                if (g_nbIterations>0) {
+                    iterationNb--;   /* new round for a more accurate speed evaluation */
+                    continue;
+                }
+            }
+            if (ticksPerHash < fastestH) fastestH = ticksPerHash;
+            if (fastestH>0.) { /* avoid div by zero */
+                XSUM_logVerbose(2, "%2i-%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \r",
+                            iterationNb,
+                            HASHNAME_MAX, HASHNAME_MAX, hName,
+                            (unsigned)bufferSize,
+                            (double)1 / fastestH,
+                            ((double)bufferSize / (1 MB)) / fastestH);
+        }   }
+        {   double nbh_perSecond = (1 / fastestH) + 1;
+            if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
+            nbh_perIteration = (XSUM_U32)nbh_perSecond;
+        }
+    }
+    XSUM_logVerbose(1, "%2i#%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \n",
+                    testID,
+                    HASHNAME_MAX, HASHNAME_MAX, hName,
+                    (unsigned)bufferSize,
+                    (double)1 / fastestH,
+                    ((double)bufferSize / (1 MB)) / fastestH);
+    if (XSUM_logLevel<1)
+        XSUM_logVerbose(0, "%u, ", (unsigned)((double)1 / fastestH));
+}
+
+
+/*
+ * Allocates a string containing s1 and s2 concatenated. Acts like strdup.
+ * The result must be freed.
+ */
+static char* XSUM_strcatDup(const char* s1, const char* s2)
+{
+    assert(s1 != NULL);
+    assert(s2 != NULL);
+    {   size_t len1 = strlen(s1);
+        size_t len2 = strlen(s2);
+        char* buf = (char*)malloc(len1 + len2 + 1);
+        if (buf != NULL) {
+            /* strcpy(buf, s1) */
+            memcpy(buf, s1, len1);
+            /* strcat(buf, s2) */
+            memcpy(buf + len1, s2, len2 + 1);
+        }
+        return buf;
+    }
+}
+
+
+/*!
+ * XSUM_benchMem():
+ * buffer: Must be 16-byte aligned.
+ * The real allocated size of buffer is supposed to be >= (bufferSize+3).
+ * returns: 0 on success, 1 if error (invalid mode selected)
+ */
+static void XSUM_benchMem(const void* buffer, size_t bufferSize)
+{
+    assert((((size_t)buffer) & 15) == 0);  /* ensure alignment */
+    XSUM_fillTestBuffer(g_benchSecretBuf, sizeof(g_benchSecretBuf));
+    {   int i;
+        for (i = 1; i < (int)NB_TESTFUNC; i++) {
+            int const hashFuncID = (i-1) / 2;
+            assert(g_hashesToBench[hashFuncID].name != NULL);
+            if (g_testIDs[i] == 0) continue;
+            /* aligned */
+            if ((i % 2) == 1) {
+                XSUM_benchHash(g_hashesToBench[hashFuncID].func, g_hashesToBench[hashFuncID].name, i, buffer, bufferSize);
+            }
+            /* unaligned */
+            if ((i % 2) == 0) {
+                /* Append "unaligned". */
+                char* const hashNameBuf = XSUM_strcatDup(g_hashesToBench[hashFuncID].name, " unaligned");
+                assert(hashNameBuf != NULL);
+                XSUM_benchHash(g_hashesToBench[hashFuncID].func, hashNameBuf, i, ((const char*)buffer)+3, bufferSize);
+                free(hashNameBuf);
+            }
+    }   }
+}
+
+static size_t XSUM_selectBenchedSize(const char* fileName)
+{
+    XSUM_U64 const inFileSize = XSUM_getFileSize(fileName);
+    size_t benchedSize = (size_t) XSUM_findMaxMem(inFileSize);
+    if ((XSUM_U64)benchedSize > inFileSize) benchedSize = (size_t)inFileSize;
+    if (benchedSize < inFileSize) {
+        XSUM_log("Not enough memory for '%s' full size; testing %i MB only...\n", fileName, (int)(benchedSize>>20));
+    }
+    return benchedSize;
+}
+
+
+int XSUM_benchFiles(const char* fileNamesTable[], int nbFiles)
+{
+    int fileIdx;
+    for (fileIdx=0; fileIdx<nbFiles; fileIdx++) {
+        const char* const inFileName = fileNamesTable[fileIdx];
+        assert(inFileName != NULL);
+
+        {   FILE* const inFile = XSUM_fopen( inFileName, "rb" );
+            size_t const benchedSize = XSUM_selectBenchedSize(inFileName);
+            char* const buffer = (char*)calloc(benchedSize+16+3, 1);
+            void* const alignedBuffer = (buffer+15) - (((size_t)(buffer+15)) & 0xF);  /* align on next 16 bytes */
+
+            /* Checks */
+            if (inFile==NULL){
+                XSUM_log("Error: Could not open '%s': %s.\n", inFileName, strerror(errno));
+                free(buffer);
+                exit(11);
+            }
+            if(!buffer) {
+                XSUM_log("\nError: Out of memory.\n");
+                fclose(inFile);
+                exit(12);
+            }
+
+            /* Fill input buffer */
+            {   size_t const readSize = fread(alignedBuffer, 1, benchedSize, inFile);
+                fclose(inFile);
+                if(readSize != benchedSize) {
+                    XSUM_log("\nError: Could not read '%s': %s.\n", inFileName, strerror(errno));
+                    free(buffer);
+                    exit(13);
+            }   }
+
+            /* bench */
+            XSUM_benchMem(alignedBuffer, benchedSize);
+
+            free(buffer);
+    }   }
+    return 0;
+}
+
+
+int XSUM_benchInternal(size_t keySize)
+{
+    void* const buffer = calloc(keySize+16+3, 1);
+    if (buffer == NULL) {
+        XSUM_log("\nError: Out of memory.\n");
+        exit(12);
+    }
+
+    {   const void* const alignedBuffer = ((char*)buffer+15) - (((size_t)((char*)buffer+15)) & 0xF);  /* align on next 16 bytes */
+
+        /* bench */
+        XSUM_logVerbose(1, "Sample of ");
+        if (keySize > 10 KB) {
+            XSUM_logVerbose(1, "%u KB", (unsigned)(keySize >> 10));
+        } else {
+            XSUM_logVerbose(1, "%u bytes", (unsigned)keySize);
+        }
+        XSUM_logVerbose(1, "...        \n");
+
+        XSUM_benchMem(alignedBuffer, keySize);
+        free(buffer);
+    }
+    return 0;
+}
diff --git a/cli/xsum_bench.h b/cli/xsum_bench.h
new file mode 100644
index 00000000..6faaec8c
--- /dev/null
+++ b/cli/xsum_bench.h
@@ -0,0 +1,51 @@
+/*
+ * xsum_bench - Benchmark functions for xxhsum
+ * Copyright (C) 2013-2021 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#ifndef XSUM_BENCH_H
+#define XSUM_BENCH_H
+
+#include <stddef.h>  /* size_t */
+
+#define NBLOOPS_DEFAULT    3    /* Default number of benchmark iterations */
+
+extern int const g_nbTestFunctions;
+extern char g_testIDs[];  /* size : g_nbTestFunctions */
+extern const char k_testIDs_default[];
+extern int g_nbIterations;
+
+int XSUM_benchInternal(size_t keySize);
+int XSUM_benchFiles(const char* fileNamesTable[], int nbFiles);
+
+
+#ifdef __cplusplus
+extern "C" {
+#endif
+
+
+#ifdef __cplusplus
+}
+#endif
+
+#endif /* XSUM_BENCH_H */
diff --git a/cli/xsum_config.h b/cli/xsum_config.h
index 9222144d..eec5528d 100644
--- a/cli/xsum_config.h
+++ b/cli/xsum_config.h
@@ -1,6 +1,6 @@
 /*
  * xxhsum - Command line interface for xxhash algorithms
- * Copyright (C) 2013-2020 Yann Collet
+ * Copyright (C) 2013-2021 Yann Collet
  *
  * GPL v2 License
  *
@@ -202,4 +202,13 @@
     typedef unsigned long long XSUM_U64;
 #endif /* not C++/C99 */
 
+/* ***************************
+ * Common constants
+ * ***************************/
+
+#define KB *( 1<<10)
+#define MB *( 1<<20)
+#define GB *(1U<<30)
+
+
 #endif /* XSUM_CONFIG_H */
diff --git a/cli/xsum_os_specific.c b/cli/xsum_os_specific.c
index 8f48ce07..b34a359a 100644
--- a/cli/xsum_os_specific.c
+++ b/cli/xsum_os_specific.c
@@ -112,7 +112,7 @@ static int XSUM_stat(const char* infilename, XSUM_stat_t* statbuf)
 }
 
 #ifndef XSUM_NO_MAIN
-int main(int argc, char* argv[])
+int main(int argc, const char* argv[])
 {
     return XSUM_main(argc, argv);
 }
diff --git a/cli/xsum_os_specific.h b/cli/xsum_os_specific.h
index b3562b26..4251cf09 100644
--- a/cli/xsum_os_specific.h
+++ b/cli/xsum_os_specific.h
@@ -39,7 +39,7 @@ extern "C" {
  *
  * Functions like main(), but is passed UTF-8 arguments even on Windows.
  */
-XSUM_API int XSUM_main(int argc, char* argv[]);
+XSUM_API int XSUM_main(int argc, const char* argv[]);
 
 /*
  * Returns whether stream is a console.
diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index 56bce1ba..864323a0 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -35,10 +35,12 @@
 #include "xsum_os_specific.h"
 #include "xsum_output.h"
 #include "xsum_sanity_check.h"
+#include "xsum_bench.h"
 #ifdef XXH_INLINE_ALL
 #  include "xsum_os_specific.c"
 #  include "xsum_output.c"
 #  include "xsum_sanity_check.c"
+#  include "xsum_bench.c"
 #endif
 
 /* ************************************
@@ -50,7 +52,6 @@
 #include <stdio.h>      /* fprintf, fopen, ftello64, fread, stdin, stdout, _fileno (when present) */
 #include <sys/types.h>  /* stat, stat64, _stat64 */
 #include <sys/stat.h>   /* stat, stat64, _stat64 */
-#include <time.h>       /* clock_t, clock, CLOCKS_PER_SEC */
 #include <assert.h>     /* assert */
 #include <errno.h>      /* errno */
 
@@ -78,19 +79,6 @@ static const char author[] = "Yann Collet";
                     exename, XSUM_PROGRAM_VERSION, author, \
                     g_nbBits, XSUM_ARCH, ENDIAN_NAME, XSUM_CC_VERSION
 
-#define KB *( 1<<10)
-#define MB *( 1<<20)
-#define GB *(1U<<30)
-
-static size_t XSUM_DEFAULT_SAMPLE_SIZE = 100 KB;
-#define NBLOOPS    3                              /* Default number of benchmark iterations */
-#define TIMELOOP_S 1
-#define TIMELOOP  (TIMELOOP_S * CLOCKS_PER_SEC)   /* target timing per iteration */
-#define TIMELOOP_MIN (TIMELOOP / 2)               /* minimum timing to validate a result */
-#define XXHSUM32_DEFAULT_SEED 0                   /* Default seed for algo_xxh32 */
-#define XXHSUM64_DEFAULT_SEED 0                   /* Default seed for algo_xxh64 */
-
-#define MAX_MEM    (2 GB - 64 MB)
 
 static const char stdinName[] = "-";
 static const char stdinFileName[] = "stdin";
@@ -104,406 +92,16 @@ static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & X
 /* Maximum acceptable line length. */
 #define MAX_LINE_LENGTH (32 KB)
 
+static size_t XSUM_DEFAULT_SAMPLE_SIZE = 100 KB;
 
-/* ************************************
- *  Local variables
- **************************************/
-static XSUM_U32 g_nbIterations = NBLOOPS;
-
-
-/* ************************************
- *  Benchmark Functions
- **************************************/
-static clock_t XSUM_clockSpan( clock_t start )
-{
-    return clock() - start;   /* works even if overflow; Typical max span ~ 30 mn */
-}
-
-static size_t XSUM_findMaxMem(XSUM_U64 requiredMem)
-{
-    size_t const step = 64 MB;
-    void* testmem = NULL;
-
-    requiredMem = (((requiredMem >> 26) + 1) << 26);
-    requiredMem += 2*step;
-    if (requiredMem > MAX_MEM) requiredMem = MAX_MEM;
-
-    while (!testmem) {
-        if (requiredMem > step) requiredMem -= step;
-        else requiredMem >>= 1;
-        testmem = malloc ((size_t)requiredMem);
-    }
-    free (testmem);
-
-    /* keep some space available */
-    if (requiredMem > step) requiredMem -= step;
-    else requiredMem >>= 1;
-
-    return (size_t)requiredMem;
-}
-
-/*
- * Allocates a string containing s1 and s2 concatenated. Acts like strdup.
- * The result must be freed.
- */
-static char* XSUM_strcatDup(const char* s1, const char* s2)
-{
-    assert(s1 != NULL);
-    assert(s2 != NULL);
-    {   size_t len1 = strlen(s1);
-        size_t len2 = strlen(s2);
-        char* buf = (char*)malloc(len1 + len2 + 1);
-        if (buf != NULL) {
-            /* strcpy(buf, s1) */
-            memcpy(buf, s1, len1);
-            /* strcat(buf, s2) */
-            memcpy(buf + len1, s2, len2 + 1);
-        }
-        return buf;
-    }
-}
-
-
-/*
- * A secret buffer used for benchmarking XXH3's withSecret variants.
- *
- * In order for the bench to be realistic, the secret buffer would need to be
- * pre-generated.
- *
- * Adding a pointer to the parameter list would be messy.
- */
-static XSUM_U8 g_benchSecretBuf[XXH3_SECRET_SIZE_MIN];
-
-/*
- * Wrappers for the benchmark.
- *
- * If you would like to add other hashes to the bench, create a wrapper and add
- * it to the g_hashesToBench table. It will automatically be added.
- */
-typedef XSUM_U32 (*hashFunction)(const void* buffer, size_t bufferSize, XSUM_U32 seed);
-
-static XSUM_U32 localXXH32(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return XXH32(buffer, bufferSize, seed);
-}
-static XSUM_U32 localXXH32_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH32_state_t state;
-    (void)seed;
-    XXH32_reset(&state, seed);
-    XXH32_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH32_digest(&state);
-}
-static XSUM_U32 localXXH64(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return (XSUM_U32)XXH64(buffer, bufferSize, seed);
-}
-static XSUM_U32 localXXH64_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH64_state_t state;
-    (void)seed;
-    XXH64_reset(&state, seed);
-    XXH64_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH64_digest(&state);
-}
-static XSUM_U32 localXXH3_64b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)XXH3_64bits(buffer, bufferSize);
-}
-static XSUM_U32 localXXH3_64b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return (XSUM_U32)XXH3_64bits_withSeed(buffer, bufferSize, seed);
-}
-static XSUM_U32 localXXH3_64b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)XXH3_64bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf));
-}
-static XSUM_U32 localXXH3_128b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)(XXH3_128bits(buffer, bufferSize).low64);
-}
-static XSUM_U32 localXXH3_128b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return (XSUM_U32)(XXH3_128bits_withSeed(buffer, bufferSize, seed).low64);
-}
-static XSUM_U32 localXXH3_128b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)(XXH3_128bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf)).low64);
-}
-static XSUM_U32 localXXH3_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    (void)seed;
-    XXH3_64bits_reset(&state);
-    XXH3_64bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH3_64bits_digest(&state);
-}
-static XSUM_U32 localXXH3_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    XXH3_INITSTATE(&state);
-    XXH3_64bits_reset_withSeed(&state, (XXH64_hash_t)seed);
-    XXH3_64bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH3_64bits_digest(&state);
-}
-static XSUM_U32 localXXH128_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    (void)seed;
-    XXH3_128bits_reset(&state);
-    XXH3_128bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
-}
-static XSUM_U32 localXXH128_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    XXH3_INITSTATE(&state);
-    XXH3_128bits_reset_withSeed(&state, (XXH64_hash_t)seed);
-    XXH3_128bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
-}
-
-
-typedef struct {
-    const char*  name;
-    hashFunction func;
-} hashInfo;
-
-static const hashInfo g_hashesToBench[] = {
-    { "XXH32",             &localXXH32 },
-    { "XXH64",             &localXXH64 },
-    { "XXH3_64b",          &localXXH3_64b },
-    { "XXH3_64b w/seed",   &localXXH3_64b_seeded },
-    { "XXH3_64b w/secret", &localXXH3_64b_secret },
-    { "XXH128",            &localXXH3_128b },
-    { "XXH128 w/seed",     &localXXH3_128b_seeded },
-    { "XXH128 w/secret",   &localXXH3_128b_secret },
-    { "XXH32_stream",      &localXXH32_stream },
-    { "XXH64_stream",      &localXXH64_stream },
-    { "XXH3_stream",       &localXXH3_stream },
-    { "XXH3_stream w/seed",&localXXH3_stream_seeded },
-    { "XXH128_stream",     &localXXH128_stream },
-    { "XXH128_stream w/seed",&localXXH128_stream_seeded },
-};
-#define NB_HASHFUNC (sizeof(g_hashesToBench) / sizeof(*g_hashesToBench))
-
-#define NB_TESTFUNC (1 + 2 * NB_HASHFUNC)
-static char g_testIDs[NB_TESTFUNC] = { 0 };
-static const char k_testIDs_default[NB_TESTFUNC] = { 0,
-        1 /*XXH32*/, 0,
-        1 /*XXH64*/, 0,
-        1 /*XXH3*/, 0, 0, 0, 0, 0,
-        1 /*XXH128*/ };
-
-#define HASHNAME_MAX 29
-static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
-                           const void* buffer, size_t bufferSize)
-{
-    XSUM_U32 nbh_perIteration = (XSUM_U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
-    unsigned iterationNb, nbIterations = g_nbIterations + !g_nbIterations /* min 1 */;
-    double fastestH = 100000000.;
-    assert(HASHNAME_MAX > 2);
-    XSUM_logVerbose(2, "\r%80s\r", "");       /* Clean display line */
-
-    for (iterationNb = 1; iterationNb <= nbIterations; iterationNb++) {
-        XSUM_U32 r=0;
-        clock_t cStart;
-
-        XSUM_logVerbose(2, "%2u-%-*.*s : %10u ->\r",
-                        iterationNb,
-                        HASHNAME_MAX, HASHNAME_MAX, hName,
-                        (unsigned)bufferSize);
-        cStart = clock();
-        while (clock() == cStart);   /* starts clock() at its exact beginning */
-        cStart = clock();
-
-        {   XSUM_U32 u;
-            for (u=0; u<nbh_perIteration; u++)
-                r += h(buffer, bufferSize, u);
-        }
-        if (r==0) XSUM_logVerbose(3,".\r");  /* do something with r to defeat compiler "optimizing" hash away */
-
-        {   clock_t const nbTicks = XSUM_clockSpan(cStart);
-            double const ticksPerHash = ((double)nbTicks / TIMELOOP) / nbh_perIteration;
-            /*
-             * clock() is the only decent portable timer, but it isn't very
-             * precise.
-             *
-             * Sometimes, this lack of precision is enough that the benchmark
-             * finishes before there are enough ticks to get a meaningful result.
-             *
-             * For example, on a Core 2 Duo (without any sort of Turbo Boost),
-             * the imprecise timer caused peculiar results like so:
-             *
-             *    XXH3_64b                   4800.0 MB/s // conveniently even
-             *    XXH3_64b unaligned         4800.0 MB/s
-             *    XXH3_64b seeded            9600.0 MB/s // magical 2x speedup?!
-             *    XXH3_64b seeded unaligned  4800.0 MB/s
-             *
-             * If we sense a suspiciously low number of ticks, we increase the
-             * iterations until we can get something meaningful.
-             */
-            if (nbTicks < TIMELOOP_MIN) {
-                /* Not enough time spent in benchmarking, risk of rounding bias */
-                if (nbTicks == 0) { /* faster than resolution timer */
-                    nbh_perIteration *= 100;
-                } else {
-                    /*
-                     * update nbh_perIteration so that the next round lasts
-                     * approximately 1 second.
-                     */
-                    double nbh_perSecond = (1 / ticksPerHash) + 1;
-                    if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
-                    nbh_perIteration = (XSUM_U32)nbh_perSecond;
-                }
-                /* g_nbIterations==0 => quick evaluation, no claim of accuracy */
-                if (g_nbIterations>0) {
-                    iterationNb--;   /* new round for a more accurate speed evaluation */
-                    continue;
-                }
-            }
-            if (ticksPerHash < fastestH) fastestH = ticksPerHash;
-            if (fastestH>0.) { /* avoid div by zero */
-                XSUM_logVerbose(2, "%2u-%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \r",
-                            iterationNb,
-                            HASHNAME_MAX, HASHNAME_MAX, hName,
-                            (unsigned)bufferSize,
-                            (double)1 / fastestH,
-                            ((double)bufferSize / (1 MB)) / fastestH);
-        }   }
-        {   double nbh_perSecond = (1 / fastestH) + 1;
-            if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
-            nbh_perIteration = (XSUM_U32)nbh_perSecond;
-        }
-    }
-    XSUM_logVerbose(1, "%2i#%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \n",
-                    testID,
-                    HASHNAME_MAX, HASHNAME_MAX, hName,
-                    (unsigned)bufferSize,
-                    (double)1 / fastestH,
-                    ((double)bufferSize / (1 MB)) / fastestH);
-    if (XSUM_logLevel<1)
-        XSUM_logVerbose(0, "%u, ", (unsigned)((double)1 / fastestH));
-}
-
-
-/*!
- * XSUM_benchMem():
- * buffer: Must be 16-byte aligned.
- * The real allocated size of buffer is supposed to be >= (bufferSize+3).
- * returns: 0 on success, 1 if error (invalid mode selected)
- */
-static void XSUM_benchMem(const void* buffer, size_t bufferSize)
-{
-    assert((((size_t)buffer) & 15) == 0);  /* ensure alignment */
-    XSUM_fillTestBuffer(g_benchSecretBuf, sizeof(g_benchSecretBuf));
-    {   int i;
-        for (i = 1; i < (int)NB_TESTFUNC; i++) {
-            int const hashFuncID = (i-1) / 2;
-            assert(g_hashesToBench[hashFuncID].name != NULL);
-            if (g_testIDs[i] == 0) continue;
-            /* aligned */
-            if ((i % 2) == 1) {
-                XSUM_benchHash(g_hashesToBench[hashFuncID].func, g_hashesToBench[hashFuncID].name, i, buffer, bufferSize);
-            }
-            /* unaligned */
-            if ((i % 2) == 0) {
-                /* Append "unaligned". */
-                char* const hashNameBuf = XSUM_strcatDup(g_hashesToBench[hashFuncID].name, " unaligned");
-                assert(hashNameBuf != NULL);
-                XSUM_benchHash(g_hashesToBench[hashFuncID].func, hashNameBuf, i, ((const char*)buffer)+3, bufferSize);
-                free(hashNameBuf);
-            }
-    }   }
-}
-
-static size_t XSUM_selectBenchedSize(const char* fileName)
-{
-    XSUM_U64 const inFileSize = XSUM_getFileSize(fileName);
-    size_t benchedSize = (size_t) XSUM_findMaxMem(inFileSize);
-    if ((XSUM_U64)benchedSize > inFileSize) benchedSize = (size_t)inFileSize;
-    if (benchedSize < inFileSize) {
-        XSUM_log("Not enough memory for '%s' full size; testing %i MB only...\n", fileName, (int)(benchedSize>>20));
-    }
-    return benchedSize;
-}
-
-
-static int XSUM_benchFiles(char*const* fileNamesTable, int nbFiles)
-{
-    int fileIdx;
-    for (fileIdx=0; fileIdx<nbFiles; fileIdx++) {
-        const char* const inFileName = fileNamesTable[fileIdx];
-        assert(inFileName != NULL);
-
-        {   FILE* const inFile = XSUM_fopen( inFileName, "rb" );
-            size_t const benchedSize = XSUM_selectBenchedSize(inFileName);
-            char* const buffer = (char*)calloc(benchedSize+16+3, 1);
-            void* const alignedBuffer = (buffer+15) - (((size_t)(buffer+15)) & 0xF);  /* align on next 16 bytes */
-
-            /* Checks */
-            if (inFile==NULL){
-                XSUM_log("Error: Could not open '%s': %s.\n", inFileName, strerror(errno));
-                free(buffer);
-                exit(11);
-            }
-            if(!buffer) {
-                XSUM_log("\nError: Out of memory.\n");
-                fclose(inFile);
-                exit(12);
-            }
-
-            /* Fill input buffer */
-            {   size_t const readSize = fread(alignedBuffer, 1, benchedSize, inFile);
-                fclose(inFile);
-                if(readSize != benchedSize) {
-                    XSUM_log("\nError: Could not read '%s': %s.\n", inFileName, strerror(errno));
-                    free(buffer);
-                    exit(13);
-            }   }
-
-            /* bench */
-            XSUM_benchMem(alignedBuffer, benchedSize);
-
-            free(buffer);
-    }   }
-    return 0;
-}
-
-
-static int XSUM_benchInternal(size_t keySize)
-{
-    void* const buffer = calloc(keySize+16+3, 1);
-    if (buffer == NULL) {
-        XSUM_log("\nError: Out of memory.\n");
-        exit(12);
-    }
-
-    {   const void* const alignedBuffer = ((char*)buffer+15) - (((size_t)((char*)buffer+15)) & 0xF);  /* align on next 16 bytes */
-
-        /* bench */
-        XSUM_logVerbose(1, "Sample of ");
-        if (keySize > 10 KB) {
-            XSUM_logVerbose(1, "%u KB", (unsigned)(keySize >> 10));
-        } else {
-            XSUM_logVerbose(1, "%u bytes", (unsigned)keySize);
-        }
-        XSUM_logVerbose(1, "...        \n");
-
-        XSUM_benchMem(alignedBuffer, keySize);
-        free(buffer);
-    }
-    return 0;
-}
 
 /* ********************************************************
 *  File Hashing
 **********************************************************/
 
+#define XXHSUM32_DEFAULT_SEED 0                   /* Default seed for algo_xxh32 */
+#define XXHSUM64_DEFAULT_SEED 0                   /* Default seed for algo_xxh64 */
+
 /* for support of --little-endian display mode */
 static void XSUM_display_LittleEndian(const void* ptr, size_t length)
 {
@@ -729,7 +327,7 @@ static int XSUM_hashFile(const char* fileName,
  * XSUM_hashFiles:
  * If fnTotal==0, read from stdin instead.
  */
-static int XSUM_hashFiles(char*const * fnList, int fnTotal,
+static int XSUM_hashFiles(const char* fnList[], int fnTotal,
                           AlgoSelected hashType,
                           Display_endianess displayEndianess,
                           Display_convention convention)
@@ -1241,7 +839,7 @@ static int XSUM_checkFile(const char* inFileName,
 }
 
 
-static int XSUM_checkFiles(char*const* fnList, int fnTotal,
+static int XSUM_checkFiles(const char* fnList[], int fnTotal,
                            const Display_endianess displayEndianess,
                            XSUM_U32 strictMode,
                            XSUM_U32 statusOnly,
@@ -1290,7 +888,7 @@ static int XSUM_usage_advanced(const char* exename)
     XSUM_log( "      --little-endian  Checksum values use little endian convention (default: big endian) \n");
     XSUM_log( "  -b                   Run benchmark \n");
     XSUM_log( "  -b#                  Bench only algorithm variant # \n");
-    XSUM_log( "  -i#                  Number of times to run the benchmark (default: %u) \n", (unsigned)g_nbIterations);
+    XSUM_log( "  -i#                  Number of times to run the benchmark (default: %u) \n", NBLOOPS_DEFAULT);
     XSUM_log( "  -q, --quiet          Don't display version header in benchmark mode \n");
     XSUM_log( "\n");
     XSUM_log( "The following four options are useful only when verifying checksums (-c): \n");
@@ -1371,7 +969,7 @@ static XSUM_U32 XSUM_readU32FromChar(const char** stringPtr) {
     return result;
 }
 
-XSUM_API int XSUM_main(int argc, char* argv[])
+XSUM_API int XSUM_main(int argc, const char* argv[])
 {
     int i, filenamesStart = 0;
     const char* const exename = XSUM_lastNameFromPath(argv[0]);
@@ -1387,6 +985,7 @@ XSUM_API int XSUM_main(int argc, char* argv[])
     AlgoSelected algo     = g_defaultAlgo;
     Display_endianess displayEndianess = big_endian;
     Display_convention convention = display_gnu;
+    int nbIterations = NBLOOPS_DEFAULT;
 
     /* special case: xxhNNsum default to NN bits checksum */
     if (strstr(exename,  "xxh32sum") != NULL) algo = g_defaultAlgo = algo_xxh32;
@@ -1468,17 +1067,18 @@ XSUM_API int XSUM_main(int argc, char* argv[])
                 do {
                     if (*argument == ',') argument++;
                     selectBenchIDs = XSUM_readU32FromChar(&argument); /* select one specific test */
-                    if (selectBenchIDs < NB_TESTFUNC) {
+                    if ((int)selectBenchIDs < g_nbTestFunctions) {
                         g_testIDs[selectBenchIDs] = 1;
-                    } else
+                    } else {
                         selectBenchIDs = kBenchAll;
+                    }
                 } while (*argument == ',');
                 break;
 
             /* Modify Nb Iterations (benchmark only) */
             case 'i':
                 argument++;
-                g_nbIterations = XSUM_readU32FromChar(&argument);
+                nbIterations = (int)XSUM_readU32FromChar(&argument);
                 break;
 
             /* Modify Block size (benchmark only) */
@@ -1503,8 +1103,9 @@ XSUM_API int XSUM_main(int argc, char* argv[])
     if (benchmarkMode) {
         XSUM_logVerbose(2, FULL_WELCOME_MESSAGE(exename) );
         XSUM_sanityCheck();
-        if (selectBenchIDs == 0) memcpy(g_testIDs, k_testIDs_default, sizeof(g_testIDs));
-        if (selectBenchIDs == kBenchAll) memset(g_testIDs, 1, sizeof(g_testIDs));
+        g_nbIterations = nbIterations;
+        if (selectBenchIDs == 0) memcpy(g_testIDs, k_testIDs_default, g_nbTestFunctions);
+        if (selectBenchIDs == kBenchAll) memset(g_testIDs, 1, g_nbTestFunctions);
         if (filenamesStart==0) return XSUM_benchInternal(keySize);
         return XSUM_benchFiles(argv+filenamesStart, argc-filenamesStart);
     }
diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index 7acc8b1f..d5456b01 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -92,6 +92,7 @@ if(XXHASH_BUILD_XXHSUM)
                         "${XXHSUM_DIR}/xsum_os_specific.c"
                         "${XXHSUM_DIR}/xsum_output.c"
                         "${XXHSUM_DIR}/xsum_sanity_check.c"
+                        "${XXHSUM_DIR}/xsum_bench.c"
                 )
   add_executable(${PROJECT_NAME}::xxhsum ALIAS xxhsum)
 

From f9ce9c5fd9a4192486f34fa365cb6fab1e9102ce Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 10:41:26 -0800
Subject: [PATCH 173/187] separate benchmark functions into xsum_bench.c

---
 Makefile                        |   6 +-
 cli/xsum_bench.c                | 435 ++++++++++++++++++++++++++++++++
 cli/xsum_bench.h                |  51 ++++
 cli/xsum_config.h               |  11 +-
 cli/xsum_os_specific.c          |   2 +-
 cli/xsum_os_specific.h          |   2 +-
 cli/xxhsum.c                    | 435 ++------------------------------
 cmake_unofficial/CMakeLists.txt |   1 +
 8 files changed, 521 insertions(+), 422 deletions(-)
 create mode 100644 cli/xsum_bench.c
 create mode 100644 cli/xsum_bench.h

diff --git a/Makefile b/Makefile
index da6e0818..dc1db341 100644
--- a/Makefile
+++ b/Makefile
@@ -75,13 +75,15 @@ XXHSUM_SRC_DIR = cli
 XXHSUM_SPLIT_SRCS = $(XXHSUM_SRC_DIR)/xxhsum.c \
                     $(XXHSUM_SRC_DIR)/xsum_os_specific.c \
                     $(XXHSUM_SRC_DIR)/xsum_output.c \
-                    $(XXHSUM_SRC_DIR)/xsum_sanity_check.c
+                    $(XXHSUM_SRC_DIR)/xsum_sanity_check.c \
+                    $(XXHSUM_SRC_DIR)/xsum_bench.c
 XXHSUM_SPLIT_OBJS = $(XXHSUM_SPLIT_SRCS:.c=.o)
 XXHSUM_HEADERS = $(XXHSUM_SRC_DIR)/xsum_config.h \
                  $(XXHSUM_SRC_DIR)/xsum_arch.h \
                  $(XXHSUM_SRC_DIR)/xsum_os_specific.h \
                  $(XXHSUM_SRC_DIR)/xsum_output.h \
-                 $(XXHSUM_SRC_DIR)/xsum_sanity_check.h
+                 $(XXHSUM_SRC_DIR)/xsum_sanity_check.h \
+                 $(XXHSUM_SRC_DIR)/xsum_bench.h
 
 ## generate CLI and libraries in release mode (default for `make`)
 .PHONY: default
diff --git a/cli/xsum_bench.c b/cli/xsum_bench.c
new file mode 100644
index 00000000..1a8988c3
--- /dev/null
+++ b/cli/xsum_bench.c
@@ -0,0 +1,435 @@
+/*
+ * xsum_bench - Benchmark functions for xxhsum
+ * Copyright (C) 2013-2021 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#include "xsum_config.h"
+#include "xsum_output.h"
+#include "xsum_bench.h"
+#include "xsum_sanity_check.h" /* XSUM_fillTestBuffer */
+#include "xsum_os_specific.h"  /* XSUM_getFileSize */
+#include <stdlib.h>
+#include <assert.h>
+#include <string.h>
+#ifndef XXH_STATIC_LINKING_ONLY
+#  define XXH_STATIC_LINKING_ONLY
+#endif
+#include "../xxhash.h"
+
+#include <stdio.h>  /* FILE */
+#include <time.h>   /* clock_t, clock, CLOCKS_PER_SEC */
+#include <errno.h>  /* errno */
+
+#define TIMELOOP_S 1
+#define TIMELOOP  (TIMELOOP_S * CLOCKS_PER_SEC)   /* target timing per iteration */
+#define TIMELOOP_MIN (TIMELOOP / 2)               /* minimum timing to validate a result */
+
+#define MAX_MEM    (2 GB - 64 MB)
+
+static clock_t XSUM_clockSpan( clock_t start )
+{
+    return clock() - start;   /* works even if overflow; Typical max span ~ 30 mn */
+}
+
+static size_t XSUM_findMaxMem(XSUM_U64 requiredMem)
+{
+    size_t const step = 64 MB;
+    void* testmem = NULL;
+
+    requiredMem = (((requiredMem >> 26) + 1) << 26);
+    requiredMem += 2*step;
+    if (requiredMem > MAX_MEM) requiredMem = MAX_MEM;
+
+    while (!testmem) {
+        if (requiredMem > step) requiredMem -= step;
+        else requiredMem >>= 1;
+        testmem = malloc ((size_t)requiredMem);
+    }
+    free (testmem);
+
+    /* keep some space available */
+    if (requiredMem > step) requiredMem -= step;
+    else requiredMem >>= 1;
+
+    return (size_t)requiredMem;
+}
+
+/*
+ * A secret buffer used for benchmarking XXH3's withSecret variants.
+ *
+ * In order for the bench to be realistic, the secret buffer would need to be
+ * pre-generated.
+ *
+ * Adding a pointer to the parameter list would be messy.
+ */
+static XSUM_U8 g_benchSecretBuf[XXH3_SECRET_SIZE_MIN];
+
+/*
+ * Wrappers for the benchmark.
+ *
+ * If you would like to add other hashes to the bench, create a wrapper and add
+ * it to the g_hashesToBench table. It will automatically be added.
+ */
+typedef XSUM_U32 (*hashFunction)(const void* buffer, size_t bufferSize, XSUM_U32 seed);
+
+static XSUM_U32 localXXH32(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return XXH32(buffer, bufferSize, seed);
+}
+static XSUM_U32 localXXH32_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH32_state_t state;
+    (void)seed;
+    XXH32_reset(&state, seed);
+    XXH32_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH32_digest(&state);
+}
+static XSUM_U32 localXXH64(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return (XSUM_U32)XXH64(buffer, bufferSize, seed);
+}
+static XSUM_U32 localXXH64_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH64_state_t state;
+    (void)seed;
+    XXH64_reset(&state, seed);
+    XXH64_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH64_digest(&state);
+}
+static XSUM_U32 localXXH3_64b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)XXH3_64bits(buffer, bufferSize);
+}
+static XSUM_U32 localXXH3_64b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return (XSUM_U32)XXH3_64bits_withSeed(buffer, bufferSize, seed);
+}
+static XSUM_U32 localXXH3_64b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)XXH3_64bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf));
+}
+static XSUM_U32 localXXH3_128b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)(XXH3_128bits(buffer, bufferSize).low64);
+}
+static XSUM_U32 localXXH3_128b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    return (XSUM_U32)(XXH3_128bits_withSeed(buffer, bufferSize, seed).low64);
+}
+static XSUM_U32 localXXH3_128b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    (void)seed;
+    return (XSUM_U32)(XXH3_128bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf)).low64);
+}
+static XSUM_U32 localXXH3_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    (void)seed;
+    XXH3_64bits_reset(&state);
+    XXH3_64bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH3_64bits_digest(&state);
+}
+static XSUM_U32 localXXH3_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    XXH3_INITSTATE(&state);
+    XXH3_64bits_reset_withSeed(&state, (XXH64_hash_t)seed);
+    XXH3_64bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)XXH3_64bits_digest(&state);
+}
+static XSUM_U32 localXXH128_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    (void)seed;
+    XXH3_128bits_reset(&state);
+    XXH3_128bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
+}
+static XSUM_U32 localXXH128_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
+{
+    XXH3_state_t state;
+    XXH3_INITSTATE(&state);
+    XXH3_128bits_reset_withSeed(&state, (XXH64_hash_t)seed);
+    XXH3_128bits_update(&state, buffer, bufferSize);
+    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
+}
+
+
+typedef struct {
+    const char*  name;
+    hashFunction func;
+} hashInfo;
+
+static const hashInfo g_hashesToBench[] = {
+    { "XXH32",             &localXXH32 },
+    { "XXH64",             &localXXH64 },
+    { "XXH3_64b",          &localXXH3_64b },
+    { "XXH3_64b w/seed",   &localXXH3_64b_seeded },
+    { "XXH3_64b w/secret", &localXXH3_64b_secret },
+    { "XXH128",            &localXXH3_128b },
+    { "XXH128 w/seed",     &localXXH3_128b_seeded },
+    { "XXH128 w/secret",   &localXXH3_128b_secret },
+    { "XXH32_stream",      &localXXH32_stream },
+    { "XXH64_stream",      &localXXH64_stream },
+    { "XXH3_stream",       &localXXH3_stream },
+    { "XXH3_stream w/seed",&localXXH3_stream_seeded },
+    { "XXH128_stream",     &localXXH128_stream },
+    { "XXH128_stream w/seed",&localXXH128_stream_seeded },
+};
+#define NB_HASHFUNC (sizeof(g_hashesToBench) / sizeof(*g_hashesToBench))
+
+#define NB_TESTFUNC (1 + 2 * NB_HASHFUNC)
+int const g_nbTestFunctions = NB_TESTFUNC;
+char g_testIDs[NB_TESTFUNC] = { 0 };
+const char k_testIDs_default[NB_TESTFUNC] = { 0,
+        1 /*XXH32*/, 0,
+        1 /*XXH64*/, 0,
+        1 /*XXH3*/, 0, 0, 0, 0, 0,
+        1 /*XXH128*/ };
+
+int g_nbIterations = NBLOOPS_DEFAULT;
+#define HASHNAME_MAX 29
+static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
+                           const void* buffer, size_t bufferSize)
+{
+    XSUM_U32 nbh_perIteration = (XSUM_U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
+    int iterationNb, nbIterations = g_nbIterations + !g_nbIterations /* min 1 */;
+    double fastestH = 100000000.;
+    assert(HASHNAME_MAX > 2);
+    XSUM_logVerbose(2, "\r%80s\r", "");       /* Clean display line */
+
+    for (iterationNb = 1; iterationNb <= nbIterations; iterationNb++) {
+        XSUM_U32 r=0;
+        clock_t cStart;
+
+        XSUM_logVerbose(2, "%2i-%-*.*s : %10u ->\r",
+                        iterationNb,
+                        HASHNAME_MAX, HASHNAME_MAX, hName,
+                        (unsigned)bufferSize);
+        cStart = clock();
+        while (clock() == cStart);   /* starts clock() at its exact beginning */
+        cStart = clock();
+
+        {   XSUM_U32 u;
+            for (u=0; u<nbh_perIteration; u++)
+                r += h(buffer, bufferSize, u);
+        }
+        if (r==0) XSUM_logVerbose(3,".\r");  /* do something with r to defeat compiler "optimizing" hash away */
+
+        {   clock_t const nbTicks = XSUM_clockSpan(cStart);
+            double const ticksPerHash = ((double)nbTicks / TIMELOOP) / nbh_perIteration;
+            /*
+             * clock() is the only decent portable timer, but it isn't very
+             * precise.
+             *
+             * Sometimes, this lack of precision is enough that the benchmark
+             * finishes before there are enough ticks to get a meaningful result.
+             *
+             * For example, on a Core 2 Duo (without any sort of Turbo Boost),
+             * the imprecise timer caused peculiar results like so:
+             *
+             *    XXH3_64b                   4800.0 MB/s // conveniently even
+             *    XXH3_64b unaligned         4800.0 MB/s
+             *    XXH3_64b seeded            9600.0 MB/s // magical 2x speedup?!
+             *    XXH3_64b seeded unaligned  4800.0 MB/s
+             *
+             * If we sense a suspiciously low number of ticks, we increase the
+             * iterations until we can get something meaningful.
+             */
+            if (nbTicks < TIMELOOP_MIN) {
+                /* Not enough time spent in benchmarking, risk of rounding bias */
+                if (nbTicks == 0) { /* faster than resolution timer */
+                    nbh_perIteration *= 100;
+                } else {
+                    /*
+                     * update nbh_perIteration so that the next round lasts
+                     * approximately 1 second.
+                     */
+                    double nbh_perSecond = (1 / ticksPerHash) + 1;
+                    if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
+                    nbh_perIteration = (XSUM_U32)nbh_perSecond;
+                }
+                /* g_nbIterations==0 => quick evaluation, no claim of accuracy */
+                if (g_nbIterations>0) {
+                    iterationNb--;   /* new round for a more accurate speed evaluation */
+                    continue;
+                }
+            }
+            if (ticksPerHash < fastestH) fastestH = ticksPerHash;
+            if (fastestH>0.) { /* avoid div by zero */
+                XSUM_logVerbose(2, "%2i-%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \r",
+                            iterationNb,
+                            HASHNAME_MAX, HASHNAME_MAX, hName,
+                            (unsigned)bufferSize,
+                            (double)1 / fastestH,
+                            ((double)bufferSize / (1 MB)) / fastestH);
+        }   }
+        {   double nbh_perSecond = (1 / fastestH) + 1;
+            if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
+            nbh_perIteration = (XSUM_U32)nbh_perSecond;
+        }
+    }
+    XSUM_logVerbose(1, "%2i#%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \n",
+                    testID,
+                    HASHNAME_MAX, HASHNAME_MAX, hName,
+                    (unsigned)bufferSize,
+                    (double)1 / fastestH,
+                    ((double)bufferSize / (1 MB)) / fastestH);
+    if (XSUM_logLevel<1)
+        XSUM_logVerbose(0, "%u, ", (unsigned)((double)1 / fastestH));
+}
+
+
+/*
+ * Allocates a string containing s1 and s2 concatenated. Acts like strdup.
+ * The result must be freed.
+ */
+static char* XSUM_strcatDup(const char* s1, const char* s2)
+{
+    assert(s1 != NULL);
+    assert(s2 != NULL);
+    {   size_t len1 = strlen(s1);
+        size_t len2 = strlen(s2);
+        char* buf = (char*)malloc(len1 + len2 + 1);
+        if (buf != NULL) {
+            /* strcpy(buf, s1) */
+            memcpy(buf, s1, len1);
+            /* strcat(buf, s2) */
+            memcpy(buf + len1, s2, len2 + 1);
+        }
+        return buf;
+    }
+}
+
+
+/*!
+ * XSUM_benchMem():
+ * buffer: Must be 16-byte aligned.
+ * The real allocated size of buffer is supposed to be >= (bufferSize+3).
+ * returns: 0 on success, 1 if error (invalid mode selected)
+ */
+static void XSUM_benchMem(const void* buffer, size_t bufferSize)
+{
+    assert((((size_t)buffer) & 15) == 0);  /* ensure alignment */
+    XSUM_fillTestBuffer(g_benchSecretBuf, sizeof(g_benchSecretBuf));
+    {   int i;
+        for (i = 1; i < (int)NB_TESTFUNC; i++) {
+            int const hashFuncID = (i-1) / 2;
+            assert(g_hashesToBench[hashFuncID].name != NULL);
+            if (g_testIDs[i] == 0) continue;
+            /* aligned */
+            if ((i % 2) == 1) {
+                XSUM_benchHash(g_hashesToBench[hashFuncID].func, g_hashesToBench[hashFuncID].name, i, buffer, bufferSize);
+            }
+            /* unaligned */
+            if ((i % 2) == 0) {
+                /* Append "unaligned". */
+                char* const hashNameBuf = XSUM_strcatDup(g_hashesToBench[hashFuncID].name, " unaligned");
+                assert(hashNameBuf != NULL);
+                XSUM_benchHash(g_hashesToBench[hashFuncID].func, hashNameBuf, i, ((const char*)buffer)+3, bufferSize);
+                free(hashNameBuf);
+            }
+    }   }
+}
+
+static size_t XSUM_selectBenchedSize(const char* fileName)
+{
+    XSUM_U64 const inFileSize = XSUM_getFileSize(fileName);
+    size_t benchedSize = (size_t) XSUM_findMaxMem(inFileSize);
+    if ((XSUM_U64)benchedSize > inFileSize) benchedSize = (size_t)inFileSize;
+    if (benchedSize < inFileSize) {
+        XSUM_log("Not enough memory for '%s' full size; testing %i MB only...\n", fileName, (int)(benchedSize>>20));
+    }
+    return benchedSize;
+}
+
+
+int XSUM_benchFiles(const char* fileNamesTable[], int nbFiles)
+{
+    int fileIdx;
+    for (fileIdx=0; fileIdx<nbFiles; fileIdx++) {
+        const char* const inFileName = fileNamesTable[fileIdx];
+        assert(inFileName != NULL);
+
+        {   FILE* const inFile = XSUM_fopen( inFileName, "rb" );
+            size_t const benchedSize = XSUM_selectBenchedSize(inFileName);
+            char* const buffer = (char*)calloc(benchedSize+16+3, 1);
+            void* const alignedBuffer = (buffer+15) - (((size_t)(buffer+15)) & 0xF);  /* align on next 16 bytes */
+
+            /* Checks */
+            if (inFile==NULL){
+                XSUM_log("Error: Could not open '%s': %s.\n", inFileName, strerror(errno));
+                free(buffer);
+                exit(11);
+            }
+            if(!buffer) {
+                XSUM_log("\nError: Out of memory.\n");
+                fclose(inFile);
+                exit(12);
+            }
+
+            /* Fill input buffer */
+            {   size_t const readSize = fread(alignedBuffer, 1, benchedSize, inFile);
+                fclose(inFile);
+                if(readSize != benchedSize) {
+                    XSUM_log("\nError: Could not read '%s': %s.\n", inFileName, strerror(errno));
+                    free(buffer);
+                    exit(13);
+            }   }
+
+            /* bench */
+            XSUM_benchMem(alignedBuffer, benchedSize);
+
+            free(buffer);
+    }   }
+    return 0;
+}
+
+
+int XSUM_benchInternal(size_t keySize)
+{
+    void* const buffer = calloc(keySize+16+3, 1);
+    if (buffer == NULL) {
+        XSUM_log("\nError: Out of memory.\n");
+        exit(12);
+    }
+
+    {   const void* const alignedBuffer = ((char*)buffer+15) - (((size_t)((char*)buffer+15)) & 0xF);  /* align on next 16 bytes */
+
+        /* bench */
+        XSUM_logVerbose(1, "Sample of ");
+        if (keySize > 10 KB) {
+            XSUM_logVerbose(1, "%u KB", (unsigned)(keySize >> 10));
+        } else {
+            XSUM_logVerbose(1, "%u bytes", (unsigned)keySize);
+        }
+        XSUM_logVerbose(1, "...        \n");
+
+        XSUM_benchMem(alignedBuffer, keySize);
+        free(buffer);
+    }
+    return 0;
+}
diff --git a/cli/xsum_bench.h b/cli/xsum_bench.h
new file mode 100644
index 00000000..6faaec8c
--- /dev/null
+++ b/cli/xsum_bench.h
@@ -0,0 +1,51 @@
+/*
+ * xsum_bench - Benchmark functions for xxhsum
+ * Copyright (C) 2013-2021 Yann Collet
+ *
+ * GPL v2 License
+ *
+ * This program is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * This program is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with this program; if not, write to the Free Software Foundation, Inc.,
+ * 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
+ *
+ * You can contact the author at:
+ *   - xxHash homepage: https://www.xxhash.com
+ *   - xxHash source repository: https://github.com/Cyan4973/xxHash
+ */
+
+#ifndef XSUM_BENCH_H
+#define XSUM_BENCH_H
+
+#include <stddef.h>  /* size_t */
+
+#define NBLOOPS_DEFAULT    3    /* Default number of benchmark iterations */
+
+extern int const g_nbTestFunctions;
+extern char g_testIDs[];  /* size : g_nbTestFunctions */
+extern const char k_testIDs_default[];
+extern int g_nbIterations;
+
+int XSUM_benchInternal(size_t keySize);
+int XSUM_benchFiles(const char* fileNamesTable[], int nbFiles);
+
+
+#ifdef __cplusplus
+extern "C" {
+#endif
+
+
+#ifdef __cplusplus
+}
+#endif
+
+#endif /* XSUM_BENCH_H */
diff --git a/cli/xsum_config.h b/cli/xsum_config.h
index 9222144d..eec5528d 100644
--- a/cli/xsum_config.h
+++ b/cli/xsum_config.h
@@ -1,6 +1,6 @@
 /*
  * xxhsum - Command line interface for xxhash algorithms
- * Copyright (C) 2013-2020 Yann Collet
+ * Copyright (C) 2013-2021 Yann Collet
  *
  * GPL v2 License
  *
@@ -202,4 +202,13 @@
     typedef unsigned long long XSUM_U64;
 #endif /* not C++/C99 */
 
+/* ***************************
+ * Common constants
+ * ***************************/
+
+#define KB *( 1<<10)
+#define MB *( 1<<20)
+#define GB *(1U<<30)
+
+
 #endif /* XSUM_CONFIG_H */
diff --git a/cli/xsum_os_specific.c b/cli/xsum_os_specific.c
index 8f48ce07..b34a359a 100644
--- a/cli/xsum_os_specific.c
+++ b/cli/xsum_os_specific.c
@@ -112,7 +112,7 @@ static int XSUM_stat(const char* infilename, XSUM_stat_t* statbuf)
 }
 
 #ifndef XSUM_NO_MAIN
-int main(int argc, char* argv[])
+int main(int argc, const char* argv[])
 {
     return XSUM_main(argc, argv);
 }
diff --git a/cli/xsum_os_specific.h b/cli/xsum_os_specific.h
index b3562b26..4251cf09 100644
--- a/cli/xsum_os_specific.h
+++ b/cli/xsum_os_specific.h
@@ -39,7 +39,7 @@ extern "C" {
  *
  * Functions like main(), but is passed UTF-8 arguments even on Windows.
  */
-XSUM_API int XSUM_main(int argc, char* argv[]);
+XSUM_API int XSUM_main(int argc, const char* argv[]);
 
 /*
  * Returns whether stream is a console.
diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index 56bce1ba..864323a0 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -35,10 +35,12 @@
 #include "xsum_os_specific.h"
 #include "xsum_output.h"
 #include "xsum_sanity_check.h"
+#include "xsum_bench.h"
 #ifdef XXH_INLINE_ALL
 #  include "xsum_os_specific.c"
 #  include "xsum_output.c"
 #  include "xsum_sanity_check.c"
+#  include "xsum_bench.c"
 #endif
 
 /* ************************************
@@ -50,7 +52,6 @@
 #include <stdio.h>      /* fprintf, fopen, ftello64, fread, stdin, stdout, _fileno (when present) */
 #include <sys/types.h>  /* stat, stat64, _stat64 */
 #include <sys/stat.h>   /* stat, stat64, _stat64 */
-#include <time.h>       /* clock_t, clock, CLOCKS_PER_SEC */
 #include <assert.h>     /* assert */
 #include <errno.h>      /* errno */
 
@@ -78,19 +79,6 @@ static const char author[] = "Yann Collet";
                     exename, XSUM_PROGRAM_VERSION, author, \
                     g_nbBits, XSUM_ARCH, ENDIAN_NAME, XSUM_CC_VERSION
 
-#define KB *( 1<<10)
-#define MB *( 1<<20)
-#define GB *(1U<<30)
-
-static size_t XSUM_DEFAULT_SAMPLE_SIZE = 100 KB;
-#define NBLOOPS    3                              /* Default number of benchmark iterations */
-#define TIMELOOP_S 1
-#define TIMELOOP  (TIMELOOP_S * CLOCKS_PER_SEC)   /* target timing per iteration */
-#define TIMELOOP_MIN (TIMELOOP / 2)               /* minimum timing to validate a result */
-#define XXHSUM32_DEFAULT_SEED 0                   /* Default seed for algo_xxh32 */
-#define XXHSUM64_DEFAULT_SEED 0                   /* Default seed for algo_xxh64 */
-
-#define MAX_MEM    (2 GB - 64 MB)
 
 static const char stdinName[] = "-";
 static const char stdinFileName[] = "stdin";
@@ -104,406 +92,16 @@ static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & X
 /* Maximum acceptable line length. */
 #define MAX_LINE_LENGTH (32 KB)
 
+static size_t XSUM_DEFAULT_SAMPLE_SIZE = 100 KB;
 
-/* ************************************
- *  Local variables
- **************************************/
-static XSUM_U32 g_nbIterations = NBLOOPS;
-
-
-/* ************************************
- *  Benchmark Functions
- **************************************/
-static clock_t XSUM_clockSpan( clock_t start )
-{
-    return clock() - start;   /* works even if overflow; Typical max span ~ 30 mn */
-}
-
-static size_t XSUM_findMaxMem(XSUM_U64 requiredMem)
-{
-    size_t const step = 64 MB;
-    void* testmem = NULL;
-
-    requiredMem = (((requiredMem >> 26) + 1) << 26);
-    requiredMem += 2*step;
-    if (requiredMem > MAX_MEM) requiredMem = MAX_MEM;
-
-    while (!testmem) {
-        if (requiredMem > step) requiredMem -= step;
-        else requiredMem >>= 1;
-        testmem = malloc ((size_t)requiredMem);
-    }
-    free (testmem);
-
-    /* keep some space available */
-    if (requiredMem > step) requiredMem -= step;
-    else requiredMem >>= 1;
-
-    return (size_t)requiredMem;
-}
-
-/*
- * Allocates a string containing s1 and s2 concatenated. Acts like strdup.
- * The result must be freed.
- */
-static char* XSUM_strcatDup(const char* s1, const char* s2)
-{
-    assert(s1 != NULL);
-    assert(s2 != NULL);
-    {   size_t len1 = strlen(s1);
-        size_t len2 = strlen(s2);
-        char* buf = (char*)malloc(len1 + len2 + 1);
-        if (buf != NULL) {
-            /* strcpy(buf, s1) */
-            memcpy(buf, s1, len1);
-            /* strcat(buf, s2) */
-            memcpy(buf + len1, s2, len2 + 1);
-        }
-        return buf;
-    }
-}
-
-
-/*
- * A secret buffer used for benchmarking XXH3's withSecret variants.
- *
- * In order for the bench to be realistic, the secret buffer would need to be
- * pre-generated.
- *
- * Adding a pointer to the parameter list would be messy.
- */
-static XSUM_U8 g_benchSecretBuf[XXH3_SECRET_SIZE_MIN];
-
-/*
- * Wrappers for the benchmark.
- *
- * If you would like to add other hashes to the bench, create a wrapper and add
- * it to the g_hashesToBench table. It will automatically be added.
- */
-typedef XSUM_U32 (*hashFunction)(const void* buffer, size_t bufferSize, XSUM_U32 seed);
-
-static XSUM_U32 localXXH32(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return XXH32(buffer, bufferSize, seed);
-}
-static XSUM_U32 localXXH32_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH32_state_t state;
-    (void)seed;
-    XXH32_reset(&state, seed);
-    XXH32_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH32_digest(&state);
-}
-static XSUM_U32 localXXH64(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return (XSUM_U32)XXH64(buffer, bufferSize, seed);
-}
-static XSUM_U32 localXXH64_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH64_state_t state;
-    (void)seed;
-    XXH64_reset(&state, seed);
-    XXH64_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH64_digest(&state);
-}
-static XSUM_U32 localXXH3_64b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)XXH3_64bits(buffer, bufferSize);
-}
-static XSUM_U32 localXXH3_64b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return (XSUM_U32)XXH3_64bits_withSeed(buffer, bufferSize, seed);
-}
-static XSUM_U32 localXXH3_64b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)XXH3_64bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf));
-}
-static XSUM_U32 localXXH3_128b(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)(XXH3_128bits(buffer, bufferSize).low64);
-}
-static XSUM_U32 localXXH3_128b_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    return (XSUM_U32)(XXH3_128bits_withSeed(buffer, bufferSize, seed).low64);
-}
-static XSUM_U32 localXXH3_128b_secret(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    (void)seed;
-    return (XSUM_U32)(XXH3_128bits_withSecret(buffer, bufferSize, g_benchSecretBuf, sizeof(g_benchSecretBuf)).low64);
-}
-static XSUM_U32 localXXH3_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    (void)seed;
-    XXH3_64bits_reset(&state);
-    XXH3_64bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH3_64bits_digest(&state);
-}
-static XSUM_U32 localXXH3_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    XXH3_INITSTATE(&state);
-    XXH3_64bits_reset_withSeed(&state, (XXH64_hash_t)seed);
-    XXH3_64bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)XXH3_64bits_digest(&state);
-}
-static XSUM_U32 localXXH128_stream(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    (void)seed;
-    XXH3_128bits_reset(&state);
-    XXH3_128bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
-}
-static XSUM_U32 localXXH128_stream_seeded(const void* buffer, size_t bufferSize, XSUM_U32 seed)
-{
-    XXH3_state_t state;
-    XXH3_INITSTATE(&state);
-    XXH3_128bits_reset_withSeed(&state, (XXH64_hash_t)seed);
-    XXH3_128bits_update(&state, buffer, bufferSize);
-    return (XSUM_U32)(XXH3_128bits_digest(&state).low64);
-}
-
-
-typedef struct {
-    const char*  name;
-    hashFunction func;
-} hashInfo;
-
-static const hashInfo g_hashesToBench[] = {
-    { "XXH32",             &localXXH32 },
-    { "XXH64",             &localXXH64 },
-    { "XXH3_64b",          &localXXH3_64b },
-    { "XXH3_64b w/seed",   &localXXH3_64b_seeded },
-    { "XXH3_64b w/secret", &localXXH3_64b_secret },
-    { "XXH128",            &localXXH3_128b },
-    { "XXH128 w/seed",     &localXXH3_128b_seeded },
-    { "XXH128 w/secret",   &localXXH3_128b_secret },
-    { "XXH32_stream",      &localXXH32_stream },
-    { "XXH64_stream",      &localXXH64_stream },
-    { "XXH3_stream",       &localXXH3_stream },
-    { "XXH3_stream w/seed",&localXXH3_stream_seeded },
-    { "XXH128_stream",     &localXXH128_stream },
-    { "XXH128_stream w/seed",&localXXH128_stream_seeded },
-};
-#define NB_HASHFUNC (sizeof(g_hashesToBench) / sizeof(*g_hashesToBench))
-
-#define NB_TESTFUNC (1 + 2 * NB_HASHFUNC)
-static char g_testIDs[NB_TESTFUNC] = { 0 };
-static const char k_testIDs_default[NB_TESTFUNC] = { 0,
-        1 /*XXH32*/, 0,
-        1 /*XXH64*/, 0,
-        1 /*XXH3*/, 0, 0, 0, 0, 0,
-        1 /*XXH128*/ };
-
-#define HASHNAME_MAX 29
-static void XSUM_benchHash(hashFunction h, const char* hName, int testID,
-                           const void* buffer, size_t bufferSize)
-{
-    XSUM_U32 nbh_perIteration = (XSUM_U32)((300 MB) / (bufferSize+1)) + 1;  /* first iteration conservatively aims for 300 MB/s */
-    unsigned iterationNb, nbIterations = g_nbIterations + !g_nbIterations /* min 1 */;
-    double fastestH = 100000000.;
-    assert(HASHNAME_MAX > 2);
-    XSUM_logVerbose(2, "\r%80s\r", "");       /* Clean display line */
-
-    for (iterationNb = 1; iterationNb <= nbIterations; iterationNb++) {
-        XSUM_U32 r=0;
-        clock_t cStart;
-
-        XSUM_logVerbose(2, "%2u-%-*.*s : %10u ->\r",
-                        iterationNb,
-                        HASHNAME_MAX, HASHNAME_MAX, hName,
-                        (unsigned)bufferSize);
-        cStart = clock();
-        while (clock() == cStart);   /* starts clock() at its exact beginning */
-        cStart = clock();
-
-        {   XSUM_U32 u;
-            for (u=0; u<nbh_perIteration; u++)
-                r += h(buffer, bufferSize, u);
-        }
-        if (r==0) XSUM_logVerbose(3,".\r");  /* do something with r to defeat compiler "optimizing" hash away */
-
-        {   clock_t const nbTicks = XSUM_clockSpan(cStart);
-            double const ticksPerHash = ((double)nbTicks / TIMELOOP) / nbh_perIteration;
-            /*
-             * clock() is the only decent portable timer, but it isn't very
-             * precise.
-             *
-             * Sometimes, this lack of precision is enough that the benchmark
-             * finishes before there are enough ticks to get a meaningful result.
-             *
-             * For example, on a Core 2 Duo (without any sort of Turbo Boost),
-             * the imprecise timer caused peculiar results like so:
-             *
-             *    XXH3_64b                   4800.0 MB/s // conveniently even
-             *    XXH3_64b unaligned         4800.0 MB/s
-             *    XXH3_64b seeded            9600.0 MB/s // magical 2x speedup?!
-             *    XXH3_64b seeded unaligned  4800.0 MB/s
-             *
-             * If we sense a suspiciously low number of ticks, we increase the
-             * iterations until we can get something meaningful.
-             */
-            if (nbTicks < TIMELOOP_MIN) {
-                /* Not enough time spent in benchmarking, risk of rounding bias */
-                if (nbTicks == 0) { /* faster than resolution timer */
-                    nbh_perIteration *= 100;
-                } else {
-                    /*
-                     * update nbh_perIteration so that the next round lasts
-                     * approximately 1 second.
-                     */
-                    double nbh_perSecond = (1 / ticksPerHash) + 1;
-                    if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
-                    nbh_perIteration = (XSUM_U32)nbh_perSecond;
-                }
-                /* g_nbIterations==0 => quick evaluation, no claim of accuracy */
-                if (g_nbIterations>0) {
-                    iterationNb--;   /* new round for a more accurate speed evaluation */
-                    continue;
-                }
-            }
-            if (ticksPerHash < fastestH) fastestH = ticksPerHash;
-            if (fastestH>0.) { /* avoid div by zero */
-                XSUM_logVerbose(2, "%2u-%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \r",
-                            iterationNb,
-                            HASHNAME_MAX, HASHNAME_MAX, hName,
-                            (unsigned)bufferSize,
-                            (double)1 / fastestH,
-                            ((double)bufferSize / (1 MB)) / fastestH);
-        }   }
-        {   double nbh_perSecond = (1 / fastestH) + 1;
-            if (nbh_perSecond > (double)(4000U<<20)) nbh_perSecond = (double)(4000U<<20);   /* avoid overflow */
-            nbh_perIteration = (XSUM_U32)nbh_perSecond;
-        }
-    }
-    XSUM_logVerbose(1, "%2i#%-*.*s : %10u -> %8.0f it/s (%7.1f MB/s) \n",
-                    testID,
-                    HASHNAME_MAX, HASHNAME_MAX, hName,
-                    (unsigned)bufferSize,
-                    (double)1 / fastestH,
-                    ((double)bufferSize / (1 MB)) / fastestH);
-    if (XSUM_logLevel<1)
-        XSUM_logVerbose(0, "%u, ", (unsigned)((double)1 / fastestH));
-}
-
-
-/*!
- * XSUM_benchMem():
- * buffer: Must be 16-byte aligned.
- * The real allocated size of buffer is supposed to be >= (bufferSize+3).
- * returns: 0 on success, 1 if error (invalid mode selected)
- */
-static void XSUM_benchMem(const void* buffer, size_t bufferSize)
-{
-    assert((((size_t)buffer) & 15) == 0);  /* ensure alignment */
-    XSUM_fillTestBuffer(g_benchSecretBuf, sizeof(g_benchSecretBuf));
-    {   int i;
-        for (i = 1; i < (int)NB_TESTFUNC; i++) {
-            int const hashFuncID = (i-1) / 2;
-            assert(g_hashesToBench[hashFuncID].name != NULL);
-            if (g_testIDs[i] == 0) continue;
-            /* aligned */
-            if ((i % 2) == 1) {
-                XSUM_benchHash(g_hashesToBench[hashFuncID].func, g_hashesToBench[hashFuncID].name, i, buffer, bufferSize);
-            }
-            /* unaligned */
-            if ((i % 2) == 0) {
-                /* Append "unaligned". */
-                char* const hashNameBuf = XSUM_strcatDup(g_hashesToBench[hashFuncID].name, " unaligned");
-                assert(hashNameBuf != NULL);
-                XSUM_benchHash(g_hashesToBench[hashFuncID].func, hashNameBuf, i, ((const char*)buffer)+3, bufferSize);
-                free(hashNameBuf);
-            }
-    }   }
-}
-
-static size_t XSUM_selectBenchedSize(const char* fileName)
-{
-    XSUM_U64 const inFileSize = XSUM_getFileSize(fileName);
-    size_t benchedSize = (size_t) XSUM_findMaxMem(inFileSize);
-    if ((XSUM_U64)benchedSize > inFileSize) benchedSize = (size_t)inFileSize;
-    if (benchedSize < inFileSize) {
-        XSUM_log("Not enough memory for '%s' full size; testing %i MB only...\n", fileName, (int)(benchedSize>>20));
-    }
-    return benchedSize;
-}
-
-
-static int XSUM_benchFiles(char*const* fileNamesTable, int nbFiles)
-{
-    int fileIdx;
-    for (fileIdx=0; fileIdx<nbFiles; fileIdx++) {
-        const char* const inFileName = fileNamesTable[fileIdx];
-        assert(inFileName != NULL);
-
-        {   FILE* const inFile = XSUM_fopen( inFileName, "rb" );
-            size_t const benchedSize = XSUM_selectBenchedSize(inFileName);
-            char* const buffer = (char*)calloc(benchedSize+16+3, 1);
-            void* const alignedBuffer = (buffer+15) - (((size_t)(buffer+15)) & 0xF);  /* align on next 16 bytes */
-
-            /* Checks */
-            if (inFile==NULL){
-                XSUM_log("Error: Could not open '%s': %s.\n", inFileName, strerror(errno));
-                free(buffer);
-                exit(11);
-            }
-            if(!buffer) {
-                XSUM_log("\nError: Out of memory.\n");
-                fclose(inFile);
-                exit(12);
-            }
-
-            /* Fill input buffer */
-            {   size_t const readSize = fread(alignedBuffer, 1, benchedSize, inFile);
-                fclose(inFile);
-                if(readSize != benchedSize) {
-                    XSUM_log("\nError: Could not read '%s': %s.\n", inFileName, strerror(errno));
-                    free(buffer);
-                    exit(13);
-            }   }
-
-            /* bench */
-            XSUM_benchMem(alignedBuffer, benchedSize);
-
-            free(buffer);
-    }   }
-    return 0;
-}
-
-
-static int XSUM_benchInternal(size_t keySize)
-{
-    void* const buffer = calloc(keySize+16+3, 1);
-    if (buffer == NULL) {
-        XSUM_log("\nError: Out of memory.\n");
-        exit(12);
-    }
-
-    {   const void* const alignedBuffer = ((char*)buffer+15) - (((size_t)((char*)buffer+15)) & 0xF);  /* align on next 16 bytes */
-
-        /* bench */
-        XSUM_logVerbose(1, "Sample of ");
-        if (keySize > 10 KB) {
-            XSUM_logVerbose(1, "%u KB", (unsigned)(keySize >> 10));
-        } else {
-            XSUM_logVerbose(1, "%u bytes", (unsigned)keySize);
-        }
-        XSUM_logVerbose(1, "...        \n");
-
-        XSUM_benchMem(alignedBuffer, keySize);
-        free(buffer);
-    }
-    return 0;
-}
 
 /* ********************************************************
 *  File Hashing
 **********************************************************/
 
+#define XXHSUM32_DEFAULT_SEED 0                   /* Default seed for algo_xxh32 */
+#define XXHSUM64_DEFAULT_SEED 0                   /* Default seed for algo_xxh64 */
+
 /* for support of --little-endian display mode */
 static void XSUM_display_LittleEndian(const void* ptr, size_t length)
 {
@@ -729,7 +327,7 @@ static int XSUM_hashFile(const char* fileName,
  * XSUM_hashFiles:
  * If fnTotal==0, read from stdin instead.
  */
-static int XSUM_hashFiles(char*const * fnList, int fnTotal,
+static int XSUM_hashFiles(const char* fnList[], int fnTotal,
                           AlgoSelected hashType,
                           Display_endianess displayEndianess,
                           Display_convention convention)
@@ -1241,7 +839,7 @@ static int XSUM_checkFile(const char* inFileName,
 }
 
 
-static int XSUM_checkFiles(char*const* fnList, int fnTotal,
+static int XSUM_checkFiles(const char* fnList[], int fnTotal,
                            const Display_endianess displayEndianess,
                            XSUM_U32 strictMode,
                            XSUM_U32 statusOnly,
@@ -1290,7 +888,7 @@ static int XSUM_usage_advanced(const char* exename)
     XSUM_log( "      --little-endian  Checksum values use little endian convention (default: big endian) \n");
     XSUM_log( "  -b                   Run benchmark \n");
     XSUM_log( "  -b#                  Bench only algorithm variant # \n");
-    XSUM_log( "  -i#                  Number of times to run the benchmark (default: %u) \n", (unsigned)g_nbIterations);
+    XSUM_log( "  -i#                  Number of times to run the benchmark (default: %u) \n", NBLOOPS_DEFAULT);
     XSUM_log( "  -q, --quiet          Don't display version header in benchmark mode \n");
     XSUM_log( "\n");
     XSUM_log( "The following four options are useful only when verifying checksums (-c): \n");
@@ -1371,7 +969,7 @@ static XSUM_U32 XSUM_readU32FromChar(const char** stringPtr) {
     return result;
 }
 
-XSUM_API int XSUM_main(int argc, char* argv[])
+XSUM_API int XSUM_main(int argc, const char* argv[])
 {
     int i, filenamesStart = 0;
     const char* const exename = XSUM_lastNameFromPath(argv[0]);
@@ -1387,6 +985,7 @@ XSUM_API int XSUM_main(int argc, char* argv[])
     AlgoSelected algo     = g_defaultAlgo;
     Display_endianess displayEndianess = big_endian;
     Display_convention convention = display_gnu;
+    int nbIterations = NBLOOPS_DEFAULT;
 
     /* special case: xxhNNsum default to NN bits checksum */
     if (strstr(exename,  "xxh32sum") != NULL) algo = g_defaultAlgo = algo_xxh32;
@@ -1468,17 +1067,18 @@ XSUM_API int XSUM_main(int argc, char* argv[])
                 do {
                     if (*argument == ',') argument++;
                     selectBenchIDs = XSUM_readU32FromChar(&argument); /* select one specific test */
-                    if (selectBenchIDs < NB_TESTFUNC) {
+                    if ((int)selectBenchIDs < g_nbTestFunctions) {
                         g_testIDs[selectBenchIDs] = 1;
-                    } else
+                    } else {
                         selectBenchIDs = kBenchAll;
+                    }
                 } while (*argument == ',');
                 break;
 
             /* Modify Nb Iterations (benchmark only) */
             case 'i':
                 argument++;
-                g_nbIterations = XSUM_readU32FromChar(&argument);
+                nbIterations = (int)XSUM_readU32FromChar(&argument);
                 break;
 
             /* Modify Block size (benchmark only) */
@@ -1503,8 +1103,9 @@ XSUM_API int XSUM_main(int argc, char* argv[])
     if (benchmarkMode) {
         XSUM_logVerbose(2, FULL_WELCOME_MESSAGE(exename) );
         XSUM_sanityCheck();
-        if (selectBenchIDs == 0) memcpy(g_testIDs, k_testIDs_default, sizeof(g_testIDs));
-        if (selectBenchIDs == kBenchAll) memset(g_testIDs, 1, sizeof(g_testIDs));
+        g_nbIterations = nbIterations;
+        if (selectBenchIDs == 0) memcpy(g_testIDs, k_testIDs_default, g_nbTestFunctions);
+        if (selectBenchIDs == kBenchAll) memset(g_testIDs, 1, g_nbTestFunctions);
         if (filenamesStart==0) return XSUM_benchInternal(keySize);
         return XSUM_benchFiles(argv+filenamesStart, argc-filenamesStart);
     }
diff --git a/cmake_unofficial/CMakeLists.txt b/cmake_unofficial/CMakeLists.txt
index 7acc8b1f..d5456b01 100644
--- a/cmake_unofficial/CMakeLists.txt
+++ b/cmake_unofficial/CMakeLists.txt
@@ -92,6 +92,7 @@ if(XXHASH_BUILD_XXHSUM)
                         "${XXHSUM_DIR}/xsum_os_specific.c"
                         "${XXHSUM_DIR}/xsum_output.c"
                         "${XXHSUM_DIR}/xsum_sanity_check.c"
+                        "${XXHSUM_DIR}/xsum_bench.c"
                 )
   add_executable(${PROJECT_NAME}::xxhsum ALIAS xxhsum)
 

From 2b8fc59ccdec20261a4792f68c2585a1181ef5de Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 11:01:13 -0800
Subject: [PATCH 174/187] fix minor conversion warning

---
 .gitignore   | 2 +-
 cli/xxhsum.c | 4 ++--
 2 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/.gitignore b/.gitignore
index d0ce9aac..e8645286 100644
--- a/.gitignore
+++ b/.gitignore
@@ -19,7 +19,7 @@ xxhsum_inlinedXXH
 dispatch
 tests/generate_unicode_test
 
-# compilation chain
+# local conf
 .clang_complete
 
 # Mac OS-X artefacts
diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index 864323a0..21b80f1c 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -1104,8 +1104,8 @@ XSUM_API int XSUM_main(int argc, const char* argv[])
         XSUM_logVerbose(2, FULL_WELCOME_MESSAGE(exename) );
         XSUM_sanityCheck();
         g_nbIterations = nbIterations;
-        if (selectBenchIDs == 0) memcpy(g_testIDs, k_testIDs_default, g_nbTestFunctions);
-        if (selectBenchIDs == kBenchAll) memset(g_testIDs, 1, g_nbTestFunctions);
+        if (selectBenchIDs == 0) memcpy(g_testIDs, k_testIDs_default, (size_t)g_nbTestFunctions);
+        if (selectBenchIDs == kBenchAll) memset(g_testIDs, 1, (size_t)g_nbTestFunctions);
         if (filenamesStart==0) return XSUM_benchInternal(keySize);
         return XSUM_benchFiles(argv+filenamesStart, argc-filenamesStart);
     }

From 2257be33b487cb13c8cbd644958eab3f0c0ec7d0 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 11:22:20 -0800
Subject: [PATCH 175/187] fix annoying mingw conversion warnings

---
 cli/xsum_os_specific.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/cli/xsum_os_specific.c b/cli/xsum_os_specific.c
index b34a359a..a5d6897c 100644
--- a/cli/xsum_os_specific.c
+++ b/cli/xsum_os_specific.c
@@ -383,7 +383,7 @@ static int XSUM_wmain(int argc, wchar_t* utf16_argv[])
         setvbuf(stderr, NULL, _IONBF, 0);
 
         /* Call our real main function */
-        ret = XSUM_main(argc, utf8_argv);
+        ret = XSUM_main(argc, (void*)utf8_argv);
 
         /* Cleanup */
         XSUM_freeArgv(argc, utf8_argv);
@@ -439,7 +439,7 @@ int __cdecl __wgetmainargs(
     _startupinfo* StartInfo
 );
 
-int main(int ansi_argc, char* ansi_argv[])
+int main(int ansi_argc, const char* ansi_argv[])
 {
     int       utf16_argc;
     wchar_t** utf16_argv;

From 29353eb21f09a45c5c07b151232d9a73cbcc7132 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 11:42:51 -0800
Subject: [PATCH 176/187] reduce dependencies

---
 cli/xsum_bench.c        | 10 ++++------
 cli/xsum_os_specific.c  |  7 +------
 cli/xsum_output.c       |  4 +---
 cli/xsum_sanity_check.c | 10 +++++-----
 cli/xxhsum.c            | 17 ++++++-----------
 5 files changed, 17 insertions(+), 31 deletions(-)

diff --git a/cli/xsum_bench.c b/cli/xsum_bench.c
index 1a8988c3..25678177 100644
--- a/cli/xsum_bench.c
+++ b/cli/xsum_bench.c
@@ -23,20 +23,18 @@
  *   - xxHash source repository: https://github.com/Cyan4973/xxHash
  */
 
-#include "xsum_config.h"
-#include "xsum_output.h"
+#include "xsum_output.h"  /* XSUM_logLevel */
 #include "xsum_bench.h"
 #include "xsum_sanity_check.h" /* XSUM_fillTestBuffer */
 #include "xsum_os_specific.h"  /* XSUM_getFileSize */
-#include <stdlib.h>
-#include <assert.h>
-#include <string.h>
 #ifndef XXH_STATIC_LINKING_ONLY
 #  define XXH_STATIC_LINKING_ONLY
 #endif
 #include "../xxhash.h"
 
-#include <stdio.h>  /* FILE */
+#include <stdlib.h>  /* malloc, free */
+#include <assert.h>
+#include <string.h>  /* strlen, memcpy */
 #include <time.h>   /* clock_t, clock, CLOCKS_PER_SEC */
 #include <errno.h>  /* errno */
 
diff --git a/cli/xsum_os_specific.c b/cli/xsum_os_specific.c
index a5d6897c..c38bd39a 100644
--- a/cli/xsum_os_specific.c
+++ b/cli/xsum_os_specific.c
@@ -23,12 +23,7 @@
  *   - xxHash source repository: https://github.com/Cyan4973/xxHash
  */
 
-#include "xsum_config.h"
-#include "xsum_os_specific.h"
-#include <stdio.h>
-#include <stdarg.h>
-#include <stdlib.h>
-#include <sys/types.h>  /* struct stat / __wstat64 */
+#include "xsum_os_specific.h"  /* XSUM_API */
 #include <sys/stat.h>   /* stat() / _stat64() */
 
 /*
diff --git a/cli/xsum_output.c b/cli/xsum_output.c
index a4d74115..c0f15f53 100644
--- a/cli/xsum_output.c
+++ b/cli/xsum_output.c
@@ -23,9 +23,7 @@
  *   - xxHash source repository: https://github.com/Cyan4973/xxHash
  */
 
-#include "xsum_output.h"
-#include "xsum_os_specific.h"
-#include <stdio.h>
+#include "xsum_os_specific.h"  /* XSUM_API */
 
 int XSUM_logLevel = 2;
 
diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 3f462b7f..ee8821ee 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -23,17 +23,17 @@
  *   - xxHash source repository: https://github.com/Cyan4973/xxHash
  */
 
-#include "xsum_config.h"
 #include "xsum_sanity_check.h"
-#include "xsum_output.h"
-#include <stdlib.h>
-#include <assert.h>
-#include <string.h>
+#include "xsum_output.h"  /* XSUM_log */
 #ifndef XXH_STATIC_LINKING_ONLY
 #  define XXH_STATIC_LINKING_ONLY
 #endif
 #include "../xxhash.h"
 
+#include <stdlib.h>  /* exit */
+#include <assert.h>
+#include <string.h>  /* memcmp */
+
 /* use #define to make them constant, required for initialization */
 #define PRIME32 2654435761U
 #define PRIME64 11400714785074694797ULL
diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index 21b80f1c..3d64e8e5 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -30,12 +30,11 @@
  */
 
 /* Transitional headers */
-#include "xsum_config.h"
-#include "xsum_arch.h"
-#include "xsum_os_specific.h"
-#include "xsum_output.h"
-#include "xsum_sanity_check.h"
-#include "xsum_bench.h"
+#include "xsum_arch.h"         /* XSUM_PROGRAM_VERSION */
+#include "xsum_os_specific.h"  /* XSUM_setBinaryMode */
+#include "xsum_output.h"       /* XSUM_output */
+#include "xsum_sanity_check.h" /* XSUM_sanityCheck */
+#include "xsum_bench.h"        /* NBLOOPS_DEFAULT */
 #ifdef XXH_INLINE_ALL
 #  include "xsum_os_specific.c"
 #  include "xsum_output.c"
@@ -46,12 +45,8 @@
 /* ************************************
  *  Includes
  **************************************/
-#include <limits.h>
 #include <stdlib.h>     /* malloc, calloc, free, exit */
-#include <string.h>     /* strcmp, memcpy */
-#include <stdio.h>      /* fprintf, fopen, ftello64, fread, stdin, stdout, _fileno (when present) */
-#include <sys/types.h>  /* stat, stat64, _stat64 */
-#include <sys/stat.h>   /* stat, stat64, _stat64 */
+#include <string.h>     /* strerror, strcmp, memcpy */
 #include <assert.h>     /* assert */
 #include <errno.h>      /* errno */
 

From ca3a9923c4f02d433a05782265f6a341d0dd6401 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 16:53:56 -0800
Subject: [PATCH 177/187] added XXH3 to the list of possible hashes

using command `-H3`.
Specifying `-H3` necessarily triggers BSD-style `--tag` mode,
in order to prevent any possibility of confusion with XXH64.
---
 Makefile     |  13 +++++-
 cli/xxhsum.c | 113 ++++++++++++++++++++++++++++-----------------------
 2 files changed, 75 insertions(+), 51 deletions(-)

diff --git a/Makefile b/Makefile
index dc1db341..2fbefbc6 100644
--- a/Makefile
+++ b/Makefile
@@ -206,6 +206,8 @@ check: xxhsum   ## basic tests for xxhsum CLI, set RUN_ENV for emulated environm
 	$(RUN_ENV) ./xxhsum$(EXT) -H0 xxhash.c
 	# 128-bit
 	$(RUN_ENV) ./xxhsum$(EXT) -H2 xxhash.c
+	# XXH3 (enforce BSD style)
+	$(RUN_ENV) ./xxhsum$(EXT) -H3 xxhash.c | grep "XXH3"
 	# request incorrect variant
 	$(RUN_ENV) ./xxhsum$(EXT) -H9 xxhash.c ; test $$? -eq 1
 	@printf "\n .......   checks completed successfully   ....... \n"
@@ -252,12 +254,18 @@ test-xxhsum-c: xxhsum
 	./xxhsum -H2 --tag $(TEST_FILES) > .test.xxh128_tag
 	./xxhsum -H2 --little-endian $(TEST_FILES) > .test.le_xxh128
 	./xxhsum -H2 --tag --little-endian $(TEST_FILES) > .test.le_xxh128_tag
+	./xxhsum -H3 $(TEST_FILES) > .test.xxh3
+	./xxhsum -H3 --tag $(TEST_FILES) > .test.xxh3_tag
+	./xxhsum -H3 --little-endian $(TEST_FILES) > .test.le_xxh3
+	./xxhsum -H3 --tag --little-endian $(TEST_FILES) > .test.le_xxh3_tag
 	./xxhsum -c .test.xxh*
 	./xxhsum -c --little-endian .test.le_xxh*
 	./xxhsum -c .test.*_tag
 	# read list of files from stdin
-	./xxhsum -c < .test.xxh64
 	./xxhsum -c < .test.xxh32
+	./xxhsum -c < .test.xxh64
+	./xxhsum -c < .test.xxh128
+	./xxhsum -c < .test.xxh3
 	cat .test.xxh* | ./xxhsum -c -
 	# check variant with '*' marker as second separator
 	$(SED) 's/  / \*/' .test.xxh32 | ./xxhsum -c
@@ -266,12 +274,15 @@ test-xxhsum-c: xxhsum
 	./xxhsum --tag -H0 xxhsum* | $(GREP) XXH32
 	./xxhsum --tag -H1 xxhsum* | $(GREP) XXH64
 	./xxhsum --tag -H2 xxhsum* | $(GREP) XXH128
+	./xxhsum --tag -H3 xxhsum* | $(GREP) XXH3
+	./xxhsum       -H3 xxhsum* | $(GREP) XXH3  # --tag is implicit for H3
 	./xxhsum --tag -H32 xxhsum* | $(GREP) XXH32
 	./xxhsum --tag -H64 xxhsum* | $(GREP) XXH64
 	./xxhsum --tag -H128 xxhsum* | $(GREP) XXH128
 	./xxhsum --tag -H0 --little-endian xxhsum* | $(GREP) XXH32_LE
 	./xxhsum --tag -H1 --little-endian xxhsum* | $(GREP) XXH64_LE
 	./xxhsum --tag -H2 --little-endian xxhsum* | $(GREP) XXH128_LE
+	./xxhsum       -H3 --little-endian xxhsum* | $(GREP) XXH3_LE
 	./xxhsum --tag -H32 --little-endian xxhsum* | $(GREP) XXH32_LE
 	./xxhsum --tag -H64 --little-endian xxhsum* | $(GREP) XXH64_LE
 	./xxhsum --tag -H128 --little-endian xxhsum* | $(GREP) XXH128_LE
diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index 3d64e8e5..bca42eee 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -77,7 +77,7 @@ static const char author[] = "Yann Collet";
 
 static const char stdinName[] = "-";
 static const char stdinFileName[] = "stdin";
-typedef enum { algo_xxh32=0, algo_xxh64=1, algo_xxh128=2 } AlgoSelected;
+typedef enum { algo_xxh32=0, algo_xxh64=1, algo_xxh128=2, algo_xxh3=3 } AlgoSelected;
 static AlgoSelected g_defaultAlgo = algo_xxh64;    /* required within main() & XSUM_usage() */
 
 /* <16 hex char> <SPC> <SPC> <filename> <'\0'>
@@ -115,9 +115,9 @@ static void XSUM_display_BigEndian(const void* ptr, size_t length)
 }
 
 typedef union {
-    XXH32_hash_t   xxh32;
-    XXH64_hash_t   xxh64;
-    XXH128_hash_t xxh128;
+    XXH32_hash_t  hash32;
+    XXH64_hash_t  hash64;  /* also for xxh3_64bits */
+    XXH128_hash_t hash128;
 } Multihash;
 
 /*
@@ -132,12 +132,12 @@ XSUM_hashStream(FILE* inFile,
 {
     XXH32_state_t state32;
     XXH64_state_t state64;
-    XXH3_state_t state128;
+    XXH3_state_t  state3;
 
     /* Init */
     (void)XXH32_reset(&state32, XXHSUM32_DEFAULT_SEED);
     (void)XXH64_reset(&state64, XXHSUM64_DEFAULT_SEED);
-    (void)XXH3_128bits_reset(&state128);
+    (void)XXH3_128bits_reset(&state3);
 
     /* Load file & update hash */
     {   size_t readSize;
@@ -151,7 +151,10 @@ XSUM_hashStream(FILE* inFile,
                 (void)XXH64_update(&state64, buffer, readSize);
                 break;
             case algo_xxh128:
-                (void)XXH3_128bits_update(&state128, buffer, readSize);
+                (void)XXH3_128bits_update(&state3, buffer, readSize);
+                break;
+            case algo_xxh3:
+                (void)XXH3_64bits_update(&state3, buffer, readSize);
                 break;
             default:
                 assert(0);
@@ -166,13 +169,16 @@ XSUM_hashStream(FILE* inFile,
         switch(hashType)
         {
         case algo_xxh32:
-            finalHash.xxh32 = XXH32_digest(&state32);
+            finalHash.hash32 = XXH32_digest(&state32);
             break;
         case algo_xxh64:
-            finalHash.xxh64 = XXH64_digest(&state64);
+            finalHash.hash64 = XXH64_digest(&state64);
             break;
         case algo_xxh128:
-            finalHash.xxh128 = XXH3_128bits_digest(&state128);
+            finalHash.hash128 = XXH3_128bits_digest(&state3);
+            break;
+        case algo_xxh3:
+            finalHash.hash64 = XXH3_64bits_digest(&state3);
             break;
         default:
             assert(0);
@@ -182,9 +188,9 @@ XSUM_hashStream(FILE* inFile,
 }
 
                                        /* algo_xxh32, algo_xxh64, algo_xxh128 */
-static const char* XSUM_algoName[] =    { "XXH32",    "XXH64",    "XXH128" };
-static const char* XSUM_algoLE_name[] = { "XXH32_LE", "XXH64_LE", "XXH128_LE" };
-static const size_t XSUM_algoLength[] = { 4,          8,          16 };
+static const char* XSUM_algoName[] =    { "XXH32",    "XXH64",    "XXH128",    "XXH3" };
+static const char* XSUM_algoLE_name[] = { "XXH32_LE", "XXH64_LE", "XXH128_LE", "XXH3_LE" };
+static const size_t XSUM_algoLength[] = { 4,          8,          16,          8 };
 
 #define XSUM_TABLE_ELT_SIZE(table)   (sizeof(table) / sizeof(*table))
 
@@ -294,22 +300,28 @@ static int XSUM_hashFile(const char* fileName,
     {
     case algo_xxh32:
         {   XXH32_canonical_t hcbe32;
-            (void)XXH32_canonicalFromHash(&hcbe32, hashValue.xxh32);
+            (void)XXH32_canonicalFromHash(&hcbe32, hashValue.hash32);
             f_displayLine(fileName, &hcbe32, hashType);
             break;
         }
     case algo_xxh64:
         {   XXH64_canonical_t hcbe64;
-            (void)XXH64_canonicalFromHash(&hcbe64, hashValue.xxh64);
+            (void)XXH64_canonicalFromHash(&hcbe64, hashValue.hash64);
             f_displayLine(fileName, &hcbe64, hashType);
             break;
         }
     case algo_xxh128:
         {   XXH128_canonical_t hcbe128;
-            (void)XXH128_canonicalFromHash(&hcbe128, hashValue.xxh128);
+            (void)XXH128_canonicalFromHash(&hcbe128, hashValue.hash128);
             f_displayLine(fileName, &hcbe128, hashType);
             break;
         }
+    case algo_xxh3:
+        {   XXH64_canonical_t hcbe64;
+            (void)XXH64_canonicalFromHash(&hcbe64, hashValue.hash64);
+            f_displayLine(fileName, &hcbe64, hashType);
+            break;
+        }
     default:
         assert(0);  /* not possible */
     }
@@ -370,9 +382,9 @@ typedef union {
 } Canonical;
 
 typedef struct {
-    Canonical   canonical;
-    const char* filename;
-    int         xxhBits;    /* canonical type: 32:xxh32, 64:xxh64, 128:xxh128 */
+    Canonical    canonical;
+    const char*  filename;
+    AlgoSelected algo;
 } ParsedLine;
 
 typedef struct {
@@ -391,10 +403,10 @@ typedef struct {
     char*           lineBuf;
     size_t          blockSize;
     char*           blockBuf;
-    XSUM_U32             strictMode;
-    XSUM_U32             statusOnly;
-    XSUM_U32             warn;
-    XSUM_U32             quiet;
+    XSUM_U32        strictMode;
+    XSUM_U32        statusOnly;
+    XSUM_U32        warn;
+    XSUM_U32        quiet;
     ParseFileReport report;
 } ParseFileArg;
 
@@ -528,8 +540,7 @@ static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int re
     size_t hash_len;
 
     parsedLine->filename = NULL;
-    parsedLine->xxhBits = 0;
-
+    parsedLine->algo = algo_xxh64; /* default */
     if (firstSpace == NULL || !firstSpace[1]) return ParseLine_invalidFormat;
 
     if (firstSpace[1] == '(') {
@@ -541,9 +552,7 @@ static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int re
         rev = strstr(line, "_LE") != NULL; /* was output little-endian */
         hash_ptr = lastSpace + 1;
         hash_len = strlen(hash_ptr);
-        /* NOTE: This currently ignores the hash description at the start of the string.
-         * In the future we should parse it and verify that it matches the hash length.
-         * It could also be used to allow both XXH64 & XXH3_64bits to be differentiated. */
+        if (!memcmp(line, "XXH3", 4)) parsedLine->algo = algo_xxh3;
     } else {
         hash_ptr = line;
         hash_len = (size_t)(firstSpace - line);
@@ -557,7 +566,7 @@ static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int re
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
-            parsedLine->xxhBits = 32;
+            parsedLine->algo = algo_xxh32;
             break;
         }
 
@@ -567,7 +576,7 @@ static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int re
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
-            parsedLine->xxhBits = 64;
+            assert(parsedLine->algo == algo_xxh3 || parsedLine->algo == algo_xxh64);
             break;
         }
 
@@ -577,7 +586,7 @@ static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int re
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
-            parsedLine->xxhBits = 128;
+            parsedLine->algo = algo_xxh128;
             break;
         }
 
@@ -670,31 +679,31 @@ static void XSUM_parseFile1(ParseFileArg* XSUM_parseFileArg, int rev)
                 break;
             }
             lineStatus = LineStatus_hashFailed;
-            switch (parsedLine.xxhBits)
-            {
-            case 32:
-                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh32, XSUM_parseFileArg->blockBuf, XSUM_parseFileArg->blockSize);
-                    if (xxh.xxh32 == XXH32_hashFromCanonical(&parsedLine.canonical.xxh32)) {
+            {   Multihash const xxh = XSUM_hashStream(fp, parsedLine.algo, XSUM_parseFileArg->blockBuf, XSUM_parseFileArg->blockSize);
+                switch (parsedLine.algo)
+                {
+                case algo_xxh32:
+                    if (xxh.hash32 == XXH32_hashFromCanonical(&parsedLine.canonical.xxh32)) {
                         lineStatus = LineStatus_hashOk;
-                }   }
-                break;
+                    }
+                    break;
 
-            case 64:
-                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh64, XSUM_parseFileArg->blockBuf, XSUM_parseFileArg->blockSize);
-                    if (xxh.xxh64 == XXH64_hashFromCanonical(&parsedLine.canonical.xxh64)) {
+                case algo_xxh64:
+                case algo_xxh3:
+                    if (xxh.hash64 == XXH64_hashFromCanonical(&parsedLine.canonical.xxh64)) {
                         lineStatus = LineStatus_hashOk;
-                }   }
-                break;
+                    }
+                    break;
 
-            case 128:
-                {   Multihash const xxh = XSUM_hashStream(fp, algo_xxh128, XSUM_parseFileArg->blockBuf, XSUM_parseFileArg->blockSize);
-                    if (XXH128_isEqual(xxh.xxh128, XXH128_hashFromCanonical(&parsedLine.canonical.xxh128))) {
+                case algo_xxh128:
+                    if (XXH128_isEqual(xxh.hash128, XXH128_hashFromCanonical(&parsedLine.canonical.xxh128))) {
                         lineStatus = LineStatus_hashOk;
-                }   }
-                break;
+                    }
+                    break;
 
-            default:
-                break;
+                default:
+                    break;
+                }
             }
             if (fp != stdin) fclose(fp);
         } while (0);
@@ -1038,6 +1047,10 @@ XSUM_API int XSUM_main(int argc, const char* argv[])
                     case 64: algo = algo_xxh64; break;
                     case 2 :
                     case 128: algo = algo_xxh128; break;
+                    case 3 : /* xxh3 - necessarily uses BSD convention to avoid confusion with XXH64 */
+                        algo = algo_xxh3;
+                        convention = display_bsd;
+                        break;
                     default:
                         return XSUM_badusage(exename);
                 }

From 35cc12401cd9670c30b2d82384dbd0517c994cef Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 17:14:34 -0800
Subject: [PATCH 178/187] more thorough format validation with --check

in --tag mode, will detect mismatch between algo name and algo width.
---
 cli/xxhsum.c | 14 ++++++++++----
 1 file changed, 10 insertions(+), 4 deletions(-)

diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index bca42eee..ae702bf5 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -540,7 +540,7 @@ static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int re
     size_t hash_len;
 
     parsedLine->filename = NULL;
-    parsedLine->algo = algo_xxh64; /* default */
+    parsedLine->algo = algo_xxh64; /* default - will be overwritten */
     if (firstSpace == NULL || !firstSpace[1]) return ParseLine_invalidFormat;
 
     if (firstSpace[1] == '(') {
@@ -553,40 +553,46 @@ static ParseLineResult XSUM_parseLine(ParsedLine* parsedLine, char* line, int re
         hash_ptr = lastSpace + 1;
         hash_len = strlen(hash_ptr);
         if (!memcmp(line, "XXH3", 4)) parsedLine->algo = algo_xxh3;
+        if (!memcmp(line, "XXH32", 5)) parsedLine->algo = algo_xxh32;
+        if (!memcmp(line, "XXH64", 5)) parsedLine->algo = algo_xxh64;
+        if (!memcmp(line, "XXH128", 6)) parsedLine->algo = algo_xxh128;
     } else {
         hash_ptr = line;
         hash_len = (size_t)(firstSpace - line);
+        if (hash_len==8) parsedLine->algo = algo_xxh32;
+        if (hash_len==16) parsedLine->algo = algo_xxh64;
+        if (hash_len==32) parsedLine->algo = algo_xxh128;
     }
 
     switch (hash_len)
     {
     case 8:
+        if (parsedLine->algo != algo_xxh32) return ParseLine_invalidFormat;
         {   XXH32_canonical_t* xxh32c = &parsedLine->canonical.xxh32;
             if (XSUM_canonicalFromString(xxh32c->digest, sizeof(xxh32c->digest), hash_ptr, rev)
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
-            parsedLine->algo = algo_xxh32;
             break;
         }
 
     case 16:
+        if (parsedLine->algo != algo_xxh64 && parsedLine->algo != algo_xxh3) return ParseLine_invalidFormat;
         {   XXH64_canonical_t* xxh64c = &parsedLine->canonical.xxh64;
             if (XSUM_canonicalFromString(xxh64c->digest, sizeof(xxh64c->digest), hash_ptr, rev)
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
-            assert(parsedLine->algo == algo_xxh3 || parsedLine->algo == algo_xxh64);
             break;
         }
 
     case 32:
+        if (parsedLine->algo != algo_xxh128) return ParseLine_invalidFormat;
         {   XXH128_canonical_t* xxh128c = &parsedLine->canonical.xxh128;
             if (XSUM_canonicalFromString(xxh128c->digest, sizeof(xxh128c->digest), hash_ptr, rev)
                 != CanonicalFromString_ok) {
                 return ParseLine_invalidFormat;
             }
-            parsedLine->algo = algo_xxh128;
             break;
         }
 

From d14507f4adb5d4e5498a4cdf6a134fe40fa5ae1e Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 17:35:46 -0800
Subject: [PATCH 179/187] updated documentation for `-H3`

---
 cli/xxhsum.1.md | 4 ++--
 cli/xxhsum.c    | 2 +-
 2 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/cli/xxhsum.1.md b/cli/xxhsum.1.md
index 85cd8fac..777d1cae 100644
--- a/cli/xxhsum.1.md
+++ b/cli/xxhsum.1.md
@@ -37,8 +37,8 @@ OPTIONS
   Displays xxhsum version and exits
 
 * `-H`<HASHTYPE>:
-  Hash selection. <HASHTYPE> means `0`=32bits, `1`=64bits, `2`=128bits.
-  Alternatively, <HASHTYPE> `32`=32bits, `64`=64bits, `128`=128bits.
+  Hash selection. <HASHTYPE> means `0`=XXH32, `1`=XXH64, `2`=XXH128, `3`=XXH3.
+  Alternatively, <HASHTYPE> `32`=XXH32, `64`=XXH64, `128`=XXH128.
   Default value is `1` (64bits)
 
 * `--tag`:
diff --git a/cli/xxhsum.c b/cli/xxhsum.c
index ae702bf5..9779aefb 100644
--- a/cli/xxhsum.c
+++ b/cli/xxhsum.c
@@ -882,7 +882,7 @@ static int XSUM_usage(const char* exename)
     XSUM_log( "Usage: %s [options] [files] \n\n", exename);
     XSUM_log( "When no filename provided or when '-' is provided, uses stdin as input. \n");
     XSUM_log( "Options: \n");
-    XSUM_log( "  -H#         algorithm selection: 0,1,2 or 32,64,128 (default: %i) \n", (int)g_defaultAlgo);
+    XSUM_log( "  -H#         algorithm selection: 0,1,2,3 or 32,64,128 (default: %i) \n", (int)g_defaultAlgo);
     XSUM_log( "  -c, --check read xxHash checksum from [files] and check them \n");
     XSUM_log( "  -h, --help  display a long help page about advanced options \n");
     return 0;

From 079d6bba3b037e316b4e899dc5e0dd20d0f93de6 Mon Sep 17 00:00:00 2001
From: Yann Collet <yann.collet.73@gmail.com>
Date: Sun, 28 Nov 2021 17:36:55 -0800
Subject: [PATCH 180/187] updated man page

---
 cli/xxhsum.1 | 65 +++-------------------------------------------------
 1 file changed, 3 insertions(+), 62 deletions(-)

diff --git a/cli/xxhsum.1 b/cli/xxhsum.1
index dd17108f..27e6808e 100644
--- a/cli/xxhsum.1
+++ b/cli/xxhsum.1
@@ -1,165 +1,106 @@
-.
-.TH "XXHSUM" "1" "July 2020" "xxhsum 0.7.4" "User Commands"
-.
+.TH "XXHSUM" "1" "November 2021" "xxhsum 0.8.1" "User Commands"
 .SH "NAME"
 \fBxxhsum\fR \- print or check xxHash non\-cryptographic checksums
-.
 .SH "SYNOPSIS"
-\fBxxhsum [<OPTION>] \.\.\. [<FILE>] \.\.\.\fR \fBxxhsum \-b [<OPTION>] \.\.\.\fR
-.
+\fBxxhsum [<OPTION>] \|\.\|\.\|\. [<FILE>] \|\.\|\.\|\.\fR \fBxxhsum \-b [<OPTION>] \|\.\|\.\|\.\fR
 .P
 \fBxxh32sum\fR is equivalent to \fBxxhsum \-H0\fR \fBxxh64sum\fR is equivalent to \fBxxhsum \-H1\fR \fBxxh128sum\fR is equivalent to \fBxxhsum \-H2\fR
-.
 .SH "DESCRIPTION"
 Print or check xxHash (32, 64 or 128 bits) checksums\. When no \fIFILE\fR, read standard input, except if it\'s the console\. When \fIFILE\fR is \fB\-\fR, read standard input even if it\'s the console\.
-.
 .P
 \fBxxhsum\fR supports a command line syntax similar but not identical to md5sum(1)\. Differences are: \fBxxhsum\fR doesn\'t have text/binary mode switch (\fB\-b\fR, \fB\-t\fR); \fBxxhsum\fR always treats files as binary file; \fBxxhsum\fR has a hash bit width switch (\fB\-H\fR);
-.
 .P
 As xxHash is a fast non\-cryptographic checksum algorithm, \fBxxhsum\fR should not be used for security related purposes\.
-.
 .P
 \fBxxhsum \-b\fR invokes benchmark mode\. See \fIOPTIONS\fR and \fIEXAMPLES\fR for details\.
-.
 .SH "OPTIONS"
-.
 .TP
 \fB\-V\fR, \fB\-\-version\fR
 Displays xxhsum version and exits
-.
 .TP
 \fB\-H\fR\fIHASHTYPE\fR
-Hash selection\. \fIHASHTYPE\fR means \fB0\fR=32bits, \fB1\fR=64bits, \fB2\fR=128bits\. Alternatively, \fIHASHTYPE\fR \fB32\fR=32bits, \fB64\fR=64bits, \fB128\fR=128bits\. Default value is \fB1\fR (64bits)
-.
+Hash selection\. \fIHASHTYPE\fR means \fB0\fR=XXH32, \fB1\fR=XXH64, \fB2\fR=XXH128, \fB3\fR=XXH3\. Alternatively, \fIHASHTYPE\fR \fB32\fR=XXH32, \fB64\fR=XXH64, \fB128\fR=XXH128\. Default value is \fB1\fR (64bits)
 .TP
 \fB\-\-tag\fR
 Output in the BSD style\.
-.
 .TP
 \fB\-\-little\-endian\fR
 Set output hexadecimal checksum value as little endian convention\. By default, value is displayed as big endian\.
-.
 .TP
 \fB\-h\fR, \fB\-\-help\fR
 Displays help and exits
-.
 .P
 \fBThe following four options are useful only when verifying checksums (\fB\-c\fR)\fR
-.
 .TP
 \fB\-c\fR, \fB\-\-check\fR \fIFILE\fR
 Read xxHash sums from \fIFILE\fR and check them
-.
 .TP
 \fB\-q\fR, \fB\-\-quiet\fR
 Don\'t print OK for each successfully verified file
-.
 .TP
 \fB\-\-strict\fR
 Return an error code if any line in the file is invalid, not just if some checksums are wrong\. This policy is disabled by default, though UI will prompt an informational message if any line in the file is detected invalid\.
-.
 .TP
 \fB\-\-status\fR
 Don\'t output anything\. Status code shows success\.
-.
 .TP
 \fB\-w\fR, \fB\-\-warn\fR
 Emit a warning message about each improperly formatted checksum line\.
-.
 .P
 \fBThe following options are useful only benchmark purpose\fR
-.
 .TP
 \fB\-b\fR
 Benchmark mode\. See \fIEXAMPLES\fR for details\.
-.
 .TP
 \fB\-b#\fR
 Specify ID of variant to be tested\. Multiple variants can be selected, separated by a \',\' comma\.
-.
 .TP
 \fB\-B\fR\fIBLOCKSIZE\fR
 Only useful for benchmark mode (\fB\-b\fR)\. See \fIEXAMPLES\fR for details\. \fIBLOCKSIZE\fR specifies benchmark mode\'s test data block size in bytes\. Default value is 102400
-.
 .TP
 \fB\-i\fR\fIITERATIONS\fR
 Only useful for benchmark mode (\fB\-b\fR)\. See \fIEXAMPLES\fR for details\. \fIITERATIONS\fR specifies number of iterations in benchmark\. Single iteration lasts approximately 1000 milliseconds\. Default value is 3
-.
 .SH "EXIT STATUS"
 \fBxxhsum\fR exit \fB0\fR on success, \fB1\fR if at least one file couldn\'t be read or doesn\'t have the same checksum as the \fB\-c\fR option\.
-.
 .SH "EXAMPLES"
 Output xxHash (64bit) checksum values of specific files to standard output
-.
 .IP "" 4
-.
 .nf
-
 $ xxhsum \-H1 foo bar baz
-.
 .fi
-.
 .IP "" 0
-.
 .P
 Output xxHash (32bit and 64bit) checksum values of specific files to standard output, and redirect it to \fBxyz\.xxh32\fR and \fBqux\.xxh64\fR
-.
 .IP "" 4
-.
 .nf
-
 $ xxhsum \-H0 foo bar baz > xyz\.xxh32
 $ xxhsum \-H1 foo bar baz > qux\.xxh64
-.
 .fi
-.
 .IP "" 0
-.
 .P
 Read xxHash sums from specific files and check them
-.
 .IP "" 4
-.
 .nf
-
 $ xxhsum \-c xyz\.xxh32 qux\.xxh64
-.
 .fi
-.
 .IP "" 0
-.
 .P
 Benchmark xxHash algorithm\. By default, \fBxxhsum\fR benchmarks xxHash main variants on a synthetic sample of 100 KB, and print results into standard output\. The first column is the algorithm, the second column is the source data size in bytes, the third column is the number of hashes generated per second (throughput), and finally the last column translates speed in megabytes per second\.
-.
 .IP "" 4
-.
 .nf
-
 $ xxhsum \-b
-.
 .fi
-.
 .IP "" 0
-.
 .P
 In the following example, the sample to hash is set to 16384 bytes, the variants to be benched are selected by their IDs, and each benchmark test is repeated 10 times, for increased accuracy\.
-.
 .IP "" 4
-.
 .nf
-
 $ xxhsum \-b1,2,3 \-i10 \-B16384
-.
 .fi
-.
 .IP "" 0
-.
 .SH "BUGS"
 Report bugs at: https://github\.com/Cyan4973/xxHash/issues/
-.
 .SH "AUTHOR"
 Yann Collet
-.
 .SH "SEE ALSO"
 md5sum(1)

From 57d44d5773e27f27c8aa221199d03c8f4b138d84 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 18:13:08 -0800
Subject: [PATCH 181/187] change XXH_REROLL into XXH32_ENDJMP

The multi-branches finalization of XXH32 is generally preferable,
so it's now selected by default.
The switch/case jump can be manually selected at compile time
by using the new build macro XXH32_ENDJMP.
---
 README.md |  5 +++--
 xxhash.h  | 31 ++++++++++++-------------------
 2 files changed, 15 insertions(+), 21 deletions(-)

diff --git a/README.md b/README.md
index 5ba73bb3..2406c8d2 100644
--- a/README.md
+++ b/README.md
@@ -117,8 +117,9 @@ The following macros can be set at compilation time to modify libxxhash's behavi
                          This is very useful when optimizing for smallest binary size,
                          and is automatically defined when compiling with `-O0`, `-Os`, `-Oz`, or `-fno-inline` on GCC and Clang.
                          This may also increase performance depending on compiler and architecture.
-- `XXH_REROLL`: Reduces the size of the generated code by not unrolling some loops.
-                Impact on performance may vary, depending on platform and algorithm.
+- `XXH32_ENDJMP`: Switch multi-branch finalization stage of XXH32 by a single jump.
+                  This is generally undesirable for performance, especially when hashing inputs of random sizes.
+                  But depending on exact architecture and compiler, a jump might provide slightly better performance on small inputs. Disabled by default.
 - `XXH_STATIC_LINKING_ONLY`: gives access to internal state declaration, required for static allocation.
                              Incompatible with dynamic linking, due to risks of ABI changes.
 - `XXH_NO_XXH3` : removes symbols related to `XXH3` (both 64 & 128 bits) from generated binary.
diff --git a/xxhash.h b/xxhash.h
index 4f9c1644..2f7986c0 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1377,18 +1377,16 @@ XXH3_128bits_reset_withSecretandSeed(XXH3_state_t* statePtr,
 #  define XXH_NO_INLINE_HINTS 0
 
 /*!
- * @def XXH_REROLL
- * @brief Whether to reroll `XXH32_finalize`.
+ * @def XXH32_ENDJMP
+ * @brief Whether to use a jump for `XXH32_finalize`.
  *
- * For performance, `XXH32_finalize` uses an unrolled loop
- * in the form of a switch statement.
+ * For performance, `XXH32_finalize` uses multiple branches in the finalizer.
+ * This is generally preferable for performance,
+ * but depending on exact architecture, a jmp may be preferable.
  *
- * This is not always desirable, as it generates larger code,
- * and depending on the architecture, may even be slower
- *
- * This is automatically defined with `-Os`/`-Oz` on GCC and Clang.
+ * This setting is only possibly making a difference for very small inputs.
  */
-#  define XXH_REROLL 0
+#  define XXH32_ENDJMP 0
 
 /*!
  * @internal
@@ -1442,14 +1440,9 @@ XXH3_128bits_reset_withSecretandSeed(XXH3_state_t* statePtr,
 #  endif
 #endif
 
-#ifndef XXH_REROLL
-#  if defined(__OPTIMIZE_SIZE__) /* -Os, -Oz */ || \
-     (defined(__GNUC__) && !defined(__clang__))
-     /* The if/then loop is preferable to switch/case on gcc (on x64) */
-#    define XXH_REROLL 1
-#  else
-#    define XXH_REROLL 0
-#  endif
+#ifndef XXH32_ENDJMP
+/* generally preferable for performance */
+#  define XXH32_ENDJMP 0
 #endif
 
 /*!
@@ -2015,8 +2008,8 @@ XXH32_finalize(xxh_u32 h32, const xxh_u8* ptr, size_t len, XXH_alignment align)
 
     if (ptr==NULL) XXH_ASSERT(len == 0);
 
-    /* Compact rerolled version */
-    if (XXH_REROLL) {
+    /* Compact rerolled version; generally faster */
+    if (!XXH32_ENDJMP) {
         len &= 15;
         while (len >= 4) {
             XXH_PROCESS4;

From 6eb4e03b3e074d15eb30ffe53a4ab7ad5b86c460 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 20:01:46 -0800
Subject: [PATCH 182/187] XXH3_generateSecret() can now generate secret of any
 size

as long as it's >= XX3_SECRET_SIZE_MIN .

Note that results produced by this version
are not comparable to results produced by earlier version.
XXH3_generateSecret() is still considered experimental,
aka, its result is not yet guaranteed to remain stable across versions.
---
 cli/xsum_sanity_check.c | 12 +++----
 xxhash.h                | 78 +++++++++++++++++++----------------------
 2 files changed, 43 insertions(+), 47 deletions(-)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index ee8821ee..5b268220 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -214,10 +214,10 @@ static const XSUM_testdata128_t XSUM_XXH128_withSecret_testdata[] = {
 };
 
 static const XSUM_testdata_sample_t XSUM_XXH3_generateSecret_testdata[] = {
-    {                              0, { 0xB8, 0x26, 0x83, 0x7E } },
-    {                              1, { 0xA6, 0x16, 0x06, 0x7B } },
-    {     XXH3_SECRET_SIZE_MIN -   1, { 0xDA, 0x2A, 0x12, 0x11 } },
-    { XXH3_SECRET_DEFAULT_SIZE + 500, { 0x7E, 0x48, 0x0C, 0xA7 } }
+    {                              0, { 0xE7, 0x8C, 0x77, 0x77 } },
+    {                              1, { 0x2B, 0x3E, 0xDE, 0x67 } },
+    {     XXH3_SECRET_SIZE_MIN -   1, { 0xE8, 0x39, 0x6C, 0x16 } },
+    { XXH3_SECRET_DEFAULT_SIZE + 500, { 0xD6, 0x1C, 0x41, 0x69 } }
 };
 
 static void XSUM_checkResult32(XXH32_hash_t r1, XXH32_hash_t r2)
@@ -616,12 +616,12 @@ static void XSUM_testXXH128_withSecret(const void* data, const void* secret, siz
 static void XSUM_testSecretGenerator(const void* customSeed, const XSUM_testdata_sample_t* testData)
 {
     static int nbTests = 1;
-    const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};
+    const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};  /* position of sampled bytes */
     XSUM_U8 secretBuffer[XXH3_SECRET_DEFAULT_SIZE] = {0};
     XSUM_U8 samples[SECRET_SAMPLE_NBBYTES];
     int i;
 
-    XXH3_generateSecret(secretBuffer, customSeed, testData->len);
+    XXH3_generateSecret(secretBuffer, sizeof(secretBuffer), customSeed, testData->len);
     for (i=0; i<SECRET_SAMPLE_NBBYTES; i++) {
         samples[i] = secretBuffer[sampleIndex[i]];
     }
diff --git a/xxhash.h b/xxhash.h
index 2f7986c0..c80ce9ba 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1131,27 +1131,26 @@ XXH_PUBLIC_API XXH128_hash_t XXH128(const void* data, size_t len, XXH64_hash_t s
  * as it becomes much more difficult for an external actor to guess how to impact the calculation logic.
  *
  * The function accepts as input a custom seed of any length and any content,
- * and derives from it a high-entropy secret of length XXH3_SECRET_DEFAULT_SIZE
- * into an already allocated buffer secretBuffer.
- * The generated secret is _always_ XXH_SECRET_DEFAULT_SIZE bytes long.
+ * and derives from it a high-entropy secret of length @secretSize
+ * into an already allocated buffer @secretBuffer.
+ * @secretSize must be >= XXH3_SECRET_SIZE_MIN
  *
  * The generated secret can then be used with any `*_withSecret()` variant.
  * Functions `XXH3_128bits_withSecret()`, `XXH3_64bits_withSecret()`,
  * `XXH3_128bits_reset_withSecret()` and `XXH3_64bits_reset_withSecret()`
  * are part of this list. They all accept a `secret` parameter
- * which must be very long for implementation reasons (>= XXH3_SECRET_SIZE_MIN)
+ * which must be large enough for implementation reasons (>= XXH3_SECRET_SIZE_MIN)
  * _and_ feature very high entropy (consist of random-looking bytes).
  * These conditions can be a high bar to meet, so
- * this function can be used to generate a secret of proper quality.
+ * XXH3_generateSecret() can be employed to ensure proper quality.
  *
  * customSeed can be anything. It can have any size, even small ones,
- * and its content can be anything, even stupidly "low entropy" source such as a bunch of zeroes.
- * The resulting `secret` will nonetheless provide all expected qualities.
+ * and its content can be anything, even "poor entropy" sources such as a bunch of zeroes.
+ * The resulting `secret` will nonetheless provide all required qualities.
  *
- * Supplying NULL as the customSeed copies the default secret into `secretBuffer`.
  * When customSeedSize > 0, supplying NULL as customSeed is undefined behavior.
  */
-XXH_PUBLIC_API void XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSeedSize);
+XXH_PUBLIC_API XXH_errorcode XXH3_generateSecret(void* secretBuffer, size_t secretSize, const void* customSeed, size_t customSeedSize);
 
 
 /*
@@ -5504,47 +5503,44 @@ XXH128_hashFromCanonical(const XXH128_canonical_t* src)
  */
 #define XXH_MIN(x, y) (((x) > (y)) ? (y) : (x))
 
+static void XXH3_combine16(void* dst, XXH128_hash_t h128)
+{
+    XXH_writeLE64( dst, XXH_readLE64(dst) ^ h128.low64 );
+    XXH_writeLE64( (char*)dst+8, XXH_readLE64((char*)dst+8) ^ h128.high64 );
+}
+
 /*! @ingroup xxh3_family */
-XXH_PUBLIC_API void
-XXH3_generateSecret(void* secretBuffer, const void* customSeed, size_t customSeedSize)
+XXH_PUBLIC_API XXH_errorcode
+XXH3_generateSecret(void* secretBuffer, size_t secretSize, const void* customSeed, size_t customSeedSize)
 {
     XXH_ASSERT(secretBuffer != NULL);
+    XXH_ASSERT(secretSize >= XXH3_SECRET_SIZE_MIN);
     if (customSeedSize == 0) {
-        memcpy(secretBuffer, XXH3_kSecret, XXH_SECRET_DEFAULT_SIZE);
-        return;
+        customSeed = XXH3_kSecret;
+        customSeedSize = XXH_SECRET_DEFAULT_SIZE;
     }
     XXH_ASSERT(customSeed != NULL);
 
-    {   size_t const segmentSize = sizeof(XXH128_hash_t);
-        size_t const nbSegments = XXH_SECRET_DEFAULT_SIZE / segmentSize;
+    /* Fill secretBuffer with a copy of customSeed - repeat as needed */
+    {   size_t pos = 0;
+        while (pos < secretSize) {
+            size_t const toCopy = XXH_MIN((secretSize - pos), customSeedSize);
+            memcpy((char*)secretBuffer + pos, customSeed, toCopy);
+            pos += toCopy;
+    }   }
+
+    {   size_t const nbSeg16 = secretSize / 16;
+        size_t n;
         XXH128_canonical_t scrambler;
-        XXH64_hash_t seeds[12];
-        size_t segnb;
-        XXH_ASSERT(nbSegments == 12);
-        XXH_ASSERT(segmentSize * nbSegments == XXH_SECRET_DEFAULT_SIZE); /* exact multiple */
         XXH128_canonicalFromHash(&scrambler, XXH128(customSeed, customSeedSize, 0));
-
-        /*
-        * Copy customSeed to seeds[], truncating or repeating as necessary.
-        */
-        {   size_t toFill = XXH_MIN(customSeedSize, sizeof(seeds));
-            size_t filled = toFill;
-            memcpy(seeds, customSeed, toFill);
-            while (filled < sizeof(seeds)) {
-                toFill = XXH_MIN(filled, sizeof(seeds) - filled);
-                memcpy((char*)seeds + filled, seeds, toFill);
-                filled += toFill;
-        }   }
-
-        /* generate secret */
-        memcpy(secretBuffer, &scrambler, sizeof(scrambler));
-        for (segnb=1; segnb < nbSegments; segnb++) {
-            size_t const segmentStart = segnb * segmentSize;
-            XXH128_canonical_t segment;
-            XXH128_canonicalFromHash(&segment,
-                XXH128(&scrambler, sizeof(scrambler), XXH_readLE64(seeds + segnb) + segnb) );
-            memcpy((char*)secretBuffer + segmentStart, &segment, sizeof(segment));
-    }   }
+        for (n=0; n<nbSeg16; n++) {
+            XXH128_hash_t const h128 = XXH128(&scrambler, sizeof(scrambler), n);
+            XXH3_combine16((char*)secretBuffer + n*16, h128);
+        }
+        /* last segment */
+        XXH3_combine16((char*)secretBuffer + secretSize - 16, XXH128_hashFromCanonical(&scrambler));
+    }
+    return XXH_OK;
 }
 
 /*! @ingroup xxh3_family */

From a0e9159bdc01f6175d4290737a21e5f4471912ed Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 20:17:57 -0800
Subject: [PATCH 183/187] more generator tests

with multiple secret lengths
---
 cli/xsum_sanity_check.c | 28 ++++++++++++++++------------
 1 file changed, 16 insertions(+), 12 deletions(-)

diff --git a/cli/xsum_sanity_check.c b/cli/xsum_sanity_check.c
index 5b268220..737747d5 100644
--- a/cli/xsum_sanity_check.c
+++ b/cli/xsum_sanity_check.c
@@ -90,9 +90,10 @@ typedef struct {
     XXH128_hash_t Nresult;
 } XSUM_testdata128_t;
 
-#define SECRET_SAMPLE_NBBYTES 4
+#define SECRET_SAMPLE_NBBYTES 5
 typedef struct {
-    XSUM_U32 len;
+    XSUM_U32 seedLen;
+    XSUM_U32 secretLen;
     XSUM_U8 byte[SECRET_SAMPLE_NBBYTES];
 } XSUM_testdata_sample_t;
 
@@ -213,11 +214,12 @@ static const XSUM_testdata128_t XSUM_XXH128_withSecret_testdata[] = {
     { 12, 0, { 0xAF82F6EBA263D7D8ULL, 0x90A3C2D839F57D0FULL } }   /*  9 - 16 */
 };
 
+#define SECRET_SIZE_MAX 9867
 static const XSUM_testdata_sample_t XSUM_XXH3_generateSecret_testdata[] = {
-    {                              0, { 0xE7, 0x8C, 0x77, 0x77 } },
-    {                              1, { 0x2B, 0x3E, 0xDE, 0x67 } },
-    {     XXH3_SECRET_SIZE_MIN -   1, { 0xE8, 0x39, 0x6C, 0x16 } },
-    { XXH3_SECRET_DEFAULT_SIZE + 500, { 0xD6, 0x1C, 0x41, 0x69 } }
+    {                              0, 192, { 0xE7, 0x8C, 0x77, 0x77, 0x00 } },
+    {                              1, 240, { 0x2B, 0x3E, 0xDE, 0xC1, 0x00 } },
+    {     XXH3_SECRET_SIZE_MIN -   1, 277, { 0xE8, 0x39, 0x6C, 0xCC, 0x7B } },
+    { XXH3_SECRET_DEFAULT_SIZE + 500, SECRET_SIZE_MAX, { 0xD6, 0x1C, 0x41, 0x17, 0xB3 } }
 };
 
 static void XSUM_checkResult32(XXH32_hash_t r1, XXH32_hash_t r2)
@@ -616,20 +618,21 @@ static void XSUM_testXXH128_withSecret(const void* data, const void* secret, siz
 static void XSUM_testSecretGenerator(const void* customSeed, const XSUM_testdata_sample_t* testData)
 {
     static int nbTests = 1;
-    const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191};  /* position of sampled bytes */
-    XSUM_U8 secretBuffer[XXH3_SECRET_DEFAULT_SIZE] = {0};
+    const int sampleIndex[SECRET_SAMPLE_NBBYTES] = { 0, 62, 131, 191, 241 };  /* position of sampled bytes */
+    XSUM_U8 secretBuffer[SECRET_SIZE_MAX] = {0};
     XSUM_U8 samples[SECRET_SAMPLE_NBBYTES];
     int i;
 
-    XXH3_generateSecret(secretBuffer, sizeof(secretBuffer), customSeed, testData->len);
+    assert(testData->secretLen <= SECRET_SIZE_MAX);
+    XXH3_generateSecret(secretBuffer, testData->secretLen, customSeed, testData->seedLen);
     for (i=0; i<SECRET_SAMPLE_NBBYTES; i++) {
         samples[i] = secretBuffer[sampleIndex[i]];
     }
     if (memcmp(samples, testData->byte, sizeof(testData->byte))) {
         XSUM_log("\rError: Secret generation test %i: Internal sanity check failed. \n", nbTests);
-        XSUM_log("\rGot { 0x%02X, 0x%02X, 0x%02X, 0x%02X }, expected { 0x%02X, 0x%02X, 0x%02X, 0x%02X } \n",
-                samples[0], samples[1], samples[2], samples[3],
-                testData->byte[0], testData->byte[1], testData->byte[2], testData->byte[3] );
+        XSUM_log("\rGot { 0x%02X, 0x%02X, 0x%02X, 0x%02X, 0x%02X }, expected { 0x%02X, 0x%02X, 0x%02X, 0x%02X, 0x%02X } \n",
+                samples[0], samples[1], samples[2], samples[3], samples[4],
+                testData->byte[0], testData->byte[1], testData->byte[2], testData->byte[3], testData->byte[4] );
         exit(1);
     }
     nbTests++;
@@ -678,6 +681,7 @@ XSUM_API void XSUM_sanityCheck(void)
     }
     /* secret generator */
     for (i = 0; i < (sizeof(XSUM_XXH3_generateSecret_testdata)/sizeof(XSUM_XXH3_generateSecret_testdata[0])); i++) {
+        assert(XSUM_XXH3_generateSecret_testdata[i].seedLen <= SANITY_BUFFER_SIZE);
         XSUM_testSecretGenerator(sanityBuffer, &XSUM_XXH3_generateSecret_testdata[i]);
     }
 

From f3245381f0436894a33fa27cde82855623e9fc6c Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 20:21:16 -0800
Subject: [PATCH 184/187] properly return error code in case of incorrect usage
 of the generator's interface

---
 xxhash.h | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/xxhash.h b/xxhash.h
index c80ce9ba..93d34e49 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -5514,12 +5514,15 @@ XXH_PUBLIC_API XXH_errorcode
 XXH3_generateSecret(void* secretBuffer, size_t secretSize, const void* customSeed, size_t customSeedSize)
 {
     XXH_ASSERT(secretBuffer != NULL);
+    if (secretBuffer == NULL) return XXH_ERROR;
     XXH_ASSERT(secretSize >= XXH3_SECRET_SIZE_MIN);
+    if (secretSize < XXH3_SECRET_SIZE_MIN) return XXH_ERROR;
     if (customSeedSize == 0) {
         customSeed = XXH3_kSecret;
         customSeedSize = XXH_SECRET_DEFAULT_SIZE;
     }
     XXH_ASSERT(customSeed != NULL);
+    if (customSeed == NULL) return XXH_ERROR;
 
     /* Fill secretBuffer with a copy of customSeed - repeat as needed */
     {   size_t pos = 0;

From cb696886cb7563e31ff826a608760c6632f87aa4 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 21:17:31 -0800
Subject: [PATCH 185/187] update changelog, in preparation for v0.8.1 release

---
 CHANGELOG | 18 ++++++++++++++++++
 1 file changed, 18 insertions(+)

diff --git a/CHANGELOG b/CHANGELOG
index 23870756..40bfcffe 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -1,3 +1,21 @@
+v0.8.1
+- perf : much improved performance for XXH3 streaming variants, notably on gcc and msvc
+- perf : improved XXH64 speed and latency on small inputs
+- perf : small XXH32 speed and latency improvement on small inputs of random size
+- perf : minor stack usage improvement for XXH32 and XXH64
+- api  : new experimental variants XXH3_*_withSecretandSeed()
+- api  : update XXH3_generateSecret(), can no generate secret of any size (>= XXH3_SECRET_SIZE_MIN)
+- cli  : xxhsum can now generate and check XXH3 checksums, using command `-H3`
+- build: can build xxhash without XXH3, with new build macro XXH_NO_XXH3
+- build: fix xxh_x86dispatch build with MSVC, by @apankrat
+- build: XXH_INLINE_ALL can always be used safely, even after XXH_NAMESPACE or a previous XXH_INLINE_ALL
+- install: fix pkgconfig, by @ellert
+- install: compatibility with Haiku, by @Begasus
+- doc  : code comments made compatible with doxygen, by @easyaspi314
+- misc : XXH_ACCEPT_NULL_INPUT_POINTER is no longer necessary, all functions can be accept NULL input pointers, as long as size == 0
+- misc : complete refactor of CI tests on Github Actions, offering much larger coverage, by @t-mat
+- misc : xxhsum code base split into multiple specialized units, within directory cli/, by @easyaspi314
+
 v0.8.0
 - api : stabilize XXH3
 - cli : xxhsum can parse BSD-style --check lines, by @WayneD

From be4bdb9ce2c97f670645eaad973974b5e42db4ad Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Sun, 28 Nov 2021 21:24:11 -0800
Subject: [PATCH 186/187] further minor changelog update

---
 CHANGELOG | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/CHANGELOG b/CHANGELOG
index 40bfcffe..ff59d8bb 100644
--- a/CHANGELOG
+++ b/CHANGELOG
@@ -9,10 +9,11 @@ v0.8.1
 - build: can build xxhash without XXH3, with new build macro XXH_NO_XXH3
 - build: fix xxh_x86dispatch build with MSVC, by @apankrat
 - build: XXH_INLINE_ALL can always be used safely, even after XXH_NAMESPACE or a previous XXH_INLINE_ALL
+- build: improved PPC64LE vector support, by @mpe
 - install: fix pkgconfig, by @ellert
 - install: compatibility with Haiku, by @Begasus
 - doc  : code comments made compatible with doxygen, by @easyaspi314
-- misc : XXH_ACCEPT_NULL_INPUT_POINTER is no longer necessary, all functions can be accept NULL input pointers, as long as size == 0
+- misc : XXH_ACCEPT_NULL_INPUT_POINTER is no longer necessary, all functions can accept NULL input pointers, as long as size == 0
 - misc : complete refactor of CI tests on Github Actions, offering much larger coverage, by @t-mat
 - misc : xxhsum code base split into multiple specialized units, within directory cli/, by @easyaspi314
 

From f2416f4d1cddfc8399dce31ebca14e8d7713a4e1 Mon Sep 17 00:00:00 2001
From: Yann Collet <cyan@fb.com>
Date: Mon, 29 Nov 2021 00:17:16 -0800
Subject: [PATCH 187/187] changed links

---
 xxhash.h | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/xxhash.h b/xxhash.h
index 93d34e49..08ab7945 100644
--- a/xxhash.h
+++ b/xxhash.h
@@ -1319,7 +1319,7 @@ XXH3_128bits_reset_withSecretandSeed(XXH3_state_t* statePtr,
  *   care, as what works on one compiler/platform/optimization level may cause
  *   another to read garbage data or even crash.
  *
- * See https://stackoverflow.com/a/32095106/646947 for details.
+ * See http://fastcompression.blogspot.com/2015/08/accessing-unaligned-memory.html for details.
  *
  * Prefer these methods in priority order (0 > 3 > 1 > 2)
  */
@@ -1683,7 +1683,7 @@ static xxh_u32 XXH_read32(const void* ptr)
 
 /*
  * Portable and safe solution. Generally efficient.
- * see: https://stackoverflow.com/a/32095106/646947
+ * see: http://fastcompression.blogspot.com/2015/08/accessing-unaligned-memory.html
  */
 static xxh_u32 XXH_read32(const void* memPtr)
 {
@@ -2326,7 +2326,7 @@ static xxh_u64 XXH_read64(const void* ptr)
 
 /*
  * Portable and safe solution. Generally efficient.
- * see: https://stackoverflow.com/a/32095106/646947
+ * see: http://fastcompression.blogspot.com/2015/08/accessing-unaligned-memory.html
  */
 static xxh_u64 XXH_read64(const void* memPtr)
 {