diff --git a/BUILD-FreeBSD.md b/BUILD-FreeBSD.md
index ec11395..530a673 100644
--- a/BUILD-FreeBSD.md
+++ b/BUILD-FreeBSD.md
@@ -1,6 +1,7 @@
-### FreeBSD 14.1 
+# FreeBSD 14.1
+
+## Install Dependencies
 
-#### Install Dependencies
 ```bash
 # as root
 pkg install bash getopt cmake ccmake ccache git ruby ruby32-gems rubygem-rake curl libevent onetbb 
diff --git a/BUILD-Windows.md b/BUILD-Windows.md
index 36d1f21..ec0c514 100644
--- a/BUILD-Windows.md
+++ b/BUILD-Windows.md
@@ -1,4 +1,4 @@
-# Build with MSYS2 / mingw 
+# Build with MSYS2 / mingw
 
 Download and install MSYS2 with all default settings
 https://www.msys2.org/
@@ -25,17 +25,16 @@ and continue with main README...
 ## Running executables / loading shared libraries
 
 By default this project builds with as many static libraries as
-possible, but you still need to ensure the remaining dynbamic ones can
+possible, but you still need to ensure the remaining dynamic ones can
 be found. Note that `PATH` is sensibly configured within the MSYS2
 environment, and executables should just run trivially there. But the
 same will not necessarily be true in the Windows `cmd` console, or
 when launching from File Explorer, etc.
 
 To be able to run the programs in the build directory from the `cmd`
-(windows shell) you need to put `C:\msys64\ucrt64\bin`into your `PATH`
+(windows shell) you need to put `C:\msys64\ucrt64\bin` into your `PATH`
 environment variable. You can do that in the usual place in the
 Windows GUI.
 
-Any problems and you can check what each program is loading with 
+Any problems and you can check what each program is loading with
 `ldd program_name` from within the MSYS2 console.
-
diff --git a/BUILD-linux.md b/BUILD-linux.md
index c4a6a13..f5ec9c3 100644
--- a/BUILD-linux.md
+++ b/BUILD-linux.md
@@ -1,15 +1,18 @@
-### Recent Linux systems from the last 5 years
+# Building on Linux
 
-Explcitly tested on:
+## Recent Linux systems from the last 5 years
+
+Explicitly tested on:
 Debian 11 & 12, Ubuntu 20.04LTS, 22.04LTS & 24.04LTS.
 
 On rpm system, the package names may vary slightly. The only runtime
 dependencies are libcurl and libevent (plus libtbb if you compile with
-`-DHIBP_WITH_PSTL`. 
+`-DHIBP_WITH_PSTL`).
+
+## Install Dependencies
 
-#### Install Dependencies
 ```bash
-sudo apt install build-essential cmake ninja-build ccache git libcurl4-openssl-dev libevent-dev ruby libtbb-dev
+sudo apt install build-essential cmake curl ninja-build ccache git libcurl4-openssl-dev libevent-dev ruby libtbb-dev
 git clone https://github.com/oschonrock/hibp.git
 cd hibp
 git submodule update --init --recursive
@@ -18,8 +21,6 @@ sudo gem install Mxx_ru   # install ruby gem required for restinio dependency in
 mxxruexternals            # install those deps
 cd ../..
 
-# optional: for compilng with clang also:
+# optional: for compiling with clang also:
 sudo apt install clang gcc-14 g++-14  # need gcc-14 because clang tries to use its stdlibc++ version
-
 ```
-
diff --git a/README.md b/README.md
index 3e28356..1107257 100644
--- a/README.md
+++ b/README.md
@@ -9,12 +9,12 @@
 
 ## Intro
 
->Have I Been Pwned? (HIBP) is a public resource that allows
->Internet users to check whether their personal data has been
->compromised by data breaches. The service collects and analyzes
->hundreds of database dumps and pastes containing information about
->billions of leaked accounts...
-> 
+> Have I Been Pwned? (HIBP) is a public resource that allows
+> Internet users to check whether their personal data has been
+> compromised by data breaches. The service collects and analyzes
+> hundreds of database dumps and pastes containing information about
+> billions of leaked accounts...
+>
 > It makes available a database of password which have been
 > compromised in previous data breaches from all over the globe. There
 > are currently (Dec 2024) just shy of 1 billion unique passwords in
@@ -24,12 +24,12 @@
 High performance *downloader, query tool, server and utilities*
 
 This very useful database is somewhat challenging to use locally
-because of its sheer size. These utiliities make it easy and fast to
-deal with the large data volume while being very effiicent on disk and
-memory resouces.
+because of its sheer size. These utilities make it easy and fast to
+deal with the large data volume while being very efficient on disk and
+memory resources.
 
 Here is `hibp-download` running on a 400Mbit/s connection, averaging
-~48MB/s which is very close to the theorectical maximum. At this
+~48MB/s which is very close to the theoretical maximum. At this
 network speed, a download of the entire HIBP database, including
 prefixing and joining the over 1 million files, converting to our
 binary format and writing to disk, takes *under 12 minutes* (the
@@ -37,18 +37,18 @@ official downloader takes over an hour on the same connection).
 
 ![](https://github.com/oschonrock/hibp/blob/main/media/download.gif)
 
-On a full 1Gbit/s connection this take *around 6 minutes*, as shown here under Windows:
+On a full 1Gbit/s connection this takes *around 6 minutes*, as shown here under Windows:
 
 ![](https://github.com/oschonrock/hibp/blob/main/media/windows_download.png)
 
 ## Quick start - Linux amd64 .deb systems (for others see below!)
 
-#### Install
+### Install
 
 [Download latest .deb and install](https://github.com/oschonrock/hibp/releases/latest)
 
 These .deb releases are compatible with recent'ish debian derived
-distrubutions on amd64 platforms. They are explicitly tested on:
+distributions on amd64 platforms. They are explicitly tested on:
 Debian 11 & 12, Ubuntu 20.04LTS, 22.04LTS & 24.04LTS.
 
 ```bash
@@ -56,7 +56,7 @@ wget -q https://github.com/oschonrock/hibp/releases/download/v0.6.1/hibp_0.6.1-1
 sudo apt install ./hibp_0.6.1-1_amd64.deb  # will install minimal dependencies (eg `libevent`)
 ```
 
-#### Usage
+### Usage
 
 Download "Have I been pawned" database.  38GB download, uses 21GB of
 disk space and takes ~6/12 minutes on 1Gbit/400Mbit
@@ -82,11 +82,11 @@ curl http://localhost:8082/check/plain/password123
 curl http://localhost:8082/check/sha1/CBFDAC6008F9CAB4083784CBD1874F76618D2A97
 ```
 
-The output will be the number of times that pasword has appeared in
+The output will be the number of times that password has appeared in
 leaks. Integrate this into your signup and login processes to show
-warnings to the user that they using a compromised password.
+warnings to the users that they are using a compromised password.
 
-For production, make this server a proper "autostart service" on your distribution. 
+For production, make this server a proper "autostart service" on your distribution.
 
 ### Binary fuse filters
 
@@ -97,23 +97,28 @@ then you should consider the binfuse formats.
 ```bash
 hibp-download --binfuse16-out hibp_binfuse16.bin
 ```
-This converts the 37GB download into an immutable 2GB [binary fuse16 filter](https://github.com/oschonrock/binfuse). 
+
+This converts the 37GB download into an immutable 2GB [binary fuse16 filter](https://github.com/oschonrock/binfuse).
 Alternatively `--binfuse8-out` produces a 1GB file with a higher false positive rate
 (see [format comparison](https://github.com/oschonrock/hibp?tab=readme-ov-file#design-high-performance-with-a-small-memory-disk-and-cpu-footprint))
 
 and then run a server
+
 ```bash
 hibp-server --binfuse16-filter=hibp_binfuse16.bin
 ```
+
 and then query with plain or 64bit hashed passwords
+
 ```bash
 curl http://localhost:8082/check/plain/password123
 curl http://localhost:8082/check/binfuse16/CBFDAC6008F9CAB4
 ```
 
-#### Uninstall
+### Uninstall
 
 To remove the package:
+
 ```bash
 sudo apt remove hibp
 ```
@@ -126,7 +131,7 @@ the data. The orginal text records are about 45 bytes per password
 record, our binary format is 24 bytes, so storage requirements are
 almost halved (21GB currently).
 
-*If you don't like the binary format, you can always ouput the
+*If you don't like the binary format, you can always output the
 conventional text version as well.*
 
 In the binary format each record is a fixed width, and the records are
@@ -135,24 +140,24 @@ because we can use random access binary search. There is an additional
 "table of contents" feature (see `--toc`below) to reduce disk access
 further at the expense of only 4MB of memory.
 
-The system of uitlities supportes multiple storage formats for the
+The system of utilities supports multiple storage formats for the
 password db. These each have different advantages - figures based on
 an ~1billion record pawned password DB from Dec 2024.
 
-| format         | download | storage | false +ve rate   | search strategy    										   | queries/s     |  count |
-|----------------|----------|---------|------------------|-------------------------------------------------------------|---------------|--------|
-| text           | 37GB     | 37GB    | 1 / 2^160        | need external tool 										   | >25[^1]       |  avail |
-| binary sha1    | 37GB     | 21GB    | 1 / 2^160        | binary search      										   | >1,000[^2]    |  avail |
-| binary ntlm    | 32GB     | 18GB    | 1 / 2^128        | binary search      										   | >1,000[^2]    |  avail |
-| binary sha1t64 | 37GB     | 11GB    | 1 / 2^64         | binary search      										   | >1,000[^2]    |  avail |
-| binfuse16      | 37GB     | 2GB     | 1 / 2^16         | [binary fuse filter](https://github.com/oschonrock/binfuse) | >100,000[^3]  |  NA    |
-| binfuse8       | 37GB     | 1GB     | 1 / 2^8          | [binary fuse filter](https://github.com/oschonrock/binfuse) | >100,000[^3]  |  NA    |
+| format         | download | storage | false +ve rate   | search strategy                                             | queries/s     | count |
+|----------------|----------|---------|------------------|-------------------------------------------------------------|---------------|-------|
+| text           | 37GB     | 37GB    | 1 / 2^160        | need external tool                                          | >25[^1]       | avail |
+| binary sha1    | 37GB     | 21GB    | 1 / 2^160        | binary search                                               | >1,000[^2]    | avail |
+| binary ntlm    | 32GB     | 18GB    | 1 / 2^128        | binary search                                               | >1,000[^2]    | avail |
+| binary sha1t64 | 37GB     | 11GB    | 1 / 2^64         | binary search                                               | >1,000[^2]    | avail |
+| binfuse16      | 37GB     | 2GB     | 1 / 2^16         | [binary fuse filter](https://github.com/oschonrock/binfuse) | >100,000[^3]  | NA    |
+| binfuse8       | 37GB     | 1GB     | 1 / 2^8          | [binary fuse filter](https://github.com/oschonrock/binfuse) | >100,000[^3]  | NA    |
 
-[^1]: when inserted into MariaDB table using aria engine and primary index on the hash in binary
+[^1]: when inserted into a table in MariaDB using aria storage engine and primary index on the hash in binary
 [^2]: performance can be increased up to 3x with `--toc` and another 5x with `--threads`
 [^3]: uses `mmap` for access, so performance is heavily dependent on available RAM, relative to storage size
 
-The local http server component is both multi threaded and event loop
+The local http server component is both multi-threaded and event loop
 driven for high efficiency. Even in a minimal configuration it should
 be more than sufficient to back almost any site, at over 1,000req/s
 on a single core.
@@ -172,7 +177,7 @@ problems on many platforms. gcc >= 10 and clang >= 11 are tested
 on several `.deb` and `.rpm` based systems, under FreeBSD and under
 Windows (MSYS2/mingw).
 
-You will likely also suceed with minimal platforms like the
+You will likely also succeed with minimal platforms like the
 raspberry-pi.
 
 ### Installing build dependencies
@@ -184,6 +189,7 @@ refer to OS specific instructions
 - [Windows](BUILD-Windows.md)
 
 ### Compile for release
+
 ```bash
 ./build.sh -c gcc -b release
 ```
@@ -192,25 +198,26 @@ refer to OS specific instructions
 
 ### Run full download: `hibp-download`
 
-Program will download the currently ~38GB of data, containing 1
-million 30-40kB text files from api.haveibeenpawned.com It does this
+Program will download the currently ~38GB of data, containing
+1 million 30-40kB text files from api.haveibeenpawned.com. It does this
 using `libcurl` with `curl_multi` and 300 parallel requests
 (adjustable) on a single thread.  A second thread does the
 conversion to binary format and writing to disk.
 
 *Warning* this will (currently) take just around 6mins on a 1Gb/s
 connection and consume ~21GB of disk space during this time:
+
 - your network connection will be saturated with HTTP2 multiplexed requests
 - `top` in threads mode (key `H`) should show 2 `hibp-download` threads.
-- One "curl thread" with ~50-80% CPU and
-- The "main thread" with ~15-30% CPU, primarily converting data to
+- One "curl" thread with ~50-80% CPU and
+- The "main" thread with ~15-30% CPU, primarily converting data to
   binary and writing to disk
 
 ```bash
 ./build/gcc/release/hibp-download hibp_all.sha1.bin
 ```
 
-If any transfer fails, even after 5 retries, the programme will
+If any transfer fails, even after 5 retries, the program will
 abort. In this case, you can try rerunning with `--resume`.
 
 For all options run `hibp-download --help`.
@@ -234,20 +241,24 @@ Performance will be mainly down to your disk and be 5-8ms per uncached
 query, and <0.3ms cached.  See below for further improved performance
 with `--toc`
 
-### NT Hash (AKA NTLM)
+### NT Hash (aka NTLM)
 
 The compromised password database is also available using the NTLM
-hash, rather than sha1. This may be useful if auditing local
+hash, rather than SHA-1. This may be useful if auditing local
 Windows server authentication systems.
 
 ```bash
 ./build/gcc/release/hibp-download --ntlm hibp_all.ntlm.bin
 ```
+
 and then search
+
 ```bash
 ./build/gcc/release/hibp-search --ntlm hibp_all.ntlm.bin password
 ```
-or maybe "by hash" rather than plaintext password? 
+
+or maybe "by hash" rather than plaintext password?
+
 ```bash
 ./build/gcc/release/hibp-search --ntlm hibp_all.ntlm.bin --hash 00000011059407D743D40689940F858C
 ```
@@ -255,7 +266,7 @@ or maybe "by hash" rather than plaintext password?
 ### Running a local server: `hibp-server`
 
 You can run a high performance server for "pawned password queries" as
-follows.  This is a simple "REST" server using the "restinio" library.
+follows. This is a simple "REST" server using the "restinio" library.
 The server process consumes <5MB of resident memory.
 
 ```bash
@@ -265,7 +276,7 @@ curl http://localhost:8082/check/plain/password
 # output should be:
 10434004
 
-#if you pass --json to the server you will get
+# if you pass --json to the server you will get
 {count:10434004}
 
 # if you feel more secure sha1 hashing the password in your client, you
@@ -273,28 +284,28 @@ curl http://localhost:8082/check/plain/password
 
 curl http://localhost:8082/check/sha1/5BAA61E4C9B93F3F0682250B6CF8331B7EE68FD8
 10434004
-
 ```
 
 For all options run `hibp-server --help`.
 
 #### Basic performance evaluation using apache bench
 
-Run server like this (--perf-test will uniquefy change the password for each request)
-```
+Run server like this (--perf-test will generate a unique password for each request)
+
+```bash
 ./build/gcc/release/hibp-server data/hibp_all.bin --perf-test
 ```
 
 And run apache bench like this (generating a somewhat random password to start with):
 
-```
+```bash
 hash=$(date | sha1sum); ab -c100 -n10000 "http://localhost:8082/check/plain/${hash:0:10}"
 ```
 
 These are the key figures from a short run on an old i5-3470 CPU @ 3.20GHz with 3 threads
 (one thread is consumed running `ab`).
 
-```
+```text
 Requests per second:    3166.96 [#/sec] (mean)
 Time per request:       31.576 [ms] (mean)
 Time per request:       0.316 [ms] (mean, across all concurrent requests)
@@ -303,11 +314,11 @@ Time per request:       0.316 [ms] (mean, across all concurrent requests)
 This should be more than enough for almost any site, in fact you may
 want to reduce the server to just one thread like so:
 
-```
+```bash
 ./build/gcc/release/hibp-server data/hibp_all.bin --perf-test --threads=1
 ```
 
-```
+```bash
 hash=$(date | sha1sum); ab -c25 -n10000 "http://localhost:8082/check/plain/${hash:0:10}"
 
 Requests per second:    1017.17 [#/sec] (mean)
@@ -317,8 +328,8 @@ Time per request:       0.983 [ms] (mean, across all concurrent requests)
 
 #### Enhanced performance for constrained devices: `--toc`
 
-If you are runnning this database on a constrained device, with
-limited free RAM or a slow disk, You may want to try using the "table
+If you are running this database on a constrained device, with
+limited free RAM or a slow disk, you may want to try using the "table
 of contents" features, which builds an index into the "chapters" of
 the database and then holds this index in memory.
 
@@ -329,19 +340,19 @@ tunable):
 
 ![](https://github.com/oschonrock/hibp/blob/main/media/toc.png)
 
-`--toc` is available on the `hibp-search` test utility, and the `hibp-server`. 
+`--toc` is available on the `hibp-search` test utility, and the `hibp-server`.
 
-The first run with `--toc` builds the index, which takes about 1
-minute, depending on your sequential disk speed. `hibp-search` shows
+The first run with `--toc` builds the index, which takes about
+1 minute, depending on your sequential disk speed. `hibp-search` shows
 that completely uncached queries *reduce from 5-8ms to just 0.7ms*.
 
-### Saving further diskspace: sha1t64 
+### Saving further diskspace: sha1t64
 
 We can also store the sha1 database with the hashes truncated to
-64bits, AKA sha1t64. `hibp-download`, `hibp-search`, `hibp-server` and
+64bits, aka sha1t64. `hibp-download`, `hibp-search`, `hibp-server` and
 `hibp-topn` support this format.
 
-There are no "hash collisions", at 64bit truncated, in the current
+There are no "hash collisions" at 64bit truncated, in the current
 dataset and the probability of a random password hitting a 64bit "hash
 collision", when actually its sha1 was different is about 1/10^10, so
 basically negligible.
@@ -352,15 +363,21 @@ performance is improved by 40-50%.
 ```bash
 hibp-download --sha1t64 hibp_all.sha1t64.bin
 ```
+
 and then search
+
 ```bash
 hibp-search --sha1t64 hibp_all.sha1t64.bin password
 ```
-or maybe "by hash" rather than plaintext password? 
+
+or maybe "by hash" rather than plaintext password?
+
 ```bash
 hibp-search --sha1t64 hibp_all.sha1t64.bin --hash 00001131628B741F
 ```
+
 or run a server
+
 ```bash
 hibp-server --sha1t64-db hibp_all.sha1t64.bin 
 ```
@@ -383,11 +400,11 @@ It's just a convenience wrapper around `cmake`, mainly to select
 You can use `./build.sh --verbose` to see how `./build.sh` is invoking
 `cmake` (as well as making `cmake` verbose).
 
-#### The "ultimate" server
+### The "ultimate" server
 
 Maybe you want to serve plaintext, sha1 and ntlm at the same time,
-while taking advantage extra of `--toc` performance, and also offering
-an extra fast option for probbabilistic results with a binfuse16
+while taking advantage of extra `--toc` performance, and also offering
+an extra fast option for probabilistic results with a binfuse16
 filter. Here is the full commands script for that, assuming the
 programs are on your `PATH` for brevity:
 
@@ -396,13 +413,14 @@ hibp-download --sha1 hibp_all.sha1.bin
 hibp-download --ntlm hibp_all.ntlm.bin
 
 hibp-server \
-	--sha1-db=hibp_all.sha1.bin \
-	--ntlm-db=hibp_all.ntlm.bin \
-	--toc --binfuse16-filter=hibp_binfuse16.bin
+  --sha1-db=hibp_all.sha1.bin \
+  --ntlm-db=hibp_all.ntlm.bin \
+  --toc --binfuse16-filter=hibp_binfuse16.bin
 ```
 
 Output:
-```
+
+```text
 Make a request to any of:
 http://localhost:8082/check/plain/password123  [using sha1 db]
 http://localhost:8082/check/sha1/CBFDAC6008F9CAB4083784CBD1874F76618D2A97
@@ -421,7 +439,7 @@ hibp-server --sha1-db=hibp_topn.sha1.bin --ntlm-db=hibp_topn.ntlm.bin --toc
 ```
 
 You can now remove the really big files, if the top 50million entries
-is enough for you.
+are enough for you.
 
 ## Running tests
 
@@ -429,15 +447,16 @@ There is a significant set of unit, integration and system tests -
 although not 100% coverage at this point.
 
 You can run them with one of these options:
+
 - from the `./build.sh` convenience script with `--run-tests`
-- by using `ccmake` to set `HIBP_TEST=ON` 
+- by using `ccmake` to set `HIBP_TEST=ON`
 - by passing `-DHIBP_TEST=ON` to cmake directly
 
 ## Why are you using http (no TLS)?
 
 The main intention is for this be a local server, binding to
 `localhost` only, and thats the default behaviour. There is no request
-logging, so `http` is a secure and simple architecture. 
+logging, so `http` is a secure and simple architecture.
 
 Of course, if you want to serve beyond localhost, you **should
 definitely** either *use a reverse proxy* in front of hibp-server, or
@@ -445,34 +464,35 @@ modify `app/hibp-server.cpp` and *recompile with TLS support*.
 
 ## Under the hood
 
-These utilities are written in C++ and centre around a `flat_file` class to model the db. 
-- multi threaded concurrency and parallelism is used in
-	`hibp-download`, `hibp-server`, `hibp-sort` and `hibp-topn`.
+These utilities are written in C++ and centre around a `flat_file` class to model the db.
+
+- multi threaded concurrency and parallelism is used in `hibp-download`,
+  `hibp-server`, `hibp-sort` and `hibp-topn`.
 - `libcurl`, `libevent` are used for the highly concurrent download
 - `restinio` is used for the local server, based on `ASIO` for efficient concurrency
 - [`arrcmp`](https://github.com/oschonrock/arrcmp) is used as a high
-  performance, compile time optimised replacement for `memcmp`, which
+  performance, compile-time optimised replacement for `memcmp`, which
   makes agressive use of your CPU's vector instructions
-- libtbb is used for local sorting in `hibp-sort` and
-	`hibp-topn`. Note that for the parallelism (ie PSTL using libtbb)
-	you currently have to compile from source, but this only has a
-	small effect on `hibp-sort` and `hibp-topn`. And due to
-	portability annoyances and a bug in libstd++, this is disabled by
-	default, and you need to turn `HIBP_WITH_PSTL=ON` to use it.
+- libtbb is used for local sorting in `hibp-sort` and `hibp-topn`.
+  Note that for the parallelism (i.e. PSTL using libtbb) you currently
+  have to compile from source, but this only has a small effect
+  on `hibp-sort` and `hibp-topn`. And due to portability annoyances
+  and a bug in libstd++, this is disabled by default, and you need
+  to turn `HIBP_WITH_PSTL=ON` to use it.
 - the binary fuse filters are based on the [binfuse C++
-  library](https://github.com/oschonrock/binfuse). 
+  library](https://github.com/oschonrock/binfuse).
   
 ## Future plans
 
 - partial diff/patch algorithm, so after one full download, small patch
   files can keep you up to date.
 
-- More packaging: 
-  - Get the .deb accepted into Debian 
-  - publish a .rpm 
+- More packaging:
+  - Get the .deb accepted into Debian
+  - publish a .rpm
   - get it into a FreeBSD port
-  - produce a windows installer 
+  - produce a windows installer
 
-- Consider adding a php/pyhton/javascript extension so that queries
+- Consider adding a php/python/javascript extension so that queries
   can be trivially made from within those scripting environments
-	  without going through an http server
+  without going through an http server