Rust FastLanes #345

gatesn · 2024-06-10T08:27:28Z

Full Rust implementation of FastLanes BitPacking.

It can almost be done without macros. We need a macro in order to force an inlined loop. This essentially repeats the code within the macro block T times, meaning the compiler sees it as unrolled and performs auto-vectorisation.

This code is 1:1 with the FastLanes scalar auto-vectorised implementation, meaning it has no explicit SIMD dependencies.
Unlike the Zig code, it doesn't unroll all loops, meaning the code size should be more reasonable.

Vectorized ASM on ARM:

Vectorized ASM on x86:

gatesn · 2024-06-11T09:55:38Z

Closing in favour of https://github.com/spiraldb/fastlanes

gatesn added 14 commits June 9, 2024 14:14

fastlanes seq

753097d

Macro

e1be082

Macro

7516ea1

Seq

27e4b3d

Clean up

f47ce13

Clean up

1acea6a

Clean up

b2eb660

Unpack

5c80660

BitUnpacking

84db7a6

Test all round-trips

98f03f2

Unpack single

220a784

Unpack single

d88defc

BitPacking

fcaa93e

Split into Rust crate

914e88d

gatesn changed the title ~~[wip] Rust FastLanes~~ Rust FastLanes Jun 11, 2024

Split into Rust crate

f87c0eb

gatesn marked this pull request as ready for review June 11, 2024 08:37

gatesn enabled auto-merge (squash) June 11, 2024 08:41

gatesn closed this Jun 11, 2024

auto-merge was automatically disabled June 11, 2024 09:55
Pull request was closed

gatesn deleted the ngates/rust-fl branch June 11, 2024 09:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rust FastLanes #345

Rust FastLanes #345

gatesn commented Jun 10, 2024 •

edited

Loading

gatesn commented Jun 11, 2024

Rust FastLanes #345

Rust FastLanes #345

Conversation

gatesn commented Jun 10, 2024 • edited Loading

gatesn commented Jun 11, 2024

gatesn commented Jun 10, 2024 •

edited

Loading