Releases: RWKV/rwkv.cpp
Releases · RWKV/rwkv.cpp
master-74f50ae
Add wkv v5 custom operator (#148) * Add wkv v5 custom operator * Fix build * Comment out intrinsics
master-20a8549
Add RWKV v5.1 and v5.2 support (#145) * Reformat CMakeLists and docs * Add RWKV v5.1 and v5.2 support * Remove sequence_length-based heuristic for allocating ggml context * Set GGML_CUDA_MMV_Y to 2 by default * Clarify comment; update ggml * Make some operations inplace * Make some operations inplace * Add late_abort option for tests * Increase thresholds * Increase thresholds * Update README.md
master-22a2778
add hipBLAS for windows (#135) * add hipBLAS for windows * fix test build * do test and fix load dll on windows * add hipBLAS_on_Windows document * update hipBLAS_on_Windows doc * add benchmark
master-39ed572
Various improvements (#131) * Implement model head offloading * Guess the tokenizer from n_vocab * Make PyTorch optional for inference * Add function to offload layers * Add rwkv_eval_sequence_in_chunks
master-0df970a
Decrease memory padding for serial and sequential contexts (#132)
master-6caa45e
Python API restructurization & code style improvements (#130) * Replace tabs with 4 spaces * Refactor tests * Rename Python scripts directory to "python" * Create a separate package for the official Python API * Move Python inference example to a separate file * Add missing const * Refactor extras * Split rwkv.cpp into smaller files * Clean up cpp code * Rename rwkv package to rwkv_cpp * Add missing type hints * Rewrite automatic library lookup * Add compatibility warning * Fix MacOS build * Fix MacOS build
master-8db73b1
Update ggml (#128) * Fix quantize.py doc * Add Q5 format compatibility test * Update ggml * Add documentation about limitations of sequence mode * Fix most compiler warnings * Clean up CMakeLists.txt * Assert contiguity instead of assuming it * Update README.md * Fix warnings * Try to fix compilation error * Attempt to fix Ubuntu build * Attempt to fix Ubuntu build * Restore all build jobs * Allow sequence lengths of up to 64 out of the box by forking ggml
master-d6c691e
add other language bindings (#126) * add other language bindings * Update README.md --------- Co-authored-by: Alex <[email protected]>
master-2d3cdd7
only append to cpu string if not initialized (#125) * only append to cpu string if not initialized * Fix code style --------- Co-authored-by: Alex <[email protected]>
master-84f34c5
Implement basic CLBlast support (#110) * Get this thing building Unzip the OpenCL SDK and CLBlast distribution into the repo root, then enable RWKV_CLBLAST and regenerate makefiles to pick them up. Currently builds and runs. * Really offload tensors to OpenCL rather than cuBLAS * Fix CLBlast builds in CMake release mode Somehow the path handling is different here which requires me to be quite a bit more annoying about it. * Remove `brew update` * Try building without sanitizer (maybe it would work this time?) --------- Co-authored-by: saharNooby <[email protected]>