Skip to content

Releases: RWKV/rwkv.cpp

master-f685aa4

19 Jul 09:36
f685aa4
Compare
Choose a tag to compare
Fix "'NoneType' object has no attribute 'cast'" error when model is f…

master-25ee75e

18 Jul 09:39
25ee75e
Compare
Choose a tag to compare
Expose n_vocab, n_embed, n_layer to the Python interface (#118)

master-84634c0

27 Jun 09:29
84634c0
Compare
Choose a tag to compare
Elide logits if the logits pointer parameter is NULL (#107)

* Completely skip calculation of logits if nobody cares

This speeds up sequence mode evaluations by up to 20% if you ingest
a large prompt and then only retrieve the logits at the very end.

Note that you must pass a NULL pointer to the logits parameter in
order to take advantage of this optimization.

* logits_out=NULL documentation

master-ffc085c

26 Jun 11:24
ffc085c
Compare
Choose a tag to compare
Update GGML (#103)

* Update GGML

* Fix linux build

Of course we forgot why we did this, and broke the build again, in
the exact same way, a second time.

* Fix cuBLAS

Properly set the backend and then call ggml_cuda_transform_tensor

* Rename xx to x_prev

probably should slip this in now before we forget it's a thing.

* See how easy updates are now? (update GGML)

master-9cbb9d9

21 Jun 16:13
9cbb9d9
Compare
Choose a tag to compare
Various improvements (#104)

* Make rwkv_gpu_offload_layers return true only if layers were actually offloaded

* Validate device of tensors

* Offload all layers during test

* Consistently use FP16 and FP32 instead of float16/fp16/F16/etc.

* Use spaces for indentation

* Remove spaces between type name and []

* Add cuBLAS on Windows guide, refactor docs structure

* Insert replacement characters when decoding invalid UTF-8 sequences

* Fix compatibility

* Fix formatting

* Fix copy-pasted tensor validation

master-6b26e0d

15 Jun 11:17
6b26e0d
Compare
Choose a tag to compare
Add Python support for sequence mode (#101)

master-5316068

14 Jun 15:57
5316068
Compare
Choose a tag to compare
fix static linking for tests and extras, remove unneeded -static flag…

master-15b7c7b

14 Jun 15:59
15b7c7b
Compare
Choose a tag to compare
add standalone build option (#99)

* add standalone build option

* Update CMakeLists.txt for more clarity in comment

Co-authored-by: Alex <[email protected]>

* add endofline properly for right formating

---------

Co-authored-by: Alex <[email protected]>

master-c64009e

13 Jun 11:26
c64009e
Compare
Choose a tag to compare
Fix typo in rwkv.h docs for n_vocab (#96)

World models actually have 65536, not 65535, oops

master-bd65c97

13 Jun 14:20
bd65c97
Compare
Choose a tag to compare
Make sampling with bias numerically stable (#90)

* Update sampling.py

Remove a slow for loop on logit bias. Make the numpy re-softmax operation numerically stable.

* Update sampling.py