This makes RWKV very CPU-friendly on large context lenghts. If you use rwkv.cpp for anything serious, please test all available formats for perplexity and latency on a representative dataset, and ...
Some results have been hidden because they may be inaccessible to you