Skip to content

Commit 27b201f

Browse files
committed
update readme
1 parent 6a1182c commit 27b201f

File tree

1 file changed

+9
-1
lines changed

1 file changed

+9
-1
lines changed

README.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,9 @@
11
# SnapKV
2+
We introduce an innovative and out-of-box KV cache compression method, SnapKV.
3+
4+
![Comprehensive Experiment Results on LongBench](./figures/longbench.jpg)
5+
![Pressure Test Result on Needle-in-a-Haystack](./figures/LWM-Text-Chat-1M_SnapKV.pdf)
6+
27
## Quick Start
38
### Use SnapKV-optimized Models
49
SnapKV-optimized models are all under models file, which could be directly imported and used the same like baseline models.
@@ -20,4 +25,7 @@ tokenizer = transformers.AutoTokenizer.from_pretrained(
2025
```
2126

2227
### Customize Your SnapKV-optimized Models
23-
SnapKV can be easily integrate with other models. You can follow the comment marked with `[SnapKV]` in [existing models](./models) to constrcut your own models. The algorithm of SnapKV is in [snapkv_utils.py](./snapkv_utils.py)
28+
SnapKV can be easily integrate with other models. You can follow the comment marked with `[SnapKV]` in [existing models](./models) to constrcut your own models. The detailed algorithm of SnapKV is in [snapkv_utils.py](./snapkv_utils.py)
29+
30+
## Motivation
31+
The observations and motivations behind SnapKV could be found at [observation folder](./observations)

0 commit comments

Comments
 (0)