Skip to content

Commit 3070afd

Browse files
committed
update readme
1 parent 14152c3 commit 3070afd

File tree

1 file changed

+4
-5
lines changed

1 file changed

+4
-5
lines changed

README.md

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,6 @@
11
# SnapKV
22
We introduce an innovative and out-of-box KV cache compression method, SnapKV.
33

4-
![Comprehensive Experiment Results on LongBench](./figures/longbench.jpg)
5-
![Pressure Test Result on Needle-in-a-Haystack](./figures/LWM-Text-Chat-1M_SnapKV.jpg)
6-
74
## Quick Start
85
### Use SnapKV-optimized Models
96
SnapKV-optimized models are all under models file, which could be directly imported and used the same like baseline models.
@@ -27,5 +24,7 @@ tokenizer = transformers.AutoTokenizer.from_pretrained(
2724
### Customize Your SnapKV-optimized Models
2825
SnapKV can be easily integrate with other models. You can follow the comment marked with `[SnapKV]` in [existing models](./models) to constrcut your own models. The detailed algorithm of SnapKV is in [snapkv_utils.py](./snapkv_utils.py)
2926

30-
## Motivation
31-
The observations and motivations behind SnapKV could be found at [observation folder](./observations)
27+
28+
## Results
29+
![Comprehensive Experiment Results on LongBench](./figures/longbench.jpg)
30+
![Pressure Test Result on Needle-in-a-Haystack](./figures/LWM-Text-Chat-1M_SnapKV.jpg)

0 commit comments

Comments
 (0)