diff --git a/demos/embeddings/README.md b/demos/embeddings/README.md index bcb81813ed..2fec549c7b 100644 --- a/demos/embeddings/README.md +++ b/demos/embeddings/README.md @@ -246,7 +246,13 @@ python export_model.py embeddings_ov --source_model sentence-transformers/all-mp **NPU** ::::{tab-set} :::{tab-item} Qwen/Qwen3-Embedding-0.6B -:sync: Qwen3-Embedding-0.6B-fp16 +:sync: Qwen3-Embedding-0.6B-int8 +```console +docker run --user $(id -u):$(id -g) --rm -v $(pwd)/models:/models:rw openvino/model_server:latest --pull --model_repository_path /models --source_model OpenVINO/Qwen3-Embedding-0.6B-int8-ov --pooling LAST --task embeddings +``` +::: +:::{tab-item} BAAI/bge-large-en-v1.5 +:sync: BAAI/bge-large-en-v1.5-fp16 ```console python export_model.py embeddings_ov --source_model BAAI/bge-large-en-v1.5 --pooling CLS --weight-format fp16 --target_device NPU --config_file_path models/config.json --model_repository_path models ```