Skip to content

unable to generate response on multi gpu. #28

@ashunaveed

Description

@ashunaveed

while generating response, got following error: 2025-01-12 19:21:15,471 - INFO - main - Documents indexed successfully.
127.0.0.1 - - [12/Jan/2025 19:21:15] "POST /chat HTTP/1.1" 200 -
127.0.0.1 - - [12/Jan/2025 19:21:21] "GET /new_session HTTP/1.1" 302 -
127.0.0.1 - - [12/Jan/2025 19:21:21] "GET /chat HTTP/1.1" 200 -
127.0.0.1 - - [12/Jan/2025 19:21:21] "GET /static/css/styles.css HTTP/1.1" 304 -
127.0.0.1 - - [12/Jan/2025 19:21:23] "GET /switch_session/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b HTTP/1.1" 302 -
127.0.0.1 - - [12/Jan/2025 19:21:23] "GET /chat HTTP/1.1" 200 -
127.0.0.1 - - [12/Jan/2025 19:21:23] "GET /static/css/styles.css HTTP/1.1" 304 -
127.0.0.1 - - [12/Jan/2025 19:21:28] "GET /get_indexed_files/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b HTTP/1.1" 200 -
127.0.0.1 - - [12/Jan/2025 19:21:32] "GET /switch_session/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b HTTP/1.1" 302 -
127.0.0.1 - - [12/Jan/2025 19:21:32] "GET /chat HTTP/1.1" 200 -
127.0.0.1 - - [12/Jan/2025 19:21:32] "GET /static/css/styles.css HTTP/1.1" 304 -
127.0.0.1 - - [12/Jan/2025 19:21:47] "POST /rename_session HTTP/1.1" 200 -
2025-01-12 19:21:57,148 - INFO - models.retriever - Retrieving documents for query: what are the technical elegibility criteria for llp companies
2025-01-12 19:21:57,283 - INFO - models.retriever - Added image to list: images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_cc27c01785f684942697aea07701cfde.png
2025-01-12 19:21:57,388 - INFO - models.retriever - Added image to list: images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_ac356a1c131cf82f86e9c17138d8d27f.png
2025-01-12 19:21:57,498 - INFO - models.retriever - Added image to list: images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_0f80b130432a76157b9055a61a3411ad.png
2025-01-12 19:21:57,498 - INFO - models.retriever - Total 3 documents retrieved. Image paths: ['images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_cc27c01785f684942697aea07701cfde.png', 'images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_ac356a1c131cf82f86e9c17138d8d27f.png', 'images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_0f80b130432a76157b9055a61a3411ad.png']
2025-01-12 19:21:57,498 - INFO - main - Retrieved images: ['images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_cc27c01785f684942697aea07701cfde.png', 'images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_ac356a1c131cf82f86e9c17138d8d27f.png', 'images/2d76bc4f-cad3-42ee-b3b4-c269dbcba02b/retrieved_0f80b130432a76157b9055a61a3411ad.png']
2025-01-12 19:21:57,498 - INFO - models.responder - Generating response using model 'qwen'.
Downloading shards: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5/5 [00:01<00:00, 2.59it/s]
Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5/5 [00:08<00:00, 1.78s/it]
You shouldn't move a model that is dispatched using accelerate hooks.
2025-01-12 19:22:17,998 - INFO - models.model_loader - Qwen model loaded and cached.
2025-01-12 19:22:18,255 - ERROR - models.responder - Error generating response: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1! (when checking argument for argument mat2 in method wrapper_CUDA_bmm)
127.0.0.1 - - [12/Jan/2025 19:22:18] "POST /chat HTTP/1.1" 200 -
127.0.0.1 - - [12/Jan/2025 19:22:18] "POST /rename_session HTTP/1.1" 200 -

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions