[BUG] Inline loading of models at Symlinks causes models to unload and reload

### OS

Linux

### GPU Library

CUDA 12.x

### Python version

3.12

### Describe the bug

I set `inline_model_loading: true` so I could load models dynamically from the `models` directory by name.

I saved the model from huggingface.co, such as `ArtusDev_Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3`, then I created a symlink named `coder` to that directory.

The goal was to be able to use "coder" as my model name for the API call while keeping a descriptive directory name on disk. This also lets me re-symlink to a different model without changing the clients making the calls.

However, I think it resolves the symlink, properly loads the model but thinks the model is called `ArtusDev_Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3` instead of `coder`, then when the next API call comes in for model `coder` it unloads the model, then immediately reloads it (its the same model).

### Reproduction steps

1) Set `inline_model_loading: true`
2) Save a model to the models directory. I used `ArtusDev_Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3`
3) `ln -s ./ArtusDev_Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3 coder`
4) Make an API call with the model name "coder"
5) The model will unload, then reload

### Expected behavior

The model would stay loaded and be used. 

### Logs

_No response_

### Additional context

This is super low priority

### Acknowledgements

- [x] I have looked for similar issues before submitting this one.
- [x] I have read the disclaimer, and this issue is related to a code bug. If I have a question, I will use the Discord server.
- [x] I understand that the developers have lives and my issue will be answered when possible.
- [x] I understand the developers of this program are human, and I will ask my questions politely.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[BUG] Inline loading of models at Symlinks causes models to unload and reload #379

OS

GPU Library

Python version

Describe the bug

Reproduction steps

Expected behavior

Logs

Additional context

Acknowledgements

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[BUG] Inline loading of models at Symlinks causes models to unload and reload #379

Description

OS

GPU Library

Python version

Describe the bug

Reproduction steps

Expected behavior

Logs

Additional context

Acknowledgements

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions