feat: InferPage settings by Madzionator · Pull Request #124 · wisedev-code/MaIN.NET

Madzionator · 2026-03-06T19:45:45Z

1. Settings UI for InferPage

Added dedicated Settings page with backend selection, model configuration, and API keys
Per-backend profile storage (remembers model choice + capability flags per backend)

2. Custom Model Paths

Users can now store models anywhere via Settings (not locked to C:\Models)
App loads model from custom location when specified

3. Image & MMProj Support

Full support for multimodal projector files (.mmproj) for local vision models
Image uploads automatically handled and routed correctly
Settings shows expected mmproj file path for registered vision models

4. Code Refactoring

Centralized settings persistence logic
Unified capability checks across UI and service layers
Cleaner startup logic with backend validation

5. LLamaSharp Upgrade

Updated to 0.26.0 with MTMD API for better image handling
Improved vision model reliability

Introduce a browser-based settings experience and improve model path resolution. Adds a Settings panel (Settings.razor) with styles and JS (settings.css, settings.js) and services to persist settings and API keys (SettingsService, InferPageSettings, SettingsStateService). Wire the settings UX into NavBar and Home to request/show/save settings and apply them at runtime via Utils.ApplySettings; Program.cs now distinguishes CLI-provided config vs browser configuration (Utils.NeedsConfiguration). Update Utils to support manual capability overrides and sanitize model paths, and tweak app CSS and script includes. Improve LLMService local model resolution and path handling (fallback GenericLocalModel, ResolvePath) so unregistered or absolute model paths load correctly.

Several refactors and behavior fixes across the app: - NavBar: call SettingsState.RequestSettings directly from the settings button, simplify theme toggle and reload helpers, remove unused flag/methods. - Home: prevent adding image inputs when the selected model doesn't support vision and display an error. - Settings page: build backend options with Prepend for simpler ordering. - Program.cs: validate Self backend model at startup and exit if unsupported; adjust AddMaIN registration flow based on NeedsConfiguration and BackendType. - SettingsService: convert to constructor-injected style, consolidate load/save into generic dict helpers, and add typed methods for saving/getting API keys and model history. - SettingsStateService: shorten comment describing the event bus. - Utils: unify capability checks with a generic GetCapability<T>, make Reason mutually exclusive with ImageGen, and streamline environment variable handling when setting backend API keys. These changes simplify codepaths, centralize settings persistence logic, and enforce model capability checks earlier and at the UI level.

Add per-backend profile storage (model + capability flags) and use it in settings UI. SettingsService: introduce BackendProfile record and SaveProfileForBackendAsync/GetProfileForBackendAsync using a new storage key. Settings.razor: load profile on backend select, restore manual vision/reasoning/imagegen flags only for unregistered models, fallback to legacy model-history when no profile exists, and save the full profile instead of just the model. This enables preserving capability information per backend while remaining compatible with older model-only entries.

Add support for multimodal projector (.mmproj) names across the stack and ensure uploaded image files are treated as images. Implemented MMProjectName on several local vision models and added MmProjName property to InferPage settings, backend profiles, Utils, and ServiceConstants. Settings UI now exposes an MMProj File input for local unregistered vision models and saves/loads it per backend. LLMService now extracts image files from message.Files early (ChatHelper.ExtractImageFromFiles), resolves mmproj name from the model or chat properties to load LLava weights, and requires loaded weights before processing image messages. ChatHelper.ExtractImageFromFiles moves image files into message.Images and cleans up Files so they aren't misrouted to RAG/memory.

Home: when a custom Model Path is set for Self backend, bake the resolved full file path into a generic local model instance so the LLMService loads from the correct location (creates GenericLocal[Vision][Reasoning]Model variants with FileName set to the resolved path). This fixes cases where Chat.Properties don't reach the service and the model file must be embedded in the model object. Settings: add RegisteredMmProjPathHint to display the expected MMProj file path for registered local vision models (shows a "MMProj file: ..." hint derived from ResolvedModelPathPreview or fallback model directory), keeping the hint in sync with the "Will load:" preview. LLMService: when resolving an mmproj for image models, prefer the directory of a fully-qualified modelKey (custom model file path) and fall back to the configured models path; this ensures mmproj files are located next to custom model files.

Update settings dialog sizing in src/MaIN.InferPage/wwwroot/settings.css to use viewport-based width and constraints: set width to 40vw with a min-width of 480px and max-width 90vw, and increase max-height from 85vh to 95vh. This improves layout and usability across different screen sizes.

Switch LLama embedding implementation to create and dispose contexts per call (aligns with LLamaSharp 0.26.0), removing the long-lived Context field and related state. Read EmbeddingSize from a temporary context at construction, call llama_set_embeddings on each per-call context, and normalize embeddings as before. Update LLamaSharpTextEmbedding defaults: use model-default context (ContextSize=0), enable Embeddings, reduce Batch/UBatch sizes, disable FlashAttention, and set pooling + metadata override for older Nomic GGUFs. Update KnownModels and LocalModels entries for the Nomic embedding model (filename, download URL, display name) and change its embedding dimension to 2048. Remove MemoryService's pre/post import context management and adjust MemoryFactory to load weights, inject pooling metadata, and return an embedding config that uses the model's native context.

Detect when the latest message contains images and, instead of clearing them, run a memory/kernel search to build contextual text for the LLM. GeminiService and LLMService now: perform SearchAsync, delete the temporary index, aggregate citation text into a context block, inject that context into the message content (nulling Files), call Send, then restore the original message content and return the result. LLMService also performs additional model/resource cleanup when disableCache is true (dispose models/generator and embedder weights, remove model from loader). DeepSeekService and OllamaService no longer null out Images on the message so image data is preserved for the new image-handling flow.

Replace the default embedding model from Nomic to MXBAI across the codebase. Added a new Mxbai_Embedding LocalModel (mxbai-embedding) implementing IEmbeddingModel with a 1024-dimension embedding and updated metadata/file URL. Kept Nomic_Embedding as a separate local model record. Removed the old KnownModels.GetEmbeddingModel helper and updated MemoryFactory to obtain the embedding model from ModelRegistry ("mxbai-embedding") and adjusted imports accordingly.

Clean up unused using statements across several services to reduce compiler warnings and tidy imports.

Madzionator added 6 commits March 5, 2026 15:38

Fix vision for local models

1041cc5

Madzionator requested a review from wisedev-pstach March 6, 2026 19:48

wisedev-pstach reviewed Mar 13, 2026

View reviewed changes

Comment thread src/MaIN.Core/MaIN.Core.csproj

Madzionator added 7 commits March 13, 2026 21:39

CR 1

42c52c8

versioning

e3c9b04

Remove unused using directives

1e29f71

Clean up unused using statements across several services to reduce compiler warnings and tidy imports.

wisedev-pstach approved these changes Mar 17, 2026

View reviewed changes

Madzionator merged commit 69446b1 into main Mar 17, 2026
1 check passed

Madzionator deleted the feat/infer-page-settings branch March 17, 2026 10:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: InferPage settings#124

feat: InferPage settings#124
Madzionator merged 13 commits into
mainfrom
feat/infer-page-settings

Madzionator commented Mar 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Madzionator commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Madzionator commented Mar 6, 2026 •

edited

Loading