#

model-routing

Here are 133 public repositories matching this topic...

CommonstackAI / UncommonRoute

Automatic LLM router — 82% cost savings, 79.4% accuracy, 93.4% pass rate. Drop-in OpenAI proxy.

agent router ai openai cost-optimization llm anthropic model-routing

Updated Apr 30, 2026
Python

NadirRouter / NadirClaw

Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenClaw. Saves 40-70% on AI API costs. Self-hosted, no middleman.

Updated May 1, 2026
Python

greynewell / infermux

Route inference across LLM providers. Track cost per request.

Updated Feb 17, 2026
Go

3873225350 / like-code

A customized Claude Code fork for routed models, multi-agent orchestration, and dense terminal control.

tui multi-agent agent-orchestration claude-code model-routing llm-tooling customized-cc

Updated Apr 28, 2026
TypeScript

fengzhizi715 / OpenVitamin

OpenVitamin is a local-first AI execution platform that unifies Agents, Workflows, and multi-model inference into a single programmable system — designed for building real, production-grade AI applications.

agent workflow ai multi-model execution-engine ai-agents rag fastapi ai-platform local-first onnxruntime llm llama-cpp local-ai ai-platforms agent-orchestration openai-compatible agent-runtime model-routing

Updated Apr 14, 2026
Python

syrin-labs / syrin-python

Developer-first Python framework for AI agents with built-in budget control, context, memory and observability.

ai budgeting multi-agent memory-management observability rag ai-agent multimodal-agent agentic-rag context-engineering model-routing harness-engineering

Updated Apr 26, 2026
Python

codeking-ai / cligate

Multi-protocol AI proxy server for Claude Code, Codex CLI, Gemini CLI & OpenClaw. Account pooling, API key management, free model routing, and visual dashboard.

Updated May 3, 2026
JavaScript

Aaryan-Kapoor / ModelGate-Hackathon

🏆Winning Project | ModelGate is a contract-aware AI control plane that ingests customer contracts, extracts SLA/privacy/routing constraints, and generates an OpenAI-compatible endpoint that automatically routes every request to the optimal model. Simple queries go to cheap models. Complex queries go to premium ones.

reinforcement-learning ai hackathon model nextjs routing openai lora quantization cost-optimization fine-tuning fastapi openai-api llm llamacpp gguf grpo openai-compatible model-routing

Updated Mar 22, 2026
TypeScript

openfreerouter / freerouter

Free, self-hosted AI model router. OpenRouter / ClawRouter alternative using your own API keys. 14-dimension classifier routes to the right model (Anthropic/OpenAI/Kimi) automatically. No middleman, no markup. Built for OpenClaw.

self-hosted openai cost-optimization anthropic llm-proxy llm-router ai-proxy model-routing openclaw openrouter-alternative ai-model-router

Updated Feb 14, 2026
TypeScript

megeezy / Chameleon

Stateless LLM runtime that dynamically routes, loads, executes, and unloads models per request with bounded VRAM caching and intelligent model selection.

systems-programming llm generative-ai ai-infrastructure latency-optimization model-routing vram-optimization model-scheduling

Updated Apr 12, 2026
Rust

claude-router

0xrdan / claude-router

Intelligent model orchestration for Claude Code - routes queries to optimal Claude model (Haiku/Sonnet/Opus) based on complexity. It also includes many more features. If this project is working well for you and would like to support me, just help spread the word. Thanks!

claude cost-optimization llm anthropic claude-code model-routing

Updated Jan 26, 2026
Python

tzachbon / claude-model-router-hook

Claude Code hooks that auto-switch model tier based on task complexity

Updated Mar 13, 2026
Python

RagavRida / mmcp

Multi-Model Collaboration Pipeline — orchestrate AI models as a DAG. RL routing, multi-verifier voting, agent mesh, self-improving. Works with OpenAI, Anthropic, Gemini, DeepSeek. npm install mmcp-core | pip install mmcp-core

Updated Apr 12, 2026
TypeScript

kalibr-ai / kalibr-sdk-python

Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.

Updated May 3, 2026
Python

Ruthwik000 / tokenfirewall

Scalable LLM cost enforcement middleware for Node.js with budget protection and multi-provider support

nodejs middleware typescript gemini openai budget cost-control llm anthropic token-counter model-routing automatic-failover ai-cost-management

Updated Mar 24, 2026
TypeScript

Hyperion-HQ / Hyperion

Ultra-low-latency LLM gateway with microsecond caching, dynamic routing, budgets, analytics, and forecasting.

Updated Apr 2, 2026
Go

chandika / openclaw-model-router

Intelligent model routing for OpenClaw - Sonnet 4.6 vs Opus 4.6 based on task complexity

ai claude model-routing openclaw

Updated Feb 18, 2026

ClawAI

ihabkhaled / ClawAI

Claw is a local-first AI control plane that runs powerful open models on your machine and connects to top LLM providers. It intelligently routes every prompt to the best model, giving you one secure workspace for chat, memory, context, connectors, routing, and full AI orchestration.

Updated May 2, 2026
TypeScript

senda-labs / DQIII8

Works for you. Go outside and live. — AI orchestrator that auto-routes tasks to the cheapest model that solves them. 70% run free on local models. Self-auditing, self-improving, zero prompting skill needed. Built with vibe coding by a finance student. Your models, your data.

self-hosted multi-model claude cost-optimization groq ai-agent ai-automation ollama ai-orchestration vibe-coding claude-code ai-routing personal-ai-assistant model-routing

Updated May 3, 2026
Python

ApiliumCode / mayros

Production-ready AI agent framework — semantic memory, multi-agent mesh, MCP server, intelligent routing, governance, and 67+ platform integrations.

Updated Apr 9, 2026
TypeScript

Improve this page

Add a description, image, and links to the model-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-routing topic, visit your repo's landing page and select "manage topics."