Recursive Language Model that improves any LLM while reducing token usage up to 4X across 12 benchmarks.
-
Updated
Apr 3, 2026 - Python
Recursive Language Model that improves any LLM while reducing token usage up to 4X across 12 benchmarks.
🎛️ All-in-one control center for Windows system tweaks and optimizations.
40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment
Amazing Latency Performance Audit
Comprehensive guide for tuning Linux network stack buffers (socket, TCP, qdisc, NIC rings) on RHEL/OEL 8. Includes detailed documentation, RTT-based buffer calculations, tuning profiles for low-latency and high-throughput scenarios, and production-ready shell scripts for validation and monitoring.
**Arc-Rides MAX FPS Booster** is an advanced Windows performance optimization tool designed to increase FPS, reduce lag, and boost gaming performance on Windows 10 and 11. This FPS booster optimizes system settings, disables unnecessary background processes, lowers latency, and maximizes PC speed for smooth, stable, high-performance gameplay.
This repo focuses on latency-aware resource optimization for Kubernetes
Experimental weights optimization for high-speed inference on distributed clusters.
Ultimate Windows optimization toolkit for gamers and power users. PowerShell scripts, registry tweaks, driver configs, debloat tools, telemetry blockers, and performance boosters. Reduce input lag, boost FPS, clean bloatware, disable tracking, and fine-tune Windows 10/11 for maximum gaming performance. One-click automation with backup support.
AmazeAim is a lightweight system utility designed for competitive gamers to minimize mouse input lag and stabilize polling rate consistency on Windows.
SIMD-Accelerated RLNC Sidecar for Deterministic MEV & HFT Networking
Latency-optimized IoT automation system using dual-core ESP32 and ESP8266 with OTA updates
AmazeGaming is a lightweight C# utility designed for real-time automatic optimization of gaming processes. It dynamically allocates system resources, minimizes input lag, and prioritizes the active gaming window to ensure peak performance.
EDLIoT: Energy and Delay Load-balancing IoT scheduling framework using cooperative game theory and intelligent threshold detection in fog computing.
Memory-Block Protocol (MBP) for reducing latency in long-context GPT conversations
Real-time computer vision system using OpenCV/DLib for facial landmark tracking. Optimized for edge deployment with <30ms latency and 95%+ accuracy.
Request hedging for tail latency reduction in distributed systems
A production-grade benchmarking suite for RAG agents, tracking P99 latency and multi-agent reliability.
Hybrid CV processing pipeline combining deterministic pattern extraction with selective LLM refinement. 99.5% cost reduction, 1195x faster time-to-first-result.
UCB bandit-based model serving optimizer with automatic latency/accuracy/cost tradeoff. 86.7% P99 latency reduction with zero accuracy degradation in production simulation.
Add a description, image, and links to the latency-optimization topic page so that developers can more easily learn about it.
To associate your repository with the latency-optimization topic, visit your repo's landing page and select "manage topics."