NVIDIA Corporation
- 25.5k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- https://nvidia.com
Pinned Loading
Repositories
- context-aware-rag Public
Context-Aware RAG library for Knowledge Graph ingestion and retrieval functions.
NVIDIA/context-aware-rag’s past year of commit activity - Model-Optimizer Public
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
NVIDIA/Model-Optimizer’s past year of commit activity - NeMo-Retriever Public
NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
NVIDIA/NeMo-Retriever’s past year of commit activity - aicr Public
Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes
NVIDIA/aicr’s past year of commit activity - srt-slurm Public
NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across specific AI use cases across hardware and software combinations.
NVIDIA/srt-slurm’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…