CUDA LLM Kernel Optimization

LLM-Speed is a CUDA kernel optimization project for LLM inference experiments, covering FlashAttention, Tensor Core GEMM, Python bindings, and verification workflows.

Repository Overview

CUDA kernels in src/ and reusable primitives in include/
Python bindings and packaging in python/, setup.py, and pyproject.toml
Tests and benchmarks in tests/ and benchmarks/
GitHub Pages site for documentation entry, reading paths, and project updates

Quick Start

pip install -r requirements.txt
pip install -e .

cmake --preset release
cmake --build build/release -j$(nproc)

pytest tests/ -v

Docs

Project docs: https://lessup.github.io/llm-speed/
Site home explains where to start, what to read next, and how the docs are organized
See CONTRIBUTING.md for contribution workflow

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
.kiro/specs/cuda-llm-kernel-optimization		.kiro/specs/cuda-llm-kernel-optimization
benchmarks		benchmarks
changelog		changelog
docs		docs
include		include
python		python
src		src
tests		tests
.clang-format		.clang-format
.editorconfig		.editorconfig
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
_config.yml		_config.yml
index.md		index.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA LLM Kernel Optimization

Repository Overview

Quick Start

Docs

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CUDA LLM Kernel Optimization

Repository Overview

Quick Start

Docs

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages