Nebius AI Performance Engineering

This project contains my notes for a comprehensive set of materials, guides, and exercises from Nebius for AI Performance Engineering course.

I'm updating it as I go through the course.

Modules

1. From AI Model to AI Product

Explore how to transition from raw machine learning models to functional AI-driven products.

Intro to AI & LLMs: An essential introduction to the landscape of Large Language Models. This module covers:
- What changed with LLMs and their core limitations.
- The GPT assistant training pipeline (Pretraining, SFT, RLHF).
- Tokenization strategies and token economics.
- Prompt and Context engineering techniques, including Zero-shot, Few-shot, and Chain-of-Thought prompting.
- Practical insights into tool use and function calling.
Evaluation & Benchmarks: An essential introduction to the landscape of Large Language Models. This module covers:
- Why LLM evaluation is hard
- Evaluation metrics
- Evaluation-Driven Development (EDD) - the mindset
- Common metrics and where they break
- Common benchmarks and their expiration dates
- LLM-as-a-Judge and automated behavioral evals (Anthropic's Bloom)
- Human evaluation
- EDD in practice: turning metrics into decisions
AI Systems & Test-Time Compute
- Fine-Tuning vs Retrieval Augmented Generation (RAG)
- The RAG Pipeline Components
- Chunking and Embedding strategies
- Evaluating RAG Systems and the "RAG Triad"
- Evaluation Datasets and Benchmarks

2. LLM Architecture

Deep dive into the underlying architecture of modern LLMs.

LLM Architecture - AI and LLM Intro
- Intro and Generative Al Landscape
- Types of ML
- Supervised Tasks Evaluation
- Language Models
- N-Gram LM
- Language Models Evaluation
AI Model Training
- Core Terminology & Hierarchy
- Language Model Architectures
- Optimization & Regularization
- Evaluation & Benchmarks
Neural Networks and Learned Representations
- Neural Networks and Multi-Layer Perceptrons (MLPs)
- Activation functions and Backpropagation
- Learned representations and Word Embeddings (Word2Vec)
- Sentence Embeddings (Concatenation, Autoencoders, Pooling)

3. MLOps

Best practices for deploying, monitoring, and maintaining machine learning models in production.

4. Performance Engineering

Techniques for optimizing model inference, reducing latency, and managing compute resources efficiently.

5. AI Model Finetuning with RL

Advanced topics in model refinement using Reinforcement Learning.

Colabs

LLM Architecture - AI and LLM Intro

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
1_From_AI_model_to_AI_product		1_From_AI_model_to_AI_product
2_LLM_Architecture		2_LLM_Architecture
3_MLOps		3_MLOps
4_Performance_Engineering		4_Performance_Engineering
5_AI_model_finetuning_with_RL		5_AI_model_finetuning_with_RL
immutable		immutable
tools		tools
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nebius AI Performance Engineering

Modules

1. From AI Model to AI Product

2. LLM Architecture

3. MLOps

4. Performance Engineering

5. AI Model Finetuning with RL

Colabs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Nebius AI Performance Engineering

Modules

1. From AI Model to AI Product

2. LLM Architecture

3. MLOps

4. Performance Engineering

5. AI Model Finetuning with RL

Colabs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages