training-stability

Here are 8 public repositories matching this topic...

OpenMatch / ANCE-Tele

Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives".

information-retrieval catastrophic-forgetting hard-negative-mining dense-retrieval training-stability

Updated Mar 25, 2024
Python

Liang-ZX / edge-of-stability

Star

An unofficial extended version for ICLR 2021"Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability"

training-stability

Updated Nov 21, 2023
Python

Emmimal / pytorch-nan-detector

Star

PyTorch NaNs are silent killers. This hook catches them at the exact layer and batch — with ~3 ms overhead vs ~7 ms for set_detect_anomaly.

pytorch autograd model-debugging training-stability gradient-explosion nan-detection deep-learning-debugging forward-hooks

Updated Apr 30, 2026
Python

shzhang3 / RVAV

Star

RVAV: a physics-informed PyTorch optimizer for energy-stable, high-LR training—with a simple closure API, tests, CI, and quickstart.

benchmarking machine-learning deep-learning ci optimizer pytorch scientific-computing reproducibility numerical-optimization physics-informed energy-based training-stability large-learning-rates closure-api

Updated Aug 18, 2025
Python

NizarBelaatik / gan-optimization-benchmark

Star

Benchmarking GAN optimizers (Adam, RMSprop, SGD, Lookahead) on CIFAR-10 using WGAN-GP and FID evaluation.

python benchmark machine-learning deep-learning optimizer pytorch generative-adversarial-network gan sgd neural-networks image-generation cifar10 adam rmsprop lookahead wgan-gp training-stability

Updated Jul 12, 2025
Jupyter Notebook

baksho / ZC-Swish

Star

A parameterized, drop-in activation function for stabilizing deep Batch-Normalization-free networks in micro-batch and Federated Learning settings.

deep-learning batch-normalization cifar100 activation-functions training-stability

Updated Apr 13, 2026
Jupyter Notebook

Iamyulx / llm-training-stability-simulator

Star

nlp deep-learning simulation pytorch research-tool language-model early-stopping pretraining llm training-stability divergence-detection loss-dynamics

Updated Apr 9, 2026
Python

vicobarafor / robust-federated-learning-noniid

Star

Drift-Aware Adaptive Aggregation (DAA) for federated learning on CIFAR-10 under heterogeneous client partitions.

machine-learning deep-learning optimization heterogeneous-data federated-learning non-iid robust-federated-learning training-stability model-aggregation

Updated Mar 15, 2026
Python

Improve this page

Add a description, image, and links to the training-stability topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the training-stability topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training-stability

Here are 8 public repositories matching this topic...

OpenMatch / ANCE-Tele

Liang-ZX / edge-of-stability

Emmimal / pytorch-nan-detector

shzhang3 / RVAV

NizarBelaatik / gan-optimization-benchmark

baksho / ZC-Swish

Iamyulx / llm-training-stability-simulator

vicobarafor / robust-federated-learning-noniid

Improve this page

Add this topic to your repo