-
Notifications
You must be signed in to change notification settings - Fork 33k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix NameError in serving CLI due to conditional import asymmetry
#45641
opened Apr 24, 2026 by
abhiprd200
Loading…
[Trainer] default to FSDP2, simplify API around fsdp + fsdp_config
#45640
opened Apr 24, 2026 by
SunMarc
Member
Loading…
Add Multi-Token Prediction (MTP) support for Qwen3.5
#45638
opened Apr 24, 2026 by
curnane-lab
Loading…
qa: speed up dtype regex weight load + reduce dtype tests to 3 random
#45635
opened Apr 24, 2026 by
tarekziade
Collaborator
Loading…
DeepGEMM BF16, isolation, refactor
#45634
opened Apr 24, 2026 by
IlyasMoutawwakil
Member
•
Draft
6 tasks
[MistralCommonBackend] Soften validation mode and apply_chat_template arguments check
#45628
opened Apr 24, 2026 by
juliendenize
Contributor
Loading…
3 of 6 tasks
Processing Utils: honor pre-built sub-processor kwargs in from_pretrained
#45627
opened Apr 24, 2026 by
javierdejesusda
Contributor
Loading…
2 tasks done
[Model] Add PP-FormulaNet Model Support
#45626
opened Apr 24, 2026 by
zhang-prog
Contributor
Loading…
Add
supports_gradient_checkpointing to NemotronHPreTrainedModel
#45625
opened Apr 24, 2026 by
sergiopaniego
Member
Loading…
4 of 6 tasks
Add MTP speculative decoding via MTPCandidateGenerator
#45618
opened Apr 24, 2026 by
ArthurZucker
Collaborator
Loading…
fix(qianfan_ocr): add XPU expectations
#45615
opened Apr 24, 2026 by
kaixuanliu
Contributor
Loading…
Add missing requests dependency to transformers[serving]
#45614
opened Apr 24, 2026 by
Oneirag
Loading…
Add regression test for Gemma4 audio relative positional range
#45607
opened Apr 23, 2026 by
mathceo
Loading…
[gemma4] infer from config instead of hardcoding
#45606
opened Apr 23, 2026 by
eustlb
Contributor
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.