-
Notifications
You must be signed in to change notification settings - Fork 27.7k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add rust to deepspeed image so it can build tokenizers
#35932
opened Jan 28, 2025 by
ivarflakstad
Loading…
changed MllamaPreTrainedModel._prepare_4d_causal_attention_mask
#35920
opened Jan 27, 2025 by
AndriiZelenko
Loading…
2 of 5 tasks
Missing weights not initialized properly #35437
#35913
opened Jan 27, 2025 by
sambhavnoobcoder
Loading…
Fix how we compute the final non-padding token for ForSequenceClassification models
#35911
opened Jan 27, 2025 by
Rocketknight1
Loading…
Add generation config validation using Pydantic
#35910
opened Jan 27, 2025 by
Manalelaidouni
Loading…
Fix Mask2Former Weight Initialization Issues #35877
#35904
opened Jan 27, 2025 by
sambhavnoobcoder
Loading…
Introduce modular files for speech models
#35902
opened Jan 27, 2025 by
nikosanto13
Loading…
1 of 5 tasks
Several fixes related to rotary position embeddings
#35901
opened Jan 27, 2025 by
mseeger
Loading…
4 of 5 tasks
Fix Gradient Checkpointing for Deberta & Deberta-V2 using PEFT / Adapters
#35898
opened Jan 26, 2025 by
lenglaender
Loading…
1 of 5 tasks
add shared experts for upcoming Granite 4.0 language models
#35894
opened Jan 26, 2025 by
mayank31398
Loading…
Fix synced multi-GPU generation with LLMs and VLMs
#35893
opened Jan 26, 2025 by
ManukyanD
Loading…
1 of 5 tasks
Iterative generation using Input embeds and static cache
#35890
opened Jan 25, 2025 by
yaswanth19
Loading…
1 of 5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.