forked from huggingface/trl
-
Notifications
You must be signed in to change notification settings - Fork 0
Pull requests: Future-House/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Created New feature or request
GRPOTrainerWithEval subclass for different evaluation reward functions
enhancement
#9
opened Mar 10, 2025 by
jamesbraza
Member
Loading…
Apply attention mask when computing logprobs
#2
opened Feb 2, 2025 by
sidnarayanan
Collaborator
Loading…
Decoupling generation and loss batch sizes
#1
opened Feb 1, 2025 by
sidnarayanan
Collaborator
Loading…
ProTip!
Updated in the last three days: updated:>2026-06-24.