Skip to content

Apply attention mask when computing logprobs#2

Open
sidnarayanan wants to merge 1 commit into
decouple-batch-sizesfrom
attn-fix
Open

Apply attention mask when computing logprobs#2
sidnarayanan wants to merge 1 commit into
decouple-batch-sizesfrom
attn-fix

Conversation

@sidnarayanan

Copy link
Copy Markdown
Collaborator

Essentially porting the logic here: huggingface#2708, but adapted for microbatching. Also cleaning up repeated code from kwargs.

Keeping open for now until an experiment finishes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants