Track the number of tokens seen to metrics #27274

muellerzr · 2023-11-03T18:04:47Z

What does this PR do?

This PR adds num_tokens_seen to the TrainerState allowing users to know how many tokens were passed in an individual batch. Uses gather to ensure that in DDP this can be known as well.

Fixes #27027

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@pacman100 @amyeroberts

HuggingFaceDocBuilderDev · 2023-11-03T18:28:35Z

The documentation is not available anymore as the PR was closed or merged.

muellerzr · 2023-11-03T18:49:50Z

@amyeroberts question: these failures are all because they cannot find the main_input_name in the batch. Does this mean some of the models are wrong?

For instance, one of the failures in the examples script is from SpeechEncoderDecoderModel, which states main_input_name is "inputs", however when running the script it's using `"input_values":

self = {'input_values': tensor([[-0.0177, -0.0188, -0.0202,  ..., -0.0032, -0.0068, -0.0039],
        [-0.0196, -0.0556, -0.0...-100, -100, -100, -100, -100, -100, -100, -100, -100,
         -100, -100, -100, -100, -100, -100, -100, -100, -100]])}

amyeroberts

Thanks for adding this!

Main comment is about whether this should be running by default

src/transformers/trainer_callback.py

src/transformers/trainer.py

amyeroberts · 2023-11-07T20:09:28Z

@muellerzr I think we don't have to worry about that if it's behind a flag in the training args. This way - if someone want to measure it then they have to make sure that main_input_name is properly set, but it won't break trainer for models which are currently compatible but don't have this correctly set.

amyeroberts

Thanks for adding and iterating!

Just some nits on the implementation

src/transformers/trainer.py

muellerzr · 2023-11-08T18:27:27Z

@amyeroberts failing tests seem to be unrelated

Co-authored-by: amyeroberts <[email protected]>

amyeroberts

Thanks!

src/transformers/trainer.py

Co-authored-by: amyeroberts <[email protected]>

HuggingFaceDocBuilderDev · 2023-11-14T20:32:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

* Add tokens seen * Address comments, add to TrainingArgs * Update log * Apply suggestions from code review Co-authored-by: amyeroberts <[email protected]> * Use self.args * Fix docstring Co-authored-by: amyeroberts <[email protected]> --------- Co-authored-by: amyeroberts <[email protected]>

muellerzr requested review from amyeroberts and pacman100 November 3, 2023 18:04

amyeroberts reviewed Nov 6, 2023

View reviewed changes

src/transformers/trainer_callback.py Outdated Show resolved Hide resolved

src/transformers/trainer.py Outdated Show resolved Hide resolved

muellerzr requested a review from amyeroberts November 7, 2023 20:30

amyeroberts approved these changes Nov 7, 2023

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

src/transformers/trainer.py Outdated Show resolved Hide resolved

muellerzr force-pushed the tokens-seen branch from 03f74e7 to 4c0366c Compare November 8, 2023 18:08

muellerzr force-pushed the tokens-seen branch from 4c0366c to d01ba62 Compare November 8, 2023 20:09

muellerzr and others added 5 commits November 14, 2023 14:48

Add tokens seen

6d1bd94

Address comments, add to TrainingArgs

46f8b29

Update log

7ef3d27

Apply suggestions from code review

8c5156e

Co-authored-by: amyeroberts <[email protected]>

Use self.args

8481a8e

muellerzr force-pushed the tokens-seen branch from d01ba62 to 8481a8e Compare November 14, 2023 19:48

amyeroberts approved these changes Nov 14, 2023

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

Fix docstring

f81dcf8

Co-authored-by: amyeroberts <[email protected]>

muellerzr merged commit 2fc33eb into main Nov 14, 2023

muellerzr deleted the tokens-seen branch November 14, 2023 20:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Track the number of tokens seen to metrics #27274

Track the number of tokens seen to metrics #27274

Uh oh!

muellerzr commented Nov 3, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Nov 3, 2023 •

edited

Loading

Uh oh!

muellerzr commented Nov 3, 2023

Uh oh!

amyeroberts left a comment

Uh oh!

Uh oh!

Uh oh!

amyeroberts commented Nov 7, 2023

Uh oh!

amyeroberts left a comment

Uh oh!

Uh oh!

Uh oh!

muellerzr commented Nov 8, 2023

Uh oh!

amyeroberts left a comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Nov 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Track the number of tokens seen to metrics #27274

Track the number of tokens seen to metrics #27274

Uh oh!

Conversation

muellerzr commented Nov 3, 2023

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

muellerzr commented Nov 3, 2023

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

amyeroberts commented Nov 7, 2023

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

muellerzr commented Nov 8, 2023

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Nov 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Nov 3, 2023 •

edited

Loading