Add early stopping for Bark generation via logits processor #26675

isaac-chung · 2023-10-08T18:35:09Z

What does this PR do?

Fixes # (issue) #26672

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

isaac-chung · 2023-10-09T14:09:06Z

@ylacombe maybe we can continue the conversation here.

ylacombe

Hi @isaac-chung , thanks for the quick PR and for the good job!

I left a few comments here, let me know if you still have questions!

Other than that, for testing, I would add a test_XXXX on LogitsProcessorTest which first checks that the new logits processor behaves as expected with an hand-made example.
Ideally, we'd have another test on BarkSemanticModelTest, but I'm not sure how to proceed yet.
Do you have any ideas?

src/transformers/generation/logits_process.py

src/transformers/models/bark/generation_configuration_bark.py

isaac-chung · 2023-10-10T07:55:48Z

Ideally, we'd have another test on BarkSemanticModelTest, but I'm not sure how to proceed yet.
Do you have any ideas?

I'm not entirely sure. Maybe we could assert outputs from self.model.generate with the new arg somehow?

could be possibly passed to BarkModel.generate kwargs without causing issues

To confirm that we support this, maybe we should add to BarkModelIntegrationTests.test_generate_end_to_end_with_sub_models_args as well?

ylacombe · 2023-10-10T08:32:18Z

Let's try to do both!

isaac-chung · 2023-10-10T20:34:58Z

I think I managed to add to BarkModelIntegrationTests without issues. But I'd like to align on how to proceed with BarkSemanticModelTest. Specifically:

Only a few tests assert the outputs. As I'm unsure what to expect, I might print the outputs and assert those
I've been manually trying to fill in BarkSemanticGenerationConfig so that the generate() call does not fail. Not sure if there's a more efficient way.

ylacombe

Hey @isaac-chung, I've addressed 2. in the comments below! I'm not sure to understand point 1. though, could you expand on this a bit ?
thanks!

tests/generation/test_logits_process.py

tests/models/bark/test_modeling_bark.py

isaac-chung · 2023-10-12T06:29:06Z

@ylacombe thanks! Regarding 1, let's take BarkModelIntegrationTests.test_generate_end_to_end_with_sub_models_args for example, the test does not assert any outputs and it simply runs .generate(). Would that be fine here?

ylacombe · 2023-10-13T08:49:58Z

Let's try to find a case where the semantic model has to stop. You can get inspiration from that test:

transformers/tests/models/bark/test_modeling_bark.py

Lines 904 to 921 in d085662

    
           def test_generate_semantic(self): 
        
               input_ids = self.inputs 
        
               # fmt: off 
        
               # check first ids 
        
               expected_output_ids = [7363, 321, 41, 1461, 6915, 952, 326, 41, 41, 927,] 
        
               # fmt: on 
        
               # greedy decoding 
        
               with torch.no_grad(): 
        
                   output_ids = self.model.semantic.generate( 
        
                       **input_ids, 
        
                       do_sample=False, 
        
                       temperature=1.0, 
        
                       semantic_generation_config=self.semantic_generation_config, 
        
                   ) 
        
               self.assertListEqual(output_ids[0, : len(expected_output_ids)].tolist(), expected_output_ids)

So basically, an example where, the same seed, the last output tokens are different, do you think it's possible?

isaac-chung · 2023-10-13T09:51:42Z

If we set min_eos_p to anything that's non-zero, we only get the eos_token (set to 10000 for open-end generation). Here is what passed.

    @slow
    def test_generate_semantic_early_stop(self):
        input_ids = self.inputs

        # fmt: off
        # check first ids
        expected_output_ids = [10000]
        # fmt: on

        self.semantic_generation_config.min_eos_p = 0.05

        # greedy decoding
        with torch.no_grad():
            output_ids = self.model.semantic.generate(
                **input_ids,
                do_sample=False,
                temperature=1.0,
                semantic_generation_config=self.semantic_generation_config,
            )

        self.assertListEqual(output_ids[0, : len(expected_output_ids)].tolist(), expected_output_ids)

Is that what you have in mind?

ylacombe · 2023-10-13T12:52:50Z

Oh, that seems weird, have you tried with another generation strategy ? (i.e do_sample=True, temperature=...)? If you have the same results, it's probably on the logit processor side!

ylacombe · 2023-10-19T09:23:16Z

Regarding using a stopping criteria, I don't think it's possible at the moment -> quoting #26672

Ideally, we'd add a custom stopping criteria, but as indicated in custom stopping_critriea function doesn't receive logits scores (receives None instead) #23674 it's not yet possible to use scores without setting return_dict_in_generate=True, output_scores=True .

ArthurZucker · 2023-10-19T13:32:27Z

It receives None because output_scores and return_dict are not properly set

ylacombe · 2023-10-19T14:01:57Z

Yes of course, but don't you think users should have the liberty to set output_scores and return_dict as they want ?

ArthurZucker · 2023-10-19T15:32:00Z

For sure. So the goal here is by default to always stop early? (actually not returning the scores might be better in terms of memory ?)
What I mean is that the stopping criterias are meant to be used that way 😉

ylacombe · 2023-10-20T14:36:43Z

For sure. So the goal here is by default to always stop early? (actually not returning the scores might be better in terms of memory ?) What I mean is that the stopping criterias are meant to be used that way 😉

Yes this is the goal here. Totally agree on the stopping criteria usage! Actually I haven't find a stopping criteria which uses scores yet, maybe because of the limitation of having to use return_dict_in_generate=True, output_scores=True. #23674 is a discussion on this and I believe this is under @gante's radar! What do you recommend in the meantime ?

isaac-chung · 2023-10-24T13:40:45Z

Hey @ylacombe / @ArthurZucker , please let me know if there's anything else I can do to further this PR.

gante

A few nits. Other than that, looks good to me! Thank you for working on it 💪

src/transformers/generation/logits_process.py

gante

Thank you for iterating 💛

ylacombe

Thanks for iterating here @isaac-chung !
@ArthurZucker could you make a final review?

Two last demands on my side:

are all the bark integration tests passing ? Could you make sure they are?
At the risk of repeating myself, we still need a test to make sure that generated ids with min_eos_p>0 are shorter than generated ids without it.

src/transformers/generation/logits_process.py

tests/models/bark/test_modeling_bark.py

gante · 2023-10-25T14:56:31Z

btw, regarding it being a logits processor vs stopping criteria: it is my impression that we want to generate an EOS token under the conditions defined here. Since we want to generate a token, it has to be a logits processor.

(the main difference between them is that the stopping criteria stops generation right away, and doesn't add any new token -- for batched generation, this can make a big difference)

ArthurZucker

LGTM, let's just keep camel case and adresse the comments from @ylacombe !

src/transformers/generation/logits_process.py

tests/models/bark/test_modeling_bark.py

isaac-chung · 2023-10-26T07:48:03Z

@ylacombe I've run this command and all tests are passing ✅

RUN_SLOW=yes python -m unittest tests.models.bark.test_modeling_bark.BarkModelIntegrationTests

tests/models/bark/test_modeling_bark.py

ylacombe · 2023-10-27T08:28:09Z

LGTM ! Let's wait for all the check to pass and merge then! Thanks for the great work here and all the iterations!

isaac-chung · 2023-10-27T08:47:19Z

Thank you all again for your guidance and patience 🙏 much appreciated.

…ace#26675) * add early stopping logits processor * black formmated * indent * follow method signature * actual logic * check for None * address comments on docstrings and method signature * add unit test under `LogitsProcessorTest` wip * unit test passing * black formatted * condition per sample * add to BarkModelIntegrationTests * wip BarkSemanticModelTest * rename and add to kwargs handling * not add to BarkSemanticModelTest * correct logic and assert last outputs tokens different in test * doc-builder style * read from kwargs as well * assert len of with less than that of without * ruff * add back seed and test case * add original impl default suggestion * doc-builder * rename and use softmax * switch back to LogitsProcessor and update docs wording * camelCase and spelling and saving compute * assert strictly less than * assert less than * expand test_generate_semantic_early_stop instead

add early stopping logits processor

c621e66

isaac-chung changed the title ~~add early stopping logits processor~~ Add early stopping logits processor Oct 8, 2023

isaac-chung changed the title ~~Add early stopping logits processor~~ Add early stopping for Bark generation Oct 8, 2023

isaac-chung changed the title ~~Add early stopping for Bark generation~~ Add early stopping for Bark generation via logits processor Oct 8, 2023

isaac-chung added 5 commits October 8, 2023 21:58

black formmated

9027782

indent

a93b03c

follow method signature

1de69b8

actual logic

0937f06

check for None

05d1647

isaac-chung marked this pull request as ready for review October 9, 2023 14:08

ylacombe mentioned this pull request Oct 9, 2023

added a min_eos_p parameter, defined 'SupressTokensLogitsProcessor' c… #26688

Closed

5 tasks

ylacombe reviewed Oct 9, 2023

View reviewed changes

isaac-chung added 4 commits October 9, 2023 19:40

address comments on docstrings and method signature

88571d1

add unit test under LogitsProcessorTest wip

528b967

unit test passing

dc33e36

black formatted

ce21bc5

isaac-chung added 3 commits October 10, 2023 12:10

condition per sample

45b3065

add to BarkModelIntegrationTests

fa5d251

wip BarkSemanticModelTest

b08346d

ylacombe reviewed Oct 11, 2023

View reviewed changes

tests/generation/test_logits_process.py Show resolved Hide resolved

tests/models/bark/test_modeling_bark.py Outdated Show resolved Hide resolved

rename and add to kwargs handling

81f90ce

not add to BarkSemanticModelTest

f9c1c84

isaac-chung added 3 commits October 19, 2023 13:49

assert len of with less than that of without

9e90232

ruff

a37aac3

add back seed and test case

b0200b8

isaac-chung added 2 commits October 19, 2023 19:01

add original impl default suggestion

7c743c6

doc-builder

28c12e9

gante reviewed Oct 24, 2023

View reviewed changes

src/transformers/generation/logits_process.py Outdated Show resolved Hide resolved

src/transformers/generation/logits_process.py Outdated Show resolved Hide resolved

src/transformers/generation/logits_process.py Outdated Show resolved Hide resolved

isaac-chung added 2 commits October 24, 2023 23:03

rename and use softmax

542f41e

switch back to LogitsProcessor and update docs wording

e16309d

gante approved these changes Oct 25, 2023

View reviewed changes

ylacombe reviewed Oct 25, 2023

View reviewed changes

src/transformers/generation/logits_process.py Outdated Show resolved Hide resolved

tests/models/bark/test_modeling_bark.py Outdated Show resolved Hide resolved

ArthurZucker approved these changes Oct 25, 2023

View reviewed changes

src/transformers/generation/logits_process.py Outdated Show resolved Hide resolved

tests/models/bark/test_modeling_bark.py Outdated Show resolved Hide resolved

isaac-chung added 3 commits October 25, 2023 20:46

camelCase and spelling and saving compute

0c75c1c

assert strictly less than

d88acd8

assert less than

eb9b2f0

ylacombe reviewed Oct 26, 2023

View reviewed changes

tests/models/bark/test_modeling_bark.py Show resolved Hide resolved

expand test_generate_semantic_early_stop instead

9e83547

gante merged commit e2bffcf into huggingface:main Oct 27, 2023

isaac-chung deleted the improve-bark-generation branch October 27, 2023 10:10

ylacombe mentioned this pull request Oct 27, 2023

Improve Bark Generation #26672

Closed

Add early stopping for Bark generation via logits processor #26675

Add early stopping for Bark generation via logits processor #26675

Uh oh!

Conversation

isaac-chung commented Oct 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

isaac-chung commented Oct 9, 2023

Uh oh!

ylacombe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isaac-chung commented Oct 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ylacombe commented Oct 10, 2023

Uh oh!

isaac-chung commented Oct 10, 2023

Uh oh!

ylacombe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

isaac-chung commented Oct 12, 2023

Uh oh!

ylacombe commented Oct 13, 2023

Uh oh!

isaac-chung commented Oct 13, 2023

Uh oh!

ylacombe commented Oct 13, 2023

Uh oh!

ylacombe commented Oct 19, 2023

Uh oh!

ArthurZucker commented Oct 19, 2023

Uh oh!

ylacombe commented Oct 19, 2023

Uh oh!

ArthurZucker commented Oct 19, 2023

Uh oh!

ylacombe commented Oct 20, 2023

Uh oh!

isaac-chung commented Oct 24, 2023

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

ylacombe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

gante commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

isaac-chung commented Oct 26, 2023

Uh oh!

Uh oh!

isaac-chung commented Oct 8, 2023 •

edited

Loading

isaac-chung commented Oct 10, 2023 •

edited

Loading

gante commented Oct 25, 2023 •

edited

Loading