Skip to content

Conversation

@sshleifer
Copy link
Contributor

@sshleifer sshleifer commented Mar 24, 2020

  • conversion script can now take a path, which is required since this model is not on torch.hub. Finetuning with fairseq and then converting to huggingface should work. I also cleaned it up a bit
  • Config in S3 is already updated with author-recommended generation parameters:
    (num_beams=6, length_penalty=1., min_length=11, max_length=62

Context:
These weights are from bart finetuned on the XSum abstractive summarization challenge, which encourages shorter (more abstractive) summaries. It achieves state of the art.

Discussion:

  • I propose changing the SummarizationPipeline default to this model in a separate PR, since the summarizations are shorter (and high quality)!

@codecov-io
Copy link

codecov-io commented Mar 24, 2020

Codecov Report

Merging #3422 into master will decrease coverage by 0.00%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #3422      +/-   ##
==========================================
- Coverage   77.80%   77.79%   -0.01%     
==========================================
  Files         100      100              
  Lines       17051    17051              
==========================================
- Hits        13266    13265       -1     
- Misses       3785     3786       +1     
Impacted Files Coverage Δ
src/transformers/configuration_bart.py 100.00% <ø> (ø)
src/transformers/modeling_bart.py 97.58% <ø> (ø)
src/transformers/tokenization_bart.py 100.00% <100.00%> (ø)
src/transformers/modeling_utils.py 91.81% <0.00%> (-0.14%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 17dceae...71fcbc9. Read the comment docs.

base_model_prefix = "model"

def __init__(self, config: BartConfig):
def __init__(self, config: BartConfig, base_model=None):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is base_model used for?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unused, should be deleted.

@sshleifer sshleifer merged commit f6a23d1 into huggingface:master Mar 29, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants