[BART] add bart-large-xsum weights #3422

sshleifer · 2020-03-24T22:32:45Z

conversion script can now take a path, which is required since this model is not on torch.hub. Finetuning with fairseq and then converting to huggingface should work. I also cleaned it up a bit
Config in S3 is already updated with author-recommended generation parameters:
(num_beams=6, length_penalty=1., min_length=11, max_length=62

Context:
These weights are from bart finetuned on the XSum abstractive summarization challenge, which encourages shorter (more abstractive) summaries. It achieves state of the art.

Discussion:

I propose changing the SummarizationPipeline default to this model in a separate PR, since the summarizations are shorter (and high quality)!

codecov-io · 2020-03-24T22:41:11Z

Codecov Report

Merging #3422 into master will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #3422      +/-   ##
==========================================
- Coverage   77.80%   77.79%   -0.01%     
==========================================
  Files         100      100              
  Lines       17051    17051              
==========================================
- Hits        13266    13265       -1     
- Misses       3785     3786       +1

Impacted Files	Coverage Δ
src/transformers/configuration_bart.py	`100.00% <ø> (ø)`
src/transformers/modeling_bart.py	`97.58% <ø> (ø)`
src/transformers/tokenization_bart.py	`100.00% <100.00%> (ø)`
src/transformers/modeling_utils.py	`91.81% <0.00%> (-0.14%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 17dceae...71fcbc9. Read the comment docs.

thomwolf · 2020-03-26T15:59:35Z

src/transformers/modeling_bart.py

    base_model_prefix = "model"

-    def __init__(self, config: BartConfig):
+    def __init__(self, config: BartConfig, base_model=None):


What is base_model used for?

Unused, should be deleted.

sshleifer added 7 commits March 18, 2020 02:01

Very broken ckpt

761f9f8

Merge branch 'master' into bart-large-xsum

dacdae5

Merge branch 'master' into bart-large-xsum

1630fa6

style

e46a3f0

config passing

d3b8ba5

assert failing

4db1135

style

ee72ca5

sshleifer requested review from LysandreJik, julien-c and thomwolf March 24, 2020 22:33

thomwolf reviewed Mar 26, 2020

View reviewed changes

sshleifer added 2 commits March 26, 2020 12:30

Merge branch 'master' into bart-large-xsum

adaecc9

delete unused base_model

774246a

thomwolf approved these changes Mar 26, 2020

View reviewed changes

merged master

71fcbc9

sshleifer merged commit f6a23d1 into huggingface:master Mar 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BART] add bart-large-xsum weights #3422

[BART] add bart-large-xsum weights #3422

Uh oh!

sshleifer commented Mar 24, 2020 •

edited

Loading

Uh oh!

codecov-io commented Mar 24, 2020 •

edited

Loading

Uh oh!

thomwolf Mar 26, 2020

Uh oh!

sshleifer Mar 26, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[BART] add bart-large-xsum weights #3422

[BART] add bart-large-xsum weights #3422

Uh oh!

Conversation

sshleifer commented Mar 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-io commented Mar 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

thomwolf Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

sshleifer Mar 26, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sshleifer commented Mar 24, 2020 •

edited

Loading

codecov-io commented Mar 24, 2020 •

edited

Loading