🤠 Add Hifigan vocoder #387

dathudeptrai · 2020-11-24T04:45:32Z

This PR is an implementation of the HiFi-GAN vocoder (https://arxiv.org/abs/2010.05646). The training process follows melgan_stft. The model logic follows by original HiFi-GAN PyTorch code (https://github.com/jik876/hifi-gan).

…tft.

machineko

LGFM

ZDisket

looks good

dathudeptrai · 2020-11-24T16:17:58Z

@machineko @ZDisket can you guys try to training around 2k steps to verify if it works :))) I do not have GPU right now to test :))). There are so many differences between a private library and this opensource:v.

machineko · 2020-11-24T16:22:15Z

@machineko @ZDisket can you guys try to training around 2k steps to verify if it works :))) I do not have GPU right now to test :))). There are so many differences between a private library and this opensource:v.

Same not available GPU at the moment but will test in 2/3 days 📦

machineko · 2020-11-24T19:29:59Z

@dathudeptrai Did u wanna to use config v2 or v1 for it?

ZDisket · 2020-11-25T01:29:35Z

@dathudeptrai

can you guys try to training around 2k steps to verify if it works :))) I do not have GPU right now to test :))). There are so many differences between a private library and this opensource:v.

I could train 4k steps and counting with v1 config and mixed precision without problems. I even got eval samples at 5k.

dathudeptrai · 2020-11-25T02:13:17Z

@dathudeptrai Did u wanna to use config v2 or v1 for it?

v2 for faster :D

dathudeptrai · 2020-11-25T02:14:04Z

@dathudeptrai

can you guys try to training around 2k steps to verify if it works :))) I do not have GPU right now to test :))). There are so many differences between a private library and this opensource:v.

I could train 4k steps and counting with v1 config and mixed precision without problems. I even got eval samples at 5k.

is the loss ok ? , can you try to continue training both G and D around 1k steps :D

ZDisket · 2020-11-25T02:32:59Z

@dathudeptrai For some reason the loss exploded after 10k and the eval samples are either noise or silence, although I think it's just because of the small dataset. Going to restart training and train discriminator from 0 steps

ZDisket · 2020-11-25T03:19:01Z

can you try to continue training both G and D around 1k steps :D

Completed 2k steps of G+D starting from 0 steps. No problems.

lesswrongzh · 2020-11-27T03:47:18Z

@ZDisket could you share your hifigan tensorboard like this?

EmreOzkose · 2021-04-05T18:49:28Z

I also have this problem. My tensorboard:

and predictions are all same noisy sound. For example:

What could be the problem? I first trained generator and after resume.

ZDisket · 2021-04-05T19:41:44Z

@EmreOzkose The TensorflowTTS implementation is not faithful to the original when it comes to the optimizer. The official implementation uses AdamW optimizer with ExponentialLR, while the one in this repo uses Adam with PiecewiseConstantDecay. Plus there is no generator pretraining in the original.
https://github.com/jik876/hifi-gan/blob/4769534d45265d52a904b850da5a622601885777/train.py#L63-L72
A while back I implemented some changes to make it the same and it didn't die during training, but I didn't train it to completion so I haven't evaluated it enough to warrant a PR. Still, the .zip I'm attaching has the training script and a v2-based 44.1KHz config with all the changes. See if it helps.
hf.zip

EmreOzkose · 2021-04-05T19:44:50Z

I am checking out, thank you @ZDisket.

EmreOzkose · 2021-04-07T08:52:22Z

@EmreOzkose The TensorflowTTS implementation is not faithful to the original when it comes to the optimizer. The official implementation uses AdamW optimizer with ExponentialLR, while the one in this repo uses Adam with PiecewiseConstantDecay. Plus there is no generator pretraining in the original.
https://github.com/jik876/hifi-gan/blob/4769534d45265d52a904b850da5a622601885777/train.py#L63-L72
A while back I implemented some changes to make it the same and it didn't die during training, but I didn't train it to completion so I haven't evaluated it enough to warrant a PR. Still, the .zip I'm attaching has the training script and a v2-based 44.1KHz config with all the changes. See if it helps.
hf.zip

I tried a training with the same setup and got different signals from a noise. Thank you 😃

ZDisket · 2021-04-20T00:31:22Z

@EmreOzkose Any updates? Did it do well?

dathudeptrai added 4 commits November 24, 2020 11:31

😸 Added hifigan model.

d51b50a

🐝 Fix Copyright.

d781e5d

🍒 Added hifigan config.

212c84f

✋ Add hifigan to autoconfig and TFautomodel.

0a677c4

dathudeptrai marked this pull request as draft November 24, 2020 05:12

dathudeptrai added 4 commits November 24, 2020 13:45

👕 Added training code for hifigan and replace melgan.stft to melgan_s…

b1132bc

…tft.

🐜 CI for Tensorflow 2.3.1 and added pytest for hifigan.

051c324

🥳 Add hifigan into test_auto.

ff356a0

👌 Update README.

43f3297

dathudeptrai marked this pull request as ready for review November 24, 2020 07:17

dathudeptrai requested review from kb-rahul, machineko and ZDisket November 24, 2020 07:17

dathudeptrai self-assigned this Nov 24, 2020

dathudeptrai added new feature new feature enhancement 🚀 New feature or request labels Nov 24, 2020

machineko previously approved these changes Nov 24, 2020

View reviewed changes

ZDisket previously approved these changes Nov 24, 2020

View reviewed changes

🐶 Trick to prevent expoded loss in melgan_stft.

e1ff1ec

dathudeptrai dismissed stale reviews from ZDisket and machineko via e1ff1ec November 25, 2020 02:41

Merge branch 'master' into hifigan

02cf998

dathudeptrai closed this Nov 25, 2020

dathudeptrai deleted the hifigan branch November 25, 2020 03:30

ZDisket mentioned this pull request Apr 21, 2021

hifigan v1 gradient explosion ？ #547

Closed

🤠 Add Hifigan vocoder #387

🤠 Add Hifigan vocoder #387

Uh oh!

Conversation

dathudeptrai commented Nov 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

machineko left a comment

Choose a reason for hiding this comment

Uh oh!

ZDisket left a comment

Choose a reason for hiding this comment

Uh oh!

dathudeptrai commented Nov 24, 2020

Uh oh!

machineko commented Nov 24, 2020

Uh oh!

machineko commented Nov 24, 2020

Uh oh!

ZDisket commented Nov 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dathudeptrai commented Nov 25, 2020

Uh oh!

dathudeptrai commented Nov 25, 2020

Uh oh!

ZDisket commented Nov 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZDisket commented Nov 25, 2020

Uh oh!

lesswrongzh commented Nov 27, 2020

Uh oh!

EmreOzkose commented Apr 5, 2021

Uh oh!

ZDisket commented Apr 5, 2021

Uh oh!

EmreOzkose commented Apr 5, 2021

Uh oh!

EmreOzkose commented Apr 7, 2021

Uh oh!

ZDisket commented Apr 20, 2021

Uh oh!

Uh oh!

dathudeptrai commented Nov 24, 2020 •

edited

Loading

ZDisket commented Nov 25, 2020 •

edited

Loading

ZDisket commented Nov 25, 2020 •

edited

Loading