[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

ZihengJiang · 2019-07-14T00:10:17Z

Separate quantization code base into different files: partition.cc, annotate.cc, realize.cc
Change rewrite_for_vta to extra partition pass and enable it by default
Change annotation.force_cast(x) to annotation.cast_hint(x, dtype)
Remove qconfig.store_lowbit_output and enable it by default
Fixed accuracy of models like mobilenet：
- resnet18_v1(8-16bit): 69.29%
- resnet18_v1(8-32bit): 69.29%
- resnet34_v1: 73.33%
- resnet50_v1: 74.78%
- resnet101_v1: 75.66%
- mobilenetv2_1.0: 66.64%

cc @tqchen @eqy @vinx13 @tmoreau89

ZihengJiang · 2019-07-19T04:32:35Z

After discussing offline with Tianqi, we decide to build the nightly regression tests in another repo.

ZihengJiang · 2019-07-19T04:41:49Z

@tqchen @eqy @vinx13 @tmoreau89 Could you please help to review this change?

python/tvm/relay/quantize/quantize.py

python/tvm/relay/quantize/_annotate.py

python/tvm/relay/quantize/_partition.py

eqy · 2019-07-19T16:35:23Z

@ZihengJiang Do you think we we could try to get the calibration PR first? I have to port it over to the new pass infra and I think this is likely more easy to replay on top of calibration than vice-versa.

ZihengJiang · 2019-07-19T17:07:57Z

@eqy Do you mean this one? #3294
Sure if you have time recently

tmoreau89

LGTM

src/relay/pass/quantize/quantize.h

tqchen · 2019-08-02T15:53:33Z

@ZihengJiang please followup now that #3538 is merged

tmoreau89 · 2019-08-07T01:41:36Z

What is the status on this PR? @tqchen @ZihengJiang

ZihengJiang · 2019-08-12T23:42:38Z

python/tvm/relay/quantize/_partition.py

+def add_partition_function(ref_call, new_args, ctx):
+    """Rewrite function for ewise add for partition"""
+    if 'cuda' in _target.current_target().keys:
+        #TODO(wuwei/ziheng) cuda specific rules


@vinx13 Since general devices and VTA are okay/required to insert stop_fusion in both side, let's use different rewrite rules for specific target here,

tests/python/nightly/quantization/test_quantization_accuracy.py

ZihengJiang · 2019-08-15T09:31:23Z

@mingwayzhang those links should be helpful:

…che#3543) * Refactor. * update * update * update * update * update * update

ZihengJiang changed the title ~~[WIP] Add nightly quantization regression tests~~ [QUANTIZE] Refactor codebase, fix accuracy, add nightly regression tests Jul 19, 2019

ZihengJiang changed the title ~~[QUANTIZE] Refactor codebase, fix accuracy, add nightly regression tests~~ [QUANTIZE] Refactor codebase and fix accuracy Jul 19, 2019

ZihengJiang changed the title ~~[QUANTIZE] Refactor codebase and fix accuracy~~ [QUANTIZE] Refactor quantization codebase and fix model accuracy Jul 19, 2019

ZihengJiang added the status: need review label Jul 19, 2019

vinx13 requested changes Jul 19, 2019

View reviewed changes

vinx13 mentioned this pull request Jul 22, 2019

[Relay][Quantization] KL-divergence-based per-layer calibration #3538

Merged

tmoreau89 approved these changes Jul 23, 2019

View reviewed changes

src/relay/pass/quantize/quantize.h Outdated Show resolved Hide resolved

tmoreau89 mentioned this pull request Aug 9, 2019

[VTA][Relay] Extending Vision model coverage compilation for VTA #3740

Merged

ZihengJiang force-pushed the quantize-lr branch from c01618d to c685007 Compare August 12, 2019 21:36

Refactor.

46c9667

ZihengJiang force-pushed the quantize-lr branch from 481c4fa to 46c9667 Compare August 12, 2019 21:43

update

c68f087

ZihengJiang commented Aug 12, 2019

View reviewed changes

ZihengJiang mentioned this pull request Aug 13, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3762

Closed

ZihengJiang added 5 commits August 12, 2019 19:10

update

0c53a16

update

6dc23aa

update

15ee364

update

8bcfc40

Merge branch 'master' of github.com:dmlc/tvm into dev

d6b9381

vinx13 approved these changes Aug 15, 2019

View reviewed changes

tests/python/nightly/quantization/test_quantization_accuracy.py Outdated Show resolved Hide resolved

update

4cac1f9

ZihengJiang merged commit 7eb1f35 into apache:master Aug 15, 2019

ZihengJiang deleted the quantize-lr branch August 15, 2019 09:31

vinx13 mentioned this pull request Aug 16, 2019

[Relay][Quantization] Fix out-of-date realize #3790

Merged

wweic pushed a commit to neo-ai/tvm that referenced this pull request Aug 16, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy (apa…

d537774

…che#3543) * Refactor. * update * update * update * update * update * update

anijain2305 pushed a commit to anijain2305/tvm that referenced this pull request Aug 22, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy (apa…

addf4b4

…che#3543) * Refactor. * update * update * update * update * update * update

wweic pushed a commit to neo-ai/tvm that referenced this pull request Sep 6, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy (apa…

c4b9fbc

…che#3543) * Refactor. * update * update * update * update * update * update

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Uh oh!

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

Uh oh!

Conversation

ZihengJiang commented Jul 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZihengJiang commented Jul 19, 2019

Uh oh!

ZihengJiang commented Jul 19, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eqy commented Jul 19, 2019

Uh oh!

ZihengJiang commented Jul 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tmoreau89 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tqchen commented Aug 2, 2019

Uh oh!

tmoreau89 commented Aug 7, 2019

Uh oh!

ZihengJiang Aug 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ZihengJiang commented Aug 15, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ZihengJiang commented Jul 14, 2019 •

edited

Loading

ZihengJiang commented Jul 19, 2019 •

edited

Loading

ZihengJiang Aug 12, 2019 •

edited

Loading

ZihengJiang commented Aug 15, 2019 •

edited

Loading