[microNPU] Cascader performance model bugfixes #10510

jacobbohlin · 2022-03-07T11:46:59Z

This PR fixes some bugs related to the microNPU performance model in the cascader.

Fixed incorrect num_blocks calculations for both BufferModes.
Fixed similar issues with Read/Write byte calculations.
Fixed an issue where the 'partkernel' flag was not propagated to the performance estimation code.
Fixed single buffering check incorrectly used output shape and block rather than the input shape and block.
Fixed block config not aligned to micro block for Elementwise.

This PR builds on and includes the changes in #10508 and will remain as a draft until that is merged.

jacobbohlin · 2022-03-07T11:49:01Z

@manupa-arm @ekalda @lhutton1 @dchauhan-arm @NicolaLancellotti

ekalda

Thanks @jacobbohlin, one comment, otherwise looks good! :)

ekalda · 2022-03-30T12:57:15Z

python/tvm/contrib/ethosu/cascader/device_config.py

-                min(output_shape[2] * output_shape[4], self._max_block_shape.depth),
-                min(output_shape[3], self._max_block_shape.width),
+                _round_up(min(output_shape[1], max_height), self._micro_block.height),
+                min(output_shape[2] * output_shape[4], max_width),


Probably shouldn't take min with max_width

* Fixed incorrect num_blocks calculations for both BufferModes. * Fixed similar issues with Read/Write byte calculations. * Fixed an issue where the 'partkernel' flag was not propagated to the performance estimation code. * Fixed single buffering check incorrectly used output shape and block rather than the input shape and block. * Fixed block config not aligned to micro block for Elementwise. Change-Id: Ide6b231bc1a17c65bed20129d2179a215ada14b2

Changed incorrect usage of 'max_width' to 'max_depth'.

ekalda

LGTM! :)

manupak

LGTM!

manupak · 2022-04-20T09:11:08Z

Thanks @jacobbohlin @ekalda !

jacobbohlin mentioned this pull request Mar 7, 2022

[microNPU] Cascader performance model bugfixes #10386

Closed

jacobbohlin force-pushed the cascader-perf-model-fixes branch from 3a54cf7 to 8745836 Compare March 18, 2022 14:58

jacobbohlin marked this pull request as ready for review March 18, 2022 14:59

ekalda reviewed Mar 30, 2022

View reviewed changes

jacobbohlin added 2 commits April 13, 2022 10:42

Address review comment

07e7603

Changed incorrect usage of 'max_width' to 'max_depth'.

jacobbohlin force-pushed the cascader-perf-model-fixes branch 2 times, most recently from 3a54cf7 to 07e7603 Compare April 13, 2022 08:50

ekalda approved these changes Apr 13, 2022

View reviewed changes

manupak approved these changes Apr 20, 2022

View reviewed changes

manupak merged commit 0b95780 into apache:main Apr 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[microNPU] Cascader performance model bugfixes #10510

[microNPU] Cascader performance model bugfixes #10510

Uh oh!

jacobbohlin commented Mar 7, 2022

Uh oh!

jacobbohlin commented Mar 7, 2022

Uh oh!

ekalda left a comment

Uh oh!

ekalda Mar 30, 2022

Uh oh!

ekalda left a comment

Uh oh!

manupak left a comment

Uh oh!

manupak commented Apr 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[microNPU] Cascader performance model bugfixes #10510

[microNPU] Cascader performance model bugfixes #10510

Uh oh!

Conversation

jacobbohlin commented Mar 7, 2022

Uh oh!

jacobbohlin commented Mar 7, 2022

Uh oh!

ekalda left a comment

Choose a reason for hiding this comment

Uh oh!

ekalda Mar 30, 2022

Choose a reason for hiding this comment

Uh oh!

ekalda left a comment

Choose a reason for hiding this comment

Uh oh!

manupak left a comment

Choose a reason for hiding this comment

Uh oh!

manupak commented Apr 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants