Skip to content

Conversation

@apivovarov
Copy link
Contributor

if cfg.is_fallback:
N, F, Y, X = get_const_tuple(conv.shape)
_fallback_schedule(N, F, Y, X)
# Workaround to make CUDA compilation work. Issue #4470
Copy link
Member

@vinx13 vinx13 Dec 6, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we still use the fallback for the other cases by checking the input params here?

Copy link
Contributor Author

@apivovarov apivovarov Dec 6, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I checked more kernel and strides combinations and found that the error happens when kernel is equal to strides, e.g.

# kernel and strides when compilation for CUDA fails
2x2 and (2,2)
3x3 and (3,3)
4x4 and (4,4)
5x5 and (5,5)
2x3 and (2,3)
3x2 and (3,2)
1x2 and (1x2)
etc

I also found that the compilation fails if output channel is 1

Copy link
Contributor Author

@apivovarov apivovarov Dec 6, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added kernel / strides check and skip _fallback_schedule when output channel is 1.
In other case It will run _fallback_schedule for kernel 1x1 or when kernel != strides

@apivovarov apivovarov force-pushed the fix_conv2d_transpose branch from f339b26 to 89056d4 Compare December 6, 2019 19:59
@apivovarov apivovarov force-pushed the fix_conv2d_transpose branch from 89056d4 to 10f4b18 Compare December 6, 2019 21:42
@vinx13 vinx13 merged commit 3fa8097 into apache:master Dec 8, 2019
zxy844288792 pushed a commit to zxy844288792/tvm that referenced this pull request Dec 13, 2019
alexgl-github pushed a commit to alexgl-github/tvm that referenced this pull request Jan 21, 2020
- combine pad and dilate;
- fix for the issue https://discuss.tvm.ai/t/compile-error-for-cuda-target/4164
- fix for the issue apache#4472
alexgl-github pushed a commit to alexgl-github/tvm that referenced this pull request Jan 21, 2020
- combine pad and dilate;
- fix for the issue https://discuss.tvm.ai/t/compile-error-for-cuda-target/4164
- fix for the issue apache#4472
vinx13 pushed a commit that referenced this pull request Jan 22, 2020
alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 26, 2020
alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 28, 2020
zhiics pushed a commit to neo-ai/tvm that referenced this pull request Mar 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants