[Metal] Dispatch numerically stable tanh for metal #16438

CharlieFRuan · 2024-01-19T19:48:20Z

Prior to this PR, tanh(x)returns NaN on metal when x > 45.0.

Metal's built-in tanh is implemented as (t - 1.0) / (t + 1.0), where t = exp(2.0 * x). Hence for large x, t becomes inf, causing tanh(x) to be NaN.

A numerically stable tanh is implemented for llvm, this PR lifts it to src/target/intrin_rule.cc and apply the same rule for metal as well.

CharlieFRuan · 2024-01-19T19:48:34Z

cc @MasterJH5574 @tqchen

CharlieFRuan · 2024-01-19T23:50:53Z

Observed the same issue on webgpu as well; updated to fix.

junrushao

Thanks! This looks great!

This PR updates `debug_intermediate.py` that uses instrumenting to debug a compiled model library, adding some comments on the script and making it SLM-model compatible. To run: ``` python tests/legacy-python/dump_intermediate.py --model dist/phi-2-q4f16_1-MLC --model-lib-path dist/phi-2_q4f16_1-cuda.so ``` While we can use JIT to print intermediate values for debugging, as of now it only supports cpu. Some issues are platform-specific, and this script could still come in handy (e.g. helped solve apache/tvm#16438, #1638).

This PR updates `debug_intermediate.py` that uses instrumenting to debug a compiled model library, adding some comments on the script and making it SLM-model compatible. To run: ``` python tests/legacy-python/dump_intermediate.py --model dist/phi-2-q4f16_1-MLC --model-lib-path dist/phi-2_q4f16_1-cuda.so ``` While we can use JIT to print intermediate values for debugging, as of now it only supports cpu. Some issues are platform-specific, and this script could still come in handy (e.g. helped solve apache/tvm#16438, mlc-ai/mlc-llm#1638).

[Metal] Dispatch numerical stable tanh for metal

cc44df5

MasterJH5574 approved these changes Jan 19, 2024

View reviewed changes

Use stabl tanh for webgpu as well

c1fb010

MasterJH5574 self-assigned this Jan 20, 2024

junrushao approved these changes Jan 20, 2024

View reviewed changes

junrushao merged commit ffa404f into apache:main Jan 20, 2024

This was referenced Jan 20, 2024

Support Phi-2 on iOS mlc-ai/mlc-llm#1554

Closed

Update Phi metal and wasm mlc-ai/binary-mlc-llm-libs#79

Merged

[Debug] Update instrument debugging script mlc-ai/mlc-llm#1637

Merged

CharlieFRuan mentioned this pull request Feb 3, 2024

[Bug] gelu_new models on Metal with f32 - "Output probabilities are all NaNs" mlc-ai/mlc-llm#1505

Closed

ysh329 mentioned this pull request Apr 21, 2024

[Release] v0.16.0 Release Candidate Notes #16911

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Metal] Dispatch numerically stable tanh for metal #16438

[Metal] Dispatch numerically stable tanh for metal #16438

Uh oh!

CharlieFRuan commented Jan 19, 2024

Uh oh!

CharlieFRuan commented Jan 19, 2024

Uh oh!

CharlieFRuan commented Jan 19, 2024

Uh oh!

junrushao left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Metal] Dispatch numerically stable tanh for metal #16438

[Metal] Dispatch numerically stable tanh for metal #16438

Uh oh!

Conversation

CharlieFRuan commented Jan 19, 2024

Uh oh!

CharlieFRuan commented Jan 19, 2024

Uh oh!

CharlieFRuan commented Jan 19, 2024

Uh oh!

junrushao left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants