Skip to content

Commit 7f0e146

Browse files
Optimize gpu reductions (#27)
* Add reduction clause to target_teams_distribute * Add reductions tests for nested for under parallel * Optimize GPU reductions - Use a 2-level approach with atomics - Support DSA_REDUCTION_MUL for nested for directices * Clean up code
1 parent 05827a9 commit 7f0e146

File tree

5 files changed

+290
-250
lines changed

5 files changed

+290
-250
lines changed

src/numba/openmp/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7448,7 +7448,7 @@ def NUMBER(self, args):
74487448
| thread_limit_clause
74497449
| data_default_clause
74507450
| data_sharing_clause
7451-
// | reduction_default_only_clause
7451+
| reduction_clause
74527452
| lastprivate_clause
74537453
| collapse_clause
74547454
| dist_schedule_clause

0 commit comments

Comments
 (0)