diff options
| author | jsmall-nvidia <jsmall@nvidia.com> | 2020-03-02 16:18:20 -0500 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2020-03-02 16:18:20 -0500 |
| commit | 8899c149b05def1cce626ea649012c4c974861de (patch) | |
| tree | 77e97c2997a653ba9262b32f55e9e3f37e166653 /source/core/slang-io.cpp | |
| parent | b85ca6f86d46ee3c4d5784d0bd4ebc8509e2a9bd (diff) | |
Additional Wave Intrinsic Support (#1252)
* Test for some wave intrinsics.
More wave intrinsic support on CUDA.
* Use shfl_xor_sync.
* Improvements around wave intrinsics.
Fix built in integer types belong to __BuiltinIntegerType.
* Improvements and fixes around Wave intrinsics.
* Added WaveIsFirstLane test.
No longer use __wavemask_lt, as appears not available as an intrinsic.
* Small fixes to CUDA prelude.
* Add wave-active-product test.
Handle the special case for arbitray sums.
* Used macro to implement CUDA wave intrinsics.
Diffstat (limited to 'source/core/slang-io.cpp')
0 files changed, 0 insertions, 0 deletions
