summaryrefslogtreecommitdiffstats
path: root/prelude
Commit message (Expand)AuthorAge
...
* Generate lookup tables from cmake (#3461)Ellie Hermaszewska2024-01-24
* WIP: CMake (#3326)Ellie Hermaszewska2023-12-08
* CUDA: Fixes for NVRTC 12.x and warp mask ambiguity; adds CC 8.x warp reductio...Neil Bickford2023-11-07
* Make the exponent return value from frexp int (#3284)Ellie Hermaszewska2023-10-26
* More `slangpy` features + polishing (#3233)Sai Praveen Bangaru2023-09-23
* Add check for contiguous tensors (#3199)Sai Praveen Bangaru2023-09-08
* Remove unsupported torch types + add bool type. (#3197)Sai Praveen Bangaru2023-09-08
* Misc. SPIRV Fixes, Part 2. (#3147)Yong He2023-08-24
* Lower all ByteAddressBuffer uses for SPIRV. (#3143)Yong He2023-08-23
* Only define atomics for `float2` and `float4` when CUDA arch<900 (#3041)Sai Praveen Bangaru2023-08-02
* Avoid implicit casts or device transfers. (#2992)Sai Praveen Bangaru2023-07-14
* Fix native string emit for CUDA/Cpp backend. (#2980)Yong He2023-07-12
* Support for infinite literal of from 34.2432#INF (#2944)jsmall-nvidia2023-06-27
* Various fixes for autodiff and slangpy. (#2876)Yong He2023-05-09
* Set sharedMem argument to 0 when launching cuda kernel. (#2799)Yong He2023-04-13
* Small fixes to TorchTensor. (#2790)Yong He2023-04-11
* Fix linking issue in slangpy + no mask param for kernels. (#2778)Yong He2023-04-05
* More builtin library support in torch backend. (#2760)Yong He2023-03-30
* Convert tensor types in `make_tensor_view`. (#2755)Yong He2023-03-29
* Add slangpy doc, fix cuda prelude. (#2748)Yong He2023-03-28
* Update slang-llvm (#2735)Yong He2023-03-26
* Add PyTorch C++ binding generation. (#2734)Yong He2023-03-26
* Add support for emitting cuda kernel and host functions. (#2712)Yong He2023-03-17
* Overhaul global inst deduplication and cpp/cuda backend. (#2654)Yong He2023-02-16
* Preliminary debugBreak support (#2647)jsmall-nvidia2023-02-14
* Fix code generation for matrix reshape. (#2568)Yong He2022-12-14
* Fix inlining pass. (#2506)Yong He2022-11-10
* f32tof16 and f16tof32 support for CPU targets (#2500)jsmall-nvidia2022-11-09
* Make cpp-host prelude include scalar intrinsics. (#2478)Yong He2022-10-31
* Run simple compute kernel in gfx-smoke test. (#2400)Yong He2022-09-15
* Add gfx interface definition in Slang. (#2364)Yong He2022-08-16
* Language server pointer type support + add `DLLImport` test (#2350)Yong He2022-08-10
* Allow `class` to implement COM interface, [DLLExport] (#2338)Yong He2022-07-25
* Improved bounds checking for C++/CUDA (#2263)jsmall-nvidia2022-06-08
* Actual global support (#2262)jsmall-nvidia2022-06-08
* COM interfaces with host callable (#2258)jsmall-nvidia2022-06-02
* Support `[DllImport]` (#2181)Yong He2022-04-12
* Allow slangc to generate exe from .slang file. (#2170)Yong He2022-03-28
* Fixed naming conflicts in heterogeneous-hello-world (#2114)David Siher2022-02-03
* Generalize heterogenous code emit (#1968)David Siher2021-10-19
* Bring heterogeneous-hello-world back up to date. (#1935)David Siher2021-09-14
* First Slang LLVM integration (#1934)jsmall-nvidia2021-09-10
* CUDA layout corner cases/testing (#1881)jsmall-nvidia2021-06-10
* Enable tracing rays with OptiX backend (#1871)Nathan V. Morrical2021-06-04
* OptiX ray payload read/write support in raytracing pipeline shaders (#1853)Nathan V. Morrical2021-05-25
* Read half->float RWTexture conversion (#1842)jsmall-nvidia2021-05-15
* Surface access on CUDA is byte addressed in X (#1841)jsmall-nvidia2021-05-15
* Support for HW format conversions for RWTexture on CUDA (#1840)jsmall-nvidia2021-05-15
* CUDA half RWTexture write support/doc improvements (#1839)jsmall-nvidia2021-05-14
* Support for reads from RWTexture<half> (#1837)jsmall-nvidia2021-05-06