summaryrefslogtreecommitdiffstats
path: root/prelude
Commit message (Expand)AuthorAge
* Add missing make_bool intrinsics in cuda prelude. (#4735)Yong He2024-07-24
* Allow CPP/CUDA/Metal to lower/legalize buffer-elements to support column_majo...ArielG-NV2024-07-18
* Move the file public header files to `include` dir (#4636)kaizhangNV2024-07-17
* Add `float16` support to slang-torch (#4584)Sai Praveen Bangaru2024-07-10
* Correct type for double log10 (#4550)Ellie Hermaszewska2024-07-05
* Error out when constructing tensor views from tensors with 0 stride. (#4516)Sai Praveen Bangaru2024-07-01
* Prevent pointer validation for zero-size arrays (#4021)Sai Praveen Bangaru2024-04-24
* Avoid DXC warnings for missing bitwise op parantheses (#4004)Jay Kwak2024-04-24
* Implement 8.14-8.19 of OpenGL-GLSL specificationArielG-NV2024-04-03
* Improve cpp prelude. (#3725)Yong He2024-03-08
* Enable SLANG_MAKE_VECTOR calls when using SLANG_CUDA_ENABLE_HALF without SLAN...NBickford2024-02-24
* Generate lookup tables from cmake (#3461)Ellie Hermaszewska2024-01-24
* WIP: CMake (#3326)Ellie Hermaszewska2023-12-08
* CUDA: Fixes for NVRTC 12.x and warp mask ambiguity; adds CC 8.x warp reductio...Neil Bickford2023-11-07
* Make the exponent return value from frexp int (#3284)Ellie Hermaszewska2023-10-26
* More `slangpy` features + polishing (#3233)Sai Praveen Bangaru2023-09-23
* Add check for contiguous tensors (#3199)Sai Praveen Bangaru2023-09-08
* Remove unsupported torch types + add bool type. (#3197)Sai Praveen Bangaru2023-09-08
* Misc. SPIRV Fixes, Part 2. (#3147)Yong He2023-08-24
* Lower all ByteAddressBuffer uses for SPIRV. (#3143)Yong He2023-08-23
* Only define atomics for `float2` and `float4` when CUDA arch<900 (#3041)Sai Praveen Bangaru2023-08-02
* Avoid implicit casts or device transfers. (#2992)Sai Praveen Bangaru2023-07-14
* Fix native string emit for CUDA/Cpp backend. (#2980)Yong He2023-07-12
* Support for infinite literal of from 34.2432#INF (#2944)jsmall-nvidia2023-06-27
* Various fixes for autodiff and slangpy. (#2876)Yong He2023-05-09
* Set sharedMem argument to 0 when launching cuda kernel. (#2799)Yong He2023-04-13
* Small fixes to TorchTensor. (#2790)Yong He2023-04-11
* Fix linking issue in slangpy + no mask param for kernels. (#2778)Yong He2023-04-05
* More builtin library support in torch backend. (#2760)Yong He2023-03-30
* Convert tensor types in `make_tensor_view`. (#2755)Yong He2023-03-29
* Add slangpy doc, fix cuda prelude. (#2748)Yong He2023-03-28
* Update slang-llvm (#2735)Yong He2023-03-26
* Add PyTorch C++ binding generation. (#2734)Yong He2023-03-26
* Add support for emitting cuda kernel and host functions. (#2712)Yong He2023-03-17
* Overhaul global inst deduplication and cpp/cuda backend. (#2654)Yong He2023-02-16
* Preliminary debugBreak support (#2647)jsmall-nvidia2023-02-14
* Fix code generation for matrix reshape. (#2568)Yong He2022-12-14
* Fix inlining pass. (#2506)Yong He2022-11-10
* f32tof16 and f16tof32 support for CPU targets (#2500)jsmall-nvidia2022-11-09
* Make cpp-host prelude include scalar intrinsics. (#2478)Yong He2022-10-31
* Run simple compute kernel in gfx-smoke test. (#2400)Yong He2022-09-15
* Add gfx interface definition in Slang. (#2364)Yong He2022-08-16
* Language server pointer type support + add `DLLImport` test (#2350)Yong He2022-08-10
* Allow `class` to implement COM interface, [DLLExport] (#2338)Yong He2022-07-25
* Improved bounds checking for C++/CUDA (#2263)jsmall-nvidia2022-06-08
* Actual global support (#2262)jsmall-nvidia2022-06-08
* COM interfaces with host callable (#2258)jsmall-nvidia2022-06-02
* Support `[DllImport]` (#2181)Yong He2022-04-12
* Allow slangc to generate exe from .slang file. (#2170)Yong He2022-03-28
* Fixed naming conflicts in heterogeneous-hello-world (#2114)David Siher2022-02-03