slang.git - Making it easier to work with shaders

Age	Commit message (Collapse)	Author
2023-02-21	Added support for simple while loops (#2667)	Sai Praveen Bangaru
	* Added support for simple while loops * Fix support for while loops by changing logic to grab the loop update block
2023-02-20	Miscellaneous backward autodiff fixes. (#2665)	Yong He
	* Fix differentiable type registration * Fix use of non-differentiable return value in a differentiable func. * Fix use of primal inst that does not dominate the diff block. * Fix primal inst hoisting, and add missing type legalization logic. * Make `detach` defined on all differentiable T. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-20	Add static for loop iteration inference. (#2659)	Yong He

2023-02-17	Allocate N+1 arrays instead of N to avoid out-of-bounds access when ↵	Sai Praveen Bangaru
	unzipping loops (#2663)
2023-02-17	AD: More legacy type handling cleanup + user-defined reverse-mode fix (#2662)	Sai Praveen Bangaru
	* WIP: Remove all legacy type checking * Fixed issue with user-defined backward derivatives not bypassing the AD process --------- Co-authored-by: Yong He <yonghe@outlook.com>
2023-02-17	AD: Remove the original loop condition upon inversion (#2661)	Sai Praveen Bangaru
	* Remove the original condition upon loop inversion (it's redundant, and causes out-of-bounds accesses) * minor fix (also removed the first loop check skip) * Cleanup unused insts * minor comment fix
2023-02-17	Fixed crash when lowering IR for no_diff struct member. (#2658)	Yong He
	* Fixed crash when lowering IR for no_diff struct member. * Improve `setInsertBeforeOrdinaryInst` and `setInsertAfterOrdinaryInst`. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-17	Cleaned up legacy differential type handling + type casting bugfixes (#2660)	Sai Praveen Bangaru

2023-02-17	Proper reverse-mode loop handling with splitting + inversion steps (#2656)	Sai Praveen Bangaru
	* Halfway to loop inversion * More progress towards proper loop inversion * More progress towards inverse insts. Only thing left is adding `counter>=0` at the right place * More fixes for inversion step. * Lots more fixes, added primal inst 'hoisting' mechanism as the central method that ensures primal values are placed in the right spot * Loop inversion is now functional * Cleaned up commented code * rename diffCounterVar -> diffCounterParam * minor update * removed some comments and commented code * Switch `IRBuilder(sharedIRBuilder)` to `IRBuilder(moduleInst)`
2023-02-16	Remove `SharedIRBuilder`. (#2657)	Yong He
	Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-16	Overhaul global inst deduplication and cpp/cuda backend. (#2654)	Yong He
	* Overhaul global inst deduplication and cpp/cuda backend. * Update IR documentation. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-15	Treat user defined backward derivative function as non differentiable. (#2650)	Yong He
	Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-15	Upgrade GLSLANG 12.0.0 (#2651)	jsmall-nvidia
	* #include an absolute path didn't work - because paths were taken to always be relative. * Update to glslang 12.0.0. Update SPIRV-Tools SPIRV-Headers.
2023-02-14	Preliminary debugBreak support (#2647)	jsmall-nvidia
	* #include an absolute path didn't work - because paths were taken to always be relative. * Preliminary support for debug break. * Add C++ debug break support. Add details about usage. * Improve debug break test details. * Make HLSL output a comment about no support. * Handle specialize for target assert, without a body if it has spv_instruction/target intrinsic
2023-02-13	Various auto-diff bug fixes. (#2646)	Yong He
	Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-13	Eliminate `continue` to allow unrolling any loops. (#2645)	Yong He
	Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-13	Add Loop Unrolling Pass. (#2644)	Yong He
	Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-11	Take into account existing initializer list type when performing coercions ↵	Ellie Hermaszewska
	(#2641) Fixes https://github.com/shader-slang/slang/issues/2189
2023-02-10	Fix several autodiff bugs. (#2643)	Yong He

2023-02-10	Fix checking of `[BackwardDerivativeOf]` attribute. (#2640)	Yong He
	* Fix checking of `[BackwardDerivativeOf]` attribute. * Fix crash in `canInstHaveSideEffectAtAddress`. * Fix. * Revert fix. * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-09	Reverse-mode Loop Support (#2635)	Sai Praveen Bangaru
	* Full loop support now working. MaxItersAttr in progress * Lookup table updates? * Fixed the max iters decoration * Minox fixes & remove superfluous code * fixup warnings * Revert "Lookup table updates?" This reverts commit 7d9b0793fb5239f31d1155776e846dcf1892d8d9. * Update 07-autodiff.md * Change maxiters to MaxIters * Added asserts * Update 07-autodiff.md
2023-02-09	Fixed derivatives for kIROp_Neg and kIROp_Div, added another test (#2639)	Sai Praveen Bangaru

2023-02-09	Use stable sort in generation of lookup tables (#2638)	Ellie Hermaszewska
	* Add Slang::List::stableSort * Use stable sort in generation of lookup tables * Disable newline translation when writing lookup tables
2023-02-07	Add backward derivatives for functions in diff.meta.slang (#2633)	winmad
	* WIP: start adding backward derivatives * Overhaul `transposeParameterBlock` to support `inout` params. * Small bug fixes. * Bug fix on differentiable intrinsic specialization. * Fixes. * Run autodiff tests on CPU. * Clean up. * Overhaul `transposeParameterBlock` to support `inout` params. * Small bug fixes. * Bug fix on differentiable intrinsic specialization. * Fixes. * Run autodiff tests on CPU. * Clean up. * More bug fixes., * WIP: working on detach * Arithmetic simplifications and more IR clean up logic. * WIP: adding detach and abs * Fix detach and abs * Fix. * Add IR transform pass for cleaner code emit. * Fix test cases. * Fix type system logic for reference type. * Add backward derivatives for functions that already have forward derivatives * Fix changes --------- Co-authored-by: Yong He <yhe@nvidia.com> Co-authored-by: Lifan Wu <lifanw@nvidia.com>
2023-02-07	Arithmetic simplifications and more IR clean up logic. (#2632)	Yong He

2023-02-06	Fix crash when processing nested switch. (#2624)	Yong He
	* Fix crash when processing nested switch. * Clean up. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-04	Patch transcription of `inout` non differentiable params. (#2623)	Yong He

2023-02-03	Overhaul `transposeParameterBlock` to support `inout` params. (#2621)	Yong He
	* Overhaul `transposeParameterBlock` to support `inout` params. * Small bug fixes. * Bug fix on differentiable intrinsic specialization. * Fixes. * Run autodiff tests on CPU. * Clean up. * More bug fixes., * Add test coverage on inout param. * Fix language server hinting for transcribed mutable params. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-02-03	Small fixes around repro (#2622)	jsmall-nvidia
	* #include an absolute path didn't work - because paths were taken to always be relative. * Fix issues in repo due to C++ expression evaluation ordering is undefined.
2023-02-03	Use SPIR-V opcode names rather than numbers (#2571)	Ellie Hermaszewska
	* s/emititng blobal/emitting global * Use SPIR-V opcode names rather than numbers * regenerate Visual Studio project files * Use names for extended SPIR-V GLSL instructions * Add missing operand for SPIR-V extended instruction * Add warning aginst modifying generated hashing files * Squash warnings on MSVC
2023-02-01	Support `out` parameters in backward differentiation. (#2619)	Yong He
	* Support `out` parameters in backward differentiation. * Fixes. * Fix cleanup. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-31	Patched support for multi-return and fallthrough if-else with break stmts ↵	Sai Praveen Bangaru
	(#2617)
2023-01-30	Add transposition logic for constructor opcodes. (#2618)	Yong He
	* Add transposition logic for constructor opcodes. * Fix. * Add language server regression test. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-30	Make ArrayExpressionType a DeclRefType and define its autodiff extension in ↵	Yong He
	stdlib. (#2615) * Allow array parameters in forward diff. * Use type canonicalization instead of coersion. * Reimplement array type. * Fix. * Update test case. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-30	Overhauled reverse-mode control flow handling (#2608)	Sai Praveen Bangaru
	* Added switch-case support; fixed non-diff parameter transposition * Made region propagation much more robust. Partial loop unzip implementation * WIP: Added most loop handling code, and a test. Still untested * Added CFG Normalization pass + CFG Reversal Pass + Loop Unzipping + most loop transcription * Add single-iter-loop test. * proj files * removed comments * Update reverse-loop.slang * Removed out-of-date code * Disabled IR validation during constructSSA phase of normalizeCFG. constructSSA now reuses sharedBuilder * Moved normalizeCFG() call to prepareFuncForBackwardDiff()
2023-01-27	Register allocation during phi elimination. (#2613)	Yong He
	* Register allocation during phi elimination. * Enhance the test case. * Cleanup line breaks in test case. * remove unncessary line break changes. * More cleanups. --------- Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-27	Add ASAN support + fixes (#2614)	skallweitNV
	* Add ASAN support to premake * Fix StringRepresentation when ASAN is enabled * Fix deep recursion in slang-generate * Fix hello-world example * Fix gpu-printing example * Linux fix * Try fixing linux * Add missing include
2023-01-25	Unify UpdateField and UpdateElement with access chain. (#2611)	Yong He
	* Unify UpdateField and UpdateElement with access chain. * Fix warnings. Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-25	Cleanup IR representation of interface member derivative. (#2610)	Yong He
	Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-24	Reimplement address elimination. (#2605)	Yong He
	* Reimplement address elimination pass. * Fix error. * Update test references. Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-24	Small fix for "static" in doc output (#2606)	jsmall-nvidia
	* #include an absolute path didn't work - because paths were taken to always be relative. * Upgrade to slang-llvm-13.x-33 * Kick - as build failed on download egress. * Output "static" on methods in doc output.
2023-01-23	Full address insts elimination for backward autodiff. (#2604)	Yong He
	Co-authored-by: Yong He <yhe@nvidia.com>
2023-01-19	Add diagnostic for calling non-bwd-diff func from bwd-diff func. (#2602)	Yong He

2023-01-17	First custom backward-derivative test case working. (#2598)	Yong He

2023-01-17	Add `set` to spirv_instruction (#2597)	jsmall-nvidia

2023-01-17	Added switch-case support; fixed non-diff parameter transposition (#2596)	Sai Praveen Bangaru

2023-01-15	Switched to a much simpler method to transpose control flow, nested control ↵	Sai Praveen Bangaru
	flow works now (#2595)
2023-01-14	Support custom backward derivative attribute. (#2594)	Yong He

2023-01-14	Fixes for crash when inlining at global scope (#2593)	Theresa Foley
	* Fixes for crash when inlining at global scope Recent changes to the way inlining is implemented in the Slang compiler have broken certain scenarios involving `static const` declarations. The basic problem is that the initial-value expression for a `static const` gets lowered into IR code at the global scope of a module, and if that code includes `call`s to stdlib operations marked `forceInlineEarly`, then we end up trying to apply inlining to code at module scope. The current inlining operation assumes that all `call`s are in basic blocks, and that the correct way to do inlining involves splitting those blocks. This change adds logic to detect when the callee at a call site to be inlined consists of a single basic block ending in a `return`, and in that case it invokes specialized inlining logic that doesn't split basic blocks and doesn't need to care if the original `call` is in a basic block. Thus we are able to inline calls to single-basic-block `forceInlineEarly` functions called as part of the initialization for global-scope `static const` variables. This logic does not solve the problem of calls to multi-block `forceInlineEarly` functions from the global scope. Such calls cannot really be inlined. A secondary problem that arises when inlining such calls is that the callee might include local temporaries (`var` instructions) that are read and written (`load`s and `store`s), and none of those instructions should be allowed at the global scope. In the case of the functions being inlined here, the `load`/`store` operations are superfluous, and should be cleaned up by our SSA pass. The only reason that they seem to not be getting cleaned up in the case that was been triggering crashes is that the callee is a generic. The current logic for the SSA pass was skipping the bodies of generic functions, so they would not be cleaned up. This change enables the SSA pass to apply to the bodies of generic functions, and also ensures that SSA cleanups are applied before any `forceInlineEarly` functions get inlined. * fixup: liveness test outputs
2023-01-13	Frontend work for `[BackwardDerivative]` and `[BackwardDerivativeOf]`. (#2589)	Yong He
	* Frontend work for `[BackwardDerivative]` and `[BackwardDerivativeOf]`. * Fix clang issue. * Fix. * fix gcc issue * fix formatting. Co-authored-by: Yong He <yhe@nvidia.com>