aboutsummaryrefslogtreecommitdiff
path: root/modules/sub_quadratic_attention.py
AgeCommit message (Collapse)Author
2023-05-11Autofix Ruff W (not W605) (mostly whitespace)Aarni Koskela
2023-05-11repair #10266AUTOMATIC
2023-05-10Update sub_quadratic_attention.pyLouis Del Valle
1. Determine the number of query chunks. 2. Calculate the final shape of the res tensor. 3. Initialize the tensor with the calculated shape and dtype, (same dtype as the input tensors, usually) Can initialize the tensor as a zero-filled tensor with the correct shape and dtype, then compute the attention scores for each query chunk and fill the corresponding slice of tensor.
2023-01-25Add UI setting for upcasting attention to float32brkirch
Adds "Upcast cross attention layer to float32" option in Stable Diffusion settings. This allows for generating images using SD 2.1 models without --no-half or xFormers. In order to make upcasting cross attention layer optimizations possible it is necessary to indent several sections of code in sd_hijack_optimizations.py so that a context manager can be used to disable autocast. Also, even though Stable Diffusion (and Diffusers) only upcast q and k, unfortunately my findings were that most of the cross attention layer optimizations could not function unless v is upcast also.
2023-01-09Remove fallback for Protocol import and remove Protocol import and remove ↵AUTOMATIC
instances of Protocol in code add some whitespace between functions to be in line with other code in the repo
2023-01-07Add fallback for Protocol importProGamerGov
2023-01-06Added licensebrkirch
2023-01-06Use narrow instead of dynamic_slicebrkirch
2023-01-06Add Birch-san's sub-quadratic attention implementationbrkirch