1cebccc72b
fix: adds causal to attention params to check when using flash attn v1 |
||
---|---|---|
.. | ||
__init__.py | ||
common.py | ||
cuda.py | ||
flash_attn_triton.py | ||
flash_infer.py | ||
ipex.py | ||
rocm.py |
1cebccc72b
fix: adds causal to attention params to check when using flash attn v1 |
||
---|---|---|
.. | ||
__init__.py | ||
common.py | ||
cuda.py | ||
flash_attn_triton.py | ||
flash_infer.py | ||
ipex.py | ||
rocm.py |