change attention logic to use tridao flash attention in some cases#1
Open
bmedishe wants to merge 1 commit into
Open
change attention logic to use tridao flash attention in some cases#1bmedishe wants to merge 1 commit into
bmedishe wants to merge 1 commit into