Skip to content
Snippets Groups Projects
Unverified Commit 5966f233 authored by leaves-zwx's avatar leaves-zwx Committed by GitHub
Browse files

Fused SelfAttention query multipy key and value op (#4660)

* op & kernel

* python api

* mv bgemm to self_att*.cu

* test case

* fix bugs

* update test

* add op to amp gray list

* mv to white list

* rename bgemm to BatchedGemm

* refine transpose

* oneflow_api is deprecated

* format
parent 26bf773b
No related branches found
No related tags found
No related merge requests found
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment