flash-attn: Python wheels for CUDA cu12 + torch2.1
cxx11abiFALSE
cxx11abiTRUE