master-1215ed7
1215ed7d · CUDA: Implemented row flattening for non-glm RoPE (#2468) · Jul 31, 2023