FireAttention V3: Enabling AMD as a Viable Alternative for GPU Inference

#85 · ✸ 20 · 💬 0 · 11 hours ago · fireworks.ai · swyx
FireAttention V3: Enabling AMD as a Viable Alternative for GPU Inference



Send Feedback | WebAssembly Version (beta)