Loading...
Loading...
Found 2 Skills
Calculate MFU (Machine FLOP Utilization) for operators such as matmul/GEMM, and provide clear formulas and derivation processes.
Build an operator-level compute template for an LLM and estimate FLOPs/MFU for a serving shape. Use when you need tensor shapes, per-op FLOPs, kernel-to-op MFU mapping, or parallelism what-if analysis.