Frontier Systems
This week on Office Hours, Anjney Midha and Mike Abbott are joined by Reiner Pope, CEO and co-founder of MatX, who spent a decade at Google - chip design for neural nets on the TPU team, then writing the inference stack for Palm - before leaving at the end of 2022 to bet that frontier-scale workloads deserved a chip designed from scratch around them. Reiner walks through the architecture decisions behind MatX including why intelligence per picojoule is the eval that matters, how to manage co-design risk when an error on the logic die costs hundreds of billions in CapEx, the trust boundary problem of working with frontier labs whose model architecture is their core IP, and why one well-balanced chip can serve pre-fill, decode, and training rather than splintering into specialized SKUs. He also gets into the parts of the job nobody talks about such as scaling supply chain from zero to gigawatts, fitting inside NVIDIA's de facto rack standard while Google's vertical integration runs ahead, and where SRAM stops scaling once context windows pass a million tokens.
5 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y forma parte de la comunidad de Frontier Systems!