The Alphaist - English Edition
Welcome to this episode of The Alphaist. The Alphaist is a deep-dive conversation series hosted by Peter Chen, Founder of Alphaist Partners, a fund focused on hard tech. Each episode explores the first principles of technology and entrepreneurship, featuring early-stage founders, engineers, and product innovators who are shaping the future. In this episode, we are joined by Lengyue, Chief Scientist of Fish Audio, and Rissa,CEO of Fish Audio — for a deep dive into the next era of AI voice. Founded from open-source roots, Fish Audio has grown from open-source roots into a leading platform for text-to-speech and voice cloning, powering over 1.1 million user-generated voice models worldwide. In this episode, Lengyue and Rissa share Fish Audio's thinking on AI Voice 2.0, including: why the team bet on a unified end-to-end architecture over cascaded pipelines — and what it unlocked for expressiveness and latency; how "noisy" data — overlapping speech, arguments, emotionally charged conversations — became their most valuable training asset rather than something to discard; and where the road toward full-duplex, real-time voice interaction is ultimately heading. We believe that only when voice truly carries emotion can AI begin to feel human. Listen in and hear what's coming next in AI voice. Contact Fish Audio Official Website: https://fish.audio/ [https://fish.audio/] Contact Alphaist Partners Official Website: https://alphaist.vc/ [https://alphaist.vc/] Partnership & Media: peter@alphaist.vc [peter@alphaist.vc] Disclaimer: This episode was originally recorded in Chinese. The English version was translated and voiced using Fish Audio's AI voice tool, with the consent of all participants. Visit https://fish.audio/ [https://fish.audio/] for more details.
5 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The Alphaist - English Edition!