Just Now Possible
Guests * Ernesto Garcia, Front-end Product Engineer, Doist * Thomas Jost, Backend Software Engineer, Doist * Hugo Fauquenoi, Product Manager, Doist In this episode * How Doist's 2-3 month AI exploration phase led to Ramble — and why voice-to-task emerged as the top contender * The user research insight behind Ramble: people using pen and paper or ChatGPT voice to brainstorm tasks before committing them to Todoist * Why Ramble skips transcription entirely and processes raw audio directly with a Gemini live audio model * How the model makes tool calls (add task, edit task, delete task) in real time while the user is still speaking — no text output at all * Designing for the driving use case: sound effects as audio confirmation cues alongside visual task cards * The challenge of teaching an LLM to capture tasks literally without over-interpreting or doing them — and how temperature tuning played a role * Date handling complexity: injecting the current date, normalizing to days vs. months, and always outputting dates in English for the natural language parser * Building an LLM-judge eval system with 20+ language recordings from 100+ employees across 35 countries to catch prompt regressions * Why Doist chose to inject the full project/label list into the system prompt instead of building a RAG pipeline — and why it worked * How easy correction beats perfect first-time accuracy in natural language interfaces * What's next: multimodal task capture from images and text blobs, Apple Watch support, and automation integrations Resources & Links * Todoist [https://todoist.com/?ref=producttalk.org] * Doist [https://doist.com/?ref=producttalk.org] * Google Vertex AI (Gemini) [https://cloud.google.com/vertex-ai?ref=producttalk.org] Chapters: 00:00 Meet the Doist Team 01:40 What Doist Builds 02:27 Ramble Voice to Tasks 04:16 Why Voice Matters 07:42 Brain Dump Insight 09:46 Prototyping With LLMs 11:08 Live Audio Workflow 14:32 Driving Friendly UX 18:47 Tool Only Architecture 26:06 Evals and Multilingual Testing 28:41 Taming Dates and Time 33:28 Fixing Date Confusion 33:43 Defining Task Boundaries 34:34 Capture Versus Do 37:17 Tuning Creativity Levels 39:01 Evals Across Languages 41:23 Feedback and Regressions 44:09 Model Upgrades Over Time 46:33 Projects Labels Context 51:40 Handling Ambiguous Names 54:23 Whats Next Multimodal 58:48 From Capture to Execution 59:46 Closing Thoughts
26 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de Just Now Possible!