<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Smart Speakers]]></title><description><![CDATA[A podcast about Voice AI]]></description><link>https://www.smartspeakers.fm</link><image><url>https://substackcdn.com/image/fetch/$s_!MZLe!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F315aa011-dee4-4094-95a6-e28cef339ea2_1000x1000.png</url><title>Smart Speakers</title><link>https://www.smartspeakers.fm</link></image><generator>Substack</generator><lastBuildDate>Sun, 19 Apr 2026 01:07:00 GMT</lastBuildDate><atom:link href="https://www.smartspeakers.fm/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Unison Labs Inc.]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[dave@unison.fm]]></webMaster><itunes:owner><itunes:email><![CDATA[dave@unison.fm]]></itunes:email><itunes:name><![CDATA[Dave Zohrob]]></itunes:name></itunes:owner><itunes:author><![CDATA[Dave Zohrob]]></itunes:author><googleplay:owner><![CDATA[dave@unison.fm]]></googleplay:owner><googleplay:email><![CDATA[dave@unison.fm]]></googleplay:email><googleplay:author><![CDATA[Dave Zohrob]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The 500ms Dash—Nikhil Gupta, VAPI]]></title><description><![CDATA[A super-deep dive into ultra-low latency voice AI infrastructure, from the CTO of the voice platform powering over 44 million calls]]></description><link>https://www.smartspeakers.fm/p/the-500ms-dashnikhil-gupta-vapi</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/the-500ms-dashnikhil-gupta-vapi</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Mon, 07 Apr 2025 12:02:33 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/160666471/c95c316e42c543f2e6cc39d86aa6f977.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Nikhil Gupta is the cofounder and CTO of VAPI. He&#8217;s been in the trenches building and scaling one of the biggest voice platforms in the world.</p><p>On this episode, he explains how VAPI aims to create a voice-default future where we talk to all our computers&#8212;and goes deep into every step of VAPI&#8217;s voice pipelines and the technical challenges along the way.</p><p>Some highlights:</p><p>&#9742;&#65039; <strong>Massive scale: </strong>VAPI has processed 44 million voice calls on their platform, growing from a COVID-era one-click Zoom meeting button to a full voice infrastructure company used by thousands of developers.</p><p>&#9889;&#65039; <strong>Latency matters:</strong> Voice AI needs to respond within 500 milliseconds to feel natural to humans. That means cleaning audio, detecting when users are done speaking, transcribing text, generating responses, converting to speech, and handling interruptions&#8212;all within a fraction of a second.</p><p>&#128483;&#65039;<strong>Voice-first future: </strong>Nikhil is betting a future where voice becomes our default interface with all computing systems.</p><p>If you&#8217;ve ever wondered how voice API actually works&#8212;this is the episode for you. </p><p><strong>Chapters</strong></p><p>00:00 - Introducing VAPI</p><p>04:42 - Pivoting through COVID</p><p>05:42 - ChatGPT existential crisis</p><p>08:33 - Technical challenges of voice</p><p>12:42 - Anatomy of a voice call</p><p>14:46 - Knowing when someone is done speaking</p><p>18:37 - Routing to the fastest model</p><p>22:07 - Knowledge and context injection</p><p>26:47 - The text-to-speech bottleneck</p><p>31:14 - Handling interruptions gracefully</p><p>33:43 - The 500-millisecond barrier</p><p>36:56 - The DNS latency discovery</p><p>39:25 - Scaling the team and what's next<br></p><p><strong>Links</strong></p><ul><li><p><a href="https://vapi.ai/">VAPI</a> </p></li><li><p><a href="https://www.linkedin.com/in/nikhilro/">Nikhil Gupta on LinkedIn</a></p></li><li><p><a href="https://www.producthunt.com/products/vapi">VAPI on Product Hunt</a></p></li><li><p><a href="https://stratechery.com/">Stratechery by Ben Thompson</a> - Recommended reading from Nikhil</p></li></ul>]]></content:encoded></item><item><title><![CDATA[Your AI interviewer will see you now—Varun Khurana, Wayfaster]]></title><description><![CDATA[Why job candidates actually prefer talking to AI over getting ghosted by recruiters]]></description><link>https://www.smartspeakers.fm/p/your-ai-interviewer-will-see-you</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/your-ai-interviewer-will-see-you</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Mon, 31 Mar 2025 12:02:48 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/160216644/3483ce51dab8753e4014ef6a8e9e2b5e.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Varun Khurana wants every job candidate to have the best possible chance to prove themselves for every job they apply to. He&#8217;s been in the trenches building with voice AI for 2+ years, and had some great nuggets to share:</p><ul><li><p><strong>&#128172; How Wayfaster started: </strong>&#8220;Wayfaster started out as a way to interview your entire pipeline in a couple seconds... the only way you can really do that is using voice AI."</p></li><li><p><strong>&#9940;&#65039;Why AI video avatars are a no-go: </strong>"We tried avatars and every candidate hated it... I think it just makes candidates feel dumb. It's just like, why am I talking to an avatar? Like how stupid do you think I am?"</p></li><li><p><strong>&#128209;How AI interviews give candidates a chance: </strong>"Candidates have just gotten really accustomed to getting this default auto-reject email... At least here, I know that if I do a good job on the interview, I have a shot at this opportunity."</p></li><li><p><strong>&#128059;Why he&#8217;s bearish consumer voice AI: </strong>"I'm actually a little more bearish on the consumer voice AI use cases. I like voice AI in constrained environments, like B2B, there's an amount of intent... I don't think people are going to want to be talking to their phones all the time."</p></li></ul><p>Hope you enjoy the conversation! As always you can subscribe at https://smartspeakers.fm.</p><h2>Chapters</h2><p>0:00 Welcome and intro</p><p>1:32 First big AI aha moment</p><p>5:08 Career background</p><p>7:40 Exploring startup ideas</p><p>10:34 How WayFaster was born</p><p>14:21 Initial target markets</p><p>16:45 Candidate response to AI interviews</p><p>19:07 Benefits for recruiters and candidates</p><p>23:30 How AI is transforming recruiting</p><p>26:51 The two-sided recruiting game</p><p>31:05 Why Varun is bearish on consumer voice AI </p><p>35:09 Future generations and AI adoption</p><p>38:09 Content recommendations and closing</p><p></p><h2>Links</h2><ul><li><p><a href="https://wayfaster.com">WayFaster Website</a></p></li><li><p><a href="https://www.linkedin.com/in/vkhurana2/">Varun Khurana on LinkedIn</a></p></li><li><p><a href="https://www.southparkcommons.com/">South Park Commons</a></p></li><li><p><a href="https://deepgram.com/">Deepgram</a></p></li><li><p><a href="https://charlesrubenfeld.substack.com/">Charles Rubenfeld's Newsletter</a> </p></li></ul>]]></content:encoded></item><item><title><![CDATA[The pipeline for voice—Kwindla Hultman Kramer, CEO of Daily]]></title><description><![CDATA[From early network computing to founding Daily and creating PipeCat, the leading open-source voice agent framework]]></description><link>https://www.smartspeakers.fm/p/the-pipeline-for-voicekwindla-hultman</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/the-pipeline-for-voicekwindla-hultman</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Mon, 24 Mar 2025 12:03:40 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/159613061/04cefecc08e59e4782859ef9277b792f.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Much of my childhood was spent in my basement, dialing up other people&#8217;s computers to trade messages and play games.</p><p>Kwin Kramer from Daily remembers that time, too&#8212;and says today's voice AI moment feels just like those early internet days. That sense of endless possibilities is back.</p><p>His open-source project PipeCat has become the standard toolkit for voice agents. What began as an experiment now powers voice AI for OpenAI, Google DeepMind, and countless startups, making conversations feel natural and responsive.</p><p>Some highlights:</p><ul><li><p>That early internet feeling is back: "1995 to 1999 felt a certain way. It never felt that way again until 2023 to 2025."</p></li></ul><ul><li><p>GPT-4 transformed Daily's business by removing a key bottleneck: "Previously you needed two humans for a conversation. Now you only need one, maybe not even that."</p></li><li><p>Voice AI's killer feature? Latency matters: "If response times are long, you're in that uncanny valley where people get uncomfortable."</p></li><li><p>Kwin's bold prediction: "We're all going to have friends in our group chats that aren't human because LLMs are actually really entertaining."</p></li></ul><p>Hope you enjoy it as much as we did.</p><p><strong>Links</strong></p><p>Daily: https://daily.co/<br>PipeCat: https://pipecat.ai/<br>Kwin on Twitter: https://twitter.com/kwindla</p><p><strong>Chapters</strong></p><p>0:00 Intro</p><p>2:02 First AI aha moment</p><p>5:43 MIT Media Lab beginnings</p><p>9:05 BBS and door games</p><p>15:26 The AllAfrica journey</p><p>18:54 Starting Daily</p><p>21:13 COVID's impact on WebRTC</p><p>22:36 GPT-4 transformation</p><p>31:26 Building voice for LLMs</p><p>35:17 PipeCat's key challenges</p><p>44:10 The future of speech-to-speech</p><p>47:10 Voice AI adoption trends</p><p>52:34 Vibe coding revolution</p><p>56:11 What's next for PipeCat</p>]]></content:encoded></item><item><title><![CDATA[The craft behind voice AI magic with Tom Shapland]]></title><description><![CDATA[Getting from zero to 80% is easy. It's that last 20% that separates the wow demo from something people actually trust.]]></description><link>https://www.smartspeakers.fm/p/the-craft-behind-voice-ai-magic-with</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/the-craft-behind-voice-ai-magic-with</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Mon, 17 Mar 2025 12:04:06 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/159211289/0c48a9406ee27b31a7500ef8e3c99262.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>This week, Tom Shapland shares his journey from agriculture tech to founding Canonical, a tool that helps voice AI developers understand and improve conversations by mapping call stages.</p><p>Tom has deep experience in the voice world&#8212;we loved this conversation and we&#8217;re sure you will too!<br></p><p><strong>Links</strong></p><p>https://canonical.chat<br>https://x.com/Tom_Shapland<br>https://www.linkedin.com/in/tom-shapland-b4494212/</p><p></p><p><strong>Chapters</strong></p><p>0:00 - Intro</p><p>1:05 - Welcome and first AI moment</p><p>3:03 - Computer vision for thirsty plants</p><p>5:50 - Explaining AI to farmers</p><p>7:52 - Hardware is hard</p><p>8:31 - Non-VC shaped business struggles</p><p>16:51 - Starting a new company</p><p>19:41 - From metrics to conversation stages</p><p>22:42 - Voice AI evolution</p><p>26:40 - Balancing determinism and freedom</p><p>29:53 - Who uses Canonical and when</p><p>32:39 - Don't make your agent spell anything</p><p>35:52 - Zero to 80% is easy, production is hard</p><p>39:28 - Future of voice AI adoption</p><p>42:21 - Book recommendations and closing</p>]]></content:encoded></item><item><title><![CDATA[Olivia Moore, a16z: Voice as a wedge]]></title><description><![CDATA[Voice AI's path from demo to business transformation, and why voice is like the Internet in 1999]]></description><link>https://www.smartspeakers.fm/p/olivia-moore-a16z-voice-as-a-wedge</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/olivia-moore-a16z-voice-as-a-wedge</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Mon, 10 Mar 2025 12:03:37 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/158700816/bf2f8ad28f8dd6df4ddc204b99b53e38.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Voice AI is having its moment, but pretty soon every company will be a voice company.</p><p>Olivia Moore is the author of a16z&#8217;s<a href="https://a16z.com/ai-voice-agents-2025-update/"> market report on voice AI agents</a>, which is a must-read if you&#8217;re interested in the space. </p><p>She&#8217;s an amazing thinker and we cover a lot in this conversation, including:</p><ul><li><p>Olivia's thesis that Voice AI is in it&#8217;s &#8220;internet in 1999&#8221; moment</p></li><li><p>The "wedge" strategy: using Voice AI to get in the door before transforming broader business operations</p></li><li><p>How recruiting became a surprising early adopter of Voice AI (candidates actually prefer it)</p></li><li><p>Voice AI's unique ability to handle compliance and stay on-script in regulated industries</p></li><li><p>Shifting from per-minute pricing to transaction-based fees or success metrics</p></li><li><p>Why big tech platforms (OpenAI, Google) won't dominate vertical-specific voice applications</p></li></ul><p><strong>Links</strong></p><ul><li><p>Olivia on X: <a href="https://x.com/omooretweets">https://x.com/omooretweets</a> (must-follow!)</p></li><li><p>Her <a href="https://a16z.com/ai-voice-agents-2025-update/">market report on voice AI agents</a></p></li><li><p><a href="https://classicreload.com/dr-sbaitso.html">Dr Sbaitso emulator</a> - this was packaged with new Sound Blaster sound cards! Strange stuff.</p></li></ul><p><strong><br>Chapters<br></strong>0:00 Intro and welcome<br>1:24 Olivia's first AI "aha" moments <br>3:42 Coolest non-voice AI tools<br>6:29 How Olivia got into the Voice AI space<br>10:03 From consumer to enterprise<br>13:50 DMV-bot<br>16:22 Voice AI as a wedge<br>20:57 Traction in the voice market<br>24:24 Voice AI for recruiting<br>28:59 Why AI doesn't go off-script<br>31:03 Better experiences for candidates and customers<br>33:24 The future of call centers<br>37:02 Bear cases for Voice AI<br>41:25 Open problems<br>44:29 Pricing<br>47:12 Working with twin sister Justine<br>50:11 Three things Olivia's excited about </p><p><strong>Subscribe</strong> and listen: <br>https://smartspeakers.fm</p>]]></content:encoded></item><item><title><![CDATA[Voice AI for Healthcare: Phil Markunas, Standard Practice]]></title><description><![CDATA[Pivoting from healthcare payments to AI voice agents that handle frustrating insurance calls; & why teaching machines to have natural conversations might be as challenging as self-driving cars.]]></description><link>https://www.smartspeakers.fm/p/voice-ai-for-healthcare-calls-phil</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/voice-ai-for-healthcare-calls-phil</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Mon, 03 Mar 2025 13:02:27 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/158199505/9b0dd9afc9607daa99373a2510969a31.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>We chat with Phil Markunas about his wild journey from Army staff sergeant to voice AI founder.</p><p>Phil shares how Standard Practice evolved from a healthcare payment startup to building an AI assistant that handles complex insurance calls. Along the way, we learn about Phil's life in Japan, his Oreo obsession, and why he believes natural conversation is as hard as self-driving cars. </p><p>I loved Phil&#8217;s concept of &#8220;success, but doof&#8221; &#8212; it neatly encapsulates just how hard it is to measure the success of AI voice calls.</p><p>Stay tuned next week for our chat with Olivia Moore from a16z. </p><p>See you soon&#8212;<br>dave &amp; harish</p><p><strong>Follow Phil:</strong><br>https://www.linkedin.com/in/phil-markunas<br>https://philm.io/<br>https://x.com/philmarkunas</p><p><strong>Standard Practice:</strong><br>https://standardpractice.ai</p><p><strong>Chapters</strong><br>0:00 - Introduction to Phil Markunas, CTO of Standard Practice<br>1:34 - Emotional responses to voice AI<br>5:17 - How Army service shaped Phil's leadership<br>8:07 - Living in Japan<br>13:04 - The Nibble Health origin story and creating a medical bill payment card<br>17:59 - Facing challenges and developing the SimpleBill medical analysis tool<br>22:07 - Pivoting to voice AI<br>26:25 - Why voice conversation is an AGI-level problem<br>33:21 - Beyond "just prompt it up" to building sophisticated voice architecture<br>40:45 - "Success, but doof" moments; and the future of Standard Practice</p><p><strong>Subscribe &#8212; and let us know what you think!<br></strong>https://smartspeakers.fm</p>]]></content:encoded></item><item><title><![CDATA[Episode 1: ChatGPT lies to us]]></title><description><![CDATA[We interview ChatGPT, who lies to us multiple times (and does a killer podcast ad read)&#8212;and talk about what we've been working on for the last 6 months.]]></description><link>https://www.smartspeakers.fm/p/episode-1-chatgpt-lies-to-us</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/episode-1-chatgpt-lies-to-us</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Mon, 24 Feb 2025 14:17:43 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/157781746/7aeb68d708ec19effd7008e486752d8c.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Smart Speakers is a podcast about voice AI. Our first guest is, of course, an AI: ChatGPT in advanced voice mode.</p><p><strong>We talk about:</strong></p><ul><li><p>Our journey since selling Chartable to Spotify in 2022: life inside Spotify, and Harish&#8217;s explorations outside corporate life in construction and factory tech</p></li><li><p>Starting to work together again last July</p></li><li><p>Exploring projects: a Chartable-like analytics platform for audiobooks, a text-to-speech bookmarking tool called Earmark, and finally experiments with voice AI</p></li><li><p>We interview ChatGPT, which is surprisingly good at podcast ad reads!</p></li><li><p>Harish thinks ChatGPT lied to us at least twice. What do you think?</p></li></ul><p><strong>Stay tuned next week:</strong> Phil Markunas from <a href="http://standardpractice.ai">Standard Practice</a> discusses voice AI in healthcare billing. </p><p>Thanks for listening, reading, and watching. We&#8217;d love to hear your feedback! Subscribe and comment at <a href="https://smartspeakers.fm">https://smartspeakers.fm </a></p><p></p>]]></content:encoded></item><item><title><![CDATA[Smart Speakers coming Monday Feb 24th!]]></title><description><![CDATA[We're starting a podcast about Voice AI &#129302;]]></description><link>https://www.smartspeakers.fm/p/smart-speakers-coming-monday-feb</link><guid isPermaLink="false">https://www.smartspeakers.fm/p/smart-speakers-coming-monday-feb</guid><dc:creator><![CDATA[Dave Zohrob]]></dc:creator><pubDate>Wed, 05 Feb 2025 22:00:50 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/156560995/f35a7baf3dba285f3d16ff7688c1c73b.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>We&#8217;re super-excited to be launching Smart Speakers, coming to you Monday February 24th.</p><p>You can subscribe at smartspeakers.fm, on YouTube, or in your fave podcast app.</p><p>See you soon! </p><p>&#8212;Dave and Harish</p>]]></content:encoded></item></channel></rss>