I stitched up Twilio, LLMs, speech understanding, and speech generation tools together to work all in parallel in a system. The result is a low-latency, interruptible and near-human quality AI phone agent<p>Wanted to share my learnings and code with the community. Hope this ends up being useful for someone out there :)<p>Detailed writeup: <a href="https://amirkiani.xyz/posts/ai-dialer/" rel="nofollow">https://amirkiani.xyz/posts/ai-dialer/</a>
Code: <a href="https://github.com/akiani/aidialer">https://github.com/akiani/aidialer</a>
Example Call: <a href="https://amirkiani.xyz/audio/ai-dialer.m4a" rel="nofollow">https://amirkiani.xyz/audio/ai-dialer.m4a</a>