TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: I built vector search for COSS podcasts & livestreams

2 pointsby zcesurabout 1 year ago
Hey HN! I built COSSgpt using videos from the Open Source Founder Podcast [1] and livestreams from COSS Office Hours [2][3]<p>I transcribed the VODs using Whisper and vectorized fixed-size segments from the transcripts with MPNet on Replicate GPUs. I made these segments overlap a little to prevent semantic meaning being lost inbetween segments<p>Then I indexed the vectors using HNSWLib in-memory vectorstore [4] and persisted the entire vectorstore into Tigris object storage [5] to cache multimedia and vectors across all Fly.io regions<p>I built the app in Elixir, almost entirely server-side rendered with minimal diffs sent to the client over WebSockets using Phoenix LiveView. I also used Livebook [6] a ton when I was building the multimedia processing &amp; ML pipeline. I&#x27;m super bullish on Elixir for building webapps and&#x2F;or MLops!<p>Let me know what you think :) If you&#x27;re curious you can find the code at <a href="https:&#x2F;&#x2F;github.com&#x2F;algora-io&#x2F;tv">https:&#x2F;&#x2F;github.com&#x2F;algora-io&#x2F;tv</a><p>[1]: <a href="https:&#x2F;&#x2F;algora.io&#x2F;podcast" rel="nofollow">https:&#x2F;&#x2F;algora.io&#x2F;podcast</a> [2]: <a href="https:&#x2F;&#x2F;tv.algora.io&#x2F;peerrich" rel="nofollow">https:&#x2F;&#x2F;tv.algora.io&#x2F;peerrich</a> [3]: <a href="https:&#x2F;&#x2F;tv.algora.io&#x2F;rfc" rel="nofollow">https:&#x2F;&#x2F;tv.algora.io&#x2F;rfc</a> [4]: <a href="https:&#x2F;&#x2F;github.com&#x2F;nmslib&#x2F;hnswlib">https:&#x2F;&#x2F;github.com&#x2F;nmslib&#x2F;hnswlib</a> [5]: <a href="https:&#x2F;&#x2F;tigrisdata.com" rel="nofollow">https:&#x2F;&#x2F;tigrisdata.com</a> [6]: <a href="https:&#x2F;&#x2F;github.com&#x2F;algora-io&#x2F;tv&#x2F;blob&#x2F;2586950&#x2F;scripts&#x2F;cossgpt.livemd">https:&#x2F;&#x2F;github.com&#x2F;algora-io&#x2F;tv&#x2F;blob&#x2F;2586950&#x2F;scripts&#x2F;cossgpt...</a>

no comments

no comments