TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Vector Embedding Server in under 100 lines of code

10 pointsby navaneethpkover 1 year ago

2 comments

mrjnover 1 year ago
Author here. I was looking for a Docker-based server, which can expose a simple endpoint to generate vector embeddings for documents. The solution needs to deal with lengthy documents that exceed the 512-token limit enforced by E5 models. Such documents require intelligent chunking, ideally at sentence boundaries, followed by taking a mean of the vectors, to work effectively. Since I couldn't find a solution that met these criteria, I decided to create this setup myself.
topicseedover 1 year ago
I see you&#x27;re skipping too long sentences — any thoughts on how you would handle them and chunk them further if they weren&#x27;t skipped?<p>Good to see non-Go code from you, ha ;)