TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Help with Hidden Markov Model for Topical Text Segmentation

1 pointsby wbarberalmost 2 years ago

1 comment

wbarberalmost 2 years ago
I&#x27;m attempting to write a function that splits a long document into shorter segments of text, splitting the text into the topics discussed as a step in a data processing pipeline prior to embedding the shorter segments of text for vector search.<p>I&#x27;m attempting to use v1.0 of the pomegranate python library as I get the impression it will be more performant than some of the other common options I looked at. Below is my code. I&#x27;m a self-taught developer just trying to solve a niche problem that&#x27;s of interest to me so I&#x27;ve not used any of these libraries before or attempted to build a hidden markov model before so be gentle and many thanks for the help.<p>You can see my current attempt and the error I&#x27;m getting at the link I provided. As well as here on stack overflow if you want some internet karma for your kindness: <a href="https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;76409619&#x2F;hidden-markov-model-for-topical-text-segmentation" rel="nofollow">https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;76409619&#x2F;hidden-markov-m...</a>