TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Help with Hidden Markov Model for Topical Text Segmentation

1 点作者 wbarber大约 2 年前

1 comment

wbarber大约 2 年前
I&#x27;m attempting to write a function that splits a long document into shorter segments of text, splitting the text into the topics discussed as a step in a data processing pipeline prior to embedding the shorter segments of text for vector search.<p>I&#x27;m attempting to use v1.0 of the pomegranate python library as I get the impression it will be more performant than some of the other common options I looked at. Below is my code. I&#x27;m a self-taught developer just trying to solve a niche problem that&#x27;s of interest to me so I&#x27;ve not used any of these libraries before or attempted to build a hidden markov model before so be gentle and many thanks for the help.<p>You can see my current attempt and the error I&#x27;m getting at the link I provided. As well as here on stack overflow if you want some internet karma for your kindness: <a href="https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;76409619&#x2F;hidden-markov-model-for-topical-text-segmentation" rel="nofollow">https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;76409619&#x2F;hidden-markov-m...</a>