TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Bistring – Bidirectionally Transformed Strings

72 点作者 varunagrawal将近 6 年前

3 条评论

zawerf将近 6 年前
I was confused about the intended use case but there&#x27;s more information in the docs folder: <a href="https:&#x2F;&#x2F;github.com&#x2F;microsoft&#x2F;bistring&#x2F;blob&#x2F;master&#x2F;docs&#x2F;Introduction.rst" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;microsoft&#x2F;bistring&#x2F;blob&#x2F;master&#x2F;docs&#x2F;Intro...</a><p>Apparently it&#x27;s for machine learning where you want to pick out a span&#x2F;substring in the original text but your model can only accept normalized text (I am guessing for stuff like transforming out-of-vocabulary words into UNK&#x2F;unknown tokens). This solves that problem by keeping track of the index mapping between the original text and transformed text.<p>(picking out spans is very common task in NLP, for example see the SQuAD dataset: <a href="https:&#x2F;&#x2F;rajpurkar.github.io&#x2F;SQuAD-explorer&#x2F;explore&#x2F;v2.0&#x2F;dev&#x2F;Normans.html" rel="nofollow">https:&#x2F;&#x2F;rajpurkar.github.io&#x2F;SQuAD-explorer&#x2F;explore&#x2F;v2.0&#x2F;dev&#x2F;...</a>)
评论 #20430054 未加载
评论 #20429134 未加载
andrewflnr将近 6 年前
Somewhat related: Boomerang <a href="https:&#x2F;&#x2F;www.seas.upenn.edu&#x2F;~harmony&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.seas.upenn.edu&#x2F;~harmony&#x2F;</a> Discussed here at least once: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=565874" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=565874</a><p>The title made me think of Boomerang, this looks like it has rather different use cases in mind.
blt将近 6 年前
This is interesting, but the readme doesn&#x27;t say much about use cases. What is a big application that could benefit from this?
评论 #20429281 未加载