TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ghostwriter – use the reMarkable2 as an interface to vision-LLMs

211 点作者 wonger_3 个月前

17 条评论

awwaiid3 个月前
Project author here -- happy to elaborate on anything; a continuous WIP project. The biggest insight has been limitations of vision models in spacial awareness -- see <a href="https:&#x2F;&#x2F;github.com&#x2F;awwaiid&#x2F;ghostwriter&#x2F;blob&#x2F;main&#x2F;evaluation_results&#x2F;2024-12-29_21-05-47&#x2F;results.md">https:&#x2F;&#x2F;github.com&#x2F;awwaiid&#x2F;ghostwriter&#x2F;blob&#x2F;main&#x2F;evaluation_...</a> for some sketchy examples of my rudimentary eval.<p>Next top things:<p>* Continue to build&#x2F;extract into a yaml+shellscript agentic framework&#x2F;tool<p>* Continue exploring pre-segmenting or other methods of spacial awareness<p>* Write a reSvg backend that sends actual pen-strokes instead of lots of dots
评论 #42984291 未加载
评论 #42983843 未加载
0xferruccio3 个月前
This is so cool! I love to see people hacking together apps for the reMarkable tablet<p>I made a little app for reMarkable too and I shared it here some time back: <a href="https:&#x2F;&#x2F;digest.ferrucc.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;digest.ferrucc.io&#x2F;</a>
评论 #42982372 未加载
评论 #42981766 未加载
vendiddy3 个月前
I wish the remarkable tablets weren&#x27;t so locked down.<p>It&#x27;s one of my favorite pieces of hardware and wish there were more apps for it.
评论 #42984549 未加载
owulveryck3 个月前
Awesome.<p>I wanted to try to implement this for months. You did a really good job.
评论 #42983657 未加载
评论 #42980826 未加载
rpicard3 个月前
This is so cool. I’m going to try it this weekend.<p>I’ve been playing with the idea of auto creating tasks when I write todos by emailing the PDF and sending it to an LLM.<p>This just opened up a whole realm of better ways to accomplish that goal in realtime.
评论 #42983282 未加载
评论 #42983647 未加载
t0bia_s3 个月前
How about this on android driven Onyx Boox ereaders? Would it be possible?
评论 #42983683 未加载
memorydial3 个月前
This is a brilliant use case—handwriting input combined with LLMs makes for a much more natural workflow. I wonder how well it handles messy handwriting and if fine-tuning on personal notes would improve recognition over time.
评论 #42983261 未加载
评论 #42983663 未加载
vessenes3 个月前
Love this! There are some vector diffusion models out there; why not use tool calling to outsource to one of those if the model decides to draw something? Then it could specify coordinate range and the prompt.
评论 #42983588 未加载
xtiansimon3 个月前
For PDF paper readers, is the Remarkable’s 11” size sufficient? I have the Sony DPT 2nd version at 13”, and it’s perfect viewing experience. But projects like this keep drawing me to the Remarkable product.
评论 #42983280 未加载
评论 #42984114 未加载
评论 #42984059 未加载
3abiton3 个月前
I own a boox tablet (full fledge Android tablet with eink screen), and this sort of things would be perfect for it. I wonder if in 5 years the mobile hw would support something like that locally!
complex13143 个月前
Really cool. Would this run on the remarkable paper pro too?
评论 #42983693 未加载
chrismorgan3 个月前
&gt; <i>Things that worked at least once:</i><p>I like it.
评论 #42984082 未加载
seethedeaduu3 个月前
Kinda unrelated but should I go for kobo or the remarkable? I mostly want to read papers and maybe take notes. How do tthey compare in terms of hackability and freedom?
newman3143 个月前
I wonder if this can be abstracted to accept interaction from a Daylight too.
cancelself3 个月前
@apple.com add to iPadOS Notes?
评论 #42981782 未加载
tony_francis3 个月前
Harry potter half-blood prince vibes. Interesting just how much the medium changes the feeling of interacting with a chat model
评论 #42981830 未加载
评论 #42981223 未加载
评论 #42981774 未加载
评论 #42982206 未加载
评论 #42982210 未加载
8bithero3 个月前
Not to distract from the project but if anyone is interested in eink tablets with LLMs, the ViWoods tablet might be of interest to you.
评论 #42982333 未加载
评论 #42982209 未加载