Project author here -- happy to elaborate on anything; a continuous WIP project. The biggest insight has been limitations of vision models in spacial awareness -- see <a href="https://github.com/awwaiid/ghostwriter/blob/main/evaluation_results/2024-12-29_21-05-47/results.md">https://github.com/awwaiid/ghostwriter/blob/main/evaluation_...</a> for some sketchy examples of my rudimentary eval.<p>Next top things:<p>* Continue to build/extract into a yaml+shellscript agentic framework/tool<p>* Continue exploring pre-segmenting or other methods of spacial awareness<p>* Write a reSvg backend that sends actual pen-strokes instead of lots of dots
This is so cool! I love to see people hacking together apps for the reMarkable tablet<p>I made a little app for reMarkable too and I shared it here some time back: <a href="https://digest.ferrucc.io/" rel="nofollow">https://digest.ferrucc.io/</a>
This is so cool. I’m going to try it this weekend.<p>I’ve been playing with the idea of auto creating tasks when I write todos by emailing the PDF and sending it to an LLM.<p>This just opened up a whole realm of better ways to accomplish that goal in realtime.
This is a brilliant use case—handwriting input combined with LLMs makes for a much more natural workflow. I wonder how well it handles messy handwriting and if fine-tuning on personal notes would improve recognition over time.
Love this! There are some vector diffusion models out there; why not use tool calling to outsource to one of those if the model decides to draw something? Then it could specify coordinate range and the prompt.
For PDF paper readers, is the Remarkable’s 11” size sufficient? I have the Sony DPT 2nd version at 13”, and it’s perfect viewing experience. But projects like this keep drawing me to the Remarkable product.
I own a boox tablet (full fledge Android tablet with eink screen), and this sort of things would be perfect for it. I wonder if in 5 years the mobile hw would support something like that locally!
Kinda unrelated but should I go for kobo or the remarkable? I mostly want to read papers and maybe take notes. How do tthey compare in terms of hackability and freedom?