1 pointsby svcrunchabout 1 month ago

1 comment

svcrunchabout 1 month ago

Various frontier LLMs were evaluated on their ability to interpret handwritten proofreading marks in printed literary text, using a small benchmark based on Charles Dickens's "Little Dorrit". Results are modest at best, and surprisingly variable across repeated runs, even on the same pages, underscoring the challenge in building reliable, structured-document systems with current multimodal LLMs.<p>Curious to hear thoughts from others working on similar problems.

评论 #43641187 未加载

Can GPT-4o Accurately Read Handwritten Proofreading Marks?

1 comment

Can GPT-4o Accurately Read Handwritten Proofreading Marks?

1 comment