TechEcho

12 comments

bambax6 months ago

> The autosegmentation jumps frequently between adjacent sheets, so is not yet precise enough to reveal contiguous texts, but it coarsely follows the entire scroll.Maybe a stupid idea, but has anyone tried to make a new scroll with known content and markers/known coordinates, and then cook it so as to bring it to a state close to the ones we're trying to unroll. And then scan it, and use that to fine-tune the software?There are probably simple insights that are extremely difficult to discover when looking at an entirely new problem, that would become more obvious when one already knows the original inside out.

评论 #42294802 未加载

评论 #42294998 未加载

评论 #42296727 未加载

ggm6 months ago

Given this is a join over image analysis, text recognition, data science and a huge complex 3D analytical model of scans which has to be mapped to the surface states, unrolled, and then subjected to edge and other discrimination, I think the application of ML and other novel techniques is great.The potential for applying lessons learned to other problems in complex surface/manifold scanning, "reading" states from disparate imaging systems, it's got big upsides.I'm not particularly sure anyone is claiming this is an LLM demonstrator or proves AGI is coming so if you permit me to float a strawman: it isn't.It's great science. Very impressive work.

评论 #42295250 未加载

prashp6 months ago

Glad to see there are more developments in the Vesuvius challenge - it has been one of the most interesting things I've discovered on HN.

评论 #42294073 未加载

someothherguyy6 months ago

Related: <a href="https://news.ycombinator.com/item?id=39261861">https://news.ycombinator.com/item?id=39261861</a>

kgeist6 months ago

>the sequence τυγχαν may be the beginning of the verb τυγχάνω: “to happen,” or perhaps “not to happen.”It says "me tunkhan..", which would indeed mean "let it not happen". Particle "me" means imperative "not". Or it can be part of a condition: "in case it does not happen...">there might be the beginning of διατροπή, a word found in other Herculaneum papyri that would mean something like “confusion, agitation, or disgust.”But it could also be just diatropos - "various, diverse". "Diatrope" can also mean "shame" and "attack (of a disease)" (for example, diatropai nautiodeis - "nausea").

NKosmatos6 months ago

Being Greek and having a bad handwriting I’m sure I could help with deciphering some letters. Heck, I can even see a capital lambda “Λ” in the lowres image :-) This would be a good case for a crowdsourcing science project with Greek universities/high schools.

评论 #42293772 未加载

moregon6 months ago

This project is a gem, I invite everybody to read their landing page, especially the page announcing the Grand Prize winner of last year, where they also quickly describe the project [1], and the Master Plan [2], where they talk about their goals.As a recap: - The real, narrative part of ancient Roman and Greek history comes from the tiny minority of texts survived by being copied through the centuries by medieval monks. We know a lot through archeology, epigraphy (engraved stones) etc., but the meat comes from the few ancient historians, philosopher, poets and so on we can read because medieval clerics thought them worthwile to preserve. - An exception to this are papyri, ancient "paper", on which they wrote both high literature and grocery lists. They were used all over the ancient world, but most of them survived only in Egypt and other dried areas, for obvious reasons. They represent the one direct link to the texts as they were written at the time, apart from engraved stones (which, though, tends to be mostly gravestones, with some laws and political stuff thown in). Unfortunately, the great majority of papyri are fragments, and most of them concern bureaucratic stuff like receipts, contracts and the like, with sometimes a private letter or half a page from a literary work. Precious for historians, but not the kind of thing that changes our knowledge of history. - But here it comes the Villa dei Papiri in Herculaneum, the town that shared the fate of Pompeii and was covered by vulcanic ashed from the Vesuvius' eruption of 79 A.D. The Villa was the home of a Greek philosopher, and there people found, at the end of the 18th century, 300 carbonized scrolls from the studio of the guy. These scrolls represent an absolute rarity: hundreds of complete works, most likely never met before, from the haydays of the Roman Empire. They're probably mostly philosophical books in Greek, but they could also contain lost plays, unknown great poets or histories about periods which have few or no sources about (we know that there were whole histories of the career of Alexander the Great that are now lost, we have dozen-of-years-wide holes in our knowledge of most of classical history etc.), - Unfortunately, these 300 scrolls are just lumps of coal. They've been cooked by the volcano's ashes and fused shut. Any attempt to open them in the past caused the destruction of most of the scroll, and for hundred of years they've been considered lost. - Until today! A breakthrough in CT scanning technology (brought by one of the founding teams of this project) has made possible to scan this kind of ancient scrolls with X-rays, accessing the internal "pages" without destroying them. - Having a scan of the internal volume of the scrolls was all well and good, but still you couldn't read anything! The scan doesn't pick up the ink, and it wasn't at all sure that there was a way to do it. That was the objective of last year challenge, gathering a community of competitors and mates to use computer vision and machine learning to virtually unwrap the scan and detect the ink inside, using AI's ability at finding patterns invisible to the human eye. - In only 8-9 months last years challenge was completed successfully, earning the winning team a big prize (almost a million, if I remember correctly?). We were able to read some pages from inside a sample scroll, showing forever that the task is possible! - The goal of 2024 was to expand this PoC to read 5 whole scrolls and to improve the scanning process. At the moment we don't know if the model developed for the Grand Prize of last year can be applied to the text of other scrolls, and anyway the whole scanning-and-virtual-unwrapping thing is incredibly time consuming and expensive and requires extensive optimization. I don't think there's been any major breakthrough till now, but of course many teams could be waiting the end of the year deadline to publish, since it's still a competition with money involved. - If the project is successful, the long term gains could be astounding. It's not only the 300 scrolls we already possess, but the possibility that a whole library could exist, yet to be excavated, in the still buried part of the Villa. You have to consider that its owner was a rich magnate hosting Greek philosophers for the heck of it. It's probable that he owned a big library, far bigger than the comparatively small one found in the studio of the philosopher. If we can develop a method to reliably read carbonized scrolls, the political impetus to dig the rest of the site would be difficult to resist. I'm Italian, I'd personally go in Rome to protest against the government if they didn't allow it :D - Finding this hypotetical library would be like finding a mini Library of Alexandria, a revolution in our knowledge of the ancient world. If you're even just a little bit interested in this kind of stuff, this is the Holy Grail!As a programmer (boring CRUD stuff) with a master's degree in ancient history (but I've forgotten most of my Greek and Latin), this project tickles both side of my life, my old academic aspirations and my current career. Unfortunately I'm not advanced enough in any of them to really contribute, since the tech part is super-advanced CV and ML stuff I can't even pronounce and decifring papyri is a whole new ball game compared with the tame texts I was translating at university. That's why I'm trying to evangelize about it, to at least contribute a little to its success![1] <a href="https://scrollprize.org/grandprize" rel="nofollow">https://scrollprize.org/grandprize</a> [2] <a href="https://scrollprize.org/master_plan" rel="nofollow">https://scrollprize.org/master_plan</a>

评论 #42294589 未加载

评论 #42295132 未加载

brabel6 months ago

They say they have Python and C APIs that can be used to explore the scroll. I had a look and they have a "tutorial" in a Python notebook: <a href="https://colab.research.google.com/github/ScrollPrize/vesuvius/blob/main/notebooks/example1_data_access.ipynb" rel="nofollow">https://colab.research.google.com/github/ScrollPrize/vesuviu...</a>But I can't make any sense of that, unfortunately :( can someone perhaps explain in terms a programmer would understand, how would I go about using this API to find the text? As far as I can see the dataset just contains a bunch of vertical and horizontal slices of the scroll and I have a hard time understanding how that can provide anything about what's written in them.

评论 #42293887 未加载

akie6 months ago

This is amazing! I couldn't imagine such a thing possible, it's basically sci-fi if you think about it. However, I couldn't help but chortle and think of my GP when I read the words "yet more text tantalizingly close to legibility".

cerebra6 months ago

This is really interesting and a challenge for folks who love to solve puzzles like this. Can't wait to see what folks are able to uncover.I wonder if any of the techniques used on other similarly decoded scrolls can work here.

jaythekiwi6 months ago

What an interesting technical challenge and puzzle. The fact these were traded for a few kangaroos is hilarious - I wonder who decided on that exchange rate!

Simon_ORourke6 months ago

I would defund many police forces belonging to "constitutional sheriffs" just to put together a fund to translate a few of these scrolls. Granted, it's probably "just" an Epicurean library, but all the same it's a good investment.

12 comments

bambax6 months ago

评论 #42294802 未加载

评论 #42294998 未加载

评论 #42296727 未加载

ggm6 months ago

评论 #42295250 未加载

prashp6 months ago

Glad to see there are more developments in the Vesuvius challenge - it has been one of the most interesting things I've discovered on HN.

评论 #42294073 未加载

someothherguyy6 months ago

Related: <a href="https://news.ycombinator.com/item?id=39261861">https://news.ycombinator.com/item?id=39261861</a>

kgeist6 months ago

NKosmatos6 months ago

评论 #42293772 未加载

moregon6 months ago

评论 #42294589 未加载

评论 #42295132 未加载

brabel6 months ago

评论 #42293887 未加载

akie6 months ago

cerebra6 months ago

jaythekiwi6 months ago

What an interesting technical challenge and puzzle. The fact these were traded for a few kangaroos is hilarious - I wonder who decided on that exchange rate!

Simon_ORourke6 months ago

Vesuvius Challenge: First letters found in new scroll

12 comments

Vesuvius Challenge: First letters found in new scroll

12 comments