Interestingly, we've taken the same approach to process historical document (like 18th Venetian manuscripts).<p>We even use a Unet architecture with a pretrained resnet50 encoder, and some postprocessing to go from prob maps to polygons, like this project does. Of course, we are much more limited than what you propose, but it is reassuring our side project took the same course as what bugger entities do.<p><a href="https://dhlab-epfl.github.io/dhSegment/" rel="nofollow">https://dhlab-epfl.github.io/dhSegment/</a>