TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

In Defense of OpenStreetMap's Data Model

265 pointsby SteveCoastalmost 3 years ago

22 comments

zigzag312almost 3 years ago
Author states:<p>&gt;And that’s the point, rules and complexity have completely unknowable downsides. Downsides like the destruction of the whole project. With each rule and added complexity you make the system less human and less fun. You make it a Computer Scientists rube goldberg machine while sterilizing it of all the joy of life.<p>While too much rules and complexity can certainly be bad, some basic amount of standardization can actually reduce complexity and really doesn&#x27;t cause a &quot;destruction of the whole project&quot;.<p>As a counterpoint, too much flexibility can also increase complexity. For example, without defined rules, 5.6.2022 can mean 5. June 2022 or 6. May 2022. Nor user, nor parser can know for sure what it means, if standard isn&#x27;t defined. This kind of flexibility certainly isn&#x27;t fun.<p>Example from OSM wiki for &quot;Key:source:date&quot;:<p>&gt; There is no standing recommendation as to the date format to be used. However, the international standard ISO 8601 appears to be followed by 9 of the top 10 values for this tag. The ISO 8601 basic date format is YYYY-MM-DD. <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Key:source:date" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Key:source:date</a><p>Just define some essential standards. It won&#x27;t lead to destruction of the project!<p>And while you are making breaking changes, please fix the &#x27;way&#x27; element. Maps are big. Storing points in ways as 64bit node-ids, while coordinates in nodes are also 64-bit (32bit lon and 32bit lat), just leads to wasted space and wasted processing time. There are billions of these nodes and nearly all of these nodes don&#x27;t have tags, just coordinates. There is no upside for this level of indirection. And in case tags are needed for a point, this can already be solved with a separate node and a &#x27;relation&#x27; element.<p>OSM data format could certainly be improved and it would benefit end users, as better tools&#x2F;apps could be made more quickly and easily.
评论 #31639680 未加载
评论 #31639715 未加载
RicoElectricoalmost 3 years ago
The proposed improvements would obsolete a bunch of problems such as broken polygons [1] which happen regularly. They would also make processing OSM more accessible without needing to randomly seek over GBs of node locations just to assemble geometries which takes a significant runtime percentage of osm2pgsql.<p>For me Steve Coast lost his credibility when he joined the closed and proprietary what3words.<p>[1] <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;OSM_Inspector&#x2F;Views&#x2F;Multipolygons" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;OSM_Inspector&#x2F;Views&#x2F;Mult...</a>
评论 #31647197 未加载
评论 #31639285 未加载
stevagealmost 3 years ago
In the first part of the article I was thinking, oh, maybe Steve Coast isn&#x27;t such a jerk after all.<p>Then I got to the meat of it. Oh dear.<p>As one of the many many people who has had to deal with OSM data, I curse people with this attitude that the mess is somehow desirable or necessary. It&#x27;s not. There is a long spectrum between totally free form and completely constrained, and OSM&#x27;s data model is painfully down the wrong end, and causes enormous harm to all kinds of potential reuses of the data.<p>It also causes harm to the people creating data. Try adding bike paths and figuring out what tags are appropriate in your area. Try working out how to tag different kinds of parks, or which sorts of administrative boundaries should be added or how they should be maintained. It puts many people off, me included.<p>Bah.
评论 #31640677 未加载
Sujanalmost 3 years ago
I think those are the important bits:<p>&gt; The Engineering Working Group (EWG) of the OSMF has “commissioned” (I think that’s OSMF language for paid) a longstanding proponent of rules and complexity to, uh, investigate how to add rules and complexity to OSM.<p>&gt; [...]<p>&gt; Let us pray that the EWG is just throwing Jochen a bone to go play in the corner and stop annoying the grownups.<p>It&#x27;s a &quot;response&quot; to <a href="https:&#x2F;&#x2F;blog.openstreetmap.org&#x2F;2022&#x2F;06&#x2F;02&#x2F;announcement-data-model-study&#x2F;" rel="nofollow">https:&#x2F;&#x2F;blog.openstreetmap.org&#x2F;2022&#x2F;06&#x2F;02&#x2F;announcement-data-...</a>
评论 #31649513 未加载
matkonieczalmost 3 years ago
&gt; The harder you make it for them to edit, the less volunteers you’ll get.<p>And that is why dedicated area type (rather than representing areas with lines or special relations[0]) could help new mappers and new users of data.<p>There would be very significant transition costs, but maybe it would be overall beneficial.<p>It is possible to have objects that are both area and line at once. Or area according to one tool&#x2F;map&#x2F;edtor and line according to another.<p>And many multipolygon relations are in inconsistent state and require manual fixup.<p>Also, complexity of entire area baggage makes explaining things to newbies more complex. You can either try to hide complexity (used by iD in-browser-editor) leaving people hopelessly confused when things are getting complex or present full complexity (JOSM) causing people to be overwhelmed.<p>See <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Area#Tags_implying_area_status" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Area#Tags_implying_area_...</a> for a start of a complexity fractal.<p>[0] <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Area" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Area</a>
评论 #31641754 未加载
JackFralmost 3 years ago
I know it’s nothing to do with the main thrust of the article, but the author fundamentally misrepresents KYC. Know-your-customer is a facet of anti-money laundering and anti-corruption regulation. It has nothing to do with talking to users.
评论 #31640618 未加载
matkonieczalmost 3 years ago
&gt; Let us pray that the EWG is just throwing Jochen a bone to go play in the corner and stop annoying the grownups.<p>That is neither helpful, not useful, nor making me more likely to treat this diatribe more seriously.
评论 #31641801 未加载
kawsperalmost 3 years ago
I’ve started playing with data from OpenStreetMap. It started with me trying to fetch all the places where I could get water when moving around Copenhagen, which turned out not to be as easy as first envisioned, because OSM seems to have a lot of different ways to categorise available water, which makes sense, OSM and the tagging system isn&#x27;t there to support only my usecase, and describing my idea doesn&#x27;t fit 1:1 with the model.<p>I identified the following tags to look out for:<p>amenity=drinking_water, <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Tag:amenity%3Ddrinking_water" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Tag:amenity%3Ddrinking_w...</a><p>man_made=water_tap, <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Tag:man_made%3Dwater_tap" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Tag:man_made%3Dwater_tap</a><p>amenity=water_point, <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Tag:amenity%3Dwater_point" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Tag:amenity%3Dwater_poin...</a><p>drinking_water=*, <a href="https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Key:drinking_water" rel="nofollow">https:&#x2F;&#x2F;wiki.openstreetmap.org&#x2F;wiki&#x2F;Key:drinking_water</a><p>It&#x27;s a tough problem to map out the world and describe it, especially when everyone can add or modify the data, but anything that could improve the experience of importing like osm2pgsql would be welcome.
评论 #31639381 未加载
评论 #31645507 未加载
评论 #31647692 未加载
londons_explorealmost 3 years ago
The exact same argument that praises OSM&#x27;s super flexible tagged node data model should also praise MS Excel for the number of things that can be achieved in the world of business with just a grid of boxes.<p>Both have been hugely successful, and both have the same pile of downsides.
评论 #31639537 未加载
gennarroalmost 3 years ago
Skip about 1&#x2F;3 of the way down and the OSM article starts. “This is why bad design is everywhere...”
评论 #31639079 未加载
评论 #31639438 未加载
评论 #31639071 未加载
seoaeualmost 3 years ago
The claim that a dataset with billions of users has only <i>dozens</i> of people able&#x2F;interested in doing data processing on it is a damning admission that the format is too hard to deal with
评论 #31643416 未加载
评论 #31641912 未加载
xg15almost 3 years ago
The author is ranting a lot about OSMF&#x27;s recent decisions, gives all kinds of reasons why they will undoubtedly lead to horrible consequences and grants sage advice what should have been done instead.<p>The only thing I&#x27;m missing is any indication that OSMF&#x27;s course actually did cause any problems in reality.
teddyhalmost 3 years ago
What happened to the title? It used to be “In Defense of OpenStreetMap&#x27;s Data Model”, which is the literal blog post title. Someone has now <i>changed it</i> to the boring-sounding “OpenStreetMap&#x27;s Data Model”, probably resulting in fewer clicks.
westnordostalmost 3 years ago
Just for context, here is the current OSM data model in a nutshell:<p>There are three data types - nodes, ways and relations. All of the three can have any number of &quot;tags&quot; (i.e. a map&lt;string, string&gt;), which define their semantic meaning. For example, a way with `barrier=fence` is a fence. This stuff is documented in the openstreetmap wiki.<p>A node is a point with a longitude and a latitude.<p>A way is a sequence of nodes.<p>A relation is a collections of any number of nodes, ways or other relations. Each member of this collection can be assigned a &quot;role&quot; (string). Again, the semantics of what each role means is documented in the openstreetmap wiki.<p>To modify data, simply new versions of the edited data are uploaded via the API.<p>---<p>The most prominent point that stands out here is that only nodes have actual geometry.<p>This means that...<p>1. to get the geometry of a way (e.g a building, a road, a landuse, ...), data users first need to get the locations of all the nodes the way references. For relations, it is even one more step.<p>2. in order to edit the course of a way, editors actually edit the location of the nodes of which the way consists of, not the way itself. This means (amongst other things) that the VCS history of that way does not contain such changes
boredumbalmost 3 years ago
Street-Complete on android is a neat way to contribute to OSM.
uptimealmost 3 years ago
I have been looking into improving kerb and traffic_signals data for some bboxen. It is daunting, and I figure I need to work backwards - try to find out what the accessibility map apps look for and use those pairs. If I know what target I am shooting for I guess it will be alright. This is like my first week looking into this and I hope to find these targets soon.
评论 #31639901 未加载
anticristialmost 3 years ago
&gt; The harder you make it for them to edit, the less volunteers you’ll get.<p>I&#x27;m not sure I like OSMs obsession with the data <i>model</i>. I guess this has to do with its business model, but then let&#x27;s not pretend the product manager is tasked to optimize for end-users.<p>I&#x27;d like it to focus on the UI. The easier it is to input a geographic thingie and the easier it is to visualize the geographic thingie, the better OSM both for users and volunteers.<p>Two issues to strengthen my point:<p>1. The osm-tag mailing list regularly discusses how tags are visualized in various renderers when recommending which to choose.<p>2. Quick mobile-based correction are nearly impossible with OSMAnd. I&#x27;d love to take a picture and write a quick note like &quot;speed limit changed&quot;, so that someone (perhaps a bot) can pick this up and update the data model. Same with restaurant opening hours. Or various POIs.
评论 #31645804 未加载
tinus_hnalmost 3 years ago
What is stopping users who have a problem with the model from transforming the data into a form that is better for their use case?
评论 #31639335 未加载
评论 #31641519 未加载
mring33621almost 3 years ago
So, come up with an improved format that has&#x2F;enforces various &#x27;rules&#x27; and also provide a conversion program for moving between the new&#x2F;old format, as desired.<p>Slowly deprecate the nastiest parts of the old, as people get used to the new format.
an9nalmost 3 years ago
There&#x27;s an amusing paradox I&#x27;ve seen many times amongst GIS managers - their complaints about how dreadful OSM data is are pretty much the direct opposite of their enthusiasm to use it!
pete_nicalmost 3 years ago
&gt;If you won’t or can’t talk to customers then at least make the thing simple. The simpler something is, the more people will use it.<p>Great advice
chapsalmost 3 years ago
Heh, their data model is 99% of the reason why I don&#x27;t use OSM. It&#x27;s scattered all over the place with <i>so many tables</i>! It&#x27;s such a nice project, but <i>damn</i> is it impossible to work with programmatically, let alone poke around it to discover what&#x27;s all in there.
评论 #31640247 未加载
评论 #31651288 未加载