TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Overture Maps Foundation releases open map dataset

402 点作者 chippy将近 2 年前

16 条评论

tony_cannistra将近 2 年前
For context, Overture Maps is a project that intends to enable large players in the geospatial space (TomTom, Amazon, Microsoft, but notably _not_ Google) to leverage open data sets (OpenStreetMap among them) alongside proprietary data and processes that they own.<p>The consortium intends to enable a framework for enhancing geospatial data based in open data sets (e.g. OSM) with their own proprietary processes, and re-release it with a permissive license (the Community Database License Agreement - CDLAv2), but keep the data and processes required to <i>create</i> that dataset proprietary.<p>The project has created a lot of conversation in the OpenStreetMap community, but in general I think it&#x27;s good to see so many resources put into the OSM-adjacent world.
评论 #36883502 未加载
评论 #36882385 未加载
评论 #36887425 未加载
评论 #36881671 未加载
joshstrange将近 2 年前
If you are looking for maps themselves (a la Google Maps) I highly recommend checking out Protomaps [0]. It provides a single file (PMTile format) that contains all the map data and you can request ranges from the single file to get the required vector data you need for a given area&#x2F;zoom.<p>So you can setup something like the following:<p><pre><code> S3 (host the myfile.pmtiles) &lt;- Lambda (takes the x&#x2F;y&#x2F;z from the path and requests the correct range) &lt;- Cloudfront cache tile response </code></pre> Then you can setup tiles.mydomain.com (or use the cloudfront domain directly) and then use Leaflet or similar on the frontend to fetch&#x2F;render the tiles. For Leaflet you use the protomaps plugin&#x2F;lib and give it a url like &quot;<a href="https:&#x2F;&#x2F;tiles.yourdomain.com&#x2F;20230408&#x2F;{z}&#x2F;{x}&#x2F;{y}.mvt" rel="nofollow noreferrer">https:&#x2F;&#x2F;tiles.yourdomain.com&#x2F;20230408&#x2F;{z}&#x2F;{x}&#x2F;{y}.mvt</a>&quot; where &quot;20230408&quot; maps to &quot;20230408.pmtiles&quot; in your S3 bucket. Now I can drop new pmtiles files into that bucket and update my clients to use the new source. And since the tiles are in vector format you can theme them however you want in the client which is neat. Lastly you don&#x27;t have to use the 100+GB whole-earth tileset. You can use a tool [1] (provided by the same guy) to download a dataset for just a given geographical region.<p>The .pmtiles file is a little over 100GB but the whole setup took me only an hour or two max to get running and will cost way less than Google Maps to run.<p>[0] <a href="https:&#x2F;&#x2F;protomaps.com&#x2F;" rel="nofollow noreferrer">https:&#x2F;&#x2F;protomaps.com&#x2F;</a><p>[1] <a href="https:&#x2F;&#x2F;app.protomaps.com&#x2F;downloads&#x2F;small_map" rel="nofollow noreferrer">https:&#x2F;&#x2F;app.protomaps.com&#x2F;downloads&#x2F;small_map</a>
评论 #36881901 未加载
评论 #36897527 未加载
评论 #36882814 未加载
mike_d将近 2 年前
&gt; Administrative Boundaries: A global open dataset of national and regional administrative boundaries, this boundary data includes regional names which have been translated into over 40 different languages to support international use.<p>Oh that is going to be fun. If I recall correctly Google Maps alters the boundaries of places based on the views of the location the map is being requested from to avoid getting in the middle of disputes.<p>Not &quot;correctly&quot; showing boundaries is a crime in many countries.<p>Edit: here is a source <a href="https:&#x2F;&#x2F;qz.com&#x2F;224821&#x2F;see-how-borders-change-on-google-maps-depending-on-where-you-view-them" rel="nofollow noreferrer">https:&#x2F;&#x2F;qz.com&#x2F;224821&#x2F;see-how-borders-change-on-google-maps-...</a>
评论 #36890349 未加载
jijji将近 2 年前
I like that they provide this data, however when you try to actually retrieve it, It looks like they went out of their way to make it as convoluted as possible to try to retrieve any of the actual data.... so you have to use DuckDB and then do the import using that? why not just support mysql dump files? why require someone to have DuckDB? is DuckDB that popular? also the links they provide don&#x27;t work so it doesn&#x27;t look like any of it&#x27;s available.... how is somebody supposed to use this stuff? they require you to also have Amazon S3 with some query language that&#x27;s non-standard to be able to talk to it.... I don&#x27;t get it I&#x27;m sure I&#x27;m not the only one but it needs to be more generic instead of the way they&#x27;re doing it
评论 #36883654 未加载
dhx将近 2 年前
I have an interest in this topic as a contributor to AllThePlaces[1], an open source project collating Scrapy spiders (MIT license) that crawl websites of franchises&#x2F;retail chains that you&#x27;d find listed in name-suggestion-index[2] to retrieve location data (CC-0 license). The project is just short of collecting 3 million points of interest from almost 1700 spiders.<p>Overture Maps appears to be quite a closed and proprietary project, with claims of openness limited to being able to download a data set and accompanying schema specification. Some issues that immediately come to mind:<p>1. There is no published description for how the data was generated. End users thus are given no assurance of how accurate and complete the data is.<p>a. As an example, administrative boundaries are frightfully complex and include disputed boundaries, significant ambiguity in definition of boundaries, and trade-off between precision of boundaries versus performance of algorithms using administrative boundary data. Which definition of a boundary does Overture Maps adhere to, or can it support multiple definitions?<p>b. It&#x27;s probable that Microsoft have contributed ld+json&#x2F;microdata geographic data from BingBot crawls of the Internet. This data is notoriously incorrect, including fields mixed up and invalidly repurposed, &quot;CLOSED&quot; in field names to denote closure of a place 5 years ago but the web page remains online, and much ambiguity in opening hours specifications. For AllThePlaces, many of the spiders developed require human consideration, sometimes of considerable complexity, to piece together horribly messy data that is published by shop and restaurant franchises, and other organisations providing location data via their websites.<p>c. For location information where +&#x2F;- 1-5m accuracy and precision may be required (e.g. individual shops within a shopping centre[3]), source data is typically provided by the authoritative sources with 1mm precision and +&#x2F;- 10-100m accuracy. AllThePlaces, Overture Maps, Google Maps and similar still need human editors (OpenStreetMap editors) to do on-the-ground surveys to pinpoint precise locations and to standardise the definition of a location (e.g. for a point, should it be the centroid of the largest regular polygon which could be placed in the overall irregular polygon, the center of mass of a planar lamina, the location of the main entrance, or some other definition?).<p>d. If Overture Maps is dependent on BingBot for place data, they&#x27;ll miss an enormous number of points of interest that BingBot would never be able to find. For example, an undocumented REST&#x2F;JSON&#x2F;GraphQL API call or modification to parameters to an observed store locator API call may be necessary to return all locations and relevant fields of data. Website developers routinely do stupid things with robots.txt such as instruct a bot to crawl 10k pages (1GB+) from a sitemap last updated 5 years ago rather than make 10 fast API calls for up-to-date data (5MB). Overture Maps would be free to consume data from AllThePlaces as it is CC-0 licensed, and possibly correlate it with other data sources such as BingBot crawl data, a government database of licensed commercial premises or postal address geocoding data. However the messiness of data in various sources would be approaching impossible to reconcile, even for humans, and Overture Maps would possibly have to decide whether to err on the side of having duplicates, or lack completeness.<p>2. There is no published tooling for how someone else can reproduce the same data.<p>a. AllThePlaces users fairly frequently experience the wrath of Cloudflare, Imperva and other Internet-breaking third parties, as well as custom geographic blocking schemes and more rarely, overzealous rate limiting mechanisms. If Overture Maps is dependent on BingBot crawls, they&#x27;ll have a slight advantage over AllThePlaces due to deliberate whitelisting of BingBot from the likes of Cloudflare, Imperva, customer firewalls, etc. However, no matter whether you&#x27;re AllThePlaces or Overture Maps or anyone else, if you want to capture as many points of interest as possible across the world, use of residential ISP subnets and anti-bot-detection software is increasingly required. They&#x27;ll need people in dozens of countries each crawling websites targeted to the same country, using residential ISP address space. Otherwise they end up with an American view of the world, or a European view of the world, or something else that isn&#x27;t the full picture.<p>b. If Overture Maps has locations incorrect for a franchise&#x2F;brand due to a data cleansing problem or sourcing data from a bad source (perhaps non-authoritative), there are no software repositories for the franchise&#x2F;brand to raise an issue or submit a patch against.<p>[1] <a href="https:&#x2F;&#x2F;www.alltheplaces.xyz&#x2F;" rel="nofollow noreferrer">https:&#x2F;&#x2F;www.alltheplaces.xyz&#x2F;</a><p>[2] <a href="http:&#x2F;&#x2F;nsi.guide&#x2F;" rel="nofollow noreferrer">http:&#x2F;&#x2F;nsi.guide&#x2F;</a><p>[3] Example Australian shopping centre as captured by AllThePlaces: <a href="https:&#x2F;&#x2F;www.alltheplaces.xyz&#x2F;map&#x2F;#18.07&#x2F;-33.834646&#x2F;150.989528" rel="nofollow noreferrer">https:&#x2F;&#x2F;www.alltheplaces.xyz&#x2F;map&#x2F;#18.07&#x2F;-33.834646&#x2F;150.98952...</a>
评论 #36880424 未加载
评论 #36880312 未加载
mikeyouse将近 2 年前
The GitHub repo is here; <a href="https:&#x2F;&#x2F;github.com&#x2F;OvertureMaps&#x2F;data">https:&#x2F;&#x2F;github.com&#x2F;OvertureMaps&#x2F;data</a> -- Licenses look super permissive - I&#x27;m not that familiar with the state of Open Mapping but the GERS idea looks great too.<p><a href="https:&#x2F;&#x2F;docs.overturemaps.org&#x2F;gers&#x2F;" rel="nofollow noreferrer">https:&#x2F;&#x2F;docs.overturemaps.org&#x2F;gers&#x2F;</a>
评论 #36881960 未加载
polemic将近 2 年前
It&#x27;s unclear to me how Overture manages to &quot;license wash&quot; OSM data here.<p>&gt; <i>Transportation: The OMF’s Transportation layer represents a worldwide road network derived from data in the OpenStreetMap project. This community-built data has been recast into the Overture data format which provides consistent segmentation of the data and a linear reference system to support additions of data such as speed limits or real-time traffic. </i><p>The OSM ODbL is crystal clear that OSM contributors have to be credited. I don&#x27;t believe that CDLA Permissive v2.0 magically allows Overture to bypass it.<p>--EDIT: I missed that they&#x27;re using different licenses per dataset, the transport theme is OBDL, which I&#x27;m sure will trip up users who are not careful.
评论 #36884689 未加载
评论 #36884289 未加载
ChuckMcM将近 2 年前
Great stuff, at some point I feel like Google is going to have to try to extract even more revenue out of maps than they currently get and it will really empower these sorts of collaborative map systems.<p>I wish there was a way for people to fund satellite imagery that got pushed into these systems after purchase. Sunnyvale, for example, paid for a lot of imagery of the city that they use&#x2F;used in staff discussions about traffic, zoning, etc. It would be nice if they could then push those images into the open data set.
评论 #36883191 未加载
sdfghswe将近 2 年前
What&#x27;s the difference between this and Open Street Maps?
评论 #36881223 未加载
评论 #36880555 未加载
评论 #36881850 未加载
评论 #36885695 未加载
评论 #36883990 未加载
评论 #36890263 未加载
评论 #36880468 未加载
kozikow将近 2 年前
Anyone loaded it to Bigquery yet?<p>I know this is &quot;booo-google&quot;, but I just want to write some joins with other tables I happen to have in bigquery. I&#x27;m wondering if there is some &quot;community&quot; BQ rather than maintaining import of my own.
anukul将近 2 年前
Has anyone been able to visually assess this dataset&#x27;s accuracy against OSM &#x2F; Google Maps etc for any given region?<p>Looks like it will be a while before that can be done, seeing as it uses a custom schema.
divbzero将近 2 年前
Between this, Natural Earth, OpenStreetMap, USGS, and others, the availability of map data today would be stunning to early cartographers.
评论 #36888714 未加载
zmix将近 2 年前
Will OSM incorporate this information into their own map?
Mithridates1将近 2 年前
Does this add impedances to the OSM street segments?
pininja将近 2 年前
POIs, buildings, transportation network, and admin boundaries layers.. on my!
dfgssdfgsdfg将近 2 年前
Is there any quicksand on the map? What about Lil Terry, is he on the map?