TE
テックエコー
ホーム24時間トップ最新ベスト質問ショー求人
GitHubTwitter
ホーム

テックエコー

Next.jsで構築されたテクノロジーニュースプラットフォームで、グローバルなテクノロジーニュースとディスカッションを提供します。

GitHubTwitter

ホーム

ホーム最新ベスト質問ショー求人

リソース

HackerNews APIオリジナルHackerNewsNext.js

© 2025 テックエコー. すべての権利を保有。

DuckDB is probably the most important geospatial software of the last decade

485 ポイント投稿者: dbreunig5日前

29 comments

wenc5日前
I&#x27;m a big fan of DuckDB and I do geospatial analysis, mostly around partitioning geographies (into Uber H3 hexagons), calculating Haversine distances, calculating areas of geometries, figuring out which geometry a point falls in, etc. Many of these features have existed in some form or other in geopandas or postgis, so DuckDB&#x27;s spatial extensions bring nothing new.<p>But what DuckDB as an engine does is it lets me work directly on parquet&#x2F;geoparquet files at scale (vectorized and parallelized) on my local desktop. It beats geopandas in that respect. It&#x27;s a quality of life improvement to say the least.<p>DuckDB also has an extension architecture that admits more exotic geospatial features like Hilbert curves, Uber H3 support.<p><a href="https:&#x2F;&#x2F;duckdb.org&#x2F;docs&#x2F;stable&#x2F;extensions&#x2F;spatial&#x2F;functions.html#st_hilbert" rel="nofollow">https:&#x2F;&#x2F;duckdb.org&#x2F;docs&#x2F;stable&#x2F;extensions&#x2F;spatial&#x2F;functions....</a><p><a href="https:&#x2F;&#x2F;duckdb.org&#x2F;community_extensions&#x2F;extensions&#x2F;h3.html" rel="nofollow">https:&#x2F;&#x2F;duckdb.org&#x2F;community_extensions&#x2F;extensions&#x2F;h3.html</a>
评论 #43887496 未加载
评论 #43884932 未加载
评论 #43889352 未加载
Demiurge5日前
&gt; Prior to this, getting up and running from a cold-start might’ve required installing or even compiling severall OSS packages, carefully noting path locations, standing up a specialized database… Enough work that a data generalist might not have bothered, or their IT department might not have supported it.<p>I&#x27;ve been able to &quot;CREATE EXTENSION postgis;&quot; for more than a decade. There have been spatial extensions for PG, MySQL, Oracle, MS SQL Server, and SQLite for a long time. DuckDB doesn&#x27;t make any material difference in how easy it is to install.
评论 #43883433 未加载
评论 #43888096 未加载
评论 #43886021 未加载
评论 #43886207 未加载
larsiusprime5日前
“import geopandas” also exists and has for some time. Snark aside, WHAT is special about duckDB? I wish the author had actually shown some practical examples so I could understand their claims better.
评论 #43882914 未加载
评论 #43882882 未加载
评论 #43882945 未加载
评论 #43882916 未加载
评论 #43882830 未加载
评论 #43882940 未加载
评论 #43882819 未加载
评论 #43883626 未加载
评论 #43887565 未加载
jparishy5日前
I work on geospatial apps and the software I think I am most excited about is <a href="https:&#x2F;&#x2F;felt.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;felt.com&#x2F;</a>. I want to see them expand their tooling such that maps and data source authentication&#x2F;authorization was controllable by the developer, to enable tenant isolation with proprietary data access. They could really disrupt how geospatial tech gets integrated into consumer apps.<p>This article doesn&#x27;t acknowledge how niche this stuff is and it&#x27;s a lot of training to get people to up to speed on coordinate systems, projections, transformations, etc. I would replace a lot of my custom built mapping tools with Felt if it were possible, so I could focus on our core geospatial processes and not the code to display and play with it in the browser, which is almost as big if not bigger in terms of LOC to maintain.<p>As mentioned by another commenter, this DuckDB DX as described is basically the same as PostGIS too.
评论 #43882813 未加载
评论 #43883460 未加载
评论 #43886016 未加载
评论 #43883475 未加载
评论 #43897638 未加载
perrygeo4日前
Why? The article is light on details. Yes, having spatial analysis combined with SQL is awesome and very natural. There&#x27;s nothing special about 2D geometries that makes them significantly different from floats and strings in an RDBMS perspective - geometry is just another column type, albeit with some special operators and indexes. We&#x27;ve been doing it with PostGIS, Spatialite, etc for two decades at this point.<p>What DuckDB brings to the table is cloud-native formats. This is standard geospatial functionality attached to an object store instead of a disk. As such, it doesn&#x27;t require running a database process - data is always &quot;at rest&quot; and available over HTTP. I&#x27;m not downplaying the accomplishment, it&#x27;s really convenient. But know that this is a repackaging of existing tech to work efficiently within a cloud IO environment. If anything, the major innovation of DuckDB is in data management not geospatial per se.
评论 #43887415 未加载
wodenokoto5日前
I’m not sure I agree that “install geospatial” is a game changer in simplicity compared to “pip install geopandas”.<p>They are both one line.
评论 #43882789 未加载
WD-425日前
Is it that much simpler than ‘load extension postgis’? I know geos and gdal have always kinda been a pain, but I feel like docker has abstracted it all away anyway. ‘docker pull postgis’ is pretty easy, granted I’m not familiar with what else duckdb offers.
评论 #43882844 未加载
评论 #43882896 未加载
twelvechairs5日前
DuckDB is a great thing for geospatial but most important of the past decade? There&#x27;s so many tools in different categories it wouldnt come near top for me. Some might be QGIS, postGIS (still the standard), ArcGIS online (still the standard), JS mapping tools like mapbox (i prefer deckgl), new data types like COG, geopackage and geoparquet, photogrammetry tools, 3d tiles, core libraries like gdal and now pdal, shapely, etc.
评论 #43882865 未加载
评论 #43882814 未加载
willtemperley4日前
I have some concerns regarding licensing of DuckDB and GEOS which DuckDB spatial depends on. The former is MIT licensed and the latter LGPL 2.1.<p>This leads to some complex situations where some builds would contravene LGPL 2.1 e.g static linking with a closed source application.
oreilles4日前
Chiming in to promote a similar project, a geospatial extension for Polars [1] I&#x27;m working on. It&#x27;s not stable yet (abeit pretty close to), but is already pretty feature complete (it uses GEOS and PROJ as a backend, so has parity with GeoPandas).<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;oreilles&#x2F;polars-st&#x2F;">https:&#x2F;&#x2F;github.com&#x2F;oreilles&#x2F;polars-st&#x2F;</a>
serjester4日前
I think this is part of a broader trend of geospatial data just becoming easier to work with. DuckDB is great for quick ad hoc stuff, but I find polars to be easier to maintain. Personally, I&#x27;m really excited for polars to (eventually) add true geospatial support. In the meantime, creating a custom h3 plugin only took a couple days and it simplified massive parts of our old geo pandas &#x2F; duckdb code. The faster we can completely get rid of geo pandas, the better.<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;Filimoa&#x2F;polars-h3">https:&#x2F;&#x2F;github.com&#x2F;Filimoa&#x2F;polars-h3</a>
评论 #43887693 未加载
fifilura4日前
I have been using Trino&#x2F;AWS Athena for some geospatial work.<p>It&#x27;s API is not very well covered, some parts are still missing, but it must just be a matter of time.<p>Where it shines is when you need to do an O(n2) or O(nm) type of calculation. Then those 100s of free CPU cores really come in handy! And the end result is pretty often a dollar for CPU-days worth of computation.<p>Example of O(nm) calculation are things like finding the closest road segment inside a tile (or more likely a tile and it&#x27;s surrounding tiles), for each point in a list.
feverzsj5日前
If you are doing lots of spatial index queries, it&#x27;s actually much slower than SpatiaLite. Because DuckDB uses column-wise storage.
elchief4日前
I wish to god they would allow 2 connections. One read-only for my BI tool, and one read-write for dbt&#x2F;sqlmesh
评论 #43888238 未加载
dmillar4日前
DuckDB &gt; geopandas, certainly for anything out of core. Though, I recently gave up on importing 70GB worth of large multipolygons (from a csv in hex wkb), and just used a postgis container. In concert with DuckDB&#x27;s growth, I&#x27;d also mark the advent of geoparquet.<p>The big change, in my view, over the past decade in GIS software, is in compute and storage efficiency across the typical stack. DuckDB has become a part of this, but h&#x2F;t to the advances from shapely, geopandas, geoparquet, and GDAL. There&#x27;s a lot of overlap in that venn diagram, and credit should be spread around. QGIS is great, too, though I feel there is market opportunity to apply 90&#x2F;10 to its massive feature set and move it to the web.
patja5日前
SQL Server has geospatial capabilities without any extensions or add-ons. I&#x27;ve been happily using geospatial datatypes on the free Express version for years, probably well over a decade.
kriro4日前
That&#x27;s a pretty grandiose statement and frankly the kind of advertisement I&#x27;m not a fan of at all.<p>If you want to import something and work with it GeoPandas exists. If you want something integrated with a SQL database, PostGIS exists.<p>On the application side of the spectrum, GRASS GIS, QGIS etc. say hi. They are being used in the agriculture industry and by government agencies (at least I know that&#x27;s the case in Germany and Brazil).
vincnetas4日前
DBeaver (db client) has built in support for displaying geo data. In my case postgis results. I see that duckdb spatial functions are almost identical to postgis ones.<p><a href="https:&#x2F;&#x2F;dbeaver.com&#x2F;docs&#x2F;dbeaver&#x2F;Working-with-Spatial-GIS-data&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dbeaver.com&#x2F;docs&#x2F;dbeaver&#x2F;Working-with-Spatial-GIS-da...</a>
fidotron5日前
Honestly, I think it&#x27;s actually <a href="https:&#x2F;&#x2F;www.uber.com&#x2F;en-CA&#x2F;blog&#x2F;h3&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.uber.com&#x2F;en-CA&#x2F;blog&#x2F;h3&#x2F;</a>
评论 #43883589 未加载
评论 #43887366 未加载
aynyc4日前
How big are the data sets? I&#x27;ve been trying to get duckdb to work in our company on financial transactions and reporting data. The dataset is around 500GB CSV in S3 and duckdb chokes on it.
评论 #43890337 未加载
评论 #43886653 未加载
评论 #43886614 未加载
评论 #43886755 未加载
quasarj4日前
I&#x27;ve been out of the loop on DuckDB, where can I get a real overview of what the excitement is about? It just looks like a new sqlite from the CLI...
评论 #43887858 未加载
评论 #43887833 未加载
badmonster5日前
How might embedding spatial capabilities directly into general-purpose data tools like DuckDB reshape who participates in geospatial analysis—and what kinds of problems they choose to solve?
bingaweek5日前
We need a &quot;come on&quot; clause for these absurd headlines. Come on.
评论 #43882733 未加载
isuckatcoding4日前
I was expecting example queries but all I got was how to install a package :(
jeffbee5日前
Ehh I tried to do some spatial stuff but there just wasn&#x27;t enough there, or I could not figure out how to use it. Loading spatial information into ipython and fiddling with it is well-traveled and it doesn&#x27;t seem to me that SQL is an inherently lower hurdle for the user.
WD-425日前
Uhoh, another pushover-licensed database. I wonder when it will begin it’s own redis saga.
jandrewrogers5日前
I think geospatial analytics is important (because of course I would), but to be frank geospatial software has been stagnant for a long time. Every new thing is just a fresh spin on the same stagnant things we already have. This more or less says exactly this?<p>For geospatial analysis, the most important thing that could happen in software would be no longer treating it, either explicitly or implicitly, as having anything to do with cartography. Many use cases are not remotely map-driven but the tools require users to force everything through the lens of map-making.
评论 #43883612 未加载
评论 #43883546 未加载
cyanydeez5日前
No. QGIS is.<p>Good god.
评论 #43882854 未加载
fithisux5日前
I agree