TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DuckDB is probably the most important geospatial software of the last decade

485 pointsby dbreunig5 days ago

29 comments

wenc5 days ago
I&#x27;m a big fan of DuckDB and I do geospatial analysis, mostly around partitioning geographies (into Uber H3 hexagons), calculating Haversine distances, calculating areas of geometries, figuring out which geometry a point falls in, etc. Many of these features have existed in some form or other in geopandas or postgis, so DuckDB&#x27;s spatial extensions bring nothing new.<p>But what DuckDB as an engine does is it lets me work directly on parquet&#x2F;geoparquet files at scale (vectorized and parallelized) on my local desktop. It beats geopandas in that respect. It&#x27;s a quality of life improvement to say the least.<p>DuckDB also has an extension architecture that admits more exotic geospatial features like Hilbert curves, Uber H3 support.<p><a href="https:&#x2F;&#x2F;duckdb.org&#x2F;docs&#x2F;stable&#x2F;extensions&#x2F;spatial&#x2F;functions.html#st_hilbert" rel="nofollow">https:&#x2F;&#x2F;duckdb.org&#x2F;docs&#x2F;stable&#x2F;extensions&#x2F;spatial&#x2F;functions....</a><p><a href="https:&#x2F;&#x2F;duckdb.org&#x2F;community_extensions&#x2F;extensions&#x2F;h3.html" rel="nofollow">https:&#x2F;&#x2F;duckdb.org&#x2F;community_extensions&#x2F;extensions&#x2F;h3.html</a>
评论 #43887496 未加载
评论 #43884932 未加载
评论 #43889352 未加载
Demiurge5 days ago
&gt; Prior to this, getting up and running from a cold-start might’ve required installing or even compiling severall OSS packages, carefully noting path locations, standing up a specialized database… Enough work that a data generalist might not have bothered, or their IT department might not have supported it.<p>I&#x27;ve been able to &quot;CREATE EXTENSION postgis;&quot; for more than a decade. There have been spatial extensions for PG, MySQL, Oracle, MS SQL Server, and SQLite for a long time. DuckDB doesn&#x27;t make any material difference in how easy it is to install.
评论 #43883433 未加载
评论 #43888096 未加载
评论 #43886021 未加载
评论 #43886207 未加载
larsiusprime5 days ago
“import geopandas” also exists and has for some time. Snark aside, WHAT is special about duckDB? I wish the author had actually shown some practical examples so I could understand their claims better.
评论 #43882914 未加载
评论 #43882882 未加载
评论 #43882945 未加载
评论 #43882916 未加载
评论 #43882830 未加载
评论 #43882940 未加载
评论 #43882819 未加载
评论 #43883626 未加载
评论 #43887565 未加载
jparishy5 days ago
I work on geospatial apps and the software I think I am most excited about is <a href="https:&#x2F;&#x2F;felt.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;felt.com&#x2F;</a>. I want to see them expand their tooling such that maps and data source authentication&#x2F;authorization was controllable by the developer, to enable tenant isolation with proprietary data access. They could really disrupt how geospatial tech gets integrated into consumer apps.<p>This article doesn&#x27;t acknowledge how niche this stuff is and it&#x27;s a lot of training to get people to up to speed on coordinate systems, projections, transformations, etc. I would replace a lot of my custom built mapping tools with Felt if it were possible, so I could focus on our core geospatial processes and not the code to display and play with it in the browser, which is almost as big if not bigger in terms of LOC to maintain.<p>As mentioned by another commenter, this DuckDB DX as described is basically the same as PostGIS too.
评论 #43882813 未加载
评论 #43883460 未加载
评论 #43886016 未加载
评论 #43883475 未加载
评论 #43897638 未加载
perrygeo4 days ago
Why? The article is light on details. Yes, having spatial analysis combined with SQL is awesome and very natural. There&#x27;s nothing special about 2D geometries that makes them significantly different from floats and strings in an RDBMS perspective - geometry is just another column type, albeit with some special operators and indexes. We&#x27;ve been doing it with PostGIS, Spatialite, etc for two decades at this point.<p>What DuckDB brings to the table is cloud-native formats. This is standard geospatial functionality attached to an object store instead of a disk. As such, it doesn&#x27;t require running a database process - data is always &quot;at rest&quot; and available over HTTP. I&#x27;m not downplaying the accomplishment, it&#x27;s really convenient. But know that this is a repackaging of existing tech to work efficiently within a cloud IO environment. If anything, the major innovation of DuckDB is in data management not geospatial per se.
评论 #43887415 未加载
wodenokoto5 days ago
I’m not sure I agree that “install geospatial” is a game changer in simplicity compared to “pip install geopandas”.<p>They are both one line.
评论 #43882789 未加载
WD-425 days ago
Is it that much simpler than ‘load extension postgis’? I know geos and gdal have always kinda been a pain, but I feel like docker has abstracted it all away anyway. ‘docker pull postgis’ is pretty easy, granted I’m not familiar with what else duckdb offers.
评论 #43882844 未加载
评论 #43882896 未加载
twelvechairs5 days ago
DuckDB is a great thing for geospatial but most important of the past decade? There&#x27;s so many tools in different categories it wouldnt come near top for me. Some might be QGIS, postGIS (still the standard), ArcGIS online (still the standard), JS mapping tools like mapbox (i prefer deckgl), new data types like COG, geopackage and geoparquet, photogrammetry tools, 3d tiles, core libraries like gdal and now pdal, shapely, etc.
评论 #43882865 未加载
评论 #43882814 未加载
willtemperley4 days ago
I have some concerns regarding licensing of DuckDB and GEOS which DuckDB spatial depends on. The former is MIT licensed and the latter LGPL 2.1.<p>This leads to some complex situations where some builds would contravene LGPL 2.1 e.g static linking with a closed source application.
oreilles4 days ago
Chiming in to promote a similar project, a geospatial extension for Polars [1] I&#x27;m working on. It&#x27;s not stable yet (abeit pretty close to), but is already pretty feature complete (it uses GEOS and PROJ as a backend, so has parity with GeoPandas).<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;oreilles&#x2F;polars-st&#x2F;">https:&#x2F;&#x2F;github.com&#x2F;oreilles&#x2F;polars-st&#x2F;</a>
serjester4 days ago
I think this is part of a broader trend of geospatial data just becoming easier to work with. DuckDB is great for quick ad hoc stuff, but I find polars to be easier to maintain. Personally, I&#x27;m really excited for polars to (eventually) add true geospatial support. In the meantime, creating a custom h3 plugin only took a couple days and it simplified massive parts of our old geo pandas &#x2F; duckdb code. The faster we can completely get rid of geo pandas, the better.<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;Filimoa&#x2F;polars-h3">https:&#x2F;&#x2F;github.com&#x2F;Filimoa&#x2F;polars-h3</a>
评论 #43887693 未加载
fifilura4 days ago
I have been using Trino&#x2F;AWS Athena for some geospatial work.<p>It&#x27;s API is not very well covered, some parts are still missing, but it must just be a matter of time.<p>Where it shines is when you need to do an O(n2) or O(nm) type of calculation. Then those 100s of free CPU cores really come in handy! And the end result is pretty often a dollar for CPU-days worth of computation.<p>Example of O(nm) calculation are things like finding the closest road segment inside a tile (or more likely a tile and it&#x27;s surrounding tiles), for each point in a list.
feverzsj5 days ago
If you are doing lots of spatial index queries, it&#x27;s actually much slower than SpatiaLite. Because DuckDB uses column-wise storage.
elchief4 days ago
I wish to god they would allow 2 connections. One read-only for my BI tool, and one read-write for dbt&#x2F;sqlmesh
评论 #43888238 未加载
dmillar4 days ago
DuckDB &gt; geopandas, certainly for anything out of core. Though, I recently gave up on importing 70GB worth of large multipolygons (from a csv in hex wkb), and just used a postgis container. In concert with DuckDB&#x27;s growth, I&#x27;d also mark the advent of geoparquet.<p>The big change, in my view, over the past decade in GIS software, is in compute and storage efficiency across the typical stack. DuckDB has become a part of this, but h&#x2F;t to the advances from shapely, geopandas, geoparquet, and GDAL. There&#x27;s a lot of overlap in that venn diagram, and credit should be spread around. QGIS is great, too, though I feel there is market opportunity to apply 90&#x2F;10 to its massive feature set and move it to the web.
patja5 days ago
SQL Server has geospatial capabilities without any extensions or add-ons. I&#x27;ve been happily using geospatial datatypes on the free Express version for years, probably well over a decade.
kriro4 days ago
That&#x27;s a pretty grandiose statement and frankly the kind of advertisement I&#x27;m not a fan of at all.<p>If you want to import something and work with it GeoPandas exists. If you want something integrated with a SQL database, PostGIS exists.<p>On the application side of the spectrum, GRASS GIS, QGIS etc. say hi. They are being used in the agriculture industry and by government agencies (at least I know that&#x27;s the case in Germany and Brazil).
vincnetas4 days ago
DBeaver (db client) has built in support for displaying geo data. In my case postgis results. I see that duckdb spatial functions are almost identical to postgis ones.<p><a href="https:&#x2F;&#x2F;dbeaver.com&#x2F;docs&#x2F;dbeaver&#x2F;Working-with-Spatial-GIS-data&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dbeaver.com&#x2F;docs&#x2F;dbeaver&#x2F;Working-with-Spatial-GIS-da...</a>
fidotron5 days ago
Honestly, I think it&#x27;s actually <a href="https:&#x2F;&#x2F;www.uber.com&#x2F;en-CA&#x2F;blog&#x2F;h3&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.uber.com&#x2F;en-CA&#x2F;blog&#x2F;h3&#x2F;</a>
评论 #43883589 未加载
评论 #43887366 未加载
aynyc4 days ago
How big are the data sets? I&#x27;ve been trying to get duckdb to work in our company on financial transactions and reporting data. The dataset is around 500GB CSV in S3 and duckdb chokes on it.
评论 #43890337 未加载
评论 #43886653 未加载
评论 #43886614 未加载
评论 #43886755 未加载
quasarj4 days ago
I&#x27;ve been out of the loop on DuckDB, where can I get a real overview of what the excitement is about? It just looks like a new sqlite from the CLI...
评论 #43887858 未加载
评论 #43887833 未加载
badmonster5 days ago
How might embedding spatial capabilities directly into general-purpose data tools like DuckDB reshape who participates in geospatial analysis—and what kinds of problems they choose to solve?
bingaweek5 days ago
We need a &quot;come on&quot; clause for these absurd headlines. Come on.
评论 #43882733 未加载
isuckatcoding4 days ago
I was expecting example queries but all I got was how to install a package :(
jeffbee5 days ago
Ehh I tried to do some spatial stuff but there just wasn&#x27;t enough there, or I could not figure out how to use it. Loading spatial information into ipython and fiddling with it is well-traveled and it doesn&#x27;t seem to me that SQL is an inherently lower hurdle for the user.
WD-425 days ago
Uhoh, another pushover-licensed database. I wonder when it will begin it’s own redis saga.
jandrewrogers5 days ago
I think geospatial analytics is important (because of course I would), but to be frank geospatial software has been stagnant for a long time. Every new thing is just a fresh spin on the same stagnant things we already have. This more or less says exactly this?<p>For geospatial analysis, the most important thing that could happen in software would be no longer treating it, either explicitly or implicitly, as having anything to do with cartography. Many use cases are not remotely map-driven but the tools require users to force everything through the lens of map-making.
评论 #43883612 未加载
评论 #43883546 未加载
cyanydeez5 days ago
No. QGIS is.<p>Good god.
评论 #43882854 未加载
fithisux5 days ago
I agree