How We Built Uber’s Highest Query per Second Service Using Go

187 点作者 dodders大约 9 年前

22 条评论

emrekzd大约 9 年前

I don't think I've learned anything from this article. It's interesting how quickly the keywords pick up on HN (Uber and Go).More than 3 years ago we've implemented a reverse geocoder web-service that indexed complete Census TIGER dataset. The service handled over 15K reqs/sec on a 2011 macbook, doing exact in polygon search(no approximations). We implemented an optimized (for in polygon search) R-tree data structure for the lookups.Assuming Uber's geofence lookups would rely on a much smaller dataset than TIGER, I think you could have come up with a much more efficient implementation requiring a fraction of the resources.And again, use of Go; irrelevant.

评论 #11208061 未加载

评论 #11207266 未加载

评论 #11207514 未加载

StreamBright大约 9 年前

This article is a little bit hand-wavy for me. Essentially they could use any languages to implement this, also there is no proof how channels could not be used. I bit more details would be essential to make this a good read, at least for me.

评论 #11206330 未加载

Animats大约 9 年前

It's interesting that they're doing a spatial database without using a 2D spatial data structure. MySQL and PostGres both have data structures for this. But maybe they don't have that many geofences per city.They have an N readers, one writer problem. It's possible to do that without blocking the readers. There was a YC article on a lockless solution to that yesterday. This is an easier problem. Each city's geofences can be updated by making a copy, updating the copy, and atomically replacing the single reference to the copy. Any searches still running on the old copy continue to run on the old copy. Eventually, the GC deletes the old copy.

评论 #11206831 未加载

评论 #11206969 未加载

评论 #11207527 未加载

chris_overseas大约 9 年前

"Instead of indexing the geofences using R-tree or the complicated S2, we chose a simpler route...""While the runtime complexity of the solution remains O(N), this simple technique reduced N from the order of 10,000s to the order of 100s."It seems odd to me that they're posting about the performance of Go, yet they deliberately chose a less optimal algorithm because the better ones were "complicated". Or, if their simpler approach did in fact run faster than R-trees or S2, it would have been nice to see some benchmarks and a clearer explanation for why. Choosing the best algorithm seems more important than the language in a case like this one.

评论 #11206416 未加载

评论 #11206428 未加载

评论 #11206784 未加载

评论 #11207531 未加载

NolMan大约 9 年前

I was surprised to not see PostGIS at least mentioned. I've been using PostGIS professionally for the last few years and only have good things to say, the toolchain is very mature.We make pretty extensive use of R-Tree indexes in PostGIS, tables with 10,000,000+ rows and shapes that roughly match streets, parcels, blocks, zip codes. While I don't have 99th percentile data offhand average query time is <1ms. Perhaps there are slow responses at the 99th percentile however I would be very surprised by this.

williamsharkey大约 9 年前

I would be interested in comparing the performance of Uber's first pass "city-fencing" vs a pre-computed 1D array.EG: Subdivide a flat projection of earth into n^2 squares. Create an array of length n^2. Set the value of each element in the array to a list of canidate geofences(which have area in that square).Scale lat and long between 0 and 1. Then you can index directly into it with PrecomputedArray[floor(lat*n+long)]This is trading space for time, may as well choose space here.

评论 #11207675 未加载

artursapek大约 9 年前

> While historically Uber has been mostly a Node.js and Python shop, the Go language is becoming the language of choice for building many of Uber Engineering’s new services.It feels like this type of narrative is becoming more common.

评论 #11206706 未加载

评论 #11206723 未加载

评论 #11206507 未加载

评论 #11207449 未加载

评论 #11206384 未加载

rosshemsley大约 9 年前

Ridiculous article.'Highest QPS?'This is a trivial geometric problem, any sane implementation should be several orders of magnitude faster than doing network IO. /rant

评论 #11207969 未加载

mbell大约 9 年前

I'm not terribly familiar with geo algorithms but I'm curious why it's worthwhile taking the two layer approach.Can't you precompute the X and Y minima and maxima for each geo fence and throw out 99% of candidates extremely quickly? In other words store the bounding rectangle for each geofence. Even with 100,000s of entries we're talking 4 primitive type comparisons per entry to exclude all non-possible candidates. I would have a hard time believing this is slower than ray casting to city geofences.

评论 #11207128 未加载

spacemanmatt大约 9 年前

Why not PostgreSQL (PostGIS)?

评论 #11206333 未加载

评论 #11206622 未加载

评论 #11206919 未加载

评论 #11206636 未加载

评论 #11206356 未加载

评论 #11206898 未加载

评论 #11206449 未加载

评论 #11206854 未加载

bufo大约 9 年前

CPU intensive workload. Geofence lookups require CPU-intensive point-in-polygon algorithms. While Node.js works great for our other services that are I/O intensive, it’s not optimal in this use case due to Node’s interpreted and dynamic-typed nature.That's misleading, math in V8 can be VERY fast (aside from the current absence of vectorization). See the benchmarks here: <a href="http://julialang.org" rel="nofollow">http://julialang.org</a>

评论 #11206508 未加载

评论 #11207457 未加载

评论 #11206932 未加载

benbjohnson大约 9 年前

How often does the fencing data change? If it's infrequent I wonder if you could load the data on start up and simply restart nodes to refresh the data. That would avoid a mutex lock entirely.However, a mutex lock/unlock should take <100ns. It doesn't seem like that should be a bottleneck. It'd be interesting to hear more about the data, requirements, and response times (median, 90%, etc).

iamleppert大约 9 年前

I've done something similar. I used the GPU (actually WebGL) and made several images of my geofences where each one was simply drawn as a bitmask (black and white). Then, I wrote a fragment shader to determine if a point is within a geofence by looking the pixel value. I rounded the lat/long pairs to a precision that matched the resolution of my image.Can't remember the exact results but throughput was much higher than what I was able to achieve using highly optimized PostGIS queries.

bsaul大约 9 年前

really interesting to see that go channels weren't used due to performance concerns. channels is the often advocated tech for concurency in this language so having to fall back to mutexes seems like a downpoint. I would have loved to see a benchmark in the article.

评论 #11206233 未加载

评论 #11206340 未加载

评论 #11208152 未加载

评论 #11207905 未加载

评论 #11206272 未加载

nevi-me大约 9 年前

I built a route finding algorithm in Node.JS for one of my projects. I also do some geofence checks, but I use a geo-index (Mongodb).Similar to other comments, the article is a bit lacking in detail. Regardless of language used, I would have liked to see a comparison of query times using a spatial index and not. From my limited experience with PostGIS, there are some inside polygons that perform poorly on large polygons, where Mongodb has done better, and vice versa with linestrings.

yueq大约 9 年前

"...service handled a peak load of 170k QPS with 40 machines running at 35% CPU usage on NYE 2015. The response time was < 5 ms at 95th percentile, and < 50 ms at the 99th percentile."It's ~4k per instance. I would like to know what the R/W ratio is. And how often the index is updated is the key, aka locking operations, which should be mentioned in the article.Also this article should give more comparison on other implementations, if possible. At least compare with Uber's monolithic implementation.The geo-datastructure part is very cool.

nailer大约 9 年前

> Because node.js is single threadedcluster's been around a while now, and is built in to node 5.x.

enitihas大约 9 年前

Am I the only one not able to access the site?

评论 #11206209 未加载

评论 #11206148 未加载

评论 #11206794 未加载

rodionos大约 9 年前

The article is about hiring a go developer that will write a better article

EwanG大约 9 年前

If you need an explanation of how Go might help your organization, this discusses how Uber does GeoFencing using the language.

w84t1me大约 9 年前

A reliable runtime is listed in the three main benefits of using Go. Are the runtimes for the alternatives really any less reliable?I can buy that the language is easy to learn since the feature set is very limited, but is productivity in the first week/month really what to optimise for?They also list performance as a benefit.Go is a reasonable choice, but I doubt that the result would have been much different if they'd used any other modern statically typed language such as Rust, Scala, D, or even Nim.

matthewaveryusa大约 9 年前

>Because Node.js is single threaded, background refreshing can tie up the CPU for an extended period of time (e.g., for CPU-intensive JSON parsing work), causing a spike in query response times.I'm not the biggest node fan, but that is utterly false. How do you think asynchronous system calls get processed in libuv (the event loop that node uses.) ? threads. Of course you'll park the background thread while you're waiting in case of IO, but if it's CPU intensive then you bet that you can use all cores with background work. You just need to know how to write C++ and read the documentation on the bindings

评论 #11206766 未加载

评论 #11208728 未加载