What is the cheapest and most efficient way to make a map from keys to values at very large scale, perhaps several petabytes?<p>When you query, the results are allowed to be a little stale.<p>It should be available to several boxes on a datacenter. Services that provide this are acceptable.
Well the theory tells us that lookups are O(1) anyway, so the implementation doesn't matter ;)<p>Will the dataset change over time, or is it immutable? I have some thoughts on this, but I have no idea how it would scale to terabyte-scale and even further.