The novel about this is Neal Stephenson's Anathem... where people strive to keep just the necessary information alive (like, "Where did we store the atomic waste a couple of 1000 years ago?").<p>No matter how your data is going to be stored, it will eventually have to be copied to another medium. And if you want to store it "forever", you <i>will</i> get data corruption, either because of transmission errors or because the storage medium fails. Keeping the data redundant is not enough for guaranteed data safety, as no matter how many copies you create, one day all of them can fail at the same time.<p>But you can take steps to ensure that your data is realistically safe for say a 1000 years (which is not by any means "forever", but then who but an archaeology AI will read your data at that time). The Long Now Foundation is thinking about stuff like this (<a href="http://www.longnow.org/" rel="nofollow">http://www.longnow.org/</a>).<p>An interesting question is, how do you ensure that your data will not be intentionally wiped out, because it is deemed (religiously, politically, legally) offensive? A metal band's CD cover is now considered dangerously close to child porn here abouts (Scorpions, Virgin Killer). That may be enough to make some people erase your data for decency's sake. A dead man can not fight a cease and desist order.<p>A different view on the matter would be: you are going to die, and your data is going to die eventually. And even if it is around, it's likely that no one will want to read it anyway. But your existence will have consequences for the rest of (human) history, as just by living you provide an input to the human race. This input may be in the form of biological data (children) or pure information (a blog article about "forever storage" which inspires the next great startup founder to do great things). Just go on, do something positive besides leaving a huge carbon footprint and atomic waste that lasts for a million years.
I wonder if there could be a distributed/open solution to this... not so easy probably. Some Bit Torrent style distributed storage comes to mind, where you have to provide x*the ammount of data you would like to store to others, with y% availability.<p>Like a data-storage ponzi scheme, but it might be sustainable because storage and bandwidth keep getting cheaper.
Technically I agree this is completely possible, the challenges are more likely to be economic. How do you make sure the company exists 1000 years in the future? That question poses itself especially for a potential startup providing this service, but would even be relevant if an established company were to do it.<p>If I were to trust anyone with doing this, it would probably be the catholic church. Can't think of anyone else with a sufficient track record.
If what you have is truly worth preserving for that long, transcode it into <a href="http://en.wikipedia.org/wiki/Junk_DNA" rel="nofollow">http://en.wikipedia.org/wiki/Junk_DNA</a> (several times over to allow for mutations) and splice it into sperm/egg/embryo cells at fertility clinics. Let your "progeny" multiply across the planet with Nature taking care of replicating your data through space and time. For retrieval: I suppose that in the distant future, DNA databases will probably be preserved for easy lookup by sequence and thus so will your data.<p>The human genome is ~6B base pairs = ~3B bits = ~357MB. Only 1.5% constitute protein-coding genes. Let's say some of that "junk DNA" is actually useful, we could still get away with ~200MB of payload per human cell. Further scaling can be obtained by targeting other organisms with large populations (bees, rabbits, cockroaches, mosquitoes, bacteria) or even synthesize new viruses or parasites that hitch along to humans for an evolutionary ride while preserving your data at the same time ...<p>(Disclaimer: I'm in the middle of an X-files marathon)
It's very easy to keep the <i>data</i> intact, if you know when, exactly, you want access to it again: as in the book referenced in the article, The Forever War, just put the <i>data</i> on an object in space, relativistically accelerate it, and set it for a return course in X years. From the object's frame of reference, much less time (and degradation) will pass.<p>Or, even simpler: find a reflective surface X/2 lightyears away, and shoot coherent light at it with sufficient power.<p>The real problem is preserving the <i>information</i>, which degrades even faster than the data. Even communicating a very simple message[1] across more than a 1000-year gap turns out to be nearly impossible, as it will be intentionally misinterpreted in the direction of whatever will make the finders happy.<p>[1] <a href="http://www.damninteresting.com/this-place-is-not-a-place-of-honor" rel="nofollow">http://www.damninteresting.com/this-place-is-not-a-place-of-...</a>
Preserving data for future historians is a good thing but why is it so important to preserve <i>your own</i> data? Do you really think that's what will be the most valuable to them? It seems better to contribute to projects like the Internet Archive.<p>Also, the value of data depends partially on its scarcity, and I expect data about us won't be particularly scarce. More data is better, but when doing a bulk analysis, 10 terabytes of family pictures will not be all that different from 10.1 terabytes of family pictures.
I think the best way to guarantee your data to live on is to get it in a form you can save for the future as easily as a photo album.<p>Some time ago I saw (maybe even here on Hacker News) something about "setting your data in stone"...<p><a href="http://primera.eu/millenniata/millenniata-en.html" rel="nofollow">http://primera.eu/millenniata/millenniata-en.html</a> is what I found quickly about it now, there is/was a bigger explaining page with a heat and ice test of this medium.
A few years ago I won a (very small) business plan competition for a company describing a consumer backup service like this one.<p>The kicker is figuring out what would happen if the company stops trading for whatever reason (e.g. bankrupcy). Although I didn't follow through with it, I think you can build a reasonably safe winding-down process that at the very least returns the data to its owners.