TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DNA is maybe 60-750MB of data

57 pointsby MattSayar9 days ago

7 comments

biomcgary9 days ago
Your genome only needs to store the information necessary for the lineage leading to you to survive and compete in the range of environmental variation that actually happened.<p>Consequently, your DNA has less information about how to survive on Jupiter or in the absence of oxygen.<p>However, your genome contains a fair amount of data on how to identify a mate that will maximize your reproductive success in an environment similar to the one your lineage experienced, e.g., a preference for symmetrical faces.<p>Until we can measure the environment of humans accurately, all the algorithmic complexity measures applied to the genome are going to be missing the relevant context.
评论 #43931934 未加载
motrm9 days ago
DNA makes me think of ASN.1 in that a short sequence of bits can convey rather a lot of information - but it makes no sense without knowing which message those bits represent. Only with that knowledge can you turn those bits into something useful.
hulitu9 days ago
&gt; DNA is maybe 60-750MB of data<p>maybe
评论 #43934581 未加载
refurb8 days ago
This article only touches on it but the data stored is far more than just the base pairs.<p>DNA is like a computer program that when it runs, it provides feedback for the code and determines which parts of it should run. It can also modify the code (DNA methylation).<p>Then add on top the external environment - external molecules can interact with the machinery which then impact which code is executed.<p>If code is self-regulating, the amount of information it encodes is far higher than that defined by its base pairs.
评论 #43934022 未加载
评论 #43934361 未加载
jonn_smith8 days ago
This completely ignores methylation and other base modifications that can occur (which impact DNA transcription). You need a few more bits just for the former, so this estimate is incomplete. The real number is higher than this estimate.
评论 #43940962 未加载
评论 #43940192 未加载
jiggawatts9 days ago
For a long time now, a thought has been repeating in my mind: <i>How</i> exactly are high level behaviours like sexual attraction encoded in our genes!?<p>It such a subtle thing too! We&#x27;re attracted (or not) to the <i>tiniest</i> differences in physiology. If you doubt this, try this exercise: Pretend you&#x27;ve just met green aliens and have to explain to them how to reliably tell the difference between men and women from appearance alone! Now explain why <i>that</i> particular girl (or boy) is very pretty&#x2F;handsome, but not <i>that</i> one.<p>It&#x27;s one of those topics where the more you know, the more freaky it is.<p>DNA does not -- to our knowledge -- directly encode the &quot;weights&quot; of our neurons! It can&#x27;t <i>possibly</i> because there are far more synapses than there bits of information in our genes. Also, most of those genes are dedicated to non-brain parts of the body plan and to the low-level machinery of our cellular biochemistry.<p>Secondly, DNA has only an indirect effect of our development: it encodes for proteins, which then provide chemical signals such as concentration gradients that guide cell division. It&#x27;s a bit like playing SimCity, where the players&#x27; control is limited to zoning and road topology. The individual Sims are not directly controllable and behave stochastically.<p>Solving this problem is so freakishly difficult for even the incredible brute force of parallel search of evolution only managed to discover a solution a few times in a billion years.<p>Our attraction to our partners is a genetic heritage shared with <i>all</i> mammals, going back hundreds of millions of years. That&#x27;s why Furry is a thing, but not Featherry. Birds are a different <i>class</i> from us mammals and don&#x27;t share the same &quot;partner attraction wiring&quot; genes. (This is closely related to why all mammal babies are cute to humans, but baby bird chicks are generally repulsive.)<p>Because this is a hard problem to solve, the few solutions that were discovered had to be reused by entire classes of Animalia. I would hazard a guess that <i>this</i> is precisely what defines a “class” in taxonomy! If there were intelligent birds, their equivalent of Furry would be Featherry, and their crimes of bestiality would be with other non-intelligent birds, not mammals.<p>With LLMs, we got to see a glimpse into the possible mechanisms of intelligence, and what it might take to design or evolve one.<p>The LLM equivalent of this kind of encoding would be to design a model architecture that falls in love with a specific, narrowly selected, subset of its users. Keep in mind that I&#x27;m not talking about a learned or specifically tuned set of model weights! The <i>architecture</i> is where the attraction is encoded, such as selecting some complex variant or combination of Transformers, Mamba, or CNNs that just &quot;so happen&quot; to result in the model preferentially learning to be attracted to certain styles of conversation, <i>but not others</i>.<p>Worse still, the direct equivalent to what genes do is that you can&#x27;t even choose an architecture directly, instead you can only contribute to PyTorch. You have to design its API such that naive developers using it stochastically tend towards the desired architecture of their own accord by simply tab-completing often.<p>That&#x27;s essentially what evolution figured out, at least five or six times, but <i>tunable</i>, so that individual species can be attracted to each other but much less to even very closely related species.<p><i>And then</i>, evolution found a way to add a &quot;notch filter&quot; such that despite increased attraction to closely related individuals, most animals (including humans) are <i>repulsed</i> sexually by their parents and siblings.<p>That&#x27;s mind-blowing to me.
评论 #43936105 未加载
评论 #43932388 未加载
评论 #43935548 未加载
E_Evan9 days ago
<a href="https:&#x2F;&#x2F;github.com&#x2F;samtools&#x2F;htslib&#x2F;blob&#x2F;develop&#x2F;sam_internal.h">https:&#x2F;&#x2F;github.com&#x2F;samtools&#x2F;htslib&#x2F;blob&#x2F;develop&#x2F;sam_internal...</a>