TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Human genes renamed to stop Microsoft Excel from misreading them as dates (2020)

191 pointsby takiwatangaabout 3 years ago

20 comments

mcvabout 3 years ago
The real problem is of course using Excel to maintain data. That&#x27;s not what it&#x27;s for. That&#x27;s what databases are for. Excel is for complex calculations over rows and columns. And of course it should be better at data validation; you should be able to tell it whether a certain column is a date or not, and Excel shouldn&#x27;t be jumping to stupid conclusions about it, but it does, and that means you need to be careful and not use Excel for something like that.<p>Excel misuse is sadly rampant and one of my main frustrations at my previous project. Excel is popular for this sort of misuse because when you open it, it presents you with an empty table and you can immediately start typing. This invites tabular data. But Excel inevitably fucks up because it lacks proper data types, data validation, or foreign keys, and loves making assumptions about what you mean. This makes it a terrible and harmful choice for any sort of serious data. I even proposed a new project for an Excel-like frontend backed by a database exactly for these sort of situations. Because I do get what attracts people to Excel for this sort of thing. And most people don&#x27;t realise what a terrible choice it is.
评论 #30987638 未加载
评论 #30987403 未加载
评论 #30988017 未加载
评论 #30994894 未加载
评论 #30988164 未加载
评论 #30989354 未加载
评论 #30988989 未加载
评论 #30990393 未加载
mdb31about 3 years ago
PSA: if you use Excel as intended, you would just go File&#x2F;New&#x2F;Blank Worksheet, switch to the Data tab, then use the &#x27;From Text&#x2F;CSV&#x27; wizard to specify the data types for each column in your data source, neatly bypassing this entire issue.<p>But yes, just double-clicking the CSV from Explorer, using the legacy &#x27;open this as a sheet&#x27; functionality, experiencing data loss and then complaining about it (and the state of Excel, MSFT and The World in general) on social media is much, much more fun...
评论 #30986970 未加载
评论 #30986947 未加载
评论 #30987186 未加载
nulbyteabout 3 years ago
&gt; Microsoft Excel may be fleeting, but human genes will be around for as long as we are. It’s best to give them names that work.<p>This is the crux of the issue. Yes, Excel could do better with support for CSV. (OpenOffice and LibreOffice habdle this better, for example.) But no, genetics is not some niche use-case for this better support. It&#x27;s just one example of many. I do analytics of another sort for a bank, and we run into this problem all the time with those pesky 16-digit account numbers.<p>But, while Microsoft might do well to make changes to address their poor implementation, someone somewhere will implement the next best thing and screw it up, too.
评论 #30987225 未加载
jmchusterabout 3 years ago
Similarly, if your last name is &quot;Null&quot;, sometimes it&#x27;s just easier to just change your last name.
评论 #30987671 未加载
评论 #30991610 未加载
评论 #30989485 未加载
评论 #30987754 未加载
wil421about 3 years ago
I love it when excel decides a bunch of digits is a date or decides to remove leading 0000s from a very important number that needs 0000s. Or maybe it wants to make a bunch of digits into scientific notation. I would prefer if excel left my cells alone or just asked what I wanted to do.
chipsaabout 3 years ago
Part of the problem, highlighted in the article: Users don&#x27;t set the data type of the column off of &quot;General&quot;. So it tries to figure out the data type on entry. Setting the column to use a specific type (&quot;Text&quot;), keeps it from trying to use heuristics to figure out that something is a date.<p>Something more recent is the introduction into Excel of Power Query, which lets you import a CSV and apply arbitrary transformations (such as applying a type), before it hits the workbook, so if you need to pull in a CSV, you can do so, and it will always be imported the same way.
评论 #30995575 未加载
Semaphorabout 3 years ago
At the time (506 comments): <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=24070385" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=24070385</a>
评论 #30986851 未加载
adhesive_wombatabout 3 years ago
&gt; Be conservative in what you send, be liberal in what you accept.<p>This is why I think that&#x27;s wrong: if you allow people to be sloppy with how they do things, they&#x27;ll do it, and then make it part of their workflow, product, religion or whatever, and now everyone is stuck with it.<p>Be absolutely explicit with what you accept and refuse to deal with crap. Then you will only ever have to maintain a simple validator and the code that deals with good data, rather than having to have an incredibly hairly validator that leaks into your logic at every level, followed by cementing your bugs into everyone&#x27;s implementations.
评论 #30991028 未加载
loufeabout 3 years ago
Good initiative, but there are hundreds of used languages, I hope they put a bit more effort into it than just testing the English dates cases. I caught one bad example, for any French speakers, in the article itself:<p>and MARS tweaked to MARS1.
评论 #30988938 未加载
pelasacoabout 3 years ago
Looks more like the scientists are choosing the wrong tool to do their job. Sure you can use excel, to get a sense of the data, but for god sake, there are better tools for that - or even contribute to an existing open source one to do what you need. I&#x27;m quite sure scientists could get some grants to hire actual software engineers to come up with a better solution, based on the open source options existing there.
评论 #30989607 未加载
burntoutfireabout 3 years ago
Reality adapting to software (instead of the other way around) seems to be more and more common. Just today, my local library sent out email informing users that, while they agree that they shouldn&#x27;t charge late fees for non-work days, the software company they use doesn&#x27;t want to introduce such feature into the software, so they have no way but to charge for those days.
评论 #30995676 未加载
keithalewisabout 3 years ago
Suggested title: &quot;Human genes renamed to stop people who don&#x27;t know how to use Microsoft Excel from misreading them as dates.&quot;
washadjeffmadabout 3 years ago
Does anyone here use Excel in any capacity on production data in the biosciences? If so, did you go to college for the biosciences, or are you supplying work using a tool you learned under another discipline?<p>I&#x27;m always fascinated to learn how Excel makes its way into unusual places.
评论 #30989691 未加载
AviationAtomabout 3 years ago
Isn&#x27;t this solved by putting a single quote, or something of the sort, in front of the data?
评论 #30989716 未加载
nikonyrhabout 3 years ago
I work in the retail sector, GTINs &#x2F; EANs in the exponential form aren&#x27;t ideal but sadly fairly common.
hoosiereeabout 3 years ago
&quot;4 base pairs ought to be enough for anybody&quot; - Bill Gates
Shadonototraabout 3 years ago
Modern stupidity at its finest, a point of no return was achieved
Schroedingersatabout 3 years ago
Just don&#x27;t use excel.<p>Or anything with autocorrect and autoformat.
rngcntrabout 3 years ago
Who said Bill Gates wants to manipulate our genes? ;)
imtringuedabout 3 years ago
&gt;Even then, a scientist might fix their data but export it as a CSV file without saving the formatting.<p>Is this supposed to be a joke? Since when does excel support CSV files? Yes you can import and export CSV files but that is just there to check a box. That feature doesn&#x27;t actually work. Just import and export .xlsx files in your applications directly.<p>CSV is such a bad format because it&#x27;s not even a standard, there is RFC4180 but most people have never heard of it. CSV is complicated enough that anyone who thinks they can implement it will get it wrong on their first attempt but simple enough that people believe they can implement it on their first attempt.
评论 #30987183 未加载
评论 #30987021 未加载