TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Google Drive flags file only containing “1” for copyright infringement

1117 pointsby thanatosminover 3 years ago

59 comments

tyingqover 3 years ago
I&#x27;m curious if it was related to the file name. I created a few 1-byte files with just &quot;1&quot; in them, with different names, including &quot;output04.txt&quot;. No problems so far. Also uploaded variations with &quot;\n&quot; and &quot;\r\n&quot; after the &quot;1&quot;. And enabled sharing to anyone with the link. No issues so far.<p>Google drive does support metadata like a description and comments. I wonder if someone posted some copyrighted text in a comment?<p>Update: Recreated it. Most of them are now flagged. Took about an hour for that to happen. So far, all that have just one byte, being a &quot;1&quot;, and also the one that contains &quot;1\n&quot;.<p>The one with &quot;1\r\n&quot; hasn&#x27;t been flagged. The file names of the flagged files: &quot;one.txt&quot;, &quot;onev2.txt&quot;, &quot;output04.txt&quot; and &quot;output05.txt&quot;.<p>Screenshots of the email and Google drive: <a href="https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;RHnEJcj" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;RHnEJcj</a> (note the little flags on the Google drive view, and the file sizes)<p>Just added some files with &quot;0&quot; and &quot;0\n&quot;, we&#x27;ll see if &quot;0&quot; is copyrighted :)
评论 #30064712 未加载
评论 #30063319 未加载
评论 #30065216 未加载
评论 #30064970 未加载
评论 #30065300 未加载
评论 #30061279 未加载
评论 #30063789 未加载
评论 #30065091 未加载
评论 #30065720 未加载
评论 #30065659 未加载
评论 #30064637 未加载
评论 #30061426 未加载
评论 #30068162 未加载
评论 #30065705 未加载
评论 #30062980 未加载
iameliover 3 years ago
All software has bugs; I&#x27;m not mad at all that this silly test case was flagged incorrectly. The truly infuriating part is &quot;A review cannot be requested for this restriction.&quot;<p>Translation: &quot;We have no idea if you actually own this content or not, but it would be _way too expensive_ for us to find out for sure! So you&#x27;re out of luck, but don&#x27;t worry — it&#x27;s all worth it so we can make sure children can&#x27;t stream Marvel movies from Google Drive! Thank you for your contributions to Disney+&#x27;s bottom line.&quot;
评论 #30065359 未加载
评论 #30064886 未加载
评论 #30065043 未加载
评论 #30066153 未加载
评论 #30066638 未加载
评论 #30066814 未加载
ChicagoBoy11over 3 years ago
I experienced something similar building an internal tool on GSuite. I had a large file with sequences of 9 digit numbers specific to our use-case, all tied to names of people (employees). Whelp, at one point the tool I was working on stopped working, and it was flagged as apparently containing social security numbers (which I suppose matched the character length).<p>Whelp, on the admin panel, you can get a report of those files, and then mark it as a false positive. Which I did. But then nothing happened, and nothing changed. It was no use.<p>The hilarious bit: It did, of course, allow me to make a copy of the file in question, and then just point the resource I was building to the new file, which was exactly the same. Weeks later... so far, so good.
评论 #30063538 未加载
评论 #30061828 未加载
评论 #30061254 未加载
评论 #30062711 未加载
评论 #30063146 未加载
version_fiveover 3 years ago
&quot;A review cannot be requested for this restriction&quot;<p>ML enforcing rules is bad enough, but not allowing false positives to be corrected is ridiculous. This is why I would never consider g-suite for any business application.<p>Otoh, I think there is a legitimate business to be made helping small businesses and individuals secure themselves against arbitrary behavior from big tech. This kind of thing can have serious consequences (imagine if it was something of real substance that got restricted without recourse) and people need to consider hardening their activities against google et al
评论 #30061184 未加载
评论 #30060910 未加载
评论 #30061224 未加载
评论 #30061145 未加载
评论 #30061589 未加载
评论 #30062012 未加载
评论 #30060735 未加载
评论 #30064199 未加载
评论 #30063409 未加载
评论 #30061385 未加载
评论 #30062549 未加载
评论 #30063499 未加载
评论 #30064266 未加载
blunteover 3 years ago
Ironically, it may end up being one of these &quot;tiny&quot; scenarios which finally does Google in.<p>When trying to illustrate a problem or bug, one of the typically time consuming challenges is reducing the scenario to the minimal case which illustrates the problem. So thank you, @emilyldolson!<p>Aside from an empty file, you cannot reduce this any further. It brings to light in simple terms that non-techies can understand how absurd the &quot;ML to solve everything&quot; promise is -- and even moreso how wilfully negligent companies are by providing NO human intervention or support when the machines break down.
评论 #30063943 未加载
评论 #30062174 未加载
chowardover 3 years ago
The fact that Google is scanning your files for &quot;copyright infringement&quot; is bad enough. They have no way of knowing that you don&#x27;t legitimately own something. Then pair that with this example and if that isn&#x27;t enough of a deal breaker for using Google drive I don&#x27;t know what is.
评论 #30067054 未加载
评论 #30066441 未加载
leokennisover 3 years ago
15 years ago the first word that came to mind when thinking of Google was “magic”.<p>10 years ago “useful”.<p>These days it’s just “dread”.
评论 #30062763 未加载
评论 #30066261 未加载
评论 #30062318 未加载
TT-392over 3 years ago
&quot;Thanks for helping google keep the web safe&quot;<p>Interesting thing to add in there, how on earth does copyright stuff have anything to do with safety?
评论 #30061461 未加载
评论 #30061637 未加载
评论 #30066328 未加载
评论 #30067958 未加载
评论 #30064992 未加载
lhorieover 3 years ago
I have a pet theory that all of these recent Google bloopers could be explained easily if you start from the assumption that Google internal incentives promote efforts to cut costs such as storage.<p>&quot;Garbage&quot; docs, inactive email accounts, less search results etc can all be reasonably explained by a desire to not spend money on storage for &quot;low value&quot; data (i.e. data that is unlikely to be accessed in a way that translates to profit for Google). Users, having been trained to rely on free services and the magic of search to summon stuff, have zero incentives to clean up their digital &quot;pollution&quot;, and at some point, something&#x27;s gotta give.
评论 #30061975 未加载
评论 #30066279 未加载
评论 #30064003 未加载
onion2kover 3 years ago
If Google has the copyright on &quot;1&quot;, they only need to get &quot;0&quot; as well and they&#x27;ll have everything.
评论 #30063387 未加载
评论 #30061851 未加载
评论 #30060921 未加载
评论 #30063373 未加载
评论 #30066997 未加载
评论 #30061004 未加载
nocturnialover 3 years ago
Just call google support... oh... wait... right...<p>I wonder how many ads we need to watch before google implements something even remotely similar to user support? How many billions are enough before we get support?<p>I know I&#x27;m overreacting but I&#x27;m getting tired of these articles. We all know that google is messed up (to put it lightly). Some people here don&#x27;t think that&#x27;s the case and that&#x27;s fine. Other people, including me, don&#x27;t find it surprising at all.<p>Post something about google killing cute kittens.<p>I wouldn&#x27;t be surprised but I would be interested in that story.
verytrivialover 3 years ago
A quick note to anyone working to reproduce this: the automated stupidity that caused this is of the same variety that will CANCEL YOUR GOOGLE ACCOUNT without recourse if your stats lean a certain way. Tread carefully.
mastaziover 3 years ago
I remember when it was announced that this was going to be possible and people here on HN were defending Google&#x27;s decision with comments along the lines of &quot;this is fine, they&#x27;re not reading your private files, they&#x27;re just going to stop people that use Google Docs for distributing pirated content&quot;<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=27858032" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=27858032</a>
dmitrygrover 3 years ago
&quot;A review cannot be requested for this restriction&quot;<p>I always did say that Franz Kafka never died. He is semi-retired working in google’s PM org, occasionally consulting for the UX teams as well.
jacquesmover 3 years ago
Pretty weird that Google would be scanning files for copyright infringement in the first place, it&#x27;s supposed to be a <i>Drive</i> not the enforcement arm of the copyright mafia.
评论 #30066289 未加载
everyoneover 3 years ago
It&#x27;s so dystopian &#x2F; Kafkaesque it&#x27;s like a parody.<p>&quot;Thankyou for helping google keep the web safe&quot;<p>followed by...<p>&quot;A review cannot be requested for this restriction&quot;
评论 #30064828 未加载
Qub3dover 3 years ago
Always operate under the assumption that iCloud (Apple), Microsoft and Google will delete any&#x2F;all of your data, with no notice, and for no reason.<p>Because they explicitly reserve the right to do so in their TOSes.<p>Not your computer, not your data etc.<p>(<a href="https:&#x2F;&#x2F;www.quentb.com&#x2F;posts&#x2F;diy-cloud-backup&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.quentb.com&#x2F;posts&#x2F;diy-cloud-backup&#x2F;</a>)
评论 #30063919 未加载
评论 #30065783 未加载
评论 #30064726 未加载
unclekevover 3 years ago
Meanwhile my Mom uses Google Drive to share pirated movies with family members (despite my protests) and is yet to have a single file flagged.<p>Just need to name your file something like &quot;Output04.S01E01.NumberOne.1080p.HEVC.x265-MeGusta&quot; and you&#x27;ll be fine &#x2F;s<p>How can they get things so wrong?
Animatsover 3 years ago
File a DMCA counter-notice, of course.[1]<p>You may have to do this the hard way, via Google&#x27;s address for service of process.[2] Use registered mail or FedEx.<p>There&#x27;s also the option of taking Google to arbitration. Legal advice from one of those &quot;free quick consult&quot; services may be helpful.<p>[1] <a href="https:&#x2F;&#x2F;www.nolo.com&#x2F;legal-encyclopedia&#x2F;responding-dmca-takedown-notice.html" rel="nofollow">https:&#x2F;&#x2F;www.nolo.com&#x2F;legal-encyclopedia&#x2F;responding-dmca-take...</a><p>[2] <a href="https:&#x2F;&#x2F;support.google.com&#x2F;faqs&#x2F;answer&#x2F;6151275" rel="nofollow">https:&#x2F;&#x2F;support.google.com&#x2F;faqs&#x2F;answer&#x2F;6151275</a>
评论 #30064216 未加载
newhotelownerover 3 years ago
I am a small business owners. I pay for google one so that all my files are backed up and sync across devices. I also pay for backblaze to backup all my files (Just in the case google screws me).<p>Is there an alternative for encrypted backup &amp; sync between different computers?
评论 #30062656 未加载
评论 #30064866 未加载
评论 #30064774 未加载
评论 #30064845 未加载
mbrukmanover 3 years ago
<i>Disclosure: I work at Google, but not on the Google Drive team specifically.</i><p>Sorry about the issue, folks! The Google Drive team is aware of it and is working on remediating it.<p>And thank you all for the many test cases! :)
评论 #30068556 未加载
PaulHouleover 3 years ago
Must have infringed on Metallica.
评论 #30060964 未加载
itronitronover 3 years ago
I guess the moral of the story is, never do business with a company that doesn&#x27;t provide a mailing address to which you can mail a turd (at book rate.)
评论 #30066307 未加载
daneel_wover 3 years ago
And googling &quot;15.91&#x2F;4&quot; throws a SafeSearch alert letting us know that &quot;some results may be explicit&quot;.
Devastaover 3 years ago
No pity, if you are using Google products for anything important you only have yourself to blame.
1vuio0pswjnm7over 3 years ago
Maybe for those times when copyright infringers try to split an infringing file into separate files containing only one bit, represented as text, to avoid detection. No, I am not serious.<p>Try testing a file that contains more than a single 1 or 0, such as 01111000.
reaperducerover 3 years ago
I&#x27;ve said it before, Google is trying to be the new Microsoft.<p><a href="https:&#x2F;&#x2F;www.theonion.com&#x2F;microsoft-patents-ones-zeroes-1819564663" rel="nofollow">https:&#x2F;&#x2F;www.theonion.com&#x2F;microsoft-patents-ones-zeroes-18195...</a>
ahsima1over 3 years ago
I wonder if it&#x27;s a part of some sort of cyberattack. Someone knows that deleting a file, containing a &quot;1&quot; or &quot;0&quot; from target&#x27;s gdrive will break something they want, so they filed a false DMCA claim.
评论 #30064455 未加载
woliveirajrover 3 years ago
Google Drive answered:<p>&quot;Hi Dr. Emily Dolson, thank you for letting us know about this issue! The Drive team is very much aware of this now thanks to all of you we&#x27;re working on it!&quot;
hdermsover 3 years ago
This seems like a really great case for property based testing and&#x2F;or fuzzers. Randomly generated output should virtually never flag copyright (and would be rare enough that you could manually assess if it was accurate or not, likely). The core utility of system like this, which puts an enormous amount of leverage in the hands of automated decision making <i>must</i> be robust against things like this.
marivillaover 3 years ago
Google Drive also offers client side encryption, which would make this scanning ineffectual: <a href="https:&#x2F;&#x2F;flowcrypt.com&#x2F;blog&#x2F;article&#x2F;2021-06-14-google-workspace-encryption&#x2F;" rel="nofollow">https:&#x2F;&#x2F;flowcrypt.com&#x2F;blog&#x2F;article&#x2F;2021-06-14-google-workspa...</a><p>So as long as you have a ton of money and are a corporation your privacy should be just fine
评论 #30068443 未加载
mateo1over 3 years ago
This is your annual reminder that you don&#x27;t own your files if they&#x27;re stored in someone else&#x27;s computer (also known as &quot;the cloud&quot;). Keep offline backups, legislation has made it very easy to export literally everything from google.
raydevover 3 years ago
Looks like the Google Drive team is aware of it. Wonder what happens next.<p><a href="https:&#x2F;&#x2F;twitter.com&#x2F;MishaBrukman&#x2F;status&#x2F;1485804925561057291" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;MishaBrukman&#x2F;status&#x2F;1485804925561057291</a>
diogenesjuniorover 3 years ago
I feel bad for the new hire who wasn&#x27;t entirely sure what he was doing. Something similar happened at reddit[0], wouldn&#x27;t put it past google.<p>0: <a href="https:&#x2F;&#x2F;redd.it&#x2F;m0rmux" rel="nofollow">https:&#x2F;&#x2F;redd.it&#x2F;m0rmux</a>
jacob019over 3 years ago
Another example of why it is time to dump google. With google you are the product, not the customer. There are decent alternatives for everything that google offers. It feels really good to do.
flykespiceover 3 years ago
Serious I&#x27;m upset that google drive can block files that you <i>own</i>, I feel my trust betrayed. We&#x27;re really moving to an dystopian age where companies can control your personal data.
pmontraover 3 years ago
Is this an unintended adversarial attack to some copyright classifier?
Grismarover 3 years ago
I wonder: is this a technical issue, or just a practical joke by someone who has managed to convince Google Drive that they have the copyright to files containing only &quot;1&quot;?
shmerlover 3 years ago
Copyright was always bizarre in the sense that any information can be expressed as numbers. So why are some numbers more copyrightable?<p>Also reminds me this (&quot;Microsoft Patents Ones, Zeroes&quot;): <a href="https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20100607151726&#x2F;http:&#x2F;&#x2F;www.theonion.com&#x2F;articles&#x2F;microsoft-patents-ones-zeroes,599&#x2F;" rel="nofollow">https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20100607151726&#x2F;http:&#x2F;&#x2F;www.theoni...</a>
评论 #30064878 未加载
ccbccccbbcccbbover 3 years ago
No surprise here. One World Corporation is going to reserve 1 for itself. This tweet is just a test drive.
luckystarrover 3 years ago
Could this be a rare case of a SHA-whatever collision? Or do they use MD5 to identify these files.
Ansil849over 3 years ago
Are there any updates on this? Like any accountability or an explanation why this happened?
userbinatorover 3 years ago
I wonder how quickly this would have been fixed if it happened to a Google employee.
hedoraover 3 years ago
This is a great example of why all cloud services should be end to end encrypted.
wanderingmindover 3 years ago
Maybe just use rclone or other tools to store encrypted files at rest in drive.
katsover 3 years ago
May want to back up your Google account if you see a message like that.
golem14over 3 years ago
That&#x27;s Number Wang!!!
jpambrunover 3 years ago
Google&#x27;s bots are crazy. Thank god they sold Boston Dynamics...
loudtieblahblahover 3 years ago
Protondrive is a thing.<p>Zero knowledge storage needs to be the default everywhere.
评论 #30070118 未加载
dwaiteover 3 years ago
So who is going to run out and get a tattoo a la DeCSS?
fortran77over 3 years ago
Ar least it wasn’t flagged for illegal porn, too.
davebaileyover 3 years ago
This could easily have been a test value.
mbfgover 3 years ago
i remember BlackDuck flagging a one pixel (white) image as a copyright infringement in our product.
spaniard_devover 3 years ago
Who said that “Don’t be evil” thing?
davebaileyover 3 years ago
This might have been a test case.
olliejover 3 years ago
1 is the loneliest IP :D
gumbyover 3 years ago
what about an empty audio file four minutes and 33 seconds long?
TedShillerover 3 years ago
Maybe but anyone can claim that on Twitter
评论 #30063428 未加载
superkuhover 3 years ago
Play stupid games, win stupid prizes. Putting your data in a megacorp basket means it&#x27;ll be treated primarily with consideration towards their legal liability first, other megacorps second, and you third or fourth.
评论 #30061006 未加载