TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Amazon has a secret workaround to scrape GitHub for model training

6 pointsby tardismechanic11 months ago

3 comments

tardismechanic11 months ago
<a href="https:&#x2F;&#x2F;archive.is&#x2F;2024.06.14-142849&#x2F;https:&#x2F;&#x2F;www.businessinsider.com&#x2F;amazon-secret-workaround-scrape-microsoft-github-ai-training-data-2024-6" rel="nofollow">https:&#x2F;&#x2F;archive.is&#x2F;2024.06.14-142849&#x2F;https:&#x2F;&#x2F;www.businessins...</a>
fikjusulta11 months ago
I would appreciate a formal mechanism to opt out of data collection for Amazon (as well as OpenAI and Microsoft).
smcin11 months ago
[Non-paywalled version]: <a href="https:&#x2F;&#x2F;dataconomy.com&#x2F;2024&#x2F;06&#x2F;14&#x2F;amazon-has-a-secret-way-to-scrape-microsofts-github-and-feed-its-ai-model&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dataconomy.com&#x2F;2024&#x2F;06&#x2F;14&#x2F;amazon-has-a-secret-way-to...</a><p><i>According to an internal memo obtained by Business Insider, Amazon’s AGI Group worked around Github&#x27;s 5,000 request&#x2F;hr&#x2F;account limits by &#x27;encouraging&#x27; its employees to create multiple GitHub accounts and share their access credentials. By leveraging a network of accounts simultaneously, Amazon aims to condense what would have been a multi-year endeavor into a matter of weeks.</i><p>Dataconomy: <i>The ethical implications are significant. By soliciting employees to share personal GitHub accounts, Amazon is potentially accessing data without explicit consent from GitHub or the repository owners.</i>
评论 #40686724 未加载