TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Amazon has a secret workaround to scrape GitHub for model training

6 点作者 tardismechanic11 个月前

3 条评论

tardismechanic11 个月前
<a href="https:&#x2F;&#x2F;archive.is&#x2F;2024.06.14-142849&#x2F;https:&#x2F;&#x2F;www.businessinsider.com&#x2F;amazon-secret-workaround-scrape-microsoft-github-ai-training-data-2024-6" rel="nofollow">https:&#x2F;&#x2F;archive.is&#x2F;2024.06.14-142849&#x2F;https:&#x2F;&#x2F;www.businessins...</a>
fikjusulta11 个月前
I would appreciate a formal mechanism to opt out of data collection for Amazon (as well as OpenAI and Microsoft).
smcin11 个月前
[Non-paywalled version]: <a href="https:&#x2F;&#x2F;dataconomy.com&#x2F;2024&#x2F;06&#x2F;14&#x2F;amazon-has-a-secret-way-to-scrape-microsofts-github-and-feed-its-ai-model&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dataconomy.com&#x2F;2024&#x2F;06&#x2F;14&#x2F;amazon-has-a-secret-way-to...</a><p><i>According to an internal memo obtained by Business Insider, Amazon’s AGI Group worked around Github&#x27;s 5,000 request&#x2F;hr&#x2F;account limits by &#x27;encouraging&#x27; its employees to create multiple GitHub accounts and share their access credentials. By leveraging a network of accounts simultaneously, Amazon aims to condense what would have been a multi-year endeavor into a matter of weeks.</i><p>Dataconomy: <i>The ethical implications are significant. By soliciting employees to share personal GitHub accounts, Amazon is potentially accessing data without explicit consent from GitHub or the repository owners.</i>
评论 #40686724 未加载