TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Where can I get a dump of modern emails for ML testing?

3 点作者 kilroy1238 个月前
I need a dataset of modern emails to test against.<p>Where can I get a huge sample of anonymized emails from? I&#x27;m really struggling to find this anywhere.

5 条评论

mindcrime8 个月前
I don&#x27;t know, but you might try asking one of these sub-reddits:<p><a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;DHExchange&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;DHExchange&#x2F;</a><p><a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;datasets" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;datasets</a><p><a href="https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;opendata" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;opendata</a>
geophph8 个月前
Is the Enron email dataset modern enough?
评论 #41633907 未加载
kilroy1238 个月前
The closest thing I&#x27;ve found to what I&#x27;m looking for is <a href="https:&#x2F;&#x2F;untroubled.org&#x2F;spam&#x2F;" rel="nofollow">https:&#x2F;&#x2F;untroubled.org&#x2F;spam&#x2F;</a><p>I need real HTML emails that an actual human would have in their inbox, not just spam.
评论 #41637697 未加载
aiaiaiaiaiai8 个月前
How ethical does this need to be?
whimsicalism8 个月前
depends how modern and whether you need many organizations to test against but wikileaks has lots