TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: Cyber-forensics for public PCs using ML to detect hate speech, threats

4 点作者 allisterb超过 6 年前
This is a digital forensics project I&#x27;ve been working on called OLAF which is designed to provide forensic evidence in investigations of using library or other public-facing PCs to send hate speech, violent threats, &quot;swatting&quot;, IP exfiltration or infiltration, or to commit any other type of computer-enabled crimes or malfeasance. The project uses OCR, NLP, and computer vision to create high level logs of the PC&#x27;s interactive usage that can be used in investigation or detection of dangerous or criminal activity.<p>Demo vid 1: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;F6k2PC-7WQw" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;F6k2PC-7WQw</a><p>Demo vid 2: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;-N2Qftl26Mw" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;-N2Qftl26Mw</a><p>Project repository: <a href="https:&#x2F;&#x2F;github.com&#x2F;allisterb&#x2F;OLAF" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;allisterb&#x2F;OLAF</a><p>The raw data OLAF uses is logged in disparate places in one form or another (like in your PC or ISP or web server logs) and OLAF just tries to do real-time analysis of the available data in one place. No PII is ever intentionally logged and it is up to the org to link PC user accounts and identities to people.<p>Of course privacy is a big concern here and the organizations using a tool like OLAF have to walk the line between protecting the privacy of their users and adhering to the relevant laws vs. being able to quickly detect and investigate potentially serious threats to people&#x27;s safety. Guidelines like those from NISO (<a href="https:&#x2F;&#x2F;www.niso.org&#x2F;publications&#x2F;privacy-principles" rel="nofollow">https:&#x2F;&#x2F;www.niso.org&#x2F;publications&#x2F;privacy-principles</a>) should be adhered to as much as possible.<p>Libraries currently used:<p>Tesseract.net: <a href="https:&#x2F;&#x2F;github.com&#x2F;tvncosine&#x2F;tesseract.net" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;tvncosine&#x2F;tesseract.net</a><p>VADERSharp: <a href="https:&#x2F;&#x2F;github.com&#x2F;codingupastorm&#x2F;vadersharp" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;codingupastorm&#x2F;vadersharp</a><p>Accord.NET: <a href="https:&#x2F;&#x2F;github.com&#x2F;Accord-net&#x2F;framework&#x2F;" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;Accord-net&#x2F;framework&#x2F;</a><p>Azure Cognitive Services Computer Vision API<p>Azure Cognitive Services Text Analytics API<p>Let me know what you guys think and any comments or suggestions (or criticism) you may have.

暂无评论

暂无评论