TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: We built an open source tool for journalists to inspect Wikipedia edits

131 点作者 psobot超过 10 年前

10 条评论

crazy2be超过 10 年前
This is awesome! I&#x27;ve wanted to build something like this for a while, but never gotten the chance :). Better tools like this are essential for helping users accept and reject the correct edits, and making it easier to investigate when reversions might be unjustified.<p>The only problem I noticed is that clicking the footnotes does not work, which makes it harder to evaluate the legitimacy of an edit (well, it just means I have to have the current article open as well).
评论 #8624892 未加载
maaaats超过 10 年前
A tip on the diffing:<p>We have done something similar when diffing HTML, e.g. replacing the HTML with single unicodes. And then we run the diff and get several diff-segments (EQUALS, INS, DEL). What we have done, is then to scan those for tags, and split them into a new type.<p>So an insert like INS(something \xE000 else) would become three <i>changes</i>. E.g. INS(something ) INS_TAG(\xE000) INS( else). So the INS_TAG shouldn&#x27;t be wrapped in &lt;ins&gt; when converting this back to HTML.
xxxyy超过 10 年前
I&#x27;m not sure if Wikipedia is OK with that, considering the load it generates [1]. I am currently doing some research on Wikipedia, and for my purposes I use the official dumps site at <a href="https://dumps.wikimedia.org/" rel="nofollow">https:&#x2F;&#x2F;dumps.wikimedia.org&#x2F;</a><p>[1] <a href="http://en.wikipedia.org/robots.txt" rel="nofollow">http:&#x2F;&#x2F;en.wikipedia.org&#x2F;robots.txt</a>
评论 #8625148 未加载
评论 #8627201 未加载
JonLim超过 10 年前
Oh boy, TWG on HN! ;) Great job with this Peter, playing around with it a bit more, but I&#x27;m intrigued.
aw3c2超过 10 年前
<i></i>You tried to access the address <a href="http://wikiwash.metronews.ca/" rel="nofollow">http:&#x2F;&#x2F;wikiwash.metronews.ca&#x2F;</a>, which is currently unavailable.<i></i><p>:(
评论 #8624134 未加载
tzm超过 10 年前
Thanks for releasing this. Any plans to support other sites? Perhaps an open source diffbot server?
allenguo超过 10 年前
Great write-up!
imaginenore超过 10 年前
wikiwash.metronews.ca doesn&#x27;t load
Edmontonian超过 10 年前
not a very useful project. Problem for journalists isn&#x27;t &quot;how to inspect edits&quot; but rather &quot;which edits on which entries should I inspect&quot;
评论 #8627208 未加载
ExpiredLink超过 10 年前
This is a surveillance tool so that &#x27;we&#x27; can monitor &#x27;their&#x27; activity and take appropriate measures. Wikipedia&#x27;s &quot;openly editable model&quot; in 2014.
评论 #8624414 未加载