TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: We built an open source tool for journalists to inspect Wikipedia edits

131 pointsby psobotover 10 years ago

10 comments

crazy2beover 10 years ago
This is awesome! I&#x27;ve wanted to build something like this for a while, but never gotten the chance :). Better tools like this are essential for helping users accept and reject the correct edits, and making it easier to investigate when reversions might be unjustified.<p>The only problem I noticed is that clicking the footnotes does not work, which makes it harder to evaluate the legitimacy of an edit (well, it just means I have to have the current article open as well).
评论 #8624892 未加载
maaaatsover 10 years ago
A tip on the diffing:<p>We have done something similar when diffing HTML, e.g. replacing the HTML with single unicodes. And then we run the diff and get several diff-segments (EQUALS, INS, DEL). What we have done, is then to scan those for tags, and split them into a new type.<p>So an insert like INS(something \xE000 else) would become three <i>changes</i>. E.g. INS(something ) INS_TAG(\xE000) INS( else). So the INS_TAG shouldn&#x27;t be wrapped in &lt;ins&gt; when converting this back to HTML.
xxxyyover 10 years ago
I&#x27;m not sure if Wikipedia is OK with that, considering the load it generates [1]. I am currently doing some research on Wikipedia, and for my purposes I use the official dumps site at <a href="https://dumps.wikimedia.org/" rel="nofollow">https:&#x2F;&#x2F;dumps.wikimedia.org&#x2F;</a><p>[1] <a href="http://en.wikipedia.org/robots.txt" rel="nofollow">http:&#x2F;&#x2F;en.wikipedia.org&#x2F;robots.txt</a>
评论 #8625148 未加载
评论 #8627201 未加载
JonLimover 10 years ago
Oh boy, TWG on HN! ;) Great job with this Peter, playing around with it a bit more, but I&#x27;m intrigued.
aw3c2over 10 years ago
<i></i>You tried to access the address <a href="http://wikiwash.metronews.ca/" rel="nofollow">http:&#x2F;&#x2F;wikiwash.metronews.ca&#x2F;</a>, which is currently unavailable.<i></i><p>:(
评论 #8624134 未加载
tzmover 10 years ago
Thanks for releasing this. Any plans to support other sites? Perhaps an open source diffbot server?
allenguoover 10 years ago
Great write-up!
imaginenoreover 10 years ago
wikiwash.metronews.ca doesn&#x27;t load
Edmontonianover 10 years ago
not a very useful project. Problem for journalists isn&#x27;t &quot;how to inspect edits&quot; but rather &quot;which edits on which entries should I inspect&quot;
评论 #8627208 未加载
ExpiredLinkover 10 years ago
This is a surveillance tool so that &#x27;we&#x27; can monitor &#x27;their&#x27; activity and take appropriate measures. Wikipedia&#x27;s &quot;openly editable model&quot; in 2014.
评论 #8624414 未加载