TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The second largest version of Wikipedia is written mostly by one bot

140 点作者 jxub大约 5 年前

7 条评论

4cao大约 5 年前
This endeavor looks largely orthogonal to what the objectives of an online encyclopedia should be. Creating as many stub articles as possible and filling them with &quot;formulaic, generic, and reusable templated sentences with spots for specific information&quot; seems more like a recipe for an automated content farm than for &quot;disseminating the sum of <i>human</i> knowledge.&quot;<p>It would be most interesting to know what the 148 active Cebuano Wikipedia users think of the 5,331,028 articles the bot created, ostensibly for them. Too bad nobody apparently cared to ask.<p>In particular, since Cebuano speakers are likely to be fluent in Tagalog and&#x2F;or English as well, they can easily use one of the other Wikipedia editions too. Without the hyperactive bot, the much smaller Cebuano Wikipedia would arguably be more relevant, reflecting topics truly of interest to the community.<p>While the number of articles is a convenient way of comparing Wikipedia language editions, it only works as such to the extent that the articles are kept to a certain standard. It seems to me that what we are observing here is yet another example of the situation that when a measure becomes a target it ceases to be a good measure.
评论 #22410889 未加载
评论 #22409878 未加载
评论 #22409835 未加载
评论 #22409777 未加载
评论 #22409747 未加载
评论 #22420244 未加载
sings大约 5 年前
I always thought it was a bit bizarre that different language editions of Wikipedia contain different information. It seems the focus should be more on translation than content creation. Maybe that isn’t practical with the current structure, but surely the aim should be a definitive knowledge graph rather than a disparate and unevenly duplicated set of articles. Just my two cents – I am sure many have put a lot of thought into how to best tackle this.
评论 #22409565 未加载
评论 #22409108 未加载
评论 #22409065 未加载
评论 #22408528 未加载
评论 #22411738 未加载
peterburkimsher大约 5 年前
I discovered this in 2018, when comparing lists of languages supported by different software and the number of speakers.<p><a href="https:&#x2F;&#x2F;peterburk.github.io&#x2F;i2018n&#x2F;#wikipedia" rel="nofollow">https:&#x2F;&#x2F;peterburk.github.io&#x2F;i2018n&#x2F;#wikipedia</a><p>Having machine-translated content is powerful for SEO, but I don&#x27;t know how practical that is for Cebuano. It would be nice for English to no longer be practically required for people to become computer literate.
评论 #22407962 未加载
评论 #22407859 未加载
评论 #22408529 未加载
tomrod大约 5 年前
I like this because growth and progress of knowledge base, regardless of language or hosting platform, is incremental and cumulative. Wikipedia shows this effectively in the English channel because it happened so quickly. But even the legacy encyclopedias did this through centuries. Whether a bot lays the groundwork from other reference points or dedicated humans do it is sort of immaterial, I think, because the very long run this benefits the people who speak this language.<p>In an age where languages are dying with their last speakers, Visayan has done much to preserve their diversity -- although not a written&#x2F;codified language, volunteers give radio broadcasts in the language, books are published in it (here the lack of codification shows by variance in spelling, verb conjugation, and sentence structure), and similar. Thank you to this wikipedian for doing something to preserve a wonderful language (I mention in another comment I am fluent and miss the regular speaking of it).
tangoalpha大约 5 年前
Clicking on random article on <a href="https:&#x2F;&#x2F;ceb.m.wikipedia.org&#x2F;wiki&#x2F;Espesyal:Random#&#x2F;random" rel="nofollow">https:&#x2F;&#x2F;ceb.m.wikipedia.org&#x2F;wiki&#x2F;Espesyal:Random#&#x2F;random</a> , looks like every article is that of either a tree, or an animal, or an insect, or a place...
评论 #22410416 未加载
评论 #22409766 未加载
qwerty456127大约 5 年前
So they mean to tell us &quot;insignificant&quot; facts and articles must be deleted?
评论 #22407761 未加载
评论 #22407609 未加载
brokensegue大约 5 年前
Slightly pedantic but the largest &quot;Wikipedia&quot; (depending on how you define it) is <a href="http:&#x2F;&#x2F;wikidata.org&#x2F;" rel="nofollow">http:&#x2F;&#x2F;wikidata.org&#x2F;</a> and it&#x27;s also primarily written by bots.
评论 #22407965 未加载
评论 #22408869 未加载