TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Detecting manuscripts and publications from paper mills

32 pointsby blopeurover 5 years ago

1 comment

DyslexicAtheistover 5 years ago
<i>&gt; Paper mills are believed to be fueled by unrealistic publication requirements or quotas, combined with monetary publication rewards to authors [2-5]. Where unrealistic publication requirements are widely applied over long periods of time, this could create a broad and growing base of paper mill clients. To both meet client demand and their capacity to pay, paper mills will likely aim to generate large numbers of manuscripts at minimum cost. This will likely require at least some falsified or fabricated data, as performing genuine experiments could render manuscripts unaffordable.<p>&gt; At the same time, externally supplied manuscripts should resemble genuine manuscripts if they are to be accepted for publication, so paper mills need to balance their requirements for efficiency and volume with the requirement that their manuscripts also appear to be genuine.</i><p>^^<p>the paper doesn&#x27;t say too much about the state of research around CS &#x2F; BigTech related topics. I have been browsing arxiv (and arxiv-sanity.com) for more than a decade and doing so on at least a weekly basis for work and to satisfy my private curiosity. My interest are mostly classic CompSci topics and Security. The genre seems increasingly polluted by papers which can only be described as BS research.<p>You can usually smell the bad with a few simple rules that help to know what to look for[1]. It increasingly feels like dumpster diving in the domains around AI&#x2F;ML. My partner who is in healthcare tells me that medicine is even worse whenever I complain (though I have no evidence for it other than anecdotal).<p>[1] <i>Hanson 1999: Efficient Reading of Papers in Science and Technology</i>: <a href="https:&#x2F;&#x2F;www.cs.columbia.edu&#x2F;~hgs&#x2F;netbib&#x2F;efficientReading.pdf" rel="nofollow">https:&#x2F;&#x2F;www.cs.columbia.edu&#x2F;~hgs&#x2F;netbib&#x2F;efficientReading.pdf</a><p>recent (published in the past month) random examples of what falls IMO into the category of BS papers (these aren&#x27;t outliers but are becoming the norm!):<p><i>P4-to-blockchain: A secure blockchain-enabled packet parser for software defined networking:</i> <a href="https:&#x2F;&#x2F;www.sciencedirect.com&#x2F;science&#x2F;article&#x2F;pii&#x2F;S0167404819301762" rel="nofollow">https:&#x2F;&#x2F;www.sciencedirect.com&#x2F;science&#x2F;article&#x2F;pii&#x2F;S016740481...</a><p><i>Security &amp; Privacy in IoT Using Machine Learning &amp; Blockchain: Threats &amp; Countermeasures</i> <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2002.03488v1" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2002.03488v1</a><p><i>REST: A thread embedding approach for identifying and classifying user-specified information in security forums</i> <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2001.02660v1" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2001.02660v1</a><p><i>Artificial Design: Modeling Artificial Super Intelligence with Extended General Relativity and Universal Darwinism via Geometrization for Universal Design Automation</i> <a href="https:&#x2F;&#x2F;openreview.net&#x2F;forum?id=SyxQ_TEFwS" rel="nofollow">https:&#x2F;&#x2F;openreview.net&#x2F;forum?id=SyxQ_TEFwS</a>
评论 #22388862 未加载