科技回声

5 条评论

gnabgib超过 1 年前

The report (Identifying and Eliminating CSAM in Generative ML Training Data and Models)[0] that this guy is very slowly sumarizing (and seems to largely agree with despite the title) was discussed 3 days ago (38 points, 30 comments)[1][0]: <a href="https://purl.stanford.edu/kh752sm9123" rel="nofollow noreferrer">https://purl.stanford.edu/kh752sm9123</a> [1]: <a href="https://news.ycombinator.com/item?id=38711135">https://news.ycombinator.com/item?id=38711135</a>

Palmik超过 1 年前

Tangential, but why didn't the OpenAssistant team (lead by the author of the video) release the OpenAssistant dataset? As far as I know, the project was shut down, and only some initial highly filtered version of the data got released. This dataset could be very valuable for the community that created it.

评论 #38754080 未加载

评论 #38750266 未加载

artninja1988超过 1 年前

It's honestly pretty sad that at no time the authors of this paper bothered contacting laion to remove the links and work together to develop better filters. Also pretty interesting, that one of the authors calls, David Thiel himself the "Ai censorship death star". Yannic is probably right that they aren't particularly interested in bettering open source diffusion models and are more in the walled garden camp.

mistrial9超过 1 年前

then why does IBM spend money producing this one?<a href="https://www.youtube.com/watch?v=y9k-U9AuDeM" rel="nofollow noreferrer">https://www.youtube.com/watch?v=y9k-U9AuDeM</a>

terminous超过 1 年前

Open source advocates: "With enough eyes, all bugs are shallow."These researchers: "I see your project includes a non-zero amount of CSAM."Open source advocates: "How dare you point out an issue? This is a hit piece!"

评论 #38748694 未加载

5 条评论

gnabgib超过 1 年前

Palmik超过 1 年前

评论 #38754080 未加载

评论 #38750266 未加载

artninja1988超过 1 年前

mistrial9超过 1 年前

then why does IBM spend money producing this one?<a href="https://www.youtube.com/watch?v=y9k-U9AuDeM" rel="nofollow noreferrer">https://www.youtube.com/watch?v=y9k-U9AuDeM</a>

terminous超过 1 年前

评论 #38748694 未加载

Another Hit Piece on Open-Source AI [video]

5 条评论

Another Hit Piece on Open-Source AI [video]

5 条评论