科技回声

11 条评论

infogulch大约 1 年前

Have you tried to 'rediscover' classic (in)famous bugs? E.g. take an old version of OpenSSL vulnerable to heartbleed and run Antethesis on it to 'discover' heartbleed via fuzzing. It would be interesting to see how much fine tuning would be needed to discover it.

评论 #40071828 未加载

评论 #40071914 未加载

评论 #40071776 未加载

评论 #40072707 未加载

mrkmarron大约 1 年前

FYI playing Super Mario with fuzzing (AFL) was done in a fun 2020 S&P paper. Also finds bugs and security issues."IJON: Exploring Deep State Spaces via Fuzzing" <a href="https://casa.rub.de/fileadmin/img/Publikationen_PDFs/2020_IJON_Exploring_Deep_State_Spaces_via_Fuzzing_Publication_ClusterofExcellence_CASA_Bochum.pdf" rel="nofollow">https://casa.rub.de/fileadmin/img/Publikationen_PDFs/2020_IJ...</a>

评论 #40069670 未加载

评论 #40068965 未加载

评论 #40069802 未加载

wwilson大约 1 年前

This is Will (I gave the talk linked in the post). Happy to answer any questions about this work, or how it generalizes to testing things that aren't Nintendo games.

评论 #40069253 未加载

评论 #40069945 未加载

评论 #40068830 未加载

评论 #40071960 未加载

infogulch大约 1 年前

In Will's talk he defines two terms related to optimizing fuzzers [2]: Strategy and Tactics.Strategy is the datum you choose to optimize for as the fuzzer randomly walks the states of the system. E.g. optimize to maximize Mario's X value, or optimize for reaching all tile positions etc. This generalizes the concept of "coverage guided" to include domain-specific details about your target program (e.g. that the program has the concept of a grid of possible positions).Tactics is the choice of input distribution. Sometimes the frequency of the randomness should be tuned for the application. For example, randomly changing the state of the A button every frame is not a good frequency to properly test long jumps, maybe a normal distribution with average hold/not hold time of 1s would be better. Also, encoding the randomness within the program's valid domain can help avoid over-testing parsing/validation code at the expense of more interesting code further in the program. [1][2][0]: Barton P. Miller, Lars Fredriksen, and Bryan So. 1990. An empirical study of the reliability of UNIX utilities. Commun. ACM 33, 12 (Dec. 1990), 32–44. <a href="https://doi.org/10.1145/96267.96279" rel="nofollow">https://doi.org/10.1145/96267.96279</a>[1]: This reference appears to be related: Rohan Padhye, Caroline Lemieux, Koushik Sen, Laurent Simon, and Hayawardh Vijayakumar. 2019. FuzzFactory: domain-specific fuzzing with waypoints. Proc. ACM Program. Lang. 3, OOPSLA, Article 174 (October 2019), 29 pages. <a href="https://doi.org/10.1145/3360600" rel="nofollow">https://doi.org/10.1145/3360600</a>[2]: I introduce the concept of fuzzing in another comment: <a href="https://news.ycombinator.com/item?id=40068187#40071972">https://news.ycombinator.com/item?id=40068187#40071972</a>

Taikonerd大约 1 年前

Super Mario is such a fun example. Well chosen.

yanniszark大约 1 年前

This is fascinating! I thought only Reinforcement Learning was doing things like this but you're saying you can do this via fuzzying? What does this mean exactly? How is it able to learn to advance through all these levels? Is there an underlying learning mechanism at play?

评论 #40071972 未加载

评论 #40070185 未加载

t4ng0pwn3d大约 1 年前

I see a lot of fuzzing tools for CLI apps, but are there any good alternatives for web applications/APIs? I've used Hypothesis for generating random datas in requests but maybe there's something better out there.

suprfnk大约 1 年前

@wwilson How do you define the X/Y "distance" of a non-Mario application? I.e. any (distributed or not) system that doesn't have a relatively trivial "higher x/y is better" fitness function?

m3kw9大约 1 年前

If you just read it it sounds like a scam to some. Going thru all states does not find you bugs magically. You need to know what a bug is first or if it’s an actual intended feature. This article fails to explain it

bbor大约 1 年前

At what point can we start suing companies on behalf of the commons for taking words from the lexicon? I miss the days when this would be called “Wilson & Co.’s automated testing solution” instead of such a beautiful, philosophically meaningful word. Same thoughts on that Devin.AI scam taking the name “Cognition” and Vercel somehow bribing their way into claiming the “ai” name on NPM.Technically awesome post tho! Love the heatmap esp. Maybe bring up changing your name to investors because some rando online doesn’t like it though, please.

m3kw9大约 1 年前

Doesn’t explain how it finds bugs it’s just had the AI play Mario bros

评论 #40070843 未加载

11 条评论

infogulch大约 1 年前

评论 #40071828 未加载

评论 #40071914 未加载

评论 #40071776 未加载

评论 #40072707 未加载

mrkmarron大约 1 年前

评论 #40069670 未加载

评论 #40068965 未加载

评论 #40069802 未加载

wwilson大约 1 年前

This is Will (I gave the talk linked in the post). Happy to answer any questions about this work, or how it generalizes to testing things that aren't Nintendo games.

评论 #40069253 未加载

评论 #40069945 未加载

评论 #40068830 未加载

评论 #40071960 未加载

infogulch大约 1 年前

Taikonerd大约 1 年前

Super Mario is such a fun example. Well chosen.

yanniszark大约 1 年前

评论 #40071972 未加载

评论 #40070185 未加载

t4ng0pwn3d大约 1 年前

suprfnk大约 1 年前

@wwilson How do you define the X/Y "distance" of a non-Mario application? I.e. any (distributed or not) system that doesn't have a relatively trivial "higher x/y is better" fitness function?

m3kw9大约 1 年前

bbor大约 1 年前

m3kw9大约 1 年前

Doesn’t explain how it finds bugs it’s just had the AI play Mario bros

评论 #40070843 未加载

How Antithesis finds bugs

11 条评论

How Antithesis finds bugs

11 条评论