TechEcho

11 comments

tptacekabout 12 years ago

This is more or less the greatest thing I've learned about in the last couple years.What's happening here is that they're getting computation without executing any instructions, simply through the process of using the MMU hardware to "resolve addresses". The page directory system has been set up in such a way that address resolution effects a virtual machine that they can code to.This works because when you attempt to resolve an invalid address, the CPU generates a trap (#PF), and the handling of that trap pushes information on the "stack". Each time you push data to the stack, you decrement the stack pointer. Eventually, the stack pointer underflows; when that happens, a different trap (#DF) fires. This mechanism put together gives you:<pre><code> if x < 4 { goto b } else { x = x - 4 ; goto a } </code></pre> also known as "subtract and branch if less than or equal to zero", also known as "an instruction adequate to construct a one-instruction computer".The virtual machine "runs" by generating an unending series of traps: in the "goto a" case, the result of translation is another address generating a trap. And so on.The details of how this computer has "memory" and addresses instructions is even headachier. They're using the x86 TSS as "memory" and for technical reasons they get 16 slots (and thus instructions) to work with, but they have a compiler that builds arbitrary programs into 16-colored graphs to use those slots to express generic programs. Every emulator they could find crashes when they abuse the hardware task switching system this way.Here's it running Conway's Life:<a href="http://youtubedoubler.com/?video1=E2VCwBzGdPM&start1=0&video2=eSRcvrVs5ug&start2=0&authorName=FAV" rel="nofollow">http://youtubedoubler.com/?video1=E2VCwBzGdPM&start1=0&#...</a>Here's their talk for a few months back:<a href="http://www.youtube.com/watch?v=NGXvJ1GKBKM" rel="nofollow">http://www.youtube.com/watch?v=NGXvJ1GKBKM</a>The talk is great, but if you're not super interested in X86/X64 memory corruption countermeasures, you might want to skip the first 30 minutes.

评论 #5262039 未加载

评论 #5261835 未加载

评论 #5263305 未加载

jbangertabout 12 years ago

Author here: While it is true that with the current implementation, memory access is extremely limited (essentially one DWORD per page, or about 0.1% of the available physical RAM) that limitation can certainly be avoided. For one, you could shift how the TSS is aligned (and align them differently for different instructions), multiplying your address space by a factor of 10 or so. Furthermore, you could also place another TSS somewhere in memory (only a few of the variables need to actually contain sane values) with an invalid EIP and use that as a 'load' instruction.The easiest way however would be to use the TrapCC mechanism to transfer control between bits of normal assembler code (perhaps repurposed from other functions already in your kernel), doing something similar to ROP. Of course, for additional fun, feel free to throw in BX's Brainfuck interpreter in ELF and James Oakley's DWARF exception handler. We might drop a demo of this soon, i.e. implementing a self-decrypting binary via page faults.

评论 #5263070 未加载

networkedabout 12 years ago

>Move, Branch if Zero, DecrementThis is basically the canonical instruction for OISCs (one instruction set computers). Wikipedia describes it pretty well: <a href="https://en.wikipedia.org/wiki/One_instruction_set_computer#Subtract_and_branch_if_less_than_or_equal_to_zero" rel="nofollow">https://en.wikipedia.org/wiki/One_instruction_set_computer#S...</a>.

majkeabout 12 years ago

There was a talk on 29c3 about this. Abstract:<a href="https://events.ccc.de/congress/2012/Fahrplan/events/5265.en.html" rel="nofollow">https://events.ccc.de/congress/2012/Fahrplan/events/5265.en....</a>video:<a href="https://www.youtube.com/watch?v=NGXvJ1GKBKM" rel="nofollow">https://www.youtube.com/watch?v=NGXvJ1GKBKM</a>

codexabout 12 years ago

Another place for root kits to hide.

评论 #5261959 未加载

评论 #5262253 未加载

评论 #5261808 未加载

arsabout 12 years ago

How fast (slow) is this relative to the host CPU?

评论 #5263709 未加载

rocky1138about 12 years ago

This is really interesting. In a way, it's a form of computer self-replication. Could the virtual machine created by the computer be considered offspring?Is there a way the virtual machine might spawn another virtual machine child of its own?

ithkuilabout 12 years ago

if you like this kind of things there is also:<a href="http://www.cs.dartmouth.edu/~bx/elf-bf-tools/slides/ELF-berlinsides-0x3.pdf" rel="nofollow">http://www.cs.dartmouth.edu/~bx/elf-bf-tools/slides/ELF-berl...</a>

traxtechabout 12 years ago

That the hardware version of the brainfuck philosophy.

评论 #5267204 未加载

conductorabout 12 years ago

Expect this technique in the future malwares and software protection DRM systems for making code analyzing harder.

general_failureabout 12 years ago

somebody checked in vim backup files :-)

11 comments

tptacekabout 12 years ago

评论 #5262039 未加载

评论 #5261835 未加载

评论 #5263305 未加载

jbangertabout 12 years ago

评论 #5263070 未加载

networkedabout 12 years ago

majkeabout 12 years ago

codexabout 12 years ago

Another place for root kits to hide.

评论 #5261959 未加载

评论 #5262253 未加载

评论 #5261808 未加载

arsabout 12 years ago

How fast (slow) is this relative to the host CPU?

评论 #5263709 未加载

rocky1138about 12 years ago

ithkuilabout 12 years ago

traxtechabout 12 years ago

That the hardware version of the brainfuck philosophy.

评论 #5267204 未加载

conductorabout 12 years ago

Expect this technique in the future malwares and software protection DRM systems for making code analyzing harder.

general_failureabout 12 years ago

somebody checked in vim backup files :-)

X86 MMU fault handling is turing complete

11 comments

X86 MMU fault handling is turing complete

11 comments