Implementing a Virtual Machine in C

159 pointsby freefouranabout 10 years ago

15 comments

neverartfulabout 10 years ago

Contrary to the naysayers, I like seeing stuff like this. Why? Because it's a simple, gentle introduction. It's easily digestible for the newcomer. And it might be easy enough to encourage a newcomer to start building their own VM that goes on to be something real.For those that criticize it and find faults with it -- I'm sure the author would consider pull requests. Or you could provide your own fork with all the improvements that you believe are necessary.

评论 #9518233 未加载

jcofflandabout 10 years ago

I'd like to see such an article on a register based VM. Pawn and Lua are nice examples. Most VMs are stack based but this is mainly because they are conceptually easier to understand. Register based machines have some real advantages, like requiring far fewer instructions inside tight loops.

评论 #9517808 未加载

评论 #9517609 未加载

评论 #9517588 未加载

评论 #9517604 未加载

评论 #9517863 未加载

aftbitabout 10 years ago

I'm a bit disappointed - this VM doesn't have instructions for looping or branching, nor does it really use the registers in any way. I was hoping to read a writeup that introduced some concepts that were used in real (non-toy) systems.

评论 #9517798 未加载

评论 #9517463 未加载

评论 #9517399 未加载

评论 #9517410 未加载

评论 #9517466 未加载

评论 #9517314 未加载

评论 #9517464 未加载

donpdonpabout 10 years ago

New concepts are introduced at a satisfying pace. Each bit of code is explained thoroughly. Nice writeup.

earlzabout 10 years ago

I'm pretty sure everyone has wrote their own toy VMs, but I'll go ahead and throw mine out there. (well, 1 of the 3 I've wrote that I like best). It's called LightVM and is intended to be capable of running on tiny microcontrollers.The most cool thing I like about it is the opcodes and registers are extremely general purpose. So, to do a branch, you do `mov IP, label`, or even a "push.mv" instruction which when used against IP is basically the same as the usual "call" instruction, but can also be used with data registers to save a register to the stack and then set it to a value.I've found the hardest thing about making a VM isn't making a VM, but rather making the infrastructure around it (assembler, debugger, compilers, etc)<a href="https://bitbucket.org/earlz/lightvm/overview" rel="nofollow">https://bitbucket.org/earlz/lightvm/overview</a>

评论 #9518429 未加载

vbezhenarabout 10 years ago

For those who want to implement a VM as an exercise, I recommend to implement a simple JIT-compiler after that. You'll probably be impressed at performance improvements and it's funny exercise to do. I used GNU lightning to generate machine code.

评论 #9519081 未加载

emmanueloga_about 10 years ago

I am starting to sound like a broken record, but here it goes. If you want a more complete tutorial on writing stack based virtual machines, check "The Elements of Computing Systems" and its accompanying course, <a href="http://www.nand2tetris.org/" rel="nofollow">http://www.nand2tetris.org/</a>.The book teaches you to build:1) A CPU from basic electronics elements2) An assembler to generate machine code3) A bytecode VM that can be simulated and an assembler generator from the bytecode4) A basic programming language that generates bytecode5) An operating system using that language.I'm midway through building the Assembler and VM myself :-).

ameliusabout 10 years ago

This project is nice for educational purposes, but I wouldn't call it a VM, but instead a "bytecode interpreter".I think nowadays it is kind of a minimum requirement to have the intermediate code JIT-compiled (or at least compiled).I'm also missing a garbage collector, although that is not necessarily part of a VM (but often is). See NaCl for a counterexample. By the way, a project that I'd like to see is an efficient garbage collector implemented inside the VM, instead of as being part of the VM.

评论 #9520156 未加载

评论 #9519606 未加载

tjscanlonabout 10 years ago

For everyone who enjoyed this or wants to take it a step further, I recommend writing a CHIP-8 emulator. I used the following source: <a href="http://www.multigesture.net/articles/how-to-write-an-emulator-chip-8-interpreter/" rel="nofollow">http://www.multigesture.net/articles/how-to-write-an-emulato...</a> and it was very helpful.

ggambettaabout 10 years ago

For people looking for less "toy" implementations, I've written two emulators, an 8086 one and a Z80 one.There's libz80 (<a href="https://github.com/ggambetta/libz80" rel="nofollow">https://github.com/ggambetta/libz80</a>) which is (AFAIK) quite complete and correct but just a library, and the 8086 one (<a href="https://github.com/ggambetta/emulator-backed-remakes" rel="nofollow">https://github.com/ggambetta/emulator-backed-remakes</a>) which is incomplete and buggy but serves a much more interesting purpose :)

phodoabout 10 years ago

While seemingly simple, the simple non-turing example is not too far off from the (simple) Forth-like stack-based programming language found and executed in bitcoin transactions.<a href="https://en.bitcoin.it/wiki/Script" rel="nofollow">https://en.bitcoin.it/wiki/Script</a>

pjonesdotcaabout 10 years ago

C is not my thing so a few years ago trying to sort out how a VM works, I created a VM in Ruby.Practical? Not in the least. But, it was a good weekend's worth of fun.<a href="https://github.com/patrickjonesdotca/carban" rel="nofollow">https://github.com/patrickjonesdotca/carban</a>

bvanslykeabout 10 years ago

For a project that goes a bit deeper (branching, i/o, etc) consider writing a Chip8 simulator. There's lots of games written in chip8 bytecode to test with!

ternaryoperatorabout 10 years ago

I find these kinds of very basic intro articles frustrating. They till the same ground over and over: a tiny instruction set implemented with a switch statement. None of the more difficult issues are addressed: exception handling, linking to libraries or other programs written for the same VM, portability of programs across architectures, accessing the OS for services like file I/O, time, etc.-- All the things that make a toy not a toy.Every CS student in the world has written a toy VM just like this one.

评论 #9517856 未加载

评论 #9517785 未加载

评论 #9518024 未加载

评论 #9519122 未加载

评论 #9521170 未加载

评论 #9517779 未加载

jCanvasabout 10 years ago

I think the title is very misleading. This is not a virtual machine but an interpreter for a made up assembly language. There is nothing wrong with that and I am sure a beginner would find it very useful. But reading the title I was expecting something quite different.

评论 #9518494 未加载