Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Thank you.

On the same subject can someone recommend a book or any other resource to learn about virtual machine internals? My goal is to try to build a toy clone of VirtualBox/VMWare.

So far I have found one -- Virtual Machines by James E. Smith and Ravi Nair.



I work on virtual machines at Google. I usually suggest "Hardware and Software Support for Virtualization" [1] to new team members without a virtualization background.

[1] https://www.amazon.com/Hardware-Software-Virtualization-Synt...


This looks like a good read, thanks. I'm curious what your background is. How does one go about getting into that specialty at an org like Google? I've understood that Borg and GKE containers at Google generally always run in a VM. Is this where your work is(platform) or are you more research oriented?


Generally "systems-y software" is my background. I joined Google for a semi-experimental operating system project and from there it was a small jump to virtualization when I decided I was interested in doing something else. I'm definitely on the platform side, but been doing some interesting stuff recently :)


Thanks. Do you have any other recommendations for "systems-y software" books or resources you think would be helpful for a people pursuing similar roles?


> semi-experimental operating system project

Fuchsia? :)


"KVM host in a few lines of code" (https://zserge.com/posts/kvm/) is a fun article to get started with.


The sibling's comment book recommendation "Hardware and Software Support for Virtualization" book is on point and it's written by one of the co-founders of VMware.

Another book on Libvirt will be handy since it is the de facto API for most virtualization including VMs and containers[1].

[1]https://www.amazon.com/Foundations-Libvirt-Development-Maint...


For a really simple emulator project (not quite the level of VirtualBox), check the "IntCode" challenges from AdventOfCode 2019.


I believe this is the direct link to the mentioned challenges: https://adventofcode.com/2019


Those were so fun! I loved my little VM as it progressed and played pong, and commanded robots and rendered the output etc.

It's a really great fun way to learn the key concepts.


Hardware and Software Support for Virtualization Synthesis Lectures on Computer Architecture (2017)

https://www.morganclaypool.com/doi/abs/10.2200/S00754ED1V01Y...

Bringing Virtualization to the x86 Architecture with the Original VMware Workstation (2012)

https://dl.acm.org/doi/abs/10.1145/2382553.2382554


Fuzz week shows how to make make a snapshot / resettable jitting hypervisor.

https://m.youtube.com/playlist?list=PLSkhUfcCXvqHsOy2VUxuoAf...


9 lectures for a total of 59+ hours! Wow. Commitment required.


Yea, it's streaming so there are some debugging sessions in there, but the information is gold.


I've always been intrigued by virtual machines and emulation as well. I've always wanted to try and make an emulator of some kind. I don't know much about the internals of VirtualBox, but my suggestion would be to start "easy" with one CPU/Computer System/Game Console and go from there. That's what I finally did with the 6502 and Commodore 64.


Conventionally, one starts from the CHIP-8, which is indeed a virtual machine rather than a system in a strict sense.

What I've found difficult is the step beyond that. NES and GameBoy are typical steps, however, I've been very frustrated by the confusing documentation of the GameBoy. There are 3/4 references, but one of them has significant mistakes, while another is incomplete. On the other hand, the Pan Docs should be complete and accurate.

I'm not sure if there is an easy middle ground, that, at the same time, is also well documented.

The Atary 2600 is architecturally simpler but less documented, and also requires very accurate timings. I've read somebody suggesting systems like Channel F, Astrocade and Odyssey2, but I'm not sure they're well documented.

I've personally lost my interest once I've found that building an emulator was essentially fighting specifications rather than actually building something.


I built about a third of a NES emulator. The nesdev wiki is mostly decent, although there's a fair number of things where it seems like the first people to figure things out got stuff kind of backwards, and if you flip it, it's a lot easier, that's the sort of fighting the specifications I think you're talking about.

All that said, emulating the CPU was pretty fun. There's a CPU test rom out there you can run with tracing and compare to the published results. I also got the background tiling from the PPU done, but the foreground processing has a lot of steps, so I indefinitely paused for now. Also, I had amazingly poor performance, so I wasn't super motivated to continue.

The 2600 has a very similar cpu, but the very limited Stella output chip means most games are very timing dependent, which means you have to be super accurate, which adds difficulty. I think you should try to be cycle accurate anyway, but it's easy to mess that up, and having some freedom would be nice.


I did a GameBoy and similarly found the CPU enjoyable and the PPU a huge pain. Perhaps if I understood graphics better, I would have enjoyed it more, but like you say it just felt like a lot of steps.


I don't know if the GameBoy PPU has the background vs foreground split. The background processing was pretty reasonable, and once you got it kind of working, it was fun to debug and get it actually working. My favorite thing was when I was processing everything in the wrong order so the menu of the rom I was using to test had all the words backwards.

But the foreground / object sprites have this huge pipeline. IIRC, the PPU determines which sprites to draw in line X + 1 during line X. After that, it has to load the data for each object, etc etc. It was just discouraging. Plus since my frame rate is so low, I have to sit at a blank screen for quite some time waiting for the game to show anything, and longer for the demo to start (I don't have controls)...

Anyhow, glad I'm not alone ;)


A subset of CP/M calls is a pretty simple "rest of the system" to implement on top of an 8080/Z80 CPU emulation. (It's a bit of a cheat - like qemu's "Linux user mode emulation" or early version of DOSBox, because you restrict software to interacting with a high-level software interface, there are no lower-level details to aim for fidelity with)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: