Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 1 | @node Implementation notes |
| 2 | @appendix Implementation notes |
bellard | debc706 | 2006-04-30 21:58:41 +0000 | [diff] [blame] | 3 | |
| 4 | @menu |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 5 | * CPU emulation:: |
| 6 | * Translator Internals:: |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 7 | * QEMU compared to other emulators:: |
Igor Mammedov | 047f703 | 2018-05-11 19:24:43 +0200 | [diff] [blame] | 8 | * Managed start up options:: |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 9 | * Bibliography:: |
bellard | debc706 | 2006-04-30 21:58:41 +0000 | [diff] [blame] | 10 | @end menu |
bellard | debc706 | 2006-04-30 21:58:41 +0000 | [diff] [blame] | 11 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 12 | @node CPU emulation |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 13 | @section CPU emulation |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 14 | |
bellard | debc706 | 2006-04-30 21:58:41 +0000 | [diff] [blame] | 15 | @menu |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 16 | * x86:: x86 and x86-64 emulation |
| 17 | * ARM:: ARM emulation |
| 18 | * MIPS:: MIPS emulation |
| 19 | * PPC:: PowerPC emulation |
| 20 | * SPARC:: Sparc32 and Sparc64 emulation |
| 21 | * Xtensa:: Xtensa emulation |
bellard | debc706 | 2006-04-30 21:58:41 +0000 | [diff] [blame] | 22 | @end menu |
| 23 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 24 | @node x86 |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 25 | @subsection x86 and x86-64 emulation |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 26 | |
| 27 | QEMU x86 target features: |
| 28 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 29 | @itemize |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 30 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 31 | @item The virtual x86 CPU supports 16 bit and 32 bit addressing with segmentation. |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 32 | LDT/GDT and IDT are emulated. VM86 mode is also supported to run |
| 33 | DOSEMU. There is some support for MMX/3DNow!, SSE, SSE2, SSE3, SSSE3, |
| 34 | and SSE4 as well as x86-64 SVM. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 35 | |
| 36 | @item Support of host page sizes bigger than 4KB in user mode emulation. |
| 37 | |
| 38 | @item QEMU can emulate itself on x86. |
| 39 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 40 | @item An extensive Linux x86 CPU test program is included @file{tests/test-i386}. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 41 | It can be used to test other x86 virtual CPUs. |
| 42 | |
| 43 | @end itemize |
| 44 | |
| 45 | Current QEMU limitations: |
| 46 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 47 | @itemize |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 48 | |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 49 | @item Limited x86-64 support. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 50 | |
| 51 | @item IPC syscalls are missing. |
| 52 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 53 | @item The x86 segment limits and access rights are not tested at every |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 54 | memory access (yet). Hopefully, very few OSes seem to rely on that for |
| 55 | normal use. |
| 56 | |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 57 | @end itemize |
| 58 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 59 | @node ARM |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 60 | @subsection ARM emulation |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 61 | |
| 62 | @itemize |
| 63 | |
| 64 | @item Full ARM 7 user emulation. |
| 65 | |
| 66 | @item NWFPE FPU support included in user Linux emulation. |
| 67 | |
| 68 | @item Can run most ARM Linux binaries. |
| 69 | |
| 70 | @end itemize |
| 71 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 72 | @node MIPS |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 73 | @subsection MIPS emulation |
ths | 24d4de4 | 2007-07-11 10:24:28 +0000 | [diff] [blame] | 74 | |
| 75 | @itemize |
| 76 | |
| 77 | @item The system emulation allows full MIPS32/MIPS64 Release 2 emulation, |
| 78 | including privileged instructions, FPU and MMU, in both little and big |
| 79 | endian modes. |
| 80 | |
| 81 | @item The Linux userland emulation can run many 32 bit MIPS Linux binaries. |
| 82 | |
| 83 | @end itemize |
| 84 | |
| 85 | Current QEMU limitations: |
| 86 | |
| 87 | @itemize |
| 88 | |
| 89 | @item Self-modifying code is not always handled correctly. |
| 90 | |
| 91 | @item 64 bit userland emulation is not implemented. |
| 92 | |
| 93 | @item The system emulation is not complete enough to run real firmware. |
| 94 | |
ths | b1f4523 | 2007-07-12 09:03:30 +0000 | [diff] [blame] | 95 | @item The watchpoint debug facility is not implemented. |
| 96 | |
ths | 24d4de4 | 2007-07-11 10:24:28 +0000 | [diff] [blame] | 97 | @end itemize |
| 98 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 99 | @node PPC |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 100 | @subsection PowerPC emulation |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 101 | |
| 102 | @itemize |
| 103 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 104 | @item Full PowerPC 32 bit emulation, including privileged instructions, |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 105 | FPU and MMU. |
| 106 | |
| 107 | @item Can run most PowerPC Linux binaries. |
| 108 | |
| 109 | @end itemize |
| 110 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 111 | @node SPARC |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 112 | @subsection Sparc32 and Sparc64 emulation |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 113 | |
| 114 | @itemize |
| 115 | |
blueswir1 | f6b647c | 2007-04-05 18:40:23 +0000 | [diff] [blame] | 116 | @item Full SPARC V8 emulation, including privileged |
bellard | 3475187 | 2005-07-02 14:31:34 +0000 | [diff] [blame] | 117 | instructions, FPU and MMU. SPARC V9 emulation includes most privileged |
blueswir1 | a785e42 | 2007-10-20 08:09:05 +0000 | [diff] [blame] | 118 | and VIS instructions, FPU and I/D MMU. Alignment is fully enforced. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 119 | |
blueswir1 | a785e42 | 2007-10-20 08:09:05 +0000 | [diff] [blame] | 120 | @item Can run most 32-bit SPARC Linux binaries, SPARC32PLUS Linux binaries and |
| 121 | some 64-bit SPARC Linux binaries. |
bellard | 3475187 | 2005-07-02 14:31:34 +0000 | [diff] [blame] | 122 | |
| 123 | @end itemize |
| 124 | |
| 125 | Current QEMU limitations: |
| 126 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 127 | @itemize |
bellard | 3475187 | 2005-07-02 14:31:34 +0000 | [diff] [blame] | 128 | |
bellard | 3475187 | 2005-07-02 14:31:34 +0000 | [diff] [blame] | 129 | @item IPC syscalls are missing. |
| 130 | |
blueswir1 | 1f58732 | 2007-11-25 18:40:20 +0000 | [diff] [blame] | 131 | @item Floating point exception support is buggy. |
bellard | 3475187 | 2005-07-02 14:31:34 +0000 | [diff] [blame] | 132 | |
| 133 | @item Atomic instructions are not correctly implemented. |
| 134 | |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 135 | @item There are still some problems with Sparc64 emulators. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 136 | |
| 137 | @end itemize |
| 138 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 139 | @node Xtensa |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 140 | @subsection Xtensa emulation |
Max Filippov | 3aeaea6 | 2011-10-10 14:48:23 +0400 | [diff] [blame] | 141 | |
| 142 | @itemize |
| 143 | |
| 144 | @item Core Xtensa ISA emulation, including most options: code density, |
| 145 | loop, extended L32R, 16- and 32-bit multiplication, 32-bit division, |
Max Filippov | 044d003 | 2012-11-29 19:53:20 +0400 | [diff] [blame] | 146 | MAC16, miscellaneous operations, boolean, FP coprocessor, coprocessor |
| 147 | context, debug, multiprocessor synchronization, |
Max Filippov | 3aeaea6 | 2011-10-10 14:48:23 +0400 | [diff] [blame] | 148 | conditional store, exceptions, relocatable vectors, unaligned exception, |
| 149 | interrupts (including high priority and timer), hardware alignment, |
| 150 | region protection, region translation, MMU, windowed registers, thread |
| 151 | pointer, processor ID. |
| 152 | |
Max Filippov | 044d003 | 2012-11-29 19:53:20 +0400 | [diff] [blame] | 153 | @item Not implemented options: data/instruction cache (including cache |
| 154 | prefetch and locking), XLMI, processor interface. Also options not |
| 155 | covered by the core ISA (e.g. FLIX, wide branches) are not implemented. |
Max Filippov | 3aeaea6 | 2011-10-10 14:48:23 +0400 | [diff] [blame] | 156 | |
| 157 | @item Can run most Xtensa Linux binaries. |
| 158 | |
| 159 | @item New core configuration that requires no additional instructions |
| 160 | may be created from overlay with minimal amount of hand-written code. |
| 161 | |
| 162 | @end itemize |
| 163 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 164 | @node Translator Internals |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 165 | @section Translator Internals |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 166 | |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 167 | QEMU is a dynamic translator. When it first encounters a piece of code, |
| 168 | it converts it to the host instruction set. Usually dynamic translators |
| 169 | are very complicated and highly CPU dependent. QEMU uses some tricks |
| 170 | which make it relatively easily portable and simple while achieving good |
| 171 | performances. |
| 172 | |
Paolo Bonzini | bf28a69 | 2016-10-06 15:10:10 +0200 | [diff] [blame] | 173 | QEMU's dynamic translation backend is called TCG, for "Tiny Code |
| 174 | Generator". For more information, please take a look at @code{tcg/README}. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 175 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 176 | Some notable features of QEMU's dynamic translator are: |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 177 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 178 | @table @strong |
| 179 | |
| 180 | @item CPU state optimisations: |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 181 | The target CPUs have many internal states which change the way it |
| 182 | evaluates instructions. In order to achieve a good speed, the |
| 183 | translation phase considers that some state information of the virtual |
| 184 | CPU cannot change in it. The state is recorded in the Translation |
| 185 | Block (TB). If the state changes (e.g. privilege level), a new TB will |
| 186 | be generated and the previous TB won't be used anymore until the state |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 187 | matches the state recorded in the previous TB. The same idea can be applied |
| 188 | to other aspects of the CPU state. For example, on x86, if the SS, |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 189 | DS and ES segments have a zero base, then the translator does not even |
| 190 | generate an addition for the segment base. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 191 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 192 | @item Direct block chaining: |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 193 | After each translated basic block is executed, QEMU uses the simulated |
Gonglei | d274e07 | 2015-07-03 17:50:57 +0800 | [diff] [blame] | 194 | Program Counter (PC) and other cpu state information (such as the CS |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 195 | segment base value) to find the next basic block. |
| 196 | |
| 197 | In order to accelerate the most common cases where the new simulated PC |
| 198 | is known, QEMU can patch a basic block so that it jumps directly to the |
| 199 | next one. |
| 200 | |
| 201 | The most portable code uses an indirect jump. An indirect jump makes |
| 202 | it easier to make the jump target modification atomic. On some host |
| 203 | architectures (such as x86 or PowerPC), the @code{JUMP} opcode is |
| 204 | directly patched so that the block chaining has no overhead. |
| 205 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 206 | @item Self-modifying code and translated code invalidation: |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 207 | Self-modifying code is a special challenge in x86 emulation because no |
| 208 | instruction cache invalidation is signaled by the application when code |
| 209 | is modified. |
| 210 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 211 | User-mode emulation marks a host page as write-protected (if it is |
| 212 | not already read-only) every time translated code is generated for a |
| 213 | basic block. Then, if a write access is done to the page, Linux raises |
| 214 | a SEGV signal. QEMU then invalidates all the translated code in the page |
| 215 | and enables write accesses to the page. For system emulation, write |
| 216 | protection is achieved through the software MMU. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 217 | |
| 218 | Correct translated code invalidation is done efficiently by maintaining |
| 219 | a linked list of every translated block contained in a given page. Other |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 220 | linked lists are also maintained to undo direct block chaining. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 221 | |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 222 | On RISC targets, correctly written software uses memory barriers and |
| 223 | cache flushes, so some of the protection above would not be |
| 224 | necessary. However, QEMU still requires that the generated code always |
| 225 | matches the target instructions in memory in order to handle |
| 226 | exceptions correctly. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 227 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 228 | @item Exception support: |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 229 | longjmp() is used when an exception such as division by zero is |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 230 | encountered. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 231 | |
| 232 | The host SIGSEGV and SIGBUS signal handlers are used to get invalid |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 233 | memory accesses. QEMU keeps a map from host program counter to |
| 234 | target program counter, and looks up where the exception happened |
| 235 | based on the host program counter at the exception point. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 236 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 237 | On some targets, some bits of the virtual CPU's state are not flushed to the |
| 238 | memory until the end of the translation block. This is done for internal |
| 239 | emulation state that is rarely accessed directly by the program and/or changes |
| 240 | very often throughout the execution of a translation block---this includes |
| 241 | condition codes on x86, delay slots on SPARC, conditional execution on |
| 242 | ARM, and so on. This state is stored for each target instruction, and |
| 243 | looked up on exceptions. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 244 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 245 | @item MMU emulation: |
| 246 | For system emulation QEMU uses a software MMU. In that mode, the MMU |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 247 | virtual to physical address translation is done at every memory |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 248 | access. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 249 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 250 | QEMU uses an address translation cache (TLB) to speed up the translation. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 251 | In order to avoid flushing the translated code each time the MMU |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 252 | mappings change, all caches in QEMU are physically indexed. This |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 253 | means that each basic block is indexed with its physical address. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 254 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 255 | In order to avoid invalidating the basic block chain when MMU mappings |
| 256 | change, chaining is only performed when the destination of the jump |
| 257 | shares a page with the basic block that is performing the jump. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 258 | |
Paolo Bonzini | 36e4970 | 2016-10-06 16:25:12 +0200 | [diff] [blame] | 259 | The MMU can also distinguish RAM and ROM memory areas from MMIO memory |
| 260 | areas. Access is faster for RAM and ROM because the translation cache also |
| 261 | hosts the offset between guest address and host memory. Accessing MMIO |
| 262 | memory areas instead calls out to C code for device emulation. |
| 263 | Finally, the MMU helps tracking dirty pages and pages pointed to by |
| 264 | translation blocks. |
| 265 | @end table |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 266 | |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 267 | @node QEMU compared to other emulators |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 268 | @section QEMU compared to other emulators |
Paolo Bonzini | 77d47e1 | 2016-10-06 16:49:03 +0200 | [diff] [blame] | 269 | |
| 270 | Like bochs [1], QEMU emulates an x86 CPU. But QEMU is much faster than |
| 271 | bochs as it uses dynamic compilation. Bochs is closely tied to x86 PC |
| 272 | emulation while QEMU can emulate several processors. |
| 273 | |
| 274 | Like Valgrind [2], QEMU does user space emulation and dynamic |
| 275 | translation. Valgrind is mainly a memory debugger while QEMU has no |
| 276 | support for it (QEMU could be used to detect out of bound memory |
| 277 | accesses as Valgrind, but it has no support to track uninitialised data |
| 278 | as Valgrind does). The Valgrind dynamic translator generates better code |
| 279 | than QEMU (in particular it does register allocation) but it is closely |
| 280 | tied to an x86 host and target and has no support for precise exceptions |
| 281 | and system emulation. |
| 282 | |
| 283 | EM86 [3] is the closest project to user space QEMU (and QEMU still uses |
| 284 | some of its code, in particular the ELF file loader). EM86 was limited |
| 285 | to an alpha host and used a proprietary and slow interpreter (the |
| 286 | interpreter part of the FX!32 Digital Win32 code translator [4]). |
| 287 | |
| 288 | TWIN from Willows Software was a Windows API emulator like Wine. It is less |
| 289 | accurate than Wine but includes a protected mode x86 interpreter to launch |
| 290 | x86 Windows executables. Such an approach has greater potential because most |
| 291 | of the Windows API is executed natively but it is far more difficult to |
| 292 | develop because all the data structures and function parameters exchanged |
| 293 | between the API and the x86 code must be converted. |
| 294 | |
| 295 | User mode Linux [5] was the only solution before QEMU to launch a |
| 296 | Linux kernel as a process while not needing any host kernel |
| 297 | patches. However, user mode Linux requires heavy kernel patches while |
| 298 | QEMU accepts unpatched Linux kernels. The price to pay is that QEMU is |
| 299 | slower. |
| 300 | |
| 301 | The Plex86 [6] PC virtualizer is done in the same spirit as the now |
| 302 | obsolete qemu-fast system emulator. It requires a patched Linux kernel |
| 303 | to work (you cannot launch the same kernel on your PC), but the |
| 304 | patches are really small. As it is a PC virtualizer (no emulation is |
| 305 | done except for some privileged instructions), it has the potential of |
| 306 | being faster than QEMU. The downside is that a complicated (and |
| 307 | potentially unsafe) host kernel patch is needed. |
| 308 | |
| 309 | The commercial PC Virtualizers (VMWare [7], VirtualPC [8]) are faster |
| 310 | than QEMU (without virtualization), but they all need specific, proprietary |
| 311 | and potentially unsafe host drivers. Moreover, they are unable to |
| 312 | provide cycle exact simulation as an emulator can. |
| 313 | |
| 314 | VirtualBox [9], Xen [10] and KVM [11] are based on QEMU. QEMU-SystemC |
| 315 | [12] uses QEMU to simulate a system where some hardware devices are |
| 316 | developed in SystemC. |
| 317 | |
Igor Mammedov | 047f703 | 2018-05-11 19:24:43 +0200 | [diff] [blame] | 318 | @node Managed start up options |
| 319 | @section Managed start up options |
| 320 | |
| 321 | In system mode emulation, it's possible to create a VM in a paused state using |
| 322 | the -S command line option. In this state the machine is completely initialized |
| 323 | according to command line options and ready to execute VM code but VCPU threads |
| 324 | are not executing any code. The VM state in this paused state depends on the way |
| 325 | QEMU was started. It could be in: |
| 326 | @table @asis |
| 327 | @item initial state (after reset/power on state) |
| 328 | @item with direct kernel loading, the initial state could be amended to execute |
| 329 | code loaded by QEMU in the VM's RAM and with incoming migration |
| 330 | @item with incoming migration, initial state will by amended with the migrated |
| 331 | machine state after migration completes. |
| 332 | @end table |
| 333 | |
| 334 | This paused state is typically used by users to query machine state and/or |
| 335 | additionally configure the machine (by hotplugging devices) in runtime before |
| 336 | allowing VM code to run. |
| 337 | |
| 338 | However, at the -S pause point, it's impossible to configure options that affect |
Markus Armbruster | 361ac94 | 2018-07-05 11:14:02 +0200 | [diff] [blame] | 339 | initial VM creation (like: -smp/-m/-numa ...) or cold plug devices. The |
| 340 | experimental --preconfig command line option allows pausing QEMU |
| 341 | before the initial VM creation, in a ``preconfig'' state, where additional |
Igor Mammedov | 047f703 | 2018-05-11 19:24:43 +0200 | [diff] [blame] | 342 | queries and configuration can be performed via QMP before moving on to |
| 343 | the resulting configuration startup. In the preconfig state, QEMU only allows |
| 344 | a limited set of commands over the QMP monitor, where the commands do not |
| 345 | depend on an initialized machine, including but not limited to: |
| 346 | @table @asis |
| 347 | @item qmp_capabilities |
| 348 | @item query-qmp-schema |
| 349 | @item query-commands |
| 350 | @item query-status |
Markus Armbruster | 361ac94 | 2018-07-05 11:14:02 +0200 | [diff] [blame] | 351 | @item x-exit-preconfig |
Igor Mammedov | 047f703 | 2018-05-11 19:24:43 +0200 | [diff] [blame] | 352 | @end table |
Igor Mammedov | 047f703 | 2018-05-11 19:24:43 +0200 | [diff] [blame] | 353 | |
bellard | debc706 | 2006-04-30 21:58:41 +0000 | [diff] [blame] | 354 | @node Bibliography |
Paolo Bonzini | 78e8779 | 2016-10-06 16:12:11 +0200 | [diff] [blame] | 355 | @section Bibliography |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 356 | |
| 357 | @table @asis |
| 358 | |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 359 | @item [1] |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 360 | @url{http://bochs.sourceforge.net/}, the Bochs IA-32 Emulator Project, |
| 361 | by Kevin Lawton et al. |
| 362 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 363 | @item [2] |
| 364 | @url{http://www.valgrind.org/}, Valgrind, an open-source memory debugger |
| 365 | for GNU/Linux. |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 366 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 367 | @item [3] |
| 368 | @url{http://ftp.dreamtime.org/pub/linux/Linux-Alpha/em86/v0.2/docs/em86.html}, |
| 369 | the EM86 x86 emulator on Alpha-Linux. |
| 370 | |
| 371 | @item [4] |
bellard | debc706 | 2006-04-30 21:58:41 +0000 | [diff] [blame] | 372 | @url{http://www.usenix.org/publications/library/proceedings/usenix-nt97/@/full_papers/chernoff/chernoff.pdf}, |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 373 | DIGITAL FX!32: Running 32-Bit x86 Applications on Alpha NT, by Anton |
| 374 | Chernoff and Ray Hookway. |
| 375 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 376 | @item [5] |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 377 | @url{http://user-mode-linux.sourceforge.net/}, |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 378 | The User-mode Linux Kernel. |
| 379 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 380 | @item [6] |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 381 | @url{http://www.plex86.org/}, |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 382 | The new Plex86 project. |
| 383 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 384 | @item [7] |
ths | 5fafdf2 | 2007-09-16 21:08:06 +0000 | [diff] [blame] | 385 | @url{http://www.vmware.com/}, |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 386 | The VMWare PC virtualizer. |
| 387 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 388 | @item [8] |
| 389 | @url{https://www.microsoft.com/download/details.aspx?id=3702}, |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 390 | The VirtualPC PC virtualizer. |
| 391 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 392 | @item [9] |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 393 | @url{http://virtualbox.org/}, |
| 394 | The VirtualBox PC virtualizer. |
| 395 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 396 | @item [10] |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 397 | @url{http://www.xen.org/}, |
| 398 | The Xen hypervisor. |
| 399 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 400 | @item [11] |
| 401 | @url{http://www.linux-kvm.org/}, |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 402 | Kernel Based Virtual Machine (KVM). |
| 403 | |
Thomas Huth | 8e9620a | 2015-09-25 11:38:36 +0200 | [diff] [blame] | 404 | @item [12] |
blueswir1 | 998a050 | 2008-10-09 18:52:04 +0000 | [diff] [blame] | 405 | @url{http://www.greensocs.com/projects/QEMUSystemC}, |
| 406 | QEMU-SystemC, a hardware co-simulator. |
| 407 | |
bellard | 1f67313 | 2004-04-04 15:21:17 +0000 | [diff] [blame] | 408 | @end table |