cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-10-28 07:33:09 +00:00

Author	SHA1	Message	Date
Bo Chen	2612a6df29	vmm: seccomp: Add seccomp filters for the vcpu worker thread Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-09-11 07:42:31 +02:00
Rob Bradford	15025d71b1	devices, vm-device: Move BusDevice and Bus into vm-device This removes the dependency of the pci crate on the devices crate which now only contains the device implementations themselves. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-09-10 09:35:38 +01:00
Samuel Ortiz	e5ce6dc43c	vmm: cpu: Warn if the guest is trying to access unregistered IO ranges Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-09-04 14:39:58 +02:00
Sebastien Boeuf	871138d5cc	vm-migration: Make snapshot() mutable There will be some cases where the implementation of the snapshot() function from the Snapshottable trait will require to modify some internal data, therefore we make this possible by updating the trait definition with snapshot(&mut self). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-08-25 16:43:10 +02:00
Michael Zhao	afc98a5ec9	vmm: Fix AArch64 clippy warnings of vmm and other crates Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-08-24 10:59:08 +02:00
Anatol Belski	eba42c392f	devices: acpi: Add UID to devices with common HID Some OS might check for duplicates and bail out, if it can't create a distinct mapping. According to ACPI 5.0 section 6.1.12, while _UID is optional, it becomes required when there are multiple devices with the same _HID. Signed-off-by: Anatol Belski <ab@php.net>	2020-08-14 08:52:02 +02:00
Wei Liu	d80e383dbb	arch: move test cases to vmm crate This saves us from adding a "kvm" feature to arch crate merely for the purpose of running tests. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-15 17:21:07 +02:00
Sebastien Boeuf	e10d9b13d4	arch, hypervisor, vmm: Patch CPUID subleaves to expose EPC sections The support for SGX is exposed to the guest through CPUID 0x12. KVM passes static subleaves 0 and 1 from the host to the guest, without needing any modification from the VMM itself. But SGX also relies on dynamic subleaves 2 through N, used for describing each EPC section. This is not handled by KVM, which means the VMM is in charge of setting each subleaf starting from index 2 up to index N, depending on the number of EPC sections. These subleaves 2 through N are not listed as part of the supported CPUID entries from KVM. But it's important to set them as long as index 0 and 1 are present and indicate that SGX is supported. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-07-15 15:08:56 +02:00
Sebastien Boeuf	1603786374	vmm: Pass MemoryManager through CpuManager creation Instead of passing the GuestMemoryMmap directly to the CpuManager upon its creation, it's better to pass a reference to the MemoryManager. This way we will be able to know if SGX EPC region along with one or multiple sections are present. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-07-15 15:08:56 +02:00
Wei Liu	a4f484bc5e	hypervisor: Define a VM-Exit abstraction In order to move the hypervisor specific parts of the VM exit handling path, we're defining a generic, hypervisor agnostic VM exit enum. This is what the hypervisor's Vcpu run() call should return when the VM exit can not be completely handled through the hypervisor specific bits. For KVM based hypervisors, this means directly forwarding the IO related exits back to the VMM itself. For other hypervisors that e.g. rely on the VMM to decode and emulate instructions, this means the decoding itself would happen in the hypervisor crate exclusively, and the rest of the VM exit handling would be handled through the VMM device model implementation. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com> Fix test_vm unit test by using the new abstraction and dropping some dead code. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-06 12:59:43 +01:00
Samuel Ortiz	3db4c003a3	vmm: cpu: Rename fd variable into something more meaningful The fd naming is quite KVM specific. Since we're now using the hypervisor crate abstractions, we can rename those into something more readable and meaningful. Like e.g. vcpu or vm. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-07-06 09:35:30 +01:00
Samuel Ortiz	618722cdca	hypervisor: cpu: Rename state getter and setter vcpu.{set_}cpu_state() is a stutter. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-07-06 09:35:30 +01:00
Sebastien Boeuf	f6eeba781b	vmm: Save and restore vCPU states during pause/resume operations We need consistency between pause/resume and snapshot/restore operations. The symmetrical behavior of pausing/snapshotting and restoring/resuming has been introduced recently, and we must now ensure that no matter if we're using pause/resume or snapshot/restore features, the resulting VM should be running in the exact same way. That's why the vCPU state is now stored upon VM pausing. The snapshot operation being a simple serialization of the previously saved state. The same way, the vCPU state is now restored upon VM resuming. The restore operation being a simple deserialization of the previously restored state. It's interesting to note that this patch ensures time consistency from a guest perspective, no matter which clocksource is being used. From a previous patch, the KVM clock was saved/restored upon VM pause/resume. We now have the same behavior for TSC, as the TSC from the vCPUs are saved/restored upon VM pause/resume too. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-25 12:01:34 +02:00
Sebastien Boeuf	18e7d7a1f7	vmm: cpu: Resume before shutdown in a specific way Instead of calling the resume() function from the CpuManager, which involves more than what is needed from the shutdown codepath, and potentially ends up with a deadlock, we replace it with a subset. The full resume operation is reserved for a VM that has been paused. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-25 12:01:34 +02:00
Sebastien Boeuf	65132fb99d	vmm: Implement Pausable trait for Vcpu We want each Vcpu to store the vCPU state upon VM pausing. This is the reason why we need to explicitly implement the Pausable trait for the Vcpu structure. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-25 12:01:34 +02:00
Sebastien Boeuf	4a81d65f79	vmm: Notify the guest about vCPUs being paused Through the newly added API notify_guest_clock_paused(), this patch improves the vCPU pause operation by letting the guest know that each vCPU is being paused. This is important to avoid soft lockups detection from the guest that could happen because the VM has been paused for more than 20 seconds. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-24 12:38:56 +02:00
Sebastien Boeuf	9fa8438063	vmm: Fill CpuManager's vCPU list on restore path It's important that on restore path, the CpuManager's vCPU gets filled with each new vCPU that is being created. In order to cover both boot and restore paths, the list is being filled from the common function create_vcpu(). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-24 12:38:56 +02:00
Rob Bradford	4b64f2a027	vmm: cpu: Reuse already allocated vCPUs if available When a request is made to increase the number of vCPUs in the VM attempt to reuse any previously removed (and hence inactive) vCPUs before creating new ones. This ensures that the APIC ID is not reused for a different KVM vCPU (which is not allowed) and that the APIC IDs are also sequential. The two key changes to support this are: * Clearing the "kill" bit on the old vCPU state so that it does not immediately exit upon thread recreation. * Using the length of the vcpus vector (the number of allocated vcpus) rather than the number of active vCPUs (.present_vcpus()) to determine how many should be created. This change also introduced some new info!() debugging on the vCPU creation/removal path to aid further development in the future. TEST=Expanded test_cpu_hotplug test. Fixes: #1338 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-23 14:11:14 +01:00
Rob Bradford	9dcd0c37f3	vmm: cpu: Clear the "kill" flag on vCPU to support reuse After the vCPU has been ejected and the thread shutdown it is useful to clear the "kill" flag so that if the vCPU is reused it does not immediately exit upon thread recreation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-23 14:11:14 +01:00
Rob Bradford	b107bfcf2c	vmm: cpu: Add info!() level debugging to vCPU handling These messages are intended to be useful to support debugging related to vCPU hotplug/unplug issues. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-23 14:11:14 +01:00
Sebastien Boeuf	a16414dc87	vmm: Restore vCPUs in "paused" state To follow a symmetrical model, and avoid potential race conditions, it's important to restore a previously snapshot VM in a "paused" state. The snapshot operation being valid only if the VM has been previously paused. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-23 10:15:03 +02:00
Muminul Islam	cca59bc52f	hypervisor, arch: Fix warnings introduced in hypervisor crate This commit fixes some warnings introduced in the previous hyperviosr crate PR.Removed some unused variables from arch/aarch64 module. Signed-off-by: Muminul Islam <muislam@microsoft.com>	2020-06-22 21:58:45 +01:00
Rob Bradford	d714efe6d4	vmm: cpu: Import CpuTopology conditionally on x86_64 only The aarch64 build has no use for this structure at the moment. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-22 15:00:27 +01:00
Muminul Islam	e4dee57e81	arch, pci, vmm: Initial switch to the hypervisor crate Start moving the vmm, arch and pci crates to being hypervisor agnostic by using the hypervisor trait and abstractions. This is not a complete switch and there are still some remaining KVM dependencies. Signed-off-by: Muminul Islam <muislam@microsoft.com> Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-06-22 15:03:15 +02:00
Rob Bradford	a74c6fc14f	vmm, arch: x86_64: Fill the CPUID leaves with the topology There are two CPUID leaves for handling CPU topology, 0xb and 0x1f. The difference between the two is that the 0x1f leaf (Extended Topology Leaf) supports exposing multiple die packages. Fixes: #1284 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-17 12:18:09 +02:00
Rob Bradford	e19079782d	vmm, arch: x86_64: Set the APIC ID on the 0x1f CPUID leaf The extended topology leaf (0x1f) also needs to have the APIC ID (which is the KVM cpu ID) set. This mirrors the APIC ID set on the 0xb topology leaf Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-17 12:18:09 +02:00
Rob Bradford	b81bc77390	vmm: cpu: Save CpusConfig into CpuManager Rather than saving the individual parts into the CpuManager save the full struct as it now also contains the topology data. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-17 12:18:09 +02:00
Michael Zhao	97a1e5e1d2	vmm: Exit VMM event loop after guest shutdown for AArch64 X86 and AArch64 work in different ways to shutdown a VM. X86 exit VMM event loop through ACPI device; AArch64 need to exit from CPU loop of a SystemEvent. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-11 15:00:17 +01:00
Michael Zhao	5cd1730bc4	vmm: Configure VM on AArch64 Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-11 15:00:17 +01:00
Michael Zhao	917219fa92	vmm: Enable VCPU for AArch64 Added MPIDR which is needed in system configuration. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-11 15:00:17 +01:00
Michael Zhao	b5f1c912d6	vmm: Enable memory manager for AArch64 Screened IO space as it is not available on AArch64. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-11 15:00:17 +01:00
Michael Zhao	eeeb45bbb9	vmm: Enable device manager for AArch64 Screened IO bus because it is not for AArch64. Enabled Serial, RTC and Virtio devices with MMIO transport option. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-11 15:00:17 +01:00
Michael Zhao	8f7dc73562	vmm: Move Vcpu::configure() to arch crate Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-03 11:27:29 +02:00
Michael Zhao	20cf21cd9d	vmm: Change booting process to cover AArch64 requirements Between X86 and AArch64, there is some difference in booting a VM: - X86_64 can setup IOAPIC before creating any VCPU. - AArch64 have to create VCPU's before creating GIC. The old process is: 1. load_kernel() load kernel binary configure system 2. activate_vcpus() create & start VCPU's So we need to separate "activate_vcpus" into "create_vcpus" and "activate_vcpus" (to start vcpus only). Setup GIC and create FDT between the 2 steps. The new procedure is: 1. load_kernel() load kernel binary (X86_64) configure system 2. create VCPU's 3. (AArch64) setup GIC 4. (AArch64) configure system 5. start VCPU's Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-03 11:27:29 +02:00
Michael Zhao	b32d3025f3	devices: Refactor IOAPIC to cover other architectures IOAPIC, a X86 specific interrupt controller, is referenced by device manager and CPU manager. To work with more architectures, a common type for all architectures is needed. This commit introduces trait InterruptController to provide architecture agnostic functions. Device manager and CPU manager can use it without caring what the underlying device is. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-05-26 11:09:19 +02:00
Michael Zhao	1befae872d	build: Fixed build errors and warnings on AArch64 This is a preparing commit to build and test CH on AArch64. All building issues were fixed, but no functionality was introduced. For X86, the logic of code was not changed at all. For ARM, the architecture specific part is still empty. And we applied some tricks to workaround lint warnings. But such code will be replaced later by other commits with real functionality. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-05-21 11:56:26 +01:00
Rob Bradford	12e00c0f45	vmm: cpu: Retry sending signals if necessary To avoid a race condition where the signal might "miss" the KVM_RUN ioctl() instead reapeatedly try sending a signal until the vCPU run is interrupted (as indicated by setting a new per vCPU atomic.) It important to also clear this atomic when coming out of a paused state. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-05-07 09:00:14 +02:00
Rob Bradford	801e72ac6d	vmm: cpu: Unpause vCPU threads After setting the kill signal flag for the vCPU thread release the pause flag and unpark the threads. This ensures that that the vCPU thread will wake up and check the kill signal flag if the VM is paused. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-05-07 09:00:14 +02:00
Rob Bradford	91a4a2581e	vmm: cpu: When coming out of the pause event check for a kill signal Rather than immediately entering the vCPU run() code check if the kill signal is set. This allows paused VMs to be shutdown. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-05-07 09:00:14 +02:00
Samuel Ortiz	86fcd19b8a	build: Initial musl support Fix all build failures and add musl to the gihub workflows. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-29 17:57:01 +01:00
Samuel Ortiz	1ed357cf34	vmm: vm: Implement the Snapshottable trait By aggregating snapshots from the CpuManager, the MemoryManager and the DeviceManager, Vm implements the snapshot() function from the Snapshottable trait. And by restoring snapshots from the CpuManager, the MemoryManager and the DeviceManager, Vm implements the restore() function from the Snapshottable trait. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com> Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com>	2020-04-07 12:26:10 +02:00
Yi Sun	50b3f008d1	vmm: cpu: Implement the Snapshottable trait Implement the Snapshottable trait for Vcpu, and then implements it for CpuManager. Note that CpuManager goes through the Snapshottable implementation of Vcpu for every vCPU in order to implement the Snapshottable trait for itself. Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com> Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-07 12:26:10 +02:00
Sebastien Boeuf	f787c409c4	vmm: cpu: Factorize vcpu starting code Anticipating the need for a slightly different function for restoring vCPUs, this patch factorizes most of the vCPU creation, so that it can be reused for migration purposes. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-07 12:26:10 +02:00
Cathy Zhang	722f9b6628	vmm: cpu: Get and set KVM vCPU state These two new helpers will be useful to capture a vCPU state and being able to restore it at a later time. Signed-off-by: Cathy Zhang <cathy.zhang@intel.com> Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-07 12:26:10 +02:00
Cathy Zhang	13756490b5	vmm: cpu: Track all Vcpus through CpuManager In anticipation for the CpuManager to aggregate all Vcpu snapshots together, this change makes sure the CpuManager has a handle onto every vCPU. Signed-off-by: Cathy Zhang <cathy.zhang@intel.com> Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-07 12:26:10 +02:00
Samuel Ortiz	0646a90626	vmm: cpu: Pass CpusConfig to simplify the new() prototype Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-03 18:05:18 +01:00
Samuel Ortiz	164e810069	vmm: cpu: Move CPUID patching to CpuManager Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-03 18:05:18 +01:00
Samuel Ortiz	1b1a2175ca	vm-migration: Define the Snapshottable and Transportable traits A Snapshottable component can snapshot itself and provide a MigrationSnapshot payload as a result. A MigrationSnapshot payload is a map of component IDs to a list of migration sections (MigrationSection). As component can be made of several Migratable sub-components (e.g. the DeviceManager and its device objects), a migration snapshot can be made of multiple snapshot itself. A snapshot is a list of migration sections, each section being a component state snapshot. Having multiple sections allows for easier and backward compatible migration payload extensions. Once created, a migratable component snapshot may be transported and this is what the Transportable trait defines, through 2 methods: send and recv. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com> Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com>	2020-04-02 13:24:25 +01:00
Alejandro Jimenez	840a9a97ff	pvh: Initialize vCPU regs/sregs for PVH boot Set the initial values of the KVM vCPU registers as specified in the PVH boot ABI: https://xenbits.xen.org/docs/unstable/misc/pvh.html Signed-off-by: Alejandro Jimenez <alejandro.j.jimenez@oracle.com>	2020-03-13 18:29:44 +01:00
Alejandro Jimenez	24f0e42e6a	pvh: Introduce EntryPoint struct In order to properly initialize the kvm regs/sregs structs for the guest, the load_kernel() return type must specify which boot protocol to use with the entry point address it returns. Make load_kernel() return an EntryPoint struct containing the required information. This structure will later be used in the vCPU configuration methods to setup the appropriate initial conditions for the guest. Signed-off-by: Alejandro Jimenez <alejandro.j.jimenez@oracle.com>	2020-03-13 18:29:44 +01:00

1 2

82 Commits