cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-12-29 00:55:18 +00:00

Author	SHA1	Message	Date
Wei Liu	6a8c0fc887	hypervisor: provide a generic FpuState structure Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-18 22:15:30 +01:00
Wei Liu	08135fa085	hypervisor: provide a generic CpudIdEntry structure Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-18 22:15:30 +01:00
Wei Liu	45fbf840db	hypervisor, vmm: move away from CpuId type CpuId is an alias type for the flexible array structure type over CpuIdEntry. The type itself and the type of the element in the array portion are tied to the underlying hypervisor. Switch to using CpuIdEntry slice or vector directly. The construction of CpuId type is left to hypervisors. This allows us to decouple CpuIdEntry from hypervisors more easily. No functional change intended. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-18 22:15:30 +01:00
Wei Liu	f1ab86fecb	hypervisor: x86: provide a generic SpecialRegisters structure Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-15 10:21:43 +01:00
Wei Liu	75797827d5	hypervisor: x86: provide a generic SegmentRegister structure And drop SegmentRegisterOps since it is no longer required. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-15 10:21:43 +01:00
Wei Liu	8b7781e267	hypervisor: x86: provide a generic StandardRegisters structure We only need to do this for x86 since MSHV does not have aarch64 support yet. This reduces unnecessary code churn. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-15 10:21:43 +01:00
Wei Liu	4201bf4011	hypervisor: provide a generic ClockData structure Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-14 22:09:04 +01:00
Wei Liu	beb4f86b82	hypervisor, vmm: drop VmState and code VmState was introduced to hold hypervisor specific VM state. KVM does not need it and MSHV does not really use it yet. Just drop the code. It can be easily revived once there is a need. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-14 22:09:04 +01:00
Alyssa Ross	a455917db5	vmm: fix missed API or debug events Previously, we were assuming that every time an eventfd notified us, there was only a single event waiting for us. This meant that if, while one API request was being processed, two more arrived, the second one would not be processed (until the next one arrived, when it would be processed instead of that event, and so on). To fix this, make sure we're processing the number of API and debug requests we've been told have arrived, rather than just one. This is easy to demonstrate by sending lots of API events and adding some sleeps to make sure multiple events can arrive while each is being processed. For other uses of eventfd, like the exit event, this doesn't matter — even if we've received multiple exit events in quick succession, we only need to exit once. So I've only made this change where receiving an event is non-idempotent, i.e. where it matters that we process the event the right number of times. Technically, reset requests are also non-idempotent — there's an observable difference between a VM resetting once, and a VM resetting once and then immediately resetting again. But I've left that alone for now because two resets in immediate succession doesn't sound like something anyone would ever want to me. Signed-off-by: Alyssa Ross <hi@alyssa.is>	2022-07-14 17:44:11 +01:00
Michael Zhao	2d8635f04a	hypervisor: Refactor `system_registers` on AArch64 Function `system_registers` took mutable vector reference and modified the vector content. Now change the definition to `get/set` style. And rename to `get/set_sys_regs` to align with other functions. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-07-14 22:55:19 +08:00
Michael Zhao	c445513976	hypervisor: Refactor `core_registers` on AArch64 On AArch64, the function `core_registers` and `set_core_registers` are the same thing of `get/set_regs` on x86_64. Now the names are aligned. This will benefit supporting `gdb`. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-07-14 22:55:19 +08:00
Wei Liu	0e8769d76a	device_manager: assert passthrough_device has the correct type There is a lot of unsafe code in such a small function. Add an assert to help detect issues earlier. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-14 08:09:50 +01:00
Wei Liu	84bbaf06d1	hypervisor: turn boot_msr_entries into a trait method This allows dispatching to either KVM or MSHV automatically. No functional change. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-08 16:49:58 +01:00
Rob Bradford	121729a3b0	vmm: Split signal handling for VM and VMM signals The VM specific signal (currently only SIGWINCH) should only be handled when the VM is running. The generic VMM signals (SIGINT and SIGTERM) need handling at all times. Split the signal handling into two separate threads which have differing lifetimes. Tested by: 1.) Boot full VM and check resize handling (SIGWINCH) works & sending SIGTERM leads to cleanup (tested that API socket is removed.) 2.) Start without a VM and send SIGTERM/SIGINT and observe cleanup (API socket removed) 3.) Boot full VM, delete VM and observe 2.) holds. 4.) Boot full VM, delete VM, recreate VM and observe 1.) holds. Fixes: #4269 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-07-08 15:15:46 +01:00
Rob Bradford	93237f0106	vmm: Set MADT "Online Capable" flag The Linux kernel now checks for this before marking CPUs as hotpluggable: commit aa06e20f1be628186f0c2dcec09ea0009eb69778 Author: Mario Limonciello <mario.limonciello@amd.com> Date: Wed Sep 8 16:41:46 2021 -0500 x86/ACPI: Don't add CPUs that are not online capable A number of systems are showing "hotplug capable" CPUs when they are not really hotpluggable. This is because the MADT has extra CPU entries to support different CPUs that may be inserted into the socket with different numbers of cores. Starting with ACPI 6.3 the spec has an Online Capable bit in the MADT used to determine whether or not a CPU is hotplug capable when the enabled bit is not set. Link: https://uefi.org/htmlspecs/ACPI_Spec_6_4_html/05_ACPI_Software_Programming_Model/ACPI_Software_Programming_Model.html?#local-apic-flags Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-07-01 18:45:05 +01:00
Rob Bradford	adf5881757	build: #[allow(clippy::significant_drop_in_scrutinee) in some crates This check is new in the beta version of clippy and exists to avoid potential deadlocks by highlighting when the test in an if or for loop is something that holds a lock. In many cases we would need to make significant refactorings to be able to pass this check so disable in the affected crates. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-30 20:50:45 +01:00
Rob Bradford	b57d7b258d	build: Fix beta clippy issue (needless_return) warning: unneeded `return` statement --> pci/src/vfio_user.rs:627:13 \| 627 \| / return Err(std::io::Error::new( 628 \| \| std::io::ErrorKind::Other, 629 \| \| format!("Region not found for 0x{:x}", gpa), 630 \| \| )); \| \|_______________^ \| = note: `#[warn(clippy::needless_return)]` on by default = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#needless_return help: remove `return` \| 627 ~ Err(std::io::Error::new( 628 + std::io::ErrorKind::Other, 629 + format!("Region not found for 0x{:x}", gpa), 630 + )) \| Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-30 20:50:45 +01:00
Rob Bradford	2716bc3311	build: Fix beta clippy issue (derive_partial_eq_without_eq) warning: you are deriving `PartialEq` and can implement `Eq` --> vmm/src/serial_manager.rs:59:30 \| 59 \| #[derive(Debug, Clone, Copy, PartialEq)] \| ^^^^^^^^^ help: consider deriving `Eq` as well: `PartialEq, Eq` \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#derive_partial_eq_without_eq Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-30 20:50:45 +01:00
Rob Bradford	2e664dca64	vmm: Always reset the console mode on VMM exit Tested: 1. SIGTERM based 2. VM shutdown/poweroff 3. Injected VM boot failure after calling Vm::setup_tty() Fixes: #4248 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-28 16:45:27 +01:00
Rob Bradford	65ec6631fb	vmm: cpu: Store the vCPU snapshots in ascending order The snapshots are stored in a BTree which is ordered however as the ids are strings lexical ordering places "11" ahead of "2". So encode the vCPU id with zero padding so it is lexically sorted. This fixes issues with CPU restore on aarch64. See: #4239 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-27 16:20:57 +01:00
Wei Liu	bccd7c7e48	vmm: drop Sync+Send bounds for EndpointHandler Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-06-20 23:28:57 +01:00
Wei Liu	8fa1098629	vmm: switch from lazy_static to once_cell Once_cell does not require using macro and is slated to become part of Rust std at some point. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-06-20 16:03:07 +01:00
Sebastien Boeuf	335a4e1cc0	vmm: api: Expose kvm_hyperv parameter in OpenAPI description Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-17 15:11:53 +01:00
Sebastien Boeuf	81ba70a497	pci, vmm: Defer mapping VFIO MMIO regions on restore When restoring a VM, the restore codepath will take care of mapping the MMIO regions based on the information from the snapshot, rather than having the mapping being performed during device creation. When the device is created, information such as which BARs contain the MSI-X tables are missing, preventing to perform the mapping of the MMIO regions. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-09 09:19:58 +02:00
Sebastien Boeuf	7df7061610	pci, vmm: Add migratable support to vfio-user devices Based on recent changes to VfioUserPciDevice, the vfio-user devices can now be migrated. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-09 09:19:58 +02:00
Sebastien Boeuf	c021dda267	pci, vmm: Add migratable support to VFIO devices Based on recent changes to VfioPciDevice, the VFIO devices can now be migrated. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-09 09:19:58 +02:00
Rob Bradford	94fb9f817d	vmm: Fix clippy issues under "guest_debug" feature Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-08 11:40:56 +01:00
Michael Zhao	a7a15d56dd	aarch64: Move `setup_regs` to `hypervisor` `setup_regs` of AArch64 calls KVM sepecific code. Now move it to `hypervisor` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 11:07:46 +01:00
Sebastien Boeuf	65dc1c83a9	vmm: cpu: Save and restore CPU states during snapshot/restore Based on recent KVM host patches (merged in Linux 5.16), it's forbidden to call into KVM_SET_CPUID2 after the first successful KVM_RUN returned. That means saving CPU states during the pause sequence, and restoring these states during the resume sequence will not work with the current design starting with kernel version 5.16. In order to solve this problem, let's simply move the save/restore logic to the snapshot/restore sequences rather than the pause/resume ones. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-06 11:07:29 +01:00
Sebastien Boeuf	3edaa8adb6	vmm: Ensure restore matches boot sequence The vCPU is created and set after all the devices on a VM's boot. There's no reason to follow a different order on the restore codepath as this could cause some unexpected behaviors. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-06 11:07:17 +01:00
Michael Zhao	9260c3816e	vmm: Update unit test for GIC refactoring Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	5d45d6d0fb	vmm: Move GIC unit test to `hypervisor` crate Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	957d3a7443	aarch64: Simplify GIC related structs definition Combined the `GicDevice` struct in `arch` crate and the `Gic` struct in `devices` crate. After moving the KVM specific code for GIC in `arch`, a very thin wapper layer `GicDevice` was left in `arch` crate. It is easy to combine it with the `Gic` in `devices` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	04949755c0	arch: Switch to new GIC interface Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Rob Bradford	ade3a9c8f6	virtio-devices, vmm: Optimised async virtio device activation In order to ensure that the virtio device thread is spawned from the vmm thread we use an asynchronous activation mechanism for the virtio devices. This change optimises that code so that we do not need to iterate through all virtio devices on the platform in order to find the one that requires activation. We solve this by creating a separate short lived VirtioPciDeviceActivator that holds the required state for the activation (e.g. the clones of the queues) this can then be stored onto the device manager ready for asynchronous activation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-01 09:42:02 +02:00
Yi Wang	dbeb922882	doc: add vm coredump support Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Yi Wang	8b585b96c1	vmm: enable coredump Based on the newly added guest_debug feature, this patch adds http endpoint support. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Yi Wang	ccb604e1e1	vmm: add cpu segment note for coredump The crash tool use a special note segment which named 'QEMU' to analyze kaslr info and so on. If we don't add the 'QEMU' note segment, crash tool can't find linux version to move on. For now, the most convenient way is to add 'QEMU' note segment to make crash tool happy. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn>	2022-05-30 13:41:40 +02:00
Yi Wang	0e65ca4a6c	vmm: save guest memory for coredump Guest memory is needed for analysis in crash tool, so save it for coredump. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Yi Wang	7e280b6f70	vmm: save elf header for coredump The vmcore file of guest is an elf format, so the first step of coredump is to save the elf header. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn>	2022-05-30 13:41:40 +02:00
Yi Wang	90034fd6ba	vmm: add GuestDebuggable trait It's useful to dump the guest, which named coredump so that crash tool can be used to analysize it when guest hung up. Let's add GuestDebuggable trait and Coredumpxxx error to support coredump firstly. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Rob Bradford	465db7f08c	vmm: config: Remove mergeable option from PmemConfig Fixes: #3968 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:48:49 +02:00
Rob Bradford	55c5961f43	vmm: config: Remove dax & cache_size options from FsConfig Fixes: #3889 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:47:13 +02:00
Rob Bradford	7c3582b4a8	vmm: config: Fix error message regarding use of cache size without dax The error message incorrectly said that the user was trying to combine cache_size without dax whereas it is only usuable with dax. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:47:13 +02:00
Rob Bradford	979797786d	vmm: Remove DAX cache setup for virtio-fs devices Remove the code from the DeviceManager that prepares the DAX cache since the functionality has now been removed. Fixes: #3889 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:47:13 +02:00
Michael Zhao	0fd6521759	aarch64: Avoid depending on `layout` in GIC code Removing the dependency on `layout` helps moving GIC code into `hypervisor` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-05-27 10:57:50 +08:00
Michael Zhao	3fe20cc09a	aarch64: Remove `GicDevice` trait `GicDevice` trait was defined for the common part of GicV3 and ITS. Now that the standalone GicV3 do not exist, `GicDevice` is not needed. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-05-27 10:57:50 +08:00
Rob Bradford	fa07d83565	Revert "virtio-devices, vmm: Optimised async virtio device activation" This reverts commit `f160572f9d`. There has been increased flakiness around the live migration tests since this was merged. Speculatively reverting to see if there is increased stability. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-21 21:27:33 +01:00
Rob Bradford	f160572f9d	virtio-devices, vmm: Optimised async virtio device activation In order to ensure that the virtio device thread is spawned from the vmm thread we use an asynchronous activation mechanism for the virtio devices. This change optimises that code so that we do not need to iterate through all virtio devices on the platform in order to find the one that requires activation. We solve this by creating a separate short lived VirtioPciDeviceActivator that holds the required state for the activation (e.g. the clones of the queues) this can then be stored onto the device manager ready for asynchronous activation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-20 17:07:13 +01:00
Sebastien Boeuf	49db713124	virtio-devices, vmm: Remove unused macro rules Latest cargo beta version raises warnings about unused macro rules. Simply remove them to fix the beta build. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-20 09:59:43 +01:00

1 2 3 4 5 ...

1680 Commits