cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-10-18 19:09:15 +00:00

Author	SHA1	Message	Date
Rob Bradford	e37ec26ccf	vmm: Remove PCI PIO optimisation This optimisation provided some peformance improvement when measured by perf however when considered in terms of boot time peformance this optimisation doesn't have any impact measurable using our peformance-metrics tooling. Removing this optimisation helps simplify the VMM internals as it allows the reordering of the VM creation process permitting refactoring of the restore code path. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-11-22 19:47:53 +00:00
Wei Liu	d05586f520	vmm: modify or provide safety comments Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-11-18 12:50:01 +00:00
Praveen K Paladugu	09e79a5e9b	vmm: Add tpm device to mmio bus Add tpm device to mmio bus if appropriate cmdline arguments were passed. Signed-off-by: Praveen K Paladugu <prapal@linux.microsoft.com>	2022-11-15 16:42:21 +00:00
Praveen K Paladugu	af261f231c	vmm: Add required acpi entries for vtpm device Add an TPM2 entry to DSDT ACPI table. Add a TPM2 table to guest's ACPI. Signed-off-by: Praveen K Paladugu <prapal@linux.microsoft.com> Co-authored-by: Sean Yoo <t-seanyoo@microsoft.com>	2022-11-15 16:42:21 +00:00
Bo Chen	a9ec0f33c0	misc: Fix clippy issues Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-11-02 09:41:43 +01:00
Jinrong Liang	cb171d4a23	device_manager: Avoid checking io_uring support when it's not needed After testing, io_uring_is_supported() causes about 38ms of overhead when creating virtio-blk. By modifying the position of io_uring_is_supported(), the overhead of creating virtio-blk is reduced to less than 1ms when we close io_uring. Signed-off-by: Jinrong Liang <cloudliang@tencent.com>	2022-10-27 22:21:51 -07:00
Sebastien Boeuf	1f0e5eb66a	vmm: virtio-devices: Restore every VirtioDevice upon creation Following the new design proposal to improve the restore codepath when migrating a VM, all virtio devices are supplied with an optional state they can use to restore from. The restore() implementation every device was providing has been removed in order to prevent from going through the restoration twice. Here is the list of devices now following the new restore design: - Block (virtio-block) - Net (virtio-net) - Rng (virtio-rng) - Fs (vhost-user-fs) - Blk (vhost-user-block) - Net (vhost-user-net) - Pmem (virtio-pmem) - Vsock (virtio-vsock) - Mem (virtio-mem) - Balloon (virtio-balloon) - Watchdog (virtio-watchdog) - Vdpa (vDPA) - Console (virtio-console) - Iommu (virtio-iommu) Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-10-24 14:17:08 +02:00
Sebastien Boeuf	099cdd2af8	virtio-devices, vmm: vdpa: Implement live migration support Vdpa now implements the Migratable trait, which allows the device to be added to the DeviceTree and therefore allows live migrating any vDPA device that supports being suspended. Given a vDPA device can't be resumed from a suspended state without having to reset everything, we don't support pause/resume for a vDPA device, as well as snapshot/restore (which requires resume to be supported). In order for the migration to work locally, reusing the same device on the same host machine, the vhost-vdpa handler is dropped after the snapshot has been performed, which allows the destination VM to open the device without any conflict about the device being busy. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-10-13 10:03:23 +02:00
Rob Bradford	1bc63e7848	vmm: Remove legacy I/O ports for ACPI These addresses have been superseded and replaced with other I/O ports. Fixes: #4483 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-10-03 17:08:57 +01:00
Sebastien Boeuf	903c08f8a1	net: Don't override default TAP interface MTU Adjust MTU logic such that: 1. Apply an MTU to the TAP interface if the user supplies it 2. Always query the TAP interface for the MTU and expose that. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-09-27 10:37:35 +01:00
Rob Bradford	b2d1dd65f3	build: Remove "fwdebug" and "common" feature flags This simplifes the buld and checks with very little overhead and the fwdebug device is I/O port device on 0x402 that can be used by edk2 as a very simple character device. See: #4679 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-09-26 10:16:33 -07:00
Rob Bradford	1202b9a07a	vmm: Add some tracing of boot sequence Add tracing of the VM boot sequence from the point at which the request to create a VM is received to the hand-off to the vCPU threads running. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-09-22 18:09:31 +01:00
Sebastien Boeuf	76dbf85b79	net: Give the user the ability to set MTU Add a new "mtu" parameter to the NetConfig structure and therefore to the --net option. This allows Cloud Hypervisor's users to define the Maximum Transmission Unit (MTU) they want to use for the network interface that they create. In details, there are two main aspects. On the one hand, the TAP interface is created with the proper MTU if it is provided. And on the other hand the guest is made aware of the MTU through the VIRTIO configuration. That means the MTU is properly set on both the TAP on the host and the network interface in the guest. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-09-21 16:20:57 +02:00
Sebastien Boeuf	f38056fc9e	virtio-devices, vmm: Simplify virtio-mem resize operation There's no need to delegate the resize operation to the virtio-mem thread. This can come directly from the vmm thread which will use the Mem object to update the VIRTIO configuration and trigger the interrupt for the guest to be notified. In order to achieve what's described above, the VirtioMemZone structure now has a handle onto the Mem object directly. This avoids the need for intermediate Resize and ResizeSender structures. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-09-20 13:43:40 +02:00
Nuno Das Neves	784a3aaf3c	devices: gic: use VgicConfig everywhere Use VgicConfig to initialize Vgic. Use Gic::create_default_config everywhere so we don't always recompute redist/msi registers. Add a helper create_test_vgic_config for tests in hypervisor crate. Signed-off-by: Nuno Das Neves <nudasnev@microsoft.com>	2022-08-31 08:33:05 +01:00
Michael Zhao	b65639fad3	vmm:AArch64: move uefi_flash to memory manager uefi_flash is used when load firmware, that is load payload depends on device manager. move uefi_flash to memory manager can eliminate the dependency. Signed-off-by: Jianyong Wu <jianyong.wu@arm.com> Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-08-31 08:32:08 +01:00
Sebastien Boeuf	8c02648ac9	vmm: device_manager: Update virtio-console for proper PTY support Given the virtio-console is now able to buffer its output when no PTY is connected on the other end, the device manager code is updated to enable this. Moving the endpoint type from FilePair to PtyPair enables the proper codepath in the virtio-console implementation, as well as updating the PTY resize code, and forcing the PTY to always be non-blocking. The non-blocking behavior is required to avoid blocking the guest that would be waiting on the virtio-console driver. When receiving an EWOULDBLOCK error, the output will simply be redirected to the temporary buffer so that it can be later flushed. The PTY resize logic has been slightly modified to ensure the PTY file descriptors are closed. It avoids the child process to keep a hold onto the PTY device, which would have caused the PTY to believe something is connected on the other end, which would have prevented the detection of any new connection on the PTY. Fixes #4521 Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-08-30 13:47:51 +02:00
Sebastien Boeuf	cdcd4d259e	vmm: serial: Wait for PTY to be available before writing to it The goal of this patch is to provide a reliable way to detect when the other end of the PTY is connected, and therefore be able to identify when we can write to the PTY device. This is needed because writing to the PTY device when the other end isn't connected causes the loss of the written bytes. The way to detect the connection on the other end of the PTY is by knowing the other end is disconnected at first with the presence of the EPOLLHUP event. Later on, when the connection happens, EPOLLHUP is not triggered anymore, and that's when we can assume it's okay to write to the PTY main device. It's important to note we had to ensure the file descriptor for the other end was closed, otherwise we would have never seen the EPOLLHUP event. And we did so by removing the "sub" field from the PtyPair structure as it was keeping the associated File opened. Fixes #3170 Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-08-19 14:39:06 +01:00
Sebastien Boeuf	98f949d35d	vmm: Add new I/O ports for ACPI shutdown and PM timer devices Adding new I/O ports for both the ACPI shutdown and the ACPI PM timer devices so they can be triggered from both addresses. The reason for this change is that TDX expects only certain I/O ports to be enabled based on what QEMU exposes. We follow this to avoid new ports from being opened exclusively for Cloud Hypervisor. We have to keep the former I/O ports available given all firmwares haven't been updated yet. Once we reach a point where we know both Rust Hypervisor Firmware, OVMF, TDVF and TDSHIM have been updated with the new port values, we'll be able to remove the former ports. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-08-11 11:46:09 +01:00
Rob Bradford	2e8eb96ef6	vmm: device_manager: Store ACPI platform addresses for later use These are ready for inclusion in the FACP table. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-07-25 16:16:06 +01:00
Wei Liu	ad33f7c5e6	vmm: return seccomp rules according to hypervisors That requires stashing the hypervisor type into various places. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-22 12:50:12 +01:00
Wei Liu	a96a5d7816	hypervisor, vmm: use new vfio-ioctls Use the new vfio-ioctls APIs. Drop Cloud Hypervisor's Device trait since it is no longer needed. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-21 23:37:53 +01:00
Wei Liu	0e8769d76a	device_manager: assert passthrough_device has the correct type There is a lot of unsafe code in such a small function. Add an assert to help detect issues earlier. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-14 08:09:50 +01:00
Sebastien Boeuf	81ba70a497	pci, vmm: Defer mapping VFIO MMIO regions on restore When restoring a VM, the restore codepath will take care of mapping the MMIO regions based on the information from the snapshot, rather than having the mapping being performed during device creation. When the device is created, information such as which BARs contain the MSI-X tables are missing, preventing to perform the mapping of the MMIO regions. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-09 09:19:58 +02:00
Sebastien Boeuf	7df7061610	pci, vmm: Add migratable support to vfio-user devices Based on recent changes to VfioUserPciDevice, the vfio-user devices can now be migrated. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-09 09:19:58 +02:00
Sebastien Boeuf	c021dda267	pci, vmm: Add migratable support to VFIO devices Based on recent changes to VfioPciDevice, the VFIO devices can now be migrated. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-06-09 09:19:58 +02:00
Michael Zhao	957d3a7443	aarch64: Simplify GIC related structs definition Combined the `GicDevice` struct in `arch` crate and the `Gic` struct in `devices` crate. After moving the KVM specific code for GIC in `arch`, a very thin wapper layer `GicDevice` was left in `arch` crate. It is easy to combine it with the `Gic` in `devices` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	04949755c0	arch: Switch to new GIC interface Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Rob Bradford	ade3a9c8f6	virtio-devices, vmm: Optimised async virtio device activation In order to ensure that the virtio device thread is spawned from the vmm thread we use an asynchronous activation mechanism for the virtio devices. This change optimises that code so that we do not need to iterate through all virtio devices on the platform in order to find the one that requires activation. We solve this by creating a separate short lived VirtioPciDeviceActivator that holds the required state for the activation (e.g. the clones of the queues) this can then be stored onto the device manager ready for asynchronous activation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-01 09:42:02 +02:00
Rob Bradford	979797786d	vmm: Remove DAX cache setup for virtio-fs devices Remove the code from the DeviceManager that prepares the DAX cache since the functionality has now been removed. Fixes: #3889 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:47:13 +02:00
Michael Zhao	3fe20cc09a	aarch64: Remove `GicDevice` trait `GicDevice` trait was defined for the common part of GicV3 and ITS. Now that the standalone GicV3 do not exist, `GicDevice` is not needed. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-05-27 10:57:50 +08:00
Rob Bradford	fa07d83565	Revert "virtio-devices, vmm: Optimised async virtio device activation" This reverts commit `f160572f9d`. There has been increased flakiness around the live migration tests since this was merged. Speculatively reverting to see if there is increased stability. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-21 21:27:33 +01:00
Rob Bradford	f160572f9d	virtio-devices, vmm: Optimised async virtio device activation In order to ensure that the virtio device thread is spawned from the vmm thread we use an asynchronous activation mechanism for the virtio devices. This change optimises that code so that we do not need to iterate through all virtio devices on the platform in order to find the one that requires activation. We solve this by creating a separate short lived VirtioPciDeviceActivator that holds the required state for the activation (e.g. the clones of the queues) this can then be stored onto the device manager ready for asynchronous activation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-20 17:07:13 +01:00
Maksym Pavlenko	3a0429c998	cargo: Clean up serde dependencies There is no need to include serde_derive separately, as it can be specified as serde feature instead. Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2022-05-18 08:21:19 +02:00
Rob Bradford	d3f66f8702	hypervisor: Make vm module private And thus only export what is necessary through a `pub use`. This is consistent with some of the other modules and makes it easier to understand what the external interface of the hypervisor crate is. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-13 15:39:22 +02:00
Rob Bradford	b1bd87df19	vmm: Simplify MsiInterruptManager generics By taking advantage of the fact that IrqRoutingEntry is exported by the hypervisor crate (that is typedef'ed to the hypervisor specific version) then the code for handling the MsiInterruptManager can be simplified. This is particularly useful if in this future it is not a typedef but rather a wrapper type. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-11 11:19:14 +01:00
Rob Bradford	c2c813599d	vmm: Don't use kvm_ioctls directly The IoEventAddress is re-exported through the crate at the top-level. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-10 15:57:43 +01:00
Sebastien Boeuf	058a61148c	vmm: Factorize net creation Since both Net and vhost_user::Net implement the Migratable trait, we can factorize the common part to simplify the code related to the net creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-05 13:08:41 +02:00
Sebastien Boeuf	425902b296	vmm: Factorize disk creation Since both Block and vhost_user::Blk implement the Migratable trait, we can factorize the common part to simplify the code related to the disk creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-05 13:08:41 +02:00
Rob Bradford	707cea2182	vmm, devices: Move logging of 0x80 timestamp to its own device This is a cleaner approach to handling the I/O port write to 0x80. Whilst doing this also use generate the timestamp at the start of the VM creation. For consistency use the same timestamp for the ARM equivalent. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-04 23:02:53 +01:00
Bo Chen	7fe399598d	vmm: device_manager: Map MMIO regions to the guest correctly To correctly map MMIO regions to the guest, we will need to wait for valid MMIO region information which is generated from 'PciDevice::allocate_bars()' (as a part of 'DeviceManager::add_pci_device()'). Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-05-04 13:53:47 +02:00
Rob Bradford	1dfe4eda5c	vmm: Prevent "internal" identifiers being used by user For devices that cannot be named by the user use the "__" prefix to identify them as internal devices. Check that any identifiers provided in the config do not clash with those internal names. This prevents the user from creating a disk such as "__serial" which would then cause a failure in unpredictable manner. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-04 12:34:11 +02:00
Sebastien Boeuf	6e101f479c	vmm: Ensure hotplugged device identifier is unique Whenever a device (virtio, vfio, vfio-user or vdpa) is hotplugged, we must verify the provided identifier is unique, otherwise we must return an error. Particularly, this will prevent issues with identifiers for serial, console, IOAPIC, balloon, rng, watchdog, iommu and gpio since all of these are hardcoded by the VMM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-03 18:34:24 +01:00
Rob Bradford	6d4862245d	vmm: Generate event when device is removed The new event contains the BDF and the device id: { "timestamp": { "secs": 2, "nanos": 731073396 }, "source": "vm", "event": "device-removed", "properties": { "bdf": "0000:00:02.0", "id": "test-disk" } } Fixes: #4038 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-03 17:10:36 +02:00
Sebastien Boeuf	677c8831af	vmm: Ensure uniqueness of generated identifiers The device identifiers generated from the DeviceManager were not guaranteed to be unique since they were not taking the list of identifiers provided through the configuration. By returning the list of unique identifiers from the configuration, and by providing it to the DeviceManager, the generation of new identifiers can rely both on the DeviceTree and the list of IDs from the configuration. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-02 13:26:15 +02:00
Rob Bradford	f1276c58d2	vmm: Commandline inject from devices is aarch64 specific This is not required for x86_64 and maintains a tight coupling between kernel loading and the DeviceManager. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	da33eb5e8c	vmm: device_manager: Remove extra whitespace lines These originated from the removal of the acpi feature gate. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Sebastien Boeuf	eb6daa2fc3	pci: Store MSI interrupt manager in VfioCommon Extend VfioCommon structure to own the MSI interrupt manager. This will be useful for implementing the restore code path. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-22 16:16:48 +02:00
Sebastien Boeuf	11e9f43305	vmm: Use new Resource type PciBar Instead of defining some very generic resources as PioAddressRange or MmioAddressRange for each PCI BAR, let's move to the new Resource type PciBar in order to make things clearer. This allows the code for being more readable, but also removes the need for hard assumptions about the MMIO and PIO ranges. PioAddressRange and MmioAddressRange types can be used to describe everything except PCI BARs. BARs are very special as they can be relocated and have special information we want to carry along with them. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-19 12:54:09 -07:00
Sebastien Boeuf	89218b6d1e	pci: Replace BAR tuple with PciBarConfiguration In order to make the code more consistent and easier to read, we remove the former tuple that was used to describe a BAR, replacing it with the existing structure PciBarConfiguration. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-19 12:54:09 -07:00

1 2 3 4 5 ...

627 Commits