cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-12-29 00:55:18 +00:00

Author	SHA1	Message	Date
Michael Zhao	5d45d6d0fb	vmm: Move GIC unit test to `hypervisor` crate Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	957d3a7443	aarch64: Simplify GIC related structs definition Combined the `GicDevice` struct in `arch` crate and the `Gic` struct in `devices` crate. After moving the KVM specific code for GIC in `arch`, a very thin wapper layer `GicDevice` was left in `arch` crate. It is easy to combine it with the `Gic` in `devices` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	04949755c0	arch: Switch to new GIC interface Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Rob Bradford	ade3a9c8f6	virtio-devices, vmm: Optimised async virtio device activation In order to ensure that the virtio device thread is spawned from the vmm thread we use an asynchronous activation mechanism for the virtio devices. This change optimises that code so that we do not need to iterate through all virtio devices on the platform in order to find the one that requires activation. We solve this by creating a separate short lived VirtioPciDeviceActivator that holds the required state for the activation (e.g. the clones of the queues) this can then be stored onto the device manager ready for asynchronous activation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-01 09:42:02 +02:00
Yi Wang	dbeb922882	doc: add vm coredump support Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Yi Wang	8b585b96c1	vmm: enable coredump Based on the newly added guest_debug feature, this patch adds http endpoint support. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Yi Wang	ccb604e1e1	vmm: add cpu segment note for coredump The crash tool use a special note segment which named 'QEMU' to analyze kaslr info and so on. If we don't add the 'QEMU' note segment, crash tool can't find linux version to move on. For now, the most convenient way is to add 'QEMU' note segment to make crash tool happy. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn>	2022-05-30 13:41:40 +02:00
Yi Wang	0e65ca4a6c	vmm: save guest memory for coredump Guest memory is needed for analysis in crash tool, so save it for coredump. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Yi Wang	7e280b6f70	vmm: save elf header for coredump The vmcore file of guest is an elf format, so the first step of coredump is to save the elf header. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn>	2022-05-30 13:41:40 +02:00
Yi Wang	90034fd6ba	vmm: add GuestDebuggable trait It's useful to dump the guest, which named coredump so that crash tool can be used to analysize it when guest hung up. Let's add GuestDebuggable trait and Coredumpxxx error to support coredump firstly. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Rob Bradford	465db7f08c	vmm: config: Remove mergeable option from PmemConfig Fixes: #3968 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:48:49 +02:00
Rob Bradford	55c5961f43	vmm: config: Remove dax & cache_size options from FsConfig Fixes: #3889 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:47:13 +02:00
Rob Bradford	7c3582b4a8	vmm: config: Fix error message regarding use of cache size without dax The error message incorrectly said that the user was trying to combine cache_size without dax whereas it is only usuable with dax. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:47:13 +02:00
Rob Bradford	979797786d	vmm: Remove DAX cache setup for virtio-fs devices Remove the code from the DeviceManager that prepares the DAX cache since the functionality has now been removed. Fixes: #3889 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-27 09:47:13 +02:00
Michael Zhao	0fd6521759	aarch64: Avoid depending on `layout` in GIC code Removing the dependency on `layout` helps moving GIC code into `hypervisor` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-05-27 10:57:50 +08:00
Michael Zhao	3fe20cc09a	aarch64: Remove `GicDevice` trait `GicDevice` trait was defined for the common part of GicV3 and ITS. Now that the standalone GicV3 do not exist, `GicDevice` is not needed. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-05-27 10:57:50 +08:00
Rob Bradford	fa07d83565	Revert "virtio-devices, vmm: Optimised async virtio device activation" This reverts commit `f160572f9d`. There has been increased flakiness around the live migration tests since this was merged. Speculatively reverting to see if there is increased stability. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-21 21:27:33 +01:00
Rob Bradford	f160572f9d	virtio-devices, vmm: Optimised async virtio device activation In order to ensure that the virtio device thread is spawned from the vmm thread we use an asynchronous activation mechanism for the virtio devices. This change optimises that code so that we do not need to iterate through all virtio devices on the platform in order to find the one that requires activation. We solve this by creating a separate short lived VirtioPciDeviceActivator that holds the required state for the activation (e.g. the clones of the queues) this can then be stored onto the device manager ready for asynchronous activation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-20 17:07:13 +01:00
Sebastien Boeuf	49db713124	virtio-devices, vmm: Remove unused macro rules Latest cargo beta version raises warnings about unused macro rules. Simply remove them to fix the beta build. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-20 09:59:43 +01:00
Maksym Pavlenko	3a0429c998	cargo: Clean up serde dependencies There is no need to include serde_derive separately, as it can be specified as serde feature instead. Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2022-05-18 08:21:19 +02:00
Rob Bradford	16a9882153	vmm: cpu: tdx: Don't use fd suffix for something not an FD The hypervisor::Vcpu is the abstraction over the fd. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-13 15:39:22 +02:00
Rob Bradford	218be2642e	hypervisor: Explicitly `pub use` at the hypervisor crate top-level Explicitly re-export types from the hypervisor specific modules. This makes it much clearer what the common functionality that is exposed is. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-13 15:39:22 +02:00
Rob Bradford	cd0df05808	vmm, arch: CpuId is x86_64 specific so import from the x86_64 module It will be removed as a top-level export from the hypervisor crate. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-13 15:39:22 +02:00
Rob Bradford	d3f66f8702	hypervisor: Make vm module private And thus only export what is necessary through a `pub use`. This is consistent with some of the other modules and makes it easier to understand what the external interface of the hypervisor crate is. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-13 15:39:22 +02:00
Rob Bradford	b1bd87df19	vmm: Simplify MsiInterruptManager generics By taking advantage of the fact that IrqRoutingEntry is exported by the hypervisor crate (that is typedef'ed to the hypervisor specific version) then the code for handling the MsiInterruptManager can be simplified. This is particularly useful if in this future it is not a typedef but rather a wrapper type. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-11 11:19:14 +01:00
Rob Bradford	3f9e8d676a	hypervisor: Move creation of irq routing struct to hypervisor crate This removes the requirement to leak as many datastructures from the hypervisor crate into the vmm crate. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-11 11:19:14 +01:00
Rob Bradford	c2c813599d	vmm: Don't use kvm_ioctls directly The IoEventAddress is re-exported through the crate at the top-level. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-10 15:57:43 +01:00
Rob Bradford	387d56879b	vmm, hypervisor: Clean up nomenclature around offloading VM operations The trait and functionality is about operations on the VM rather than the VMM so should be named appropriately. This clashed with with existing struct for the concrete implementation that was renamed appropriately. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-10 13:10:01 +01:00
Sebastien Boeuf	5f722d0d3f	vmm: Fix loading RAW firmware Whenever going through the codepath of loading a RAW firmware, we always add an extra RAM region to the guest memory through the memory manager. But we must be careful to use the updated guest memory rather than a previous reference that wasn't containing the new region, as this can lead to the following error: VmBoot(FirmwareLoad(InvalidGuestAddress(GuestAddress(4290772992)))) This is fixed by the current patch, getting the latest reference onto the guest memory from the memory manager right after the new region has been added. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-06 18:13:28 +02:00
Bo Chen	42c19e14c5	vmm: Add 'shutdown()' to vCPU seccomp filter This is required when hot-removing a vfio-user device. Details code path below: Thread 6 "vcpu0" received signal SIGSYS, Bad system call. [Switching to Thread 0x7f8196889700 (LWP 2358305)] 0x00007f8196dae7ab in shutdown () at ../sysdeps/unix/syscall-template.S:78 78 T_PSEUDO (SYSCALL_SYMBOL, SYSCALL_NAME, SYSCALL_NARGS) (gdb) bt 0x00007f8196dae7ab in shutdown () at ../sysdeps/unix/syscall-template.S:78 0x000056189240737d in std::sys::unix::net::Socket::shutdown () at library/std/src/sys/unix/net.rs:383 std::os::unix::net::stream::UnixStream::shutdown () at library/std/src/os/unix/net/stream.rs:479 0x000056189210e23d in vfio_user::Client::shutdown (self=0x7f8190014300) at vfio_user/src/lib.rs:787 0x00005618920b9d02 in <pci::vfio_user::VfioUserPciDevice as core::ops::drop::Drop>::drop ( self=0x7f819002d7c0) at pci/src/vfio_user.rs:551 0x00005618920b8787 in core::ptr::drop_in_place<pci::vfio_user::VfioUserPciDevice> () at /rustc/7737e0b5c4103216d6fd8cf941b7ab9bdbaace7c/library/core/src/ptr/mod.rs:188 0x00005618920b92e3 in core::ptr::drop_in_place<core::cell::UnsafeCell<dyn pci::device::PciDevice>> () at /rustc/7737e0b5c4103216d6fd8cf941b7ab9bdbaace7c/library/core/src/ptr/mod.rs:188 0x00005618920b9362 in core::ptr::drop_in_place<std::sync::mutex::Mutex<dyn pci::device::PciDevice>> () at /rustc/7737e0b5c4103216d6fd8cf941b7ab9bdbaace7c/library/core/src/ptr/mod.rs:188 0x00005618920d8a3e in alloc::sync::Arc<T>::drop_slow (self=0x7f81968852b8) at /rustc/7737e0b5c4103216d6fd8cf941b7ab9bdbaace7c/library/alloc/src/sync.rs:1092 0x00005618920ba273 in <alloc::sync::Arc<T> as core::ops::drop::Drop>::drop (self=0x7f81968852b8) at /rustc/7737e0b5c4103216d6fd8cf941b7ab9bdbaace7c/library/alloc/src/sync.rs:1688 0x00005618920b76fb in core::ptr::drop_in_place<alloc::sync::Arc<std::sync::mutex::Mutex<dyn pci::device::PciDevice>>> () at /rustc/7737e0b5c4103216d6fd8cf941b7ab9bdbaace7c/library/core/src/ptr/mod.rs:188 0x0000561891b5e47d in vmm::device_manager::DeviceManager::eject_device (self=0x7f8190009600, pci_segment_id=0, device_id=3) at vmm/src/device_manager.rs:4000 0x0000561891b674bc in <vmm::device_manager::DeviceManager as vm_device:🚌:BusDevice>::write ( self=0x7f8190009600, base=70368744108032, offset=8, data=&[u8](size=4) = {...}) at vmm/src/device_manager.rs:4625 0x00005618921927d5 in vm_device:🚌:Bus::write (self=0x7f8190006e00, addr=70368744108040, data=&[u8](size=4) = {...}) at vm-device/src/bus.rs:235 0x0000561891b72e10 in <vmm::vm::VmOps as hypervisor::vm::VmmOps>::mmio_write ( self=0x7f81900097b0, gpa=70368744108040, data=&[u8](size=4) = {...}) at vmm/src/vm.rs:378 0x0000561892133ae2 in <hypervisor::kvm::KvmVcpu as hypervisor::cpu::Vcpu>::run ( self=0x7f8190013c90) at hypervisor/src/kvm/mod.rs:1114 0x0000561891914e85 in vmm::cpu::Vcpu::run (self=0x7f819001b230) at vmm/src/cpu.rs:348 0x000056189189f2cb in vmm::cpu::CpuManager::start_vcpu::{{closure}}::{{closure}} () at vmm/src/cpu.rs:953 Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-05-05 15:33:26 -07:00
Sebastien Boeuf	058a61148c	vmm: Factorize net creation Since both Net and vhost_user::Net implement the Migratable trait, we can factorize the common part to simplify the code related to the net creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-05 13:08:41 +02:00
Sebastien Boeuf	425902b296	vmm: Factorize disk creation Since both Block and vhost_user::Blk implement the Migratable trait, we can factorize the common part to simplify the code related to the disk creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-05 13:08:41 +02:00
Sebastien Boeuf	54f39aa8cb	vmm: Validate vhost-user-block/net are not configured with iommu=on Extend the validate() function for both DiskConfig and NetConfig so that we return an error if a vhost-user-block or vhost-user-net device is expected to be placed behind the virtual IOMMU. Since these devices don't support this feature, we can't allow iommu to be set to true in these cases. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-05 13:08:41 +02:00
Rob Bradford	707cea2182	vmm, devices: Move logging of 0x80 timestamp to its own device This is a cleaner approach to handling the I/O port write to 0x80. Whilst doing this also use generate the timestamp at the start of the VM creation. For consistency use the same timestamp for the ARM equivalent. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-04 23:02:53 +01:00
Rob Bradford	c47e3b8689	gdb: Do not use VmmOps for memory manipulation We don't use the VmmOps trait directly for manipulating memory in the core of the VMM as it's really designed for the MSHV crate to handle instruction decoding. As I plan to make this trait MSHV specific to allow reduced locking for MMIO and PIO handling when running on KVM this use should be removed. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-04 11:33:02 -07:00
Bo Chen	7fe399598d	vmm: device_manager: Map MMIO regions to the guest correctly To correctly map MMIO regions to the guest, we will need to wait for valid MMIO region information which is generated from 'PciDevice::allocate_bars()' (as a part of 'DeviceManager::add_pci_device()'). Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-05-04 13:53:47 +02:00
Rob Bradford	1dfe4eda5c	vmm: Prevent "internal" identifiers being used by user For devices that cannot be named by the user use the "__" prefix to identify them as internal devices. Check that any identifiers provided in the config do not clash with those internal names. This prevents the user from creating a disk such as "__serial" which would then cause a failure in unpredictable manner. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-04 12:34:11 +02:00
Sebastien Boeuf	6e101f479c	vmm: Ensure hotplugged device identifier is unique Whenever a device (virtio, vfio, vfio-user or vdpa) is hotplugged, we must verify the provided identifier is unique, otherwise we must return an error. Particularly, this will prevent issues with identifiers for serial, console, IOAPIC, balloon, rng, watchdog, iommu and gpio since all of these are hardcoded by the VMM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-03 18:34:24 +01:00
Rob Bradford	6d4862245d	vmm: Generate event when device is removed The new event contains the BDF and the device id: { "timestamp": { "secs": 2, "nanos": 731073396 }, "source": "vm", "event": "device-removed", "properties": { "bdf": "0000:00:02.0", "id": "test-disk" } } Fixes: #4038 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-03 17:10:36 +02:00
Sebastien Boeuf	a5a2e591c9	vmm: Remove FsConfig from VmConfig when unplugging fs device All hotpluggable devices were properly removed from the VmConfig when a remove-device command was issued, except for the "fs" type. Fix this lack of support as it is causing the integration tests to fail with the recent addition of verifying that identifiers are unique. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-02 13:26:15 +02:00
Sebastien Boeuf	677c8831af	vmm: Ensure uniqueness of generated identifiers The device identifiers generated from the DeviceManager were not guaranteed to be unique since they were not taking the list of identifiers provided through the configuration. By returning the list of unique identifiers from the configuration, and by providing it to the DeviceManager, the generation of new identifiers can rely both on the DeviceTree and the list of IDs from the configuration. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-02 13:26:15 +02:00
Sebastien Boeuf	634c53ea50	vmm: config: Validate provided identifiers are unique A valid configuration means we can only accept unique identifiers from the user. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-02 13:26:15 +02:00
LiHui	ec0c1b01c4	vmm: api: Do not delete the API socket on API server creation The socket will safely deleted on shutdown and so it is not necessary to delete the API socket when starting the HTTP server. Fixes: #4026 Signed-off-by: LiHui <andrewli@kubesphere.io> Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 18:40:49 +01:00
Rob Bradford	f17aa3755f	vmm: Add clarifying comment about Vm::entry_point() Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	744a049007	vmm: Parallelise functionality with kernel loading Move fuctionality earlier in the boot so as to run in parallel with the loading of the kernel. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	e70bd069b3	vmm: Load kernel asynchronously Start loading the kernel as possible in the VM in a separate thread. Whilst it is loading other work can be carried out such as initialising the devices. The biggest performance improvement is seen with a more complex set of devices. If using e.g. four virtio-net devices then the time to start the kernel improves by 20-30ms. With the simplest configuration the improvement was of the order of 2-3ms. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	bfeb3120f5	vmm: Refactor kernel loading to decouple from Vm struct This will allow the kernel to be loaded from another thread. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	ce6d88d187	vmm: Merge aarch64 use statements These were in their own block and not organised lexically. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	56fe4c61af	vmm: Duplicate Vm::entry_point() across architectures These will have very different implementations when asynchronously loading the kernel. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	1d1a087fc5	vmm: Refactor kernel command line generation This allows the same code for generating the kernel command line to be used on both aarch64 and x86_64 when the latter starts loading the kernel in asynchronously. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	f1276c58d2	vmm: Commandline inject from devices is aarch64 specific This is not required for x86_64 and maintains a tight coupling between kernel loading and the DeviceManager. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Rob Bradford	da33eb5e8c	vmm: device_manager: Remove extra whitespace lines These originated from the removal of the acpi feature gate. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-29 11:03:38 +01:00
Fabiano Fidêncio	fdeb4f7c46	Revert "vmm, openapi: Token Bucket fields should be uint64" This reverts commit `87eed369cd`. The reason we're reverting this is that OpenAPI Specification[0] doesn't know how to deal with unsigned types. :-/ Right now the best to do is keep it as it's, as an int64, and try to fix OpenAPI, or even switch to swagger, as the latter knows how to properly deal with those. However, switching to swagger is far from being an 1:1 transition and will require time to experiment, thus reverting this for now seems the best approach. [0]: https://github.com/OAI/OpenAPI-Specification/blob/main/versions/3.1.0.md#data-types Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-28 09:26:38 +02:00
Fabiano Fidêncio	87eed369cd	vmm, openapi: Token Bucket fields should be uint64 The Token Bucket fields are, on the Cloud Hypervisor side, u64. However, we expose those as int64 in the OpenAPI YAML file. With that in mind, let's adjust the yaml file to expose those as uint64. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-04-27 13:16:02 +02:00
Rob Bradford	79f4c2db01	vmm: Enable virtio-iommu in VmConfig::validate() This means that the automatic enabling of the virtio-iommu will also be applied to VMs creates via the API as well as the CLI. Fixes: #4016 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-26 12:27:00 +01:00
Rob Bradford	bf9f79081a	vmm: Only create ACPI memory manager DSDT when resizable If using the ACPI based hotplug only memory can be added so if the hotplug RAM size is the same as the boot RAM size then do not include the memory manager DSDT entries. Also: this change simplifies the code marginally by making the HotplugMethod enum Copyable. This was identified from the following perf output: 1.78% 0.00% vmm cloud-hypervisor [.] <vmm::memory_manager::MemorySlots as acpi_tables::aml::Aml>::append_aml_bytes \| ---<vmm::memory_manager::MemorySlots as acpi_tables::aml::Aml>::append_aml_bytes <vmm::memory_manager::MemorySlot as acpi_tables::aml::Aml>::append_aml_bytes acpi_tables::aml::Name::new <acpi_tables::aml::Path as acpi_tables::aml::Aml>::append_aml_bytes __libc_malloc Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-26 13:07:19 +02:00
Rob Bradford	62f17ccf8c	vmm: Improve error handling for vmm::vm::Error In particular implement thiserror::Error, cleanup wording and remove unused errors. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-22 17:46:41 +01:00
Rob Bradford	cb03540ffd	vmm: config: Derive thiserror::Error No further changes are necessary that adding a #[derive(Error)] as there is a manual implementation of Display. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-22 17:46:41 +01:00
Rob Bradford	0270d697ab	vmm: cpu: Improve Error reporting Remove unused enum members, improve error messages and implement thiserror::Error. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-22 17:46:41 +01:00
Rob Bradford	47529796d0	arch: Improve arch::Error Remove unused error enum entries, improve wording and derive thiserror::Error. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-22 17:46:41 +01:00
Rob Bradford	1c786610b7	vmm: api: Don't use clashing struct name for Error Import vmm::Error as VmmError to allow the use of thiserror::Error to avoid clashing names. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-22 17:46:41 +01:00
Sebastien Boeuf	eb6daa2fc3	pci: Store MSI interrupt manager in VfioCommon Extend VfioCommon structure to own the MSI interrupt manager. This will be useful for implementing the restore code path. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-22 16:16:48 +02:00
Rob Bradford	adb3dcdc13	vmm: openapi: Add serial_number to PlatformConfig Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-21 17:17:08 +02:00
Rob Bradford	e972eb7c74	arch, vmm: Expose platform serial_number via SMBIOS Fixes: #4002 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-21 17:17:08 +02:00
Rob Bradford	203dfdc156	vmm: config: Add "serial_number" option to "--platform" This carries a string that is exposed via DMI/SMBIOS and is particularly useful for cloud-init initialisation. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-21 17:17:08 +02:00
Rob Bradford	4a04d1f8f2	vmm: seccomp: Allow SYS_rseq as required by newer glibc glibc 2.35 as shipped by Fedora 36 now uses the rseq syscall. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-21 13:02:51 +01:00
Rob Bradford	4ca066f077	vmm: api: Simplify error reporting from HTTP to internal API calls Use a single enum member for representing errors from the internal API. This avoids the ugly duplication of the API call name in the error message: e.g. $ target/debug/ch-remote --api-socket /tmp/api resize --cpus 2 Error running command: Server responded with an error: InternalServerError: VmResize(VmResize(CpuManager(DesiredVCpuCountExceedsMax))) Becomes: $ target/debug/ch-remote --api-socket /tmp/api resize --cpus 2 Error running command: Server responded with an error: InternalServerError: ApiError(VmResize(CpuManager(DesiredVCpuCountExceedsMax))) Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-20 19:39:05 +01:00
Sebastien Boeuf	11e9f43305	vmm: Use new Resource type PciBar Instead of defining some very generic resources as PioAddressRange or MmioAddressRange for each PCI BAR, let's move to the new Resource type PciBar in order to make things clearer. This allows the code for being more readable, but also removes the need for hard assumptions about the MMIO and PIO ranges. PioAddressRange and MmioAddressRange types can be used to describe everything except PCI BARs. BARs are very special as they can be relocated and have special information we want to carry along with them. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-19 12:54:09 -07:00
Sebastien Boeuf	89218b6d1e	pci: Replace BAR tuple with PciBarConfiguration In order to make the code more consistent and easier to read, we remove the former tuple that was used to describe a BAR, replacing it with the existing structure PciBarConfiguration. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-19 12:54:09 -07:00
Sebastien Boeuf	1795afadb8	vmm: Factorize algorithm finding HOB memory resources By factorizing the algorithm untangling TDVF sections from guest RAM into a dedicated function, we can write some unit tests to validate it properly achieves what we expect. Adding the "tdx" feature to the unit tests, otherwise it wouldn't get tested. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-19 15:23:12 +02:00
Sebastien Boeuf	5264d545dd	pci, vmm: Extend PciDevice trait to support BAR relocation By adding a new method id() to the PciDevice trait, we allow the caller to retrieve a unique identifier. This is used in the context of BAR relocation to identify the device being relocated, so that we can update the DeviceTree resources for all PCI devices (and not only VirtioPciDevice). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-14 12:11:37 +02:00
Sebastien Boeuf	0c34846ef6	vmm: Return new PCI resources from add_pci_device() By returning the new PCI resources from add_pci_device(), we allow the factorization of the code translating the BARs into resources. This allows VIRTIO, VFIO and vfio-user to add the resources to the DeviceTree node. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-14 12:11:37 +02:00
Sebastien Boeuf	4f172ae4b6	vmm: Retrieve PCI resources for VFIO and vfio-user devices Relying on the function introduced recently to get the PCI resources and handle the restore case, both VFIO and vfio-user device creation paths now have access to PCI resources, which can be provided to the function add_pci_device(). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-14 12:11:37 +02:00
Sebastien Boeuf	0f12fe9b3b	vmm: Factorize retrieval of PCI resources Create a dedicated function for getting the PCI segment, b/d/f and optional resources. This is meant for handling the potential case of a restore. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-14 12:11:37 +02:00
Sebastien Boeuf	6e084572d4	pci, virtio: Make virtio-pci BAR restoration more generic Updating the way of restoring BAR addresses for virtio-pci by providing a more generic approach that will be reused for other PciDevice implementations (i.e VfioPcidevice and VfioUserPciDevice). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-14 12:11:37 +02:00
Rob Bradford	b212f2823d	vmm: Deprecate mergeable option from virtio-pmem KSM would never merge the file backed pages so this option has no effect. See: #3968 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-12 07:12:25 -07:00
Rob Bradford	ed87e42e6f	vm-device, pci, devices: Remove InterruptSourceGroup::{un}mask The calls to these functions are always preceded by a call to InterruptSourceGroup::update(). By adding a masked boolean to that function call it possible to remove 50% of the calls to the KVM_SET_GSI_ROUTING ioctl as the the update will correctly handle the masked or unmasked case. This causes the ioctl to disappear from the perf report for a boot of the VM. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-11 22:56:48 +01:00
Michael Zhao	d1b2a3fca9	aarch64: Add a memory-simulated flash for UEFI EDK2 execution requires a flash device at address 0. The new added device is not a fully functional flash. It doesn't implement any spec of a flash device. Instead, a piece of memory is used to simulate the flash simply. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-04-11 09:51:34 +01:00
Michael Zhao	298a5580a9	aarch64: Remove unnecessary function definitions This is a refactoring commit to simplify source code. Removed some functions that only return a layout const. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-04-08 11:08:43 -07:00
Michael Zhao	656425a328	aarch64: Align the data types in layout Some addresses defined in `layout.rs` were of type `GuestAddress`, and are `u64`. Now align the types of all the `*_START` definitions to `GuestAddress`. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-04-08 11:08:43 -07:00
Michael Zhao	848d88c122	aarch64: Reserve a hole in 32-bit space The reserved space is for devices. Some devices (like TPM) require arbitrary addresses close to 4GiB. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-04-05 11:04:52 +08:00
Michael Zhao	a3dbc3b415	aarch64: Change `RAM_START` type GuestAddress Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-04-05 11:04:52 +08:00
Michael Zhao	ef9f37cd5f	aarch64: Rename `RAM_64BIT_START` in layout `RAM_64BIT_START` was set to 1 GiB, not a real 64-bit address. Now rename it `RAM_START` to avoid confusion. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-04-05 11:04:52 +08:00
Sebastien Boeuf	e76a5969e8	vmm: Add iommu parameter to VdpaConfig Add a new iommu parameter to VdpaConfig in order to place the vDPA device behind a virtual IOMMU. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-05 00:09:52 +02:00
Sebastien Boeuf	00ce8277aa	vmm: tdx: Fix the logic for generating HOB memory resources The list of memory resources provided through the HOB wasn't accurate because of the broken logic. The fix provides correct ranges to the firmware. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-01 18:24:32 +01:00
Sebastien Boeuf	70222ffc1a	vmm: tdx: Only report TempMem as reserved memory Based on latest QEMU patches from branch tdx-qemu-2022.03.29-v7.0.0-rc1 we should only report as memory resources the TempMem sections from TDVF sections. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-04-01 18:24:32 +01:00
Rob Bradford	7fd76eff05	vmm: Don't error if live resizing is not possible The introduction of a error if live resizing is not possible is a regression compared to the original behaviour where the new size would be stored in the config and reflected in the next boot. This behaviour was also inconsistent with the effect of resizing with no VM booted. Instead of generating an error allow the code to go ahead and update the config so that the new size will be available upon the reboot. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-31 17:04:53 +01:00
Bo Chen	eed2a0d06b	vmm: Add 'libc::SYS_shutdown' to vmm 'seccomp' filter list Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-03-31 09:22:07 +01:00
Fabiano Fidêncio	f049867cd9	vmm,memory_manager: Deny resizing only if the ram amount has changed Similarly to the previous commit restricting the cpu resizing error only to the situations where the vcpu amount has changed, let's do the same with the memory and be consistent throughout our code base. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-30 21:29:08 +01:00
Fabiano Fidêncio	2c8045343c	vmm,cpu: Deny resizing only if the vcpu amount has changed `188078467d` made clear that resize should only happen when dealing with a "dynamic" CpuManager. Although this is very much correct, it causes a regression on Kata Containers (and on any other consumer of Cloud Hypervisor) in cases where a resize would be triggered but the vCPUs values wouldn't be changed. There's no doubt Kata Containers could do better and do not call a resize in such situations, and that's something that should also be solved there. However, we should also work this around on Cloud Hypervisor side as it introduces a regression with the current Kata Containers code. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2022-03-30 21:29:08 +01:00
Sebastien Boeuf	3c973fa7ce	virtio-devices: vhost-user: Add support for TDX By enabling the VIRTIO feature VIRTIO_F_IOMMU_PLATFORM for all vhost-user devices when needed, we force the guest to use the DMA API, making these devices compatible with TDX. By using DMA API, the guest triggers the TDX codepath to share some of the guest memory, in particular the virtqueues and associated buffers so that the VMM and vhost-user backends/processes can access this memory. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-03-30 10:32:23 +02:00
Rob Bradford	ca68b9e7a9	build: Remove "cmos" feature gate Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-29 15:20:58 +01:00
Rob Bradford	e0d3efec6e	devices: cmos: Implement CMOS based reset If EFI reset fails on the Linux kernel then it will fallthrough to CMOS reset. Implement this as one of our reset solutions. Fixes: #3912 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-29 15:20:58 +01:00
Rob Bradford	7c0cf8cc23	arch, devices, vmm: Remove "acpi" feature gate Compile this feature in by default as it's well supported on both aarch64 and x86_64 and we only officially support using it (no non-acpi binaries are available.) Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-28 09:18:29 -07:00
William Douglas	6b0df31e5d	vmm: Add support for enabling AMX in vm guests AMX is an x86 extension adding hardware units for matrix operations (int and float dot products). The goal of the extension is to provide performance enhancements for these common operations. On Linux, AMX requires requesting the permission from the kernel prior to use. Guests wanting to make use of the feature need to have the request made prior to starting the vm. This change then adds the first --cpus features option amx that when passed will enable AMX usage for guests (needs a 5.17+ kernel) or exits with failure. The activation is done in the CpuManager of the VMM thread as it allows migration and snapshot/restore to work fairly painlessly for AMX enabled workloads. Signed-off-by: William Douglas <william.douglas@intel.com>	2022-03-25 14:11:54 -07:00
Bo Chen	639a7dd73a	vmm: Improve 'test_config_validation' with precise Err assertions Fixed: #3879 Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-03-25 09:17:05 +00:00
Sebastien Boeuf	afd9f17b73	virtio-fs: Deprecate the DAX feature Disable the DAX feature from the virtio-fs implementation as the feature is still not stable. The feature is deprecated, meaning the 'dax' parameter will be removed in about 2 releases cycles. In the meantime, the parameter value is ignored and forced to be disabled. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-03-24 10:39:11 -07:00
Rob Bradford	7a8061818e	vmm: Don't expose MemoryManager ACPI functionality unless required When running non-dynamic or with virtio-mem for hotplug the ACPI functionality should not be included on the DSDT nor does the MemoryManager need to be placed on the MMIO bus. Fixes: #3883 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-24 13:17:51 +00:00
Rob Bradford	f6dfb42a64	vmm: cpu: Don't place CpuManager on MMIO bus when non-dynamic This is now consistent with not supplying the _CRS for the device when CpuManager is not dynamic. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-24 13:17:39 +00:00
Rob Bradford	bbf7fd5372	vmm: Reject memory resizing on TDX This is similar to the dynamic concept used in CpuManager. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-23 23:15:20 +00:00

1 2 3 4 5 ...

1699 Commits