cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-11-04 19:11:11 +00:00

Author	SHA1	Message	Date
Rob Bradford	9c5be6f660	build: Remove unnecessary Result<> returns If the function can never return an error this is now a clippy failure: error: this function's return value is unnecessarily wrapped by `Result` --> virtio-devices/src/watchdog.rs:215:5 \| 215 \| / fn set_state(&mut self, state: &WatchdogState) -> io::Result<()> { 216 \| \| self.common.avail_features = state.avail_features; 217 \| \| self.common.acked_features = state.acked_features; 218 \| \| // When restoring enable the watchdog if it was previously enabled. We reset the timer ... \| 223 \| \| Ok(()) 224 \| \| } \| \|_____^ \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#unnecessary_wraps Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-02-11 18:18:44 +00:00
Sebastien Boeuf	19167e7647	pci: vfio: Implement INTx support With all the preliminary work done in the previous commits, we can update the VFIO implementation to support INTx along with MSI and MSI-X. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-10 17:34:56 +00:00
Sebastien Boeuf	2cdc3e3546	vmm: device_manager: Add PCI routing table to ACPI Here we are adding the PCI routing table, commonly called _PRT, to the ACPI DSDT. For simplification reasons, we chose not to implement PCI links as this involves dynamic decision from the guest OS, which result in lots of complexity both from an AML perspective and from a device manager perspective. That's why the _PRT creates a static list of 32 entries, each assigned with the IRQ number previously reserved by the device manager. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-10 17:34:56 +00:00
Sebastien Boeuf	de9471fc72	vmm: device_manager: Allocate IRQs for PCI devices In order to support INTx for PCI devices, each PCI device must be assigned an IRQ. This is preliminary work to reserve 8 IRQs which will be shared across the 32 PCI devices. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-10 17:34:56 +00:00
Sebastien Boeuf	b5450ca72c	vmm: device_manager: Store legacy interrupt manager In anticipation for accessing the legacy interrupt manager from the function creating a VFIO PCI device, we store it as part of the DeviceManager, to make it available for all methods. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-10 17:34:56 +00:00
Sebastien Boeuf	8008aad545	vmm: device_manager: Don't pass the MSI interrupt manager around The DeviceManager already has a hold onto the MSI interrupt manager, therefore there's no need to pass it through every function. Instead, let's simplify the code by using the attribute from DeviceManager's instance. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-10 17:34:56 +00:00
William Douglas	48963e322a	Enable pty console Add the ability for cloud-hypervisor to create, manage and monitor a pty for serial and/or console I/O from a user. The reasoning for having cloud-hypervisor create the ptys is so that clients, libvirt for example, could exit and later re-open the pty without causing I/O issues. If the clients were responsible for creating the pty, when they exit the main pty fd would close and cause cloud-hypervisor to get I/O errors on writes. Ideally the main and subordinate pty fds would be kept in the main vmm's Vm structure. However, because the device manager owns parsing the configuration for the serial and console devices, the information is instead stored in new fields under the DeviceManager structure directly. From there hooking up the main fd is intended to look as close to handling stdin and stdout on the tty as possible (there is some future work ahead for perhaps moving support for the pty into the vmm_sys_utils crate). The main fd is used for reading user input and writing to output of the Vm device. The subordinate fd is used to setup raw mode and it is kept open in order to avoid I/O errors when clients open and close the pty device. The ability to handle multiple inputs as part of this change is intentional. The current code allows serial and console ptys to be created and both be used as input. There was an implementation gap though with the queue_input_bytes needing to be modified so the pty handlers for serial and console could access the methods on the serial and console structures directly. Without this change only a single input source could be processed as the console would switch based on its input type (this is still valid for tty and isn't otherwise modified). Signed-off-by: William Douglas <william.r.douglas@gmail.com>	2021-02-09 10:03:28 +00:00
Wei Liu	1fc8c9165a	vmm: drop two unused errors Their last users were gone in `af3c6c34c3`. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-02-08 16:15:31 +00:00
Sebastien Boeuf	c397c9c95e	vmm, virtio-devices: mem: Don't use MADV_DONTNEED on hugepages This commit introduces a new information to the VirtioMemZone structure in order to know if the memory zone is backed by hugepages. Based on this new information, the virtio-mem device is now able to determine if madvise(MADV_DONTNEED) should be performed or not. The madvise documentation specifies that MADV_DONTNEED advice will fail if the memory range has been allocated with some hugepages. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Hui Zhu <teawater@antfin.com>	2021-02-04 17:52:30 +00:00
Sebastien Boeuf	f24094392e	virtio-devices: mem: Improve semantic around Resize object By introducing a ResizeSender object, we avoid having a Resize clone with a different content than the original Resize object. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-04 17:52:30 +00:00
Sebastien Boeuf	24c8cce012	block_util: Add synchronous support for fixed VHD disk files Relying on the simplified version of the synchronous support for RAW disk files, the new fixed_vhd_sync module in the block_util crate introduces the synchronous support for fixed VHD disk files. With this patch, the fixed VHD support is complete as it is implemented in both synchronous and asynchronous versions. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-01 13:45:08 +00:00
Sebastien Boeuf	c6854c5a97	block_util: Simplify RAW synchronous implementation Using directly preadv and pwritev, we can simply use a RawFd instead of a file, and we don't need to use the more complex implementation from the qcow crate. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-01 13:45:08 +00:00
Sebastien Boeuf	b2e5dbaecb	block_util, vmm: Add fixed VHD asynchronous implementation This commit adds the asynchronous support for fixed VHD disk files. It introduces FixedVhd as a new ImageType, moving the image type detection to the block_util crate (instead of qcow crate). It creates a new vhd module in the block_util crate in order to handle VHD footer, following the VHD specification. It creates a new fixed_vhd_async module in the block_util crate to implement the asynchronous version of fixed VHD disk file. It relies on io_uring. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-02-01 13:45:08 +00:00
Muminul Islam	5bbf2dca80	vmm: Remove unneeded `return` statement This unneeded return statement giving clippy warnings Signed-off-by: Muminul Islam <muislam@microsoft.com>	2021-01-30 08:58:09 +00:00
Bo Chen	6664e5a6e7	net_util, virtio-devices, vmm: Accept multiple TAP fds This patch enables multi-queue support for creating virtio-net devices by accepting multiple TAP fds, e.g. '--net fds=3:7'. Fixes: #2164 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-01-28 09:11:39 +00:00
Wei Liu	b959849149	vmm: drop unnecessary semicolon Building with 1.51 nightly produces the following warning: warning: unnecessary trailing semicolon --> vmm/src/device_manager.rs:396:6 \| 396 \| }; \| ^ help: remove this semicolon \| = note: `#[warn(redundant_semicolons)]` on by default warning: 1 warning emitted Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-01-27 14:43:20 +00:00
Rob Bradford	76e15a4240	vmm: acpi: Support compiling ACPI code on aarch64 This skeleton commit brings in the support for compiling aarch64 with the "acpi" feature ready to the ACPI enabling. It builds on the work to move the ACPI hotplug devices from I/O ports to MMIO and conditionalises any code that is x86_64 only (i.e. because it uses an I/O port.) Filling in the aarch64 specific details in tables such as the MADT it out of the scope. See: #2178 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-26 15:19:02 +08:00
Sebastien Boeuf	f6c8e4b045	vmm: device_manager: Add info!() message about disk file backend It might be useful debugging information for the user to know what kind of disk file implementation is in use. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-01-22 16:10:34 +00:00
Sebastien Boeuf	2824642e80	virtio-devices: Rename BlockIoUring to Block Now that BlockIoUring is the only implementation of virtio-block, handling both synchronous and asynchronous backends based on the AsyncIo trait, we can rename it to Block. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-01-22 16:10:34 +00:00
Sebastien Boeuf	12e20effd7	block_util: Port synchronous QCOW file to AsyncIo trait Based on the synchronous QCOW file implementation present in the qcow crate, we created a new qcow_sync module in block_util that ports this synchronous implementation to the AsyncIo trait. The point is to reuse virtio-blk asynchronous implementation for both synchronous and asynchronous backends. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-01-22 16:10:34 +00:00
Sebastien Boeuf	9fc86a91e2	block_util: Port synchronous RAW file to AsyncIo trait Based on the synchronous RAW file implementation present in the qcow crate, we created a new raw_sync module in block_util that ports this synchronous implementation to the AsyncIo trait. The point is to reuse virtio-blk asynchronous implementation for both synchronous and asynchronous backends. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-01-22 16:10:34 +00:00
Sebastien Boeuf	da8ce25abf	virtio-devices: Use asynchronous traits for virtio-blk io_uring Based on the new DiskFile and AsyncIo traits, the implementation of asynchronous block support does not have to be tied to io_uring anymore. Instead, the only thing the virtio-blk implementation knows is that it is using an asynchronous implementation of the underlying disk file. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-01-22 16:10:34 +00:00
Rob Bradford	d578c408b7	vmm: acpi: Move DeviceManager ACPI device to an MMIO address Migrate the DeviceManager from a fixed I/O port address to an allocated MMIO address. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-22 16:08:41 +01:00
Rob Bradford	6006068951	vmm: acpi: Move MemoryManager ACPI device to an MMIO address Migrate the MemoryManager from a fixed I/O port address to an allocated MMIO address. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-22 16:08:41 +01:00
Rob Bradford	28ab6cea0e	vmm: acpi: Move ACPI GED device to MMIO bus Currently the GED control is in a fixed I/O port address but instead use an MMIO address that has been chosen by the allocator. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-22 16:08:41 +01:00
Rob Bradford	4ebeeb1310	vmm: device_manager: Rename shutdown ACPI device Rename the variable for the shutdown device to clarify it's purpose in the function. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-22 16:08:41 +01:00
Rob Bradford	843a010163	vmm: vm, device_manager: Trigger a power button notification via ACPI Use the ACPI GED device to trigger a notitifcation of type POWER_BUTTON_CHANGED which will ultimately lead to the guest being notified. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-13 17:00:39 +00:00
Rob Bradford	ff4d378f1b	vmm: device_manager: acpi: Add a power button (PNP0C0C) Add a power button to the devices defined in the \_SB Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-13 17:00:39 +00:00
Rob Bradford	7b376fa8e0	devices: acpi: Generalise the HotPlugNotificationFlags Renamed this bitfield as it will also be used for non-hotplug purposes such as synthesising a power button. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-13 17:00:39 +00:00
Rob Bradford	7f9cdf1183	vmm: device_manager: Create a virtio-net device from TAP fd If a TAP fd is supplied then create an virtio-net device using that TAP fd. Fixes: #2052 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-12-17 22:51:30 +01:00
Rob Bradford	03db48306b	vmm: Activate virtio device from VMM thread When a device is ready to be activated signal to the VMM thread via an EventFd that there is a device to be activated. When the VMM receives a notification on the EventFd that there is a device to be activated notify the device manager to attempt to activate any devices that have not been activated. As a side effect the VMM thread will create the virtio device threads. Fixes: #1863 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-12-17 11:23:53 +00:00
Rob Bradford	1fc6d50f3e	misc: Make Bus::write() return an Option<Arc<Barrier>> This can be uses to indicate to the caller that it should wait on the barrier before returning as there is some asynchronous activity triggered by the write which requires the KVM exit to block until it's completed. This is useful for having vCPU thread wait for the VMM thread to proceed to activate the virtio devices. See #1863 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-12-17 11:23:53 +00:00
Muminul Islam	f4af668d76	hypervisor, vmm: Implement MsiInterruptOps for mshv Co-Developed-by: Wei Liu <liuwe@microsoft.com> Signed-off-by: Wei Liu <liuwe@microsoft.com> Signed-off-by: Muminul Islam <muislam@microsoft.com>	2020-12-09 14:55:20 +01:00
Muminul Islam	9ce6c3b75c	hypervisor, vmm: Feature guard KVM specific code There are some code base and function which are purely KVM specific for now and we don't have those supports in mshv at the moment but we have plan for the future. We are doing a feature guard with KVM. For example, KVM has mp_state, cpu clock support, which we don't have for mshv. In order to build those code we are making the code base for KVM specific compilation. Signed-off-by: Muminul Islam <muislam@microsoft.com>	2020-12-09 14:55:20 +01:00
Rob Bradford	280d4fb245	vmm: Include device tree in vm.info API The DeviceNode cannot be fully represented as it embeds a Rust style enum (i.e. with data) which is instead represented by a simple associative array. Fixes: #1167 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-12-01 16:44:25 +01:00
Rob Bradford	593a958fe5	pci, vmm: Include VFIO devices in device tree Fixes: #1687 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-12-01 10:49:04 +01:00
Rob Bradford	6f63cd00eb	vmm: device_manager: Remove ActivatedBackend struct This was used for vhost-user self spawning but no longer has any users. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-25 20:47:43 +01:00
Rob Bradford	c622278030	config, device_manager: Add support for disabling io_uring for testing Add config parameter to --disk called "_disable_io_uring" (the underscore prefix indicating it is not for public consumpion.) Use this option to disable io_uring if it would otherwise be used. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-18 11:47:54 +01:00
Rob Bradford	c62e409827	memory_manager: Generate a MemoryRangeTable for dirty ranges In order to do this we must extend the MemoryManager API to add the ability to specify the tracking of the dirty pages when creating the userspace mappings and also keep track of the userspace mappings that have been created for RAM regions. Currently the dirty pages are collected into ranges based on a block level of 64 pages. The algorithm could be tweaked to create smaller ranges but for now if any page in the block of 64 is dirty the whole block is added to the range. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-17 16:57:11 +00:00
Rob Bradford	8baa244ec1	hypervisor: Add control for dirty page logging When creating a userspace mapping provide a control for enabling the logging of dirty pages. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-17 16:57:11 +00:00
Rob Bradford	7b77f1ef90	vmm: Remove self-spawning functionality for vhost-user-{net,block} This also removes the need to lookup up the "exe" symlink for finding the VMM executable path. Fixes: #1925 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-09 00:16:15 +01:00
Sebastien Boeuf	3594685279	vmm: Move balloon code from MemoryManager to DeviceManager Now that we have a new dedicated way of asking for a balloon through the CLI and the REST API, we can move all the balloon code to the device manager. This allows us to simplify the memory manager, which is already quite complex. It also simplifies the behavior of the balloon resizing command. Instead of providing the expected size for the RAM, which is complex when memory zones are involved, it now expects the balloon size. This is a much more straightforward behavior as it really resizes the balloon to the desired size. Additionally to the simplication, the benefit of this approach is that it does not need to be tied to the memory manager at all. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-10-22 16:33:16 +02:00
Rob Bradford	885ee9567b	vmm: Add support for creating virtio-watchdog The watchdog device is created through the "--watchdog" parameter. At most a single watchdog can be created per VM. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-10-21 16:02:39 +01:00
Sebastien Boeuf	af3c6c34c3	vmm: Remove mmio and pci differentiation Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-10-19 14:58:48 +01:00
Sebastien Boeuf	7cbd47a71a	vmm: Prevent KVM device fd from being unusable from VfioContainer When shutting down a VM using VFIO, the following error has been detected: vfio-ioctls/src/vfio_device.rs:312 -- Could not delete VFIO group: KvmSetDeviceAttr(Error(9)) After some investigation, it appears the KVM device file descriptor used for removing a VFIO group was already closed. This is coming from the Rust sequence of Drop, from the DeviceManager all the way down to VfioDevice. Because the DeviceManager owns passthrough_device, which is effectively a KVM device file descriptor, when the DeviceManager is dropped, the passthrough_device follows, with the effect of closing the KVM device file descriptor. Problem is, VfioDevice has not been dropped yet and it still needs a valid KVM device file descriptor. That's why the simple way to fix this issue coming from Rust dropping all resources is to make Linux accountable for it by duplicating the file descriptor. This way, even when the passthrough_device is dropped, the KVM file descriptor is closed, but a duplicated instance is still valid and owned by the VfioContainer. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-10-15 19:15:28 +02:00
Sebastien Boeuf	1e3a6cb450	vmm: Simplify some of the io_uring code Small patch creating a dedicated `block_io_uring_is_supported()` function for the non-io_uring case, so that we can simplify the code in the DeviceManager. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-10-07 14:26:49 +02:00
Sebastien Boeuf	6aa5e21212	vmm: device_manager: Fix PCI device unplug issues Because of the PCI refactoring that happened in the previous commit `d793cc4da3`, the ability to fully remove a PCI device was altered. The refactoring was correct, but the usage of a generic function to pass the same reference for both BusDevice, PciDevice and Any + Send + Sync causes the Arc::ptr_eq() function to behave differently than expected, as it does not match the references later in the code. That means we were not able to remove the device reference from the MMIO and/or PIO buses, which was leading to some bus range overlapping error once we were trying to add a device again to the previous range that should have been removed. Fixes #1802 Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-10-06 12:56:17 +02:00
Rob Bradford	2d457ab974	vmm: device_manager: Make PMEM "discard_writes" mode true CoW The PMEM support has an option called "discard_writes" which when true will prevent changes to the device from hitting the backing file. This is trying to be the equivalent of "readonly" support of the block device. Previously the memory of the device was marked as KVM_READONLY. This resulted in a trap when the guest attempted to write to it resulting a VM exit (and recently a warning). This has a very detrimental effect on the performance so instead make "discard_writes" truly CoW by mapping the memory as `PROT_READ \| PROT_WRITE` and using `MAP_PRIVATE` to establish the CoW mapping. Fixes: #1795 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-10-02 14:26:15 +02:00
Rob Bradford	664c3ceda6	vmm: device_manager: Warn that vhost-user self spawning is deprecated See #1724 for details. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-09-30 18:32:50 +02:00
Henry Wang	381d0b4372	devices: remove the migration traits for the `Gic` struct Unlike x86_64, the "interrupt_controller" in the device manager for AArch64 is only a `Gic` object that implements the `InterruptController` to provide the interrupt delivery service. This is not the real GIC device so that we do not need to save its states. Also, we do not need to insert it to the device_tree. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2020-09-23 12:37:25 +01:00
Henry Wang	7ddcad1d8b	arch: AArch64: add a field `gicr_typers` for GIC implementations The value of GIC register `GICR_TYPER` is needed in restoring the GIC states. This commit adds a field in the GIC device struct and a method to construct its value. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2020-09-23 12:37:25 +01:00
Henry Wang	dcf6d9d731	device_manager: AArch64: add a field to set/get GIC device entity In AArch64 systems, the state of GIC device can only be retrieved from `KVM_GET_DEVICE_ATTR` ioctl. Therefore to implement saving/restoring the GIC states, we need to make sure that the GIC object (either the file descriptor or the device itself) can be extracted after the VM is started. This commit refactors the code of GIC creation by adding a new field `gic_device_entity` in device manager and methods to set/get this field. The GIC object can be therefore saved in the device manager after calling `arch::configure_system`. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2020-09-23 12:37:25 +01:00
Sebastien Boeuf	7c346c3844	vmm: Kill vhost-user self-spawned process on failure If after the creation of the self-spawned backend, the VMM cannot create the corresponding vhost-user frontend, the VMM must kill the freshly spawned process in order to ensure the error propagation can happen. In case the child process would still be around, the VMM cannot return the error as it waits onto the child to terminate. This should help us identify when self-spawned failures are caused by a connection being refused between the VMM and the backend. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-18 17:26:25 +01:00
Sebastien Boeuf	de2b917f55	vmm: Add hotplugged_size to VirtioMemZone Adding a new field to VirtioMemZone structure, as it lets us associate with a particular virtio-mem region the amount of memory that should be plugged in at boot. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-16 19:20:04 +02:00
Sebastien Boeuf	3faf8605f3	vmm: Group virtio-mem fields under a dedicated structure This patch simplifies the code as we have one single Option for the VirtioMemZone. This also prepares for storing additional information related to the virtio-mem region. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-16 19:20:04 +02:00
Hui Zhu	33a1e37c35	virtio-devices: mem: Allow for an initial size This commit gives the possibility to create a virtio-mem device with some memory already plugged into it. This is preliminary work to be able to reboot a VM with the virtio-mem region being already resized. Signed-off-by: Hui Zhu <teawater@antfin.com> Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-16 19:20:04 +02:00
Sebastien Boeuf	eb7b923e22	vmm: Create virtio-mem device with appropriate NUMA node Now that virtio-mem device accept a guest NUMA node as parameter, we retrieve this information from the list of NUMA nodes. Based on the memory zone associated with the virtio-mem device, we obtain the NUMA node identifier, which we provide to the virtio-mem device. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-16 19:20:04 +02:00
Sebastien Boeuf	dcedd4cded	virtio-devices: virtio-mem: Add NUMA support Implement support for associating a virtio-mem device with a specific guest NUMA node, based on the ACPI proximity domain identifier. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-16 19:20:04 +02:00
Sebastien Boeuf	775f3346e3	vmm: Rename 'virtiomem' to 'virtio_mem' For more consistency and help reading the code better, this commit renames all 'virtiomem' variables into 'virtio_mem'. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-16 19:20:04 +02:00
Sebastien Boeuf	30ff7e108f	vmm: Prepare code to accept multiple virtio-mem devices Both MemoryManager and DeviceManager are updated through this commit to handle the creation of multiple virtio-mem devices if needed. For now, only the framework is in place, but the behavior remains the same, which means only the memory zone created from '--memory' generates a virtio-mem region that can be used for resize. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-09-16 19:20:04 +02:00
Rob Bradford	d793cc4da3	vmm: device_manager: Extract common PCI code Extract common code for adding devices to the PCI bus into its own function from the VFIO and VIRTIO code paths. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-09-11 07:33:18 +02:00
Rob Bradford	15025d71b1	devices, vm-device: Move BusDevice and Bus into vm-device This removes the dependency of the pci crate on the devices crate which now only contains the device implementations themselves. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-09-10 09:35:38 +01:00
Bo Chen	3c923f0727	virtio-devices: seccomp: Add seccomp filters for virtio_vsock thread This patch enables the seccomp filters for the virtio_vsock worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-09-09 17:04:39 +01:00
Bo Chen	1175fa2bc7	virtio-devices: seccomp: Add seccomp filters for blk_io_uring thread This patch enables the seccomp filters for the block_io_uring worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-09-09 17:04:39 +01:00
Sebastien Boeuf	871138d5cc	vm-migration: Make snapshot() mutable There will be some cases where the implementation of the snapshot() function from the Snapshottable trait will require to modify some internal data, therefore we make this possible by updating the trait definition with snapshot(&mut self). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-08-25 16:43:10 +02:00
Anatol Belski	12212d2966	pci: device_manager: Remove hardcoded I/O port assignment It is otherwise seems to be able to cause resource conflicts with Windows APCI_HAL. The OS might do a better job on assigning resources to this device, withouth them to be requested explicitly. 0xcf8 and 0xcfc are only what is certainly needed for the PCI device enumeration. Signed-off-by: Anatol Belski <anatol.belski@microsoft.com>	2020-08-25 09:00:06 +02:00
Michael Zhao	afc98a5ec9	vmm: Fix AArch64 clippy warnings of vmm and other crates Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-08-24 10:59:08 +02:00
Bo Chen	02d87833f0	virtio-devices: seccomp: Add seccomp filters for vhost_blk thread This patch enables the seccomp filters for the vhost_blk worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-19 08:33:58 +02:00
Bo Chen	896b9a1d4b	virtio-devices: seccomp: Add seccomp filter for vhost_net_ctl thread This patch enables the seccomp filters for the vhost_net_ctl worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-19 08:33:58 +02:00
Bo Chen	02d63149fe	virtio-devices: seccomp: Add seccomp filters for vhost_fs thread This patch enables the seccomp filters for the vhost_fs worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-19 08:33:58 +02:00
Bo Chen	c82ded8afa	virtio-devices: seccomp: Add seccomp filters for balloon thread This patch enables the seccomp filters for the balloon worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-19 08:33:58 +02:00
Bo Chen	c460178723	virtio-devices: seccomp: Add seccomp filters for mem thread This patch enables the seccomp filters for the mem worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-19 08:33:58 +02:00
Bo Chen	4539236690	virtio-devices: seccomp: Add seccomp filters for iommu thread This patch enables the seccomp filters for the iommu worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-17 21:08:49 +02:00
Anatol Belski	eba42c392f	devices: acpi: Add UID to devices with common HID Some OS might check for duplicates and bail out, if it can't create a distinct mapping. According to ACPI 5.0 section 6.1.12, while _UID is optional, it becomes required when there are multiple devices with the same _HID. Signed-off-by: Anatol Belski <ab@php.net>	2020-08-14 08:52:02 +02:00
Bo Chen	dc71d2765a	virtio-devices: seccomp: Add seccomp filters for pmem thread This patch enables the seccomp filters for the pmem worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-05 08:13:31 +01:00
Bo Chen	d77977536d	virtio-devices: seccomp: Add seccomp filters for net thread This patch enables the seccomp filters for the net worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-05 08:13:31 +01:00
Bo Chen	276df6b71c	virtio-devices: seccomp: Add seccomp filters for console thread This patch enables the seccomp filters for the console worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-05 08:13:31 +01:00
Bo Chen	a426221167	virtio-devices: seccomp: Add seccomp filters for rng thread This patch enables the seccomp filters for the rng worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-05 08:13:31 +01:00
Bo Chen	704edd544c	virtio-devices: seccomp: Add seccomp_filter module This patch added the seccomp_filter module to the virtio-devices crate by taking reference code from the vmm crate. This patch also adds allowed-list for the virtio-block worker thread. Partially fixes: #925 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-08-04 11:40:49 +02:00
Sebastien Boeuf	8f0bf82648	io_uring: Add new feature gate By adding a new io_uring feature gate, we let the user the possibility to choose if he wants to enable the io_uring improvements or not. Since the io_uring feature depends on the availability on recent host kernels, it's better if we leave it off for now. As soon as our CI will have support for a kernel 5.6 with all the features needed from io_uring, we'll enable this feature gate permanently. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-08-03 14:15:01 +01:00
Sebastien Boeuf	917027c55b	vmm: Rely on virtio-blk io_uring when possible In case the host supports io_uring and the specific io_uring options needed, the VMM will choose the asynchronous version of virtio-blk. This will enable better I/O performances compared to the default synchronous version. This is also important to note the VMM won't be able to use the asynchronous version if the backend image is in QCOW format. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-08-03 14:15:01 +01:00
Wei Liu	a52b614a61	vmm: device_manager: console input should be only consumed by one device Cloud Hypervisor allows either the serial or virtio console to output to TTY, but TTY input is pushed to both. This is not correct. When Linux guest is configured to spawn TTYs on both ttyS0 and hvc0, the user effectively issues the same commands twice in different TTYs. Fix this by only direct input to the one choice that is using host side TTY. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-30 18:05:01 +02:00
Wei Liu	5ed794a44c	vmm: device_manager: rename console_input to virtio_console_input Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-30 18:05:01 +02:00
Wei Liu	3e68867bb7	vmm: device_manager: eliminate KvmMsiInterruptManager from the new function The logic to create an MSI interrupt manager is applicable to Hyper-V as well. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-30 08:00:33 +02:00
Wei Liu	218ec563fc	vmm: fix warnings when KVM is not enabled Some imports are only used by KVM. Some variables and code become dead or unused when KVM is not enabled. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-28 21:08:39 +01:00
Rob Bradford	aae5d988e1	devices: vmm: Add ACPI PM timer This is a counter exposed via an I/O port that runs at 3.579545MHz. Here we use a hardcoded I/O and expose the details through the FADT table. TEST=Boot Linux kernel and see the following in dmesg: [ 0.506198] clocksource: acpi_pm: mask: 0xffffff max_cycles: 0xffffff, max_idle_ns: 2085701024 ns Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-07-23 13:10:21 +01:00
Wei Liu	f03afea0d6	device_manager: document unsafe block in add_vfio_device It is not immediately obvious why the conversion is safe. Document the safety guarantee. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-21 17:13:10 +01:00
Samuel Ortiz	be51ea250d	device_manager: Simplify the passthrough internal API We store the device passthrough handler, so we should use it through our internal API and only carry the passed through device configuration. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-07-21 17:20:25 +02:00
Michael Zhao	ddf1b76906	hypervisor: Refactor create_passthrough_device() for generic type Changed the return type of create_passthrough_device() to generic type hypervisor::Device. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-07-21 16:22:02 +02:00
Wei Liu	d484a3383c	vmm: device_manager: introduce add_passthrough_device It calls add_vfio_device on KVM or returns an error when not running on KVM. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-17 20:21:39 +02:00
Wei Liu	821892419c	vmm: device_manager: use generic names for passthrough device Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-17 20:21:39 +02:00
Wei Liu	ff8d7bfe83	hypervisor: add create_passthrough_device call to Vm trait That function is going to return a handle for passthrough related operations. Move create_kvm_device code there. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-17 20:21:39 +02:00
Wei Liu	c08d2b2c70	device_manager: avoid manipulating MemoryRegion fields directly Hyper-V may have different field names. Use make_user_memory_region instead. No functional change. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-16 15:56:03 +02:00
Michael Zhao	cce6237536	pci: Enable GSI routing (MSI type) for AArch64 In this commit we saved the BDF of a PCI device and set it to "devid" in GSI routing entry, because this field is mandatory for GICv3-ITS. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-07-14 14:34:54 +01:00
Michael Zhao	17057a0dd9	vmm: Fix build errors with "pci" feature on AArch64 Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-07-14 14:34:54 +01:00
Rob Bradford	4963e37dc8	qcow, virtio-devices: Break cyclic dependency Move the definition of RawFile from virtio-devices crate into qcow crate. All the code that consumes RawFile also already depends on the qcow crate for image file type detection so this change breaks the need for the qcow crate to depend on the very large virtio-devices crate. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-07-10 17:47:31 +02:00
Hui Zhu	800220acbb	virtio-balloon: Store the balloon size to support reboot This commit store balloon size to MemoryConfig. After reboot, virtio-balloon can use this size to inflate back to the size before reboot. Signed-off-by: Hui Zhu <teawater@antfin.com>	2020-07-07 17:25:13 +01:00
Hui Zhu	8b6b97b86f	vmm: Add virtio-balloon support This commit adds new option balloon to memory config. Set it to on will open the balloon function. Signed-off-by: Hui Zhu <teawater@antfin.com>	2020-07-07 17:25:13 +01:00
Wei Liu	cfa758fbb1	vmm, hypervisor: introduce and use make_user_memory_region This removes the last KVM-ism from memory_manager. Also make use of that method in other places. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-06 12:31:19 +02:00
Wei Liu	8d97d628c3	vmm: drop "kvm" from memory slot code The code is purely for maintaining an internal counter. It is not really tied to KVM. No functional change. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-06 12:31:19 +02:00
Samuel Ortiz	4cc8853fe4	vmm: device_manager: Rename vm_fd The _fd suffix is KVM specific. But since it now point to an hypervisor agnostic hypervisor::Vm implementation, we should just rename it vm. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-07-06 09:35:30 +01:00
Rob Bradford	2a6eb31d5b	vm-virtio, virtio-devices: Split device implementation from virt queues Split the generic virtio code (queues and device type) from the VirtioDevice trait, transport and device implementations. This also simplifies the feature handling in vhost_user_backend as the vm-virtio crate is no longer has any features. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-07-02 17:09:28 +01:00
Wei Liu	2b8accf49a	vmm: interrupt: put KVM code into a kvm module Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-06-30 12:09:42 +01:00
Wei Liu	637f58bcd9	vmm: interrupt: drop Kvm prefix from KvmLegacyUserspaceInterruptManager This data structure doesn't contain KVM specific stuff. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-06-30 12:09:42 +01:00
Wei Liu	574cab6990	vmm: interrupt: create GSI hashmap directly The observation is that the GSI hashmap remains untouched before getting passed into the MSI interrupt manager. We can create that hashmap directly in the interrupt manager's new function. The drops one import from the interrupt module. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-06-30 12:09:42 +01:00
Wei Liu	b00171e17d	vmm: use MemoryRegion where applicable That removes one more KVM-ism in VMM crate. Note that there are more KVM specific code in those files to be split out, but we're not at that stage yet. No functional change. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-06-25 10:25:13 +02:00
Rob Bradford	d983c0a680	vmm: Expose counters from virtio devices to API Collate the virtio device counters in DeviceManager for each device that exposes any and expose it through the recently added HTTP API. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-25 07:02:44 +02:00
Sebastien Boeuf	e382dc6657	vmm, vm-virtio: Restore DeviceManager's devices in a paused state The same way the VM and the vCPUs are restored in a paused state, all devices associated with the device manager must be restored in the same paused state. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-23 10:15:03 +02:00
Wei Liu	7552f4db61	vmm: device_manager: restore error handling When the hypervisor crate was introduced, a few places that handled errors were commented out in favor of unwrap, but that's bad practice. Restore proper error handling in those places in this patch. We cannot use from_raw_os_error anymore because it is wrapped deep under hypervisor crate. Create new custom errors instead. Fixes: `e4dee57e81` ("arch, pci, vmm: Initial switch to the hypervisor crate") Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-06-22 22:02:21 +01:00
Muminul Islam	e4dee57e81	arch, pci, vmm: Initial switch to the hypervisor crate Start moving the vmm, arch and pci crates to being hypervisor agnostic by using the hypervisor trait and abstractions. This is not a complete switch and there are still some remaining KVM dependencies. Signed-off-by: Muminul Islam <muislam@microsoft.com> Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-06-22 15:03:15 +02:00
Sebastien Boeuf	f08e9b6a73	vmm: device_manager: Return PciDeviceInfo from a hotplugged device In order to provide the device name and PCI b/d/f associated with a freshly hotplugged device, the hotplugging functions from the device manager return a new structure called PciDeviceInfo. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-12 13:37:18 +01:00
Michael Zhao	5cd1730bc4	vmm: Configure VM on AArch64 Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-11 15:00:17 +01:00
Michael Zhao	eeeb45bbb9	vmm: Enable device manager for AArch64 Screened IO bus because it is not for AArch64. Enabled Serial, RTC and Virtio devices with MMIO transport option. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-06-11 15:00:17 +01:00
Rob Bradford	9b71ba20ac	vmm, vm-virtio: Stop always autogenerating a host MAC address This removes the need to use CAP_NET_ADMIN privileges and instead the host MAC addres is either provided by the user or alternatively it is retrieved from the kernel. TEST=Run cloud-hypervisor without CAP_NET_ADMIN permission and a preconfigured tap device: sudo ip tuntap add name tap0 mode tap sudo ifconfig tap0 192.168.249.1 netmask 255.255.255.0 up cargo clean cargo build target/debug/cloud-hypervisor --serial tty --console off --kernel ~/src/rust-hypervisor-firmware/target/target/release/hypervisor-fw --disk path=~/workloads/clear-33190-kvm.img --net tap=tap0 VM was also rebooted to check that works correctly. Fixes: #1274 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-06-08 17:56:10 +02:00
Bo Chen	a8cdf2f070	tests,vm-virtio,vmm: Use 'socket' for all CLI/API parameters This patch unifies the inconsistent uses of 'socket' and 'sock' from our CLI/API parameters. Fixes: #1091 Signed-off-by: Bo Chen <chen.bo@intel.com>	2020-06-08 17:41:12 +02:00
Samuel Ortiz	3336e80192	vfio: Switch to the vfio-ioctls crate ch branch Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-06-04 08:48:55 +02:00
Samuel Ortiz	d24aa72d3e	vfio: Rename to vfio-ioctls Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-06-04 08:48:55 +02:00
Samuel Ortiz	53ce529875	vfio: Move the PCI implementation to the PCI crate There is a much stronger PCI dependency from vfio_pci.rs than a VFIO one from pci/src/vfio.rs. It seems more natural to have the PCI specific VFIO implementation in the PCI crate rather than the other way around. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-06-04 08:48:55 +02:00
Michael Zhao	8f1f9d9e6b	devices: Implement InterruptController on AArch64 This commit only implements the InterruptController crate on AArch64. The device specific part for GIC is to be added. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-05-26 11:09:19 +02:00
Michael Zhao	b32d3025f3	devices: Refactor IOAPIC to cover other architectures IOAPIC, a X86 specific interrupt controller, is referenced by device manager and CPU manager. To work with more architectures, a common type for all architectures is needed. This commit introduces trait InterruptController to provide architecture agnostic functions. Device manager and CPU manager can use it without caring what the underlying device is. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-05-26 11:09:19 +02:00
Michael Zhao	1befae872d	build: Fixed build errors and warnings on AArch64 This is a preparing commit to build and test CH on AArch64. All building issues were fixed, but no functionality was introduced. For X86, the logic of code was not changed at all. For ARM, the architecture specific part is still empty. And we applied some tricks to workaround lint warnings. But such code will be replaced later by other commits with real functionality. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-05-21 11:56:26 +01:00
Rob Bradford	af8292b623	vmm, config, vhost_user_blk: remove "wce" parameter This config option provided very little value and instead we now enable this feature (which then lets the guest control the cache mode) unconditionally. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-05-21 08:40:43 +02:00
Rob Bradford	1b8b5ac179	vhost-user_net, vm-virtio, vmm: Permit host MAC address setting Add a new "host_mac" parameter to "--net" and "--net-backend" and use this to set the MAC address on the tap interface. If no address is given one is randomly assigned and is stored in the config. Support for vhost-user-net self spawning was also included. Fixes: #1177 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-05-15 11:45:09 +01:00
Sebastien Boeuf	c37da600e8	vmm: Update DeviceTree upon PCI BAR reprogramming By passing a reference of the DeviceTree to the AddressManager, we can now update the DeviceTree whenever a PCI BAR is reprogrammed. This is mandatory to maintain the correct resources information related to each virtio-pci device, which will ensure correct information will be stored upon VM snapshot. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-12 17:37:31 +01:00
Sebastien Boeuf	d0ae9d7ce6	vmm: Share the DeviceTree across threads We want to be able to share the same DeviceTree across multiple threads, particularly to handle the use case where PCI BAR reprogramming might need to update the tree while from another thread a new device is being added to the tree. That's why this patch moves the DeviceTree instance into an Arc<Mutex<>> so that we can later share a reference of the same mutable tree with the AddressManager responsible for handling PCI BAR reprogramming. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-12 17:37:31 +01:00
Sebastien Boeuf	5e9d254564	vmm: Store and restore virtio-pci BAR resources By using the vector of resources provided by the DeviceNode, the device manager can store the information related to PCI BARs from a virtio-pci device. Based on this, and upon VM restoration, the device manager can restore the BARs in the expected location in the guest address space. One thing to note is that we only need to provide the VirtioPciDevice with the configuration BAR (BAR 0) since the SHaredMemory BAR info comes from the virtio device directly. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-12 17:37:31 +01:00
Sebastien Boeuf	8a826ae24c	vmm: Store and restore virtio-pci device on right PCI slot Based on the new field "pci_bdf", a virtio-pci device can be restored at the same place on the PCI bus it was located before the VM snapshot. This ensures consistent placement on the PCI bus, based on the stored information related to each device. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-12 17:37:31 +01:00
Rob Bradford	592de97fbd	vhost_user_net: Use config::OptionParser to simplify net backend parsing Switch to using the recently added OptionParser in the code that parses the network backend. Whilst doing this also update the net-backend syntax to use "sock" rather than socket. Fixes: #1092 Partially fixes: #1091 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-05-11 09:40:40 +02:00
Sebastien Boeuf	64e01684f9	vmm: Create new module device_tree This module will be dedicated to DeviceNode and DeviceTree definitions along with some dedicated unit tests. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	3b77be903d	vmm: Add device_node!() macro to improve code readability Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	83ec716ec4	vmm: Create breadth-first search iterator for the DeviceTree This iterator will let the VMM enumerate the resources associated with the DeviceManager, allowing for introspection. Moreover, by implementing a double ended iterator, we can get the hierarchy from the leaves to the root of the tree, which is very helpful in the context of restoring the devices in the right order. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	b91ab1e3a5	vmm: Remove the list of migratable devices Now that the device tree fully replaced the need for a dedicated list of migratable devices, this commit cleans up the codebase by removing it from the DeviceManager. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	1be7037229	vmm: Don't use migratable_devices for restore This commit switches from migratable_devices to device_tree in order to restore devices exclusively based on the device tree. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	bc6084390f	vmm: Add migratable field to the DeviceNode This commit adds an extra field to the DeviceNode so that the structure can hold a Migratable device. The long term plan is to be able to remove the dedicated table of migratable devices, but instead rely only on the device tree. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	7fec020f53	vmm: Create a dedicated DeviceTree structure In order to hide the complexity chosen for the device tree stored in the DeviceManager, we introduce a new DeviceTree structure. For now, this structure is a simple passthrough of a HashMap, but it can be extended to handle some DeviceTree specific operations. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	14b379dec5	vmm: Add an identifier field to DeviceNode structure Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	0805d458c4	vmm: Add support for multiple children per DeviceNode Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	daaeba5142	vmm: Change Node into DeviceNode Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	5c7df03efe	vmm: Store and restore virtio-pmem resources This device has a dedicated memory region in the guest address space, which means in case of snapshot/restore, it must be restored in the exact same location it was during the snapshot. That's through the resources that we can describe the location of this extra memory region, allowing the device for correct restoring. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	2e6895d911	vmm: Store and restore virtio-fs resources This device has a dedicated memory region in the guest address space, which means in case of snapshot/restore, it must be restored in the exact same location it was during the snapshot. That's through the resources that we can describe the location of this extra memory region, allowing the device for correct restoring. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	987f82152e	vmm: Store and restore virtio-mmio resources Based on the device tree, retrieve the resources associated with a virtio-mmio device to restore it at the right location in guest address space. Also, the IRQ number is correctly restored. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	9cb1e1cc6b	vmm: Perform MMIO allocation from virtio-mmio device creation Instead of splitting the MMIO allocation and the device creation into separate functions for virtio-mmio devices, it's is easier to move everything into the same function as we'll be able to gather resources in the same place for the same device. These resources will be stored in the device tree in a follow up patch. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	adf297066d	vmm: Create devices in different path if restoring the VM In case the VM is created from scratch, the devices should be created after the DeviceManager has been created. But this should not affect the restore codepath, as in this case the devices should be created as part of the restore() function. It's necessary to perform this differentiation as the restore must go through the following steps: - Create the DeviceManager - Restore the DeviceManager with the right state - Create the devices based on the restored DeviceManager's device tree - Restore each device based on the restored DeviceManager's device tree That's why this patch leverages the recent split of the DeviceManager's creation to achieve what's needed. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	d39f91de02	vmm: Reorganize DeviceManager creation This commit performs the split of the DeviceManager's creation into two separate functions by moving anything related to device's creation after the DeviceManager structure has been initialized. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	89c2a5868c	vmm: Restore devices following the device tree Based on the device tree, we now ensure the restore can be done in the right order, as it will respect the dependencies between nodes. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	52c80cfcf5	vmm: Snapshot and restore DeviceManager state The DeviceManager itself must be snapshotted in order to store the information regarding the devices associated with it, which effectively means we need to store the device tree. The mechanics to snapshot and restore the DeviceManagerState are added to the existing snapshot() and restore() implementations. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Sebastien Boeuf	5b408eec66	vmm: Create a device tree The DeviceManager now creates a tree of devices in order to store the resources associated with each device, but also to track dependencies between devices. This is a key part for proper introspection, but also to support snapshot and restore correctly. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-05-05 16:08:42 +02:00
Rob Bradford	fec97e0586	vm-virtio, vmm: Delete unix socket on shutdown It's not possible to call UnixListener::Bind() on an existing file so unlink the created socket when shutting down the Vsock device. This will allow the VM to be rebooted with a vsock device. Fixes: #1083 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-05-05 13:01:38 +02:00
Rob Bradford	d5bfa2dfc8	vmm, vhost_user_block: Make parameter names match --disk Make the --block-backend parameters match the --disk parameters. Fixes: #898 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-30 15:20:55 +02:00
Sebastien Boeuf	2f0bc06bec	vmm: Update default devices names as "internal" Let's put an underscore "_" in front of each device name to identify when it has been set internally. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	aaba6e777f	vmm: Add virtio-console to the list of Migratable devices The virtio-console was not added to the list of Migratable devices, which is fixed from this patch. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	9ab4bb1ae2	devices: serial: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	06487131f9	vm-virtio: pci: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. It is based off the name from the virtio device attached to this transport layer. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	eeb7e10d1f	vm-virtio: mmio: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. It is based off the name from the virtio device attached to this transport layer. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	9d84ef5073	vmm: Make the virtio identifier mandatory Because we know we will need every virtio device to be identified with a unique id, we can simplify the code by making the identifier mandatory. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	14350f5de4	devices: ioapic: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	556871570e	vm-virtio: iommu: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	052eff1ca7	vm-virtio: console: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	354c2a4b3d	vm-virtio: vhost-user-net: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	46e0b3ff75	vm-virtio: vhost-user-blk: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	bb7fa71fcb	vm-virtio: vhost-user-fs: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	ec5ff395cf	vm-virtio: vsock: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	9b53044aae	vm-virtio: mem: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	1592a9292f	vm-virtio: pmem: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	2e91b73881	vm-virtio: rng: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	9eb7413fab	vm-virtio: net: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	be946caf4b	vm-virtio: blk: Expect an identifier upon device creation This identifier is chosen from the DeviceManager so that it will manage all identifiers across the VM, which will ensure uniqueness. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	ff9c8b847f	vmm: Always generate the next device name Even in the context of "mmio" feature, we need the next device name to be generated as we need to identify virtio-mmio devices to support snapshot and restore functionalities. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	8183141399	vmm: Add an identifier to the ioapic device This will be later used to identify each device used by the VM in order to perform introspection and snapshot/restore properly. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	e4386c8bb7	vmm: Add an identifier to the virtio-iommu device This will be later used to identify each device used by the VM in order to perform introspection and snapshot/restore properly. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	75ddd2a244	vmm: Add an identifier to the --console device This will be later used to identify each device used by the VM in order to perform introspection and snapshot/restore properly. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	eac350c454	vmm: Add an identifier to the virtio-mem device This will be later used to identify each device used by the VM in order to perform introspection and snapshot/restore properly. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	6802ef5406	vmm: Add an identifier to the --rng device This will be later used to identify each device used by the VM in order to perform introspection and snapshot/restore properly. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	d71d52e9b0	vmm: Fix virtio-console creation with virtual IOMMU If the virtio-console device is supposed to be placed behind the virtual IOMMU, this must be explicitly propagated through the code. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	b08fde5928	vmm: Fix virtio-rng creation with virtual IOMMU If the virtio-rng device is supposed to be placed behind the virtual IOMMU, this must be explicitly propagated through the code. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	8031ac33c3	vmm: Fix virtio-vsock creation with virtual IOMMU If the virtio-vsock device is supposed to be placed behind the virtual IOMMU, this must be explicitly propagated through the code. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 19:34:31 +01:00
Sebastien Boeuf	a5de49558e	vmm: Only allow removal of specific types of virtio device Now that all virtio devices are assigned with identifiers, they could all be removed from the VM. This is not something that we want to allow because it does not make sense for some devices. That's why based on the device type, we remove the device or we return an error to the user. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-29 13:33:19 +01:00
Sebastien Boeuf	9ed880d74e	vmm: Add an identifier to the --fs device By giving the devices ids this effectively enables the removal of the device. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-29 13:33:19 +01:00
Sebastien Boeuf	7e0ab6b56d	vmm: Fix pmem device creation The parameters regarding the attachment to the virtio-iommu device was not propagated correclty, and any modification to the configuration was not stored back into it. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-29 13:33:19 +01:00
Rob Bradford	1d61c476a1	vmm: device_manager: Add support for hotplugging virtio-vsock devices Create a new VirtioVsock device and add it to the PCI bus upon hotplug. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-29 12:44:49 +01:00
Sebastien Boeuf	6e049e0da1	vmm: Add an identifier to the --vsock device It's possible to have multiple vsock devices so in preparation for hotplug/unplug it is important to be able to have a unique identifier for each device. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-29 12:44:49 +01:00
Rob Bradford	10348f73e4	vmm, main: Support only zero or one vsock devices The Linux kernel does not support multiple virtio-vsock devices. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-28 20:07:18 +02:00
Rob Bradford	1df38daf74	vmm, tests: Make specifying a size optional for virtio-pmem If a size is specified use it (in particular this is required if the destination is a directory) otherwise seek in the file to get the size of the file. Add a new check that the size is a multiple of 2MiB otherwise the kernel will reject it. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-24 18:30:05 +01:00
Sebastien Boeuf	c22fd39170	vmm: Remove virtio device's userspace mapping on hot-unplug When a virtio device is dynamically removed from the VM through the hot-unplug mechanism, every mapping associated with it must be properly removed. Based on the previous patches letting a VirtioDevice expose the list of userspace mappings associated with it, this patch can now remove all the KVM userspace memory regions through the MemoryManager. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-21 10:02:21 +01:00
Sebastien Boeuf	fbcf3a7a7a	vm-virtio: Implement userspace_mappings() for virtio-pmem When hot-unplugging the virtio-pmem from the VM, we don't remove the associated userspace mapping. This patch will let us fix this in a following patch. For now, it simply adapts the code so that the Pmem device knows about the mapping associated with it. By knowing about it, it can expose it to the caller through the new userspace_mappings() function. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-21 10:02:21 +01:00
Sebastien Boeuf	18f7789a81	vmm: Add hotplugged virtio devices to the DeviceManager list The hotplugged virtio devices were not added to the list of virtio devices from the DeviceManager. This patch fixes it, as it was causing hotplugged virtio-fs devices from not supporting memory hotplug, since they were never getting the update as they were not part of the list of virtio devices held by the DeviceManager. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-20 20:36:26 +02:00
Dean Sheather	c2abadc293	vmm: Add ability to add virtio-fs device post-boot Adds DeviceManager method `make_virtio_fs_device` which creates a single device, and modifies `make_virtio_fs_devices` to use this method. Implements the new `vm.add-fs route`. Signed-off-by: Dean Sheather <dean@coder.com>	2020-04-20 20:36:26 +02:00
Sebastien Boeuf	d35e775ed9	vmm: Update KVM userspace mapping when PCI BAR remapping In the context of the shared memory region used by virtio-fs in order to support DAX feature, the shared region is exposed as a dedicated PCI BAR, and it is backed by a KVM userspace mapping. Upon BAR remapping, the BAR is moved to a different location in the guest address space, and the KVM mapping must be updated accordingly. Additionally, we need the VirtioDevice to report the updated guest address through the shared memory region returned by get_shm_regions(). That's why a new setter is added to the VirtioDevice trait, so that after the mapping has been updated for KVM, we can tell the VirtioDevice the new guest address the shared region is located at. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-20 16:01:25 +02:00
Sebastien Boeuf	ac7178ef2a	vmm: Keep migratable devices list as a Vec The order the elements are pushed into the list is important to restore them in the right order. This is particularly important for MmioDevice (or VirtioPciDevice) and their VirtioDevice counterpart. A device must be fully ready before its associated transport layer management can trigger its restoration, which will end up activating the device in most cases. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-04-17 19:29:41 +02:00
Rob Bradford	e7e0e8ac38	vmm, devices: Add firmware debug port device OVMF and other standard firmwares use I/O port 0x402 as a simple debug port by writing ASCII characters to it. This is gated under a feature that is not enabled by default. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-17 12:54:00 +02:00
Rob Bradford	444e5c2a04	vmm: device_manager: Generalise NoAvailableVfioDeviceName We now support assigning device ids for VFIO and virtio-pci devices so this error can be generalised. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-16 17:03:25 +02:00
Rob Bradford	5bab9c3894	vmm: device_manager: Assign ids to pmem/net/disk devices if absent If the id has not been provided by the user generate an incrementing id. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-16 17:03:25 +02:00
Rob Bradford	514491a051	vmm: device_manager: Support unplugging virtio-pci devices Extend the eject_device() method on DeviceManager to also support virtio-pci devices being unplugged. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-16 17:03:25 +02:00
Rob Bradford	476e4ce24f	vmm: device_manager: Add virtio-pci devices into id to BDF map In order to support hotplugging there is a map of human readable device id to PCI BDF map. As the device id is part of the specific device configuration (e.g. NetConfig) it is necessary to return the id through from the helper functions that create the devices through to the functions that add those devices to the bus. This necessitates changing a great deal of function prototypes but otherwise has little impact. Currently only if an id is supplied by the user as part of the device configuration is it populated into this map. A later commit will populate with an autogenerated name where none is supplied by the user. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-16 17:03:25 +02:00
Rob Bradford	72fdfff15d	vmm: device_manager: Remove unused "_mmap_regions" member Now that ownership of the memory regions used for the virtio-pmem and vhost-user-fs devices have been moved into those devices it is no longer necessary to track them inside DeviceManager. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-14 17:46:11 +01:00
Rob Bradford	70ecd6bab4	vmm, virtio: fs: Move freeing of mappped region into device Move the release of the managed memory region from the DeviceManager to the vhost-user-fs device. This ensures that the memory will be freed when the device is unplugged which will lead to it being Drop()ed. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-14 17:46:11 +01:00
Rob Bradford	0c6706a510	vmm, virtio: pmem: Move freeing of mappped region into device Move the release of the managed memory region from the DeviceManager to the virtio-pmem device. This ensures that the memory will be freed when the device is unplugged which will lead to it being Drop()ed. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-04-14 17:46:11 +01:00
Samuel Ortiz	1ed357cf34	vmm: vm: Implement the Snapshottable trait By aggregating snapshots from the CpuManager, the MemoryManager and the DeviceManager, Vm implements the snapshot() function from the Snapshottable trait. And by restoring snapshots from the CpuManager, the MemoryManager and the DeviceManager, Vm implements the restore() function from the Snapshottable trait. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com> Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com>	2020-04-07 12:26:10 +02:00
Samuel Ortiz	a0d5dbce6c	vmm: device_manager: Implement the Snapshottable trait Based on the list of Migratable devices stored by the DeviceManager, the DeviceManager can implement the Snapshottable trait by aggregating all devices snapshots together. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-07 12:26:10 +02:00
Yi Sun	93d3abfd6e	vmm: device_manager: Make serial and ioapic devices migratable Serial and Ioapic both implement the Migratable trait, hence the DeviceManager can store them in the list of Migratable devices. Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com>	2020-04-07 12:26:10 +02:00
Samuel Ortiz	b584ec3fb3	vmm: memory_manager: Own the system allocator Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-03 18:05:18 +01:00
Samuel Ortiz	7a50646c02	vmm: device_manager: Convert migratable_devices to a map We must be able to map a migratable component id to its device. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-04-03 18:05:18 +01:00
Samuel Ortiz	1b1a2175ca	vm-migration: Define the Snapshottable and Transportable traits A Snapshottable component can snapshot itself and provide a MigrationSnapshot payload as a result. A MigrationSnapshot payload is a map of component IDs to a list of migration sections (MigrationSection). As component can be made of several Migratable sub-components (e.g. the DeviceManager and its device objects), a migration snapshot can be made of multiple snapshot itself. A snapshot is a list of migration sections, each section being a component state snapshot. Having multiple sections allows for easier and backward compatible migration payload extensions. Once created, a migratable component snapshot may be transported and this is what the Transportable trait defines, through 2 methods: send and recv. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com> Signed-off-by: Yi Sun <yi.y.sun@linux.intel.com>	2020-04-02 13:24:25 +01:00
Sebastien Boeuf	9e18177654	vmm: Add memory hotplug support to VFIO PCI devices Extend the update_memory() method from DeviceManager so that VFIO PCI devices can update their DMA mappings to the physical IOMMU, after a memory hotplug has been performed. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-27 09:35:39 +01:00
Sebastien Boeuf	cc67131ecc	vmm: Retrieve new memory region when memory is extended Whenever the memory is resized, it's important to retrieve the new region to pass it down to the device manager, this way it can decide what to do with it. Also, there's no need to use a boolean as we can instead use an Option to carry the information about the region. In case of virtio-mem, there will be no region since the whole memory has been reserved up front by the VMM at boot. This means only the ACPI hotplug will return a region and is the only method that requires the memory to be updated from the device manager. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-27 09:35:39 +01:00
Rob Bradford	8f323e61d8	vmm: Add support to DeviceManager for hotplugging network devices Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-25 17:58:06 +01:00
Rob Bradford	42a9896fe4	vmm: device_manager: Refactor make_virtio_net_devices Split it into a method that creates a single device which is called by the multiple device version so this can be used when dynamically adding a device. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-25 17:58:06 +01:00
Hui Zhu	e63f98182a	vmm: device: Add make_virtio_mem_devices Add make_virtio_mem_devices to add virtio-mem to vmm. Signed-off-by: Hui Zhu <teawater@antfin.com>	2020-03-25 15:54:16 +01:00
Rob Bradford	f7def621dd	vmm: Add support to DeviceManager for hotplugging pmem devices Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-25 13:18:17 +01:00
Rob Bradford	8c3ea8cd76	vmm: device_manager: Refactor make_virtio_pmem_devices Split it into a method that creates a single device which is called by the multiple device version so this can be used when dynamically adding a device. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-25 13:18:17 +01:00
Rob Bradford	b3082c1984	vmm: Add support to DeviceManager for hotplugging disks Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-25 09:35:53 +00:00
Rob Bradford	2be703ca92	vmm: device_manager: Refactor make_virtio_block_devices Split it into a method that creates a single device which is called by the multiple device version so this can be used when dynamically adding a device. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-25 09:35:53 +00:00
Sebastien Boeuf	e54f8ec8a5	vmm: Update memory through DeviceManager Whenever the VM memory is resized, DeviceManager needs to be notified so that it can subsequently notify each virtio devices about it. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-24 19:01:15 +00:00
Rob Bradford	f7197e8415	vmm: Add a "discard_writes=" to --pmem This opens the backing file read-only, makes the pages in the mmap() read-only and also makes the KVM mapping read-only. The file is also mapped with MAP_PRIVATE to make the changes local to this process only. This is functional alternative to having support for making a virtio-pmem device readonly. Unfortunately there is no concept of readonly virtio-pmem (or any type of NVDIMM/PMEM) in the Linux kernel so to be able to have a block device that is appears readonly in the guest requires significant specification and kernel changes. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-20 14:46:34 +01:00
Rob Bradford	d11a67b0fe	vmm: Use more generic MmapRegion constructor Switch to MmapRegion::build() and fill in the fields appropriately. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-20 14:46:34 +01:00
Rob Bradford	7257e890ef	vmm: Add "readonly" parameter MemoryManager::create_userspace_mapping Use this boolean to turn on the KVM_MEM_READONLY flag to indicate that this memory mapping should not be writable by the VM. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-20 14:46:34 +01:00
Qiu Wenbo	c503118d16	vmm: fix a corrupted stack caused by get_win_size According to `asm-generic/termios.h`, the `struct winsize` should be: struct winsize { unsigned short ws_row; unsigned short ws_col; unsigned short ws_xpixel; unsigned short ws_ypixel; }; The ioctl of TIOCGWINSZ will trigger a segfault on aarch64. Signed-off-by: Qiu Wenbo <qiuwenbo@phytium.com.cn>	2020-03-20 07:30:06 +01:00
Rob Bradford	87990f9e67	vmm: Add virtio-pci device to B/D/F hash table This table currently contains only all the VFIO devices and it should really contain all the PCI devices. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-18 19:05:58 +00:00
Rob Bradford	fb185fa839	vmm: Always return PCI B/D/F from add_virtio_pci_device Previously this was only returned if the device had an IOMMU mapping and whether the device should be added to the virtio-iommu. This was already captured earlier as part of creating the device so use that information instead. Always returning the B/D/F is helpful as it facilitates virtio PCI device hotplug. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-18 19:05:58 +00:00
Rob Bradford	7e599b4450	vmm: Make disk path optional When using "--disk" with a vhost socket and not using self spawning then it is not necessary or helpful to specify the path. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-03-13 11:41:52 +00:00
Sebastien Boeuf	8d785bbd5f	pci: Fix the PciBus using HashMap instead of Vec By using a Vec to hold the list of devices on the PciBus, there's a problem when we use unplug. Indeed, the vector of devices gets reduced and if the unplugged device was not the last one from the list, every other device after this one is shifted on the bus. To solve this problem, a HashMap is used. This allows to keep track of the exact place where each device stands on the bus. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-13 10:54:34 +01:00
Sebastien Boeuf	efba48dddb	vmm: Don't put a VFIO device behind the vIOMMU by default With some of the factorization that happened to be able to support VFIO hotplug, one mistake was made. In case a vIOMMU is created through a virtio-iommu device, and no matter the "iommu" option value from the VFIO device parameter, the VFIO device was always placed behind the virtual IOMMU. This commit fixes this wrong behavior by making sure the device configuration is taken into account to decide if it should be attached or not to the virtual IOMMU. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-11 19:50:31 +01:00
Sebastien Boeuf	34412c9b41	vmm: Add id option to VFIO hotplug Add a new id option to the VFIO hotplug command so that it matches the VFIO coldplug semantic. This is done by refactoring the existing code for VFIO hotplug, where VmAddDeviceData structure is replaced by DeviceConfig. This structure is the one used whenever a VFIO device is coldplugged, which is why it makes sense to reuse it for the hotplug codepath. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-11 19:50:31 +01:00
Sebastien Boeuf	9023444ad3	vmm: Add id field to --device through CLI Add the ability to specify the "id" associated with a device, by adding an extra option to the parameter --device. This new option is not mandatory, and by default, the VMM will take care of finding a unique identifier. If the identifier provided by the user through this new option is not unique, an error will be thrown and the VM won't be started. Fixes #881 Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-11 13:10:57 +00:00
Sebastien Boeuf	f4a956a60a	vmm: Remove 32 bits MMIO range from correct address space The 32 bits MMIO address space is handled separately from the 64 bits one. For this reason, we need to invoke the appropriate freeing function to remove a range from this address space. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-11 13:10:30 +00:00
Sebastien Boeuf	432eb5b70a	vmm: Free PCI BARs when unplugging PCI device Now that PciDevice trait has a dedicated function to remove the bars, the DeviceManager can invoke this function whenever a PCI device is unplugged from the VM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-11 13:10:30 +00:00
Sebastien Boeuf	b50cbe5064	pci: Give PCI device ID back when removing a device Upon removal of a PCI device, make sure we don't hold onto the device ID as it could be reused for another device later. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	df71aaee3f	pci: Make the device ID allocation smarter In order to handle the case where devices are very often plugged and unplugged from a VM, we need to handle the PCI device ID allocation better. Any PCI device could be removed, which means we cannot simply rely on the vector size to give the next available PCI device ID. That's why this patch stores in memory the information about the 32 slots availability. Based on this information, whenever a new slot is needed, the code can correctly provide an available ID, or simply return an error because all slots are taken. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	81173bf4ab	vmm: Add id field to DeviceConfig structure Add a new field to the DeviceConfig, allowing the VMM to allocate a name to the VFIO devices. By identifying a VFIO device with a unique name, we can make sure a user can properly unplug it at any time. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	6cbdb9aa47	vmm: api: Introduce new "remove-device" HTTP endpoint This commit introduces the new command "remove-device" that will let a user hot-unplug a VFIO PCI device from an already running VM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	991f3bb5da	vmm: Remove VFIO device from everywhere it is referenced This commit implements the eject function so that a VFIO device will be removed from any bus it might sit on, and from any list it might be stored in. The idea is to reach a point where there is no reference of the device anywhere in the code, so that the Drop implementation will be invoked and so that the device will be fully removed from the VMM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	6adebbc6a0	vmm: Detect when guest notifies about ejecting PCI device When the guest OS is done removing a PCI device, it will invoke the _EJ0 method from ACPI, associated with the device. This will trigger a port IO write to a region known by the VMM. Upon this writing, the VMM will trap the VM exit and retrieve the written value. Based on the value, the VMM will invoke its eject_device() method to finalize the removal of the device. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	08604ac6a8	vmm: Store PCI devices as Any devices from DeviceManager As we try to keep track of every PCI device related to the VM, we don't want to have separate lists depending on the concrete type associated with the PciDevice trait. Also, we want to be able to cast the actual type into any trait or concrete type. The most efficient way to solve all these issues is to store every device as an Arc<dyn Any + Send + Sync>. This gives the ability to downcast into the appropriate concrete type, and then to cast back into any trait that we might need. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	0f99d3f7cc	vmm: Store VFIO device's name and its PCI b/d/f Add a new list storing the device names across the entire codebase. VFIO devices are added to the list whenever a new one is created. By default, each VFIO device is given a name "vfioX" where X is the first available integer. Along with this new list of names, another list is created, grouping PCI device's name with its associated b/d/f. This will be useful to keep track of the created devices so that we can implement unplug functionality. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-10 17:05:06 +00:00
Sebastien Boeuf	09829c44b2	vmm: Remove IO bus strong reference from Vm The Vm structure was used to store a strong reference to the IO bus. This is not needed anymore since the AddressManager is logically the one holding this strong reference. This has been made possible by the introduction of Weak references on the Bus structure itself. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 18:46:44 +01:00
Sebastien Boeuf	2dbb376175	vmm: Remove all Weak references from DeviceManager Now that the BusDevice devices are stored as Weak references by the IO and MMIO buses, there's no need to use Weak references from the DeviceManager anymore. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 18:46:44 +01:00
Sebastien Boeuf	49268bff3b	pci: Remove all Weak references from PciBus Now that the BusDevice devices are stored as Weak references by the IO and MMIO buses, there's no need to use Weak references from the PciBus anymore. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 18:46:44 +01:00
Sebastien Boeuf	7773812f58	vmm: Store the list of BusDevice devices from DeviceManager The point is to make sure the DeviceManager holds a strong reference of each BusDevice inserted on the IO and MMIO buses. This will allow these buses to hold Weak references onto the BusDevice devices. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 18:46:44 +01:00
Sebastien Boeuf	d0820cc026	vmm: Make add_vfio_device mutable The method add_vfio_device() from the DeviceManager needs to be mutable if we want later to be able to update some internal fields from the DeviceManager from this same function. This commit simply takes care of making the necessary changes to change this function as mutable. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 18:46:44 +01:00
Sebastien Boeuf	d47f733e51	vmm: Break the cyclic dependency between DeviceManager and IO bus By inserting the DeviceManager on the IO bus, we introduced some cyclic dependency: DeviceManager ---> AddressManager ---> Bus ---> BusDevice ^ \| \| \| +---------------------------------------------+ This cycle needs to be broken by inserting a Weak reference instead of an Arc (considered as a strong reference). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	c1af13efeb	vmm: Update VmConfig when adding new device Ensures the configuration is updated after a new device has been hotplugged. In the event of a reboot, this means the new VM will be started with the new device that had been previously hotplugged. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	a86f4369a7	vmm: Add VFIO PCI device hotplug support This commit finalizes the VFIO PCI hotplug support, based on all the previous commits preparing for it. One thing to notice, this does not support vIOMMU yet. This means we can hotplug VFIO PCI devices, but we cannot attach them to an existing or a new virtio-iommu device. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	320fea0eaf	vmm: Factorize VFIO PCI device creation This factorization is very important as it will allow both the standard codepath and the VFIO PCI hotplug codepath to rely on the same function to perform the addition of a new VFIO PCI device. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	00716f90a0	vmm: Store virtio-iommu device from DeviceManager Helps with future refactoring of VFIO device creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	5902dfa403	vmm: Store VFIO KVM device from DeviceManager Helps with future refactoring of VFIO device creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	d9c1b4396e	vmm: Store MSI InterruptManager from DeviceManager Helps with future refactoring of VFIO device creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	02adc4061a	vmm: Store PciBus from DeviceManager Helps with future refactoring of VFIO device creation. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	0e58741a09	vmm: api: Introduce new "add-device" HTTP endpoint This commit introduces the new command "add-device" that will let a user hotplug a VFIO PCI device to an already running VM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	0f1396acef	vmm: Insert PCI device hotplug operation region on IO bus Through the BusDevice implementation from the DeviceManager, and by inserting the DeviceManager on the IO bus for a specific IO port range, the VMM now has the ability to handle PCI device hotplug. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00
Sebastien Boeuf	65774e8a78	vmm: Implement BusDevice for DeviceManager In anticipation of inserting the DeviceManager on the IO/MMIO buses, the DeviceManager must implement the BusDevice trait. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-03-04 12:06:02 +00:00

... 3 4 5 6 7 ...

610 Commits