cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-11-05 03:21:13 +00:00

Author	SHA1	Message	Date
Ziye Yang	9075809494	virtio-devices: Update some comments in epoll_helper.rs Make some comments more clear. Signed-off-by: Ziye Yang <ziye.yang@intel.com>	2021-11-23 14:03:05 +01:00
Wei Liu	32af6f9723	virtio-device: add a safety comment for a dup(2) call Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-11-17 23:12:11 +00:00
Wei Liu	31b3871eee	virtio-devices: add or adjust comments for `impl ByteValued` Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-11-17 14:40:51 +00:00
Sebastien Boeuf	58d25b3ccc	virtio-devices: net: Improve throughput with virtio features By merging receive buffers through the VIRTIO_NET_F_MRG_RXBUF feature, as well as enabling the use of indirect descriptors through VIRTIO_RING_F_INDIRECT_DESC feature, we achieve better throughput for the virtio-net device without hurting its latency. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-11-03 17:11:59 +00:00
Rob Bradford	cd9d1cf8fc	pci, virtio-devices, vmm: Allocate PCI 64-bit bars per segment Since each segment must have a non-overlapping memory range associated with it the device memory must be equally divided amongst all segments. A new allocator is used for each segment to ensure that BARs are allocated from the correct address ranges. This requires changes to PciDevice::allocate/free_bars to take that allocator and when reallocating BARs the correct allocator must be identified from the ranges. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Rob Bradford	88378d17a2	vmm: Take PCI segment ID into BAR size allocation Move the decision on whether to use a 64-bit bar up to the DeviceManager so that it can use both the device type (e.g. block) and the PCI segment ID to decide what size bar should be used. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Sebastien Boeuf	0249e8641a	Move Cloud Hypervisor to virtio-queue crate Relying on the vm-virtio/virtio-queue crate from rust-vmm which has been copied inside the Cloud Hypervisor tree, the entire codebase is moved to the new definition of a Queue and other related structures. The reason for this move is to follow the upstream until we get some agreement for the patches that we need on top of that to make it properly work with Cloud Hypervisor. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-22 11:38:55 +02:00
Rob Bradford	84f0f332b3	virtio-devices: Use #[allow(dead_code)] for unread structs These structs are not read on the VMM side but are used in communication with the guest. As identified by the new beta clippy. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-10-19 19:42:36 +01:00
Rob Bradford	48d4ccbfeb	virtio-devices: Call closure directly rather than indirect As identified by the new beta clippy. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-10-19 19:42:36 +01:00
Rob Bradford	8a56720b0f	virtio-devices: Use assert!() rather than if+panic As identified by the new beta clippy. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-10-19 19:42:36 +01:00
Sebastien Boeuf	5ac013df8b	virtio-devices: vhost-user: Set reply_ack conditionally Setting the reply_ack should depend on the set of acknowledged features containing the REPLY_ACK flag. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 19:48:17 +01:00
Sebastien Boeuf	0fb24ea3ae	virtio-devices: mem: Discard unplugged ranges only on activate() In order to support correctly the snapshot/restore and migration use cases, we must be careful with the ranges that we discard by punching holes. On restore, there might be some ranges already plugged in, meaning they should not be discarded. That's why we loop over the list of blocks to discard only the ranges that are marked as unplugged. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	e390775bcb	vmm, virtio-devices: Move BlocksState creation to the MemoryManager By creating the BlocksState object in the MemoryManager, we can directly provide it to the virtio-mem device when being created. This will allow the MemoryManager through each VirtioMemZone to have a handle onto the blocks that are plugged at any point in time. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	4450c44fbc	virtio-devices: mem: Create a MemoryRangeTable from BlocksState This is going to be useful to let virtio-mem report the list of ranges that are currently plugged, so that both snapshot/restore and migration will copy only what is needed. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	a1caa6549a	vmm: Add page size as a parameter for MemoryRangeTable::from_bitmap() This will be helpful to support the creation of a MemoryRangeTable from virtio-mem, as it uses 2M pages. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	af3a59aa33	virtio-devices: mem: Add constructor for BlocksState Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	d7115ec656	virtio-devices: mem: Add snapshot/restore support Adding the snapshot/restore support along with migration as well, allowing a VM with virtio-mem devices attached to be properly migrated. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Rob Bradford	43365ade2e	vmm, pci: Implement virtio-mem support for vfio-user Implement the infrastructure that lets a virtio-mem device map the guest memory into the device. This is necessary since with virtio-mem zones memory can be added or removed and the vfio-user device must be informed. Fixes: #3025 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-21 15:42:49 +01:00
Rob Bradford	fd4f32fa69	virtio-mem: Support multiple mappings For vfio-user the mapping handler is per device and needs to be removed when the device in unplugged. For VFIO the mapping handler is for the default VFIO container (used when no vIOMMU is used - using a vIOMMU does not require mappings with virtio-mem) To represent these two use cases use an enum for the handlers that are stored. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-21 15:42:49 +01:00
Sebastien Boeuf	6fb88c3c5a	virtio-devices: balloon: Add snapshot/restore support Adding the snapshot/restore support along with migration as well, allowing a VM with a virtio-balloon device attached to be properly migrated. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-21 14:47:17 +02:00
Michael Zhao	b3fa56544c	virtio-devices: iommu: Support AArch64 The MSI IOVA address on X86 and AArch64 is different. This commit refactored the code to receive the MSI IOVA address and size from device_manager, which provides the actual IOVA space data for both architectures. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2021-09-17 12:19:46 +02:00
Sebastien Boeuf	a6040d7a30	vmm: Create a single VFIO container For most use cases, there is no need to create multiple VFIO containers as it causes unwanted behaviors. Especially when passing multiple devices from the same IOMMU group, we need to use the same container so that it can properly list the groups that have been already opened. The correct logic was already there in vfio-ioctls, but it was incorrectly used from our VMM implementation. For the special case where we put a VFIO device behind a vIOMMU, we must create one container per device, as we need to control the DMA mappings per device, which is performed at the container level. Because we must keep one container per device, the vIOMMU use case prevents multiple devices attached to the same IOMMU group to be passed through the VM. But this is a limitation that we are fine with, especially since the vIOMMU doesn't let us group multiple devices in the same group from a guest perspective. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-15 09:08:13 -07:00
Alyssa Ross	330b5ea3be	vmm: notify virtio-console of pty resizes When a pty is resized (using the TIOCSWINSZ ioctl -- see ioctl_tty(2)), the kernel will send a SIGWINCH signal to the pty's foreground process group to notify it of the resize. This is the only way to be notified by the kernel of a pty resize. We can't just make the cloud-hypervisor process's process group the foreground process group though, because a process can only set the foreground process group of its controlling terminal, and cloud-hypervisor's controlling terminal will often be the terminal the user is running it in. To work around this, we fork a subprocess in a new process group, and set its process group to be the foreground process group of the pty. The subprocess additionally must be running in a new session so that it can have a different controlling terminal. This subprocess writes a byte to a pipe every time the pty is resized, and the virtio-console device can listen for this in its epoll loop. Alternatives I considered were to have the subprocess just send SIGWINCH to its parent, and to use an eventfd instead of a pipe. I decided against the signal approach because re-purposing a signal that has a very specific meaning (even if this use was only slightly different to its normal meaning) felt unclean, and because it would have required using pidfds to avoid race conditions if cloud-hypervisor had terminated, which added complexity. I decided against using an eventfd because using a pipe instead allows the child to be notified (via poll(2)) when nothing is reading from the pipe any more, meaning it can be reliably notified of parent death and terminate itself immediately. I used clone3(2) instead of fork(2) because without CLONE_CLEAR_SIGHAND the subprocess would inherit signal-hook's signal handlers, and there's no other straightforward way to restore all signal handlers to their defaults in the child process. The only way to do it would be to iterate through all possible signals, or maintain a global list of monitored signals ourselves (vmm:vm::HANDLED_SIGNALS is insufficient because it doesn't take into account e.g. the SIGSYS signal handler that catches seccomp violations). Signed-off-by: Alyssa Ross <hi@alyssa.is>	2021-09-14 15:43:25 +01:00
Alyssa Ross	98bfd1e988	virtio-devices: get tty size from the right tty Previously, we were always getting the size from stdin, even when the console was hooked up to a pty. Signed-off-by: Alyssa Ross <hi@alyssa.is>	2021-09-14 15:43:25 +01:00
Alyssa Ross	28382a1491	virtio-devices: determine tty size in console This prepares us to be able to handle console resizes in the console device's epoll loop, which we'll have to do if the output is a pty, since we won't get SIGWINCH from it. Signed-off-by: Alyssa Ross <hi@alyssa.is>	2021-09-14 15:43:25 +01:00
Alyssa Ross	8abe8c679b	seccomp: allow mmap everywhere brk is allowed Musl often uses mmap to allocate memory where Glibc would use brk. This has caused seccomp violations for me on the API and signal handling threads. Signed-off-by: Alyssa Ross <hi@alyssa.is>	2021-09-10 12:01:31 -07:00
Rob Bradford	33a55bac0f	virtio-devices: seccomp: Split out common seccomp rules As well as reducing the amount of code this also improves the binary size slightly: cargo bloat --release -n 2000 --bin cloud-hypervisor \| grep virtio_devices::seccomp_filters::get_seccomp_rules Before: 0.1% 0.2% 7.8KiB virtio_devices virtio_devices::seccomp_filters::get_seccomp_rules After: 0.0% 0.1% 3.0KiB virtio_devices virtio_devices::seccomp_filters::get_seccomp_rules Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-10 10:11:12 -07:00
Rob Bradford	687d646c60	virtio-devices, vmm: Shutdown VMM on virtio thread panic Shutdown the VMM in the virtio (or VMM side of vhost-user) thread panics. See: #3031 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-08 09:40:36 +01:00
Rob Bradford	54e523c302	virtio-devices: Use a common method for spawning virtio threads Introduce a common solution for spawning the virtio threads which will make it easier to add the panic handling. During this effort I discovered that there were no seccomp filters registered for the vhost-user-net thread nor the vhost-user-block thread. This change also incorporates basic seccomp filters for those as part of the refactoring. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-08 09:40:36 +01:00
Rob Bradford	e475b12cf7	virtio-devices, vmm: Upgrade restore related messages to info!() These happen only sporadically so can be included at the info!() level. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-03 09:30:55 -07:00
Rob Bradford	c2144b5690	vmm, virtio-console: Move input reading into virtio-console thread Move the processing of the input from stdin, PTY or file from the VMM thread to the existing virtio-console thread. The handling of the resize of a virtio-console has not changed but the name of the struct used to support that has been renamed to reflect its usage. Fixes: #3060 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-02 21:17:33 +01:00
Fazla Mehrab	5db4dede28	block_util, vhdx: vhdx crate integration with the cloud hypervisor vhdx_sync.rs in block_util implements traits to represent the vhdx crate as a supported block device in the cloud hypervisor. The vhdx is added to the block device list in device_manager.rs at the vmm crate so that it can automatically detect a vhdx disk and invoke the corresponding crate. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com> Signed-off-by: Fazla Mehrab <akm.fazla.mehrab@intel.com>	2021-08-19 11:43:19 +02:00
Bo Chen	9aba1fdee6	virtio-devices, vmm: Use syscall definitions from the libc crate Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-18 10:42:19 +02:00
Bo Chen	864a5e4fe0	virtio-devices, vmm: Simplify 'get_seccomp_rules' Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-18 10:42:19 +02:00
Bo Chen	7d38a1848b	virtio-devices, vmm: Fix the '--seccomp false' option We are relying on applying empty 'seccomp' filters to support the '--seccomp false' option, which will be treated as an error with the updated 'seccompiler' crate. This patch fixes this issue by explicitly checking whether the 'seccomp' filter is empty before applying the filter. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-18 10:42:19 +02:00
Bo Chen	08ac3405f5	virtio-devices, vmm: Move to the seccompiler crate Fixes: #2929 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-08-18 10:42:19 +02:00
Sebastien Boeuf	6d34ed03f7	virtio-devices: vhost_user: Refactor through VhostUserCommon Introducing a new structure VhostUserCommon allowing to factorize a lot of the code shared between the vhost-user devices (block, fs and net). Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-11 17:01:12 -07:00
Sebastien Boeuf	4735cb8563	vmm, virtio-devices: Restore vhost-user devices in a dedicated way We cannot let vhost-user devices connect to the backend when the Block, Fs or Net object is being created during a restore/migration. The reason is we can't have two VMs (source and destination) connected to the same backend at the same time. That's why we must delay the connection with the vhost-user backend until the restoration is performed. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	a636411522	vitio-devices: vhost_user: Factorize some part of the initialization Introducing a new function to factorize a small part of the initialization that is shared between a full reinitialization and a restoration. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	c85aa6dfae	virtio-devices: vhost_user: Kill threads upon migration completion In order to prevent the vhost-user devices from reconnecting to the backend after the migration has been successfully performed, we make sure to kill the thread in charge of handling the reconnection mechanism. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	152a3b98c9	virtio-devices: vhost_user: Shutdown communication after migration During a migration, the vhost-user device talks to the backend to retrieve the dirty pages. Once done with this, a snapshot will be taken, meaning there's no need to communicate with the backend anymore. Closing the communication is needed to let the destination VM being able to connect to the same backend. That's why we shutdown the communication with the backend in case a migration has been started and we're asked for a snapshot. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	a738808604	virtio-devices: vhost_user: Make vhost-user handle optional This anticipates the need for creating a new Blk, Fs or Net object without having performed the connection with the vhost-user backend yet. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	2c54c30435	virtio-devices: vhost_user: common: Fix memory access It was incorrect to call Vec::from_raw_parts() on the address pointing to the shared memory log region since Vec is a Rust specific structure that doesn't directly translate into bytes. That's why we use the same function from std::slice in order to create a proper slice out of the memory region, which is then copied into a Vec. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	adae986233	virtio-devices: vhost_user: Add LOG_SHMFD protocol feature Now that the common vhost-user code can handle logging dirty pages through shared memory, we need to advertise it to the vhost-user backends with the protocol feature VHOST_USER_PROTOCOL_F_LOG_SHMFD. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	1c3f8236e7	virtio-devices, vm-migration: Update MigratableError types Make sure the error types match the function from the Migratable trait. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Sebastien Boeuf	204be8611c	virtio-devices: vhost_user: net: Fix wrong error message Due to a previous copy and paste error. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-10 12:36:58 -07:00
Markus Theil	5b0d4bb398	virtio-devices: seccomp: allow unix socket connect in vsock thread Allow vsocks to connect to Unix sockets on the host running cloud-hypervisor with enabled seccomp. Reported-by: Philippe Schaaf <philippe.schaaf@secunet.com> Tested-by: Franz Girlich <franz.girlich@tu-ilmenau.de> Signed-off-by: Markus Theil <markus.theil@tu-ilmenau.de>	2021-08-06 08:44:47 -07:00
Sebastien Boeuf	9d88e0b417	virtio-devices: vhost_user: Fully implement Migratable trait All vhost-user devices are now equipped to support migration. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-05 06:07:00 -07:00
Sebastien Boeuf	b3f5630c27	virtio-devices: vhost_user: Add common migration logic Adding the common vhost-user code for starting logging dirty pages when the migration is started, and its counterpart for stopping, as well as the code in charge of retrieving the bitmap of the dirty pages that have been logged. All these functions are meant to be leveraged from vhost-user devices. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-05 06:07:00 -07:00
Sebastien Boeuf	61994cdb14	virtio-devices: vhost_user: Store ability to migrate Adding a simple field `migration_support` to VhostUserHandle in order to store the information about the device supporting migration or not. The value of this flag depends on the feature set negotiated with the backend. It's considered as supporting migration if VHOST_F_LOG_ALL is present in the virtio features and if VHOST_USER_PROTOCOL_F_LOG_SHMFD is present in the vhost-user protocol features. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-05 06:07:00 -07:00

1 2 3 4 5 ...

318 Commits