cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2025-01-05 04:15:20 +00:00

Author	SHA1	Message	Date
Ruoqing He	ab7b294688	misc: Replace map_or on false with is_some_and Replace `map_or()` on false condition with `is_some_and` to provide better readability, as suggestted by v1.84.0-beta.1 `cargo clippy`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2024-11-29 12:44:33 +00:00
Rob Bradford	df1d6eaaee	virtio-devices: Enable VIRTIO_RING_F_INDIRECT_DESC This improves sequential write performance using fio (2888MiB/s -> 3293MiB/s) VM config: cloud-hypervisor --disk path=~/workloads/jammy.raw,direct=on path=~/workloads/big-disk.img,direct=on --cpus boot=1 --memory size=2G,shared=on --serial tty --console off --seccomp log --kernel ~/workloads/hypervisor-fw Host: fio --filename=big-disk.img --direct=1 --rw=write --bs=256k --ioengine=libaio --iodepth=64 --runtime=120 --numjobs=1 --time_based --group_reporting --name=throughput-test-job --eta-newline=1 VM: fio --filename=/dev/vdb --direct=1 --rw=write --bs=256k --ioengine=libaio --iodepth=64 --runtime=120 --numjobs=1 --time_based --group_reporting --name=throughput-test-job --eta-newline=1 Baseline (file on filesystem on host used as backing store for block device): throughput-test-job: (groupid=0, jobs=1): err= 0: pid=10169: Tue Nov 5 09:31:55 2024 write: IOPS=13.5k, BW=3385MiB/s (3549MB/s)(397GiB/120008msec); 0 zone resets slat (usec): min=4, max=10222, avg=20.25, stdev=29.01 clat (usec): min=984, max=45599, avg=4706.01, stdev=2278.11 lat (usec): min=1002, max=45610, avg=4726.27, stdev=2278.77 clat percentiles (usec): \| 1.00th=[ 3195], 5.00th=[ 3228], 10.00th=[ 3261], 20.00th=[ 3261], \| 30.00th=[ 3261], 40.00th=[ 3261], 50.00th=[ 3294], 60.00th=[ 3916], \| 70.00th=[ 5014], 80.00th=[ 7308], 90.00th=[ 7635], 95.00th=[ 7898], \| 99.00th=[ 8586], 99.50th=[ 8979], 99.90th=[36439], 99.95th=[36963], \| 99.99th=[43779] bw ( MiB/s): min= 1934, max= 4821, per=100.00%, avg=3391.67, stdev=1266.42, samples=239 iops : min= 7738, max=19286, avg=13566.67, stdev=5065.65, samples=239 lat (usec) : 1000=0.01% lat (msec) : 2=0.03%, 4=61.10%, 10=38.62%, 20=0.11%, 50=0.15% cpu : usr=17.13%, sys=14.38%, ctx=1352501, majf=0, minf=11 IO depths : 1=0.1%, 2=0.1%, 4=0.1%, 8=0.1%, 16=0.1%, 32=0.1%, >=64=100.0% submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.1%, >=64=0.0% issued rwts: total=0,1624829,0,0 short=0,0,0,0 dropped=0,0,0,0 latency : target=0, window=0, percentile=100.00%, depth=64 Run status group 0 (all jobs): WRITE: bw=3385MiB/s (3549MB/s), 3385MiB/s-3385MiB/s (3549MB/s-3549MB/s), io=397GiB (426GB), run=120008-120008msec Disk stats (read/write): dm-2: ios=129/1624787, sectors=1872/831364040, merge=0/0, ticks=185/6960387, in_queue=6960572, util=100.00%, aggrios=130/1626025, aggsectors=1880/831915888, aggrmerge=0/0, aggrticks=194/6967818, aggrin_queue=6968012, aggrutil=99.97% dm-0: ios=130/1626025, sectors=1880/831915888, merge=0/0, ticks=194/6967818, in_queue=6968012, util=99.97%, aggrios=130/1606095, aggsectors=1880/831915888, aggrmerge=0/19930, aggrticks=204/6634488, aggrin_queue=6635288, aggrutil=58.59% nvme0n1: ios=130/1606095, sectors=1880/831915888, merge=0/19930, ticks=204/6634488, in_queue=6635288, util=58.59% On block device in VM: throughput-test-job: (groupid=0, jobs=1): err= 0: pid=667: Tue Nov 5 09:53:19 2024 write: IOPS=13.2k, BW=3293MiB/s (3453MB/s)(386GiB/120008msec); 0 zone resets slat (usec): min=4, max=3518, avg=27.77, stdev=35.32 clat (usec): min=723, max=44252, avg=4829.82, stdev=2222.41 lat (usec): min=735, max=44270, avg=4857.85, stdev=2223.45 clat percentiles (usec): \| 1.00th=[ 3097], 5.00th=[ 3195], 10.00th=[ 3195], 20.00th=[ 3228], \| 30.00th=[ 3261], 40.00th=[ 3294], 50.00th=[ 3621], 60.00th=[ 4555], \| 70.00th=[ 5997], 80.00th=[ 7242], 90.00th=[ 7570], 95.00th=[ 7898], \| 99.00th=[ 8586], 99.50th=[ 8848], 99.90th=[36439], 99.95th=[36963], \| 99.99th=[40633] bw ( MiB/s): min= 1914, max= 4857, per=100.00%, avg=3299.46, stdev=1180.81, samples=239 iops : min= 7658, max=19430, avg=13197.77, stdev=4723.22, samples=239 lat (usec) : 750=0.01%, 1000=0.01% lat (msec) : 2=0.01%, 4=52.79%, 10=46.95%, 20=0.10%, 50=0.14% cpu : usr=25.95%, sys=16.71%, ctx=1111821, majf=0, minf=10 IO depths : 1=0.1%, 2=0.1%, 4=0.1%, 8=0.1%, 16=0.1%, 32=0.1%, >=64=100.0% submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.1%, >=64=0.0% issued rwts: total=0,1580693,0,0 short=0,0,0,0 dropped=0,0,0,0 latency : target=0, window=0, percentile=100.00%, depth=64 Run status group 0 (all jobs): WRITE: bw=3293MiB/s (3453MB/s), 3293MiB/s-3293MiB/s (3453MB/s-3453MB/s), io=386GiB (414GB), run=120008-120008msec Disk stats (read/write): vdb: ios=60/1953213, merge=0/0, ticks=14/8229134, in_queue=8229149, util=100.00% Prior to change: throughput-test-job: (groupid=0, jobs=1): err= 0: pid=667: Tue Nov 5 09:37:45 2024 write: IOPS=11.6k, BW=2888MiB/s (3028MB/s)(338GiB/120008msec); 0 zone resets slat (usec): min=3, max=3200, avg=18.48, stdev=24.54 clat (usec): min=1237, max=46575, avg=5521.41, stdev=2641.99 lat (usec): min=1249, max=46591, avg=5540.06, stdev=2643.54 clat percentiles (usec): \| 1.00th=[ 2999], 5.00th=[ 3163], 10.00th=[ 3195], 20.00th=[ 3261], \| 30.00th=[ 3294], 40.00th=[ 3359], 50.00th=[ 6063], 60.00th=[ 7111], \| 70.00th=[ 7373], 80.00th=[ 7570], 90.00th=[ 7832], 95.00th=[ 8094], \| 99.00th=[ 8717], 99.50th=[ 9241], 99.90th=[36963], 99.95th=[37487], \| 99.99th=[41157] bw ( MiB/s): min= 1936, max= 4826, per=100.00%, avg=2892.43, stdev=1202.99, samples=239 iops : min= 7746, max=19306, avg=11569.68, stdev=4811.98, samples=239 lat (msec) : 2=0.01%, 4=46.26%, 10=53.38%, 20=0.09%, 50=0.26% cpu : usr=14.20%, sys=8.59%, ctx=1246257, majf=0, minf=12 IO depths : 1=0.1%, 2=0.1%, 4=0.1%, 8=0.1%, 16=0.1%, 32=0.1%, >=64=100.0% submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.1%, >=64=0.0% issued rwts: total=0,1386102,0,0 short=0,0,0,0 dropped=0,0,0,0 latency : target=0, window=0, percentile=100.00%, depth=64 Run status group 0 (all jobs): WRITE: bw=2888MiB/s (3028MB/s), 2888MiB/s-2888MiB/s (3028MB/s-3028MB/s), io=338GiB (363GB), run=120008-120008msec Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-11-05 15:44:41 +00:00
Ruoqing He	0aab960bf1	misc: Elide needless lifetimes As clippy of rust-toolchain version 1.83.0-beta.1 suggests, elide needless lifetimes to `'_`. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2024-10-18 17:46:39 +00:00
Ruoqing He	297236a7c0	misc: Eliminate use of `assert!((...).is_ok())` Asserting on .is_ok()/.is_err() leads to hard to debug failures (as if the test fails, it will only say "assertion failed: false". We replace these with `.unwrap()`, which also prints the exact error variant that was unexpectedly encountered (we can to this these days thanks to efforts to implement Display and Debug for our error types). If the assert!((...).is_ok()) was followed by an .unwrap() anyway, we just drop the assert. Inspired by and quoted from @roypat. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2024-10-03 12:03:49 +00:00
Ruoqing He	61e57e1cb1	misc: Further improve imports styling By introducing `imports_granularity="Module"` format strategy, effectively groups imports from the same module into one line or block, improving maintainability and readability. Signed-off-by: Ruoqing He <heruoqing@iscas.ac.cn>	2024-09-29 16:13:48 +00:00
Rob Bradford	88a9f79944	misc: Adapt consistent import style formatting Historically the Cloud Hypervisor coding style has been to ensure that all imports are ordered and placed in a single group. Unfortunately cargo fmt has no support for ensuring that all imports are in a single group so if whitespace lines were added as part of the import statements then they would only be odered correctly in the group. By adopting "group_imports="StdExternalCrate" we can enforce a style where imports are placed in at most three groups for std, external crates and the crate itself. Choosing a style enforceable by the tooling reduces the reviewer burden. Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-09-29 13:08:12 +01:00
Alyssa Ross	287887c99c	vmm: fix console IO safety Rebooting a VM fails with the following error when debug assertions are enabled: fatal runtime error: IO Safety violation: owned file descriptor already closed This happens because FromRawFd::from_raw_fd is used on RawFds stored in ConsoleInfo every time a VM begins to boot, so the second time (after a reboot, or if the first attempt to boot via the API failed), the fd will be closed. Until this assertion is hit, the code is operating on either closed file descriptors, or new file descriptors for something completely different. If debug assertions are disabled, it will just continue doing this with unpredictable results. To fix this, and prevent the problem reocurring, ownership of the console file descriptors needs to be properly tracked, using Rust's type system, so this commit refactors the console code to do that. The file descriptors are now passed around with reference counts, so they won't be closed prematurely. The obvious way to do this would be to just have each member of ConsoleInfo be an Arc<File>, but we need to accomodate that serial console file descriptors can also be sockets. We can't just store an OwnedFd and convert it when it's used, because we only get a reference from the Arc, so we need to store the descriptors as their concrete types in an enum. Since this basically duplicates the ConsoleOutputMode enum from the config, the ConsoleOutputMode enum is now not used past constructing the ConsoleInfo. So that ownership can be represented consistently, the debug console's tty mode now uses its own stdout descriptor. I'm still using .try_clone().unwrap() (i.e. dup()) to clone file descriptors for Endpoint::FilePair and Endpoint::TtyPair, because I assume there's a reason for them not just to hold a single file descriptor. I've also retained the existing behaviour of having serial manager ignore the tty file descriptor passed to it (which is stdout), and instead using stdin. It looks a lot weirder now, because it has to explicitly indicate it's ignoring the fd with an underscore binding. Fixes: `52eebaf6` ("vmm: refactor DeviceManager to use console_info") Signed-off-by: Alyssa Ross <hi@alyssa.is>	2024-09-25 22:34:43 +00:00
Rob Bradford	e810be62cd	virtio-devices: vhost_user: Remove unused backend support from virtio-fs Complete the removal of the DAX support by removing the use of non-standard messages. These messages have since been removed from the vhost_user crate (rust-vmm/vhost#246) and so need to be removed from our implementation since that would otherwise block updating to a newer version of the crate. The ability to enable DAX support in Cloud Hypervisor has been disabled some time ago but this code was residual with no way to enable it. Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-09-25 19:41:35 +01:00
Yuhong Zhong	2ad8fac624	vmm: memory_manager: Fix bound checks for memory hotplug Bound checks for virtio-mem and ACPI memory hotplug are off by one and two, respectively. This prevents users to fully use the reserved memory hotplug size. For ACPI, if we specific `--memory size=2G,hotplug_size=4G` and run `ch-remote resize --memory 6G`, cloud-hypervisor will report the following error because of the incorrect bound check: `<vmm> ERROR:vmm/src/lib.rs:1631 -- Error when resizing VM: MemoryManager(InsufficientHotplugRam)` Similarly, for virtio-mem, cloud-hypervisor will fail the incorrect bound check and abort the resize. The VM will see the following error in dmesg: `virtio_mem virtio3: unknown error, marking device broken: -22` This patch has fixed both bound checks and ensure that users can hot add memory up to the reserved hotplug size. Signed-off-by: Yuhong Zhong <yz@cs.columbia.edu>	2024-09-19 18:02:20 +00:00
Purna Pavan Chandra	d10f6ca714	virtio-devices: Fix seccomp rules for SevSnp With `cd0cdac` ("virtio-devices: Fix seccomp rules for SevSnp guest"), the sys_ioctl rule that was applied in virtio_thread_common, would override previously specified sys_ioctl rules for individual thread type. This causes the SevSnp guest to crash with seccomp violation. Fixes: `cd0cdac` ("virtio-devices: Fix seccomp rules for SevSnp guest") Signed-off-by: Purna Pavan Chandra <paekkaladevi@microsoft.com>	2024-09-11 08:41:17 +00:00
Bo Chen	60c8a72e29	misc: Fix various warnings from clippy 0.1.82 An example warning output is: error: first doc comment paragraph is too long --> virtio-devices/src/lib.rs:158:1 \| 158 \| / /// Convert an absolute address into an address space (GuestMemory) 159 \| \| /// to a host pointer and verify that the provided size define a valid 160 \| \| /// range within a single memory region. 161 \| \| /// Return None if it is out of bounds or if addr+size overlaps a single region. \| \|_ \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#too_long_first_doc_paragraph = note: `-D clippy::too-long-first-doc-paragraph` implied by `-D warnings` = help: to override `-D warnings` add `#[allow(clippy::too_long_first_doc_paragraph)]` Signed-off-by: Bo Chen <chen.bo@intel.com>	2024-09-07 09:40:20 +00:00
Jinank Jain	bbd72d6453	vm-virtio: Gain access to virtqueue before accessing them In case of SEV-SNP guests on MSHV, any host component including VMM trying to access any guest memory region, needs to call gain_access API to acquire access to that particular memory region. This applies to all the virtqueue buffers as well which the VMM is using to use to perform DMA into the guest and in order to facilitate device emulation. While creating various virtqueues we are already using access platform hook to translate the addresses but currently we are missing the size arguments which would be required by the SevSnpAccessPlatformProxy to gain access to the memory of these queues. Signed-off-by: Jinank Jain <jinankjain@microsoft.com>	2024-09-04 17:33:37 +00:00
Jinank Jain	8e5c7a37b9	vm-virtio: Add helper function to get size of virtqueue segments This would be used in case of SEV-SNP guest because we need to accquire access to these virtqueue segments before accessing them. Signed-off-by: Jinank Jain <jinankjain@microsoft.com>	2024-09-04 17:33:37 +00:00
wuxinyue	6956306604	virtio-devices: block: Reduce notification latency when rate limited When the rate limit was reached it was possible for the notification to the guest to be lost since the logic to handle the notification was tightly coupled with processing the queue. The notification would eventually be triggered when the rate limit pool was refilled but this could add significant latency. Address this by refactoring the code to separate processing queue and signalling - the processing of the queue is suspended when the rate limit is reached but the signalling will still be attempted if needed (i.e. VIRTIO_F_EVENT_IDX is still considered.) Signed-off-by: wuxinyue <wuxinyue.wxy@antgroup.com> Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-08-31 20:54:13 +00:00
Jinank Jain	cd0cdac0ed	virtio-devices: Fix seccomp rules for SevSnp guest With commit `1e967697c` ("vmm: pass AccessPlatform implementation for SEV-SNP guest"), we started performing one additional ioctl to gain access to the guest memory before accessing those regions inside virtio-device emulation code path. This additional ioctl is not part of the current seccomp filter, which is causing the SevSnp guest to crash in this scenario with seccomp violation. Fixes: `1e967697c` ("vmm: pass AccessPlatform implementation for SEV-SNP guest") Signed-off-by: Jinank Jain <jinankjain@microsoft.com>	2024-08-30 16:55:53 +00:00
Wenyu Huang	4299815a67	vmm: allow to call fcntl in debug This fixes a issue of running vm compiled in debug with Rust 1.80.0 or later, where this check was introduced. Signed-off-by: Wenyu Huang <huangwenyuu@outlook.com>	2024-08-27 18:13:21 +00:00
Alyssa Ross	cbb588c380	virtio-devices: allow vsock to call fcntl in debug This fixes the vsock::device::tests::test_virtio_device test with Rust 1.80.0 or later, where this check was introduced. Signed-off-by: Alyssa Ross <hi@alyssa.is>	2024-08-12 20:13:13 +00:00
Wei Liu	f5b2eb5c76	virtio-devices: vsock: drop a useless line Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-07-31 16:36:10 +00:00
Wei Liu	82ac114b8a	virtio-devices: vsock: handle short read in muxer Use read_exact to make sure we really get the minimum number of bytes. Fixes: #6621 Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-07-31 16:36:10 +00:00
Wei Liu	b7512263be	virtio-devices: iommu: use inspect_err instead of map_err Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-07-23 21:07:17 +00:00
wuxinyue	a2438700e4	virtio-devices: support event idx for virtio-blk Support event idx feature for virtio-blk device. This feature could improve disk IO performance by suppressing notifications from guest to host and interrupts from host to guest, which has been already supported in virtio-net and vhost-user devices. To achieve this, virtqueue's event-idx-related API is leveraged for avail_event field update and needs_notification check. Fixes: #6580 Signed-off-by: wuxinyue <wuxinyue.wxy@antgroup.com>	2024-07-23 14:16:34 +00:00
Changyuan Lyu	bc6acb842f	block: fix `status` value size As per VirtIO spec 1.2 section 5.2.6, the `status` field is a byte, not u32. cloud-hypervisor writes an `u32` to guest memory, which accidentally zeros out the following 3 bytes, and may corrupt guest OS internal state. Signed-off-by: Changyuan Lyu <changyuanl@google.com>	2024-07-14 19:23:06 +00:00
Alyssa Ross	e7c7a304e8	virtio-devices: fix UB getting tty size TIOCGWINSZ modifies its argument, so it needs to mutably borrow it. Unfortunately, ioctl()'s signature is not able to enforce this, and the write happens in the kernel, so I don't think anything like miri, valgrind, UBSan, etc. would have been able to catch this. The UB passing an immutable reference caused resulted, for me, in get_win_size() returning (0, 0) since LLVM commit 9a09c737a052 ("[BasicAA] Make isNotCapturedBeforeOrAt() check for calls more precise (#69931)"). I've had a look through the other ioctl() calls in Cloud Hypervisor, and I don't think any others have the same problem. Signed-off-by: Alyssa Ross <hi@alyssa.is>	2024-07-03 21:26:04 +00:00
Bo Chen	cdd3ff5e5a	virtio-devices: vdpa: Don't restore on paused state Since vdpa device does not support pause/resume [1], it does not make sense to restore on paused state. [1] `099cdd2af8` Signed-off-by: Bo Chen <chen.bo@intel.com>	2024-06-15 07:32:58 +00:00
Bo Chen	6cb76abbf1	virtio-devices: vdpa: Don't error out on resume if not paused Signed-off-by: Bo Chen <chen.bo@intel.com>	2024-06-15 07:32:58 +00:00
Wei Liu	b3a73d6634	virtio-devices: fix documentation formatting Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-06-12 16:59:20 +00:00
Josh Soref	42e9632c53	misc: Fix spelling issues Misspellings were identified by: https://github.com/marketplace/actions/check-spelling * Initial corrections based on forbidden patterns from the action * Additional corrections by Google Chrome auto-suggest * Some manual corrections * Adding markdown bullets to readme credits section Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2024-06-08 16:31:30 +00:00
Wei Liu	862a056105	virtio-devices: vsock: fix two clippy warnings The mem_size field is not needed in TestContext. Drop it. Make sure guest_evvq is read once. Clippy cannot figured out that it was used. While at it, add an extra assert for the spurious rxvq event test, too. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-04-30 07:32:08 +00:00
Rob Bradford	10ab87d6a3	misc: Migrate away from versionize Replace with serde instead. Fixes: #6370 Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-04-22 17:10:55 +00:00
Ruslan Mstoi	5e9886bba4	build: add REUSE Compliance Check In accordance with reuse requirements: - Place each license file in the LICENSES/ directory - Add missing SPDX-License-Identifier to files. - Add .reuse/dep5 to bulk-license files Fixes: #5887 Signed-off-by: Ruslan Mstoi <ruslan.mstoi@intel.com>	2024-04-19 17:35:45 +00:00
Wei Liu	101cfb9650	virtio-devices: fs: cap the tag copy length The caller shouldn't pass in an &str that's too long. This is a precaution if something goes wrong in the caller. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-04-04 20:42:36 +00:00
Wei Liu	11c593e3b9	virtio-devices: fs: avoid unnecessary string allocation Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-04-04 20:42:36 +00:00
Wei Liu	f3b0f59646	vmm: validate virtio-fs tag length Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-04-04 20:42:36 +00:00
Andrew Carp	3fa02b34ca	virtio-devices: Attach and detach endpoints from domain properly Properly detach a device from a domain if that device is already attached to another domain on an attach request (following section 5.13.6.3.2 of the virtio-iommu spec). Resolves nested virtualization reboot. Signed-off-by: Andrew Carp <acarp@crusoeenergy.com>	2024-04-01 09:19:04 +00:00
Andrew Carp	5668f02eb6	virtio-devices: Map previously attached endpoints Ensures that any endpoints already attached to the domain are properly mapped to a new endpoint on said endpoint's attach request. This is done by search for all previous mappings in the domain and then issuing map requests for the newly attached endpoint. Signed-off-by: Andrew Carp <acarp@crusoeenergy.com>	2024-04-01 09:19:04 +00:00
Rob Bradford	fd81a23fcc	virtio-devices: vsock: csm: Use thiserror to provide error messages This resolves a nightly compiler check for unused enum inner value. Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-03-25 04:32:28 +00:00
Alexandru Matei	c3f1c3ee3d	virtio-devices: save pci configuration capability state in snapshot When restoring a VM, the VirtioPciCfgCapInfo struct is not properly initialized. All fields are 0, including the offset where the capabibility starts. Hence, when you read a PCI configuration register in the range [0..length(VirtioPciCfgCap)] you get the value 0 instead of the actual register contents. Linux rescans the whole PCI bus when adding a new device. It reads the values vendor_id and device_id for every device. Because these are stored at offset 0 in pci configuration space, their value is 0 for existing devices. As such, Linux considers that the devices have been unplugged and it removes them from the system. Fixes: #6265 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2024-03-24 17:18:51 +00:00
Andrew Carp	fbdc5d4487	virtio-devices: Removing all mappings found in an unmap request According to the virtio iommu spec (section 5.13.6.6), all mappings within the entire range from virt_start to virt_end in an unmap request must be removed. This change adds this functionality, iterating through all mappings that fall within an unmap request for that domain and removing them. Signed-off-by: Andrew Carp <acarp@crusoeenergy.com>	2024-03-22 20:25:52 +00:00
Rob Bradford	2529ffd593	virtio-devices: Fix clippy warning for use of .clone() warning: `devices` (lib) generated 1 warning (run `cargo clippy --fix --lib -p devices` to apply 1 suggestion) warning: assigning the result of `Clone::clone()` may be inefficient --> virtio-devices/src/transport/pci_device.rs:1073:9 \| 1073 \| self.bar_regions = bars.clone(); \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ help: use `clone_from()`: `self.bar_regions.clone_from(&bars)` \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#assigning_clones Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-03-19 18:36:22 +00:00
Rob Bradford	adb318f4cd	misc: Remove redundant "use" imports With the nightly toolchain (2024-02-18) cargo check will flag up redundant imports either because they are pulled in by the prelude on earlier match. Remove those redundant imports. Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-02-19 17:54:30 +00:00
Muminul Islam	3d5718bd87	virtio-devices: handle IO event for SevSnp properly For SevSnp guest IO events are handled by GHCB protocol. While we get the notification we have to notify via eventfd. Signed-off-by: Muminul Islam <muislam@microsoft.com>	2024-02-16 11:28:32 -08:00
acarp	035c4b20fb	block: Set an option to pin virtio block threads to host cpus Currently the only way to set the affinity for virtio block threads is to boot the VM, search for the tid of each of the virtio block threads, then set the affinity manually. This commit adds an option to pin virtio block queues to specific host cpus (similar to pinning vcpus to host cpus). A queue_affinity option has been added to the disk flag in the cli to specify a mapping of queue indices to host cpus. Signed-off-by: acarp <acarp@crusoeenergy.com>	2024-02-13 09:05:57 +00:00
Alyssa Ross	451d3fb2f0	vmm: limit VSOCK CIDs to 32 bits The VIRTIO specification[1] says: > The upper 32 bits of the CID are reserved and zeroed. We should therefore not allow the user to supply a VSOCK CID with those bits set. To accomplish this, limit the public API of the virtio-vsock device to only accept 32-bit CIDs, while still using 64-bit CIDs internally since that's how virtio-vsock works. [1]: https://docs.oasis-open.org/virtio/virtio/v1.2/csd01/virtio-v1.2-csd01.html#x1-4400004 Signed-off-by: Alyssa Ross <hi@alyssa.is>	2024-01-10 17:28:56 +00:00
Alyssa Ross	48de800756	virtio-devices: fix reading vsock connect command The socket is nonblocking, so it's not guaranteed that it will be possible to read the whole connect command in a single iteration of the event loop. To reproduce: (echo -n 'CONNECT '; sleep 1; echo 1234; cat) \| socat STDIO UNIX-CONNECT:vsock.sock This would produce the error: cloud-hypervisor: 5.509209s: <_vsock4> INFO:virtio-devices/src/vsock/unix/muxer.rs:446 -- vsock: error adding local-init connection: UnixRead(Os { code: 11, kind: WouldBlock, message: "Resource temporarily unavailable" }) To fix this, if we only get a partial command, we need to save it for future iterations of the event loop, and only proceed once we've read a complete command. Signed-off-by: Alyssa Ross <hi@alyssa.is>	2024-01-09 16:01:52 +00:00
Alyssa Ross	9d5dfa879b	fuzz: fix unused import warnings Signed-off-by: Alyssa Ross <hi@alyssa.is>	2024-01-08 17:39:05 +00:00
Thomas Barrett	c297d8d796	vmm: use RateLimiterGroup for virtio-blk devices Add a 'rate_limit_groups' field to VmConfig that defines a set of named RateLimiterGroups. When the 'rate_limit_group' field of DiskConfig is defined, all virtio-blk queues will be rate-limited by a shared RateLimiterGroup. The lifecycle of all RateLimiterGroups is tied to the Vm. A RateLimiterGroup may exist even if no Disks are configured to use the RateLimiterGroup. Disks may be hot-added or hot-removed from the RateLimiterGroup. When the 'rate_limiter' field of DiskConfig is defined, we construct an anonymous RateLimiterGroup whose lifecycle is tied to the Disk. This is primarily done for api backwards compatability. Importantly, the behavior is not the same! This implementation rate_limits the aggregate bandwidth / iops of an individual disk rather than the bandwidth / iops of an individual queue of a disk. When neither the 'rate_limit_group' or the 'rate_limiter' fields of DiskConfig is defined, the Disk is not rate-limited. Signed-off-by: Thomas Barrett <tbarrett@crusoeenergy.com>	2024-01-03 10:21:06 -08:00
Thomas Barrett	45b01d592a	vmm: assign each pci segment 32-bit mmio allocator Signed-off-by: Thomas Barrett <tbarrett@crusoeenergy.com>	2023-11-20 15:33:50 -08:00
Bo Chen	d4892f41b3	misc: Stop using deprecated functions from vm-memory crate See: https://github.com/rust-vmm/vm-memory/pull/247 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-11-14 09:17:42 +00:00
Bo Chen	4d7a4c598a	build: Upgrade vm-memory crates and its consumers Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-11-14 09:17:42 +00:00
Bo Chen	d4a163dd39	virtio-devices: Fix beta clippy issue error: use of a fallible conversion when an infallible one could be used Error: --> virtio-devices/src/vhost_user/vu_common_ctrl.rs:206:51 \| 206 \| let actual_size: usize = queue.size().try_into().unwrap(); \| ^^^^^^^^^^^^^^^^^^^ help: use: `into()` \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#unnecessary_fallible_conversions = note: `-D clippy::unnecessary-fallible-conversions` implied by `-D warnings` = help: to override `-D warnings` add `#[allow(clippy::unnecessary_fallible_conversions)]` error: could not compile `virtio-devices` (lib) due to previous error Error: warning: build failed, waiting for other jobs to finish... error: could not compile `virtio-devices` (lib test) due to previous error Error: The process '/home/runner/.cargo/bin/cargo' failed with exit code 101 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-11-14 09:15:45 +00:00

1 2 3 4 5 ...

557 Commits