cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2025-01-19 11:05:18 +00:00

Author	SHA1	Message	Date
Yuhong Zhong	2ad8fac624	vmm: memory_manager: Fix bound checks for memory hotplug Bound checks for virtio-mem and ACPI memory hotplug are off by one and two, respectively. This prevents users to fully use the reserved memory hotplug size. For ACPI, if we specific `--memory size=2G,hotplug_size=4G` and run `ch-remote resize --memory 6G`, cloud-hypervisor will report the following error because of the incorrect bound check: `<vmm> ERROR:vmm/src/lib.rs:1631 -- Error when resizing VM: MemoryManager(InsufficientHotplugRam)` Similarly, for virtio-mem, cloud-hypervisor will fail the incorrect bound check and abort the resize. The VM will see the following error in dmesg: `virtio_mem virtio3: unknown error, marking device broken: -22` This patch has fixed both bound checks and ensure that users can hot add memory up to the reserved hotplug size. Signed-off-by: Yuhong Zhong <yz@cs.columbia.edu>	2024-09-19 18:02:20 +00:00
Wei Liu	254db7b96a	vmm: fix documentation formatting Signed-off-by: Wei Liu <liuwe@microsoft.com>	2024-06-12 16:59:20 +00:00
Josh Soref	42e9632c53	misc: Fix spelling issues Misspellings were identified by: https://github.com/marketplace/actions/check-spelling * Initial corrections based on forbidden patterns from the action * Additional corrections by Google Chrome auto-suggest * Some manual corrections * Adding markdown bullets to readme credits section Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2024-06-08 16:31:30 +00:00
Rob Bradford	10ab87d6a3	misc: Migrate away from versionize Replace with serde instead. Fixes: #6370 Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-04-22 17:10:55 +00:00
Alexandru Matei	fbe3e4d642	vmm: memory_manager: don't set backing_file for virtio_mem regions The memory region that is associated with the hotpluggable part of a virtio-mem zone isn't backed by the file specified in the MemoryZoneConfig. The file is used only for the fixed part of the zone. When you try to restore a snapshot with virtio-mem, the backing file is used for all its regions. This results in the following error: VmRestore(MemoryManager(GuestMemoryRegion(MappingPastEof))) This patch sets backing_file only for the fixed part of a virtio-mem zone. Fixes: #6337 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com>	2024-03-29 20:11:20 +00:00
Rob Bradford	adb318f4cd	misc: Remove redundant "use" imports With the nightly toolchain (2024-02-18) cargo check will flag up redundant imports either because they are pulled in by the prelude on earlier match. Remove those redundant imports. Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2024-02-19 17:54:30 +00:00
Sean Banko	7633d47293	vmm: prefault memory in parallel to optimize boot time On guests with large amounts of memory, using the `prefault` option can lead to a very long boot time. This commit implements the strategy taken by QEMU to prefault memory in parallel using multiple threads, decreasing the time to allocate memory for large guests by an order of magnitude or more. For example, this commit reduces the time to allocate memory for a guest configured with 704 GiB of memory on 1 NUMA node using 1 GiB hugepages from 81.44134669s to just 6.865287881s. Signed-off-by: Sean Banko <sbanko@crusoeenergy.com>	2024-02-07 08:59:03 -08:00
Thomas Barrett	45b01d592a	vmm: assign each pci segment 32-bit mmio allocator Signed-off-by: Thomas Barrett <tbarrett@crusoeenergy.com>	2023-11-20 15:33:50 -08:00
Bo Chen	d4892f41b3	misc: Stop using deprecated functions from vm-memory crate See: https://github.com/rust-vmm/vm-memory/pull/247 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-11-14 09:17:42 +00:00
Philipp Schuster	7bf0cc1ed5	misc: Fix various spelling errors using typos This fixes all typos found by the typos utility with respect to the config file. Signed-off-by: Philipp Schuster <philipp.schuster@cyberus-technology.de>	2023-09-09 10:46:21 +01:00
Rob Bradford	4548de194d	build: Bump acpi_tables version Fix newly added deprecation for mispelling of cacheable. Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2023-09-07 13:58:33 +01:00
Changyuan Lyu	7f18d0a281	memory_manager: improve memory region creation Instead of making an owned `zones`, using an iterator is cheaper since `Vec::remove` may have the performance O(n) [1]. [1]: https://doc.rust-lang.org/std/vec/struct.Vec.html#method.remove Signed-off-by: Changyuan Lyu <changyuanl@google.com>	2023-07-10 11:54:05 -07:00
Bo Chen	de31b3fadc	vmm: Clarify memory regions are required to be page-size aligned Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-06-16 14:15:03 -07:00
Yu Li	8d89736c68	vmm: memory_manager: align down the rest space of ram_region This commit renames `ram_region_sub_size` to `ram_region_available_size` and make its value align down to the default page size or hugepage size of the current memory zone, which can prevent the memory zone from being split into misaligned parts. And if the available size of ram region is zero, this region will be marked as consumed even it has unused space. Note that there is two methods to use hugepages. 1. Specify `hugepages` for `memory` or `memory-zone`, if the `hugepage_size` is not specified, the value can be got by `statfs` for `/dev/hugepages`. 2. Specify a `file` in hugetlbfs for `memory-zone`, the hugepage size can also be got by `statfs` for the file. The value for alignment will be the hugepage size if this memory zone is using hugepages, otherwise the value will be default page size of system. Fixes: #5463 Signed-off-by: Yu Li <liyu.yukiteru@bytedance.com>	2023-06-16 14:15:03 -07:00
Yu Li	55ee8eb482	arch: let `arch_memory_regions` return all available regions The previous `arch_memory_regions` function will provide some memory regions with the specified memory size and fill all the previous regions before using the next one, but sometimes there may be no need to fill up the previous one, e.g., the previous one should be aligned with hugepage size. This commit make `arch_memory_regions` function not take any parameters and return the max available regions, the memory manager can use them on demand. Fixes: #5463 Signed-off-by: Yu Li <liyu.yukiteru@bytedance.com>	2023-06-16 14:15:03 -07:00
Yu Li	ce0f30bb54	vmm: use `unwrap_or` instead of `match` for `prefault` Signed-off-by: Yu Li <liyu.yukiteru@bytedance.com>	2023-06-16 14:15:03 -07:00
Rob Bradford	f485922b78	build: Bump acpi_tables from `cb5f06c` to `05a6091` Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2023-06-08 17:28:02 +00:00
Rob Bradford	89e658d9ff	misc: Update for beta clippy failures on x86-64 Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2023-05-30 07:18:17 -07:00
Ravi kumar Veeramally	a8d1849485	vmm: Remove directory support from MemoryZoneConfig::file Fixes: #5082 Signed-off-by: Ravi kumar Veeramally <ravikumar.veeramally@intel.com>	2023-04-04 06:49:18 -07:00
Rob Bradford	73c4156775	vmm, devices: Update to latest acpi_tables crate API Significant API changes have occured, most significantly is the switch to an approach which does not require vm-memory and can run no_std. Signed-off-by: Rob Bradford <rbradford@rivosinc.com>	2023-03-03 13:08:36 +00:00
Jinank Jain	b54ce6c3db	vmm: Defer address space allocation We can ideally defer the address space allocation till we start the vCPUs for the very first time. Because the VM will not access the memory until the CPUs start running. Thus there is no need to allocate the address space eagerly and wait till the time we are going to start the vCPUs for the first time. Signed-off-by: Jinank Jain <jinankjain@microsoft.com>	2023-02-10 11:52:20 +01:00
Rob Bradford	9ba06ce5f5	vmm: memory_manager: Deprecate directory backing for memory This functionality has been obsoleted by our native support for hugepages and shared memory. See: #5082 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2023-01-10 15:08:34 +00:00
Rob Bradford	795f2a5558	vmm: memory_manager: Mark guest memory mappings as non-dumpable Including the guest RAM (or other mapped memory) in a coredump is not useful. See: #5014 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-12-15 20:36:40 +01:00
Sebastien Boeuf	3931b99d4e	vm-migration: Introduce new constructor for Snapshot This simplifies the Snapshot creation as we expect a SnapshotData to be provided most of the time. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-12-09 10:26:06 +01:00
Sebastien Boeuf	4ae6b595d7	vm-migration: Rename add_data_section() into add_data() Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-12-09 10:26:06 +01:00
Sebastien Boeuf	748018ace3	vm-migration: Don't store the id as part of Snapshot structure The information about the identifier related to a Snapshot is only relevant from the BTreeMap perspective, which is why we can get rid of the duplicated identifier in every Snapshot structure. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-12-09 10:26:06 +01:00
Sebastien Boeuf	4517b76a23	vm-migration: Rename SnapshotDataSection into SnapshotData Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-12-09 10:26:06 +01:00
Sebastien Boeuf	5b3bcfa233	vm-migration: Snapshot should have a unique SnapshotDataSection There's no reason to carry a HashMap of SnapshotDataSection per Snapshot. And given we now provide at most one SnapshotDataSection per Snapshot, there's no need to keep the id part of the SnapshotDataSection structure. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-12-09 10:26:06 +01:00
Rob Bradford	cefbf6b4a3	vmm: guest_debug: Mark coredump functionality x86_64 only The coredump functionality is only implemented for x86_64 so it should only be compiled in there. Fixes: #4964 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-12-05 17:23:52 +00:00
Rob Bradford	3888f57600	aarch64: Remove unnecessary casts (beta clippy check) Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-12-01 17:02:30 +00:00
Wei Liu	d05586f520	vmm: modify or provide safety comments Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-11-18 12:50:01 +00:00
Rob Bradford	f603afc46e	vmm: Make Transparent Huge Pages controllable (default on) Add MemoryConfig::thp and `--memory thp=on\|off` to allow control of Transparent Huge Pages. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-11-09 16:51:21 +00:00
Rob Bradford	b68add2d0d	vmm: Enable THP when using anonymous memory If the memory is not backed by a file then it is possible to enable Transparent Huge Pages on the memory and take advantage of the benefits of huge pages without requiring the specific allocation of an appropriate number of huge pages. TEST=Boot and see that in /proc/`pidof cloud-hypervisor`/smaps that the region is now THPeligible (and that also pages are being used.) Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-11-09 16:51:21 +00:00
Bo Chen	a9ec0f33c0	misc: Fix clippy issues Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-11-02 09:41:43 +01:00
Rob Bradford	99d9a3d299	vmm: memory_manager: Avoid MAP_PRIVATE CoW with VFIO for hugepages too We can't use MAP_ANONYMOUS and still have huge pages so MAP_SHARED is effectively required when using huge pages. Unfortunately it is not as simple as always forcing MAP_SHARED if hugepages is on as this might be inappropriate in the backing file case hence why there is additional complexity of assigning to mmap_flags on each case and the MAP_SHARED is only turned on for the anonymous file huge page case as well as anonymous shared file case. See: #4805 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-10-31 22:28:29 +00:00
Rob Bradford	df7c728399	vmm: memory_manager: Only file back memory when required If we do not need an anonymous file backing the memory then do not create one. As a side effect this addresses an issue with CoW (mmap with MAP_PRIVATE but no MAP_ANONYMOUS) when the memory is pinned for VFIO. Fixes: #4805 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-10-31 22:28:29 +00:00
Rob Bradford	1e5a4e8d77	vmm: memory_manager: Split filesystem backed and anonymous RAM creation This simplifies the code somewhat making the code paths more readable. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-10-31 22:28:29 +00:00
Rob Bradford	ff3fb91ba6	vmm: Refactor creation of the FileOffset for GuestRegionMmap::new() Create this earlier so that it is possible to pass a None in for anonymous mappings. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-10-31 22:28:29 +00:00
Wei Liu	b99b2bc990	memory_manager: use MFD_CLOEXEC flag when creating memory fd Until there is a need for sharing the memory fd with a child process, we should err on the safe side to close it on exec. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-10-27 09:20:08 +02:00
Sebastien Boeuf	c52ccf3992	vmm: migration: Create destination VM right before to restore it This is preliminary work to ensure a migrated VM is created right before it is restored. This will be useful when moving to a design where the VM is both created and restored simultaneously from the Snapshot. In details, that means the MemoryManager is the object that must be created upon receiving the config from the source VM, so that memory content can be later received and filled into the GuestMemory. Only after these steps happened, the snapshot is received from the source VM, and the actual Vm object can be created from both the snapshot and the MemoryManager previously created. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-10-18 17:14:29 +02:00
Bo Chen	37c3b0429a	vmm: Make MemoryManager::create_ram_region() public So that it can be reused externally, such as for fuzzing. Signed-off-by: Bo Chen <chen.bo@intel.com>	2022-10-12 16:09:27 +01:00
Rob Bradford	1202b9a07a	vmm: Add some tracing of boot sequence Add tracing of the VM boot sequence from the point at which the request to create a VM is received to the hand-off to the vCPU threads running. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-09-22 18:09:31 +01:00
Sebastien Boeuf	f38056fc9e	virtio-devices, vmm: Simplify virtio-mem resize operation There's no need to delegate the resize operation to the virtio-mem thread. This can come directly from the vmm thread which will use the Mem object to update the VIRTIO configuration and trigger the interrupt for the guest to be notified. In order to achieve what's described above, the VirtioMemZone structure now has a handle onto the Mem object directly. This avoids the need for intermediate Resize and ResizeSender structures. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-09-20 13:43:40 +02:00
Michael Zhao	b65639fad3	vmm:AArch64: move uefi_flash to memory manager uefi_flash is used when load firmware, that is load payload depends on device manager. move uefi_flash to memory manager can eliminate the dependency. Signed-off-by: Jianyong Wu <jianyong.wu@arm.com> Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-08-31 08:32:08 +01:00
Yi Wang	0e65ca4a6c	vmm: save guest memory for coredump Guest memory is needed for analysis in crash tool, so save it for coredump. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Yi Wang	90034fd6ba	vmm: add GuestDebuggable trait It's useful to dump the guest, which named coredump so that crash tool can be used to analysize it when guest hung up. Let's add GuestDebuggable trait and Coredumpxxx error to support coredump firstly. Signed-off-by: Yi Wang <wang.yi59@zte.com.cn> Co-authored-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-05-30 13:41:40 +02:00
Maksym Pavlenko	3a0429c998	cargo: Clean up serde dependencies There is no need to include serde_derive separately, as it can be specified as serde feature instead. Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2022-05-18 08:21:19 +02:00
Rob Bradford	bf9f79081a	vmm: Only create ACPI memory manager DSDT when resizable If using the ACPI based hotplug only memory can be added so if the hotplug RAM size is the same as the boot RAM size then do not include the memory manager DSDT entries. Also: this change simplifies the code marginally by making the HotplugMethod enum Copyable. This was identified from the following perf output: 1.78% 0.00% vmm cloud-hypervisor [.] <vmm::memory_manager::MemorySlots as acpi_tables::aml::Aml>::append_aml_bytes \| ---<vmm::memory_manager::MemorySlots as acpi_tables::aml::Aml>::append_aml_bytes <vmm::memory_manager::MemorySlot as acpi_tables::aml::Aml>::append_aml_bytes acpi_tables::aml::Name::new <acpi_tables::aml::Path as acpi_tables::aml::Aml>::append_aml_bytes __libc_malloc Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-04-26 13:07:19 +02:00
Michael Zhao	848d88c122	aarch64: Reserve a hole in 32-bit space The reserved space is for devices. Some devices (like TPM) require arbitrary addresses close to 4GiB. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-04-05 11:04:52 +08:00
Rob Bradford	7fd76eff05	vmm: Don't error if live resizing is not possible The introduction of a error if live resizing is not possible is a regression compared to the original behaviour where the new size would be stored in the config and reflected in the next boot. This behaviour was also inconsistent with the effect of resizing with no VM booted. Instead of generating an error allow the code to go ahead and update the config so that the new size will be available upon the reboot. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-03-31 17:04:53 +01:00

1 2 3 4 5

249 Commits