cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-11-05 11:31:14 +00:00

Author	SHA1	Message	Date
Rob Bradford	a2e02a8fff	vmm: Add SGX section creation logging Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Rob Bradford	def98faf37	vmm, vm-allocator: Introduce an allocator for platform devices This allocator allocates 64-bit MMIO addresses for use with platform devices e.g. ACPI control devices and ensures there is no overlap with PCI address space ranges which can cause issues with PCI device remapping. Use this allocator the ACPI platform devices. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Rob Bradford	afe95e5a2a	vmm: Use an allocator specifically for RAM regions Rather than use the system MMIO allocator for RAM use an allocator that covers the full RAM range. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Rob Bradford	b8fee11822	vmm: Place SGX EPC region between RAM and device area Increase the start of the device area to accomodate the SGX EPC area. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Rob Bradford	e20be3e147	vmm: Check hotplug memory against end of RAM not start of device area This is because the SGX region will be placed between the end of ram and the start of the device area. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Rob Bradford	ec81f377b6	vmm: Refactor SGX setup to inside MemoryManager::new() This makes it possible to manually allocate the SGX region after the end of RAM region. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-11-02 16:55:42 +00:00
Sebastien Boeuf	58d8206e2b	migration: Use MemoryManager restore code path Instead of creating a MemoryManager from scratch, let's reuse the same code path used by snapshot/restore, so that memory regions are created identically to what they were on the source VM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	1e1e61614c	vmm: memory_manager: Leverage new codepath for snapshot/restore Now that all the pieces are in place, we can restore a VM with the new codepath that restores properly all memory regions, allowing for ACPI memory hotplug to work properly with snapshot/restore feature. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	6a55768d94	vmm: Create MemoryManager from restore data Extending the MemoryManager::new() function to be able to create a MemoryManager from data that have been previously stored instead of always creating everything from scratch. This change brings real added value as it allows a VM to be restored respecting the proper memory layout instead of hoping the regions will be created the way they were before. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	5b177b205b	arch, vmm: Extend the data being snapshot Storing multiple data coming from the MemoryManager in order to be able to restore without creating everything from scratch. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	f440976a7c	vmm: memory_manager: Add a way to restore memory regions properly This new function will be able to restore memory regions and memory zones based on the GuestMemoryMapping list that will be provided through snapshot/restore and migration phases. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	0d573ae86c	vmm: memory_manager: Add file_offset to GuestRamMapping This will help restoring the region with the correct file offset for the memory mapping. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	01420f5195	vmm: memory_manager: Add virtio_mem to GuestRamMapping This will help identify if the range belongs to a virtio-mem region or not. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	dfb1829f65	vmm: memory_manager: Add zone_id to GuestRamMapping This can help identifying which zone relates to which memory range. This is going to be useful when recreating GuestMemory regions from the previous layout instead of having to recreate everything from scratch. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	b5d11f72b3	vmm: memory_manager: Factorize allocation of ranges Create a dedicated function to factorize the allocation of the memory ranges, and helping with the simplification of MemoryManager::new() function. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	00951f17d4	vmm: memory_manager: Simplify regions creation By updating the list of GuestMemory regions with the virtio-mem ones before the creation of the MemoryManager, we know the GuestMemory is up to date and the allocation of memory ranges is simplified afterwards. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Sebastien Boeuf	63c6c78c4e	vmm: memory_manager: Factorize configuration validation In order to simplify MemoryManager::new() function. let's move the memory configuration validation to its own function. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-06 18:35:49 -07:00
Yu Li	08021087ec	vmm: add prefault option in memory and memory-zone The argument `prefault` is provided in MemoryManager, but it can only be used by SGX and restore. With prefault (MAP_POPULATE) been set, subsequent page faults will decrease during running, although it will make boot slower. This commit adds `prefault` in MemoryConfig and MemoryZoneConfig. To resolve conflict between memory and restore, argument `prefault` has been changed from `bool` to `Option<bool>`, when its value is None, config from memory will be used, otherwise argument in Option will be used. Signed-off-by: Yu Li <liyu.yukiteru@bytedance.com>	2021-09-29 14:17:35 +02:00
Sebastien Boeuf	59031531b6	vmm: Simplify the way memory is snapshot and restored By using a single file for storing the memory ranges, we simplify the way snapshot/restore works by avoiding multiples files, but the main and more important point is that we have now a way to save only the ranges that matter. In particular, the ranges related to virtio-mem regions are not always fully hotplugged, meaning we don't want to save the entire region. That's where the usage of memory ranges is interesting as it lets us optimize the snapshot/restore process when one or multiple virtio-mem regions are involved. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	1ea63f50a1	vmm: Move MemoryRangeTable creation to the MemoryManager The function memory_range_table() will be reused by the MemoryManager in a following patch to describe all the ranges that we should snapshot. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	e390775bcb	vmm, virtio-devices: Move BlocksState creation to the MemoryManager By creating the BlocksState object in the MemoryManager, we can directly provide it to the virtio-mem device when being created. This will allow the MemoryManager through each VirtioMemZone to have a handle onto the blocks that are plugged at any point in time. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	a1caa6549a	vmm: Add page size as a parameter for MemoryRangeTable::from_bitmap() This will be helpful to support the creation of a MemoryRangeTable from virtio-mem, as it uses 2M pages. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	d7115ec656	virtio-devices: mem: Add snapshot/restore support Adding the snapshot/restore support along with migration as well, allowing a VM with virtio-mem devices attached to be properly migrated. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	7bbcc0f849	vmm: memory_manager: Make sure the hotplugged_size is up to date The amount of memory plugged in the virtio-mem region should always be kept up to date in the hotplugged_size field from VirtioMemZone. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	c4dc7a583d	vmm: memory_manager: Simplify the MemoryManager structure There's no need to duplicate the GuestMemory for snapshot purpose, as we always have a handle onto the GuestMemory through the guest_memory field. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Sebastien Boeuf	74485924b1	vmm: memory_manager: Simplification to avoid unnecessary locking Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-28 10:15:22 -07:00
Rob Bradford	171d12943d	vmm: memory_manager: Increase robustness of MemoryManager control device See: #1289 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-09-10 10:23:19 -07:00
Bo Chen	4f37a273d9	vmm: Fix clippy issue error: all if blocks contain the same code at the end --> vmm/src/memory_manager.rs:884:9 \| 884 \| / Ok(mm) 885 \| \| } \| \|_________^ Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-09-08 13:31:19 -07:00
Sebastien Boeuf	0411064271	vmm: Refactor migration through Migratable trait Now that Migratable provides the methods for starting, stopping and retrieving the dirty pages, we move the existing code to these new functions. No functional change intended. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-05 06:07:00 -07:00
Sebastien Boeuf	79425b6aa8	vm-migration, vmm: Extend methods for MemoryRangeTable In anticipation for supporting the merge of multiple dirty pages coming from multiple devices, this patch factorizes the creation of a MemoryRangeTable from a bitmap, as well as providing a simple method for merging the dirty pages regions under a single MemoryRangeTable. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-05 06:07:00 -07:00
Muminul Islam	fdecba6958	hypervisor: MSHV needs gpa to retrieve dirty logs Right now, get_dirty_log API has two parameters, slot and memory_size. MSHV needs gpa to retrieve the page states. GPA is needed as MSHV returns the state base on PFN. Signed-off-by: Muminul Islam <muislam@microsoft.com>	2021-07-29 16:29:53 +01:00
Bo Chen	b00a6a8519	vmm: Create guest memory regions with explicit dirty-pages-log flags As we are now using an global control to start/stop dirty pages log from the `hypervisor` crate, we need to explicitly tell the hypervisor (KVM) whether a region needs dirty page tracking when it is created. This reverts commit `f063346de3`. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-28 09:08:32 -07:00
Bo Chen	e7c9954dc1	hypervisor, vmm: Abstract the interfaces to start/stop dirty log Following KVM interfaces, the `hypervisor` crate now provides interfaces to start/stop the dirty pages logging on a per region basis, and asks its users (e.g. the `vmm` crate) to iterate over the regions that needs dirty pages log. MSHV only has a global control to start/stop dirty pages log on all regions at once. This patch refactors related APIs from the `hypervisor` crate to provide a global control to start/stop dirty pages log (following MSHV's behaviors), and keeps tracking the regions need dirty pages log for KVM. It avoids leaking hypervisor-specific behaviors out of the `hypervisor` crate. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-28 09:08:32 -07:00
Bo Chen	f063346de3	vmm: Create guest memory regions without dirty-pages-log by default With the support of dynamically turning on/off dirty-pages-log during live-migration (only for guest RAM regions), we now can create guest memory regions without dirty-pages-log by default both for guest RAM regions and other regions backed by file/device. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-26 09:19:35 -07:00
Bo Chen	5e0d498582	hypervisor, vmm: Add dynamic control of logging dirty pages This patch extends slightly the current live-migration code path with the ability to dynamically start and stop logging dirty-pages, which relies on two new methods added to the `hypervisor::vm::Vm` Trait. This patch also contains a complete implementation of the two new methods based on `kvm` and placeholders for `mshv` in the `hypervisor` crate. Fixes: #2858 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-26 09:19:35 -07:00
Sebastien Boeuf	9aedabe11e	sgx: Add mandatory `id` field to SgxEpcConfig In order to uniquely identify each SGX EPC section, we introduce a mandatory option `id` to the `--sgx-epc` parameter. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-07-09 14:45:30 +02:00
Sebastien Boeuf	17c99ae00a	vmm: Enable provisioning for SGX guest The guest can see that SGX supports provisioning as it is exposed through the CPUID. This patch enables the proper backing of this feature by having the host open the provisioning device and enable this capability through the hypervisor. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-07-07 14:56:38 +02:00
Wei Liu	1f2915bff0	vmm: hypervisor: split set_user_memory_region to two functions Previously the same function was used to both create and remove regions. This worked on KVM because it uses size 0 to indicate removal. MSHV has two calls -- one for creation and one for removal. It also requires having the size field available because it is not slot based. Split set_user_memory_region to {create/remove}_user_memory_region. For KVM they still use set_user_memory_region underneath, but for MSHV they map to different functions. This fixes user memory region removal on MSHV. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-07-05 09:45:45 +02:00
Sebastien Boeuf	09c3ddd47d	vmm: memory_manager: Remove _PXM from ACPI memory slot The _PXM method always return 0, which is wrong since the SRAT might tell differently. The point of the _PXM method is to be evaluated by the guest OS when some new memory slot is being plugged, but this will never happen for Cloud Hypervisor since using NUMA nodes along with memory hotplug only works for virtio-mem. Memory hotplug through ACPI will only happen when there's only one NUMA node exposed to the guest, which means the _PXM method won't be needed at all. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-06-17 16:08:46 +02:00
Bo Chen	7839e121f6	vmm: Add dirty pages tracked by vm_memory::bitmap to live migration Live migration currently handles guest memory writes from the guest through the KVM dirty page tracking and sends those dirty pages to the destination. This patch augments the live migration support with dirty page tracking of writes from the VMM to the guest memory(e.g. virtio devices). Fixes: #2458 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-06-03 08:34:45 +01:00
Bo Chen	2c4fa258a6	virtio-devices, vmm: Deprecate "GuestMemory::with_regions(_mut)" Function "GuestMemory::with_regions(_mut)" were mainly temporary methods to access the regions in `GuestMemory` as the lack of iterator-based access, and hence they are deprecated in the upstream vm-memory crate [1]. [1] https://github.com/rust-vmm/vm-memory/issues/133 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-06-03 08:34:45 +01:00
Bo Chen	b5bcdbaf48	misc: Upgrade to use the vm-memory crate w/ dirty-page-tracking As the first step to complete live-migration with tracking dirty-pages written by the VMM, this commit patches the dependent vm-memory crate to the upstream version with the dirty-page-tracking capability. Most changes are due to the updated `GuestMemoryMmap`, `GuestRegionMmap`, and `MmapRegion` structs which are taking an additional generic type parameter to specify what 'bitmap backend' is used. The above changes should be transparent to the rest of the code base, e.g. all unit/integration tests should pass without additional changes. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-06-03 08:34:45 +01:00
Rob Bradford	f840327ffb	vmm: Version MemoryManager state Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-05-21 15:29:52 +02:00
Rob Bradford	496ceed1d0	misc: Remove unnecessary "extern crate" Now all crates use edition = "2018" then the majority of the "extern crate" statements can be removed. Only those for importing macros need to remain. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-05-12 17:26:11 +02:00
Rob Bradford	b8f5911c4e	misc: Remove unused errors from public interface Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-05-11 13:37:19 +02:00
Mikko Ylinen	3b18caf229	sgx: update virt EPC device path and docs The latest kvm-sgx code has renamed sgx_virt_epc device node to sgx_vepc. Update cloud-hypervisor code and documentation to follow this. Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>	2021-04-30 16:16:01 +02:00
Rob Bradford	01f0c1e313	vmm: Simplify memory state to support Versionize In order to support using Versionize for state structures it is necessary to use simpler, primitive, data types in the state definitions used for snapshot restore. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-04-23 14:24:16 +01:00
Rob Bradford	a7c4483b8b	vmm: Directly (de)serialise CpuManager, DeviceManager and MemoryManager state Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-04-20 18:58:37 +02:00
Bo Chen	78796f96b7	vmm: Refine the granularity of dirty memory tracking Instead of tracking on a block level of 64 pages, we are now collecting dirty pages one by one. It improves the efficiency of dirty memory tracking while live migration. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-04-19 17:17:14 +02:00
Anatol Belski	e1cc702327	memory_manager: Fix address range calculation in MemorySlot The MCRS method returns a 64-bit memory range descriptor. The calculation is supposed to be done as follows: max = min + len - 1 However, every operand is represented not as a QWORD but as combination of two DWORDs for high and low part. Till now, the calculation was done this way, please see also inline comments: max.lo = min.lo + len.lo //this may overflow, need to carry over to high max.hi = min.hi + len.hi max.hi = max.hi - 1 // subtraction needs to happen on the low part This calculation has been corrected the following way: max.lo = min.lo + len.lo max.hi = min.hi + len.hi + (max.lo < min.lo) // check for overflow max.lo = max.lo - 1 // subtract from low part The relevant part from the generated ASL for the MCRS method: ``` Method (MCRS, 1, Serialized) { Acquire (MLCK, 0xFFFF) \_SB.MHPC.MSEL = Arg0 Name (MR64, ResourceTemplate () { QWordMemory (ResourceProducer, PosDecode, MinFixed, MaxFixed, Cacheable, ReadWrite, 0x0000000000000000, // Granularity 0x0000000000000000, // Range Minimum 0xFFFFFFFFFFFFFFFE, // Range Maximum 0x0000000000000000, // Translation Offset 0xFFFFFFFFFFFFFFFF, // Length ,, _Y00, AddressRangeMemory, TypeStatic) }) CreateQWordField (MR64, \_SB.MHPC.MCRS._Y00._MIN, MINL) // _MIN: Minimum Base Address CreateDWordField (MR64, 0x12, MINH) CreateQWordField (MR64, \_SB.MHPC.MCRS._Y00._MAX, MAXL) // _MAX: Maximum Base Address CreateDWordField (MR64, 0x1A, MAXH) CreateQWordField (MR64, \_SB.MHPC.MCRS._Y00._LEN, LENL) // _LEN: Length CreateDWordField (MR64, 0x2A, LENH) MINL = \_SB.MHPC.MHBL MINH = \_SB.MHPC.MHBH LENL = \_SB.MHPC.MHLL LENH = \_SB.MHPC.MHLH MAXL = (MINL + LENL) /* \_SB_.MHPC.MCRS.LENL / MAXH = (MINH + LENH) / \_SB_.MHPC.MCRS.LENH / If ((MAXL < MINL)) { MAXH += One / \_SB_.MHPC.MCRS.MAXH / } MAXL -= One Release (MLCK) Return (MR64) / \_SB_.MHPC.MCRS.MR64 */ } ``` Fixes #1800. Signed-off-by: Anatol Belski <anbelski@linux.microsoft.com>	2021-04-12 16:20:19 +02:00

1 2 3 4

187 Commits