cloud-hypervisor

mirror of https://github.com/cloud-hypervisor/cloud-hypervisor.git synced 2024-11-05 19:41:27 +00:00

Author	SHA1	Message	Date
Wei Liu	e1cf889dbd	hypervisor: use UserMemoryRegion in the Vm trait Signed-off-by: Dev Rajput <t-devrajput@microsoft.com> Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-14 07:55:48 +01:00
Wei Liu	5894b5370c	hypervisor: transform between UserMemoryRegion and hypervisor structs Signed-off-by: Dev Rajput <t-devrajput@microsoft.com> Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-14 07:55:48 +01:00
Wei Liu	84bbaf06d1	hypervisor: turn boot_msr_entries into a trait method This allows dispatching to either KVM or MSHV automatically. No functional change. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2022-07-08 16:49:58 +01:00
Rob Bradford	2716bc3311	build: Fix beta clippy issue (derive_partial_eq_without_eq) warning: you are deriving `PartialEq` and can implement `Eq` --> vmm/src/serial_manager.rs:59:30 \| 59 \| #[derive(Debug, Clone, Copy, PartialEq)] \| ^^^^^^^^^ help: consider deriving `Eq` as well: `PartialEq, Eq` \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#derive_partial_eq_without_eq Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-06-30 20:50:45 +01:00
Michael Zhao	a7a15d56dd	aarch64: Move `setup_regs` to `hypervisor` `setup_regs` of AArch64 calls KVM sepecific code. Now move it to `hypervisor` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 11:07:46 +01:00
Michael Zhao	5d45d6d0fb	vmm: Move GIC unit test to `hypervisor` crate Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	957d3a7443	aarch64: Simplify GIC related structs definition Combined the `GicDevice` struct in `arch` crate and the `Gic` struct in `devices` crate. After moving the KVM specific code for GIC in `arch`, a very thin wapper layer `GicDevice` was left in `arch` crate. It is easy to combine it with the `Gic` in `devices` crate. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	b8dbb26647	hypervisor: Refactor `save_pending_tables` of Vgic Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	3fe7d61a02	hypervisor: Remove some redundant parameters Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Michael Zhao	c2862b6947	hypervisor: Move GitV3Its code from `arch` Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2022-06-06 10:17:26 +08:00
Maksym Pavlenko	3a0429c998	cargo: Clean up serde dependencies There is no need to include serde_derive separately, as it can be specified as serde feature instead. Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2022-05-18 08:21:19 +02:00
Rob Bradford	218be2642e	hypervisor: Explicitly `pub use` at the hypervisor crate top-level Explicitly re-export types from the hypervisor specific modules. This makes it much clearer what the common functionality that is exposed is. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-13 15:39:22 +02:00
Rob Bradford	3ffc105f83	hypervisor, vm-device: Relocate InterruptSourceConfig Move this enum from vm-device to hypervisor crate so that hypervisor crate does not gain an extra dependency. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-11 11:19:14 +01:00
Rob Bradford	3f9e8d676a	hypervisor: Move creation of irq routing struct to hypervisor crate This removes the requirement to leak as many datastructures from the hypervisor crate into the vmm crate. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-11 11:19:14 +01:00
Rob Bradford	387d56879b	vmm, hypervisor: Clean up nomenclature around offloading VM operations The trait and functionality is about operations on the VM rather than the VMM so should be named appropriately. This clashed with with existing struct for the concrete implementation that was renamed appropriately. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-05-10 13:10:01 +01:00
Sebastien Boeuf	b0077f0b5e	hypervisor: Implement retrieval of TDX capabilities Extend the Hypervisor API in order to retrieve the TDX capabilities from the underlying hypervisor. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-03-30 14:57:23 +01:00
Sebastien Boeuf	f310dc0916	hypervisor: Don't enable TDX debug This might not be correctly supported, therefore best to keep it disabled by default. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-03-30 14:57:23 +01:00
Akira Moroo	9f111388c0	hypervisor: Add `VmExit::Debug` for x86/KVM This commit adds `VmExit::Debug` for x86/KVM. When the guest hits a hardware breakpoint, `VcpuExit::Debug` vm exit occurs. This vm exit will be handled with code implemented in the following commits. Signed-off-by: Akira Moroo <retrage01@gmail.com>	2022-02-23 11:16:09 +00:00
Akira Moroo	9f27954fbd	hypervisor: Add `set_guest_debug` for x86/KVM This commit adds `set_guest_debug` implementation for x86/KVM. This function sets hardware breakpoints and single step to debug registers. NOTE: The `set_guest_debug` implementation is based on the crosvm implementation [1]. [1] https://github.com/google/crosvm/blob/main/hypervisor/src/kvm/x86_64.rs Signed-off-by: Akira Moroo <retrage01@gmail.com>	2022-02-23 11:16:09 +00:00
Akira Moroo	603ca0e21b	hypervisor: Add `translate_gva` for x86/KVM This commit adds `translate_gva` for x86/KVM. The same name function is already implemented for MSHV, but the implementation differs as KVM_TRANSLATE does not take the flag argument and does not return status code. This change requires the newer version of kvm-ioctls [1]. [1] `97ff779b6e` Signed-off-by: Akira Moroo <retrage01@gmail.com>	2022-02-23 11:16:09 +00:00
Sebastien Boeuf	cb844ecd1d	hypervisor: Add support for TDX exit reason to KVM Relying on the recent additions to the kvm-ioctls crate, this commit implements the support for providing the exit reason details to the caller, which allows the identification of the type of hypercall that was issued. It also introduces a way for the consumer to set the status code that must be sent back to the guest. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2022-02-18 14:41:07 +01:00
Rob Bradford	507912385a	vmm: Ensure that PIO and MMIO exits complete before pausing As per this kernel documentation: For KVM_EXIT_IO, KVM_EXIT_MMIO, KVM_EXIT_OSI, KVM_EXIT_PAPR, KVM_EXIT_XEN, KVM_EXIT_EPR, KVM_EXIT_X86_RDMSR and KVM_EXIT_X86_WRMSR the corresponding operations are complete (and guest state is consistent) only after userspace has re-entered the kernel with KVM_RUN. The kernel side will first finish incomplete operations and then check for pending signals. The pending state of the operation is not preserved in state which is visible to userspace, thus userspace should ensure that the operation is completed before performing a live migration. Userspace can re-enter the guest with an unmasked signal pending or with the immediate_exit field set to complete pending operations without allowing any further instructions to be executed. Since we capture the state as part of the pause and override it as part of the resume we must ensure the state is consistent otherwise we will lose the results of the MMIO or PIO operation that caused the exit from which we paused. Fixes: #3658 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-02-07 15:26:22 +00:00
Jianyong Wu	9bcb984962	hypervisor: add has/set trait for vcpu Like devicefd, vcpufd also has ability to set/has attribute through kvm ioctl. These traits are used when enable PMU on arm64, so add it here. Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2022-01-21 17:59:36 +08:00
Rob Bradford	658658e76c	hypervisor: kvm: Ignore -EINVAL from KVM_KVMCLOCK_CTRL ioctl() If the guest hasn't initialised a PV clock then the KVM_KVMCLOCK_CTRL ioctl will return -EINVAL. Therefore if running in the firmware or an OS that doesn't use the PV clock then we should ignore that error Tested by migrating a VM that has not yet booted into the Linux kernel (just in firmware) by specifying no disk image: e.g. target/debug/cloud-hypervisor --kernel ~/workloads/hypervisor-fw --api-socket /tmp/api --serial tty --console off Fixes: #3586 Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2022-01-19 10:12:57 +01:00
Sebastien Boeuf	c452471c4e	hypervisor: Add support for setting KVM identity map Extending the Vm trait with set_identity_map_address() in order to expose this ioctl to the VMM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-12-04 19:33:34 +00:00
Rob Bradford	348def9dfb	arch, hypervisor, vmm: Explicitly place the TSS in the 32-bit space Place the 3 page TSS at an explicit location in the 32-bit address space to avoid conflicting with the loaded raw firmware. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-12-03 16:53:56 +01:00
Wei Liu	58d984f6b8	hypervisor: add a few safety comments Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-11-18 14:42:55 +00:00
Wei Liu	6221b6f8a1	hypervisor: aarch64: move a comment to where it should be No functional change. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-11-16 10:13:09 +08:00
Wei Liu	57cc8bc6fe	hypervisor: aarch64: remove undefined behaviour in offset__of The variable tmp was never initialized. Calling assume_init when the content is not yet initialized causes immediate undefined behaviour. We also cannot create any intermediate references because they will be subject to the same requirements for references -- the referred object must be valid. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-11-16 10:13:09 +08:00
Rob Bradford	70f9fea1c3	hypervisor: aarch64: Use assert!() rather than if+panic As identified by the new beta clippy. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-10-19 19:42:36 +01:00
Sebastien Boeuf	76a036e96d	hypervisor: kvm: Add missing MSR related to Hyper-V When the synthetic interrupt controller is enabled, an extra set of MSRs must be stored in case of migration. There was one MSR missing in the list, HV_X64_MSR_SINT14 corresponding to the 15th interrupt source from the synthetic interrupt controller. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-10-11 15:30:13 +02:00
Sebastien Boeuf	bcdac10149	deps: Bump kvm-bindings to v0.5.0 Update the kvm-bindings dependency so that Cloud Hypervisor now depends on the version 0.5.0, which is based on Linux kernel v5.13.0. We still have to rely on a forked version to be able to serialize all the KVM structures we need. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-09-15 16:20:17 +01:00
Henry Wang	d74a219add	hypervisor: Remove useless check when saving Arm SystemRegs Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2021-08-31 09:53:57 +02:00
Jiaqi Gao	a90260ffb6	hypervisor: kvm: Update TDX command INIT_VM Definition of kvm_tdx_init_vm used by INIT_VM has been updated in latest kernel, needing an update on the Cloud Hypervisor side as well. Update structure TdxInitVm to fit this change and avoid -EINVAL to be returned by the kernel. Signed-off-by: Jiaqi Gao <jiaqi.gao@intel.com> Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-08-30 10:24:37 -07:00
Muminul Islam	fdecba6958	hypervisor: MSHV needs gpa to retrieve dirty logs Right now, get_dirty_log API has two parameters, slot and memory_size. MSHV needs gpa to retrieve the page states. GPA is needed as MSHV returns the state base on PFN. Signed-off-by: Muminul Islam <muislam@microsoft.com>	2021-07-29 16:29:53 +01:00
Sebastien Boeuf	dcc646f5b1	clippy: Fix redundant allocations With the new beta version, clippy complains about redundant allocation when using Arc<Box<dyn T>>, and suggests replacing it simply with Arc<dyn T>. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-07-29 13:28:57 +02:00
Bo Chen	e7c9954dc1	hypervisor, vmm: Abstract the interfaces to start/stop dirty log Following KVM interfaces, the `hypervisor` crate now provides interfaces to start/stop the dirty pages logging on a per region basis, and asks its users (e.g. the `vmm` crate) to iterate over the regions that needs dirty pages log. MSHV only has a global control to start/stop dirty pages log on all regions at once. This patch refactors related APIs from the `hypervisor` crate to provide a global control to start/stop dirty pages log (following MSHV's behaviors), and keeps tracking the regions need dirty pages log for KVM. It avoids leaking hypervisor-specific behaviors out of the `hypervisor` crate. Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-28 09:08:32 -07:00
Bo Chen	5e0d498582	hypervisor, vmm: Add dynamic control of logging dirty pages This patch extends slightly the current live-migration code path with the ability to dynamically start and stop logging dirty-pages, which relies on two new methods added to the `hypervisor::vm::Vm` Trait. This patch also contains a complete implementation of the two new methods based on `kvm` and placeholders for `mshv` in the `hypervisor` crate. Fixes: #2858 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-07-26 09:19:35 -07:00
Sebastien Boeuf	9ec0c981f8	hypervisor: Add enable_sgx_attribute to the Vm API We need a dedicated function to enable the SGX attribute capability through the Hypervisor abstraction. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2021-07-07 14:56:38 +02:00
Wei Liu	1f2915bff0	vmm: hypervisor: split set_user_memory_region to two functions Previously the same function was used to both create and remove regions. This worked on KVM because it uses size 0 to indicate removal. MSHV has two calls -- one for creation and one for removal. It also requires having the size field available because it is not slot based. Split set_user_memory_region to {create/remove}_user_memory_region. For KVM they still use set_user_memory_region underneath, but for MSHV they map to different functions. This fixes user memory region removal on MSHV. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2021-07-05 09:45:45 +02:00
Bo Chen	5825ab2dd4	clippy: Address the issue 'needless-borrow' Issue from beta verion of clippy: Error: --> vm-virtio/src/queue.rs:700:59 \| 700 \| if let Some(used_event) = self.get_used_event(&mem) { \| ^^^^ help: change this to: `mem` \| = note: `-D clippy::needless-borrow` implied by `-D warnings` = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#needless_borrow Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-06-24 08:55:43 +02:00
Henry Wang	2fe3586eba	hypervisor: support AArch64 `get_host_ipa_limit` Not all AArch64 platforms support IPAs up to 40 bits. Since the kvm-ioctl crate now supports `get_host_ipa_limit` for AArch64, when creating the VM, it is better to get the IPA size from the host and use that to create the VM. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2021-06-10 12:06:17 +02:00
Henry Wang	805cb303d5	hypervisor: Add `get_host_ipa_limit` for AArch64 This commit adds a helper `get_host_ipa_limit` to the AArch64 `KvmHypervisor` struct. This helper can be used to get the `Host_IPA_Limit`, which is the maximum possible value for IPA_Bits on the host and is dependent on the CPU capability and the kernel configuration. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2021-06-10 12:06:17 +02:00
Rob Bradford	9f5325fd52	hypervisor: tdx: Unconditionally enable TDX debug For now enable the TDX attribute for TDX debug. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-06-01 09:50:22 -07:00
Rob Bradford	84454f142d	hypervisor: Remove panic from Hypervisor::check_required_extensions() Remove the panic by replacing the .expect() with a cleaner error handling. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-05-19 17:11:30 +02:00
Rob Bradford	2439625785	hypervisor: Cleanup unused Hypervisor trait members Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-05-19 17:11:30 +02:00
Rob Bradford	28f383bae9	hypervisor: aarch64: Safer calculation of offset_of Use a safer method for calculating struct member offsets. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-05-07 07:16:09 +08:00
Rob Bradford	3c6dfd7709	tdx: Address Rust 1.51.0 clippy issue (upper_case_acroynms) error: name `FinalizeTDX` contains a capitalized acronym --> vmm/src/vm.rs:274:5 \| 274 \| FinalizeTDX(hypervisor::HypervisorVmError), \| ^^^^^^^^^^^ help: consider making the acronym lowercase, except the initial letter: `FinalizeTdx` \| = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#upper_case_acronyms Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-03-26 11:32:09 +00:00
Rob Bradford	0c27f69f1c	hypervisor: Address Rust 1.51.0 clippy issue (upper_case_acroynms) warning: name `TranslateGVA` contains a capitalized acronym --> hypervisor/src/arch/emulator/mod.rs:51:5 \| 51 \| TranslateGVA(#[source] anyhow::Error), \| ^^^^^^^^^^^^ help: consider making the acronym lowercase, except the initial letter: `TranslateGva` \| = note: `#[warn(clippy::upper_case_acronyms)]` on by default = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#upper_case_acronyms Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-03-26 11:32:09 +00:00
Vineeth Pillai	7fad74cb04	hypervisor: refactor vec_with_array_field function refactor vec_with_array_field to common hypervisor code so that mshv can also make use of it. Signed-off-by: Vineeth Pillai <viremana@linux.microsoft.com>	2021-03-23 11:06:13 +01:00
Michael Zhao	afc83582be	aarch64: Enable IRQ routing for legacy devices On AArch64, interrupt controller (GIC) is emulated by KVM. VMM need to set IRQ routing for devices, including legacy ones. Before this commit, IRQ routing was only set for MSI. Legacy routing entries of type KVM_IRQ_ROUTING_IRQCHIP were missing. That is way legacy devices (like serial device ttyS0) does not work. The setting of X86 IRQ routing entries are not impacted. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2021-03-15 20:59:50 +08:00
Rob Bradford	1c54fc3ab7	hypervisor: Support creating a VM of a specified KVM type This is necessary to support creating a TD VM. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-03-08 18:30:00 +00:00
Rob Bradford	f282cc001a	tdx: Add abstraction to call TDX ioctls to hypervisor Add API to the hypervisor interface and implement for KVM to allow the special TDX KVM ioctls on the VM and vCPU FDs. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-03-08 18:30:00 +00:00
Rob Bradford	f8875acec2	misc: Bulk upgrade dependencies In particular update for the vmm-sys-util upgrade and all the other dependent packages. This requires an updated forked version of kvm-bindings (due to updated vfio-ioctls) but allowed the removal of our forked version of kvm-ioctls. The changes to the API from kvm-ioctls and vmm-sys-util required some other minor changes to the code. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-02-26 11:31:08 +00:00
Rob Bradford	07a09eda27	hypervisor: kvm: Remove whitespace from use statements This allows cargo fmt to correctly order the statements. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-02-16 18:38:57 +01:00
Rob Bradford	a6b839b35c	build: Update to latest kvm-ioctls Update the version of the fork pointed to which has been rebased on the latest upstream. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-02-08 18:16:18 +00:00
Rob Bradford	184baff355	hypervisor: kvm: aarch64: Use struct initialisation error: field assignment outside of initializer for an instance created with Default::default() Error: --> hypervisor/src/kvm/mod.rs:1239:9 \| 1239 \| state.mp_state = self.get_mp_state()?; \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = note: `-D clippy::field-reassign-with-default` implied by `-D warnings` note: consider initializing the variable with `kvm::aarch64::VcpuKvmState { mp_state: self.get_mp_state()?, ..Default::default() }` and removing relevant reassignments --> hypervisor/src/kvm/mod.rs:1237:9 \| 1237 \| let mut state = CpuState::default(); \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#field_reassign_with_default Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-04 13:46:37 +01:00
Rob Bradford	f452fe7497	hypervisor: kvm: Use struct initialiser where possible error: field assignment outside of initializer for an instance created with Default::default() --> hypervisor/src/kvm/mod.rs:318:9 \| 318 \| cap.cap = KVM_CAP_SPLIT_IRQCHIP; \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = note: `-D clippy::field-reassign-with-default` implied by `-D warnings` note: consider initializing the variable with `kvm_bindings::kvm_enable_cap { cap: KVM_CAP_SPLIT_IRQCHIP, ..Default::default() }` and removing relevant reassignments --> hypervisor/src/kvm/mod.rs:317:9 \| 317 \| let mut cap: kvm_enable_cap = Default::default(); \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ = help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#field_reassign_with_default Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2021-01-04 13:46:37 +01:00
Muminul Islam	8c85dd32fa	hypervisor: Move msr and msr_data macro to arch/x86 Currently these two macros(msr, msr_data) reside both on kvm and mshv module. Definition is same for both module. Moving them to arch/x86 module eliminates redundancy and makes more sense. Signed-off-by: Muminul Islam <muislam@microsoft.com>	2020-12-11 00:59:46 +01:00
Rob Bradford	ffaab46934	misc: Use a more relaxed memory model when possible When a total ordering between multiple atomic variables is not required then use Ordering::Acquire with atomic loads and Ordering::Release with atomic stores. This will improve performance as this does not require a memory fence on x86_64 which Ordering::SeqCst will use. Add a comment to the code in the vCPU handling code where it operates on multiple atomics to explain why Ordering::SeqCst is required. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-12-02 19:04:30 +01:00
Samuel Ortiz	d419e30df1	hypervisor: x86: Add a SegmentRegistorOps trait In order to validate emulated memory accesses, we need to be able to get all the segments descriptor attributes. This is done by abstracting the SegmentRegister attributes through a trait that each hypervisor will have to implement. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-11-30 13:09:19 +00:00
Rob Bradford	0fec326582	hypervisor, vmm: Remove shared ownership of VmmOps This interface is used by the vCPU thread to delegate responsibility for handling MMIO/PIO operations and to support different approaches than a VM exit. During profiling I found that we were spending 13.75% of the boot CPU uage acquiring access to the object holding the VmmOps via ArcSwap::load_full() 13.75% 6.02% vcpu0 cloud-hypervisor [.] arc_swap::ArcSwapAny<T,S>::load_full \| ---arc_swap::ArcSwapAny<T,S>::load_full \| --13.43%--<hypervisor::kvm::KvmVcpu as hypervisor::cpu::Vcpu>::run std::sys_common::backtrace::__rust_begin_short_backtrace core::ops::function::FnOnce::call_once{{vtable-shim}} std::sys::unix:🧵:Thread:🆕:thread_start However since the object implementing VmmOps does not need to be mutable and it is only used from the vCPU side we can change the ownership to being a simple Arc<> that is passed in when calling create_vcpu(). This completely removes the above CPU usage from subsequent profiles. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-19 00:16:02 +01:00
Rob Bradford	041724a7cf	hypervisor: Add ability to get dirty logged pages Return a bitmap of pages that have been dirtied (written to) since it was last called. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-17 16:57:11 +00:00
Rob Bradford	8baa244ec1	hypervisor: Add control for dirty page logging When creating a userspace mapping provide a control for enabling the logging of dirty pages. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-17 16:57:11 +00:00
Rob Bradford	46e736973e	hypervisor: kvm: Correctly share VmmOps between Kvm{Vm,Vcpu} Cloning the ArcSwapOption (like the ArcSwap) does not act like a .clone() on an Arc, instead an entirely new ArcSwap is created with the same contents. To correctly share the ArcSwap needs to be placed inside an Arc. See: `2433d5719b (diff-6c6d94533c44c19bd1416ef17bad1a878e63dca6e98d59181228fbe8f967c62bR6)` Due to this being wrongly used ::clone() was removed from ArcSwap/ArcSwapOption in 1.0.0. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-11-16 14:10:09 +01:00
Michael Zhao	093a581ee1	vmm: Implement VM rebooting on AArch64 The logic to handle AArch64 system event was: SHUTDOWN and RESET were all treated as RESET. Now we handle them differently: - RESET event will trigger Vmm::vm_reboot(), - SHUTDOWN event will trigger Vmm::vm_shutdown(). Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-10-30 17:14:44 +00:00
Sebastien Boeuf	28e12e9f3a	vmm, hypervisor: Fix snapshot/restore for Windows guest The snasphot/restore feature is not working because some CPU states are not properly saved, which means they can't be restored later on. First thing, we ensure the CPUID is stored so that it can be properly restored later. The code is simplified and pushed down to the hypervisor crate. Second thing, we identify for each vCPU if the Hyper-V SynIC device is emulated or not. In case it is, that means some specific MSRs will be set by the guest. These MSRs must be saved in order to properly restore the VM. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-10-21 19:11:03 +01:00
Rob Bradford	c4dc25de09	hypervisor: kvm: aarch64: Trigger reset upon KVM_SYSTEM_EVENT_RESET This will trigger Vm::vm_reboot to make the VM reboot. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-10-20 12:46:35 +08:00
Rob Bradford	573a5c63cf	hypervisor: kvm: Use unstable_sort() to keep clippy happy "Using a stable sort consumes more memory and cpu cycles. Because values which compare equal are identical, preserving their relative order (the guarantee that a stable sort provides) means nothing, while the extra costs still apply." Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-10-09 10:49:54 +02:00
Wei Liu	ed1fdd1f7d	hypervisor, arch: rename "OneRegister" and relevant code The OneRegister literally means "one (arbitrary) register". Just call it "Register" instead. There is no need to inherit KVM's naming scheme in the hypervisor agnostic code. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-10-08 08:55:10 +02:00
Wei Liu	9ad14e6b3a	aarch64: Add OneReg to the list required extensions for KVM Without that capability save / restore for aarch64 won't work. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-10-08 08:54:38 +02:00
Praveen Paladugu	71c435ce91	hypervisor, vmm: Introduce VmmOps trait Run loop in hypervisor needs a callback mechanism to access resources like guest memory, mmio, pio etc. VmmOps trait is introduced here, which is implemented by vmm module. While handling vcpuexits in run loop, this trait allows hypervisor module access to the above mentioned resources via callbacks. Signed-off-by: Praveen Paladugu <prapal@microsoft.com> Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-10-02 16:42:55 +01:00
Praveen Paladugu	4b32252028	hypervisor, vmm: fix clippy warnings Signed-off-by: Praveen Paladugu <prapal@microsoft.com>	2020-09-26 14:07:12 +01:00
Henry Wang	89a6b63e6e	hypervisor: Implement `get_device_attr` method for AArch64 This commit implements the `get_device_attr` method for the `KVM_GET_DEVICE_ATTR` ioctl. This ioctl will be used in retrieving the GIC status. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2020-09-23 12:37:25 +01:00
Henry Wang	ffafeda4b6	AArch64: Implement AArch64 vCPU states save/restore This commit adds methods to save/restore AArch64 vCPU registers, including: 1. The AArch64 `VcpuKvmState` structure. 2. Some `Vcpu` trait methods of the `KvmVcpu` structure to enable the save/restore of the AArch64 vCPU states. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2020-09-23 12:37:25 +01:00
Henry Wang	e3d45be6f7	AArch64: Preparation for vCPU save/restore This commit ports code from firecracker and refactors the existing AArch64 code as the preparation for implementing save/restore AArch64 vCPU, including: 1. Modification of `arm64_core_reg` macro to retrive the index of arm64 core register and implemention of a helper to determine if a register is a system register. 2. Move some macros and helpers in `arch` crate to the `hypervisor` crate. 3. Added related unit tests for above functions and macros. Signed-off-by: Henry Wang <Henry.Wang@arm.com>	2020-09-23 12:37:25 +01:00
Rob Bradford	da642fcf7f	hypervisor: Add "HyperV" exit to list of KVM exits Currently we don't need to do anything to service these exits but when the synthetic interrupt controller is active an exit will be triggered to notify the VMM of details of the synthetic interrupt page. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-09-16 16:08:01 +01:00
Rob Bradford	9b48ee38cb	hypervisor: Support enabling HyperV synthetic interrupt controller This adds a KVM HyperV synthetic interrupt controller in place of the emulated PIC. Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-09-16 16:08:01 +01:00
Wei Liu	53f4fed516	hypervisor: drop get_api_version from Hypervisor trait The new function already checks if the API version is compatible. There is no need to expose the get_api_version function to code outside hypervisor crate. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-09-07 11:59:08 +01:00
Wei Liu	d73971e407	hypervisor: kvm: check API compatibility Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-09-07 11:59:08 +01:00
Michael Zhao	afc98a5ec9	vmm: Fix AArch64 clippy warnings of vmm and other crates Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-08-24 10:59:08 +02:00
Muminul Islam	92b4499c1e	vmm, hypervisor: Add vmstate to snapshot and restore path Signed-off-by: Muminul Islam <muislam@microsoft.com>	2020-08-24 08:48:15 +02:00
Muminul Islam	77e901a602	hypervisor: Introduce VM state to Vm hypervisor trait We may need to store hypervisor speciific data to the VM. This support is needed for Microsoft hyperv implementations. This patch introduces two new definitions to Vm trait and implements for KVM. Signed-off-by: Muminul Islam <muislam@microsoft.com>	2020-08-24 08:48:15 +02:00
Sebastien Boeuf	0f1ab38ded	hypervisor: kvm: Make MSRs set/get more flexible Based on the way KVM_GET_MSRS and KVM_SET_MSRS work, both function are very unlikely to fail, as they simply stop looping through the list of MSRs as soon as getting or setting one fails. This is causing some issues with the snapshot/restore feature, as on some platforms, we only save a subset of the list of MSRs, leading to unproper way of saving the VM. The way to address this issue is by checking the number of MSRs get/set matches the expected amount from the list. In case it does not match, we simply ignore the failing MSR and continue getting/setting the rest of the list. By doing this by iterations, we end up getting/setting as many MSRs as the platform can support. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-08-05 14:52:35 +01:00
Michael Zhao	ddf1b76906	hypervisor: Refactor create_passthrough_device() for generic type Changed the return type of create_passthrough_device() to generic type hypervisor::Device. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-07-21 16:22:02 +02:00
Michael Zhao	e7288888cf	hypervisor: Extend hypervisor crate with Device trait Added Device trait and KvmDevice struct for KVM-emulated devices. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-07-21 16:22:02 +02:00
Wei Liu	e1af251c9f	vmm, hypervisor: adjust set_gsi_routing / set_gsi_routes Make set_gsi_routing take a list of IrqRoutingEntry. The construction of hypervisor specific structure is left to set_gsi_routing. Now set_gsi_routes, which is part of the interrupt module, is only responsible for constructing a list of routing entries. This further splits hypervisor specific code from hypervisor agnostic code. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-20 07:32:32 +02:00
Wei Liu	ff8d7bfe83	hypervisor: add create_passthrough_device call to Vm trait That function is going to return a handle for passthrough related operations. Move create_kvm_device code there. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-17 20:21:39 +02:00
Sebastien Boeuf	e10d9b13d4	arch, hypervisor, vmm: Patch CPUID subleaves to expose EPC sections The support for SGX is exposed to the guest through CPUID 0x12. KVM passes static subleaves 0 and 1 from the host to the guest, without needing any modification from the VMM itself. But SGX also relies on dynamic subleaves 2 through N, used for describing each EPC section. This is not handled by KVM, which means the VMM is in charge of setting each subleaf starting from index 2 up to index N, depending on the number of EPC sections. These subleaves 2 through N are not listed as part of the supported CPUID entries from KVM. But it's important to set them as long as index 0 and 1 are present and indicate that SGX is supported. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-07-15 15:08:56 +02:00
Michael Zhao	cce6237536	pci: Enable GSI routing (MSI type) for AArch64 In this commit we saved the BDF of a PCI device and set it to "devid" in GSI routing entry, because this field is mandatory for GICv3-ITS. Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-07-14 14:34:54 +01:00
Michael Zhao	82a0e29c7a	hypervisor: Export check_extension() API from hypervisor::Vm Signed-off-by: Michael Zhao <michael.zhao@arm.com>	2020-07-14 14:34:54 +01:00
Wei Liu	a4f484bc5e	hypervisor: Define a VM-Exit abstraction In order to move the hypervisor specific parts of the VM exit handling path, we're defining a generic, hypervisor agnostic VM exit enum. This is what the hypervisor's Vcpu run() call should return when the VM exit can not be completely handled through the hypervisor specific bits. For KVM based hypervisors, this means directly forwarding the IO related exits back to the VMM itself. For other hypervisors that e.g. rely on the VMM to decode and emulate instructions, this means the decoding itself would happen in the hypervisor crate exclusively, and the rest of the VM exit handling would be handled through the VMM device model implementation. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com> Fix test_vm unit test by using the new abstraction and dropping some dead code. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-06 12:59:43 +01:00
Wei Liu	cfa758fbb1	vmm, hypervisor: introduce and use make_user_memory_region This removes the last KVM-ism from memory_manager. Also make use of that method in other places. Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-07-06 12:31:19 +02:00
Samuel Ortiz	618722cdca	hypervisor: cpu: Rename state getter and setter vcpu.{set_}cpu_state() is a stutter. Signed-off-by: Samuel Ortiz <sameo@linux.intel.com>	2020-07-06 09:35:30 +01:00
Rob Bradford	8e43f886e1	build: Bump kvm-ioctls dependency after rebase ch branch is now rebased on latest upstream master Signed-off-by: Rob Bradford <robert.bradford@intel.com>	2020-07-01 18:46:05 +02:00
Sebastien Boeuf	e35d4c5b28	hypervisor: Store all supported MSRs On x86 architecture, we need to save a list of MSRs as part of the vCPU state. By providing the full list of MSRs supported by KVM, this patch fixes the remaining snapshot/restore issues, as the vCPU is restored with all its previous states. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-30 14:03:03 +01:00
Sebastien Boeuf	49b4fba283	hypervisor: Retrieve list of supported MSRs Add a new function to the hypervisor trait so that the caller can retrieve the list of MSRs supported by this hypervisor. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-30 14:03:03 +01:00
Sebastien Boeuf	e2b5c78dc5	hypervisor: Re-order vCPU state for storing and restoring Some vCPU states such as MP_STATE can be modified while retrieving other states. For this reason, it's important to follow a specific order that will ensure a state won't be modified after it has been saved. Comments about ordering requirements have been copied over from Firecracker commit 57f4c7ca14a31c5536f188cacb669d2cad32b9ca. This patch also set the previously saved VCPU_EVENTS, as this was missing from the restore codepath. Signed-off-by: Sebastien Boeuf <sebastien.boeuf@intel.com>	2020-06-30 14:03:03 +01:00
Wei Liu	24c051c663	vmm: hypervisor: drop duplicate comment Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-06-29 21:51:59 +01:00
Wei Liu	2518b9e3cd	vmm: hypervisor: fix white space issues Signed-off-by: Wei Liu <liuwe@microsoft.com>	2020-06-29 21:51:59 +01:00

1 2 3 4

160 Commits