libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-10-05 22:05:47 +00:00

Author	SHA1	Message	Date
Jonathon Jongsma	e04373794c	qemu: log error output from nbdkit log stderr and stdout from nbdkit into its own log so that nbdkit-related issues can be debugged more easily. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	e498941476	qemu: move qemuProcessReadLog() to qemuLogContext This code can be used by the nbdkit implementation for reading back filtered log data for error reporting. Move it to qemuLogContext so that it can be shared. Renamed to qemuLogContextReadFiltered(). Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	b658b1a27e	qemu: Extract qemuDomainLogContext into a new file This will allow us to use it for nbdkit logging in upcoming commits. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	abdc4f2092	Generalize qemuDomainLogContextNew() Allow to specify a basename for the log file so that qemuDomainLogContextNew() can be used to create log contexts for secondary loggers. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	8d73929e04	qemu: add functions to start and stop nbdkit Add some helper functions to build a virCommand object and run the nbdkit process for a given virStorageSource. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	4a632de2ac	qemu: query nbdkit module dir from binary Rather than hard-coding the nbdkit module directory, query the nbdkit binary for the location to these directories. nbdkit provides a --dump-config optiont that outputs this information and can be easily parsed. We can also get the version from this output rather than executing `nbdkit --version` separately. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	1475c8c0d7	qemu: Add qemuNbdkitProcess An object for storing information about a nbdkit process that is serving a specific virStorageSource. At the moment, this information is just stored in the private data of virStorageSource and not used at all. Future commits will use this data to actually start a nbdkit process. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	0c00059a72	qemu: use file cache for nbdkit caps Add the virFileCache implementation for nbdkit capabilities to the qemu driver. This allows us to determine whether nbdkit is installed and which plugins are supported. it also has persistent caching and the capabilities are re-queried whenever something changes. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	d0fd04556c	qemu: implement persistent file cache for nbdkit caps Implement the loadFile and saveFile virFileCacheHandlers callbacks so that nbdkit capabilities are cached perstistently across daemon restarts. The format and implementation is modeled on the qemu capabilities, but simplified slightly. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:50 -05:00
Jonathon Jongsma	9c0b83a9ed	qemu: implement basic virFileCache for nbdkit caps Preparatory step for caching nbdkit capabilities. This patch implements the newData and isValid virFileCacheHandlers callback functions. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:49 -05:00
Jonathon Jongsma	03afd10cd3	qemu: expand nbdkit capabilities In order to add caching of the nbdkit capabilities, we will need to compare against file modification times, etc. So look up this information when creating the nbdkit caps. Add a nbdkit_moddir build option to allow the builder to specify the location to look for nbdkit plugins and filters. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:49 -05:00
Jonathon Jongsma	6b7e6b73a9	qemu: Add functions for determining nbdkit availability In future commits, we will optionally use nbdkit to serve some remote disk sources. This patch queries to see whether nbdkit is installed on the host and queries it for capabilities. The data will be used in later commits. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-19 14:28:49 -05:00
Michal Privoznik	732c5f4270	src: Avoid needless checks before calling g_strdup() There are few places where the following pattern occurs: if (var) other = g_strdup(var); where @other wasn't initialized before g_strdup(). Checking for var != NULL is useless in this case, as that's exactly what g_strdup() does (in which case it returns NULL). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-18 14:46:24 +02:00
Michal Privoznik	452695926d	virStorageNetHostDef: Convert 'transport' field to proper enum type Convert the field and adjust the XML parsers to use virXMLPropEnumDefault(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:35:20 +02:00
Michal Privoznik	2dd6369477	virDomainDeviceInfo: Convert 'type' field to proper enum type Convert the field, adjust the XML parsers to use virXMLPropEnum() and fill in missing cases to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:35:17 +02:00
Michal Privoznik	a6eadf00d2	virDomainWatchdogDef: Convert 'model' field to proper enum type Convert the field and adjust the XML parsers to use virXMLPropEnum(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:35:13 +02:00
Michal Privoznik	b691d249d1	virDomainRNGDef: Convert 'backend' field to proper enum type Convert the field and adjust the XML parser to use virXMLPropEnum(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:35:11 +02:00
Michal Privoznik	53edfa23f1	virDomainRNGDef: Convert 'model' field to proper enum type Convert the field and adjust the XML parser to use virXMLPropEnum(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:35:09 +02:00
Michal Privoznik	27a653b893	virDomainControllerDef: Convert 'type' field to proper enum type Convert the field and fill in missing cases to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:35:06 +02:00
Michal Privoznik	44f0371c8c	virDomainHostdevSubsys: Convert 'type' field to proper enum type Convert the field, and fill in missing cases to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:35:04 +02:00
Michal Privoznik	7bdab79e40	virDomainHostdevDef: Convert 'mode' field to proper enum type Convert the field, adjust the XML parser to use virXMLPropEnumDefault() and fill in missing cases to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:34:59 +02:00
Michal Privoznik	d537f203b9	virDomainHostdevSubsysSCSI: Convert 'protocol' field to proper enum type Convert the field and adjust the XML parser to use virXMLPropEnum(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:34:54 +02:00
Michal Privoznik	f7534c2573	src: Drop needless typecast to virDomainDiskBus The 'bus' member of _virDomainDiskDef is already declared of virDomainDiskModel type. Hence, there is no need to typecast the variable when passing to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:34:52 +02:00
Michal Privoznik	0041723233	qemu_domain_address: Drop needless typecast to virDomainDiskModel The 'mode' member of _virDomainDiskDef is already declared of virDomainDiskModel type. Hence, there is no need to typecast the variable when passing to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:34:50 +02:00
Michal Privoznik	75cdb664d0	src: Drop needless typecast to virDomainDeviceType The 'type' member of _virDomainDeviceDef is already declared of virDomainDeviceType type. Hence, there is no need to typecast the variable when passing to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:34:48 +02:00
Michal Privoznik	3926f1e688	qemu: Drop needless typecast to virDomainFSDriverType The 'fsdriver' member of _virDomainFSDef is already declared of virDomainFSDriverType type. Hence, there is no need to typecast the variable when passing to switch() statements. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:34:44 +02:00
Michal Privoznik	8435be3923	qemu_domain_address: Drop needless typecast in qemuDomainDeviceCalculatePCIConnectFlags() Inside of qemuDomainDeviceCalculatePCIConnectFlags() there's a switch() which typecasts a variable of virDomainHostdevSubsysSCSIVHostModelType type to the very same type. This is useless. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-15 10:34:41 +02:00
Jonathon Jongsma	4ef2bcfd3f	qemu: Implement support for vDPA block devices Requires recent qemu with support for the virtio-blk-vhost-vdpa device and the ability to pass a /dev/fdset/N path for the vdpa path (8.1.0) Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1900770 Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-12 11:06:41 -05:00
Jonathon Jongsma	2efa9ba66a	qemu: consider vdpa block devices for memlock limits vDPA block devices will also need the same consideration for memlock limits as other vdpa devices, so consider these devices when calculating memlock limits. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-12 11:06:41 -05:00
Jonathon Jongsma	0ebb416d7e	qemu: make vdpa connect function more generic qemuInterfaceVDPAConnect() was a helper function for connecting to the vdpa device file. But in order to support other vdpa devices besides network interfaces (e.g. vdpa block devices) make this function a bit more generic. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-12 11:06:41 -05:00
Jonathon Jongsma	6cf7dbeff8	qemu: add virtio-blk-vhost-vdpa capability Check whether the qemu binary supports the vdpa block driver. We can't rely simply on the existence of the virtio-blk-vhost-vdpa block driver since the first releases of qemu didn't support fd-passing for this driver. So we have to check for the 'fdset' feature on the driver object. This feature will be present in the qemu 8.1.0 release and was merged to qemu in commit 98b126f5. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-12 11:06:41 -05:00
Jonathon Jongsma	1df106cc20	conf: add ability to configure a vdpa block disk device vDPA block devices can be configured as follows: <disk type='vhostvdpa'> <source dev='/dev/vhost-vdpa-0'/> </disk> Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-12 11:06:41 -05:00
Peter Krempa	58d9fe7202	qemu: capabilities: Remove 'virQEMUCapsAddCPUDefinitions' The function was used only to fill the cpu models into fake capabilities, whic no longer exists. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:46 +02:00
Peter Krempa	33f27cc978	qemu: capabilities: Unexport 'virQEMUCapsAddMachine' It's no longer needed in tests as we are no longer adding fake machines. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:46 +02:00
Peter Krempa	ba3ee74987	virQEMUCapsAddCPUDefinitions: Fill in cpu type in data and node it's used only in tests 'virQEMUCapsAddCPUDefinitions' is used solely to populate fake cpu models for the fake-caps tests. Note that and also populate the 'type' field so that default cpu type can be propagated properly. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:44 +02:00
Peter Krempa	c9f3f578cd	qemu_capabilities: Retire QEMU_CAPS_DEVICE_VFIO_PCI Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:43 +02:00
Peter Krempa	cbb3adde61	qemu: Always assume support for QEMU_CAPS_DEVICE_VFIO_PCI The 'display' option for the 'vfio-pci' device was added in qemu-2.12 and can't be compiled out. Assume support for the flag. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:43 +02:00
Peter Krempa	aaec42dbdf	qemu: capabilities: Retire QEMU_CAPS_CPU_CACHE Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:43 +02:00
Peter Krempa	917cf3e164	qemu: Always assume support for QEMU_CAPS_CPU_CACHE All supported qemu versions have this feature and it can't be compiled out. The logic is a bit more complex in this instance as the flag is asserted if: if (ARCH_IS_X86(qemuCaps->arch) && virQEMUCapsGet(qemuCaps, QEMU_CAPS_QUERY_CPU_MODEL_EXPANSION)) { virQEMUCapsSet(qemuCaps, QEMU_CAPS_CPU_CACHE); } Now QEMU_CAPS_QUERY_CPU_MODEL_EXPANSION is available since qemu-2.8 but only on certain architectures, thus we need to keep the flag itself, but x86_64 is one of them. The flag can be also assumed as qemuValidateDomainDefCpu rejects any cache config on non-x86 arches. Remove any use of the capability and drop the impossible test cases. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:43 +02:00
Peter Krempa	748dc3b1d3	qemu: capabilities: Preserve location of the default machine type when stripping aliases for tests Internally the preferred machine which is 'pc' for x86_64 must be kept in the first place in the array of machines. This was not the case when stripping the machine aliases for use in tests (so that test output stays stable) where we've created a new entry for the alias. This means that the original name (e.g. pc-i440fx-8.1) stayed in the first place. To fix this we now swap the names around and create a new entry at the end for the specific type. Additionally the default flag is not propagated to the copy. This is also visible now in the output of 'qemuxml2xmltest' as the test cases which use the default machine type return to 'pc' instead of the more specific name. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:42 +02:00
Peter Krempa	b816b41a65	qemu: capabilities: Retire QEMU_CAPS_ICH9_DISABLE_S* and QEMU_CAPS_ICH9_DISABLE_S* capabilities Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:42 +02:00
Peter Krempa	c225398015	qemu: Always assume support for QEMU_CAPS_PIIX_DISABLE_S* and QEMU_CAPS_ICH9_DISABLE_S* The support for PIIX power management was added in qemu commit v1.0-3094-g459ae5ea5a and the suport for ICH9 power management was added in qemu commit v2.2.0-542-g6ac0d8d44c and both can't be compiled out. This means we can always assume support for these features. Remove the validation and impossible tests. Move relevant bits from 'q35-pm-disable' to 'q35' test case. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-09-06 14:09:42 +02:00
Peter Krempa	24b769a25b	qemu: capabilities: Remove unused 'virQEMUCapsFilterByMachineType' The filtering of qemu capabilities by machine type doesn't seem to be ever used, remove it and adjust callers. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-09-04 10:31:52 +02:00
Peter Krempa	d0601b4267	qemu: capabilities: Retire QEMU_CAPS_QUERY_HOTPLUGGABLE_CPUS All qemu versions have that command and cpu hotplug code now directly probes the machine type. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-09-04 10:31:52 +02:00
Peter Krempa	4d82bce836	qemuDomainSupportsVcpuHotplug: Base return value on virQEMUCapsGetMachineHotplugCpus The QEMU_CAPS_QUERY_HOTPLUGGABLE_CPUS flag is always asserted as all qemu versions support the command and selectively cleared when copying the capabilities for VM use if given machine type does not support cpu hotplug. Rework this to directly probe the machine as we now populate the data also when re-connecting to a qemu instance after daemon restart, so that the capability can be removed. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-09-04 10:31:52 +02:00
Peter Krempa	a9e71cb737	qemu: process: Probe machine type data on reconnect to qemu When reconnecting we populate only the capability flags from the XML as we need to know the exact flags that were present when starting the VM. On the other hand the machine type data is not stored as it wasn't really used after startup. While storing all of the data into the status XML would be theoretically possible, with machine-type specific data it makes no sense to do so, and thus the data can be re-probed from the current instance. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-09-04 10:31:52 +02:00
Peter Krempa	e0a5787bff	qemu: capabilities: Export functions necessary for probing machine types Upcoming patch will re-probe machines from the current qemu instance to populate the private copy of qemuCaps after reconnecting to a running instance. This is needed to be able to access the machine type data, while storing them in the status XML seems to be an overkill, for information which can be easily reprobed. Export 'virQEMUCapsInitQMPArch' needed to populate the 'arch' field and 'virQEMUCapsProbeQMPMachineTypes'. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-09-04 10:31:52 +02:00
Peter Krempa	c7a0a0fe11	qemu: Rename qemuDomainSupportsNewVcpuHotplug to qemuDomainSupportsVcpuHotplug Support for legacy cpu hotplug was removed a long time ago. At this point this function only checks whether the current machine type supports cpu hotplug. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-09-04 10:31:52 +02:00
Michal Privoznik	895525db81	qemu: Move error messages onto a single line Error messages are exempt from the 80 columns rule. Move them onto one line. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2023-09-04 09:35:36 +02:00
Pavel Hrdina	0873d63b09	Revert "capabilities: report full external snapshot support" Reverting external snapshot for running VM doesn't work correctly so we should not report this capability until it is fixed. This reverts commit `de71573bfe`. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-09-01 12:09:04 +02:00
Kristina Hanicova	18705b03c6	qemu: add support for discard_granularity This commit adds building of `discard_granularity` disk option for qemu commandline. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1849570 Signed-off-by: Kristina Hanicova <khanicov@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-25 15:05:22 +02:00
Kristina Hanicova	96d8ee2cff	conf: add support for discard_granularity This introduces the ability to set the discard granularity option for a disk. It defines the smallest amount of data that can be discarded in a single operation (useful for managing and optimizing storage). However, most hypervisors automatically set the proper discard granularity and users usually do not need to change the default setting. Signed-off-by: Kristina Hanicova <khanicov@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-25 15:05:13 +02:00
Laine Stump	10e8a518a0	qemu: turn two multiline log messages into single line Normally I wouldn't bother with a change like this, but I was touching the function anyway, and wanted to leave it looking nice and tidy. Signed-off-by: Laine Stump <laine@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-24 23:36:18 -04:00
Laine Stump	24beaffec3	node_device: support binding other drivers with virNodeDeviceDetachFlags() In the past, the only allowable values for the "driver" field of virNodeDeviceDetachFlags() were "kvm" or "vfio" for the QEMU driver, and "xen" for the libxl driver. Then "kvm" was deprecated and removed, so the driver name became essentially irrelevant (because it is always called via a particular hypervisor driver, and so the "xen" or "vfio" can be (and almost always is) implied. With the advent of VFIO variant drivers, the ability to explicitly specify a driver name once again becomes useful - it can be used to name the exact VFIO driver that we want bound to the device in place of vfio-pci, so this patch allows those other names to be passed down the call chain, where the code in virpci.c can make use of them. The names "vfio", "kvm", and "xen" retain their special meaning, though: 1) because there may be some application or configuration that still calls virNodeDeviceDetachFlags() with driverName="vfio", this single value is substituted with the synonym of NULL, which means "bind the default driver for this device and hypervisor". This will currently result in the vfio-pci driver being bound to the device. 2) in the case of the libxl driver, "xen" means to use the standard driver used in the case of Xen ("pciback"). 3) "kvm" as a driver name always results in an error, as legacy KVM device assignment was removed from the kernel around 10 years ago. Signed-off-by: Laine Stump <laine@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-24 23:36:18 -04:00
Pavel Hrdina	de71573bfe	capabilities: report full external snapshot support Now that deleting and reverting external snapshots is implemented we can report that in capabilities so management applications can use that information and start using external snapshots. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-24 19:08:20 +02:00
Peter Krempa	24310b2b7f	qemuValidateDomainVCpuTopology: Remove misconfiguration warning Since commit `baca59a538` the NUMA definition is automatically fixed if the vCPU count mismatches the NUMA cpu count so that this warning will never be triggered. Additionally VIR_WARN of a misconfiguration of a VM would not really be seen in most cases as it's only simply logged. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-24 14:18:45 +02:00
Michal Privoznik	6a0b645537	src: Rename some members of _virDomainMemoryDef struct As advertised earlier, now that the _virDomainMemoryDef struct is cleaned up, we can shorten some names as their placement within unions define their use. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-24 12:39:29 +02:00
Michal Privoznik	7d01b67323	src: Move _virDomainMemoryDef target nodes into an union The _virDomainMemoryDef struct is getting a bit messy. It has various members and only some of them are valid for given model. Worse, some are re-used for different models. We tried to make this more bearable by putting a comment next to each member describing what models the member is valid for, but that gets messy too. Therefore, do what we do elsewhere: introduce an union of structs and move individual members into their respective groups. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-24 12:39:26 +02:00
Michal Privoznik	f23a991bea	src: Move _virDomainMemoryDef source nodes into an union The _virDomainMemoryDef struct is getting a bit messy. It has various members and only some of them are valid for given model. Worse, some are re-used for different models. We tried to make this more bearable by putting a comment next to each member describing what models the member is valid for, but that gets messy too. Therefore, do what we do elsewhere: introduce an union of structs and move individual members into their respective groups. This allows us to shorten some names (e.g. nvdimmPath or sourceNodes) as their purpose is obvious due to their placement. But to make this commit as small as possible, that'll be addressed later. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-24 12:39:23 +02:00
Michal Privoznik	a03b6cedfd	qemu_driver: validate mem->model on MEMORY_DEVICE_SIZE_CHANGE event When guest acknowledges change in size of virtio-mem (portion that's exposed to the guest), QEMU emits MEMORY_DEVICE_SIZE_CHANGE event. We process it in processMemoryDeviceSizeChange(). So far, QEMU emits the even only for virtio-mem (as that's the only memory device model that allows live changes to its size). Nevertheless, if this ever changes, validate the memory model upon processing the event as the rest of the code blindly expects virtio-mem model. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-24 12:39:21 +02:00
Michal Privoznik	10d38f46c7	qemu_hotplug: Don't validate inaccessible fields in qemuDomainChangeMemoryLiveValidateChange() The qemuDomainChangeMemoryLiveValidateChange() function is called when a live memory device change is requested (via virDomainUpdateDeviceFlags()). Currently, the only model that is allowed to change is VIRTIO_MEM (and the only value that's allowed to change is requestedsize). The aim of the function is to check whether the change user requested follows this rule. And in accordance with defensive programming I made the function check all virDomainMemoryDef struct members. Even those which are unused for VIRTIO_MEM model. Drop these checks as the respective members will be inaccessible soon (as the struct is refined). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-24 12:39:15 +02:00
Michal Privoznik	0b7af96492	qemu_hotplug: validate address on memory device change As of v7.9.0-rc1~296 users have ability to adjust what portion of virtio-mem is exposed to the guest. Then, as of v9.4.0-rc2~5 they have ability to set address where the memory is mapped. But due to a missing check it was possible to feed virDomainUpdateDeviceFlags() API with memory device XML that changes the address. This is of course not possible and should be forbidden. Add the missing check. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-24 12:39:13 +02:00
Ján Tomko	2bad705ebb	qemu: remove pointless qemuDomainLogContextMode Since its introduction in `4d1b771fbb` it has only been used to differentiate between START and non-START. Last use of QEMU_DOMAIN_LOG_CONTEXT_MODE_ATTACH was removed by: commit `f709377301` qemu: Fix qemuDomainObjTaint with virtlogd QEMU_DOMAIN_LOG_CONTEXT_MODE_STOP is unused since: commit `cf3ea0769c` qemu: process: Append the "shutting down" message using the new APIs Now, the only caller passes QEMU_DOMAIN_LOG_CONTEXT_MODE_START. Assume that's always the case and remove the 'mode' argument. Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-23 15:25:29 +02:00
Pavel Hrdina	cb3bc96e67	qemu_snapshot: allow snapshot revert for external snapshots Now that the support to revert external snapshots is implemented we can drop this check. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:43 +02:00
Pavel Hrdina	88d0e6748d	qemu_snapshot: add checks for external snapshot deletion With the introduction of external snapshot revert support we need to error out in some cases when trying to delete some snapshots. If users reverts to non-leaf snapshots and would try to delete it after the revert is done it would not work currently as this operation would require using block-stream which is not implemented for now as in this case the snapshot has two children so the disk files have multiple overlays. Similarly if user reverts to non-leaf snapshot and would try to delete snapshot that is non-leaf but not in currently active snapshot chain we would still need to use block-commit operation. The issue here is that in order to do that we would have to start new qemu process with different domain definition than what is currently used by the domain. If the current domain would be running it would complicate things even more so this operation is not yet supported. If user creates new snapshot after reverting to non-leaf snapshot it creates a new branch. Deleting snapshot with multiple children will require block-stream which is not implemented for now. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:42 +02:00
Pavel Hrdina	daf48171f1	qemu_snapshot: check only once if snapshot is external There will be more external snapshot checks introduced by following patch so group them together. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:40 +02:00
Pavel Hrdina	32cf4ed5cd	qemu_snapshot: update backing store after deleting external snapshot With introduction of external snapshot revert we will have to update backing store of qcow images not actively used be QEMU manually. The need for this patch comes from the fact that we stop and start QEMU process therefore after revert not all existing snapshots will be known to that QEMU process due to reverting to non-leaf snapshot or having multiple branches. We need to loop over all existing snapshots and check all disks to see if they happen to have the image we are deleting as backing store and update them to point to the new image except for images currently used by the running QEMU process doing the merge operation. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:36 +02:00
Pavel Hrdina	47821b2ff5	qemuDomainGetImageIds: pass domain definition directly We only need the domain definition from domain object. This will allow us to use it from snapshot code where we need to pass different domain definition. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:34 +02:00
Pavel Hrdina	1392be7092	qemu_snapshot: remove revertdisks when creating new snapshot When user creates a new snapshot after reverting to non-leaf snapshot we no longer need to store the temporary overlays as they will be part of the VM XMLs stored in the newly created snapshot. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:30 +02:00
Pavel Hrdina	a68b849c51	qemu_snapshot: delete: properly update parent snapshot with revert data When deleting external snapshot and parent snapshot is the currently active snapshot as user reverted to it we need to properly update the parent snapshot metadata. After the delete is done the new overlay files will be the currently used files created when snapshot revert was done, replacing the original overlay files. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:29 +02:00
Pavel Hrdina	7c402396ec	qemu_snapshot: add support to delete external snapshot without block commit When block commit is not needed we can just simply unlink the disk files. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:27 +02:00
Pavel Hrdina	d84538fbad	qemu_snapshot: prepare data for non-active leaf external snapshot deletion In this case there is no need to run block commit and using qemu process at all. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:26 +02:00
Pavel Hrdina	a3152a506b	qemu_snapshot: add merge to external snapshot delete prepare data Before external snapshot revert every delete operation did block commit in order to delete a snapshot. But now when user reverts to non-leaf snapshot deleting leaf snapshot will not have any overlay files so we can just simply delete the snapshot images. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:25 +02:00
Pavel Hrdina	b0876595a3	qemu_snapshot: extract external snapshot delete prepare to function This part of code is about to grow to make deletion work when user reverts to non-leaf snapshot. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:23 +02:00
Pavel Hrdina	443221bbf1	qemu_snapshot: rename qemuSnapshotDeleteExternalPrepare The new name reflects that we prepare data for external snapshot deletion and the old name will be used later for different part of code. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:22 +02:00
Pavel Hrdina	8e66473781	qemu_snapshot: introduce external snapshot revert support When reverting to external snapshot we need to create new overlay qcow2 files from the disk files the VM had when the snapshot was taken. There are some specifics and limitations when reverting to a snapshot: 1) When reverting to last snapshot we need to first create new overlay files before we can safely delete the old overlay files in case the creation fails so we have still recovery option when we error out. These new files will not have the suffix as when the snapshot was created as renaming the original files in order to use the same file names as when the snapshot was created would add unnecessary complexity to the code. 2) When reverting to any snapshot we will always create overlay files for every disk the VM had when the snapshot was done. Otherwise we would have to figure out if there is any other qcow2 image already using any of the VM disks as backing store and that itself might be extremely complex and in some cases impossible. 3) When reverting from any state the current overlay files will be always removed as that VM state is not meant to be saved. It's the same as with internal snapshots. If user want's to keep the current state before reverting they need to create a new snapshot. For now this will only work if the current snapshot is the last. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:20 +02:00
Pavel Hrdina	96fa494f88	qemu_snapshot: use VIR_ASYNC_JOB_SNAPSHOT when reverting snapshot Both creating and deleting snapshot are using VIR_ASYNC_JOB_SNAPSHOT but reverting is using VIR_ASYNC_JOB_START. Let's unify it to make it consistent for all snapshot operations. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:18 +02:00
Pavel Hrdina	537f9136b7	qemu_snapshot: move external disk prepare to single function We will need to reuse the functionality when reverting external snapshots. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:16 +02:00
Pavel Hrdina	a8f3e2f104	qemuSnapshotCreateQcow2Files: use domain definition directly To create new overlay files when external snapshot revert support is introduced we will be using different domain definition than what is currently used by the domain. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:15 +02:00
Pavel Hrdina	9b94a9e8ab	qemu_snapshot: introduce qemuSnapshotCreateQcow2Files Extract creation of qcow2 files for external snapshots to separate function as we will need it for external snapshot revert code. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:11 +02:00
Pavel Hrdina	db9e253fc4	qemu_snapshot: use virDomainDiskByName while updating domain def When creating external snapshot this function is called only when the VM is not running so there is only one definition to care about. However, it will be used by external snapshot revert code for active and inactive definition and they may be different if a disk was (un)plugged only for the active or inactive definition. The current code would crash so use virDomainDiskByName() to get the correct disk from the domain definition based on the disk name and make sure it exists. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:10 +02:00
Pavel Hrdina	4edd9e297a	qemu_snapshot: introduce qemuSnapshotDomainDefUpdateDisk Extract the code that updates disks in domain definition while creating external snapshots. We will use it later in the external snapshot revert code. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:09 +02:00
Pavel Hrdina	8f9d96199d	virDomainSnapshotAlignDisks: Allow overriding user-configured snapshot default This new option will be used by external snapshot revert code. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-08-22 16:06:08 +02:00
Peter Krempa	ba2b2ad1bb	qemu: capabilities: Retire unused QEMU_CAPS_DEVICE_IVSHMEM qemu removed the support for the old 'ivshmem' device in 4.0 release. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-22 14:04:41 +02:00
Peter Krempa	dc7ca40a13	qemu: Retire 'ivshmem' device The device was removed in qemu-4.0 and is superseded by 'ivshmem-plain' and 'ivshmem-doorbell'. Always report error when the old version is used and drop the irrelevant tests. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-22 14:04:41 +02:00
Peter Krempa	4bc796a82f	qemuValidateDomainVCpuTopology: Always validate vcpu count against topology Historically we've used QEMU_CAPS_QUERY_HOTPLUGGABLE_CPUS as witness that the topology must cover the maximum number ov vcpus. qemu started to enforce this in qemu-2.5, thus we can now always do the check. This change also requires aligning the topology in certain test files. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-22 14:04:41 +02:00
Andrea Bolognani	10a8997cbb	conf: Don't default to raw format for loader/NVRAM Due to the way the information is stored by the XML parser, we've had this quirk where specifying any information about the loader or NVRAM would implicitly set its format to raw. That is, <nvram>/path/to/guest_VARS.fd</nvram> would effectively be interpreted as <nvram format='raw'>/path/to/guest_VARS.fd</nvram> forcing the use of raw format firmware even when qcow2 format would normally be preferred based on the ordering of firmware descriptors. This behavior can be worked around in a number of ways, but it's fairly unintuitive. In order to remove this quirk, move the selection of the default firmware format from the parser down to the individual drivers. Most drivers only support raw firmware images, so they can unconditionally set the format early and be done with it; the QEMU driver, however, supports multiple formats and so in that case we want this default to be applied as late as possible, when we have already ruled out the possibility of using qcow2 formatted firmware images. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-21 13:51:34 +02:00
Andrea Bolognani	b845e376a4	qemu: Match NVRAM template extension for new domains Keep things consistent by using the same file extension for the generated NVRAM path as the NVRAM template. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-21 13:51:32 +02:00
Andrea Bolognani	e96e322725	qemu: Filter firmware based on loader.readonly If the user included loader.readonly=no in the domain XML, we should not pick a firmware build that expects to work with loader.readonly=yes. https://bugzilla.redhat.com/show_bug.cgi?id=2196178 Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-21 13:51:31 +02:00
Andrea Bolognani	ccbb987707	qemu: Generate NVRAM path in more cases Right now, we only generate it after finding a matching entry either among firmware descriptors or in the legacy firmware list. Even if the domain is configured to use a custom firmware build that we know nothing about, however, we should still automatically generate the NVRAM path instead of requiring the user to provide it manually. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-21 13:51:30 +02:00
Andrea Bolognani	4a49114ff4	qemu: Don't overwrite NVRAM template for legacy firmware Just because we have found a matching entry, it doesn't mean that we should discard the information explicitly provided in the domain XML. https://bugzilla.redhat.com/show_bug.cgi?id=2196178 https://gitlab.com/libvirt/libvirt/-/issues/500 Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-21 13:51:28 +02:00
Andrea Bolognani	ac76386eda	qemu: Fix lookup against stateless/combined pflash Just like the more common split builds, these are of type QEMU_FIRMWARE_DEVICE_FLASH; however, they have no associated NVRAM template, so we can't access the corresponding structure member unconditionally or we'll trigger a crash. https://bugzilla.redhat.com/show_bug.cgi?id=2196178 Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-21 13:51:20 +02:00
Andrea Bolognani	d917883b30	qemu: Fix return value for qemuFirmwareFillDomainLegacy() The documentation states that, just like the Modern() variant, this function should return 1 if a match wasn't found. It currently doesn't do that, and returns 0 instead. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-21 13:51:19 +02:00
Michal Privoznik	b07640bb43	qemu_domain: Drop unused variables from qemuDomainChrDefDropDefaultPath() In mu previous commits I've moved internals of qemuDomainChrDefDropDefaultPath() into a separate function (qemuDomainChrMatchDefaultPath()) but forgot to remove @buf and @regexp variables which are now unused. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2023-08-17 17:43:54 +02:00
Michal Privoznik	8abc979bb0	qemu: Move channelTargetDir into stateDir For historical reasons (i.e. unknown reason) we put channel sockets into a path derived from cfg->libDir which is a path that survives host reboots (e.g. /var/lib/libvirt/...). This is not necessary and in fact for session daemon creates a longer prefix: XDG_CONFIG_HOME -> /home/user/.config XDG_RUNTIME_DIR -> /run/user/1000 Worse, if host is rebooted suddenly (e.g. due to power loss) then we leave files behind and nobody will ever remove them. Therefore, place the channel target dir into state dir. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2173980 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2023-08-17 17:22:09 +02:00
Michal Privoznik	d3759d3674	qemu: Generate shorter channel target paths A <channel/> device is basically an UNIX socket into guest. Whatever is sent from the host, appears in the guest and vice versa. But because of that, the length of the path to the socket is important (underscored by fact that we derive the path from domain short name). But there are still cases where we might not fit into UNIX_PATH_MAX limit (usually 108 characters), because the path is derived also from other variables, e.g. XDG_CONFIG_HOME for session domains. There are two components though, that are needless: "/target/" and "domain-" prefix. Drop them. This is safe to do, because running domains have their path saved in status XML and even though paths are dropped on migration, they are not part of guest ABI and thus we are free to change them. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2023-08-17 17:19:01 +02:00
Peter Krempa	ba8ae0532b	qemuMigrationSrcBeginPhase: Require storage migration when 'migrate_disks' parameter is specified If a user passes a list of disks to migrate but don't actually use 'VIR_MIGRATE_NON_SHARED_DISK' or 'VIR_MIGRATE_NON_SHARED_INC' flags the parameter would be simply ignored without informing the user of the error. Add a proper error in such case. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-10 14:53:27 +02:00
Peter Krempa	deac6f017e	qemuMigrationSrcBeginPhase: Properly report error when non-shared storage migration is requested over tunnel When VIR_MIGRATE_TUNNELLED is used without VIR_MIGRATE_NON_SHARED_DISK/VIR_MIGRATE_NON_SHARED_INC an error was reported without actually returning failure. This was caused by a refactor which dropped many error paths. Fixes: `6111b23522` Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-08-10 14:53:26 +02:00
Daniel P. Berrangé	3d2f3fb72b	src: fix max file limits in systemd services This fixes commit `38abf9c34d` Author: Daniel P. Berrangé <berrange@redhat.com> Date: Wed Jun 21 13:22:40 2023 +0100 src: set max open file limit to match systemd >= 240 defaults The bug referenced in that commit had suggested to set LimitNOFile=512000:1024 on the basis that matches current systemd default behaviour and is compatible with old systemd. That was good except * The setting is LimitNOFILE and these are case sensitive * The hard and soft limits were inverted - soft must come first and so it would have been ignored even if the setting name was correct. * The default hard limit is 524288 not 512000 Reported-by: Olaf Hering <olaf@aepfle.de> Reviewed-by: Martin Kletzander <mkletzan@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-08-04 13:00:52 +01:00
Michal Privoznik	1ca3c339a1	lib: Prefer sizeof(variable) instead of sizeof(type) in memset If one of previous commits taught us something, it's that: sizeof(variable) and sizeof(type) are not the same. Especially because for live enough code the type might change (e.g. as we use autoptr more). And since we don't get any warnings when an incorrect length is passed to memset() it is easy to mess up. But with sizeof(variable) instead, it's not as easy. Therefore, switch to using memset(variable, 0, sizeof(*variable)), or its alternatives, depending on level of pointers. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Claudio Fontana <cfontana@suse.de>	2023-08-03 16:41:19 +02:00
Michal Privoznik	4f159d4269	lib: Finish using struct zero initializer manually There are some cases left after previous commit which were not picked up by coccinelle. Mostly, becuase the spatch was not generic enough. We are left with cases like: two variables declared on one line, a variable declared in #ifdef-s (there are notoriously difficult for coccinelle), arrays, macro definitions, etc. Finish what coccinelle started, by hand. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Claudio Fontana <cfontana@suse.de>	2023-08-03 16:41:19 +02:00
Michal Privoznik	b20a5e9a4d	lib: use struct zero initializer instead of memset This is a more concise approach and guarantees there is no time window where the struct is uninitialized. Generated using the following semantic patch: @@ type T; identifier X; @@ - T X; + T X = { 0 }; ... when exists ( - memset(&X, 0, sizeof(X)); \| - memset(&X, 0, sizeof(T)); ) Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Claudio Fontana <cfontana@suse.de>	2023-08-03 16:41:19 +02:00
Michal Privoznik	5aba198687	qemu: Don't reuse variable in processSerialChangedEvent() When a VSERPORT_CHANGE event is processed, we firstly do a little detour and try to detect whether the event is coming from guest agent. If so, we notify threads that are currently talking to the agent about this fact. Then we proceed with usual event processing (BeginJob(), update domain def, emit event, and so on). In both cases we use the same @dev variable to refer to domain device. While this works, it will make writing semantic patch unnecessary harder (see next commit(s)). Therefore, introduce a separate variable for the detour code. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Claudio Fontana <cfontana@suse.de>	2023-08-03 16:35:29 +02:00
Michal Privoznik	039b16e41e	Decrease scope of some variables There are couple of variables that are declared at function beginning but then used solely within a block (either for() loop or if() statement). And just before their use they are zeroed explicitly using memset(). Decrease their scope, use struct zero initializer and drop explicit memset(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Claudio Fontana <cfontana@suse.de>	2023-08-03 16:35:11 +02:00
Laine Stump	a8262cb331	qemu: don't add --mac-addr option to passt commandline When I implemented passt support in libvirt, I saw the --mac-addr option on the passt commandline, immediately assumed that this was used for setting the guest interface's mac address somewhere within passt, and read no further. As a result, "--mac-addr" is always added to the passt commandline, specifying the setting from <mac addr='blah'/> in the guest's interface config. But as pointed out in this bugzilla comment: https://bugzilla.redhat.com/2184967#c8 That is not at all what passt's --mac-addr option does. Instead, it is used to force the remote mac address for incoming traffic to a specific value. So setting --mac-addr results in all traffic on the interface having the same (the guest's) mac address for both source and destination in all traffic. Surprisingly, this still works, so nobody noticed it during testing. The proper thing is to not specify any mac address to passt - the remote MAC addresses can and should remain untouched, and the local MAC address will end up being known to passt just by the guest sending out packets with that MAC address. Reported-by: Laszlo Ersek <lersek@redhat.com> Signed-off-by: Laine Stump <laine@redhat.com> Reviewed-by: Laszlo Ersek <lersek@redhat.com> Reviewed-by: Stefano Brivio <sbrivio@redhat.com>	2023-08-02 19:33:02 -04:00
Michal Privoznik	99349ba18e	Revert "qemu_passt: Precreate passt logfile" This reverts commit `8511b96a31`. Turns out, we need to do a bit more than just plain qemuSecurityDomainSetPathLabel() which sets svirt_image_t. Passt has its own SELinux policy and as a part of that they invent passt_log_t for log files. Right now, I don't know how libvirt could query that and even if I did, passt SELinux policy would need to permit relabelling from svirt_t to passt_log_t, which it doesn't [1]. Until these problems are addressed we shouldn't be pre-creating the file as it puts users into way worse position - even scenarios that used to work don't work. But then again - using log file for passt is usually valuable for developers only and not regular users. 1: https://bugzilla.redhat.com/show_bug.cgi?id=2209191#c10 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-08-02 17:23:13 +02:00
Michal Privoznik	bc9a254dc7	Revert "qemu_passt: Actually use @logfd" This reverts commit `83686f1eea`. This is needed only so that the next revert is clean. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-08-02 17:23:05 +02:00
Michal Privoznik	baeefe0327	qemu_domain: Partially validate memory amounts when auto-adding NUMA node When automatically adding a NUMA node (qemuDomainDefNumaAutoAdd()) the memory size of the node is computed as: total_memory - sum(memory devices) And we have a nice helper for that: virDomainDefGetMemoryInitial() so it looks logical to just call it. Except, this code runs in post parse callback, i.e. memory sizes were not validated and it may happen that the sum is greater than the total memory. This would be caught by virDomainDefPostParseMemory() but that runs only after driver specific callbacks (i.e. after qemuDomainDefNumaAutoAdd()) and because the domain config was changed and memory was increased to this huge number no error is caught. So let's do what virDomainDefGetMemoryInitial() would do, but with error checking. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2216236 Fixes: `f5d4f5c8ee` Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-25 14:51:35 +02:00
Michal Privoznik	6c3ef66cc6	qemu_tpm: Try harder to create emulator state If a per-domain SWTPM state directory exists but is empty our code still considers it a valid state and skips running 'swtpm_setup' (handled in qemuTPMEmulatorRunSetup()). While we should not try to inspect individual files created by swtpm, we can still consider empty folder as non-existent state. Resolves: https://gitlab.com/libvirt/libvirt/-/issues/320 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-21 15:01:39 +02:00
Peter Krempa	033c4fcddf	qemu_monitor: Remove helpers for 'query-commands' Now that we don't use it for probing at all we can remove all the corresponding monitor code. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 17:28:21 +02:00
Peter Krempa	0af5a514ee	qemu: capabilities: Don't probe 'query-commands' The capability code now probes the presence of commands from the QMP schema instead of using 'query-commands'. Don't call the command and adjust the '.replies' files. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 17:27:19 +02:00
Peter Krempa	d8b6801095	qemu: capabilities: Probe presence of commands from QMP schema instead of 'query-commands' Move the probing code to extract the data from the QMP schema rather than invoking 'query-commands'. This patch doesn't yet remove the actual invocation of 'query-commands', just moves the actual probing. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 17:23:43 +02:00
Peter Krempa	294cf405d5	qemu: capabilities: Retire QEMU_CAPS_(IDE\|SCSI)_DRIVE_WWN Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 15:24:43 +02:00
Peter Krempa	37f20a9bdb	qemu: Always assume support for QEMU_CAPS_IDE_DRIVE_WWN The support for configuring the 'wwn' of a IDE disk was added in qemu commit 95ebda85e09 (v1.0-1869-g95ebda85e0) and can't be compiled out. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 15:24:33 +02:00
Peter Krempa	5eed97e313	qemu: Always assume support for QEMU_CAPS_SCSI_DISK_WWN The support for configuring the 'wwn' of a SCSI disk was added in qemu commit 27395add759ff4caeb0 (v1.0-3326-g27395add75) and can't be compiled out. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 15:24:21 +02:00
Peter Krempa	c90c97a734	Properly mark auto-added 'terminator' virStorageSource All backing chain members which were auto-added by image detection, including the terminating element, should have the 'detected' property set to true. This is needed to properly strip the detected elements in some cases, e.g. for the status XML where we could treat some images as manually terminated even when it was auto-detected. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 14:58:35 +02:00
Peter Krempa	49aca0a845	qemu: driver: Reformat helpers for saving VM state Rewrap argument definition of qemuDomainSaveInternal and align argument in the invocation of the aforementioned function in qemuDomainManagedSaveHelper. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 14:58:35 +02:00
Daniel P. Berrangé	6103584284	src: remove dep on systemd-logind.service from unit files After the previous commit we no longer require that logind is actually running, it merely has to be activatable. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 11:01:04 +01:00
Daniel P. Berrangé	38abf9c34d	src: set max open file limit to match systemd >= 240 defaults Since systemd 240, all services get an open file hard limit of 500k, and a soft limit of 1024. This limit means apps are safe to use select() by default which is limited to 1024 FDs. Apps which don't use select() are expected to simply set their soft limit to match the hard limit during startup. With our current unit file settings we've been effectively reducing the max open files we have on most modern systems. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:01 +01:00
Daniel P. Berrangé	a315070047	src: remove After=local-fs.target from systemd units All services are ordered after local-fs.target unless they have set DefaultDependencies=no, which we do not do. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:00 +01:00
Jonathon Jongsma	988f8443fb	qemu: require memfd memory for virtio 'blob' support The virtio-gpu 'blob' support was insufficiently validated. Qemu requires a memfd memory backing in order to use udmabuf and enable blob support. Example error: $ virsh start rhel9 error: Failed to start domain 'rhel9' error: internal error: qemu unexpectedly closed the monitor: 2023-07-18T02:33:57.083178Z qemu-kvm: -device {"driver":"virtio-vga","id":"video0","max_outputs":1,"blob":true,"bus":"pcie.0","addr":"0x1"}: cannot enable blob resources without udmabuf Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-19 09:46:30 -05:00
Michal Privoznik	ecbce92371	qemu: Adapt to new way of specifying PC speaker Historically, the way to set PC speaker for a guest was to pass: -soundhw pcspk but as of QEMU commit v5.1.0-rc0~28^2~3 this is deprecated and we should use: -machine pcspk-audiodev=$id instead. The old way was then removed in commit v7.1.0-rc0~99^2~3. Now, ideally we would have a capability selecting whether we talk to a QEMU that understands the new way or not. But it's not that simple - the machine attribute is just an alias to the .audiodev= attribute of 'isa-pcspk' object and both are created in pc_machine_initfn() function, i.e. not then the PC_MACHINE() class is initialized, but when it's instantiated. IOW, it's not possible for us to query whether we're dealing with older or newer QEMU. But given that the newer version is supported since v5.1.0 and the minimal version we require is v4.2.0 (i.e. there are two releases which don't understand the newer cmd line) and how frequently this feature is (un-)used (the issue was reported after ~1 year since it stopped working), I believe we can live without any capability and just use the newer cmd line unconditionally. Resolves: https://gitlab.com/libvirt/libvirt/-/issues/490 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-19 16:03:31 +02:00
Michal Privoznik	f9641d2c37	qemu_capabilities: Retire QEMU_CAPS_USB_STORAGE_REMOVABLE Now that the QEMU_CAPS_USB_STORAGE_REMOVABLE capability is no longer used we can stop querying it and retire it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-07-19 13:34:25 +02:00
Michal Privoznik	1a663b53c0	qemu: Always assume QEMU_CAPS_USB_STORAGE_REMOVABLE Introduced in QEMU commit of v0.14.0-rc0~83^2~1 and not being able to compile the .removable attribute of the "usb-storage" object out, renders our corresponding capability QEMU_CAPS_USB_STORAGE_REMOVABLE always set. Stop using it in command generation / domain validation. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-19 13:26:31 +02:00
Boris Fiuczynski	8417c1394c	qemu: S390 does not provide physical address size Commit `be1b7d5b18` introduced parsing /proc/cpuinfo for "address size" which is not including on S390 and therefore reports an internal error. Lets remove the parsing on S390. Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Marc Hartmayer <mhartmay@linux.ibm.com> Reviewed-by: Collin Walling <walling@linux.ibm.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-19 08:42:08 +02:00
Boris Fiuczynski	aece25f665	conf: domcaps: Add 'async-teardown' domain capability Add async-teardown to the features list in domain capabilities allowing high level management to introspect the availability of the asynchronous teardown feature. Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-18 08:49:32 +02:00
Michal Privoznik	f5d4f5c8ee	qemu: Add NUMA node automatically for memory hotplug Up until v2.11.0-rc2~19^2~3 QEMU used to require at least one NUMA node to be configured when memory hotplug was enabled. After that commit, QEMU automatically adds a NUMA node if none was specified on the cmd line. Reflect this in domain XML, i.e. explicitly add a NUMA node into our domain definition if needed. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2216236 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-18 08:42:55 +02:00
Michal Privoznik	851c5f075b	qemu_domain: Deduplicate targetNode check in qemuDomainDefValidateMemoryHotplugDevice() If a domain has NUMA configured, then all <memory/> devices (except for 'virtio-pmem') need to have targetNode set. There are two checks inside of qemuDomainDefValidateMemoryHotplugDevice() for this: one inside of big switch() statement, which only checks 'dimm' and 'nvdimm' cases, and the other at the end of the function that checks all models (except for 'virtio-pmem'). Let's keep the latter and remove the former as the latter covers the former too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-13 16:34:15 +02:00
Boris Fiuczynski	3bf02acdc5	qemu: allow use of async teardown in domain Asynchronous teardown can be specified if the QEMU binary supports it by adding in the domain XML <features> ... <async-teardown enabled='yes\|no'/> ... </features> By default this new feature is disabled. Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Thomas Huth <thuth@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-10 13:28:17 +02:00
Boris Fiuczynski	65c6513811	qemu: add run-with async-teardown capability QEMU capability is looking in query-command-line-options response for ... { "parameters": [ { "name": "async-teardown", "type": "boolean" } ], "option": "run-with" } ... allow to use the QEMU option -run-with async-teardown=on\|off Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Thomas Huth <thuth@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-10 13:28:13 +02:00
Han Han	1ed695bcff	qemu: Support removable for scsi disk Allow //disk/target@removable for scsi disk devices, since QEMU has support the removable attribute for scsi-hd device from v0.14.0[1]. [1]: 419e691f8e: scsi-disk: Allow overriding SCSI INQUIRY removable bit Signed-off-by: Han Han <hhan@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-09 07:39:57 +02:00
Nikolai Barybin	2d6659e778	qemu: prevent SIGSEGV in qemuProcessHandleDumpCompleted If VIR_ASYNC_JOB_NONE flag is present, job.current is equal to NULL, which leads to SIGSEGV. Thus, this check should be moved up. Fixes: v8.0.0-427-gf304de0df6 Signed-off-by: Nikolai Barybin <nikolai.barybin@virtuozzo.com> Reviewed-by: Jiri Denemark <jdenemar@redhat.com>	2023-06-27 12:39:50 +02:00
Michal Privoznik	83686f1eea	qemu_passt: Actually use @logfd In one of my previous commits I've introduced @logfd variable that was supposed to hold FD of passt logfile. But I've forgot to assign the qemuDomainOpenFile() retval to it. Fixes: `8511b96a31` Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-27 08:51:36 +02:00
Michal Privoznik	8511b96a31	qemu_passt: Precreate passt logfile There are a few situations where passt itself is unable to create a file because it runs under QEMU user (e.g. just like our example from formatdomain.rst suggests: /var/log/passt.log). If libvirtd runs with sufficient permissions (e.g. as root) it can create the file and set seclabels on it so that passt can then open it. Ideally, we would just pass pre-opened FD, but this wasn't viewed as secure enough [1]. So lets just create the file and set seclabels. For the case when both libvirtd and passt have the same permissions, well then we fail before even needing to fork() and exec(). 1: https://archives.passt.top/passt-dev/20230606225836.63aecebe@elisabeth/ Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2209191 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 15:52:54 +02:00
Peter Krempa	f4fea84f59	Explicitly reject unsupported storage types for legacy config generators New storage types are not implemented in generators for -drive and the xen config. Explicitly reject them in case of a programming error. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-26 14:53:36 +02:00
zuoboqun	b7f7f07542	qemuDomainWaitForDeviceRemoval: recheck the value of priv->unplug.alias when timeout When detaching a device, the following race condition may happen: Once qemuDomainSignalDeviceRemoval() marks the device for removal, it returns true, which means it is the caller that marked the device for removal is going to remove the device from domain definition. But qemuDomainWaitForDeviceRemoval() may still receive timeout from virDomainObjWaitUntil() which is implemented by pthread_cond_timedwait() due to an unavoidable race between the expiration of the timeout and the predicate state(priv->unplug.alias) change. And then qemuDomainWaitForDeviceRemoval() will return 0, thus the caller will not remove the device from domain definition. In this situation, the device is still present in the domain definition but doesn't exist in qemu anymore. Worse, there is no way to remove it from the domain definition. Solution is to recheck the value of priv->unplug.alias to determine who is going to remove the device from domain definition. Signed-off-by: zuo boqun <zuoboqun@baidu.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-26 13:40:58 +02:00
Jean-Louis Dupond	b855f8ea1e	Add discard_no_unref option for qcow2 images Qemu 8.1.0 will add discard_no_unref option for qcow2 images. When this option is enabled (default=false), then it will no longer unreference clusters when guest does a discard, but it will just free the blocks (useful for incremental backups for example) and pass the discard to the lower layer. This was implemented to avoid fragmentation within the qcow2 image. Signed-off-by: Jean-Louis Dupond <jean-louis@dupond.be> Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 13:06:00 +02:00
Peter Krempa	1092a88e13	qemu: capabilities: Introduce QEMU_CAPS_QCOW2_DISCARD_NO_UNREF The qcow2 driver allows passing discards to the storage while keeping the reference of the block, and just marking it as zeroed. This can decrease the levels of fragmentation of the qcow2 metadata when discards are enabled. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 13:00:54 +02:00
Peter Krempa	e3ce39195c	qemu_domain: Properly validate count of memory slots Memory slots are required only for DIMM-like devices, while other devices defined via <memory> such as virtio-mem may use the PCI bus and thus do not require/consume a memory slot. Fix the validation code to calculate the required count of memory devices only for DIMMs and NVDIMMs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 12:58:24 +02:00
Peter Krempa	a52c68443d	qemu_command: Always use modern syntax of '-m' Specify the memory size by using '-m size=2048k' instead of just '-m 2'. The new syntax is used when memory hotplug is enabled. To preserve memory sizing, if memory hotplug is disabled the size is rounded down to the nearest mebibyte. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 12:58:23 +02:00
Jiri Denemark	ce6d1dca6d	qemu: Include maximum physical address size in baseline CPU The current implementation of virConnectBaselineHypervisorCPU in QEMU driver can provide a CPU definition that will not work on all hosts in case they have different maximum physical address size. So when we get the info from domain capabilities, we need to choose the smallest physical address size for the computed baseline CPU definition. https://bugzilla.redhat.com/show_bug.cgi?id=2171860 Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-16 12:44:54 +02:00
Jiri Denemark	be1b7d5b18	qemu: Report physical address size in domain capabilities We already report the hosts physical address size in host capabilities, but computing a baseline CPU definition is done from domain capabilities. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-16 12:44:54 +02:00
Martin Kletzander	e940cac98e	util: Parse RSS into ullp It is used to fill an unsigned long long anyway and if it is negative than there is really an issue somewhere. Signed-off-by: Martin Kletzander <mkletzan@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-14 12:47:54 +02:00
Michal Privoznik	d09b73b560	qemu: Drop @unionMems argument from qemuProcessSetupPid() The @unionMems argument of qemuProcessSetupPid() function is not necessary really as all callers pass 'true'. Drop it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-08 09:39:20 +02:00
Michal Privoznik	83adba541a	qemu: Allow more generous cpuset.mems for vCPUs and IOThreads The unit that cpuset CGroups controller works with is a thread/process, not individual memory allocations. Therefore, after we've set cpuset.mems for emulator (after previous commit it's set to union of all host NUMA nodes allowed for given domain), and as we try to set up cpuset.mems for vCPUs/IOThreads, memory is migrated to selected NUMA node(s). We are effectively saying: "this thread (vCPU thread) can have memory only from these NUMA node(s)". That's not really what we want though. The cpuset controller doesn't differentiate memory "belonging" to the emulator thread and vCPU thread or IOThread even. Therefore, set union of all allowed host NUMA nodes, just like we're doing for the emulator thread. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2138150 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-08 09:39:20 +02:00
Michal Privoznik	fddbb2f12f	qemu: Don't try to 'fix up' cpuset.mems after QEMU's memory allocation In ideal world, my plan was perfect. We allow union of all host nodes in cpuset.mems and once QEMU has allocated its memory, we 'fix up' restriction of its emulator thread by writing the original value we wanted to set all along. But in fact, we can't do it because that triggers memory movement. For instance, consider the following <numatune/>: <numatune> <memory mode="strict" nodeset="0"/> <memnode cellid="1" mode="strict" nodeset="1"/> </numatune> <numa> <cell id="0" cpus="0-1" memory="1024000" unit="KiB" /> <cell id="1" cpus="2-3" memory="1048576" unit="KiB"/> </numa> This is meant to create 1:1 mapping between guest and host NUMA nodes. So we start QEMU with cpuset.mems set to "0-1" (so that it can allocate memory even for guest node #1 and have the memory come fro host node #1) and then, set cpuset.mems to "0" (because that's where we wanted emulator thread to live). But this in turn triggers movement of all memory (even the allocated one) to host NUMA node #0. Therefore, we have to just keep cpuset.mems untouched and rely on .host-nodes passed on the QEMU cmd line. The placement still suffers because of cpuset.mems set for vcpus or iothreads, but that's fixed in next commit. Fixes: `3ec6d586bc` Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-08 09:39:20 +02:00
Peter Krempa	9d6867198d	qemuMonitorSetBlockIoThrottle: Drop 'diskalias' argument Every caller will pass 'qdevid' as it's populated in the data mandatorily with qemu-4.2 and onwards due to mandatory -blockdev use. Thus we can drop compatibility with the old way of matching the disk via alias. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00
Peter Krempa	f0296aa9a3	qemuMonitorGetBlockIoThrottle: Drop 'diskalias' argument Every caller will pass 'qdevid' as it's populated in the data mandatorily with qemu-4.2 and onwards due to mandatory -blockdev use. Thus we can drop compatibility with the old way of matching the disk via alias. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00
Peter Krempa	4ba032a2a2	qemu: Refuse setting <iotune> for 'SD' disks Historically this didn't work with any supported qemu version as we don't set the alias of the device, and thus qemu uses a different alias resulting in a failure to startup the VM: internal error: unable to execute QEMU command 'block_set_io_throttle': Device 'drive-sd-disk0' not found Refuse setting throttling as this is unlikely to be needed and proper fix requires using -device instead of -drive if=sd. Note that this was broken when I moved the setup of throttling as a command at startup for blockdev integration quite a while ago. Until then throttling was passed as arguments for -drive. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00
Peter Krempa	6d6a87f229	qemuDiskConfigBlkdeviotuneEnabled: Make 'disk' argument const The function doesn't modify it. Fix the argument declaration so that the function can be used in a context where we have a 'const' disk definition. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00

1 2 3 4 5 ...

13268 Commits