libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-10-05 22:05:47 +00:00

Author	SHA1	Message	Date
Peter Krempa	d8b6801095	qemu: capabilities: Probe presence of commands from QMP schema instead of 'query-commands' Move the probing code to extract the data from the QMP schema rather than invoking 'query-commands'. This patch doesn't yet remove the actual invocation of 'query-commands', just moves the actual probing. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 17:23:43 +02:00
Peter Krempa	7ca777cc09	node_device: Don't leak error message buffer from virMdevctlListDefined\|Active nodeDeviceUpdateMediatedDevices invokes virMdevctlListDefined and virMdevctlListActive both of which were passed the same 'errmsg' buffer. Since virCommandSetErrorBuffer() always allocates the error buffer one of them was leaked. Fix it by populating the 'errmsg' buffer only on failure of virMdevctlListActive\|Defined which invoke the command. Add a comment to nodeDeviceGetMdevctlListCommand reminding how virCommandSetErrorBuffer() works. Fixes: `44a0f2f0c8` Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.ibm.com>	2023-07-20 15:25:03 +02:00
Peter Krempa	294cf405d5	qemu: capabilities: Retire QEMU_CAPS_(IDE\|SCSI)_DRIVE_WWN Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 15:24:43 +02:00
Peter Krempa	37f20a9bdb	qemu: Always assume support for QEMU_CAPS_IDE_DRIVE_WWN The support for configuring the 'wwn' of a IDE disk was added in qemu commit 95ebda85e09 (v1.0-1869-g95ebda85e0) and can't be compiled out. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 15:24:33 +02:00
Peter Krempa	5eed97e313	qemu: Always assume support for QEMU_CAPS_SCSI_DISK_WWN The support for configuring the 'wwn' of a SCSI disk was added in qemu commit 27395add759ff4caeb0 (v1.0-3326-g27395add75) and can't be compiled out. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 15:24:21 +02:00
Peter Krempa	9a47442366	storage: Fix returning of locked objects from 'virStoragePoolObjListSearch' CVE-2023-3750 'virStoragePoolObjListSearch' explicitly documents that it's returning a pointer to a locked and ref'd pool that maches the lookup function. This was not the case as in commit `0c4b391e2a` (released in libvirt-8.3.0) the code was accidentally converted to use 'VIR_LOCK_GUARD' which auto-unlocked it when leaving the scope, even when the code was originally "leaking" the lock. Revert the corresponding conversion and add a comment that this function is intentionally leaking a locked object. Fixes: `0c4b391e2a` Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2221851 Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 15:24:02 +02:00
Peter Krempa	c90c97a734	Properly mark auto-added 'terminator' virStorageSource All backing chain members which were auto-added by image detection, including the terminating element, should have the 'detected' property set to true. This is needed to properly strip the detected elements in some cases, e.g. for the status XML where we could treat some images as manually terminated even when it was auto-detected. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 14:58:35 +02:00
Peter Krempa	49aca0a845	qemu: driver: Reformat helpers for saving VM state Rewrap argument definition of qemuDomainSaveInternal and align argument in the invocation of the aforementioned function in qemuDomainManagedSaveHelper. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-07-20 14:58:35 +02:00
Daniel P. Berrangé	d051e0623c	util: add logging about node suspend availability Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 11:01:08 +01:00
Daniel P. Berrangé	6103584284	src: remove dep on systemd-logind.service from unit files After the previous commit we no longer require that logind is actually running, it merely has to be activatable. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 11:01:04 +01:00
Daniel P. Berrangé	29f2222dd5	util: relax requirement for logind to be running Historically we wanted to check if logind was actually running, not merely activatable, because on systems where systemd is installed, but the OS is booted into non-systemd init, we want to fallback to pm-utils. Requiring logind to be running, however, forces us to serialize libvirtd startup on startup of logind which is undesirable. We can relax this dependancy if we check whether systemd itself is running, which implies that logind will activated when we need it. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 11:00:18 +01:00
Daniel P. Berrangé	38abf9c34d	src: set max open file limit to match systemd >= 240 defaults Since systemd 240, all services get an open file hard limit of 500k, and a soft limit of 1024. This limit means apps are safe to use select() by default which is limited to 1024 FDs. Apps which don't use select() are expected to simply set their soft limit to match the hard limit during startup. With our current unit file settings we've been effectively reducing the max open files we have on most modern systems. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:01 +01:00
Daniel P. Berrangé	04b82f961b	rpc: automatically raise max file limit in all daemons None of our daemons use select(), so it is safe to raise the max file limit to its maximum on startup. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:01 +01:00
Daniel P. Berrangé	c41cc852f6	util: add helper for raising the max files limit Historically the max files limit for processes has always been 1024, because going beyond this is incompatible with the select() function. None the less most apps these days will use poll() so should not be limited in this way. Since systemd >= 240, the hard limit will be 500k, while the soft limit remains at 1k. Applications which don't use select() should raise their soft limit to match the hard limit during their startup. This function provides a convenient helper to do this limit raising. Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:01 +01:00
Daniel P. Berrangé	3967174d03	util: remove pointless wrappers for setrlimit/getrlimit These wrappers added no semantic difference over calling the system function directly. Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:00 +01:00
Daniel P. Berrangé	427eef8959	src: remove deps on ip[6]tables/firewalld.service from systemd units The unit files both have After=network.target, and this in turn implies After=network-pre.target. Both iptables.service & ip6tables.service have Before=network-pre.target since Fedora >= 35 and RHEL >= 8.4. When we first added the deps on ip[6]tables.service in commit `0756415f14` Author: Laine Stump <laine@redhat.com> Date: Fri May 1 00:05:50 2020 -0400 systemd: start libvirtd after firewalld/iptables services the Before=network-pre.target didn't exist, but we can rely on it now given our supported platforms matrix. The firewalld.service has similarly has a Before=network-pre.target, even when we took that commit above, so this dep was in face never actually needed. This answers the question posed in that above commit message about firewalld ordering. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:00 +01:00
Daniel P. Berrangé	a315070047	src: remove After=local-fs.target from systemd units All services are ordered after local-fs.target unless they have set DefaultDependencies=no, which we do not do. https://gitlab.com/libvirt/libvirt/-/issues/489 Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-20 10:58:00 +01:00
Michal Privoznik	7ba20863a7	meson: Annotate each test() with 'suite' A test case can be part of a test suite (just like we already have 'syntax-check'). This then allows developers to run only a subset of tests. For instance - when using valgrind test setup (`meson test -C _build/ --setup valgrind`) it makes zero sense to run syntax-check tests or other script based tests (e.g. check-augeas-*, check-remote_protocol, etc.). What does makes sense is to run compiled binaries. Strictly speaking, reaching that goal is as trivial as annotating only those compiled tests (declared in tests/meson.build) and running them selectively: meson test -C _build/ --setup valgrind --suite $TAG But it may be also desirable to run test scripts separately. Therefore, introduce two new tags: 'bin' for compiled tests, and 'script' for script based tests and annotate each test() accordingly. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-20 10:58:01 +02:00
Matt Low	3cde509f1a	storage: zfs: Use 'zfs list' to check pool status The current virtStorageBackendZFSCheckPool checks for the existence of a path under /dev/zvol/ to determine if the pool is active. ZFS does not create a path under /dev/zvol/ if no ZFS volumes have been created under a particular dataset, thus, empty ZFS storage pools are deactivated whenever checkPool is called on them (as noted in referenced issue). This commit changes virStorageBackendZFSCheckPool so that the 'zfs list' command is used to explicitly check for the existence a dataset specified by the pool's def->source.name. Resolves: https://gitlab.com/libvirt/libvirt/-/issues/221 Signed-off-by: Matt Low <matt@mlow.ca> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-07-20 10:44:14 +02:00
Jonathon Jongsma	fa0d5f4ebc	nodedev: report mdev persistence properly Since commit `44a0f2f0`, we now query mdevctl for transient (active) mdevs in order to gather attributes for the mdev. Unfortunately, this commit introduced a regression because nodeDeviceUpdateMediatedDevice() assumed that all mdevs returned from mdevctl were actually persistent mdevs but we were using it to update transient mdevs. Refactor the function so that we can use it to update both persistent and transient mdevs. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.ibm.com>	2023-07-19 09:47:20 -05:00
Jonathon Jongsma	988f8443fb	qemu: require memfd memory for virtio 'blob' support The virtio-gpu 'blob' support was insufficiently validated. Qemu requires a memfd memory backing in order to use udmabuf and enable blob support. Example error: $ virsh start rhel9 error: Failed to start domain 'rhel9' error: internal error: qemu unexpectedly closed the monitor: 2023-07-18T02:33:57.083178Z qemu-kvm: -device {"driver":"virtio-vga","id":"video0","max_outputs":1,"blob":true,"bus":"pcie.0","addr":"0x1"}: cannot enable blob resources without udmabuf Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-19 09:46:30 -05:00
Michal Privoznik	ecbce92371	qemu: Adapt to new way of specifying PC speaker Historically, the way to set PC speaker for a guest was to pass: -soundhw pcspk but as of QEMU commit v5.1.0-rc0~28^2~3 this is deprecated and we should use: -machine pcspk-audiodev=$id instead. The old way was then removed in commit v7.1.0-rc0~99^2~3. Now, ideally we would have a capability selecting whether we talk to a QEMU that understands the new way or not. But it's not that simple - the machine attribute is just an alias to the .audiodev= attribute of 'isa-pcspk' object and both are created in pc_machine_initfn() function, i.e. not then the PC_MACHINE() class is initialized, but when it's instantiated. IOW, it's not possible for us to query whether we're dealing with older or newer QEMU. But given that the newer version is supported since v5.1.0 and the minimal version we require is v4.2.0 (i.e. there are two releases which don't understand the newer cmd line) and how frequently this feature is (un-)used (the issue was reported after ~1 year since it stopped working), I believe we can live without any capability and just use the newer cmd line unconditionally. Resolves: https://gitlab.com/libvirt/libvirt/-/issues/490 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-19 16:03:31 +02:00
Michal Privoznik	f9641d2c37	qemu_capabilities: Retire QEMU_CAPS_USB_STORAGE_REMOVABLE Now that the QEMU_CAPS_USB_STORAGE_REMOVABLE capability is no longer used we can stop querying it and retire it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-07-19 13:34:25 +02:00
Michal Privoznik	1a663b53c0	qemu: Always assume QEMU_CAPS_USB_STORAGE_REMOVABLE Introduced in QEMU commit of v0.14.0-rc0~83^2~1 and not being able to compile the .removable attribute of the "usb-storage" object out, renders our corresponding capability QEMU_CAPS_USB_STORAGE_REMOVABLE always set. Stop using it in command generation / domain validation. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-19 13:26:31 +02:00
Michal Privoznik	d90a34cf29	virrandommock: Drop virRandomGenerateWWN After previous commit, there's no functional difference between real virRandomGenerateWWN() and the mocked version. Drop the mock then. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-07-19 13:09:11 +02:00
Michal Privoznik	ae7df11377	test_driver: Pass virt_type to virNodeDeviceDefParse() in testNodeDeviceCreateXML() This brings the code closer to real implementation: nodeDeviceCreateXML(). For the unique OUI, let's take the value from tests/virrandommock.c: 100000. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-07-19 13:09:11 +02:00
Michal Privoznik	b857ad0696	virrandom: Fix printf format string in virRandomGenerateWWN() Firstly, drop needless concatenation of two static strings. Secondly, use proper (portable) formatter for uint64_t so that typecast to ULL can be dropped. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-07-19 13:09:11 +02:00
Boris Fiuczynski	8417c1394c	qemu: S390 does not provide physical address size Commit `be1b7d5b18` introduced parsing /proc/cpuinfo for "address size" which is not including on S390 and therefore reports an internal error. Lets remove the parsing on S390. Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Marc Hartmayer <mhartmay@linux.ibm.com> Reviewed-by: Collin Walling <walling@linux.ibm.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-19 08:42:08 +02:00
Boris Fiuczynski	aece25f665	conf: domcaps: Add 'async-teardown' domain capability Add async-teardown to the features list in domain capabilities allowing high level management to introspect the availability of the asynchronous teardown feature. Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-18 08:49:32 +02:00
Michal Privoznik	f5d4f5c8ee	qemu: Add NUMA node automatically for memory hotplug Up until v2.11.0-rc2~19^2~3 QEMU used to require at least one NUMA node to be configured when memory hotplug was enabled. After that commit, QEMU automatically adds a NUMA node if none was specified on the cmd line. Reflect this in domain XML, i.e. explicitly add a NUMA node into our domain definition if needed. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2216236 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-18 08:42:55 +02:00
Jonathon Jongsma	14026db9b0	nodedev: update mdevs from the mdevctl thread Rather than directly executing mdevctl from the udev event thread when we determine that we need to re-query, schedule the mdevctl thread to run. This also helps to coalesce multiple back-to-back updates into a single one when there are multiple updates in a row or at startup when a host has a very large number of mdevs. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.ibm.com>	2023-07-13 09:43:45 -05:00
Jonathon Jongsma	9b7fadc5dc	nodedev: refactor mdevctl thread functions Factor out a new scheduleMdevctlUpdate() function so that we can re-use it from other places. Now that other events can make it necessary to re-query mdevctl for mdev updates, this function will be useful for coalescing multiple updates in quick succession into a single mdevctl query. Also rename a couple functions. The names weren't very descriptive of their behavior. For example, the old scheduleMdevctlHandler() function didn't actually schedule anything, it just started a thread. So rename it to free up the 'schedule' name for the above refactored function. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.ibm.com>	2023-07-13 09:43:45 -05:00
Boris Fiuczynski	37481aa1f1	nodedev: transient mdev update on nodeDeviceCreateXML Update the optional mdev attributes by running an mdevctl update on a new created nodedev object representing an mdev. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2143158 Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Jonathon Jongsma <jjongsma@redhat.com>	2023-07-13 09:43:34 -05:00
Michal Privoznik	851c5f075b	qemu_domain: Deduplicate targetNode check in qemuDomainDefValidateMemoryHotplugDevice() If a domain has NUMA configured, then all <memory/> devices (except for 'virtio-pmem') need to have targetNode set. There are two checks inside of qemuDomainDefValidateMemoryHotplugDevice() for this: one inside of big switch() statement, which only checks 'dimm' and 'nvdimm' cases, and the other at the end of the function that checks all models (except for 'virtio-pmem'). Let's keep the latter and remove the former as the latter covers the former too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-13 16:34:15 +02:00
Jim Fehlig	b9eeeebddb	libxl: Advertise support for VIR_MIGRATE_CHANGE_PROTECTION The libxl driver has basic support for VIR_MIGRATE_CHANGE_PROTECTION by starting and stopping modify jobs in the begin/confirm and prepare/finish phases of migration, but it doesn't advertise that support. This can result in unterminated jobs because the migration logic skips phases of migration when the VIR_MIGRATE_CHANGE_PROTECTION feature is absent. Ensure jobs are terminated properly by advertising support for VIR_MIGRATE_CHANGE_PROTECTION. Signed-off-by: Jim Fehlig <jfehlig@suse.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-13 08:28:50 -06:00
Jim Fehlig	f3ed5c2713	libxl: Don't attempt to resume domain on canceled migration For unknown reasons, the libxl driver attempts to resume a domain in the confirm phase when a migration operation has been canceled. This has shown to be problematic when simulating scenarios that result in a canceled migration. In all scenarios, the domain was in a running state when entering libxlDomainMigrationSrcConfirm, causing the call to libxl_domain_resume to fail. Making matters worse, the domain state is changed to paused when in fact it's running. And finally, libxlDomainMigrationSrcConfirm incorrectly returns an error. Remove this incorrect logic from libxlDomainMigrationSrcConfirm. On a canceled migration it's sufficient to resume the lock process that was paused in the perform phase. Signed-off-by: Jim Fehlig <jfehlig@suse.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2023-07-13 08:28:20 -06:00
Michal Privoznik	ded44a0406	node_device: Enclose Linux specific code in ifdef __linux__ Our CI started to enable udev backend on FreeBSD. And while there is udev on FreeBSD some parts of our code are highly Linux specific, e.g. translating SCSI device type to string (from an integer obtained from the sysfs). Obviously, this doesn't work anywhere else. This is the reason why we need to include scsi/scsi.h header file (which actually comes from the Linux kernel source tree but for some reason glibc started to distribute it, followed by musl). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-07-11 18:11:26 +02:00
Boris Fiuczynski	3bf02acdc5	qemu: allow use of async teardown in domain Asynchronous teardown can be specified if the QEMU binary supports it by adding in the domain XML <features> ... <async-teardown enabled='yes\|no'/> ... </features> By default this new feature is disabled. Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Thomas Huth <thuth@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-10 13:28:17 +02:00
Boris Fiuczynski	65c6513811	qemu: add run-with async-teardown capability QEMU capability is looking in query-command-line-options response for ... { "parameters": [ { "name": "async-teardown", "type": "boolean" } ], "option": "run-with" } ... allow to use the QEMU option -run-with async-teardown=on\|off Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Thomas Huth <thuth@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-10 13:28:13 +02:00
Han Han	1ed695bcff	qemu: Support removable for scsi disk Allow //disk/target@removable for scsi disk devices, since QEMU has support the removable attribute for scsi-hd device from v0.14.0[1]. [1]: 419e691f8e: scsi-disk: Allow overriding SCSI INQUIRY removable bit Signed-off-by: Han Han <hhan@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-07-09 07:39:57 +02:00
Andrea Bolognani	4c6feb832f	apparmor: Make all profiles extensible Do for all other profiles what we already do for the virt-aa-helper one. In this case we limit the feature to AppArmor 3.x, as it was never implemented for 2.x. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Jim Fehlig <jfehlig@suse.com>	2023-07-03 14:55:43 +02:00
Andrea Bolognani	21a84ec994	apparmor: Improve virt-aa-helper include For AppArmor 3.x we can use 'include if exists', which frees us from having to create a dummy override. For AppArmor 2.x we keep things as they are to avoid introducing regressions. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Jim Fehlig <jfehlig@suse.com>	2023-07-03 14:55:41 +02:00
Andrea Bolognani	b6092de883	apparmor: Make abstractions extensible Implement the standard AppArmor 3.x abstraction extension approach. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Jim Fehlig <jfehlig@suse.com>	2023-07-03 14:55:39 +02:00
Andrea Bolognani	84e01d182e	apparmor: Only support passt on 3.x The subprofile can only work by including the abstraction shipped in the passt package, which we can't assume is present, and 'include if exists' doesn't work well on 2.x. No distro that's stuck on AppArmor 2.x is likely to be shipping passt anyway. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Jim Fehlig <jfehlig@suse.com>	2023-07-03 14:55:37 +02:00
Andrea Bolognani	63a312fa2d	apparmor: Allow version-specific bits in abstractions too Compared to profiles, we only need a single preprocessing step here, as there is no variable substitution happening. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Jim Fehlig <jfehlig@suse.com>	2023-07-03 14:55:35 +02:00
Andrea Bolognani	19eb8abc9a	apparmor: Allow version-specific bits in profiles Perform an additional preprocessing step before the existing variable substitution. This is the same approach that we already use to customize systemd unit files based on whether the service supports TCP connections. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Jim Fehlig <jfehlig@suse.com>	2023-07-03 14:55:34 +02:00
Michal Privoznik	63792a286d	Revert "lxc_fuse: Tell FUSE that /proc/meminfo is nonseekable" After v8.1.0-61-g030faee28d it is no longer necessary to make the /proc/meminfo file nonseekable as our code that fills the file with spoofed values can handle seeking just fine. Previously, `free(1)` was okay with failed lseek(), but this was ages ago and meanwhile the procps project moved to creating a library and moved the file parsing code under an exported function. In attempt to make the function callable multiple times, it can lseek() multiple times and failure to do so is fatal. This reverts commit `7664955086` Resolves: https://gitlab.com/libvirt/libvirt/-/issues/492 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-29 15:21:00 +02:00
Han Han	28141adfdc	scripts: Fix the flake8 syntax-check failures Fix the syntax-check failures (which can be seen after python3-flake8-import-order package is installed) with the help of isort[1]: 289/316 libvirt:syntax-check / flake8 FAIL 5.24s exit status 2 [1]: https://pycqa.github.io/isort/ Signed-off-by: Han Han <hhan@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-29 11:51:27 +02:00
Jim Fehlig	d7fb8deb6a	Revert "apparmor: Add support for local profile customizations" As it turns out, apparmor 2.x and 3.x behave differently or have differing levels of support for local customizations of profiles and profile abstractions. Additionally the apparmor 2.x tools do not cope well with 'include if exists'. Revert this commit until a more complete solution is developed that works with old and new apparmor. Reverts: `9b743ee190` Signed-off-by: Jim Fehlig <jfehlig@suse.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2023-06-28 07:43:44 -06:00
Nikolai Barybin	2d6659e778	qemu: prevent SIGSEGV in qemuProcessHandleDumpCompleted If VIR_ASYNC_JOB_NONE flag is present, job.current is equal to NULL, which leads to SIGSEGV. Thus, this check should be moved up. Fixes: v8.0.0-427-gf304de0df6 Signed-off-by: Nikolai Barybin <nikolai.barybin@virtuozzo.com> Reviewed-by: Jiri Denemark <jdenemar@redhat.com>	2023-06-27 12:39:50 +02:00
Michal Privoznik	83686f1eea	qemu_passt: Actually use @logfd In one of my previous commits I've introduced @logfd variable that was supposed to hold FD of passt logfile. But I've forgot to assign the qemuDomainOpenFile() retval to it. Fixes: `8511b96a31` Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-27 08:51:36 +02:00
Michal Privoznik	8511b96a31	qemu_passt: Precreate passt logfile There are a few situations where passt itself is unable to create a file because it runs under QEMU user (e.g. just like our example from formatdomain.rst suggests: /var/log/passt.log). If libvirtd runs with sufficient permissions (e.g. as root) it can create the file and set seclabels on it so that passt can then open it. Ideally, we would just pass pre-opened FD, but this wasn't viewed as secure enough [1]. So lets just create the file and set seclabels. For the case when both libvirtd and passt have the same permissions, well then we fail before even needing to fork() and exec(). 1: https://archives.passt.top/passt-dev/20230606225836.63aecebe@elisabeth/ Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2209191 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 15:52:54 +02:00
Peter Krempa	f4fea84f59	Explicitly reject unsupported storage types for legacy config generators New storage types are not implemented in generators for -drive and the xen config. Explicitly reject them in case of a programming error. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-26 14:53:36 +02:00
Oleg Vasilev	7b793a00bd	util: don't validate empty params If there are no parameters, there is nothing to validate. If params == NULL, memcpy below results in memcpy(sorted, NULL, 0), which is UB. Found by UBSAN. Example of this codepath: virDomainBlockCopy() (where nparams == 0 is valid) -> qemuDomainBlockCopy() Signed-off-by: Oleg Vasilev <oleg.vasilev@virtuozzo.com> Reviewed-by: Kristina Hanicova <khanicov@redhat.com>	2023-06-26 14:19:18 +02:00
zuoboqun	b7f7f07542	qemuDomainWaitForDeviceRemoval: recheck the value of priv->unplug.alias when timeout When detaching a device, the following race condition may happen: Once qemuDomainSignalDeviceRemoval() marks the device for removal, it returns true, which means it is the caller that marked the device for removal is going to remove the device from domain definition. But qemuDomainWaitForDeviceRemoval() may still receive timeout from virDomainObjWaitUntil() which is implemented by pthread_cond_timedwait() due to an unavoidable race between the expiration of the timeout and the predicate state(priv->unplug.alias) change. And then qemuDomainWaitForDeviceRemoval() will return 0, thus the caller will not remove the device from domain definition. In this situation, the device is still present in the domain definition but doesn't exist in qemu anymore. Worse, there is no way to remove it from the domain definition. Solution is to recheck the value of priv->unplug.alias to determine who is going to remove the device from domain definition. Signed-off-by: zuo boqun <zuoboqun@baidu.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-26 13:40:58 +02:00
Jean-Louis Dupond	b855f8ea1e	Add discard_no_unref option for qcow2 images Qemu 8.1.0 will add discard_no_unref option for qcow2 images. When this option is enabled (default=false), then it will no longer unreference clusters when guest does a discard, but it will just free the blocks (useful for incremental backups for example) and pass the discard to the lower layer. This was implemented to avoid fragmentation within the qcow2 image. Signed-off-by: Jean-Louis Dupond <jean-louis@dupond.be> Reviewed-by: Peter Krempa <pkrempa@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 13:06:00 +02:00
Peter Krempa	1092a88e13	qemu: capabilities: Introduce QEMU_CAPS_QCOW2_DISCARD_NO_UNREF The qcow2 driver allows passing discards to the storage while keeping the reference of the block, and just marking it as zeroed. This can decrease the levels of fragmentation of the qcow2 metadata when discards are enabled. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 13:00:54 +02:00
Peter Krempa	e4b4765f4f	conf: Allow omitting 'slots' attribute of <maxMemory> Memory slots are required only for DIMM-like devices, but the maximum memory address space is relevant also for other non-DIMM memory devices such as virtio-mem. Allow configurations where no slots are added. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 12:58:24 +02:00
Peter Krempa	e3ce39195c	qemu_domain: Properly validate count of memory slots Memory slots are required only for DIMM-like devices, while other devices defined via <memory> such as virtio-mem may use the PCI bus and thus do not require/consume a memory slot. Fix the validation code to calculate the required count of memory devices only for DIMMs and NVDIMMs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 12:58:24 +02:00
Peter Krempa	a52c68443d	qemu_command: Always use modern syntax of '-m' Specify the memory size by using '-m size=2048k' instead of just '-m 2'. The new syntax is used when memory hotplug is enabled. To preserve memory sizing, if memory hotplug is disabled the size is rounded down to the nearest mebibyte. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-26 12:58:23 +02:00
Michal Privoznik	1dddd44451	virGlobalInit: Make glib init its own global state This should not be needed, but here's what's happening: virStrToLong_() family of functions was switched from strtol() to g_ascii_strtol() in order to handle corner cases on Windows (most notably parsing hex numbers with base=0) - see v9.4.0-61-g2ed41d7cd9. But what we did not realize back then, is the fact that g_ascii_strtol() family has their own global lock rendering virStrToLong_() function unsafe between fork() + exec(). Worse, if one of the threads has to wait for the lock (or on its corresponding condition), then errno is mangled and g_ascii_strtol() signals an error, even though there's no error. Read more here: https://gitlab.gnome.org/GNOME/glib/-/issues/3034 Nevertheless, if we make glib init the g_ascii_strtol() global state (by calling one function from g_ascii_strtol() family), then there shouldn't be any congestion on the lock and thus no errno mangling. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2023-06-22 10:56:13 +02:00
Jiri Denemark	ce6d1dca6d	qemu: Include maximum physical address size in baseline CPU The current implementation of virConnectBaselineHypervisorCPU in QEMU driver can provide a CPU definition that will not work on all hosts in case they have different maximum physical address size. So when we get the info from domain capabilities, we need to choose the smallest physical address size for the computed baseline CPU definition. https://bugzilla.redhat.com/show_bug.cgi?id=2171860 Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-16 12:44:54 +02:00
Jiri Denemark	be1b7d5b18	qemu: Report physical address size in domain capabilities We already report the hosts physical address size in host capabilities, but computing a baseline CPU definition is done from domain capabilities. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-16 12:44:54 +02:00
Martin Kletzander	078e645cc3	conf/nwfilter: Initialize size_t attr to zero Newer GCC (13.1.1 in my case) wrongly reports "maybe uninitialized" warning for this variable inside the next condition. Even though this accusation is wrong (the condition is guarded by the same condition as the for cycle initializing it), initialize it during the declaration so compilation errors don't stop others and maybe also future proof the code for changes. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-14 12:47:55 +02:00
Martin Kletzander	2ed41d7cd9	util: Use g_ascii_strtoll This has two main advantages: - it parses the number with C locale explicitly - it behaves the same on Windows as on Linux and BSD both of which are wanted behaviours. Signed-off-by: Martin Kletzander <mkletzan@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-14 12:47:55 +02:00
Martin Kletzander	fd340227a7	util: Remove virStrToLong_l With the last user gone this function can be abolished. It is preferable to use _ll instead since that is not a subject to 32/64 bit scaling. Signed-off-by: Martin Kletzander <mkletzan@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-14 12:47:54 +02:00
Martin Kletzander	e940cac98e	util: Parse RSS into ullp It is used to fill an unsigned long long anyway and if it is negative than there is really an issue somewhere. Signed-off-by: Martin Kletzander <mkletzan@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-14 12:47:54 +02:00
Ján Tomko	a7ee9eac83	conf: virtiofs: validate that the target dir is unique even for hotplug https://bugzilla.redhat.com/show_bug.cgi?id=2171384 Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-12 13:06:31 +02:00
Michal Privoznik	d09b73b560	qemu: Drop @unionMems argument from qemuProcessSetupPid() The @unionMems argument of qemuProcessSetupPid() function is not necessary really as all callers pass 'true'. Drop it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-08 09:39:20 +02:00
Michal Privoznik	83adba541a	qemu: Allow more generous cpuset.mems for vCPUs and IOThreads The unit that cpuset CGroups controller works with is a thread/process, not individual memory allocations. Therefore, after we've set cpuset.mems for emulator (after previous commit it's set to union of all host NUMA nodes allowed for given domain), and as we try to set up cpuset.mems for vCPUs/IOThreads, memory is migrated to selected NUMA node(s). We are effectively saying: "this thread (vCPU thread) can have memory only from these NUMA node(s)". That's not really what we want though. The cpuset controller doesn't differentiate memory "belonging" to the emulator thread and vCPU thread or IOThread even. Therefore, set union of all allowed host NUMA nodes, just like we're doing for the emulator thread. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2138150 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-08 09:39:20 +02:00
Michal Privoznik	fddbb2f12f	qemu: Don't try to 'fix up' cpuset.mems after QEMU's memory allocation In ideal world, my plan was perfect. We allow union of all host nodes in cpuset.mems and once QEMU has allocated its memory, we 'fix up' restriction of its emulator thread by writing the original value we wanted to set all along. But in fact, we can't do it because that triggers memory movement. For instance, consider the following <numatune/>: <numatune> <memory mode="strict" nodeset="0"/> <memnode cellid="1" mode="strict" nodeset="1"/> </numatune> <numa> <cell id="0" cpus="0-1" memory="1024000" unit="KiB" /> <cell id="1" cpus="2-3" memory="1048576" unit="KiB"/> </numa> This is meant to create 1:1 mapping between guest and host NUMA nodes. So we start QEMU with cpuset.mems set to "0-1" (so that it can allocate memory even for guest node #1 and have the memory come fro host node #1) and then, set cpuset.mems to "0" (because that's where we wanted emulator thread to live). But this in turn triggers movement of all memory (even the allocated one) to host NUMA node #0. Therefore, we have to just keep cpuset.mems untouched and rely on .host-nodes passed on the QEMU cmd line. The placement still suffers because of cpuset.mems set for vcpus or iothreads, but that's fixed in next commit. Fixes: `3ec6d586bc` Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-08 09:39:20 +02:00
Jim Fehlig	9b743ee190	apparmor: Add support for local profile customizations Apparmor profiles in /etc/apparmor.d/ are config files that can and should be replaced on package upgrade, which introduces the potential to overwrite any local changes. Apparmor supports local profile customizations via /etc/apparmor.d/local/<service> [1]. This change makes the support explicit by adding libvirtd, virtqemud, and virtxend profile customization stubs to /etc/apparmor.d/local/. The stubs are conditionally included by the corresponding main profiles. [1] https://ubuntu.com/server/docs/security-apparmor See "Profile customization" section Signed-off-by: Jim Fehlig <jfehlig@suse.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-07 11:45:55 -06:00
Michal Privoznik	1b8c1ce704	virnetsshsession: Adapt to changed libssh2 API In one of its commits [1] libssh2 changed the 'text' member of LIBSSH2_USERAUTH_KBDINT_PROMPT struct from 'char' to 'unsigned char'. But we g_strdup() the member in order to fill 'prompt' member of virConnectCredential struct. Typecast the value to avoid warnings. Also, drop @prompt variable, as it's needless. 1: `83853f8aea` Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2023-06-05 16:06:46 +02:00
Peter Krempa	bc2c392bd1	conf: nwfilter: Refactor virNWFilterFormatParamAttributes Use virXMLFormatElement and simplify the formatter. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	3a5f9a4041	conf: nwfilter: Refactor virNWFilterIncludeParse Use automatic memory freeing and modern XML parsers to simplify the function. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	9cb4e78ffd	virNWFilterRuleDefDetailsFormat: Refactor formatter Format the rule attributes in two passes, first for positive 'match' and second pass for negative. This removes the crazy logic for switching between match modes inside the formatter. The refactor makes it also more clear in which cases we actually do format something. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	59a1455012	virNWFilterRuleParse: Refactor attribute parser Use virXMLNodeGetSubelementList to get the elements to process. The new approach documents the complexity of the parser, which is designed to ignore unknown attributes and parse only a single kind of them after finding the first valid one. Note that the XML schema doesn't actually allow having multiple sub-elements, but I'm not sure how that translates to actual configs present. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	3774fca88f	virNWFilterRuleParse: Parse 'priority' via 'virXMLPropInt' Use modern parsing. Invalid numbers are now rejected. Semantis for numbers out of range is preserved. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	f0c5da0f24	virNWFilterRuleDef: Turn 'action' and 'tt' into proper enum types Convert the fields to the proper types and use virXMLPropEnum for parsing. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	43f280cc65	conf: nwfilter: Refactor XML formatting in virNWFilterRuleDefFormat Use virXMLFormatElement to simplify the formatter. Drop return value of virNWFilterRuleDefFormat as there are no errors to report. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	d50eb74b9d	conf: network: Refactor XML parsing in virNetworkDHCPDefParseXML Use virXMLNodeGetSubelement(List) instead of the looped parser and simplify the code. Note that handling of the 'bootp' element now conforms to the schema where we allow just one and the 'file' attribute is mandatory. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	dcd49d2cd6	util: xml: Introduce virXMLNodeGetSubelementList The new helper is similar to virXPathNodeSet list but for cases where we want to get subelements directly rather than using XPath. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	4aafa900ce	virNetworkDHCPDefParseXML: Refactor cleanup There's nothing to clean up in the 'host' local variable on error as the function which fills it makes sure to fill it only on success. In such case it's also directly assigned to the array thus the 'host' variable is cleared. Remove the 'cleanup' label and 'ret' variable as we can now directly return -1 on error. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	355582e3df	virNetDevBandwidthParse: Use virXMLNodeGetSubelement instead of looped parser Extract the 'inbound'/'outbound' subelements using virXMLNodeGetSubelement to simplify the code. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	036e677886	virNetDevBandwidthParseRate: Refactor parsing Remove the unnecessary check for valid arguments and use virXMLPropULongLong instead of hand-written property parsers. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	ede5ee9eca	virNetDevBandwidthParse: Use 'virXMLPropUInt' to parse 'classID' Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	92332ade1f	virNetDevBandwidthParse: Don't validate element name Callers make sure to pass the correct element. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:23:06 +02:00
Peter Krempa	9d6867198d	qemuMonitorSetBlockIoThrottle: Drop 'diskalias' argument Every caller will pass 'qdevid' as it's populated in the data mandatorily with qemu-4.2 and onwards due to mandatory -blockdev use. Thus we can drop compatibility with the old way of matching the disk via alias. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00
Peter Krempa	f0296aa9a3	qemuMonitorGetBlockIoThrottle: Drop 'diskalias' argument Every caller will pass 'qdevid' as it's populated in the data mandatorily with qemu-4.2 and onwards due to mandatory -blockdev use. Thus we can drop compatibility with the old way of matching the disk via alias. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00
Peter Krempa	4ba032a2a2	qemu: Refuse setting <iotune> for 'SD' disks Historically this didn't work with any supported qemu version as we don't set the alias of the device, and thus qemu uses a different alias resulting in a failure to startup the VM: internal error: unable to execute QEMU command 'block_set_io_throttle': Device 'drive-sd-disk0' not found Refuse setting throttling as this is unlikely to be needed and proper fix requires using -device instead of -drive if=sd. Note that this was broken when I moved the setup of throttling as a command at startup for blockdev integration quite a while ago. Until then throttling was passed as arguments for -drive. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00
Peter Krempa	6d6a87f229	qemuDiskConfigBlkdeviotuneEnabled: Make 'disk' argument const The function doesn't modify it. Fix the argument declaration so that the function can be used in a context where we have a 'const' disk definition. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:13 +02:00
Peter Krempa	f8625248ac	virRaiseErrorLog: Don't skip error printing when enabling debug logging env variable When a user requests debug logging by setting the environment variable: LIBVIRT_DEBUG=1 we should log any errors regardless of the setting of e.g. 'LIBVIRT_LOG_OUTPUTS' as the code will log every 'debug' and 'info' level message to stderr but will skip 'error' level messages. This obviously makes debugging things very complicated as you can get to a situation when the error itself is missing. This can happen e.g. in tests. Fix the issue by probing the default log level and calling the logger if it's set for VIR_LOG_DEBUG. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-05 13:20:12 +02:00
Michal Privoznik	8b9d2bda8a	qemu: Set proper PCI backend for <interface/>-s that are actually hostdevs When starting a domain, it's done so in two steps (actually more, but lets focus on just the following two): 1) qemuProcessPrepareDomain(), followed by 2) qemuProcessPrepareHost(). Now, in the first step (PrepareDomain()), PCI backends for all hostdevs is set (qemuProcessPrepareDomain() -> qemuProcessPrepareDomainHostdevs() -> qemuDomainPrepareHostdev() -> qemuDomainPrepareHostdevPCI()). Perfect. But then, additional hostdevs may appear, because in the host prepare phase we may insert some hostdevs into domain definition (qemuProcessPrepareHost() -> qemuProcessNetworkPrepareDevices()). Now, these additional hostdevs don't undergo the same prepare as hostdevs that were already present in the domain definition (i.e. in qemuProcessPrepareDomain() phase). Therefore, we have to call corresponding prepare function explicitly. NB, the interface hotplug code (qemuDomainAttachNetDevice()) does not suffer from this problem, because it calls top level qemuDomainAttachHostDevice() which is used to hotplug regular hostdevs too and as such calls qemuDomainPrepareHostdev(). Fixes: `3b87709c76` Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2209853 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-06-05 12:18:53 +02:00
Michal Privoznik	1c7335add9	qemu_passt: Format portForward device even without address It's almost like we've anticipated this. Our XML parser and formatter handles @address and @dev attributes of <portForward/> element completely independent of each other. And as of commit 2023_03_29.b10b983~3 passt allows handling these two separately too. All that's left is generate the cmd line according to this new fact. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2210287 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-01 14:25:08 +02:00
Michal Privoznik	a36318be9d	conf: Reject invalid device's <seclabel relabel='yes'/> with no <label/> We allow (some) domain devices to have a different <seclabel/> than the top level domain one (this is mostly to allow access to a resource for multiple domains). Now, we do couple of sanity checks for such <seclabel/>, e.g. when the <label/> is specified, but '@relabel' is set to no. But what we are missing is the opposite: when '@relabel' is set, but no <label/> was provided. Our schema already denies such combination. Make our parser behave the same. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2160356 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-06-01 13:53:09 +02:00
Ján Tomko	0033998e9a	conf: node_device: use separate variables for parsing integers In virNodeDeviceGetSCSIHostCaps, there is a pattern of reusing a tmp value and stealing the pointer. But in two case it is not stolen. Use separate variables for them to avoid mixing autofree with manual free() calls. Fixes: `8a0cb5f73a` Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-06-01 10:31:31 +02:00
Michal Privoznik	a1bdffdd96	qemu_command: Generate .memaddr for virtio-mem and virtio-pmem This is fairly trivial. Just set .memaddr attribute if a value was set in the XML. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2180679 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-26 16:44:45 +02:00
Michal Privoznik	2c15506254	qemu: Fill virtio-mem/virtio-pmem .memaddr at runtime After a QEMU domain is started, among other thing we query memory device information. And while memory address is returned by QEMU for all models, we store it only for DIMMs and NVDIMMs. Do store it for VIRTIO_MEM and VIRTIO_PMEM too. This effectively reports the address the virtio-mem/virtio-pmem is mapped to in live XML. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-26 16:44:45 +02:00
Michal Privoznik	677156f662	conf: Introduce <address/> for virtio-mem and virtio-pmem Both virtio-mem and virtio-pmem devices have '.memaddr' attribute which controls the address where they are mapped in the guest memory. Ideally, users do not need to specify this as QEMU does the right thing and computes addresses automatically on startup. But soon, we will need to record this address as it is part of guest ABI. And also, there might be some users that want to control this value. Now, we are in a bit of a pickle, because both these device types already have a PCI address, therefore we can't just use <address/> blindly. But what we can do, is introduce <address/> under the <target/> element. This is also more conceptual, as knobs under <target/> control guest visible config of memory device (and .memaddr surely falls into that category). NB, SgxEPCDeviceInfo struct in QMP definition also has .memaddr attribute, but because of the way we build cmd line there's no (easy) way to set the attribute. So ignore that for now. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-26 16:44:42 +02:00
Michal Privoznik	f08309d868	conf: Run virDomainInputDefPostParse() only for VIR_DOMAIN_DEVICE_INPUT Due to missed break; statement the virDomainInputDefPostParse() is called not only for VIR_DOMAIN_DEVICE_INPUT but also VIR_DOMAIN_DEVICE_LEASE and VIR_DOMAIN_DEVICE_NET, which can lead to all sort of unpredictable results. Fixes: `c4bc4d3b82` Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-05-26 16:28:54 +02:00
Tim Wiederhake	1a86c27edb	sync_qemu_models_i386.py: Add missing features This brings the tool's list of features in sync with qemu commit 886c0453cbf10eebd42a9ccf89c3e46eb389c357. Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:30 +02:00
Tim Wiederhake	af6b5c1edf	sync_qemu_models_i386.py: Fix complaint from flake8 Line was too long. Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:27 +02:00
Tim Wiederhake	b4560dc69b	cpu_map: Add missing feature "vnmi" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:24 +02:00
Tim Wiederhake	b038d52e30	cpu_map: Add missing feature "stibp-always-on" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:21 +02:00
Tim Wiederhake	5d1a8d0053	cpu_map: Add missing feature "prefetchiti" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:18 +02:00
Tim Wiederhake	e680f69f77	cpu_map: Add missing feature "null-sel-clr-base" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:15 +02:00
Tim Wiederhake	822c5df993	cpu_map: Add missing feature "no-nested-data-bp" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:12 +02:00
Tim Wiederhake	0876e2cd9f	cpu_map: Add missing feature "lfence-always-serializing" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:09 +02:00
Tim Wiederhake	877f609b8c	cpu_map: Add missing feature "flush-l1d" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:06 +02:00
Tim Wiederhake	bd93b13294	cpu_map: Add missing feature "fb-clear" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:03 +02:00
Tim Wiederhake	aff0caeae8	cpu_map: Add missing feature "cmpccxadd" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:43:00 +02:00
Tim Wiederhake	7d57f0f1e5	cpu_map: Add missing feature "avx-vnni-int8" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:42:57 +02:00
Tim Wiederhake	3836efdaaf	cpu_map: Add missing feature "avx-ne-convert" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:42:54 +02:00
Tim Wiederhake	c8c44a59fe	cpu_map: Add missing feature "avx-ifma" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:42:52 +02:00
Tim Wiederhake	01be1472f9	cpu_map: Add missing feature "auto-ibrs" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:42:49 +02:00
Tim Wiederhake	c0dfb6b953	cpu_map: Add missing feature "amx-fp16" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:42:45 +02:00
Tim Wiederhake	c8d67f644c	cpu_map: Add missing feature "amd-psfd" Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-26 08:42:29 +02:00
Lin Yang	96c8d39af0	cpu_map: Add SapphireRapids CPU model Introduced in QEMU by commit v8.0.0-7eb061b06e. Signed-off-by: Lin Yang <lin.a.yang@intel.com> Signed-off-by: Tim Wiederhake <twiederh@redhat.com> Reviewed-by: Tim Wiederhake <twiederh@redhat.com>	2023-05-24 12:48:09 +02:00
Boris Fiuczynski	44a0f2f0c8	nodedev: update transient mdevs Instead of updating defined mdevs only add another update for active devices as well to cover transient mdev devices as well. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2143158 Signed-off-by: Boris Fiuczynski <fiuczy@linux.ibm.com> Reviewed-by: Jonathon Jongsma <jjongsma@redhat.com>	2023-05-23 11:15:13 -05:00
Michal Privoznik	e53291514c	qemu_hotplug: Temporarily allow emulator thread to access other NUMA nodes during mem hotplug Again, this fixes the same problem as one of previous commits, but this time for memory hotplug. Long story short, if there's a domain running and the emulator thread is restricted to a subset of host NUMA nodes, but the memory that's about to be hotplugged requires memory from a host NUMA node that's not in the set we need to allow emulator thread to access the node, temporarily. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-23 17:21:16 +02:00
Michal Privoznik	3ec6d586bc	qemu: Start emulator thread with more generous cpuset.mems Consider a domain with two guest NUMA nodes and the following <numatune/> setting : <numatune> <memory mode="strict" nodeset="0"/> <memnode cellid="0" mode="strict" nodeset="1"/> </numatune> What this means is the emulator thread is pinned onto host NUMA node #0 (by setting corresponding cpuset.mems to "0"), and two memory-backend-* objects are created: -object '{"qom-type":"memory-backend-ram","id":"ram-node0", .., "host-nodes":[1],"policy":"bind"}' \ -numa node,nodeid=0,cpus=0-1,memdev=ram-node0 \ -object '{"qom-type":"memory-backend-ram","id":"ram-node1", .., "host-nodes":[0],"policy":"bind"}' \ -numa node,nodeid=1,cpus=2-3,memdev=ram-node1 \ Note, the emulator thread is pinned well before QEMU is even exec()-ed. Now, the way memory allocation works in QEMU is: the emulator thread calls mmap() followed by mbind() (which is sane, that's how everybody should do it). BUT, because the thread is already restricted by CGroups to just NUMA node #0, calling: mbind(host-nodes:[1]); /* made up syntax (TM) */ fails. This is expected though. Kernel was instructed to place the memory at NUMA node "0" and yet, process is trying to place it elsewhere. We used to solve this by not restricting emulator thread at all initially, and only after it's done initializing (i.e. we got the QMP greeting) we placed it onto desired nodes. But this had its own problems (e.g. QEMU might have locked pieces of its memory which were then unable to migrate onto different NUMA nodes). Therefore, in v5.1.0-rc1~282 we've changed this and set cgroups upfront (even before exec()-ing QEMU). And this used to work, but something has changed (I can't really put my finger on it). Therefore, for the initialization start the thread with union of all configured host NUMA nodes ("0-1" in our example) and fix the placement only after QEMU is started. NB, the memory hotplug suffers the same problem, but that will be fixed in the next commit. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2138150 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-23 17:21:16 +02:00
Michal Privoznik	c4a7f8007c	qemuProcessSetupPid: Use @numatune variable more Inside of qemuProcessSetupPid() there's @numatune variable which is set to vm->def->numa, but it lives only in one block. In the rest of places the expanded form (vm->def->numa) is used instead. Move the variable declaration at the beginning of the function and use it instead of the expanded form. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-23 17:21:16 +02:00
Martin Kletzander	1bb439e4b0	qemu: Use thread-context even with numatune's restrictive mode We cannot use host-nodes attribute for it, but there is no reason for us to skip the preallocation optimisation using thread-context in such case. Thankfully returning the proper nodemask from qemuBuildMemoryBackendProps is enough to trigger this. Signed-off-by: Martin Kletzander <mkletzan@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-05-23 17:04:08 +02:00
Andrea Bolognani	3b6d69237f	Revert "conf: Introduce MTE domain feature" The QEMU interface is still in a state of flux, and KVM support has been pulled shortly after having been merged. Let's not commit to a stable interface in libvirt just yet. Reverts: `720e8f13ff` Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Cornelia Huck <cohuck@redhat.com>	2023-05-22 15:13:19 +02:00
Andrea Bolognani	4fd5f0d660	Revert "qemu:: Introduce QEMU_CAPS_MACHINE_VIRT_MTE capability" The QEMU interface is still in a state of flux, and KVM support has been pulled shortly after having been merged. Let's not commit to a stable interface in libvirt just yet. Reverts: `1347a19f75` Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Cornelia Huck <cohuck@redhat.com>	2023-05-22 15:13:18 +02:00
Andrea Bolognani	178a66f9af	Revert "qemu: Validate MTE feature" The QEMU interface is still in a state of flux, and KVM support has been pulled shortly after having been merged. Let's not commit to a stable interface in libvirt just yet. Reverts: `c6c9b5d251` Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Cornelia Huck <cohuck@redhat.com>	2023-05-22 15:13:17 +02:00
Andrea Bolognani	167138a525	Revert "qemu: Generate command line for MTE feature" The QEMU interface is still in a state of flux, and KVM support has been pulled shortly after having been merged. Let's not commit to a stable interface in libvirt just yet. Reverts: `b10bc8f7ab` Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Cornelia Huck <cohuck@redhat.com>	2023-05-22 15:12:51 +02:00
Michal Privoznik	17c8a173b6	numa_conf: Deny other memory modes than 'restrictive' if a memnode is 'restrictive' We already do check that if there's <memory mode='restrictive'/> then all <memnode/> have to be of 'restrictive' mode too. But what we are missing the reverse: if there is <memnode/> with 'restrictive' mode, then the <memory/> has to be of the same mode too. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2208946 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-22 13:58:03 +02:00
Michal Privoznik	f6ba9fc12a	numa_conf: Move memnode mode validation into virDomainNumaDefValidate() When parsing a <memnode/> we also check whether the @mode argument fulfills some requirements wrt 'restrictive' mode. This is not the right place though. There's virDomainNumaDefValidate() which contains other checks. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-22 13:57:44 +02:00
Michal Privoznik	a152d856c3	virDomainNumatuneNodeSpecified: Fix const correctness The virDomainNumatuneNodeSpecified() function does not write into passed @numatune pointer, it just reads from it. Therefore, the argument should be const, which allows this function to be called from places where virDomainNuma is already const (e.g. domain validation code). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-22 13:49:34 +02:00
Jiang Jiacheng	ffa258a39d	qemu: support set parallel migration compression method Add new compress methods zlib and zstd for parallel migration, these method should be used with migration option --comp-methods and will be processed in 'qemuMigrationParamsSetCompression'. Note that only one compress method could be chosen for parallel migration and they cann't be used in compress migration. Signed-off-by: Jiang Jiacheng <jiangjiacheng@huawei.com> Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Jiri Denemark <jdenemar@redhat.com>	2023-05-18 15:47:30 +02:00
Peter Krempa	af621caa6b	conf: numa: Allow formatting 'none' values for 'associativity' and 'policy' of cache The parser makes the values mandatory and also the qemu code implements actions for those values. The formatter skips them though. Since format+parse is used to copy the XML at startup a definition with those values can't be started. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2203709 Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-17 10:07:18 +02:00
Peter Krempa	0d5fc7219a	virDomainNumaDefNodeCacheParseXML: Refactor parsing of cache XML Use virXMLProp* helpers to simplify the code. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-17 10:07:18 +02:00
Michal Privoznik	b10bc8f7ab	qemu: Generate command line for MTE feature This is pretty trivial, just append "mte=on/off" to -machine arguments. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-16 17:43:05 +02:00
Michal Privoznik	c6c9b5d251	qemu: Validate MTE feature The MTE feature is not supported by all QEMUs, only those with QEMU_CAPS_MACHINE_VIRT_MTE capability. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-16 17:43:03 +02:00
Michal Privoznik	1347a19f75	qemu:: Introduce QEMU_CAPS_MACHINE_VIRT_MTE capability The MTE feature (introduced in QEMU commit of v5.1.0-rc1~8^2~11) is detectable via 'qom-list-properties' for 'virt' machine type. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-16 17:43:00 +02:00
Michal Privoznik	720e8f13ff	conf: Introduce MTE domain feature The Memory Tagging Extensions are hardware acceleration present in some ARM processors that allow memory error detection [1]. Introduce a domain XML knob that turns them on or off. 1: https://www.arm.com/blogs/blueprint/memory-safety-arm-memory-tagging-extension Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-16 17:42:58 +02:00
Michal Privoznik	37e41b7f16	qemu: Drop @forceVFIO argument of qemuDomainGetMemLockLimitBytes() After previous cleanup, there's not a single caller that would call qemuDomainGetMemLockLimitBytes() with @forceVFIO set. All callers pass false. Drop the unneeded argument from the function. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-16 14:43:43 +02:00
Michal Privoznik	4f355fa5b7	qemu: Drop @forceVFIO argument of qemuDomainAdjustMaxMemLock() After previous cleanup, there's not a single caller that would call qemuDomainAdjustMaxMemLock() with @forceVFIO set. All callers pass false. Drop the unneeded argument from the function. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-16 14:43:43 +02:00
Michal Privoznik	c925bb9273	qemu_domin: Account for NVMe disks when calculating memlock limit on hotplug During hotplug of a NVMe disk we need to adjust the memlock limit. The computation of the limit is handled by qemuDomainGetMemLockLimitBytes() which looks at given domain definition and accounts for various device types (as different types require different amounts). But during disk hotplug the disk is not added to domain definition until the very last moment. Therefore, qemuDomainGetMemLockLimitBytes() has this @forceVFIO argument which tells it to assume VFIO even if there are no signs of VFIO in domain definition. And this kind of works, until the amount needed for NVMe disks changed (in v9.3.0-rc1~52). What's missing in the commit is making @forceVFIO behave the same as if there was an NVMe disk present in the domain definition. But, we can do even better - just mimic whatever we're doing for hostdevs. IOW - introduce qemuDomainAdjustMaxMemLockNVMe() that behaves the same as qemuDomainAdjustMaxMemLockHostdev(). There are subtle differences though: 1) qemuDomainAdjustMaxMemLockHostdev() can afford placing hostdev right at the end of vm->def->hostdevs, because the array was already reallocated (at the beginning of qemuDomainAttachHostPCIDevice()). But qemuDomainAdjustMaxMemLockNVMe() doesn't have that luxury. 2) qemuDomainAdjustMaxMemLockHostdev() places a virDomainHostdevDef pointer into domain definition, while qemuDomainStorageSourceAccessModifyNVMe() (which calls qemuDomainAdjustMaxMemLock()) sees a virStorageSource pointer but domain definition contains virDomainDiskDef. But that's okay, we can create a dummy disk definition and append it into the domain definition. After this, qemuDomainAdjustMaxMemLock() can be called with @forceVFIO = false, as the disk is now part of domain definition (when computing the new limit). Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2014030#c28 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-16 14:43:42 +02:00
Peter Krempa	9884e89236	schemas: backup: Allow missing 'type' attribute for backup disk One of our examples in the 'formatbackup.rst' page shows following config: <disk name='vda' backup='yes'/> The schema didn't allow it though. Fix the schema as the internals were supposed to support it (except for the bug fixed in previous patches). Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-05-16 10:26:33 +02:00
Peter Krempa	1222ada2af	conf: backup: Fix logic for generating default backup filenames If the 'disk->store' property is already allocated which happens e.g. when the disk is described by the backup XML but the optional filename is not filled in 'virDomainBackupDefAssignStore' would not fill in the default location. Fix the logic to do it also if a 'virStorageSource' categorizes as empty. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-05-16 10:26:33 +02:00
Peter Krempa	de48ade195	virDomainBackupDefAssignStore: Restructure control flow Return early for errors instead of using 'else' branches. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2023-05-16 10:26:33 +02:00
Andrea Bolognani	517d76466b	qemu: Update documentation for dbus_daemon qemu.conf key Reflect the new default value, and explain that a runtime lookup will be performed if the value is not an absolute path. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-11 15:04:56 +02:00
Andrea Bolognani	4400f63636	meson: Stop looking for dbus-daemon Now that we're performing the lookup at runtime, doing it at build time is no longer necessary. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-11 15:04:54 +02:00
Andrea Bolognani	769de39f50	qemu: Find dbus-daemon at runtime Don't bother looking at /usr/libexec, since every distro ships dbus-daemon in $PATH. Note that it's still possible for the administrator to prevent this lookup and use an arbitrary binary by setting the appropriate key in qemu.conf. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2023-05-11 15:04:50 +02:00
Andrea Bolognani	db91bf2ba3	qemu: Update documentation for qemu.conf keys Reflect the new default value, and explain that a runtime lookup will be performed if the value is not an absolute path. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-10 18:54:15 +02:00
Andrea Bolognani	b134a9bd2a	meson: Stop looking for QEMU helpers Now that we're performing the lookup at runtime, doing it at build time is no longer necessary. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-10 18:54:12 +02:00
Andrea Bolognani	934113d376	qemu: Find helpers at runtime Use the recently introduced virFindFileInPathFull() function to discover the path for qemu-bridge-helper and qemu-pr-helper at runtime. Note that it's still possible for the administrator to prevent this lookup and use arbitrary binaries by setting the appropriate keys in qemu.conf: this simply removes the need to perform the lookup at build time, and thus to have the helpers installed in the build environment. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-10 18:54:09 +02:00
Andrea Bolognani	ef91f9e52a	util: Introduce virFileFindInPathFull() Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2023-05-10 18:54:08 +02:00

1 2 3 4 5 ...

34759 Commits