libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2025-01-05 12:35:20 +00:00

Author	SHA1	Message	Date
Cole Robinson	270583ed98	qemu: conf: Cache domCaps in qemuCaps qemuCaps is tied to a binary on disk. domCaps is tied to a combo of binary+machine+arch+virttype values. For the qemu driver this almost entirely translates to a permutation of qemuCaps though Upcoming patches want to use the domCaps data store at XML validate time, but we need to cache the data so we aren't repeatedly regenerating it. Add a domCapsCache hash table to qemuCaps. This ensures that the domCaps cache is blown away whenever qemuCaps needs to be regenerated. Similarly when qemuCaps is invalidated, the next call to virQEMUCapsCacheLookup will unref qemuCaps and free our cache as well. Adjust virQEMUDriverGetDomainCapabilities to search the cache and add to it if we don't find a hit. Signed-off-by: Cole Robinson <crobinso@redhat.com> Reviewed-by: Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-08-06 19:45:49 -04:00
Cole Robinson	d05bdff79b	qemu: conf: add virQEMUDriverGetDomainCapabilities For now it's just a helper for building a qemu virDomainCapsPtr, used in qemuConnectGetDomainCapabilities Reviewed-by: Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-08-06 19:45:49 -04:00
Cole Robinson	928508f669	qemu: capabilities: fill in domcaps <rng> The model logic is taken from qemuDomainRNGDefValidate Reviewed-by: Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-08-06 19:45:49 -04:00
Michal Privoznik	0e66f0669a	qemuDomainGetResctrlMonData: Switch to switch() This way it is obvious when adding a new resource control type that stats helper func needs to be updated too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-08-06 14:37:30 +02:00
Michal Privoznik	9fc616cc10	qemuDomainGetResctrlMonData: Dereference resctrl monitor iff not NULL If the host doesn't have resctrl then the monitor is going to be NULL and we must avoid dereferencing it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-08-06 14:37:30 +02:00
Michal Privoznik	9801ee899f	qemuDomainGetResctrlMonData: Don't leak @caps The capabilities object must be unrefed when no longer needed. Use VIR_AUTOUNREF() for that. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-08-06 14:37:30 +02:00
Michal Privoznik	610858d282	qemu_domain: Separate VFIO code This piece of code will be re-used later. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> ACKed-by: Peter Krempa <pkrempa@redhat.com>	2019-08-06 11:21:43 +02:00
Michal Privoznik	22fc83df92	qemuDomainDeviceDefValidateDisk: Reorder some checks I find this function more readable if checks for passed storage source are done first and backing chain is done last. Mixing them together does not hurt, but is less readable. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> ACKed-by: Peter Krempa <pkrempa@redhat.com>	2019-08-06 11:20:35 +02:00
Michal Privoznik	f0c50bc1ce	lib: Unify PCI address formatting The format string for a PCI address is copied over and over again, often with slight adjustments. Introduce global VIR_PCI_DEVICE_ADDRESS_FMT macro that holds the formatting string and use it wherever possible. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-08-05 19:42:15 +02:00
Michal Privoznik	1737d11e1b	qemuBuildPCIHostdevDevStr: Always format PCI domain onto cmd line While it's true that older QEMUs were not able to deal with PCI domains, we don't support those versions anymore (see `4a42ece13a`). Therefore it is safe to always format fully expanded PCI address. Format PCI domain always as it will simplify next commits. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-08-05 19:42:15 +02:00
Wang Huaqiang	9549a8967a	util: Extend virresctl API to retrieve multiple monitor statistics Export virResctrlMonitorGetStats and make virResctrlMonitorGetCacheOccupancy obsoleted. Signed-off-by: Wang Huaqiang <huaqiang.wang@intel.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-08-05 19:41:12 +02:00
Wang Huaqiang	c09a14e5b4	util: Refactor 'virResctrlMonitorStats' Refactor 'virResctrlMonitorStats' to track multiple statistical records. Signed-off-by: Wang Huaqiang <huaqiang.wang@intel.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-08-05 19:41:12 +02:00
Wang Huaqiang	782dd229ac	util: Refactor and rename 'virResctrlMonitorFreeStats' Refactor and rename 'virResctrlMonitorFreeStats' to 'virResctrlMonitorStatsFree' to free one 'virResctrlMonitorStatsPtr' object. Signed-off-by: Wang Huaqiang <huaqiang.wang@intel.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-08-05 19:41:12 +02:00
Wang Huaqiang	816cef0783	util, conf: Handle default monitor group of an allocation properly 'default monitor of an allocation' is defined as the resctrl monitor group that created along with an resctrl allocation, which is created by resctrl file system. If the monitor group specified in domain configuration file is happened to be a default monitor group of an allocation, then it is not necessary to create monitor group since it is already created. But if an monitor group is not an allocation default group, you should create the group under folder '/sys/fs/resctrl/mon_groups' and fill the vcpu PIDs to 'tasks' file. Signed-off-by: Wang Huaqiang <huaqiang.wang@intel.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-08-05 19:41:11 +02:00
Jiri Denemark	ad9d5d3a6a	cpu: Drop CPUID definition for hv-spinlocks hv-spinlocks is not a CPUID feature and should not be checked as such. While starting a domain with hv-spinlocks enabled, we would report a warning about unsupported hyperv spinlocks feature even though it was set properly. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-30 17:09:53 +02:00
Jiri Denemark	4c62ed6068	qemu: Fix KVM features with QEMU 4.1 Originally the names of the KVM CPU features were only used internally for looking up their CPUID bits. So we used "__kvm_" prefix for them to make sure the names do not collide with normal CPU features stored in our CPU map. But with QEMU 4.1 we check which features were enabled or disabled by a freshly started QEMU process using their names rather than their CPUID bits (mostly because of MSR features). Thus we need to change our made up internal names into the actual names used by QEMU. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Tested-by: Vitaly Kuznetsov <vkuznets@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-29 15:41:51 +02:00
Jiri Denemark	1ddf014fef	cpu: Drop KVM_ from hyperv feature macros All the features are hyperv features even though they are provided by KVM with QEMU. The "KVM" part in the macro names does not make a lot of sense. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Tested-by: Vitaly Kuznetsov <vkuznets@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-29 15:41:50 +02:00
Jiri Denemark	d99e8f01c7	qemu: Prefer dashes for hyperv features Starting with QEMU 4.1, we're using the canonical feature names on the command line and avoid aliases to prepare for possible deprecation of all aliases in QEMU. But we do so only for features from our CPU map, hyperv features defined in the code were unchanged and this patch fixes it. Some features use "hv-" prefix unconditionally because they were introduced recently enough to always support spelling with a dash. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Tested-by: Vitaly Kuznetsov <vkuznets@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-29 15:41:50 +02:00
Jiri Denemark	0ccdd476bb	qemu: Fix hyperv features with QEMU 4.1 Originally the names of the hyperv CPU features were only used internally for looking up their CPUID bits. So we used "__kvm_hv_" prefix for them to make sure the names do not collide with normal CPU features stored in our CPU map. But with QEMU 4.1 we check which features were enabled or disabled by a freshly started QEMU process using their names rather than their CPUID bits (mostly because of MSR features). Thus we need to change our made up internal names into the actual names used by QEMU. Most of the names are only used with QEMU 4.1 and newer and the reset was introduced with QEMU recently enough to already support spelling with "-". Thus we don't need to define them as "hv_" with a translation to "hv-" for new QEMU. Without this patch libvirt would mistakenly report all hyperv features as unavailable and refuse to start any domain using them with QEMU 4.1. Reported-by: Vitaly Kuznetsov <vkuznets@redhat.com> Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Tested-by: Vitaly Kuznetsov <vkuznets@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-29 15:41:50 +02:00
Eric Blake	7efe930ec3	backup: Prevent snapshots and checkpoints at same time Earlier patches mentioned that the initial implementation will prevent snapshots and checkpoints from being used on the same domain at once. However, the actual restriction is done in this separate patch to make it easier to lift that restriction via a revert, when we are finally ready to tackle that integration in the future. Signed-off-by: Eric Blake <eblake@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-29 08:22:29 -05:00
Eric Blake	3a204b036f	backup: Wire up qemu checkpoint commands over QMP Time to actually issue the QMP transactions that create and delete persistent checkpoints, resolving TODOs intentionally left earlier in the series. For create, we only need one transaction: inside, we visit all disks affected by the checkpoint, and create a new enabled bitmap, as well as disabling the bitmap of the first ancestor checkpoint (if any) that also had a bitmap. For deletion, we need multiple QMP calls: for each disk, if there is an ancestor checkpoint with a bitmap, then the bitmap must be merged (including activating the ancestor bitmap if the leaf node changes), all before deleting the bitmap from the checkpoint being removed. Signed-off-by: Eric Blake <eblake@redhat.com>	2019-07-29 08:15:31 -05:00
Eric Blake	e3a4b8f461	backup: qemu: Add helper API for looking up node name Qemu bitmap operations require knowing the node name associated with the format layer (the qcow2 file); as upcoming patches will be grabbing that information frequently, make a helper function to access it. Another potential benefit of this function is that we have a single place where we could insert a QMP node-name scraping call if we don't currently know the node name, when -blockdev is not supported; however, the goal is that we hopefully don't ever have to do that because we instead scrape node names only at the point where they change. Signed-off-by: Eric Blake <eblake@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-29 08:15:11 -05:00
Eric Blake	5f4e079650	backup: qemu: Implement metadata tracking for checkpoint APIs A lot of this work heavily copies from the existing snapshot APIs. What's more, this patch is (intentionally) very similar to the checkpoint code just added in the test driver, to the point that qemu checkpoints are not fully usable in this patch, but it at least bisects and builds cleanly. The separation between patches is done because the grunt work of saving and restoring XML and tracking relations between checkpoints is common to the test driver, while the later patch adding integration with QMP is specific to qemu. Also note that the interlocking to prevent checkpoints and snapshots from existing at the same time will be a separate patch, to make it easier to revert that restriction when we finally round out the design for supporting interaction between the two concepts. Signed-off-by: Eric Blake <eblake@redhat.com>	2019-07-29 08:09:13 -05:00
Eric Blake	63b9c21dd2	backup: qemu: Add directory for tracking checkpoints This is similar to the existing directory for snapshots; the domain will save one xml file per checkpoint, for reloading on the next libvirtd restart. Fortunately, since checkpoints mandate RNG validation, we are assured that the checkpoint name will be usable as a file name (no abuse of '../escape' as a checkpoint name, for example). Signed-off-by: Eric Blake <eblake@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-29 07:54:03 -05:00
Peter Krempa	3f93884a4d	qemu: Add -blockdev support for block commit job Introduce the handler for finalizing a block commit and active bloc commit job which will allow to use it with blockdev. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-29 13:58:26 +02:00
Peter Krempa	1bf3808207	qemu: Add -blockdev support for block pull job Introduce the handler for finalizing a block pull job which will allow to use it with blockdev. This patch also contains some additional machinery which is required to store all the relevant job data in the status XML which will also be reused with other block job types. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-29 13:58:25 +02:00
Stefan Berger	72299db636	tpm: Run swtpm_setup with less parameters on incoming migration In case of an incoming migration we do not need to run swtpm_setup with all the parameters but only want to get the benefit of it creating a TPM state file for us that we can then label with an SELinux label. The actual state will be overwritten by the in- coming state. So we have to pass an indicator for incomingMigration all the way to the command line parameter generation for swtpm_setup. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Marc-André Lureau <marcandre.lureau@redhat.com>	2019-07-27 07:56:00 -04:00
Eric Blake	c82abfdea9	backup: qemu: Detect node names at domain startup If we are using -blockdev, then node names are always available (because we set them). But when not using it, we have to scrape node names from QMP, and want to do so as infrequently as possible. We were scraping node names after reconnecting a new libvirtd to an existing guest (see qemuProcessReconnect), and after any block job that may have changed the set of node names we care about (legacy block jobs), but forgot to scrape the names when first starting a guest. Do so now in order to allow the checkpoint code to always have access to a node name without having to repeat a node name scrape itself. Future patches may need to clean up qemuDomainSetBlockThreshold (if node names are always available, then it doesn't need to repeat a scrape) and/or hotplug and media changes (if the addition of new nodes can result in a null node name, then scraping at that point in time would be appropriate). But for now, this patch addresses only the most common instance of a missing node name. Signed-off-by: Eric Blake <eblake@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-26 16:48:58 -05:00
Stefan Berger	b8358f94e0	tpm: Fix memory leak and use existing variable instead Use the existing variables rather then calling virTPMSwtpmXYZ(). Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: John Ferlan <jferlan@redhat.com> Message-Id: <20190726205633.2041912-2-stefanb@linux.vnet.ibm.com>	2019-07-26 16:32:29 -05:00
Stefan Berger	e1ff8a95c6	tpm: Create empty log file if file was removed Create an empty log file if the log file was removed, otherwise the transaction to set the security labels on the file will fail. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com> Message-Id: <20190726210706.24440-3-stefanb@linux.ibm.com>	2019-07-26 16:32:25 -05:00
Stefan Berger	20b0fd6d21	tpm: Set transationStarted to false if commit failed Set the transactionStarted to false if the commit failed. If this is not done, then the failure path will report 'no transaction is set' and hide more useful error reports. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com> Message-Id: <20190726210706.24440-2-stefanb@linux.ibm.com>	2019-07-26 16:32:25 -05:00
Jiri Denemark	1fd28a2e79	qemu: Translate features in virQEMUCapsGetCPUFeatures Starting with QEMU 4.1 qemuMonitorCPUModelInfo structure in virQEMUCaps stores only canonical feature names which may differ from the name used by libvirt. We need translate these canonical names into libvirt names for further consumption. This fixes a bug in qemuConnectBaselineHypervisorCPU which would remove all features for which libvirt's spelling differs from the QEMU's preferred name. For example, the following result of qemuConnectBaselineHypervisorCPU on my host with QEMU 4.1 is wrong: <cpu mode='custom' match='exact'> <model fallback='forbid'>Skylake-Client</model> <vendor>Intel</vendor> <feature policy='require' name='ss'/> <feature policy='require' name='vmx'/> <feature policy='require' name='hypervisor'/> <feature policy='require' name='clflushopt'/> <feature policy='require' name='umip'/> <feature policy='require' name='arch-capabilities'/> <feature policy='require' name='xsaves'/> <feature policy='require' name='pdpe1gb'/> <feature policy='require' name='invtsc'/> <feature policy='disable' name='pclmuldq'/> <feature policy='disable' name='lahf_lm'/> </cpu> The 'pclmuldq' and 'lahf_lm' should not be disabled in the baseline CPU as they are supported by QEMU on this host. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-07-26 16:37:30 +02:00
Stefan Berger	94b3aa55f8	tpm: Check TPM XML device configuration changes after edit Since swtpm does not support getting started without password once it was created with encryption enabled, we don't allow encryption to be removed. Similarly, we do not allow encryption to be added once swtpm has run. We also prevent chaning the type of the TPM backend since the encrypted state is still around and the next time one was to switch back to the emulator backend and forgot the encryption the TPM would not work. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-26 10:30:59 +01:00
Stefan Berger	a9d6f1c054	tpm: Pass migration key passphrase via fd to swtpm This patch now passes the passphrase as a migration key to swtpm. This now encrypts the state of the TPM while a VM is migrated between hosts or when suspended into a file. Since the migration key secret is the same as the state encryption secret, this now requires that the migration destination host has the same secret value. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-26 10:30:59 +01:00
Stefan Berger	5eeff28585	tpm: Use fd to pass password to swtpm_setup and swtpm Allow vTPM state encryption when swtpm_setup and swtpm support passing a passphrase using a file descriptor. This patch enables the encryption of the vTPM state only. It does not encrypt the state during migration, so the destination secret does not need to have the same password at this point. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-26 10:30:58 +01:00
Stefan Berger	01cf7a1bb9	tpm: Check whether previously found executables were updated Check whether previously found executables were updated and if so look for them again. This helps to use updated features of swtpm and its tools upon updating them. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Marc-André Lureau <marcandre.lureau@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-26 10:30:18 +01:00
Stefan Berger	4777bbdd76	tpm: Move qemuTPMEmulatorInit to virTPMEmulatorInit in virtpm.c Move qemuTPMEmulatorInit to virTPMEmulatorInit in virtpm.c and introduce a few functions to query the executables needed for virCommands. Add locking to protect the tool paths and return a copy of the tool paths to callers wanting to access them so that we can run the initialization function multiples time later on and detect when the executable gets updated. Signed-off-by: Stefan Berger <stefanb@linux.ibm.com> Reviewed-by: Marc-André Lureau <marcandre.lureau@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-26 10:29:57 +01:00
Peter Krempa	56ad575dbe	qemu: blockjob: Ensure that config disk source is identical when modifying it qemuBlockJobRewriteConfigDiskSource rewrites the disk source only according to the 'target'. This means that if someone would change the inactive config of the VM to refer to a different disk a block job would rewrite it when finishing a job which modifies the disk source. Make sure that this does not happen by verifying that the source of the config disk is the same. Signed-off-by: Peter Krempa <pkrempa@redhat.com> ACKed-by: Eric Blake <eblake@redhat.com>	2019-07-25 13:21:33 +02:00
Peter Krempa	d3833b0799	qemu: blockjob: Clear out any irrelevant data in copied source Since we copy everything from the original storage source including some runtime data which are not relevant for the config we should clear them. Signed-off-by: Peter Krempa <pkrempa@redhat.com> ACKed-by: Eric Blake <eblake@redhat.com>	2019-07-25 13:21:33 +02:00
Peter Krempa	3b27b5f35a	qemu: blockjob: Split out update of persistent XML disk's source on mirror jobs Both active block commit and block copy modify the disk source of the active definition and thus also must modify the corresponding inactive definition source so that the VM starts up later. This is currently implemented in the legacy block job handler but the logic will be useful also for the new handlers. Split it out which also simplifies it. Signed-off-by: Peter Krempa <pkrempa@redhat.com> ACKed-by: Eric Blake <eblake@redhat.com>	2019-07-25 13:21:33 +02:00
Peter Krempa	dae7d01322	qemu: blockjob: Register disk->mirror with a job only when required The <mirror> subelement is used in two ways: in a commit job to point to existing storage, and in a block-copy job to point to additional storage. We need a way to track only the distinct storage. This patch introduces qemuBlockJobDiskRegisterMirror which registers the mirror chain separately only for jobs which require it. This also comes with remembering that in the status XML. Signed-off-by: Peter Krempa <pkrempa@redhat.com> ACKed-by: Eric Blake <eblake@redhat.com>	2019-07-25 13:21:33 +02:00
Peter Krempa	35a97e4532	qemu: blockjob: Document qemuBlockJobRegister Signed-off-by: Peter Krempa <pkrempa@redhat.com> ACKed-by: Eric Blake <eblake@redhat.com>	2019-07-25 13:21:33 +02:00
Peter Krempa	ed1c0aba34	qemu: blockjob: Use proper value when setting disk's READY state Commit `c412383796` used a value from wrong enum when setting the disk's mirrorState variable. This meant that a 'READY' job would show up as 'PIVOTING'. Signed-off-by: Peter Krempa <pkrempa@redhat.com> ACKed-by: Eric Blake <eblake@redhat.com>	2019-07-25 13:21:33 +02:00
Peter Krempa	93b77cba0a	qemu: blockjob: Reset 'synchronous' block job handling flag prior to flushing events When returning to asynchronous block job handling the flag which determines the handling method should be reset prior to flushing outstanding events. If there's an event to process the handler may invoke the monitor and another event may be received. We'd not process that one. Reset the flag earlier. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:33 +02:00
Peter Krempa	22a9f08572	qemu: snapshot: Initialize data for inactive config of snapshot earlier qemuDomainSnapshotDiskDataCollect copies the source of the disk from the live config into the inactive config. Move this operation earlier so that if we initialize it for use for the particular instance the run-time-only data is not copied. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	47a12f2752	qemu: block: Use simple backing stores string format if possible In case when the backing store can be represented with something simpler such as a URI we can use it rather than falling back to the json: pseudo-protocol. In cases when it's not worth it (e.g. with the old ugly NBD or RBD strings) let's switch to json. The function is exported as we'll need it when overwriting the ugly strings qemu would come up with during blockjobs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	4f11a1eaf2	qemu: Use virStorageSourceIsEmpty in qemuDomainBlockCommit The block commit API checked 'disk->src->path' to see whether there is a reasonable disk source to be committed. As the top image can be e.g. backed by NBD the check is not good enough. Replace it by virStorageSourceIsEmpty. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	08f23b8ffa	qemu: block: Add helper for generating snapshot transaction for -blockdev For the modern use cases we are going to use 'blockdev-snapshot' instead of 'blockdev-snapshot-sync'. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	dfc980ab97	qemu: Add possibility to prepare top image only for attachment via blockdev qemuBuildStorageSourceChainAttachPrepareBlockdev prepares the full backing chain for attachment via blockdev. For snapshots we'll need to prepare one image only as it needs to be plugged on top of the existing chain. This patch introduces qemuBuildStorageSourceChainAttachPrepareBlockdevTop which prepares only @top similarly to the original function by splitting out the functionality into an internal function so that the API does not change. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	6115b5e7d0	qemu: command: Fix function name in comment In commit `042c95bd19` qemuBuildStorageSourceChainAttachPrepareBlockdev was added but the comment for the function mentions qemuBuildStorageSourceChainAttachPrepareDrive. Fix the mistake. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	188de4f529	qemu: driver: Remove semi-stale comment about asynchronous job abort Now that we track the job separately we watch only for the abort of the one single block job so the comment is no longer accurate. Also describing asynchronous operation is not really necessary. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	45c37d648d	qemu: Remove stale comment outlining how to extend qemuDomainBlockPivot With -blockdev: - we track the job and check it after restart - have the ability to ask qemu to persist it to collect result - have the ability to report errors. This solves all points the comment outlined so remove it. Also all jobs handle the disk state modification along with the event so there's nothing special the comment says. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	4817b5ca1d	qemu: driver: Blockdevize qemuDomainBlockJobAbort/Pivot Use job-complete/job-abort instead of the blockjob-* variants for blockdev. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	4ed4e35772	qemu: driver: Report error if pivoting fails in qemuDomainBlockJobAbort As the error message is now available and we know whether the job failed we can report an error straight away rather than having the user check the event. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	7005779653	qemu: Use QEMU_BLOCKJOB_STATE_PIVOTING/ABORTING in qemuDomainBlockJobAbort Set the correct job states after the operation is requested in qemu. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	9e200a0f39	qemu: blockjob: Add block job states for abort and pivot operations When initiating a pivot or abort of a block job we need to track which one was initiated. Currently it was done via data stashed in virDomainDiskDef. Add possibility to track this also together with the job itself. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	dd9dc7bfe1	qemu: Make checks in qemuDomainBlockPivot depend on data of the job Do decisions based on the configuration of the job rather than the data stored with the disk. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Peter Krempa	e6f38fdbe5	qemu: driver: blockdevize qemuDomainGetBlockJobInfo Use the stored job name rather than passing in the disk alias when referring to the job which allows the same code to work also when -blockdev will be used. Note that this API does not require the change to use 'query-job' as it will ever only work with blockjobs bound to disks due to the arguments which allow only referring to a disk. For the disk-less jobs we'll need to add a separate API later. The change to qemuMonitorGetBlockJobInfo is required as the API was stripping the 'drive-' prefix when returning the data which is not desired any more. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-25 13:21:32 +02:00
Eric Blake	ceb1019257	snapshot: Don't leak moment obj list metaroot to callers virDomainSnapshotFindByName(list, NULL) should return NULL, rather than the internal-use-only metaroot. Most existing callers pass in a non-NULL name; the few external callers that don't are immediately calling virDomainMomentSetParent (which indeed needs the metaroot rather than NULL if the parent name is NULL); but as the leaky abstraction is ugly, it is worth instead making virDomainMomentSetParent static and adding a new function for resolving the parent link of a brand new moment within its list. The existing external uses of virDomainMomentSetParent always succeed (either the new moment has parent_name of NULL to become a new root, or has parent_name set to a strdup of the previous current moment); hence, our new function does not need a return value (but it still has a VIR_WARN in case future uses break our assumptions about failure being impossible). Missed when commit `02c4e24d` refactored things to attempt to remove direct metaroot manipulations out of the qemu and test drivers into internal-only details, and made more obvious when commit `dc8d3dc6` factored it out into a separate file. Signed-off-by: Eric Blake <eblake@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-24 17:03:34 -05:00
Jim Fehlig	d5572f62e3	qemu: Add support for overriding max threads per process limit Some VM configurations may result in a large number of threads created by the associated qemu process which can exceed the system default limit. The maximum number of threads allowed per process is controlled by the pids cgroup controller and is set to 16k when creating VMs with systemd's machined service. The maximum number of threads per process is recorded in the pids.max file under the machine's pids controller cgroup hierarchy, e.g. $cgrp-mnt/pids/machine.slice/machine-qemu\\x2d1\\x2dtest.scope/pids.max Maximum threads per process is controlled with the TasksMax property of the systemd scope for the machine. This patch adds an option to qemu.conf which can be used to override the maximum number of threads allowed per qemu process. If the value of option is greater than zero, it will be set in the TasksMax property of the machine's scope after creating the machine. Signed-off-by: Jim Fehlig <jfehlig@suse.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-24 15:59:49 -06:00
John Ferlan	e8bf136cff	qemu: Remove unnecessary check in qemuMonitorJSONGetJobInfoOne It's already dereffed in the initialization and shouldn't be NULL unless virJSONValueArraySize after a virJSONValueObjectGetArray could return a NULL data entry. Found by Coverity Signed-off-by: John Ferlan <jferlan@redhat.com> ACKed-by: Peter Krempa <pkrempa@redhat.com>	2019-07-23 10:55:56 -04:00
Peter Krempa	2651b04ba5	qemu: driver: Add debug message when we conjure block job data object Report in logs when we don't find existing block job data and create it just to handle the job. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-19 15:49:40 +02:00
Peter Krempa	d26f3cdedd	qemu: driver: Don't use qemuBlockJobStartupFinalize in processBlockJobEvent While this function does start a block job in case when we'd not be able to get our internal data for it, the handler sets the job state to QEMU_BLOCKJOB_STATE_RUNNING anyways, thus qemuBlockJobStartupFinalize would just unref the job. Since the other usage of qemuBlockJobStartupFinalize in the other part of the event handler was a bug replace this one anyways even if it would not cause problems. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-19 15:49:40 +02:00
Peter Krempa	00c4c971fd	qemu: process: Don't use qemuBlockJobStartupFinalize in qemuProcessHandleBlockJob The block job event handler qemuProcessHandleBlockJob looks at the block job data to see whether the job requires synchronous handling. Since the block job event may arrive before we continue the job handling (if the job has no data to copy) we could hit the state when the job is still set as QEMU_BLOCKJOB_STATE_NEW (as we move it to the QEMU_BLOCKJOB_STATE_RUNNING state only after returning from monitor). If the event handler uses qemuBlockJobStartupFinalize it would unregister and free the job. Thankfully this is not a big problem for legacy blockjobs as we don't need much data for them but since we'd re-instantiate the job data structure we'd report wrong job type for active commit as qemu reports it as a regular commit job. Fix it by not using qemuBlockJobStartupFinalize function in qemuProcessHandleBlockJob as it is not starting the job anyways. https://bugzilla.redhat.com/show_bug.cgi?id=1721375 Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-19 15:49:40 +02:00
Peter Krempa	b4a44ec272	qemu: blockjob: Adjust ATTRIBUTE_NONNULL statements for qemuBlockJobDiskNew Commit `5ff46aaa7f` added a new parameter but neglected to fix the NONNULL declarations. Reported-by: Julio Faracco <jcfaracco@gmail.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2019-07-19 08:47:39 +02:00
Peter Krempa	d524c9a893	qemu: hotplug: Transfer ownership of backing chain to block job on disk unplug When removing the disk fronted while any block job is still active we need to transfer the ownership of the backing chain to the job itself as the job still holds the reference to the chain members and thus attempts to remove them would fail. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	ae4b921f2a	qemu: blockjob: Unplug inherited storage chains when concluding blockjob In cases when the disk frontend was unplugged while a blockjob was running the blockjob inherits the backing chain. When the blockjob is then terminated we need to unplug the chain as it will not be used any more. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	0a9fd83240	qemu: Detect managed persistent reservations in block job orphan chains The PR manager is a property of the format layer in qemu so we need to be able to track it also in the chains of orphaned block jobs. Add a helper for qemu to look also into the blockjob state. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	b3f8ad07dd	qemu: blockjob: Track orphaned backing chains in blockjob status XML When the guest unplugs the disk frontend libvirt is responsible for deleting the backend. Since a blockjob may still have a reference to the backing chain when it is running we'll have to store the metadata for the unplugged disk for future reference. This patch adds 'chain' and 'mirrorChain' fields to 'qemuBlockJobData' to keep them around with the job along with status XML machinery and tests. Later patches will then add code to change the ownership of the chain when unplugging the disk backend. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	59a0306f07	qemu: process: Refresh -blockdev based blockjobs on reconnect to qemu Refresh the state of the jobs and process any events that might have happened while libvirt was not running. The job state processing requires some care to figure out if a job needs to be bumped. For any invalid job try doing our best to cancel it. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	c412383796	qemu: blockjob: Add modern block job event handler Add the infrastructure to handle block job events in the -blockdev era. Some complexity is required as qemu does not bother to notify whether the job was concluded successfully or failed. Thus it's necessary to re-query the monitor. To minimize the possibility of stuck jobs save the state into the XML prior to handling everything so that the reconnect code can potentially continue with the cleanup. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	cbf4e3af70	qemu: Add handler for job state change event Add support for handling the event either synchronously or asynchronously using the event thread. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	b2d6ae674e	qemu: blockjob: Add helper to convert monitor job status to internal state Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	8e2a5c3a4c	qemu: process: Don't trigger BLOCK_JOB* events with -blockdev With blockdev we'll need to use the JOB_STATUS_CHANGE so gate the old events by the blockdev capability. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	e55d64818d	qemu: blockjob: Add 'concluded' state for a block job This new state is entered when qemu finished the job but libvirt does not know whether it was successful or not. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	3b6161a5f2	qemu: driver: Remove unnecessary saving of status XML Now that the blockjob handling code deals with the status XML we don't need to save it explicitly when starting blockjobs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	4cc4357f3e	qemu: blockjob: Save status XML when modifying job state Now that block job data is stored in the status XML portion we need to make sure that everything which changes the state also saves the status XML. The job registering function is used while parsing the status XML so in that case we need to skip the XML saving. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	d3158852fa	qemu: domain: Store blockjob data in the status XML We need to store the block job state in the status XML so that we can properly recover any data when reconnecting after startup and also in the end to be able to do any transition of the backing chain that happened while libvirt was not connected to the monitor. First step is to note the name, type, state and corresponding disk into the status XML. We also need to make sure that a broken blockjob does not make libvirt lose the VM, thus many of the errors just mark the job as invalid. Later on we'll cancel all invalid jobs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	61e9066a69	qemu: blockjob: Add flag for invalid block job data The job data saved in the XML may be partially invalid e.g. if something is missing. To prevent losing a domain with such a job add a flag to the job data so that job APIs can ignore such a job and we can just cancel it. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	3dc496e098	qemu: blockjob: Export functions for allocating and registering job data When parsing the status XML we need to register all existing jobs. Export the functions so that they are usable in other modules. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	8d82e6d98a	qemu: blockjob: Add string convertors for blockjob type and state enums Later on we'll format these values into the status XML so the from/to string functions will come handy. The implementation also notes that these will be used in the status XML to avoid somebody changing the values. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	5ff46aaa7f	qemu: blockjob: Register new and running blockjobs in the global table Add the job structure to the table when instantiating a new job and remove it when it terminates/fails. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	acff582915	qemu: domain: Add global table of blockjobs Block jobs currently belong to disks only so we can look up the block job data for them in the corresponding disks. This won't be the case when using blockdev as certain jobs don't even correspond to a disk and most of them can run on a part of the backing chain. Add a global table of blockjobs which can be used to look up the data for the blockjobs when the job events need to be processed. The table is a hash table organized by job name and has a reference to the job. New and running jobs will later be added to this table. Reference counting will allow to reap job state for synchronous callers. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	9dd12d4ecf	qemu: blockjob: Update new job state earlier in qemuBlockJobEventProcessLegacy The legacy job handler does not look at the old job state so we can update it earlier. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	b6316435e4	qemu: blockjob: Save config only in qemuBlockJobEventProcessLegacyCompleted There's no need to do it if the job is not completed. The new helper allows to do this with much less hassle in the correct place. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	dbdda6aca0	qemu: blockjob: Separate and unify block job (un)registration Rename and move qemuBlockJobTerminate to qemuBlockJobUnregister and separate bits from qemuBlockJobDiskNew which register the job with the disk. This creates an unified interface for other APIs to use. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	0610aa51c4	qemu: blockjob: Use VIR_AUTOUNREF in qemuBlockJobDataNew Simplify error paths. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	a3b0e09242	qemu: domain: Add helper for saving config XML Similarly to qemuDomainSaveStatus add a helper to save the config XML named qemuDomainSaveConfig. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	997746b2df	qemu: domain: Repurpose and export helper for saving domain status XML Rename qemuDomainObjSaveJob and create a wrapper for it which does not require 'driver' to be passed and export it so that other palces can easily save the status XML without having to invoke virDomainSaveStatus which has unpleasing parameters. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	b84c09f41a	qemu: domain: Export qemuDomainPrepareStorageSourceBlockdev Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	ed812441b3	qemu: block: Add generator for creating storage with blockdev-create QEMU allows us to create storage on certain network protocols which allow image creation through their API. Wire up the generator for using it with libvirt as well as for local files. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	2d593705c8	qemu: block: Add generator for image format creation properties 'blockdev-add' allows us to use qemu to format images to our desired format. This patch implements helpers which convert a virStorageSourcePtr into JSON objects describing the required configuration. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	2c4c347c4b	qemu: block: Use 'auto-read-only' instead of 'read-only' for backing chain To allow using -blockdev with blockjobs QEMU needs to reopen files in read-write mode when modifying the backing chain. To achieve this we need to use 'auto-read-only' for the backing files rather than the normal 'read-only' property. That way qemu knows that the files need to be reopened. Note that the format drivers (e.g. qcow2) are still opened with the read-only property enabled when being a member of the backing chain since they are supposed to be immutable unless a block job is started. QEMU v4.0 (since commit 23dece19da4) allows also dynamic behaviour for auto-read-only which allows us to use sVirt as we only grant write permissions to files when doing a blockjob. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	8e0b1f8a9e	qemu: block: Extract formatting of 'driver' attribute from child formatters To allow reusing the formatters in the code for creating JSON properties for 'blockdev-create' we need to create everything except the 'driver' attribute. Use the new helper virJSONValueObjectPrependString to put the driver at the same place so that we don't change any output. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	423b4f6625	qemu: block: Allow skipping non-target related data when formating disk JSON When formatting new qcow2 images we need to provide the backing store string which should not contain any authentication or irrelevant data. Add a flag for qemuBlockStorageSourceGetBackendProps which allows to skip the irrelevant data. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:34 +02:00
Peter Krempa	aa7d73134f	qemu: monitor: Add APIs for 'blockdev-create' The 'blockdev-create' starts a job which creates a storage volume using the given protocol or formats an existing (added) volume with one of the supported storage formats. This patch adds the monitor interaction bits. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	587c0ed12a	qemu: monitor: Implement support for 'JOB_STATUS_CHANGE' event This new event is a superset of the BLOCK_JOB* events and also covers jobs which don't bind to a VM disk. In this patch the monitor part is implemented. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	ed56851f1b	qemu: monitor: Add infrastructure for 'query-jobs' Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	93de886b10	qemu: monitor: Add support for 'job-complete' command This belongs to the new job management API which can manage also non-block based jobs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	1d2e044302	qemu: monitor: Add support for 'job-cancel' command This belongs to the new job management API which can manage also non-block based jobs. Since we'll need to be able to attempt to cancel jobs which potentially were not started (during reconnect) the 'quiet' flag allows to suppress errors reported. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	190e66ea5d	qemu: monitor: Add support for 'job-dismiss' command This belongs to the new job management API for generic jobs. The dismiss command is meant to remove a concluded job after we were able to get the final status. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	e53adccebd	qemu: monitor: Add new fields for 'blockdev-mirror' command Allow using the delayed dismiss of the job so that we can reap the state even if libvirtd was not running when qemu emitted the job completion event. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	ac6c579af3	qemu: monitor: Add new fields for 'block-commit' command Allow using the node name to specify the base and top of the 'commit' operation, allow specifying explicit job name and add support for delayed dismiss of the job so that we can reap the state even if libvirtd was not running when qemu emitted the job completion event. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Peter Krempa	a4f10a6821	qemu: monitor: Add new fields for 'block-stream' command Allow using the node name to specify the base of the 'stream' operation, allow specifying explicit job name and add support for delayed dismiss of the job so that we can reap the state even if libvirtd was not running when qemu emitted the job completion event. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-18 17:59:33 +02:00
Cole Robinson	8911d843f3	conf: Add network xmlopt argument Pass an xmlopt argument through all the needed network conf functions, like is done for domain XML handling. No functional change for now Reviewed-by: Laine Stump <laine@laine.org> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-07-17 17:18:56 -04:00
Ján Tomko	898821cce8	qemu: command: remove qemuDomainFSDriver Having a translation enum full of empty strings seems excessive. Now that the validiation is performed in qemuDomainDeviceDefValidateFS, remove it completely and open-code the two allowed cases. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-16 17:00:37 +02:00
Ján Tomko	e6e7c41f84	qemu: command: use VIR_AUTOCLEAN in qemuBuildFS* Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-16 17:00:37 +02:00
Ján Tomko	acef350080	qemu: introduce qemuDomainDeviceDefValidateFS Move validation of the filesystem device out of qemu_command. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-16 17:00:37 +02:00
Ján Tomko	77570e2600	qemu: command: use VIR_AUTOFREE in qemuBuildFSDevCommandLine Introduce two separate variables instead of reusing the same one for clarity. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-16 17:00:37 +02:00
Ján Tomko	da0f5aab3e	qemu: command: re-introduce qemuBuildFSDevCommandLine This time it only builds one device. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-16 17:00:36 +02:00
Ján Tomko	d8f8f1d172	qemu: command: rename qemuBuildFSDevCommandLine This function iterates over all filesystems, not just -fsdevs. Rename it to free the name for a function that actually builds fsdevs. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-16 17:00:36 +02:00
Ján Tomko	628709245b	qemu: address: remove useless comment Commit `b27375a9b8` omitted one zero. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-16 17:00:36 +02:00
Daniel P. Berrangé	cb1938eb58	all: don't wait for driver lock during startup When the drivers acquire their pidfile lock we don't want to wait if the lock is already held. We need the driver to immediately report error, causing the daemon to exit. Reviewed-by: Erik Skultety <eskultet@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-15 13:36:45 +01:00
Michal Privoznik	7711a7346a	qemu: Relax os.loader->type check when validating domain When validating a domain among all the checks there are two that concern VIR_DOMAIN_LOADER_TYPE_PFLASH specifically. The first check ensures that on x86 ACPI is enabled when UEFI is requested, the second ensures that UEFI is used when ACPI is requested on aarch64. However, check for UEFI is done by plain comparison of def->os.loader->type which is insufficient because we have def->os.firmware too. NB, this wouldn't be a problem for active domain, because on startup process def->os.loader->type gets filled by qemuFirmwareEnableFeatures(), but that's not the case for inactive domains. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1729604 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-07-15 13:24:09 +02:00
Peter Krempa	35e3069ce4	qemu: block: Split up qemuBlockStorageSourceAttachApply Split up the addition of a storage source into the following sub-steps: 1) storage access dependencies (TLS transport, persistent reservation) 2) storage acccess node (file/gluster/nbd...) 3) format driver dependencies (encryption secret) 4) format driver node (qcow2, raw, ...) The functions split out will be later reused when implementing support for 'blockdev-create' as we'll need the dependencies plugged in first, then blockdev-create will be called and after that successfully finishes blockdev-add will be added. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-15 10:26:23 +02:00
Peter Krempa	16ca234b56	qemu: Refactor variables for extracting flags in qemuDomainBlockCopyCommon Add separate booleans for extracting VIR_DOMAIN_BLOCK_COPY_REUSE_EXT and VIR_DOMAIN_BLOCK_COPY_SHALLOW from '@flags' and also change 'reuse' into 'existing'. qemuMonitorDriveMirror requires the unmodified state of the flags to pass to qemu and also we use the value a few times internally. Extract it separately now. The 'reuse' flag did not indicate reusing of the file as much as the fact that the storage is existing and thus should not be created, so modify the name to reflect this. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-15 10:26:23 +02:00
Peter Krempa	0db5912617	qemu: blockjob: Don't emit traditional disk events for jobs without disk With -blockdev it will be possible that a block job loses the disk that was used to start it to a guest-initiated hot-unplug. Don't emit the block job events in that case as we can't report the top level source or disk target for an unplugged (and potentially replugged with different source) disk. Eventually when we add machinery for tracking jobs globally for a VM the event will be reinstated via the domain job event. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-15 10:26:23 +02:00
Peter Krempa	3149e00ab1	qemu: blockjob: Don't reset state when entering sync blockjob job->newstate is now used internally all the time so there's no need to clear it as it already has correct value. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-07-15 10:26:23 +02:00
Jonathon Jongsma	e579f5300b	qemu: add 'bochs' video display type Update schema and configuration to allow specifying new video type of 'bochs'. Add implementation and tests for qemu. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2019-07-15 10:21:21 +02:00
Jonathon Jongsma	086c19d699	qemu: Add bochs-display capability Check whether qemu supports the bochs-display device and set a capability. Update tests. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2019-07-15 10:21:21 +02:00
Jonathon Jongsma	062b4a4cd7	qemu: minor refactor of video device string handling In preparation for adding the bochs display device, refactor the logic so that each branch handles a single device type and checks its parameters within that branch. In this case VGA and VMVGA are still grouped into the same branch since they share device-specific parameter names. Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2019-07-12 16:54:32 +02:00
Ján Tomko	173a191cfe	qemu: stop formatting json='1' in status XML For quite some time now it is impossible to connect to a domain using a HMP monitor, so there is no point in formatting it in the status XML. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-11 16:19:12 +02:00
Daniel P. Berrangé	1af03e2714	qemu: acquire a pidfile in the driver root directory When we allow multiple instances of the driver for the same user account, using a separate root directory, we need to ensure mutual exclusion. Use a pidfile to guarantee this. In privileged libvirtd this ends up locking /var/run/libvirt/qemu/driver.pid In unprivileged libvirtd this ends up locking /run/user/$UID/libvirt/qemu/run/driver.pid NB, the latter can vary depending on $XDG_RUNTIME_DIR Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-11 12:46:20 +01:00
Eric Blake	95f8e3237e	snapshot: Add VIR_DOMAIN_SNAPSHOT_CREATE_VALIDATE flag We've been doing a terrible job of performing XML validation in our various API that parse XML with a corresponding schema (we started with domains back in commit `dd69a14f`, v1.2.12, but didn't catch all domain-related APIs, didn't document the use of the flag, and didn't cover other XML). New APIs (like checkpoints) should do the validation unconditionally, but it doesn't hurt to continue retrofitting existing APIs to at least allow the option. While there are many APIs that could be improved, this patch focuses on wiring up a new snapshot XML creation flag through all the hypervisors that support snapshots, as well as exposing it in 'virsh snapshot-create'. For 'virsh snapshot-create-as', we blindly set the flag without a command-line option, since the XML we create from the command line should generally always comply (note that validation might cause failures where it used to succeed, such as if we tighten the RNG to reject a name of '../\n'); but blindly passing the flag means we also have to add in fallback code to disable validation if the server is too old to understand the flag. Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-07-10 17:34:58 -05:00
Michal Privoznik	881686d4b1	qemu: Validate disk against domain def on coldplug https://bugzilla.redhat.com/show_bug.cgi?id=1692296#c7 This is a counterpart for `ddc72f9902` and implements the same check for coldplug. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Erik Skultety <eskultet@redhat.com>	2019-07-08 14:26:20 +02:00
Ilias Stamatis	13a7d75835	qemu: Remove a redundant function call from qemuDomainGetPerfEvents Calling virDomainObjUpdateModificationImpact directly inside the function body is redundant, since the same function call is embedded into virDomainObjGetOneDef. Signed-off-by: Ilias Stamatis <stamatis.iliass@gmail.com> Reviewed-by: Erik Skultety <eskultet@redhat.com>	2019-07-03 09:57:30 +02:00
Michal Privoznik	9f95552d17	qemu: De-duplicate some path definitions There are some paths (e.g. /dev/vfio/vfio or /dev/mapper/control) which are defined in qemu_domain.c and then in qemu_cgroup.c again. This is suboptimal. Let's move paths into qemu_domain.h and drop duplicate definitions. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-07-03 09:33:45 +02:00
Michal Privoznik	8695793d72	Revert "qemu: Temporary disable owner remembering" This reverts commit `fc3990c7e6`. Now that all the reported bugs are fixed let's turn the feature back on. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Cole Robinson <crobinso@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-03 08:36:04 +02:00
Michal Privoznik	3973d4dff1	qemu: Move image security metadata on snapshot activity Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Cole Robinson <crobinso@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-03 08:36:04 +02:00
Michal Privoznik	706e68237f	qemu_security: Implement qemuSecurityMoveImageMetadata Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Cole Robinson <crobinso@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-07-03 08:36:04 +02:00
Daniel P. Berrangé	464a41bc0d	qemu: delete methods which are no longer supported The public API entry points will report VIR_ERR_NO_SUPPORT to the caller when a driver does not provide an implementation of a particular method. When deleting methods, leaving the driver API entry point explicitly set to NULL with an version range comment, allows the hvsupport.html page to document when the AP was removed. Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-06-27 14:41:48 +01:00
Peter Krempa	773f923e74	qemu: blockjob: Don't leak 'cfg' from qemuBlockJobEventProcessLegacy Since `c257352797` a reference of 'cfg' would be leaked if the function does not need to process anything. Fix it by using VIR_AUTOUNREF. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-27 13:09:54 +02:00
Jie Wang	cf146eb042	qemu: distinguish pr disk before qemuHotplugRemoveManagedPR when a disk without PR perform attach or detach operation, need not call qemuHotplugRemoveManagedPR, otherwise, it will print err log about PR, let us fix it. Signed-off-by: Jie Wang <wangjie88@huawei.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-06-26 17:10:52 +02:00
Peter Krempa	7d5b283065	qemu: Supply correct default type for 'dir' based VIR_STORAGE_TYPE_VOLUME Our code would skip adding the default type in this cases, but since we know that the only reasonable option here is 'fat' we can add it while starting the VM. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 12:28:31 +02:00
Peter Krempa	99917ade0a	qemu: domain: Allow 'VIR_STORAGE_TYPE_VOLUME' disks with 'fat' format The storage volume may in fact convert into a directory when starting the VM so that it may be actually possible to use it. This is a regression caused by `c9b27af32d` as moving the check to validation time without adjustment causes problems as the volumes are not translated yet. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 12:28:31 +02:00
Peter Krempa	76e3de37bf	qemu: command: Use 'actualType' when deciding whether to use disk format qemuBuildDriveSourceStr omits the disk format string when we are emulating a 'fat' filesystem from a directory. The logic should decide based on the 'actualType' as a disk type=pool may be converted to a directory. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 12:28:31 +02:00
Marc-André Lureau	fe09cd72aa	qemu_firmware: only set nfeatures on success The field is set just before returning on success. Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 09:24:09 +02:00
Marc-André Lureau	6cff855c73	tpm: minor argument comment fix Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 09:24:09 +02:00
Peter Krempa	04c7fc0809	qemu: hotplug: Remove rest of source backend if hotplug fails When changing media using blockdev-add we need to remove the leftovers if we didn't succeed plugging in the full chain or closing the tray. Otherwise the data structures will be freed and thus the backing chain members will never be unplugged. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	9237a81c13	qemu: hotplug: Use storage chain helpers in qemuDomainChangeMediaBlockdev As this conversion removes the last use of qemuHotplugDiskSource* functions we can remove all of them now. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	513faf6ccc	qemu: hotplug: Use storage chain helpers in qemuDomainRemoveDiskDevice Use the new helpers for removing the backing chain in case when -blockdev is used. For -drive this function has a local implementation. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	407fd434bc	qemu: hotplug: Use storage chain helpers in qemuDomainAttachDiskGeneric Replace the use of qemuHotplugDiskSourceAttach* helpers with qemuBuildStorageSourceChainAttachPrepare(Blockdev\|Drive). Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	c64010d3e8	qemu: command: get rid of 'cleanup' in qemuBuildDiskSourceCommandLine Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	29e4368382	qemu: command: Use VIR_AUTO infrastructure in qemuBuildDiskSourceCommandLine Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	5fdb20d793	qemu: command: Use storage chain helpers in commandline generator Replace the open-coded local implementation with qemuBuildStorageSourceChainAttachPrepare(Drive\|Blockdev). Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	042c95bd19	qemu: Introduce new set of helpers for attaching and detaching storage chains These are meant to replace the ad-hoc helpers qemuHotplugDiskSourceAttach... and the open-coded version in qemu_command.c for use in command line generation. The functions for preparing for attach of chains unfortunately need to be in qemu_command.c as they use function defined by that file and inclusion hierarchy. In this patch new functions are introduced and subsequent patches then refactor individual parts to use them. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	2523999717	qemu: hotplug: Handle copy-on-read filter separate from rest of backing chain We use only one copy-on-read filter per disk, so we should handle it separately from the chain. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:37 +02:00
Peter Krempa	fc341eedea	qemu: block: Move and rename qemuHotplugRemoveStorageSourcePrepareData Move it to qemu_block.c and call it qemuBlockStorageSourceDetachPrepare. It will be reused in other parts as well. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:59:36 +02:00
Peter Krempa	6f532d7ffc	qemu: Use VIR_ERR_DEPRECATED in QemuAttach and DomainXMLFromNative stubs We've deprecated qemuConnectDomainXMLFromNative qemuDomainQemuAttach. Switch the error code from VIR_ERR_OPERATION_UNSUPPORTED to the new VIR_ERR_DEPRECATED. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-26 08:37:01 +02:00
John Ferlan	a190f86729	qemu: Adjust ATTRIBUTE_NONNULL Commit `7bf679ae` removed the @json argument from the qemuMonitorOpen prototype; however, it did not update the ATTRIBUTE_NONNULL value which causes a build failure for when checking is enabled such as when lv_cv_static_analysis is enabled. Signed-off-by: John Ferlan <jferlan@redhat.com>	2019-06-21 15:35:51 -04:00
Peter Krempa	2cb86fc260	qemu: Implement support for 'capability_filters' config option Filter out the given capabilities and set domain taint if we've done so. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	30ce8f3163	qemu: conf: Add debug option to allow disabling qemu capabilities In cases when e.g. a new feature breaks upstream behaviour it's useful to allow users to disable the new feature to verify the regression and possibly use it as a workaround until a fix is available. The new qemu.conf option named "capability_filters" allows to remove qemu capabilities from the detected bitmap. This patch introduces the configuration infrastructure to parse the option and pass it around. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	a7d3599a4e	qemu: Remove unused var 'corestr' from virQEMUDriverConfigLoadFile Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	3616ec3927	qemu: domain: Add support for modifying qemu capability list via qemu namespace For testing purposes it's sometimes desired to be able to control the presence of capabilities of qemu. This adds the possibility to do this via the qemu namespace. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	c6da5913d9	qemu: Add support for controling qemu capabilities via the qemu XML namespace Similarly how we allow adding arbitrary command line arguments and environment variables this patch introduces the ability to control libvirt's perception of the qemu process by tweaking the capability bits for testing purposes. The idea is to allow developers and users either test a new feature by enabling it early or disabling it to see whether it introduced regressions. This feature is not meant for production use though, so users should handle it with care. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	f1ded974c3	qemu: domain: Split out commandline namespace data formatting Separate it from qemuDomainDefNamespaceFormatXML. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	8b0a389d2b	qemu: Refactor qemuDomainDefNamespaceParse Rename 'cmd' to 'nsdef' and improve the control flow. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	a858b77754	qemu: Extract parsing of qemu namespace env vars into separate function Simplify the main function by splitting out how we parse the extra passthrough environment variables. Note that the validation function checks that the first letter must be a character or underscore which makes the check whether the name is redundant. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	3f76419a05	qemu: Extract parsing of qemu namespace arguments into separate function Simplify the main function by splitting out how we parse the extra passthrough commandline arguments. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	ad4b08fa50	qemu: domain: Use virStringListFreeCount in qemuDomainXmlNsDefFree Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	a967b2f0bd	qemu: Move qemuDomainXmlNsDef(Free) from qemu_conf.(ch) qemu_conf.c deals with the configuration file. Better fit for the structure and freeing function will be qemu_domain.c where the rest of the namespace parsing/formatting stuff resides. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	99759126f7	qemu: Rename qemuDomainCmdlineDefPtr to qemuDomainXmlNsDefPtr The data injected via the namespace may contain also other things than commandline passthrough definitions. Rename it to make it more universal. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-21 15:24:06 +02:00
Peter Krempa	d9536f5cff	qemu: process: Report better error when virtlogd connection fails When connecting to virtlogd fails e.g. due to wrong libvirtd selinux process label we'd report an utterly useless error message: $ virsh start upstream error: Failed to start domain upstream error: Cannot recv data: Connection reset by peer Use virLastErrorPrefixMessage in the correct place to give a better sense of what's going on: $ virsh start upstream error: Failed to start domain upstream error: can't connect to virtlogd: Cannot recv data: Connection reset by peer Signed-off-by: Peter Krempa <pkrempa@redhat.com> ACKed-by: Michal Privoznik <mprivozn@redhat.com>	2019-06-20 17:10:24 +02:00
Peter Krempa	d79ec3f33b	qemu: driver: Fix off-by-one in qemuDomainSnapshotDiskDataCollect Commit `f34397e51c` introduced a crash-inducing problem when collecting disk snapshot data, where the array would be filled starting from the second element. The code then dereferenced the first one. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 16:09:58 +02:00
Peter Krempa	2348c00f10	qemu: Remove qemuMonitorTextSetCPU It's not used any more. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 15:59:19 +02:00
Peter Krempa	d828b744ac	qemu: monitor: Remove text monitor support for cpu hot(un)plug The "cpu-add" command is supported in all supported qemu versions and cpu unplug did not work at all until the new cpu unplug approach (using device_add/del) was implemented. Remove the support for falling back to the text monitor. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 15:59:19 +02:00
Jiri Denemark	2674d00ed4	qemu: Drop MSR features from host-model with old QEMU With QEMU versions which lack "unavailable-features" we use CPUID based detection of features which were enabled or disabled once QEMU starts. Thus using MSR features with host-model would result in all of them being marked as disabled in the active domain definition even though QEMU did not actually disable them. Let's make sure we add MSR features to host-model only when "unavailable-features" property is supported by QEMU. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 14:02:36 +02:00
Jiri Denemark	8eb4a89f5f	qemu: Forbid MSR features with old QEMU Without "unavailable-features" CPU property we cannot properly detect whether a specific MSR feature we asked for (either explicitly or implicitly via a CPU model) was disabled by QEMU for some reason. Because this could break migration, snapshots, and save/restore operaions, it's better to just forbid any use of MSR features with QEMU which lacks "unavailable-features" CPU property. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 14:02:36 +02:00
Ján Tomko	aed6a032ce	api: disallow virDomainSaveImageGetXMLDesc on read-only connections The virDomainSaveImageGetXMLDesc API is taking a path parameter, which can point to any path on the system. This file will then be read and parsed by libvirtd running with root privileges. Forbid it on read-only connections. Fixes: CVE-2019-10161 Reported-by: Matthias Gerstner <mgerstner@suse.de> Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-06-20 13:50:56 +02:00
Ján Tomko	63427110b6	qemu: monitor: s/ret/rc/ in UpdateVideoSize functions Use 'rc' to temporarily store the subfunction return values, instead of ret. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Ján Tomko	8eacdff4c8	qemu: monitor: use VIR_AUTOFREE in qemuMonitor*VideoSize Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Ján Tomko	86d648f2c9	qemu: monitor: remove the json field Now that it is no longer used, remove it. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Ján Tomko	4d5da03ce4	qemu: monitor: remove mon->json checks Remove all the mon->json checks in qemuMonitor functions. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Ján Tomko	6799b52795	qemu: monitor: assume JSON in QEMU_CHECK_MONITOR macro In preparation to removing the json field from qemuMonitor, stop checking for it in QEMU_CHECK_MONITOR. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Ján Tomko	7bf679aec6	qemu: remove json argument from qemuMonitorOpen Always assume JSON monitor was requested, since all the callers pass true anyway. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Ján Tomko	466764346d	qemu: domain: remove monJSON field If we have a monitor, it is a JSON monitor. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Ján Tomko	011f4eb124	qemu: assume monJSON is always true Now that we no longer support the HMP monitor, remove some dead code. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 13:47:41 +02:00
Andrea Bolognani	54964f563d	qemu: Format spapr-vio addresses as 32-bit No reason not to be consistent with the user-visible value. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-06-20 12:50:05 +02:00
Andrea Bolognani	89afb9f594	qemu: Validate spapr-vio addresses According to sPAPR, addresses are 32-bit rather than 64-bit. Update qemuDomainDeviceDefValidateAddress() accordingly. https://bugzilla.redhat.com/show_bug.cgi?id=1598657 Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-06-20 12:49:59 +02:00
Andrea Bolognani	ad9b36efcd	qemu: Rework qemuDomainDeviceDefValidateAddress() Introduce a switch() statement and prepare for validating more address types than just PCI. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-06-20 12:49:58 +02:00
Ján Tomko	4d497566e6	qemu: also delete qemuProcessAttach Now that the virDomainQemuAttach API returns an error, we can remove the unused qemuProcessAttach function as well, deleting the only user that possibly could have requested to open a non-JSON monitor. Signed-off-by: Ján Tomko <jtomko@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-20 12:47:10 +02:00
Peter Krempa	e8b505c956	qemu: Move qemuParseKeywords(Free) to the monitor code The only user is now in qemu_monitor_json.c to re-parse the command line format into keyvalue pairs for use in QMP command construction. Move and rename the functions. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-06-20 12:15:05 +02:00
Peter Krempa	bd843409a4	qemu: Move QEMU_QXL_VGAMEM_DEFAULT macro qemu_domain.c is now the only place that uses it, so we can move it from qemu_parse_command.h Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-06-20 12:15:05 +02:00
Peter Krempa	613eeebb4b	qemu: parse: Drop unused qemu command line parsing infrastructure It's now unused and utterly obsolete. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-06-20 12:15:05 +02:00
Peter Krempa	5cc402a9b4	qemu: driver: Remove support for native->XML conversion This code is really neglected and does not at all work reliably. It can't even be used for converting our own commandline back. Since this was mostly useful for aiding migration from manually run qemu to libvirt and will not work for this puspose in many cases it's not worth having. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-06-20 12:15:05 +02:00
Peter Krempa	953b88fc88	qemu: parse: Drop qemuParseCommandLinePid and friends Now that we no longer support attaching to a live QEMU process not managed by libvirt we can drop the backend functions as well. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-06-20 12:15:05 +02:00
Peter Krempa	215d9393bb	qemu: driver: Drop support for qemu-attach Attaching to modern qemu will not work with all this code and attempting to ressurect it would be mostly pointless. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-06-20 12:15:05 +02:00
Michal Privoznik	7979066b69	qemuProcessLaunch: Return earlier if spawning qemu failed If spawning qemu fails then we report an error and proceed to writing status XML onto the disk. This is unnecessary as we are sure that the domain is not running. At the same time, if virPidFileReadPath() fails it returns -errno. Use it in the error message. It may explain what went wrong. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 10:29:54 +02:00
Peter Krempa	7684e54ee9	qemu: qapi: Implement support for 'features' Starting from version 4.1 qemu allows reporting 'features' for a given QAPI type object. This allows reporting support of fixes and additions which are otherwise invisible in the QAPI schema. Implement a possibility to query 'features' in the QAPI query strings. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Jiri Denemark <jdenemar@redhat.com>	2019-06-20 09:20:04 +02:00
Jiri Denemark	63acb7bfd5	qemu_process: Prefer generic qemuMonitorGetGuestCPU When updating guest CPU definition according to the vCPU actually created by QEMU, we want to use the generic qemuMonitorGetGuestCPU to get both CPUID and MSR features. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	cc6d6b3cb9	qemu: Introduce generic qemuMonitorGetGuestCPU Unlike the old version (which is now called qemuMonitorGetGuestCPUx86), this monitor API checks for individual features by their names rather than processing CPUID bits. Thus we can get the list of enabled and disabled features for both CPUID and MSR features. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	430023e5ee	qemu: Add type filter to qemuMonitorJSONParsePropsList The function converts a list of QOM properties into a NULL-terminated array of property names. The new type parameter may be used to limit the result to properties of a specific type. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	df73078c61	cpu: Introduce virCPUDataAddFeature This is a generic replacement for the former virCPUx86DataAddFeature, which worked on the generic virCPUDataPtr anyway. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	055f8f6bb9	qemu: Make qemuMonitorGetGuestCPU usable on x86 only It was never implemented or used for anything else anyway. Mainly because it uses CPUID features bits. The function is renamed as qemuMonitorGetGuestCPUx86 to make this explicit. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	a3f2c802d2	qemu: Don't use full CPU model expansion We used type=full expansion on the result of previous type=static expansion to get all possible spellings of CPU features. Since we can now translate the QEMU's canonical names to our names, we can drop this magic and do only type=static CPU model expansion. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	ec232c5ddc	qemu: Translate feature names from query-cpu-model-expansion By default query-cpu-model-expansion only reports canonical names of all CPU features. We do some magic and call the command twice to get all possible spellings of the features, but being able to consume canonical names will allow us to drop this magic. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	5030a7450b	qemu_command: Use canonical names of CPU features When building QEMU command line, we should use the preferred spelling of each CPU feature without relying on compatibility aliases (which may be removed at some point). The "unavailable-features" CPU property is used as a witness for the correct names of the features in our translation table. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:39 +02:00
Jiri Denemark	6f6401fbae	qemu: Probe host CPU after capabilities The way we call query-cpu-model-expansion will rely on some capabilities bits. Let's make sure all capabilities are set before probing host CPU. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:38 +02:00
Jiri Denemark	0d254bce4e	qemu: Probe for "unavailable-features" CPU property It is similar to "filtered-features" property, which reports CPUID bits corresponding to disabled features, but more general. The "unavailable-features" property supports both CPUID and MSR features by listing their names. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:38 +02:00
Jiri Denemark	2a4c232106	qemu: Probe for max-x86_64-cpu type We will use it to check whether QEMU supports a specific CPU property. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:38 +02:00
Jiri Denemark	61ee757e20	qemu: Add APIs for translating CPU features So far we always used libvirt's name of each CPU feature relying on backward compatible aliases in QEMU. The new translation table can be used whenever QEMU mandates or prefers canonical feature names. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:38 +02:00
Jiri Denemark	e1ba407396	qemu_command: Use consistent syntax for CPU features Normal CPU features use modern -cpu ...,feature=on\|off syntax when available, but kvm features kept using the old +feature or -feature. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:38 +02:00
Jiri Denemark	0b763774a5	qemu: Filter CPU features in active XML Properly filter features which should not be passed to QEMU because they were never supported by QEMU or they did nothing and QEMU dropped them. Currently they are just silently ignored by the command line generator. Let's make this process more visible and clean by dropping the features from the domain's active definition in qemuProcessUpdateGuestCPU. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:37 +02:00
Jiri Denemark	955fd6e7a2	qemu_process: Drop cleanup label from qemuProcessUpdateGuestCPU Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:37 +02:00
Jiri Denemark	b12865260a	qemu: Drop qemuFeatureNoEffect We already have virQEMUCapsCPUFilterFeatures for filtering features which QEMU does not know about. Let's move osxsave and ospke from qemuFeatureNoEffect there. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-20 00:22:37 +02:00
Jonathon Jongsma	5dad4b5d93	src/qemu: use #pragma once in headers Signed-off-by: Jonathon Jongsma <jjongsma@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2019-06-19 17:12:30 +02:00
Ján Tomko	c0dc0e8e23	qemu: delete unused QEMUD_CPUMASK_LEN macro Unused as of: commit `f136b83139` qemu: Rework setting process affinity Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-06-19 16:48:44 +02:00
Daniel P. Berrangé	f02e21cb33	network: remove the virDomainNetBandwidthChangeAllowed callback The current qemu driver code for changing bandwidth on a NIC first asks the network driver if the change is supported, then changes the bandwidth on the VIF, and then tells the network driver to update the bandwidth on the bridge. This is potentially racing if a parallel API call causes the network driver to allocate bandwidth on the bridge between the check and the update phases. Change the code to just try to apply the network bridge update immediately and rollback at the end if something failed. Reviewed-by: Laine Stump <laine@laine.org> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-06-17 15:19:54 +01:00
Jie Wang	7a232286b9	qemu: Try harder to remove pr-helper object and kill pr-helper process If libvirt receives DISCONNECTED event and prDaemonRunning is set to false, and qemuDomainRemoveDiskDevice() is performing in the meantime, then qemuDomainRemoveDiskDevice() will fail to remove pr-helper object because prDaemonRunning is false. But removing that check from qemuHotplugRemoveManagedPR() is not enough, because after removing the object through monitor the qemuProcessKillManagedPRDaemon() is called which contains the same check. Thus the pr-helper process might be left behind. Signed-off-by: Jie Wang <wangjie88@huawei.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2019-06-14 09:51:10 +02:00
Peter Krempa	e6635c626a	qemu: domain: Log some useful data in qemuDomainStorageSourceAccessModify Log the flags passed to the function in a exploded state so that it's easily visible what's happening to the image. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-13 09:43:02 +02:00
Peter Krempa	56c6893ff5	qemu: Use proper block job name when reconnecting to VM The hash table returned by qemuMonitorGetAllBlockJobInfo is organized by the frontend name (which skipps the 'drive-' prefix). While our code properly matches the jobs to the disk, qemu needs the full job name including the 'drive-' prefix to be able to identify jobs. Fix this by adding an argument to qemuMonitorGetAllBlockJobInfo which does not modify the job name before filling the hash. This fixes a regression where users would not be able to cancel/pivot block jobs after restarting libvirtd while a blockjob is running. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-12 09:40:02 +02:00
Peter Krempa	4c4953fb37	qemu: domain: Allow forcing images to read-write in qemuDomainStorageSourceAccessAllow In commit `76b9aba2ba` I refactored how the function treats the readonly flag which introduced a bug when we'd not allow to force read-write state for an image. This created problems with blockjobs where we need to temporarily force images to have read-write permissions. Rename QEMU_DOMAIN_STORAGE_SOURCE_ACCESS_READ_ONLY to QEMU_DOMAIN_STORAGE_SOURCE_ACCESS_FORCE_READ_ONLY and also introduce a complement QEMU_DOMAIN_STORAGE_SOURCE_ACCESS_FORCE_READ_WRITE which will allow to force write access. https://bugzilla.redhat.com/show_bug.cgi?id=1717768 Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-12 09:40:02 +02:00
Peter Krempa	9961e7799a	qemu: domain: Fix logic bug in qemuDomainStorageSourceAccessAllow In commit `76b9aba2ba` I tried to refactor qemuDomainStorageSourceAccessAllow but used wrong operators for adding bitwise flags. This way the flags would result in 0 if any of them would be applied. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-12 09:40:02 +02:00
Eric Blake	fbb5271c78	backup: Add new parameters to qemu monitor nbd-server-add The upcoming virDomainBackup() API needs to take advantage of the ability to expose a bitmap as part of nbd-server-add for a pull-mode backup (this is the recently-added QEMU_CAPS_NBD_BITMAP capability). Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-11 21:47:13 -05:00
Eric Blake	ad1c17c8d5	backup: Add new qemu monitor bitmap The upcoming virDomainBackup() API needs to take advantage of various qcow2 bitmap manipulations as the basis to virDomainCheckpoints and incremental backups. Add four functions to expose block-dirty-bitmap-{add,enable,disable,merge} (this is the recently-added QEMU_CAPS_BITMAP_MERGE capability). Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-11 21:47:10 -05:00
Eric Blake	6abda7a445	backup: Add two new qemu capabilities Add two capabilities for testing features required for the upcoming virDomainBackupBegin: use block-dirty-bitmap-merge as the generic witness of bitmap support needed for checkpoints (since all of the bitmap management functionalities were finalized in the same qemu 4.0 release), and the bitmap parameter to nbd-server-add for pull-mode backup support. Even though both capabilities are likely to be present or absent together (that is, it is unlikely to encounter a qemu that backports only one of the two), it still makes sense to keep two capabilities as the two uses are orthogonal (full backups don't require checkpoints, push mode backups don't require NBD bitmap support, and checkpoints can be used for more than just incremental backups). Existing code is not affected by the new capabilities. Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-11 21:42:57 -05:00
Eric Blake	73bf0a9c28	backup: Prepare for Unix sockets in QMP nbd-server-start Migration always uses a TCP socket for NBD servers, because we don't support same-host migration. But upcoming pull-mode incremental backup needs to also support a Unix socket, for retrieving the backup from the same host. Support this by plumbing virStorageNetHostDef through the monitor calls, since that is a nice reusable struct that can track both TCP and Unix sockets. Update qemumonitorjsontest to verify both forms of the QMP command. Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-06-11 21:41:42 -05:00
Peter Krempa	143c2de113	qemu: snapshot: Remove unnecessary 'do_transaction' logic in qemuDomainSnapshotCreateDiskActive Now that we never get to the actual snapshot code if there's nothing to do we can remove the variable and surrounding logic. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:09 +02:00
Peter Krempa	0491128f2a	qemu: snapshot: Return early if there's nothing to snapshot Skip actual snapshot creation code if we have 0 disks to snapshot. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:09 +02:00
Peter Krempa	0325d42668	qemu: snapshot: Unify 'cleanup' and 'error' in qemuDomainSnapshotCreateDiskActive All cases taking the 'cleanup' path can take the original 'error' path. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:09 +02:00
Peter Krempa	46da762669	qemu: snapshot: Don't overload 'ret' in qemuDomainSnapshotCreateDiskActive Introduce 'rc' for collecting state from monitor commands so that we can initialize 'ret' to -1. This also fixes few cases which could return 0 from the function despite an error condition. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:09 +02:00
Peter Krempa	a7087c929e	qemu: snapshot: Move all cleanup of snapshot disk data to qemuDomainSnapshotDiskDataFree qemuDomainSnapshotDiskDataFree also removes the resources associated with the disk data. Move the unlinking of the just-created file so that we can unify the cleanup paths. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:09 +02:00
Peter Krempa	acda184236	qemu: Rename qemuDomainSnapshotDiskDataFree to qemuDomainSnapshotDiskDataCleanup In commit `cbb4d229de` I named the function with 'free' suffix, but at that time it already did some non-freeing tasks. Rename it to make it obvious that it's not just memory managemet. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	f34397e51c	qemu: snapshot: Densely pack data in qemuDomainSnapshotDiskDataCollect The function skips disks which are not selected for snapshot. Rather than creating a sparse array and check whether the given field is filled compress the entries. Note that this does not allocate a smaller array, but the memory allocation is short-lived. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	9678503088	qemu: snapshot: Always save config XML after qemuDomainSnapshotCreateDiskActive If there's an offline config definition save it unconditionally even if it was not modified. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	999e450c26	qemu: snapshot: Always save status and config after qemuDomainSnapshotCreateDiskActive The error path is unlikely thus saving the status XML even if we didn't modify it does not add much burden. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	7b8c319d9c	qemu: snapshot: Remove unused cleanup section in qemuDomainSnapshotCreateSingleDiskActive After getting rid of pre-transaction qemu support the cleanup section is unused. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	7d67319cfb	qemu: Use VIR_AUTO* in qemuDomainSnapshotCreateActiveExternal Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	77f71c45ad	qemu: Use virErrorPreserveLast in qemuDomainSnapshotCreateDiskActive Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	0f9a2a665b	qemu: Use VIR_AUTOPTR in qemuDomainSnapshotCreateDiskActive Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Peter Krempa	ef6b88deef	qemu: snapshot: Pass 'cfg' to external snapshot functions The caller has it so there's no point in getting it again. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-10 14:01:08 +02:00
Andrea Bolognani	a84922c09e	qemu: Fix NULL pointer access in qemuProcessInitCpuAffinity() Commit `2f2254c7f4` attempted to fix a memory leak by ensuring cpumapToSet is always a freshly allocated bitmap, but regrettably introduced a NULL pointer access while doing so, because it called virBitmapCopy() without allocating the destination bitmap first. Solve the issue by using virBitmapNewCopy() instead. Reported-by: John Ferlan <jferlan@redhat.com> Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Erik Skultety <eskultet@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2019-06-06 16:50:11 +02:00
Andrea Bolognani	de563ebcf9	qemu: Drop cleanup label from qemuProcessInitCpuAffinity() We're using VIR_AUTOPTR() for everything now, plus the cleanup section was not doing anything useful anyway. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-04 15:54:04 +02:00
Andrea Bolognani	2f2254c7f4	qemu: Fix leak in qemuProcessInitCpuAffinity() In two out of three scenarios we are cleaning up properly after ourselves, but commit `5f2212c062` has changed the remaining one in a way that caused it to start leaking cpumapToSet. Refactor the logic so that cpumapToSet is always a freshly allocated bitmap that gets cleaned up automatically thanks to VIR_AUTOPTR(); this also allows us to remove the hostcpumap variable. Reported-by: John Ferlan <jferlan@redhat.com> Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-04 15:53:51 +02:00
Andrea Bolognani	5f2212c062	qemu: Fix qemuProcessInitCpuAffinity() Ever since the feature was introduced with commit `0f8e7ae33a`, it has contained a logic error in that it attempted to use a NUMA node map where a CPU map was expected. Because of that, guests using <numatune> might fail to start: # virsh start guest error: Failed to start domain guest error: cannot set CPU affinity on process 40055: Invalid argument This was particularly easy to trigger on POWER 8 machines, where secondary threads always show up as offline in the host: having <numatune> <memory mode='strict' placement='static' nodeset='1'/> </numatune> in the guest configuration, for example, would result in libvirt trying to set the process affinity so that it would prefer running on CPU 1, but since that's a secondary thread and thus shows up as offline, the operation would fail, and so would starting the guest. Use the newly introduced virNumaNodesetToCPUset() to convert the NUMA node map to a CPU map, which in the example above would be 48,56,64,72,80,88 - a valid input for virProcessSetAffinity(). https://bugzilla.redhat.com/show_bug.cgi?id=1703661 Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-04 09:29:35 +02:00
Jiri Denemark	7da62c91f0	qemu: Check TSC frequency before starting QEMU When migrating a domain with invtsc CPU feature enabled, the TSC frequency of the destination host must match the frequency used when the domain was started on the source host or the destination host has to support TSC scaling. If the frequencies do not match and the destination host does not support TSC scaling, QEMU will fail to set the right TSC frequency when starting vCPUs on the destination and thus migration will fail. However, this is quite late since both host might have spent significant time transferring memory and perhaps even storage data. By adding the check to libvirt we can let migration fail before any data starts to be sent over. If for some reason libvirt is unable to detect the host's TSC frequency or scaling support, we'll just let QEMU try and the migration will either succeed or fail later. Luckily, we mandate TSC frequency to be explicitly set in the domain XML to even allow migration of domains with invtsc. We can just check whether the requested frequency is compatible with the current host before starting QEMU. https://bugzilla.redhat.com/show_bug.cgi?id=1641702 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2019-06-03 18:07:16 +02:00
Jiri Denemark	dd3fc650de	qemu: Make virQEMUCapsProbeHostCPUForEmulator more generic The function is renamed as virQEMUCapsProbeHostCPU and it does not get the list of allowed CPU models from qemuCaps anymore. This is responsibility is moved to the caller. The result is just a very thin wrapper around virCPUGetHost mostly required mocking in tests. The generic function is used in place of a direct call to virCPUGetHost in virQEMUCapsInitHostCPUModel to make sure tests don't accidentally probe host CPU. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2019-06-03 18:07:16 +02:00
Andrea Bolognani	1462881f4e	qemu: Format SMMUv3 IOMMU https://bugzilla.redhat.com/show_bug.cgi?id=1575526 Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:57 +02:00
Andrea Bolognani	b645f0fcb4	qemu: Move capability checks for IOMMU features All current IOMMU features are specific to Intel IOMMU, so understandably we check for the corresponding capabilities inside the Intel-specific switch() branch; however, we want to make sure SMMUv3 IOMMU users get an error if they try to enable any of those features in their guest, and performing the capability checks unconditionally is both the easiest way to achieve that, as well as the one least likely to result in us inadvertently letting users enable some new Intel-specific IOMMU feature for ARM guests later on. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:54 +02:00
Andrea Bolognani	fc660ae315	qemu: Add validation for SMMUv3 IOMMU Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:52 +02:00
Andrea Bolognani	60f4c41377	conf: Parse and format SMMUv3 IOMMU SMMUv3 is an IOMMU implementation for ARM virt guests. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:48 +02:00
Andrea Bolognani	124eb803fc	qemu: Introduce QEMU_CAPS_MACHINE_VIRT_IOMMU This capability can be used to figure out whether the QEMU binary at hand supports the machine type property we need in order to enable SMMUv3 IOMMU support. Unfortunately we can't avoid probing the RISC-V binaries along with the ARM ones, since both architectures have their own 'virt' machine type. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:45 +02:00
Andrea Bolognani	21bb887abc	qemu: Move capability checks inside switch() statements Current capability checks are specific to Intel IOMMU, so we need to move them inside the switch() statement before we can introduce more virDomainIOMMUModel values. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:43 +02:00
Andrea Bolognani	70cdf1b52e	qemu: Move virBuffer inside switch() statement This doesn't make a whole lot of difference now, but once we introduce more virDomainIOMMUModel values the current structure will no longer work. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:41 +02:00
Andrea Bolognani	711f8c3627	qemu: Use VIR_AUTOCLEAN() in qemuBuildIOMMUCommandLine() Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:39 +02:00
Andrea Bolognani	9775f48f84	qemu: Drop 'ret' from qemuBuildIOMMUCommandLine() Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:37 +02:00
Andrea Bolognani	dfa631b55a	qemu: Fix switch() statements for virDomainIOMMUModel Ensure unexpected values are dealt with correctly, that is by invoking virReportEnumRangeError() and immediately returning a negative value to the caller. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-06-03 17:40:24 +02:00
Martin Kletzander	c67a3c0fc3	qemu: Set emulator thread scheduler only after QEMU starts If the scheduler is set before vCPU0 cannot be moved into its cpu,cpuacct cgroup. While it is not yet known whether this is a bug or not, it makes sense for us to do that later as otherwise the scheduler would be inherited by vCPU and I/O Threads even when they do not have any such setting specified. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2019-05-27 16:05:23 +02:00
Michal Privoznik	c46bdad576	qemu: Get default hugepage size only if needed Fixes: `6864d8f740` Hugepages don't work in session mode but when building memory part of command line we query for the default size anyway. This breaks creating domains under session daemon. Query the page size only if it's clear we need hugepages. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Erik Skultety <eskultet@redhat.com>	2019-05-27 14:51:39 +02:00
Andrea Bolognani	435330d084	qemu: Tweak Intel IOMMU command line generation Mostly add comments explaining why there are two capabilites for the same feature and how they interact. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-23 15:19:06 +02:00
Andrea Bolognani	a7a78c273e	qemu: Introduce qemuDomainDeviceDefValidateIOMMU() Device validation should not have to wait until command line generation time. Moving the code to a separate function also allows us to avoid some unnecessary repetition. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-23 15:19:01 +02:00
Peter Krempa	c74b898d4c	qemu: monitor: Use VIR_AUTOPTR in qemuMonitorJSON(Drive/Blockdev)Mirror Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:07 +02:00
Peter Krempa	e90d51c4d0	qemu: monitor: Don't pass full flags to qemuMonitorJSONDriveMirror Split out the 'shallow' and 'reuse' flags as booleans rather than passing in flags and constructing them in irrelevant APIs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	6b155c41e9	qemu: monitor: Don't pass full flags to qemuMonitorJSONBlockdevMirror Split out the 'shallow' flag as a boolean argument rather than passing in flags and constructing them in irrelevant APIs. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	c4043d1d6e	qemu: migration: Don't pass around flags for different API The NBD migration code uses drive/blockdev-mirror internally. In those APIs we pass around flags for the monitor commands which are based on the flags for the virDomainBlockRebase API. Since there's only one flag which changes, pass it around explicitly rather than obscuring it in a bitfield. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	47d610e960	qemu: blockcopy: sanitize permission handling for 'mirror' At the point when we want to modify the permissions for the 'mirror' we know whether it is supposed to have a backing chain or no. Given that mirror->backingStore is populated only when we'd need to touch it ayways we can use qemuDomainStorageSourceChainAccessAllow even in place of qemuDomainStorageSourceAccessAllow used for other cases to simplify the code. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	32ec5fee02	qemu: Simplify allowing access to storage file for block copy One code path open-coded qemuDomainStorageSourceChainAccessAllow badly and also did not integrate with the locking code. Replace the separate calls with qemuDomainStorageSourceChainAccessAllow which does everything internally. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	56fe0d6d29	qemu: Validate backing store of 'mirror' for block copy Since `4e797f1a` we parse backingStore of mirror which will later be used with blockdev. Add some validation for the user passed mirror at the current point to make sure it's not used improperly. Validate that it's not used without blockdev and also that it's not passed when not requesting a shallow copy. Also add a chain terminator for a deep copy since we know the resulting mirror will not have chain. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	83c579d0ae	qemu: Remove unnecessary calls to qemuDomainStorageSourceAccessRevoke Since `3decae00e9` qemuDomainStorageSourceAccessAllow revokes the permissions it granted if it fails halfway, thus we can remove some calls to qemuDomainStorageSourceAccessRevoke which tried to undo this situation. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	8787032c5c	qemu: Remove unecessary error keeping in qemuDomainBlockCopyCommon Since `3decae00e9` qemuDomainStorageSourceAccessRevoke keeps the libvirt error which was set prior to the call around even after the call, thus we don't need to do the same when reverting access in the block copy code. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	e05d211f5b	qemu: Modernize memory cleaning in qemuDomainBlockCommit Use VIR_AUTOFREE and VIR_AUTOUNREF. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	82b3f470c6	qemu: Modernize memory cleaning in qemuDomainBlockPullCommon Use VIR_AUTOFREE and VIR_AUTOUNREF. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:06 +02:00
Peter Krempa	ddafae7a39	qemu: Modernize memory cleaning in qemuDomainBlockCopyCommon Use VIR_AUTOFREE, VIR_AUTOUNREF, and VIR_STEAL_PTR. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:05 +02:00
Peter Krempa	019461facb	qemu: driver: Set mirror state after successful command When aborting or pivoting a block job we record which operation we do for the mirror in the virDomainDiskDef structure. As everything is synchronized by a job it's not necessary to modify the state prior to calling the monitor and resetting the state on failure. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:05 +02:00
Peter Krempa	d41e1aa169	qemu: driver: Don't try to update blockjob status in qemuDomainGetBlockJobInfo All blockjobs get their status updated by events from qemu, so this code no longer makes sense. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:05 +02:00
Peter Krempa	acd71408b2	qemu: blockjob: Fix documentation for 'newstate' of _qemuBlockJobData When used with the new job handler the values will also include some of the non-public values from qemuBlockjobState. Modify the comment to clarify this. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:05 +02:00
Peter Krempa	2234354f9e	qemu: blockjob: Remove 'started' from struct _qemuBlockJobData As of commit `d1a44634ac` this field is unused. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-21 14:22:05 +02:00
Michal Privoznik	da04eab953	Revert "qemu: Do not override config XML in case of snapshot revert" This reverts commit `dfd70ca1eb`. Pushed by a mistake, sorry. There's still some discussion going on upstream. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-05-20 14:19:44 +02:00
Han Han	a699b19f6c	qemu: Add entry for balloon stats stat-htlb-pgalloc and stat-htlb-pgfail Qemu added reporting of virtio balloon new statistics stat-htlb-pgalloc and stat-htlb-pgfail since qemu-3.0 commit b7b12644297. The value of stat-htlb-pgalloc represents the number of successful hugetlb page allocations while stat-htlb-pgfail represents the number of failed ones. Add this statistics reporting to libvirt. To enable this feature for vm, guest kenel >= 4.17 is required because the exporting hugetlb page allocation for virtio balloon is introduced since 6c64fe7f. Signed-off-by: Han Han <hhan@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-05-20 11:18:25 +02:00
Maxiwell S. Garcia	dfd70ca1eb	qemu: Do not override config XML in case of snapshot revert Snapshot create operation saves the live XML and uses it to replace the domain definition in case of revert. But the VM config XML is not saved and the revert operation does not address this issue. This commit prevents the config XML from being overridden by snapshot definition. An active domain stores both current and new definitions. The current definition (vm->def) stores the live XML and the new definition (vm->newDef) stores the config XML. In an inactive domain, only the config XML is persistent, and it's saved in vm->def. The revert operation uses the virDomainObjAssignDef() to set the snapshot definition in vm->newDef, if domain is active, or in vm->def otherwise. But before that, it saves the old value to return to caller. This return is used here to restore the config XML after all snapshot startup process finish. Signed-off-by: Maxiwell S. Garcia <maxiwell@linux.ibm.com>	2019-05-20 09:08:54 +02:00
Michal Privoznik	5cdd5d380b	lib: Avoid double close when passing FDs with virCommandPassFD() If an FD is passed into a child using: virCommandPassFD(cmd, fd, VIR_COMMAND_PASS_FD_CLOSE_PARENT); then the parent should refrain from touching @fd thereafter. This is even documented in virCommandPassFD() comment. The reason is that either at virCommandRun()/virCommandRunAsync() or virCommandFree() time the @fd will be closed. Closing it earlier, e.g. right after virCommandPassFD() call might result in undesired results. Another thread might open a file and receive the same FD which is then unexpectedly closed by virCommandFree() or virCommandRun(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-17 16:01:11 +02:00
Andrea Bolognani	a251095e13	qemu: Only probe available machine types Since we know the full list of machine types supported by the QEMU binary when probing machine type properties, we can save some work (and eventually test suite churn, as more architecture-specific machine types need to be probed) by only probing machines that we know exist. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2019-05-17 14:59:40 +02:00
Andrea Bolognani	d22c6221fc	qemu: Probe canonicalized machine type Now that we have the list of machine types available when probing machine type properties, we can list properties for the canonicalized version of the "pseries" machine type instead of having to go through "spapr-machine", which we know to be the parent type for all "pseries-*-machine" types. By doing this, we'll be able to find even properties that are only available from a certain versioned machine type forward, and can't thus be obtained when looking at the parent type only. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2019-05-17 14:59:34 +02:00
Andrea Bolognani	f3f9d8e376	qemu: Add -machine suffix automatically The QOM type for machine types is the machine type name followed by the -machine suffix. Since this is always the case, we can make virQEMUCapsMachineProps more readable and avoid repetition by not including the suffix there and adding it automatically while processing the data; moreover, when later on we will start figuring out which specific versioned machine type to probe at runtime instead of doing so statically, adding the suffix dynamically will become necessary. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2019-05-17 14:59:31 +02:00
Andrea Bolognani	295a42e19f	qemu: Move call to virQEMUCapsProbeQMPMachineProps() We're going to need information about available machine types when probing machine type properties soon, and that means we have to change the order we call QMP commands. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2019-05-17 14:59:29 +02:00
Andrea Bolognani	4ad8d620cc	qemu: Introduce virQEMUCapsProbeQMPMachineProps() Up until now we've probed machine type properties, along with properties for other types, in virQEMUCapsProbeQMPDevices(), but soon we're going to need some logic that is specific to machine types and as such wouldn't quite fit into that function. Signed-off-by: Andrea Bolognani <abologna@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2019-05-17 14:59:23 +02:00
Peter Krempa	4d8cc5a07a	qemu: blockjob: Fix saving of inactive XML after completed legacy blockjob Commit `c257352797` introduced a logic bug where we will never save the inactive XML after a blockjob as the variable which was determining whether to do so is cleared right before. Thus even if we correctly modify the inactive state it will be rolled back when libvirtd is restarted. Reported-by: Thomas Stein <hello@himbee.re> Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-17 13:25:06 +02:00
Christian Ehrhardt	2900575db8	qemu: do not define known no-op features Qemu dropped cpu features for osxsave and ospke [1][2]. The reason for the instant removal is that those features were never configurable as discussed in [3]. Fortunately the use cases adding those flags in the past are rare, but they exist. One that I identified are e.g. older virt-install when used with --cpu=host-model and there always could be the case of a user adding it to the guest xml. This triggers an issue like: qemu-system-x86_64: can't apply global Broadwell-noTSX-x86_64- cpu.osxsave=on: Property '.osxsave' not found Ensure that this does no more break spawning newer qemu versions by not rendering those features into the qemu command line. Fixes: https://bugs.launchpad.net/fedora/+source/qemu/+bug/1825195 Resolves: https://bugzilla.redhat.com/1644848 [1]: https://git.qemu.org/?p=qemu.git;a=commit;h=f1a2352 [2]: https://git.qemu.org/?p=qemu.git;a=commit;h=9ccb978 [3]: https://www.mail-archive.com/qemu-devel@nongnu.org/msg561877.html Signed-off-by: Christian Ehrhardt <christian.ehrhardt@canonical.com> Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com> Tested-by: Daniel Henrique Barboza <danielhb413@gmail.com>	2019-05-15 09:32:52 +02:00
Michal Privoznik	b58a6b050e	qemuDomainSnapshotCreateXML: Don't leak parsed snapshot definition This function gets snapshot XML (provided by used) as an argument. It parses it into a local variable @def and then sets some more members (e.g. it creates a copy of live domain XML). Then it proceeds to checking if snapshot XML is valid (e.g. it contains as many disks as currently in the domain). If this fails then the control jumps to endjob label and subsequently return from the function. This is where AUTOFREE function for @def is ran. Well, because the code says to run plain VIR_FREE() we leak some memory because @def is actually an object and therefore it should have been declared as AUTOUNREF. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Erik Skultety <eskultet@redhat.com>	2019-05-14 16:42:59 +02:00
Eric Blake	9dd5bc151c	qemu: Fix regression with undefine --snapshots-metadata In refactoring the snapshot code to prepare for checkpoints, I changed qemuDomainMomentDiscardAll to take a callback that would handle the cleanup of either a snapshot or a checkpoint, but failed to set the callback on one of the two snapshot callers. As a result, 'virsh undefine $dom --snapshots-metadata' crashed on a NULL function dereference. Fixes: `a487890d37` Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1707708 Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-05-10 10:50:16 -05:00
Eric Blake	57387ff54b	snapshot: Make virDomainSnapshotDef a virObject This brings about a couple of benefits: - use of VIR_AUTOUNREF() simplifies several callers - Fixes a todo about virDomainMomentObjList not being polymorphic enough Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-05-09 10:02:53 -05:00
Eric Blake	098043eddd	snapshot: s/current/parent/ as prep for virObject VIR_CLASS_NEW insists that descendents of virObject have 'parent' as the name of their inherited base class member at offset 0. While it would be possible to write a new class-creation macro that takes the actual field name 'current', and rewrite VIR_CLASS_NEW to call the new macro with the hard-coded name 'parent', it seems less confusing if all object code uses similar naming. Thus, this is a mechanical rename in preparation of making virDomainSnapshotDef a descendent of virObject. Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-05-09 09:48:07 -05:00
Eric Blake	36603bc568	snapshot: s/parent/parent_name/ as prep for virObject VIR_CLASS_NEW insists that descendents of virObject have 'parent' as the name of their inherited base class member at offset 0. While it would be possible to write a new class-creation macro that takes the actual field name, and rewrite VIR_CLASS_NEW to call the new macro with the hard-coded name 'parent', so that we could make virDomainMomentDef use a custom name for its base class, it seems less confusing if all object code uses similar naming. Thus, this is a mechanical rename in preparation of making virDomainSnapshotDef a descendent of virObject, when we can no longer use 'parent' for a different purpose than the base class. Signed-off-by: Eric Blake <eblake@redhat.com> Acked-by: Peter Krempa <pkrempa@redhat.com>	2019-05-09 09:43:41 -05:00
Peter Krempa	76b9aba2ba	qemu: Refactor/simplify qemuDomainStorageSourceAccessAllow Use qemuDomainStorageSourceAccessModify with correct flags to do the job. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:36 +02:00
Peter Krempa	b1fe51c4ba	qemu: Mark when modifying access to existing source in qemuDomainStorageSourceAccessModify Some operations e.g. namespace setup are not necessary when modifying access to a file which the VM can already access. Add a flag which allows to skip them. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:36 +02:00
Peter Krempa	f50d1b7f49	qemu: Allow skipping the revoke step in qemuDomainStorageSourceAccessModify In some cases when we need to modify access permissions for a storage source which is already used by the VM we should not revoke all permissions on a failure. Allow this in qemuDomainStorageSourceAccessModify by adding a new flag. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:36 +02:00
Peter Krempa	657216b60d	qemu: Use bools rather than labels in qemuDomainStorageSourceAccessModify Rather than jumping to the correct label use a set of booleans to determine which operation needs to be rolled back. This will allow more flexibility when e.g. rollback after a failed operation will not be necessary/desired. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:36 +02:00
Peter Krempa	3bb1423883	qemu: Allow forcing read-only mode in qemuDomainStorageSourceAccessModify Add a new flag which will set the image as read-only even if the image data allows writing. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:36 +02:00
Peter Krempa	3decae00e9	qemu: Refactor/simplify qemuDomainStorageSourceAccessRevoke Use qemuDomainStorageSourceAccessModify instead of the individual calls. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:35 +02:00
Peter Krempa	0304fa2fee	qemu: Allow using qemuDomainStorageSourceAccessModify on singe images Add a new flag QEMU_DOMAIN_STORAGE_SOURCE_ACCESS_CHAIN to select whether to work on single image or full chain. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:35 +02:00
Peter Krempa	6d4136da2a	qemu: Convert boolean flags to enum flags in qemuDomainStorageSourceAccessModify Upcoming patches will add a few more flags. Add an enum to collect them so that we don't end up with multiple bools. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:35 +02:00
Peter Krempa	0ae504d375	qemu: domain: Rename qemuDomainStorageSourceChainAccessPrepare The function will be able to deal with non-chains too so drop 'Chain' and also change the suffix to 'Modify' as it's used both for setup and teardown. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:35 +02:00
Peter Krempa	45b9ec5b09	qemu: Split entry points to qemuDomainStorageSourceChainAccessPrepare Introduce qemuDomainStorageSourceChainAccess(Allow\|Revoke) as entry points to qemuDomainStorageSourceChainAccessPrepare for symmetry with the functions for single backing chain elements. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:35 +02:00
Peter Krempa	3d36d666f8	qemu: Move and rename qemuHotplugPrepareDiskSourceAccess Move it to qemu_domain.c and call it qemuDomainStorageSourceChainAccessPrepare. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:35 +02:00
Peter Krempa	1a828a578e	qemu: Rename qemuDomainDiskChainElement(Revoke\|Prepare) Use qemuDomainStorageSourceAccess(Allow\|Revoke) instead. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-09 15:55:35 +02:00
Eric Blake	1ec3e39742	conf: Add parameter to virDomainDiskSourceFormat Commits `4bc42986` and `218c81ea` removed virDomainStorageSourceFormat on the grounds that there were no external callers; however, the upcoming backup code wants to output a <target> (push mode) or <scratch> (pull mode) element that is in all other respects identical to a domain's <source> element, where the previous virDomainStorageSourceFormat fit the bill nicely. But rather than reverting the commits, it's easier to just add an additional parameter for the element name to use, and update all callers. Signed-off-by: Eric Blake <eblake@redhat.com>	2019-05-06 18:05:17 -05:00
Peter Krempa	67c2ddf8a6	qemu: qapi: Implement worker for introspecting alternate types Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	b82f2d837a	qemu: qapi: Implement worker for introspecting builtin types Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	8bfb615b4b	qemu: qapi: Implement worker for introspecting enums Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	0ea6cd6209	qemu: qapi: Prepare for extension of virQEMUQAPISchemaPathGet docs Prepare section for boolean queries and make the typed query section more clear. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	9c2f48de68	qemu: qapi: Report schema and user errors for QAPI queries We treated broken schema as failure to look up given query. Treat it as a separate error instead. It is unlikely to happen though. Also prepare for possibility of user errors if query components which can't be queired deeper have following components. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	c3944f8fe7	qemu: qapi: Use declarative approach for meta-type parsers in virQEMUQAPISchemaTraverse Introduce an array of callbacks for given 'meta-type' of the QAPI schema structure rather than using code to select it. This will simplify extension for the other meta-types which are not handled yet. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	0b49f891be	qemu: qapi: Add helpers for virQEMUQAPISchemaTraverseContext Rather than modifying the context struct add a helpers that does this. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	cea3ed3bb1	qemu: qapi: Rename local vars in virQEMUQAPISchemaTraverseObject Now that 'query' is no longer an argument we can reuse it. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	06283debe3	qemu: qapi: Convert arguments of QAPI traversal helpers to a struct Create a context data type for the QAPI path rather than passing an increasing number of arguments. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	115e677a52	qemu: qapi: Optimize out some helper functions virQEMUQAPISchemaTypeFromObject and virQEMUQAPISchemaTypeFromObject can be very easily folded into virQEMUQAPISchemaTraverseObject removing the need for the helpers. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	8af5d6bd7c	qemu: qapi: Separate virQEMUQAPISchemaTraverse into functions by object type Simplify virQEMUQAPISchemaTraverse by separating out the necessary operations for given 'meta-type' into separate functions. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	5ded7590d1	qemu: qapi: Convert virQEMUQAPISchemaTraverse to recursive lookup Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	641c60a17d	qemu: qapi: Modify values returned by virQEMUQAPISchemaPathGet Return 1 if the schema entry was found optionally returning it rather than depending on the returned object. Some callers don't care which schema object belongs to the query, but rather only want to know whether it exists. Additionally this will allow introducing boolean queries for checking if enum values exist. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	9da894dea7	qemu: qapi: Return schema entry via argument in virQEMUQAPISchemaTraverse To allow for boolean query string, let's return the queried schema entry via argument rather than a return value. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	eed544e131	qemu: qapi: Fix return value of impossible case in virQEMUQAPISchemaTraverse The return statement after the infinite loop without a break is there to appease the compiler. Make it return NULL as it would be a failure if control flow reaches that point. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Peter Krempa	432452eb0d	qemu: qapi: Use automatic memory cleanup Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-05-06 09:46:06 +02:00
Michal Privoznik	f0f1e79bf5	qemuConnectOpen: Drop unused @cfg and simplify After `65a372d6e0` the @cfg variable is no longer used. This means we can drop it and therefore drop 'cleanup' label with it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-05-04 23:39:35 +02:00
Michal Privoznik	f308f71d83	qemu.conf: Make nvram list obsolete Now that libvirt has firmware auto selection feature the nvram config knob is more or less obsolete. It still makes sense in cases where distro users are using does not provide FW descriptor files, therefore I'm not removing it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-05-02 15:09:45 +02:00
Michal Privoznik	2a1ae8fba7	lib: Preserve error around virDomainNetReleaseActualDevice() This function is calling public API virNetworkLookupByName() which resets the error. Therefore, if virDomainNetReleaseActualDevice() is used in cleanup path it actually resets the original error that got us jump into 'cleanup' label. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-30 17:00:46 +02:00
Daniel P. Berrangé	04e4307d34	Revert "network: use 'bridge' as actual type instead of 'network'" This caused the live XML to report the 'bridge' type instead of the 'network' type, which is a behavioural regression. It also breaks 'virsh domif-setlink', 'virsh update-device' and 'virsh domiftune' This reverts commit `518026e159`. Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-30 14:42:34 +01:00
Daniel P. Berrangé	e007e8ba3a	Revert "virt drivers: don't handle type=network after resolving actual network type" This reverts commit `2f5e6502e3`. Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-30 14:42:22 +01:00
Jie Wang	5d5e7875cd	qemu_command: fix double_close vhostfd in qemuBuildHostdevCommandLine vhostfd passed to cmd->passfd in virCommandPassFD, virCommandFree will always close cmd->passfd when qemuBuildSCSIVHostHostdevDevStr failed. Signed-off-by: Jie Wang <wangjie88@huawei.com>	2019-04-30 11:10:36 +02:00
Michal Privoznik	572c50849c	qemu: Check for user alias collisions in coldplug https://bugzilla.redhat.com/show_bug.cgi?id=1697676 If an user tries to attach a device with colliding user alias then we attach it happily and thus leave domain unable to start. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-26 15:47:40 +02:00
Michal Privoznik	57eb2936f3	qemu: On attach to live XML check for user alias collision only live XML When attaching a device to live XML we don't care (well, shouldn't care) that there's already a device in inactive XML that has the same user alias. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-26 15:47:36 +02:00
Michal Privoznik	c4c44b535b	qemuDomainAttachDeviceLiveAndConfig: Don't overwrite @ret If we're attaching a device to both inactive and live XML then @ret is overwritten which may result in incorrect return value. For instance, if attaching to inactive XML succeeds, @ret is assigned value of zero and control proceeds to attaching the device to live XML. Here, if say virDomainDeviceValidateAliasForHotplug() fails the control jumps over to 'cleanup' label and zero is returned indicating success. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-26 15:16:17 +02:00
Michal Privoznik	08193fdbf5	src: Check for virDomainDiskInsert() retval properly Our coding style specifies that only negative values are considered as error. Check for return value of virDomainDiskInsert() properly, following the style. Not that the function can now return anything other than 0 or -1, but it just triggers my OCD. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-26 15:14:17 +02:00
Daniel Henrique Barboza	cc1d1dbbd5	qemuDomainPMSuspendForDuration: check for wake-up support If the current QEMU guest can't wake up from suspend properly, and we are able to determine that, avoid suspending the guest at all. To be able to determine this support, QEMU needs to implement the 'query-current-machine' QMP call. This is reflected by the QEMU_CAPS_QUERY_CURRENT_MACHINE cap. If the cap is enabled, a new function qemuDomainProbeQMPCurrentMachine is called. This is wrapper for qemuMonitorGetCurrentMachineInfo, where the 'wakeup-suspend-support' flag is retrieved from 'query-current-machine'. If wakeupSuspendSupport is true, proceed with the regular flow of qemuDomainPMSuspendForDuration. The absence of QEMU_CAPS_QUERY_CURRENT_MACHINE indicates that we're dealing with a QEMU version older than 4.0 (which implements the required QMP API). In this case, proceed as usual with the suspend logic of qemuDomainPMSuspendForDuration, since we can't assume whether the guest has support or not. Fixes: https://bugs.launchpad.net/ubuntu/+source/qemu/+bug/1759509 Reported-by: Balamuruhan S <bala24@linux.vnet.ibm.com> Signed-off-by: Daniel Henrique Barboza <danielhb413@gmail.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-25 11:43:53 +02:00
Michal Privoznik	70a4e3ee07	qemu_monitor: Introduce handler for 'query-current-machine' command So far, this command returns a structure with only one member: 'wakeup-suspend-support'. But that's okay. It's what we are after anyway. Based-on-work-of: Daniel Henrique Barboza <danielhb413@gmail.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-25 11:43:53 +02:00
Daniel Henrique Barboza	dca1b1d007	qemu_capabilities: Add QEMU_CAPS_QUERY_CURRENT_MACHINE QEMU commit 46ea94ca9cf ("qmp: query-current-machine with wakeup-suspend-support") added a new QMP command called 'query-current-machine' that retrieves guest parameters that can vary in the same machine model (e.g. ACPI support for x86 VMs depends on the '--no-acpi' option). Currently, this API has a single flag, 'wakeup-suspend-support', that indicates whether the guest has the capability of waking up from suspended state. Introduce a libvirt capability that reflects whether qemu has the monitor command. Signed-off-by: Daniel Henrique Barboza <danielhb413@gmail.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-04-25 11:43:53 +02:00
Cole Robinson	77bca8b730	qemu: monitor: check for common 'Error: ' string qemu 4.0.0 will prefix most errors with 'Error: ', so consider any string instance of that an error. This fixes savevm failure detection when migration is blocked due to usage of nested VMX https://bugzilla.redhat.com/show_bug.cgi?id=1697997 Acked-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-04-23 11:05:44 -04:00
Cole Robinson	d9ed7bb1dd	qemu: monitor cleanup delvm error handling Drop redundant NULL checks, and add an error string prefix Acked-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-04-23 11:05:44 -04:00
Cole Robinson	a82c182171	qemu: monitor: cleanup loadvm error handling Drop redundant NULL checks, add error string prefixes, consolidate a few indentical reports. Acked-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-04-23 11:05:43 -04:00
Michal Privoznik	918e8d6867	qemu_cgroup: Remove unused qemuSetupCpusetMems This function is not used anymore. Let's remove it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2019-04-18 17:59:19 +02:00
Michal Privoznik	0eaa4716e1	qemu: Set up EMULATOR thread and cpuset.mems before exec()-ing qemu It's funny how this went unnoticed for such a long time. Long story short, if a domain is configured with VIR_DOMAIN_NUMATUNE_MEM_STRICT libvirt doesn't really honour that. This is because of `7e72ac7878` after which libvirt allowed qemu to allocate memory just anywhere and only after that it used some magic involving cpuset.memory_migrate and cpuset.mems to move the memory to desired NUMA nodes. This was done in order to work around some KVM bug where KVM would fail if there wasn't a DMA zone available on the NUMA node. Well, while the work around might stopped libvirt tickling the KVM bug it also caused a bug on libvirt side: if there is not enough memory on configured NUMA node(s) then any attempt to start a domain must fail. Because of the way we play with guest memory domains can start just happily. The solution is to move the child we've just forked into emulator cgroup, set up cpuset.mems and exec() qemu only after that. This basically reverts `7e72ac7878` which was a workaround for kernel bug. This bug was apparently fixed because I've tested this successfully with recent kernel. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Martin Kletzander <mkletzan@redhat.com>	2019-04-18 17:53:42 +02:00
Michal Privoznik	ddc72f9902	qemu_hotplug: Check for duplicate drive addresses This tries to fix the same problem as `f1d6585300` but it's doing so in a less invasive way. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Tested-by: Daniel Henrique Barboza <danielhb413@gmail.com> Reviewed-by: Jim Fehlig <jfehlig@suse.com>	2019-04-18 17:09:02 +02:00
Daniel P. Berrangé	2f5e6502e3	virt drivers: don't handle type=network after resolving actual network type The call to resolve the actual network type will turn any NICs with type=network into one of the other types. Thus there should be no need to handle type=network in later switch() statements jumping off the actual type. Reviewed-by: Cole Robinson <crobinso@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-18 13:10:06 +01:00
Daniel P. Berrangé	518026e159	network: use 'bridge' as actual type instead of 'network' Ports allocated on virtual networks with type=nat\|route\|open all get given an actual type of 'network'. Only ports in networks with type=bridge use an actual type of 'bridge'. This distinction makes little sense since the virtualization drivers will treat both actual types in exactly the same way, as they're all just bridge devices a VM needs to be connected to. This doesn't affect user visible XML since the "actual" device XML is internal only, but we need code to convert the data upgrades. Reviewed-by: Laine Stump <laine@laine.org> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-18 13:10:00 +01:00
Michal Privoznik	4bdce1219f	qemu: Simplify interface handling in qemuConnectDomainXMLToNative() Firstly, VIR_STRDUP() accepts NULL, so there is no need to check if the string we want to duplicate is not-NULL. Secondly, virDomainNetSetModelString() also accepts NULL. Thirdly, we have VIR_AUTOFREE(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-04-17 10:47:23 +02:00
Andrea Bolognani	3c32f9ec42	qemu: Fix uninitialized variable It has made Clang very unhappy ever since `6bf7c67699`. Signed-off-by: Andrea Bolognani <abologna@redhat.com>	2019-04-17 09:33:28 +02:00
Cole Robinson	84a5e89b31	conf: Add VIR_DOMAIN_DEF_FEATURE_NET_MODEL_STRING This requires drivers to opt in to handle the raw modelstr network model, all others will error if a passed in XML value is not in the model enum. Enable this feature for libxl/xen/xm and qemu drivers Acked-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-04-16 13:11:08 -04:00
Cole Robinson	41b002f934	qemu: Partially convert to net model enum This converts the qemu driver to the net model enum, for all the model values that we have hardcoded for various checks, which adds e1000e, virtio-transitional, virtio-non-transitional, usb-net, spapr-vlan, lan9118, smc91c111 Because the qemu driver has historically also allowed the raw model string onto the qemu command line, this isn't a full conversion. Unwinding that will require more thought. However for all new driver code we should be adding explicit enum values for any model name we have special handling for. Remove the now unused virDomainNetStreqModelString Acked-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-04-16 13:11:08 -04:00
Cole Robinson	d79a2c079c	conf: net: Add model enum, and netfront value This adds a network model enum. The virDomainNetDef property is named 'model' like most other devices. When the XML parser or a driver calls NetSetModelString, if the passed string is in the enum, we will set net->model, otherwise we copy the string into net->modelstr Add a single example for the 'netfront' xen model, and wire that up, just to verify it's all working Acked-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-04-16 13:11:08 -04:00
Cole Robinson	6bf7c67699	conf: net: Add wrapper functions for <model> value To ease converting the net->model value to an enum, add the wrapper functions: virDomainNetGetModelString virDomainNetSetModelString virDomainNetStreqModelString virDomainNetStrcaseeqModelString Acked-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Cole Robinson <crobinso@redhat.com>	2019-04-16 13:11:08 -04:00
Daniel P. Berrangé	bbe2aa627f	conf: simplify link from hostdev back to network device hostdevs have a link back to the original network device. This is fairly generic accepting any type of device, however, we don't intend to make use of this approach in future. It can thus be specialized to network devices. Reviewed-by: Cole Robinson <crobinso@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-16 14:44:53 +01:00
Daniel P. Berrangé	e1d10f8ef2	network: pass a virNetworkPtr to port management APIs The APIs for allocating/notifying/removing network ports just take an internal domain interface struct right now. As a step towards turning these into public facing APIs, add a virNetworkPtr argument to all of them. Reviewed-by: Cole Robinson <crobinso@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-16 14:44:53 +01:00
Daniel P. Berrangé	dd52444f23	network: restrict usage of port management APIs The port allocation APIs are currently called unconditionally for all types of NIC, but (mostly) only do anything for NICs with type=network. The exception is the port allocate API which does some validation even for NICs with type!=network. Relying on this validation is flawed, however, since the network driver may not even be installed. IOW virt drivers must not delegate validation to the network driver for NICs with type != network. This change allows us to report errors when the virtual network driver is not registered. Reviewed-by: Cole Robinson <crobinso@redhat.com> Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-16 14:44:53 +01:00
Martin Kletzander	2b342cda72	qemu: Add support for emulatorsched This helps in a scenarios where vCPUs run with a priority that is so high they might starve the emulator thread. And it also fits with the rest of the settings. Signed-off-by: Martin Kletzander <mkletzan@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-04-16 13:46:17 +02:00
Marc-André Lureau	ad32d76165	qemu: do not set wait:false for client sockets Qemu commit 767abe7 ("chardev: forbid 'wait' option with client sockets") effectively deprecates usage of "wait" with client sockets starting with qemu 4.0, and earlier versions ignored the value. Cc: Daniel P. Berrangé <berrange@redhat.com> Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2019-04-16 12:52:03 +02:00
Jiri Denemark	673c62a3b7	qemu: Don't cache microcode version My earlier commit `be46f61326` was incomplete. It removed caching of microcode version in the CPU driver, which means the capabilities XML will see the correct microcode version. But it is also cached in the QEMU capabilities cache where it is used to detect whether we need to reprobe QEMU. By missing the second place, the original commit `be46f61326` made the situation even worse since libvirt would report correct microcode version while still using the old host CPU model (visible in domain capabilities XML). Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-04-15 14:34:49 +02:00
Ján Tomko	5dd6e7f949	Delete QEMU_CAPS_KQEMU and QEMU_CAPS_ENABLE_KQEMU Support for kqemu was dropped in libvirt by commit `8e91a400c` and even back then we never set these capabilities when doing QMP probing. Since no QEMU we aim to support has these, drop them completely. Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Andrea Bolognani <abologna@redhat.com>	2019-04-15 14:06:39 +02:00
Daniel Henrique Barboza	1a922648f6	PPC64 support for NVIDIA V100 GPU with NVLink2 passthrough The NVIDIA V100 GPU has an onboard RAM that is mapped into the host memory and accessible as normal RAM via an NVLink2 bridge. When passed through in a guest, QEMU puts the NVIDIA RAM window in a non-contiguous area, above the PCI MMIO area that starts at 32TiB. This means that the NVIDIA RAM window starts at 64TiB and go all the way to 128TiB. This means that the guest might request a 64-bit window, for each PCI Host Bridge, that goes all the way to 128TiB. However, the NVIDIA RAM window isn't counted as regular RAM, thus this window is considered only for the allocation of the Translation and Control Entry (TCE). For more information about how NVLink2 support works in QEMU, refer to the accepted implementation [1]. This memory layout differs from the existing VFIO case, requiring its own formula. This patch changes the PPC64 code of @qemuDomainGetMemLockLimitBytes to: - detect if we have a NVLink2 bridge being passed through to the guest. This is done by using the @ppc64VFIODeviceIsNV2Bridge function added in the previous patch. The existence of the NVLink2 bridge in the guest means that we are dealing with the NVLink2 memory layout; - if an IBM NVLink2 bridge exists, passthroughLimit is calculated in a different way to account for the extra memory the TCE table can alloc. The 64TiB..128TiB window is more than enough to fit all possible GPUs, thus the memLimit is the same regardless of passing through 1 or multiple V100 GPUs. Further reading explaining the background [1] https://lists.gnu.org/archive/html/qemu-devel/2019-03/msg03700.html [2] https://www.redhat.com/archives/libvir-list/2019-March/msg00660.html [3] https://www.redhat.com/archives/libvir-list/2019-April/msg00527.html Signed-off-by: Daniel Henrique Barboza <danielhb413@gmail.com> Reviewed-by: Erik Skultety <eskultet@redhat.com>	2019-04-15 07:41:43 +02:00
Daniel Henrique Barboza	cc9f03801c	qemu_domain: NVLink2 bridge detection function for PPC64 The NVLink2 support in QEMU implements the detection of NVLink2 capable devices by verifying the attributes of the VFIO mem region QEMU allocates for the NVIDIA GPUs. To properly allocate an adequate amount of memLock, Libvirt needs this information before a QEMU instance is even created, thus querying QEMU is not possible and opening a VFIO window is too much. An alternative is presented in this patch. Making the following assumptions: - if we want GPU RAM to be available in the guest, an NVLink2 bridge must be passed through; - an unknown PCI device can be classified as a NVLink2 bridge if its device tree node has 'ibm,gpu', 'ibm,nvlink', 'ibm,nvlink-speed' and 'memory-region'. This patch introduces a helper called @ppc64VFIODeviceIsNV2Bridge that checks the device tree node of a given PCI device and check if it meets the criteria to be a NVLink2 bridge. This new function will be used in a follow-up patch that, using the first assumption, will set up the rlimits of the guest accordingly. Signed-off-by: Daniel Henrique Barboza <danielhb413@gmail.com>	2019-04-15 07:06:52 +02:00
Martin Kletzander	673f805d4d	qemu: Label uniqDir when probing capabilities This does not cause a problem in usual scenarios thanks to us allowing CAP_DAC_OVERRIDE for the qemu process, however in some scenarios this might be an issue because the directory is created with mkdtemp(3) which explicitly creates that with 0700 permissions and qemu running as non-root cannot access that. The scenarios include: - Builds without CAPNG - Running libvirtd in certain container configurations [1] - and possibly others. [1] https://github.com/kubevirt/kubevirt/pull/2181#issuecomment-481840304 Signed-off-by: Martin Kletzander <mkletzan@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-04-13 00:56:45 +02:00
Jiri Denemark	370177e2f6	cpu_x86: Store virCPUx86DataItem content in union The structure can only be used for CPUID data now. Adding a type indicator and moving the data into a union will let us store alternative data types. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-04-12 22:53:39 +02:00

... 5 6 7 8 9 ...

9028 Commits