libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2025-01-08 14:05:19 +00:00

Author	SHA1	Message	Date
Michal Privoznik	8fca346c97	qemu: s/qemuDomainChangeDiskMediaLive/qemuDomainChangeDiskLive/ While we currently only allow changing a media in a disk, this is going to change in a while, so the function name would be invalid. Moreover, the old name does not match the pattern laid out by other update functions. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-09-18 15:51:18 +02:00
Michal Privoznik	9af8447720	qemuDomainUpdateDeviceConfig: Allow startupPolicy update, yet again https://bugzilla.redhat.com/show_bug.cgi?id=1159219 So, in `11e058ca58` I've tried to make UpdateDevice update startupPolicy too. And it worked well until somebody came around and pushed `d0dc6c0369` which accidentally removed my contribution. Redo my commit. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-09-18 15:51:18 +02:00
Jiri Denemark	79ccfec803	qemu: Fix some corner cases in persistent migration When persistently migrating a domain to a destination host where the same domain already exists (i.e., it is persistent and shutdown at the destination), we would happily throw away the original persistent definition without properly freeing it. And when updating the definition fails for some reason we don't properly revert to the original state leaving the domain broken. In addition to fixing these issues, the patch also makes sure the domain definition parsed from a migration cookie is either used or freed. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 14:40:55 +02:00
Jiri Denemark	c641d55083	qemu: Queue events in migration Finish phase ASAP For quite a long time we don't need to postpone queueing events until the end of the function since we no longer have the big driver lock. Let's make the code of qemuMigrationFinish simpler by queuing events at the time we generate them. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 13:50:04 +02:00
Jiri Denemark	cda2afac79	qemuDomainEventQueue: Check if event is non-NULL Every single call to qemuDomainEventQueue() uses the following pattern: if (event) qemuDomainEventQueue(driver, event); Let's move the check for valid event to qemuDomainEventQueue and simplify all callers. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 13:50:03 +02:00
Jiri Denemark	5f7ad32778	qemu: Don't report false errors in migration protocol v2 Finish is the final state in v2 of our migration protocol. If something fails, we have no option to abort the migration and resume the original domain. Non fatal errors (such as failure to start guest CPUs or make the domain persistent) has to be treated as success. Keeping the domain running while reporting the failure was just asking for trouble. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 13:50:03 +02:00
Jiri Denemark	cc40c28410	qemu: Kill domain when migration finish fails Whenever something fails during incoming migration in Finish phase before we started guest CPUs, we need to kill the domain in addition to reporting the failure. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 13:50:03 +02:00
Jiri Denemark	f5c509623f	qemu: Don't fail migration on save status failure When we save status XML at the point during migration where we have already started the domain on destination, we can't really go back and abort migration. Thus the only thing we can do is to log a warning and report success. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 13:50:02 +02:00
Jiri Denemark	8874d37f94	qemu: Simplify qemuMigrationFinish Offline migration is quite special because we don't really need to do anything but make the domain persistent. Let's do it separately from normal migration to avoid cluttering the code with !(flags & VIR_MIGRATE_OFFLINE). Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 13:49:49 +02:00
Jiri Denemark	a86b188567	qemu: Split qemuMigrationFinish Separate code which makes incoming domain persistent into qemuMigrationPersist. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-09-18 10:52:39 +02:00
John Ferlan	b421a70811	virfile: Check for existence of dir in virFileDeleteTree Commit id 'f1f68ca33' added code to remove the directory paths for auto-generated sockets, but that code could be called before the paths were created resulting in generating error messages from virFileDeleteTree indicating that the file doesn't exist. Rather than "enforce" all callers to make the non-NULL and existence checks, modify the virFileDeleteTree API to silently ignore NULL on input and non-existent directory trees.	2015-09-16 11:23:16 -04:00
Andrea Bolognani	eb36666d22	qemu: Fix using guest architecture as lookup key When looking for a QEMU binary suitable for running ppc64le guests we have to take into account the fact that we use the QEMU target as key for the hash, so direct comparison is not good enough. Factor out the logic from virQEMUCapsFindBinaryForArch() to a new virQEMUCapsFindTarget() function and use that both when looking for QEMU binaries available on the system and when looking up QEMU capabilities later. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1260753	2015-09-16 10:31:11 +02:00
Pavel Fedin	d526e37bad	Ignore virtio-mmio disks in qemuAssignDevicePCISlots() Fixes the following error when attempting to add a disk with bus='virtio' to a machine which actually supports virtio-mmio (caught with ARM virt): virtio disk cannot have an address of type 'virtio-mmio' The problem has been likely introduced by `e8d5517254`. Before that qemuAssignDevicePCISlots() was never called for ARM "virt" machine. Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-09-15 11:35:50 +02:00
Michal Privoznik	307fb9044c	virSecurityManager: Track if running as privileged We may want to do some decisions in drivers based on fact if we are running as privileged user or not. Propagate this info there. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-09-15 11:30:14 +02:00
Martin Kletzander	192a139489	qemu: Do not allow others into per-VM subdirectories Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-09-14 10:06:00 +02:00
Peter Krempa	64c6695f1a	qemu: hotplug: Properly clean up drive backend if frontend hotplug fails Commit `8125113c` added code that should remove the disk backend if the fronted hotplug failed for any reason. The code had a bug though as it used the disk string for unplug rather than the backend alias. Fix the code by pre-creating an alias string and using it instead of the disk string. In cases where qemu does not support QEMU_CAPS_DEVICE, we ignore the unplug of the backend since we can't really create an alias in that case. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1262399	2015-09-14 09:41:43 +02:00
Cole Robinson	db35beaa1d	qemu: command: Report stderr from qemu-bridge-helper There's a couple reports of things failing in this area (bug 1259070), but it's tough to tell what's going wrong without stderr from qemu-bridge-helper. So let's report stderr in the error message Couple new examples: virbr0 is inactive: internal error: /usr/libexec/qemu-bridge-helper --use-vnet --br=virbr0 --fd=21: failed to communicate with bridge helper: Transport endpoint is not connected stderr=failed to get mtu of bridge `virbr0': No such device bridge isn't on the ACL: internal error: /usr/libexec/qemu-bridge-helper --use-vnet --br=br0 --fd=21: failed to communicate with bridge helper: Transport endpoint is not connected stderr=access denied by acl file	2015-09-11 12:57:42 -04:00
Andrea Bolognani	75036c69df	qemu: Try several network devices when looking for a default Up until now, the default has been rtl8139, but no check was in place to make sure that device was actually available. Now we try rtl8139, e1000 and virtio-net in turn, checking for availability before using any of them: this means we have a much better chance for the guest to be able to boot.	2015-09-10 11:15:38 +02:00
Andrea Bolognani	fedbb015a9	qemu: Introduce QEMU_CAPS_DEVICE_VIRTIO_NET This capability can be used to detect whether or not the QEMU binary supports the virtio-net-* network device.	2015-09-10 09:25:02 +02:00
Andrea Bolognani	fb58318d7f	qemu: Introduce QEMU_CAPS_DEVICE_E1000 This capability can be used to detect whether or not the QEMU binary supports the e1000 network device.	2015-09-10 09:25:01 +02:00
Andrea Bolognani	b8d545a8b8	qemu: Introduce QEMU_CAPS_DEVICE_RTL8139 This capability can be used to detect whether or not the QEMU binary supports the rtl8139 network device.	2015-09-10 09:25:01 +02:00
Martin Kletzander	8370023730	qemu: Report error if per-VM directory cannot be created Commit `f1f68ca334` did not report an error if virFileMakePath() returned -1. Well, who would've guessed function with name starting with 'vir' sets an errno instead of reporting an error the libvirt way. Anyway, let's fix it, so the output changes from: $ virsh start arm error: Failed to start domain arm error: An error occurred, but the cause is unknown to: $ virsh start arm error: Failed to start domain arm error: Cannot create directory '/var/lib/libvirt/qemu/domain-arm': Not a directory Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1146886 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-09-09 13:38:18 +02:00
Peter Krempa	a98e5a7815	qemu: migration: Relax enforcement of memory hotplug support If the current live definition does not have memory hotplug enabled, but the persistent one does libvirt would reject migration if the destination does not support memory hotplug even if the user didn't want to persist the VM at the destination and thus the XML containing the memory hotplug definition would not be used. To fix this corner case the code will check for memory hotplug in the newDef only if VIR_MIGRATE_PERSIST_DEST was used.	2015-09-09 09:39:55 +02:00
John Ferlan	ea3c5f25eb	qemu: Check virGetLastError return value for migration finish failure Commit id '2e7cea243' added a check for an error from Finish instead of 'unexpected error'; however, if for some reason there wasn't an error, then virGetLastError could return NULL resulting in the NULL pointer deref to err->domain.	2015-09-04 15:19:04 -04:00
John Ferlan	a39ab90908	qemu: Need to check for machine.os when using ADDRESS_TYPE_CCW https://bugzilla.redhat.com/show_bug.cgi?id=1258361 When attaching a disk, controller, or rng using an address type ccw or s390, we need to ensure the support is provided by both the machine.os and the emulator capabilities (corollary to unconditional setting when address was not provided for the correct machine.os and emulator. For an inactive guest, an addition followed by a start would cause the startup to fail after qemu_command builds the command line and attempts to start the guest. For an active guest, libvirtd would crash.	2015-09-04 08:47:33 -04:00
John Ferlan	d334c91751	qemu: Introduce qemuDomainMachineIsS390CCW Rather than have different usages of STR function in order to determine whether the domain is s390-ccw or s390-ccw-virtio, make a single API which will check the machine.os prefix. Then use the function.	2015-09-04 08:47:33 -04:00
Jonathan Toppins	5c668a78d8	qemu: add udp interface support Adds a new interface type using UDP sockets, this seems only applicable to QEMU but have edited tree-wide to support the new interface type. The interface type required the addition of a "localaddr" (local address), this then maps into the following xml and qemu call. <interface type='udp'> <mac address='52:54:00:5c:67:56'/> <source address='127.0.0.1' port='11112'> <local address='127.0.0.1' port='22222'/> </source> <model type='virtio'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x07' function='0x0'/> </interface> QEMU call: -net socket,udp=127.0.0.1:11112,localaddr=127.0.0.1:22222 Notice the xml "local" entry becomes the "localaddr" for the qemu call. reference: http://lists.gnu.org/archive/html/qemu-devel/2011-11/msg00629.html Signed-off-by: Jonathan Toppins <jtoppins@cumulusnetworks.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-09-02 10:17:50 +02:00
Jim Fehlig	60acb38abb	Revert "LXC: show used memory as 0 when domain is not active" This reverts commit `1ce7c1d20c`, which introduced a significant semantic change to the virDomainGetInfo() API. Additionally, the change was only made to 2 of the 15 virt drivers. Conflicts: src/qemu/qemu_driver.c Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2015-08-28 09:05:26 -06:00
Luyao Huang	8f8031df19	qemu: Emit correct audit message for memory hot unplug https://bugzilla.redhat.com/show_bug.cgi?id=1226234#c3 If the qemu monitor fails to remove the memory from the guest for any reason, the auditlog message will incorrectly use the current actual memory (via virDomainDefGetMemoryActual) instead of the value we were attempting to reduce to. The result is the 'new-mem' and 'old-mem' values for the auditlog message would be identical. This patch creates a local 'newmem' which accounts for the current memory size minus the memory which is being removed. NB, for the success case this results in the same value that would be returned by virDomainDefGetMemoryActual without the need to do the math. This follows the existing code which would subtract the size for cur_balloon. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-08-26 17:47:49 -04:00
Luyao Huang	cb1fbda4a1	qemu: Emit correct audit message for memory hot plug https://bugzilla.redhat.com/show_bug.cgi?id=1226234#c3 Prior to this patch, after successfully hot plugging memory the audit log indicated that the update failed, e.g.: type=VIRT_RESOURCE ... old-mem=1024000 new-mem=1548288 \ exe="/usr/sbin/libvirtd" hostname=? addr=? terminal=pts/2 res=failed This patch will adjust where virDomainAuditMemory is called to ensure the proper 'ret' value is used based on success or failure. Additionally, the audit message should include the size of the memory we were attempting to change to rather than the current actual size. On failure to add, the message showed the same value for old-mem and new-mem. In order to do this, introduce a 'newmem' local which will compute the new size based on the oldmem size plus the size of memory we are about to add. NB: This would be the same as calling the virDomainDefGetMemoryActual again on success, but avoids the overhead of recalculating. Plus cur_balloon is already adjusted by the same value, so this follows that. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-08-26 17:47:49 -04:00
Martin Kletzander	f674dc6794	qemu: Label correct per-VM path when starting Commit `f1f68ca334` overused mdir_name() event though it was not needed in the latest version, hence labelling directory one level up in the tree and not the one it should. If anyone with SElinux managed to try run a domain with guest agent set up, it's highly possible that they will need to run 'restorecon -F /var/lib/libvirt/qemu/channel/target' to fix what was done. Reported-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-26 10:44:14 +02:00
Luyao Huang	bf2788218a	qemu: Update blkio.weight value after successful set https://bugzilla.redhat.com/show_bug.cgi?id=1253107 Make a call virCgroupGetBlkioWeight to re-read blkio.weight right after it is set in order to keep internal data up-to-date. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-08-25 16:20:39 -04:00
Vasiliy Tolstov	bbc705d130	Eliminate incorrect and unnecessary check for changed IP address Commit aa2cc7 modified a previously unnecessary but innocuous check for interface IP address during interface update incorrectly, causing all attempted updates (e.g. changing link state) to interfaces of type='ethernet' for QEMU to fail. This patch fixes the issue by completely removing the check for IP address, which is pointless since QEMU doesn't support setting interface IP addresses from the domain interface XML anyway. Signed-off-by: Vasiliy Tolstov <v.tolstov@selfip.ru> Signed-off-by: Laine Stump <laine@laine.org>	2015-08-24 10:23:40 -04:00
Luyao Huang	c8e5177303	qemu: add a check for nodeset in qemuDomainSetNumaParamsLive We will try to set the node to cpuset.mems without check if it is available, since we already have helper to check this. Call virNumaNodesetIsAvailable to check if node is available, then try to change it in the cgroup. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-08-24 15:20:35 +02:00
Martin Kletzander	f1f68ca334	qemu: Fix access to auto-generated socket paths We are automatically generating some socket paths for domains, but all those paths end up in a directory that's the same for multiple domains. The problem is that multiple domains can each run with different seclabels (users, selinux contexts, etc.). The idea here is to create a per-domain directory labelled in a way that each domain can access its own unix sockets. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1146886 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-24 11:53:17 +02:00
Martin Kletzander	c7596cf0e0	qemu: Report better error message when renaming to existing domain name Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-19 14:13:06 -07:00
Martin Kletzander	7b5acf9461	qemu: Sync BlkioDevice values when setting them in cgroups The problem here is that there are some values that kernel accepts, but does not set them, for example 18446744073709551615 which acts the same way as zero. Let's do the same thing we do with other tuning options and re-read them right after they are set in order to keep our internal structures up-to-date. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1165580 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-18 16:27:43 -07:00
Michal Privoznik	ed4da64657	qemuDomainRename: Don't leave a domain locked uppon fail Well, yet again one case of 'goto cleanup' while 'goto endjob' was needed. Sorry. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-18 20:42:44 +02:00
John Ferlan	32c6b1908b	qemu: Add check for invalid iothread_id in qemuDomainChgIOThread https://bugzilla.redhat.com/show_bug.cgi?id=1251886 Since iothread_id == 0 is an invalid value for QEMU let's point that out specifically. For the IOThreadDel code, the failure would have ended up being a failure to find the IOThread ID; however, for the IOThreadAdd code - an IOThread 0 was added and that isn't good. It seems during many reviews/edits to the code the check for iothread_id = 0 being invalid was lost - it could have originally been in the API code, but requested to be moved - I cannot remember.	2015-08-18 14:36:21 -04:00
Luyao Huang	4153a74105	qemuDomainAddCgroupForThread: Don't overwrite the error Just like in commit `704cf06`, if virCgroup*() fails, the error is already reported. There's no need to overwrite the error with a generic one and possibly hiding the true root cause of the error. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-08-18 19:53:58 +02:00
Michal Privoznik	0ace4d985a	qemuDomainRename: Explicitly check if domain is renaming to itself It may happen that user (mistakenly) wants to rename a domain to itself. Which is no renaming at all. We should reject that with some meaningful error message. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-17 23:41:17 +02:00
John Ferlan	c4cfc0d037	qemu: Resolve Coverity UNINIT Coverity complained that 'vm' wasn't initialized before jumping to cleanup: and calling virDomainObjEndAPI if the VIR_STRDUP fails. So I initialized vm = NULL and also moved the VIR_STRDUP closer to usage and used endjob for goto. Lots of other reasons for failures.	2015-08-15 15:53:26 -04:00
Tomas Meszaros	b5d63e997b	qemu: Implement virDomainRename Currently supports only renaming inactive domains without snapshots. Signed-off-by: Tomas Meszaros <exo@tty.sk> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-14 10:50:20 +02:00
Guido Günther	0e4972fe48	Detect location of qemu-bridge-helper RedHat and Debian based distros use different locations Reference: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=790935	2015-08-13 21:31:55 +02:00
Guido Günther	151ba02293	Check if qemu-bridge-helper exists and is executable Otherwise the error is just error: Failed to create domain from test1.xml error: failed to retrieve file descriptor for interface: Transport endpoint is not connected since we don't get a sensible error after the fork.	2015-08-13 21:31:54 +02:00
Martin Kletzander	776924e376	qemu: Use numad information when getting pin information Pinning information returned for emulatorpin and vcpupin calls is being returned from our data without querying cgroups for some time. However, not all the data were utilized. When automatic placement is used the information is not returned for the calls mentioned above. Since the numad hint in private data is properly saved/restored, we can safely use it to return true information. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1162947 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-13 15:06:06 +02:00
Martin Kletzander	8ce86722d7	qemu: Keep numad hint after daemon restart The numad hint stored in priv->autoNodeset is information that gets lost during daemon restart. And because we would like to use that information in the future, we also need to save it in the status XML. For the sake of tests, we need to initialize nnumaCell_max to some value, so that the restoration doesn't fail in our test suite. There is no need to fill in the actual numa cell data since the recalculating function virCapabilitiesGetCpusForNodemask() will not fail, it will just skip filling the data in the bitmap which we don't use in tests anyway. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-13 15:06:06 +02:00
Martin Kletzander	7c8028cda9	conf: Pass private data to Parse function of XML options This needs a reorder of XML option definitions. It might come in handy one day. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-13 15:06:06 +02:00
Martin Kletzander	92ddffdbd3	qemu: Fix segfault when parsing private domain data When parsing private domain data, there are two paths that are flawed. They are both error paths, just from different parts of the function. One of them can call free() on an uninitialized pointer. Initialization to NULL is enough here. The other one is a bit trickier to explain, but as easy as the first one to fix. We create capabilities, parse them and then assign them into the private data pointer inside the domain object. If, however, we get to fail from now on, the error path calls unrefs the capabilities and then, when the domain object is being cleaned, qemuDomainObjPrivateFree() tries to unref them as well. That causes a segfault. Settin the pointer to NULL upon successful addition to the private data is enough. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-13 15:06:06 +02:00
John Ferlan	1b08cc170a	conf: Check for hostdev conflicts when assign default disk address https://bugzilla.redhat.com/show_bug.cgi?id=1210587 (completed) When generating the default drive address for a SCSI <disk> device, check the generated address to ensure it doesn't conflict with a SCSI <hostdev> address. The <disk> address generation algorithm uses the <target> "dev" name in order to determine which controller and unit in order to place the device. Since a SCSI <hostdev> device doesn't require a target device name, its placement on the guest SCSI address "could" conflict. For instance, if a SCSI <hostdev> exists at controller=0 unit=0 and an attempt to hotplug 'sda' into the guest made, there would be a conflict if the <hostdev> is already using /dev/sda.	2015-08-12 16:09:05 -04:00
Frank Schreuder	69a3b0df2d	Drive hot-unplug: reliable parsing of HMP results Hot-unplugging a disk from a guest that supports hot-unplugging generates an error in the libvirt log when running QEMU with the "-msg timestamp=on" flag. 2015-08-06 10:48:59.945+0000: 11662: error : qemuMonitorTextDriveDel:2594 : operation failed: deleting drive-virtio-disk4 drive failed: 2015-08-06T10:48:59.945058Z Device 'drive-virtio-disk4' not found This error is caused because the HMP results are getting prefixed with a timestamp. Parsing the output is not reliable with STRPREFIX as the results can be prefixed with a timestamp. Using strstr ensures that parsing the output works whether the results are prefixed or not. Cc: Stefan Hajnoczi <stefanha@redhat.com> Cc: Daniel P. Berrange <berrange@redhat.com> Signed-off-by: Frank Schreuder <fschreuder@transip.nl>	2015-08-12 18:30:28 +02:00
Laine Stump	d5e6d1cfc7	Revert "qemu: Allow to plug virtio-net-pci into PCIe slot" This reverts commit `ede34470fd`, which was apparently written based on testing performed before commits `1e15be1` and 9a12b6 were pushed upstream. Once those two patches are in place, commit `ede34470` is redundant, and can even cause incorrect/unexpected behavior when auto-assigning addresses for virtio-net devices.	2015-08-12 11:23:29 -04:00
Laine Stump	9bd16ad3b4	qemu: fix qemuDomainSupportsPCI() for ARM machines of "virt" machinetype Commit `e8d5517` updated the domain post-parse to automatically add pcie-root et al for certain ARM "virt" machinetypes, but didn't update the function qemuDomainSupportsPCI() which is called later on when we are auto-assigning PCI addresses and default settings for the PCI controller <model> and <target> attributes. The result was that PCI addresses weren't assigned, and the controllers didn't have their attribute default values set, leading to an error when the domain was started, e.g.: internal error: autogenerated dmi-to-pci-bridge options not set This patch adds the same check made in the earlier patch to qemuDomainSupportsPCI(), so that PCI address auto-assignment and target/model default values will be set.	2015-08-11 16:11:05 -04:00
Michal Privoznik	b044e3257f	qemu: Implement VIR_DOMAIN_BANDWIDTH_IN_FLOOR Well, there are just two places that needs adjustment: qemuDomainGetInterfaceParameters - to report the @floor qemuDomainSetInterfaceParameters - now that the function has been fixed, we can allow updating @floor too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Michal Privoznik	5ee6d243fc	qemuDomainSetInterfaceParameters: Use new functions to update bandwidth As sketched in previous commits, imagine the following scenario: virsh # domiftune gentoo vnet0 inbound.average: 100 inbound.peak : 0 inbound.burst : 0 outbound.average: 100 outbound.peak : 0 outbound.burst : 0 virsh # domiftune gentoo vnet0 --inbound 0 virsh # shutdown gentoo Domain gentoo is being shutdown virsh # list --all error: Failed to list domains error: Cannot recv data: Connection reset by peer Program received signal SIGSEGV, Segmentation fault. 0x00007fffe80ea221 in networkUnplugBandwidth (net=0x7fff9400c1a0, iface=0x7fff940ea3e0) at network/bridge_driver.c:4881 4881 net->floor_sum -= ifaceBand->in->floor; This is rather unfortunate. We should not SIGSEGV here. The problem is, that while in the second step the inbound QoS was cleared out, the network part of it was not updated (moreover, we don't report that vnet0 had inbound.floor set). Internal structure therefore still had some fragments left (e.g. class_id). So when qemuProcessStop() started to clean up the environment it got to networkUnplugBandwidth(). Here, class_id is set therefore function assumes that there is an inbound QoS. This actually is a fair assumption to make, there's no need for a special QoS box in network's QoS when there's no QoS to set. Anyway, the problem is not the networkUnplugBandwidth() rather than qemuDomainSetInterfaceParameters() which completely forgot about QoS being disperse (some parts are set directly on interface itself, some on bridge the interface is plugged into). Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Laine Stump	f4f1d18dc4	qemu: fail on attempts to use <filterref> for non-tap network connections nwfilter uses iptables and ebtables, which only work properly on tap-based network connections (not on macvtap, for example), but we just ignore any <filterref> elements for other types of networks, potentially giving users a false sense of security. This patch checks the network type and fails/logs an error if any domain <interface> has a <filterref> when the connection isn't using a tap device. This resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1180011	2015-08-10 13:08:41 -04:00
Martin Kletzander	cf0404455c	qemu: Enable ioeventfd usage for virtio-scsi controllers Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1150484 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-10 15:05:34 +02:00
Cao jin	17cba9fb51	qemuMonitorOpenInternal: remove redundant code There's no need to set mon->fd to a dummy value since it's initialized to proper value just a few lines below. Signed-off-by: Cao jin <caoj.fnst@cn.fujitsu.com>	2015-08-10 13:47:33 +02:00
Laine Stump	7d69387cd6	qemu: support new pci controller model "pcie-switch-downstream-port" This is backed by the qemu device xio3130-downstream. It can only be connected to a pcie-switch-upstream-port (x3130-upstream) on the upstream side.	2015-08-09 22:32:00 -04:00
Laine Stump	76379a6ec1	conf: new pcie-controller model "pcie-switch-downstream-port" This controller can be connected only to a port on a pcie-switch-upstream-port. It provides a single hotpluggable port that will accept any PCI or PCIe device, as well as any device requiring a pcie-*-port (the only current example of such a device is the pcie-switch-upstream-port).	2015-08-09 22:30:47 -04:00
Laine Stump	ad1748a1aa	qemu: add capabilities bit for device xio3130-downstream The downstream ports of an x3130-upstream switch can each have one of these plugged into them (and that is the only place they can be connected). Each xio3130-downstream provides a single PCIe port that can have PCI or PCIe devices hotplugged into it. Apparently an entire set of x3130-upstream + several xio3130-downstreams can be hotplugged as a unit, but it's not clear to me yet how that would be done, since qemu only allows attaching a single device at a time. This device will be used to implement the "pcie-switch-downstream-port" model of pci controller.	2015-08-09 22:29:25 -04:00
Laine Stump	cb99086d1b	qemu: support new pci controller model "pcie-switch-upstream-port" this is backed by the qemu device x3130-upstream. It can only plug into a pcie-root-port or pcie-switch-downstream-port.	2015-08-09 22:16:10 -04:00
Laine Stump	38ea9515af	conf: new pci controller model "pcie-switch-upstream-port" This controller can be connected only to a pcie-root-port or a pcie-switch-downstream-port (which will be added in a later patch), which is the reason for the new connect type VIR_PCI_CONNECT_TYPE_PCIE_PORT. A pcie-switch-upstream-port provides 32 ports (slot=0 to slot=31) on the downstream side, which can only have pci controllers of model "pcie-switch-downstream-port" plugged into them, which is the reason for the other new connect type VIR_PCI_CONNECT_TYPE_PCIE_SWITCH.	2015-08-09 22:12:29 -04:00
Laine Stump	4cde758808	qemu: add capabilities bit for device x3130-upstream This is the upstream part of a PCIe switch. It connects to a PCIe port (but not PCI) on the upstream side, and can have up to 31 xio3130-downstream controllers (but no other types of devices) connected to its downstream side. This device will be used to implement the "pcie-switch-upstream-port" model of pci controller.	2015-08-09 22:02:16 -04:00
Laine Stump	16328520f6	qemu: support new pci controller model "pcie-root-port" This is backed by the qemu device ioh3420. chassis and port from the <target> subelement are used to store/set the respective qemu device options for the ioh3420. Currently, chassis is set to be the index of the controller, and port is set to "(slot << 3) + function" (per suggestion from Alex Williamson).	2015-08-09 21:58:55 -04:00
Laine Stump	dce3b8beb3	conf: new pci controller model "pcie-root-port" This controller can be connected (at domain startup time only - not hotpluggable) only to a port on the pcie root complex ("pcie-root" in libvirt config), hence the new connect type VIR_PCI_CONNECT_TYPE_PCIE_ROOT. It provides a hotpluggable port that will accept any PCI or PCIe device. New attributes must be added to the controller <target> subelement for this - chassis and port are guest-visible option values that will be set by libvirt with values derived from the controller's index and pci address information.	2015-08-09 21:52:52 -04:00
Laine Stump	408b100a06	qemu: add capabilities bit for device ioh3420 This is a PCIE "root port". It connects only to a port of the integrated pcie.0 bus of a Q35 machine (can't be hotplugged), and provides a single PCIe port that can have PCI or PCIe devices hotplugged into it. This device will be used to implement the "pcie-root-port" model of pci controller.	2015-08-09 21:44:11 -04:00
Laine Stump	18c104516e	qemu: implement <target chassisNr='n'/> subelement/attribute of <controller> This uses the new subelement/attribute in two ways: 1) If a "pci-bridge" pci controller has no chassisNr attribute, it will automatically be set to the controller's index as soon as the controller's PCI address is known (during qemuDomainAssignPCIAddresses()). 2) when creating the commandline for a pci-bridge device, chassisNr will be used to set qemu's chassis_nr option (rather than the previous practice of hard-coding it to the controller's index).	2015-08-09 21:40:40 -04:00
Laine Stump	572ebdbce7	qemu: implement <model> subelement to <controller> This patch provides qemu support for the contents of <model> in <controller> for the two existing PCI controller types that need it (i.e. the two controller types that are backed by a device that must be specified on the qemu commandline): 1) pci-bridge - sets <model> name attribute default as "pci-bridge" 2) dmi-to-pci-bridge - sets <model> name attribute default as "i82801b11-bridge". These both match current hardcoded practice. The defaults are set at the end of qemuDomainAssignPCIAddresses(). This can't be done earlier because some of the options that will be autogenerated need full PCI address info for the controller, and because qemuDomainAssignPCIAddresses() might create extra controllers which would need default settings added, and that hasn't yet been done at the time the PostParse callbacks are being run. qemuDomainAssignPCIAddresses() is still called prior to the XML being written to disk, though, so the autogenerated defaults are persistent. qemu capabilities bits aren't checked when the domain is defined, but rather when the commandline is actually created (so the domain can possibly be defined on a host that doesn't yet have support for the given device, or a host different from the one where it will eventually be run). When the commandline is being generated we compare the modelName to known qemu device names implementing the given type of controller, and check the capabilities bit for that device.	2015-08-09 21:33:58 -04:00
Peter Krempa	8dc2725925	qemu: Fix reporting of physical capacity for block devices Qemu reports physical size 0 for block devices. As `15fa84acbb` changed the behavior of qemuDomainGetBlockInfo to just query the monitor this created a regression since we didn't report the size correctly any more. This patch adds code to refresh the physical size of a block device by opening it and seeking to the end and uses it both in qemuDomainGetBlockInfo and also in qemuDomainGetStatsOneBlock that was broken since it was introduced in this respect. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1250982	2015-08-07 13:28:50 +02:00
Michal Privoznik	c646814438	qemuDomainDefPostParse: Adjust indent While reviewing `e8d551725` I've noticed a few unaligned lines. Fix this. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-06 15:33:01 +02:00
Pavel Fedin	ede34470fd	qemu: Allow to plug virtio-net-pci into PCIe slot virtio-net-pci adapter is capable to use irqfd with vhost-net only in MSI-X mode, which appears to be available only on PCIe bus, at least on ARM Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-08-06 14:28:05 +02:00
Pavel Fedin	8b78ec011c	qemu: Build correct command line for PCI NICs on ARM Legacy -net option works correctly only with embedded device models, which do not require any bus specification. Therefore, we should use -device for PCI hardware Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-08-06 14:25:02 +02:00
Pavel Fedin	e8d5517254	qemu: Add PCI-Express root to ARM virt machine Here we assume that if qemu supports generic PCI host controller, it is a part of virt machine and can be used for adding PCI devices. In qemu this is actually a PCIe bus, so we also declare multibus capability so that 0'th bus is specified to qemu correctly as 'pcie.0' Signed-off-by: Pavel Fedin <p.fedin@samsung.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-06 14:24:51 +02:00
Pavel Fedin	8a482abf75	qemu: Introduce QEMU_CAPS_OBJECT_GPEX This capability specifies that qemu can implement generic PCI host controller. It is often used for virtual environments, including ARM. Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-08-06 13:59:22 +02:00
Peter Krempa	6da3b694cc	qemu: Forbid image pre-creation for non-shared storage migration Libvirt doesn't reliably know the location of the backing chain when pre-creating images for non-shared migration. This isn't a problem for full copy, but incremental copy requires the information. Forbid pre-creating the image in cases where incremental migration is required. This limitation can perhaps be lifted once libvirt will fully support loading of backing chain information from the XML. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1249587	2015-08-05 17:24:59 +02:00
John Ferlan	36025c552c	conf: Allow error reporting in virDomainDiskSourceIsBlockType Rather than provide a somewhat generic error message when the API returns false, allow the caller to supply a "report = true" option in order to cause virReportError's to describe which of the 3 paths that can cause failure. Some callers don't care about what caused the failure, they just want to have a true/false - for those, calling with report = false should be sufficient.	2015-08-04 07:19:25 -04:00
Kothapally Madhu Pavan	d9557572ae	Avoid starting a PowerPC VM with floppy disk PowerPC pseries based VMs do not support a floppy disk controller. This prohibits libvirt from creating qemu command with floppy device. Signed-off-by: Kothapally Madhu Pavan <kmp@linux.vnet.ibm.com> https://bugzilla.redhat.com/show_bug.cgi?id=1180486 Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-08-04 10:17:07 +02:00
Kothapally Madhu Pavan	020a178318	Caps: Disable floppy disk for PowerPC VM PowerPC pseries based VMs do not support a floppy disk controller. This prohibits libvirt from adding floppy disk for a PowerPC pseries VM. Signed-off-by: Kothapally Madhu Pavan <kmp@linux.vnet.ibm.com>	2015-08-04 10:16:20 +02:00
Martin Kletzander	c43c661fe4	qemu: Remove double unlock for domains The virDomainObjListRemove() function unlocks a domain that it's given due to legacy code. And because of that code, which should be refactored, that last virObjectUnlock() cannot be just removed. So instead, lock it right back for qemu for now. All calls to qemuDomainRemoveInactive() are followed by code that unlocks the domain again, plus the domain should be locked during qemuDomainObjEndJob(), so the right place to lock it is right after virDomainObjListRemove(). The only place where this would cause a problem is the autodestroy callback, so we need to get another reference there and uref+unlock it afterwards. Luckily, returning NULL from that function doesn't mean an error, and only means that it doesn't need to be unlocked anymore. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-03 16:59:20 +02:00
Cao jin	e7fef6d00e	There is no virDomainFindBy{ID, Name, UUID} anymore s/virDomainFindBy/virDomainObjListFindBy/ Signed-off-by: Cao jin <caoj.fnst@cn.fujitsu.com>	2015-08-03 13:08:00 +02:00
Luyao Huang	1439eb32af	qemu: fix some api cannot work when disable cpuset in conf If cpuset is disabled or not available, it libvirt must not use it. Mainly for actions that do not need it and can use sched_setaffinity() or numa_membind() instead, because they will fail without good reason. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1244664 Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-08-03 13:08:00 +02:00
Jiri Denemark	e8d0166e1d	qemu: Do not reset labels when migration fails When stopping a domain on the destination host after a failed migration, we need to avoid reseting security labels since the domain is still running on the source host. While we were correctly doing so in some cases, there were still some paths which did this wrong. https://bugzilla.redhat.com/show_bug.cgi?id=1242904 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-31 15:15:12 +02:00
Jiri Denemark	40a6dd9c16	qemu: Properly check for incoming migration job In addition to checking the current asynchronous job qemuMigrationJobIsActive reports an error if the current job does not match the one we asked for. Let's just check the job directly since we are not interested in the error in qemuProcessHandleMonitorEOF. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-31 15:15:12 +02:00
Peter Krempa	136f3de411	qemu: Reject migration with memory-hotplug if destination doesn't support it If destination libvirt doesn't support memory hotplug since all the support was introduced by adding new elements the destination would attempt to start qemu with an invalid configuration. The worse part is that qemu might hang in such situation. Fix this by sending a required migration feature called 'memory-hotplug' to the destination. If the destination doesn't recognize it it will fail the migration. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1248350	2015-07-30 16:44:02 +02:00
Erik Skultety	b2960501c7	qemu: Adjust VM id allocation Our atomic increment (virAtomicIntInc) uses (if available) gcc __sync_add_and_fetch builtin. In qemu driver though, we'd profit more from __sync_fetch_and_add builtin. To keep it simplistic, this patch adjusts qemu driver initialization rather than adding a new atomic increment macro.	2015-07-29 09:15:44 +02:00
Martin Kletzander	ba167186cf	qemu: Check for iotune_max support properly Commit `d506a51aeb` meant to check for QEMU_CAPS_DRIVE_IOTUNE_MAX, but checked for QEMU_CAPS_DRIVE_IOTUNE instead. That's clearly visible from the diff, but it got in. Because of that, we were supplying information unknown for QEMU if it wasn't new enough and we couldn't even properly handle the error, leading to "Unexpected error". Also iops_size came at the same time with all the other "_max" options, so check whether we're not setting that either if QEMU_CAPS_DRIVE_IOTUNE_MAX is not supported. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1224053 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-27 08:29:37 +02:00
Laine Stump	0726878297	qemu: reorganize loop in qemuDomainAssignPCIAddresses This loop occurs just after we've assured that all devices that require a PCI device have been assigned and all necessary PCI controllers have been added. It is the perfect place to add other potentially auto-generated PCI controller attributes that are dependent on the controller's PCI address (upcoming patch). There is a convenient loop through all controllers at the end of the function, but the patch to add new functionality will be cleaner if we first rearrange that loop a bit. Note that the loop originally was accessing info.addr.pci.bus prior to determining that the pci part of the object was valid. This isn't dangerous in any way, but seemed a bit ugly, so I fixed it.	2015-07-25 10:10:22 -04:00
Cao jin	c1c5eb6fad	fix typo in qemu_monitor Signed-off-by: Cao jin <caoj.fnst@cn.fujitsu.com>	2015-07-24 14:29:34 +02:00
Martin Kletzander	a5bdb8459a	Revert "qemu: Use heads parameter for QXL driver" This reverts commit `7b401c3bda`. Until libvirt is able to differentiate whether heads='1' is just a leftover from previous libvirt or whether that's added by user on purpose and also whether the domain was started with the support for qxl's max_outputs, we cannot incorporate this patch into the tree due to compatibility reasons. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-24 13:06:47 +02:00
Luyao Huang	704cf06a14	qemu: fix the error cover issue in SetMemoryParameters https://bugzilla.redhat.com/show_bug.cgi?id=1245476 We won't return the errno after commit `0d7f45ae`, and the more clearly error will be set in the code in vircgroup*. Also We will always report error "Operation not permitted", because the return is -1. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-22 11:02:17 +02:00
Peter Krempa	88f6c007c3	cgroup: Drop resource partition from virSystemdMakeScopeName The scope name, even according to our docs is "machine-$DRIVER\x2d$VMNAME.scope" virSystemdMakeScopeName would use the resource partition name instead of "machine-" if it was specified thus creating invalid scope paths. This makes libvirt drop cgroups for a VM that uses custom resource partition upon reconnecting since the detected scope name would not match the expected name generated by virSystemdMakeScopeName. The error is exposed by the following log entry: debug : virCgroupValidateMachineGroup:302 : Name 'machine-qemu\x2dtestvm.scope' for controller 'cpu' does not match 'testvm', 'testvm.libvirt-qemu' or 'machine-test-qemu\x2dtestvm.scope' for a "/machine/test" resource and "testvm" vm. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1238570	2015-07-22 07:12:56 +02:00
Peter Krempa	eae59247c5	qemu: Update state of block job to READY only if it actually is ready Few parts of the code looked at the current progress of and assumed that a two phase blockjob is in the _READY state as soon as the progress reached 100% (info.cur == info.end). In current versions of qemu this assumption is invalid and qemu exposes a new flag 'ready' in the query-block-jobs output that is set to true if the job is actually finished. This patch adds internal data handling for reading the 'ready' flag and acting appropriately as long as the flag is present. While this still doesn't fix the virsh client problem with two phase block jobs and the --pivot option, it at least improves the error message: $ virsh blockcommit --wait --verbose vm vda --base vda[1] --active --pivot Block commit: [100 %]error: failed to pivot job for disk vda error: internal error: unable to execute QEMU command 'block-job-complete': The active block job for device 'drive-virtio-disk0' cannot be completed to $ virsh blockcommit --wait --verbose VM vda --base vda[1] --active --pivot Block commit: [100 %]error: failed to pivot job for disk vda error: block copy still active: disk 'vda' not ready for pivot yet	2015-07-21 15:32:59 +02:00
Martin Kletzander	717c99f360	qemu: Reject updating unsupported disk information If one calls update-device with information that is not updatable, libvirt reports success even though no data were updated. The example used in the bug linked below uses updating device with <boot order='2'/> which, in my opinion, is a valid thing to request from user's perspective. Mainly since we properly error out if user wants to update such data on a network device for example. And since there are many things that might happen (update-device on disk basically knows just how to change removable media), check for what's changing and moreover, since the function might be usable in other drivers (updating only disk path is a valid possibility) let's abstract it for any two disks. We can't possibly check for everything since for many fields our code does not properly differentiate between default and unspecified values. Even though this could be changed, I don't feel like it's worth the complexity so it's not the aim of this patch. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1007228	2015-07-20 11:35:54 +02:00
Frediano Ziglio	7b401c3bda	qemu: Use heads parameter for QXL driver Allows to specify maximum number of head to QXL driver. Actually can be a compatiblity problem as heads in the XML configuration was set by default to '1'. Signed-off-by: Frediano Ziglio <fziglio@redhat.com>	2015-07-20 10:35:18 +02:00
Boris Fiuczynski	d01b7c7854	qemu: Make virtio-9p-ccw the default for s390-ccw-virtio machines For s390-ccw-virtio machines the default bus type is set to ccw. Specifing an address element allows to override the default. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Jason J. Herne <jjherne@us.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com>	2015-07-15 14:37:30 +02:00
Boris Fiuczynski	56f6de93b5	qemu: Support for virtio-9p-ccw Adding the recently in qemu added 9pfs support for virtio-ccw. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Jason J. Herne <jjherne@us.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com>	2015-07-15 14:37:30 +02:00
Michal Privoznik	cd043390ff	qemuMigrationRun: Don't leak @fd If we are migrating to an UNIX socket, we accept() a connection from qemu and use that FD to set up a tunnel. However, the FD is not closed as often as it should be. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-07-15 11:40:41 +02:00
Peter Krempa	c212e0c779	qemu: process: Improve update of maximum balloon state at startup In commit `641a145d73` I've added code that resets the balloon memory value to full size prior to resuming the vCPUs since the size certainly was not reduced at that point. Since qemuProcessStart is used also in code paths with already booted up guests (migration, save/restore) the assumption is not entirely true since the guest might already been running before. This patch adds a function that queries the monitor rather than using the full size since a balloon event would not be reissued in case we are recovering a saved migration state. Additionally the new function is used also when reconnecting to a VM after libvirtd restart since we might have missed a few balloon events while libvirtd was not running.	2015-07-14 14:47:57 +02:00
Michal Privoznik	1cf25f6334	qemuDomainSetNumaParamsLive: Check for NUMA mode more wisely https://bugzilla.redhat.com/show_bug.cgi?id=1232663 In one of my previous ptaches (`bcd9a564`) I've tried to fix the problem that we blindly assumed strict NUMA mode for guests. This led to several problems like us pinning a domain onto a nodeset via libnuma among with CGroups. Once the nodeset was changed by user, well, it did not result in desired effect. See the original commit for more info. But, the commit I wrote had a bug: when NUMA parameters are changed on a running domain we require domain to be strictly pinned onto a nodeset. Due to a typo a condition was mis-evaluated. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-07-14 10:29:19 +02:00
John Ferlan	c71f0654fc	nodeinfo: Add sysfs_prefix to nodeGetMemoryStats Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_SYSTEM_PATH.	2015-07-13 15:59:32 -04:00
John Ferlan	b97b30480d	nodeinfo: Add sysfs_prefix to nodeCapsInitNUMA Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_CPU_PATH which is a derivative of SYSFS_SYSTEM_PATH Use cpupath for nodeCapsInitNUMAFake and remove SYSFS_CPU_PATH	2015-07-13 15:59:32 -04:00
John Ferlan	29e4f2243f	nodeinfo: Add sysfs_prefix to nodeGetInfo Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_SYSTEM_PATH.	2015-07-13 15:59:32 -04:00
John Ferlan	f1c6179f0d	nodeinfo: Add sysfs_prefix to nodeGetCPUMap Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_SYSTEM_PATH.	2015-07-13 15:59:32 -04:00
John Ferlan	f1a43a0f91	nodeinfo: Add sysfs_prefix to nodeGetCPUCount Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_SYSTEM_PATH.	2015-07-13 15:59:32 -04:00
Michal Privoznik	45cc2fca5c	qemuProcessHandleMigrationStatus: Update migration status more frequently After Jirka's migration patches libvirt is listening on migration events from qemu instead of actively polling on the monitor. There is, however, a little regression (introduced in `6d2edb6a42`). The problem is, the current status of migration job is updated in qemuProcessHandleMigrationStatus if and only if migration job was started. But eventually every asynchronous job may result in migration. Therefore, since this job is not strictly a migration job, internal state was not updated and later checks failed: virsh # save fedora22 /tmp/fedora22_ble.save error: Failed to save domain fedora22 to /tmp/fedora22_ble.save error: operation failed: domain save job: is not active Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-07-13 15:07:12 +02:00
Jiri Denemark	2e7cea2435	qemu: Use error from Finish instead of "unexpectedly failed" When QEMU exits on destination during migration, the source reports either success (if the failure happened at the very end) or unhelpful "unexpectedly failed" error message. However, the Finish API called on the destination may report a real error so let's use it instead of the generic one. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-10 11:47:13 +02:00
Jiri Denemark	44c42b564d	qemu: Don't report false error from MigrateFinish virDomainMigrateFinish* APIs were unfortunately designed to return the pointer to the domain on destination and NULL on error. This looks OK in normal cases but the same API is also called when we know migration failed and thus we expect Finish to return NULL even if it actually did all it was supposed to do without any error. The call is defined to return nonnull domain pointer over RPC, which means returning NULL will always result in an error being send. If this was not in fact an error, the API itself wouldn't set anything to the thread local virError, which makes the RPC layer come up with it's own "Library function returned error but did not set virError" error. This is quite confusing and also hard to detect by the caller. This patch adds a special error code which can be used to check that Finish successfully aborted migration. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-10 11:47:13 +02:00
Jiri Denemark	e68f395fcb	qemu: Remember incoming migration errors If QEMU fails during incoming migration, the domain disappears including a possibly useful error message read from QEMU log file. Let's remember the error in virQEMUDriver so that Finish can report more than just "no such domain". Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-10 11:47:13 +02:00
Jiri Denemark	04d5fb2e0a	qemu: Drop LFs at the end of error from QEMU log Libvirt's error messages do not end with a LF. However, when reading the error from QEMU log, we would read the LF from the log and keep it in the message. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-09 21:58:44 +02:00
Jiri Denemark	108a219f02	qemu: Log all arguments of qemuProcessStart Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-09 21:57:46 +02:00
Jiri Denemark	3409f5bc4e	qemu: Wait for migration events on domain condition Since we already support the MIGRATION event, we just need to make sure the domain condition is signalled whenever a p2p connection drops or the domain is paused due to IO error and we can avoid waking up every 50 ms to check whether something happened. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-09 21:57:30 +02:00
Jiri Denemark	6d2edb6a42	qemu: Update migration state according to MIGRATION event We don't need to call query-migrate every 50ms when we get the current migration state via MIGRATION event. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-09 21:53:35 +02:00
Jiri Denemark	66c95964a5	qemuDomainGetJobStatsInternal: Support migration events When QEMU supports migration events the qemuDomainJobInfo structure will no longer be updated with migration statistics. We have to enter a job and explicitly ask QEMU every time virDomainGetJob{Info,Stats} is called. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-09 21:44:07 +02:00
Jiri Denemark	3df4d2a45a	qemu: Enable migration events on QMP monitor Even if QEMU supports migration events it doesn't send them by default. We have to enable them by calling migrate-set-capabilities. Let's enable migration events everytime we can and clear QEMU_CAPS_MIGRATION_EVENT in case migrate-set-capabilities does not support events. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-09 21:44:07 +02:00
Jiri Denemark	a60ee613c4	qemu_monitor: Wire up MIGRATION event Thanks to Juan's work QEMU finally emits an event whenever migration state changes. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-09 21:44:07 +02:00
Guido Günther	c6f9ddf909	qemu: don't use initialized ret in qemuRemoveSharedDevice This fixes CC qemu/libvirt_driver_qemu_impl_la-qemu_conf.lo qemu/qemu_conf.c: In function 'qemuRemoveSharedDevice': qemu/qemu_conf.c:1384:9: error: 'ret' may be used uninitialized in this function [-Werror=maybe-uninitialized]	2015-07-09 19:13:43 +02:00
Pavel Hrdina	28554080ec	qemu_hotplug: try harder to eject media Some guests lock the tray and QEMU eject command will simply fail to eject the media. But the guest OS can handle this attempt to eject the media and can unlock the tray and open it. In this case, we should try again to actually eject the media. If the first attempt fails to detect a tray_open we will fail with error, from monitor. If we receive that event, we know, that the guest properly reacted to the eject request, unlocked the tray and opened it. In this case, we need to run the command again to actually eject the media from the device. The reason to call it again is, that QEMU doesn't wait for the guest to react and report an error, that the tray is locked. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1147471 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-07-09 18:02:05 +02:00
Pavel Hrdina	256496e149	monitor: detect that eject fails because the tray is locked Modify the eject monitor functions to parse the return code and detect, whether the error contains "is locked" to report this type of failure to upper layers. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-07-09 18:02:05 +02:00
Pavel Hrdina	6b278f3ad6	virDomainObjSignal: drop this function There are multiple consumers for the domain condition and we should always wake them all. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-07-09 18:02:05 +02:00
Luyao Huang	b5c2245b0c	qemu: report error for non-existing disk in blockjobinfo Before: # virsh blockjob r7 vdc error: An error occurred, but the cause is unknown After: # virsh blockjob r7 vdc error: Disk 'vdc' not found in the domain Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1241355 Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-09 16:10:05 +02:00
John Ferlan	69b850fe2a	qemu: Fix integer/boolean logic in qemuSetUnprivSGIO Setting of 'val' is a boolean expression, so handle it that way and adjust the check/return logic to be clearer Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-07-09 06:48:05 -04:00
John Ferlan	931ac3cdb5	qemu: Refactor qemuSetUnprivSGIO return values Set to ret = -1 and prove otherwise, like usual Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-07-09 06:48:05 -04:00
John Ferlan	3830795318	qemu: Inline qemuGetHostdevPath Since a future patch will need the device path generated when adding a shared host device, remove the qemuAddSharedHostdev and inline the two calls into qemuAddSharedHostdev and qemuRemoveSharedHostdev Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-07-09 06:48:05 -04:00
John Ferlan	f6a5cbbfdc	qemu: Refactor qemuCheckSharedDisk to create qemuCheckUnprivSGIO Split out the current function in order to share the code with hostdev in a future patch. Failure to match the expected sgio value against what is stored will cause an error which the caller would need to handle since only the caller has the disk (or eventually hostdev) specific data in order to uniquely identify the disk in an error message. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-07-09 06:48:05 -04:00
John Ferlan	083cbe506b	qemu: Introduce qemuGetHostdevPath Introduce a convenience function to handle formulating the hostdev path Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-07-08 15:07:22 -04:00
John Ferlan	8c43258ed5	qemu: Introduce qemuIsSharedHostdev Add a single boolean function to handle whether the hostdev is shared or not. Use the new function for the qemu{Add\|Remove}SharedHostdev calls as well as qemuSetUnprivSGIO. NB: This third usage fixes a possible bug where if this feature is enabled at some time in the future and the shareable flag wasn't set, the sgio would have been erroneously set. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-07-08 15:07:22 -04:00
Luyao Huang	955d9bb8d0	qemu: report error when shmem has an invalid address If user passes an invalid address for shared memory device to qemu, neither libvirt nor qemu will report an error, but qemu will auto assign a pci address to the shared memory device. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-08 16:30:42 +02:00
Luyao Huang	2c2655744a	conf: use virDomainChrSourceDef to save server path As the backend of shmem server is a unix type chr device, save it in virDomainChrSourceDef, so we can reuse the existing code for chr device. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-08 16:30:42 +02:00
Luyao Huang	ffe96a1593	qemu: Refactor creation of shared memory device commandline Rename qemuBuildShmemDevCmd to qemuBuildShmemDevStr and change the return type so that it can be reused in the device hotplug code later. And split the chardev creation part in a new function qemuBuildShmemBackendStr for reuse in the device hotplug code later. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-08 16:30:42 +02:00
Luyao Huang	e9401342e1	qemu: Assign IDs for shared memory devices Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-08 16:30:42 +02:00
Luyao Huang	e309ea6658	qemu: Auto assign pci addresses for shared memory devices Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1165029 Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-08 16:30:42 +02:00
Ján Tomko	4edf01c92c	Explicitly format the isa-fdc controller for newer q35 machines Since QEMU commit ea96bc6 [1]: i386: drop FDC in pc-q35-2.4+ if neither it nor floppy drives are wanted the floppy controller is no longer implicit. Specify it explicitly on the command line if the machine type version is 2.4 or later. Note that libvirt's floppy drives do not result in QEMU implying the controller, because libvirt uses if=none instead of if=floppy. https://bugzilla.redhat.com/show_bug.cgi?id=1227880 [1] http://git.qemu.org/?p=qemu.git;a=commitdiff;h=ea96bc6	2015-07-08 15:35:35 +02:00
Ján Tomko	4ef21ec192	Separate isa-fdc options generation For the implicit controller, we set them via -global. Separating them will allow reuse for explicit fdc controller as well. No functional impact apart from one extra allocation.	2015-07-08 15:00:10 +02:00
Luyao Huang	0b3fcfb1ea	qemu: move the guest status check before agent config and status check When use setvcpus command with --guest option to a offline vm, we will get error: # virsh setvcpus test3 1 --guest error: Guest agent is not responding: QEMU guest agent is not connected However guest is not running, agent status could not be connected. In this case, report domain is not running will be better than agent is not connected. Move the guest status check more early to output error to point out guest status is not right. Also from the logic, a running vm is a basic requirement to use agent, we cannot use agent if vm is not running. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-07-08 10:51:51 +02:00
Luyao Huang	3b2b4114da	qemu: remove deadcode in qemuDomain{HelperGetVcpus\|GetIOThreadsLive} We set @hostcpus variable but not use it. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-08 10:23:37 +02:00
Pavel Hrdina	d28fefc66a	qemu_driver: live/config checks cleanup Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-07-03 15:30:33 +02:00
John Ferlan	ebd62ebaaa	qemu: Resolve Coverity DEADCODE Commit id 'f967e7a6' didn't place the closing parentheses quite right causing DEADCODE errors since the rc setting/comparison was wrong.	2015-07-01 06:28:12 -04:00
Peter Krempa	4b48ba4af5	conf: qemu: Taint VMs using custom device tree blob Using a custom device tree image may cause unexpected behavior in architectures that use this approach to detect platform devices. Since usually the device tree is generated by qemu and thus it's not normally used let's taint VMs using it to make it obvious as a possible source of problems.	2015-07-01 10:34:25 +02:00
Peter Krempa	91081979dd	qemu: Audit memory size with memory hotplug operations The memory device hot(un)plug was missing calls to the auditing code. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1226234	2015-07-01 10:19:54 +02:00
Peter Krempa	1a13677460	conf: audit: Audit physical memory size rather than balloon request Since the balloon driver does not guarantee that it returns memory to the host, using the value in the audit message is not a good idea. This patch removes auditing from updating the balloon size and reports the total physical size at startup.	2015-07-01 10:18:10 +02:00
Jiri Denemark	ffbafd4e88	qemu: Avoid using ".(null)" in UNIX socket path The code which generates paths for UNIX socket blindly used target name without checking if it was set. Thus for the following device XML <channel type='unix'> <source mode='bind'/> <target type='virtio'/> </channel> we would generate "/var/lib/libvirt/qemu/channel/target/NAME.(null)" path which works but is not really correct. Let's not use the ".target_name" suffix at all if target name is not set. https://bugzilla.redhat.com/show_bug.cgi?id=1226854 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-01 09:47:32 +02:00
Peter Krempa	18c9d1578b	qemu: agent: Don't automatically disable CPU0 via guest agent While CPU0 was made unpluggable in Linux a while ago it's not desirable to unplug it since some parts of the kernel (suspend-to-ram) still depend on it. This patch fixes the vCPU selection code in libvirt so that it will not be disabled.	2015-07-01 09:38:02 +02:00
Luyao Huang	91c9e4d920	qemu: End job even if exiting monitor after OpenGraphics(FD) fails Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-01 08:36:48 +02:00
Ján Tomko	224456fc4a	qemu: properly free addresses on non-serial chardev unplug The target type comparison in qemuDomainDetachChrDevice used the VIR_DOMAIN_CHR_SERIAL_TARGET_TYPE enum, so virtio-serial addresses were not freed properly for channel devices. Call qemuDomainReleaseDeviceAddress uncoditionally and decide based on the address type instead of the target/device types.	2015-07-01 08:09:43 +02:00
Luyao Huang	f967e7a669	qemu: fix address allocation on chardev attach Also check the device type when deciding what type the address should be. Commit `9807c47` (aiming to fix another error in address allocation) only checked the target type, but its value is different for different device types. This resulted in an error when trying to attach a channel with target type 'virtio': error: Failed to attach device from channel-file.xml error: internal error: virtio serial device has invalid address type Make the logic for releasing the address dependent only on * the address type * whether it was allocated earlier to avoid copying the device and target type checks. https://bugzilla.redhat.com/show_bug.cgi?id=1230039 Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-07-01 08:09:43 +02:00
John Ferlan	0b32838394	qemu: Add missing on_crash lifecycle type https://bugzilla.redhat.com/show_bug.cgi?id=1201760 When the domain "<on_crash>coredump-destroy</on_crash>" is set, the domain wasn't being destroyed, rather it was being rebooted. Add VIR_DOMAIN_LIFECYCLE_CRASH_COREDUMP_DESTROY to the list of on_crash types that cause "-no-reboot" to be added to the qemu command line.	2015-06-30 11:32:50 -04:00
John Ferlan	5cd985221b	Use the correct symbol for 'onCrash' Although defined the same way, fortunately there hadn't been any deviation. Ensure any assignments to onCrash use VIR_DOMAIN_LIFECYCLE_CRASH_* defs and not VIR_DOMAIN_LIFECYCLE_* defs	2015-06-30 11:32:50 -04:00
Peter Krempa	15632a312e	qemu: event: Clean up VNC monitor handling Get rid of spice specific stuff from the handler func and save a few lines by reflowing the conditions.	2015-06-30 13:50:52 +02:00
Peter Krempa	8df888a532	qemu: event: Properly handle spice events Spice events have mostly similar information present in the event JSON but they differ in the name of the element containing the port. The JSON event also provides connection ID which might be useful in the future. This patch splits up the event parser code into two functions and the SPICE reimplements the event parsing with correct names and drops the VNC only stuff. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1236585	2015-06-30 13:50:45 +02:00
Peter Krempa	4e5d03957a	Revert "Introduce QEMU_CAPS_ARM_VIRT_PCI" The capability was not used up to the feature freeze. This reverts commit `7f3515b4bb`.	2015-06-30 13:31:18 +02:00
Peter Krempa	f1caa42777	qemu: Close the agent connection only on agent channel events processSerialChangedEvent processes events for all channels. Commit `2af51483` broke all agent interaction if a channel other than the agent closes since it did not check that the event actually originated from the guest agent channel. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1236924 Fixes up: https://bugzilla.redhat.com/show_bug.cgi?id=890648	2015-06-30 13:18:02 +02:00
Andrea Bolognani	bb66d93c11	Fix typo incomaptible -> incompatible	2015-06-30 12:26:25 +02:00
Jiri Denemark	365b454ed9	qemu: Fix assignment of the default spicevmc channel name Make sure we only assign the default spicevmc channel name to spicevmc virtio channels. Caused by commits `3269ee65` and `1133ee2b`, which moved the assignment from XML parsing code to QEMU but failed to keep the logic. https://bugzilla.redhat.com/show_bug.cgi?id=1179680 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-30 10:31:29 +02:00
John Ferlan	782355a711	qemu: Resolve Coverity RESOURCE_LEAK Commit id '15fa84acb' added the alias fetch, but forgot to free it.	2015-06-27 06:41:59 -04:00
Laine Stump	1e15be1bbc	qemu: always permit PCI devices to be manually assigned to a PCIe bus When support for the pcie-root and dmi-to-pci-bridge buses on a Q35 machinetype was added, I was concerned that even though qemu at the time allowed plugging a PCI device into a PCIe port, that it might not be supported in the future. To prevent painful backtracking in the possible future where this happened, I disallowed such connections except in a few specific cases requested by qemu developers (indicated in the code with the flag VIR_PCI_CONNECT_TYPE_EITHER_IF_CONFIG). Now that a couple years have passed, there is a clear message from qemu that there is no danger in allowing PCI devices to be plugged into PCIe ports. This patch eliminates VIR_PCI_CONNECT_TYPE_EITHER_IF_CONFIG and changes the code to always allow PCI->PCIe or PCIe->PCI connection *when the PCI address is specified in the config. (For newly added devices that haven't yet been given a PCI address, the auto-placement still prefers using the correct type of bus).	2015-06-26 13:51:33 -04:00
Laine Stump	1074fc5061	qemu: refactor qemuBuildControllerDevStr to eliminate future duplicate code The PCI case of the switch statement in this function contains another switch statement with a case for each model. Currently every model except pci-root and pcie-root has a check for index > 0 (since only those two can have index==0), and the function should never be called for those two anyway. If we move the check for !pci[e]-root to the top of the pci case, then we can move the check for index > 0 out of the individual model cases. This will save repeating that check for the three new controller models about to be added.	2015-06-26 13:45:40 -04:00
Peter Krempa	78aefb5275	qemu: monitor: Remove qemuMonitorGetBlockExtent Now that qemuMonitorGetAllBlockStatsInfo collects also wr_highest_offset the whole function can be killed.	2015-06-26 16:41:24 +02:00
Peter Krempa	15fa84acbb	qemu: Refactor qemuDomainGetBlockInfo Change the code so that it queries the monitor when the VM is alive.	2015-06-26 16:41:24 +02:00
Peter Krempa	0d130a011c	qemu: monitor: Open-code retrieval of wr_highest_offset Instead of using qemuMonitorJSONDevGetBlockExtent (which I plan to remove later) extract the data in place. Additionally add a flag that will be set when the wr_highest_offset was extracted correctly so that callers can act according to that. The test case addition should help make sure that everything works.	2015-06-26 16:41:24 +02:00
Peter Krempa	21578858c7	qemu: monitor: Fix indentation in qemuMonitorJSONGetOneBlockStatsInfo	2015-06-26 16:05:10 +02:00
Peter Krempa	14062e6fe5	internal: Introduce virCheckNonEmptyStringArgGoto and reuse it The helper makes sure that strings passed to APIs are non-NULL and non-empty. This allows to drop some inlined checks where it does not make sense.	2015-06-26 16:05:10 +02:00
Michal Privoznik	70d75ffc79	qemuBuildMemoryBackendStr: Honour passed @pagesize So far the argument has not much meaning and was practically ignored. This is not good since when doing memory hotplug, the size of desired hugepage backing is passed in that argument. Taking closer look at the tests I'm fixing reveals the bug. For instance, while the following is in the test: <memory model='dimm'> <source> <nodemask>1-3</nodemask> <pagesize unit='KiB'>4096</pagesize> </source> <target> <size unit='KiB'>524287</size> <node>0</node> </target> <address type='dimm' slot='0' base='0x100000000'/> </memory> the generated commandline corresponding to this XML was: -object memory-backend-ram,id=memdimm0,size=536870912,\ host-nodes=1-3,policy=bind Have you noticed? Yes, memory-backend-ram! Nothing can be further away from the right answer. The hugepage backing is requested in the XML and we happily ignore it. This is just not right. It's memory-backend-file which should have been used: -object memory-backend-file,id=memdimm0,prealloc=yes,\ mem-path=/dev/hugepages4M/libvirt/qemu,size=536870912,\ host-nodes=1-3,policy=bind The problem is, that @pagesize passed to qemuBuildMemoryBackendStr (where this part of commandline is built) was ignored. The hugepage to back memory was searched only and only by NUMA nodes pinning. This works only for regular guest NUMA nodes. Then, I'm changing the hugepages size in the test XMLs too. This is simply because in the test suite we create dummy mount points just for 2M and 1G hugepages. And in the test 4M was requested. I'm sticking to 2M, but 1G should just work too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-26 09:23:06 +02:00
Michal Privoznik	f8e9deb1d4	qemuBuildMemoryBackendStr: Fix hugepages lookup process https://bugzilla.redhat.com/show_bug.cgi?id=1196644 This function constructs the backend (host facing) part of the memory device. At the beginning, the configured hugepages are searched to find the best match for given guest NUMA node. Configured hugepages can have a @nodeset attribute to specify on which guest NUMA nodes should be the hugepages backing used. There is, however, one 'corner case'. Users may just tell 'use hugepages to back all the nodes'. In other words: <memoryBacking> <hugepages/> </memoryBacking> <cpu> <numa> <cell id='0' cpus='0-1' memory='1024000' unit='KiB'/> </numa> </cpu> Our code fails in this case. Well, since there's no @nodeset (nor any <page/> child element to <hugepages/>) we fail to lookup the default hugepage size to use. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-26 09:15:26 +02:00
Luyao Huang	09444724bc	qemu: Avoid removing persistent config if migration fails When migration fails in qemuMigrationPrepareAny, we unconditionally call qemuDomainRemoveInactive, which should only be called for transient domains. The check for !vm->persistent was accidentally removed by commit `540c339`. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-25 10:18:39 +02:00
Boris Fiuczynski	b831c5b801	Support for the new watchdog model diag288 This patch provides support for the new watchdog model "diag288". Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Daniel Hansel <daniel.hansel@linux.vnet.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com> Reviewed-by: Tony Krowiak <akrowiak@linux.vnet.ibm.com>	2015-06-24 15:26:31 +02:00
Boris Fiuczynski	1238dc29af	Support for a new watchdog action inject-nmi This patch provides support for a new watchdog action "inject-nmi" which allows to define an inject of a non-maskable interrupt into a guest. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Daniel Hansel <daniel.hansel@linux.vnet.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com> Reviewed-by: Tony Krowiak <akrowiak@linux.vnet.ibm.com>	2015-06-24 15:26:31 +02:00
Luyao Huang	898e76f0aa	Improve some errors for openconsole/channel Functions like virDomainOpenConsole() and virDomainOpenChannel() accept NULL as a dev_name parameter. Try using alias for the error message if dev_name is not specified. Before: error: internal error: character device <null> is not using a PTY After: error: internal error: character device serial0 is not using a PTY Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-06-24 10:11:56 +02:00
Eric Farman	22b8a61756	Convert SCSI logical unit from unsigned int to unsigned long long The SCSI Architecture Model defines a logical unit address as 64-bits in length, so change the field accordingly so that the entire value could be stored. Signed-off-by: Eric Farman <farman@linux.vnet.ibm.com>	2015-06-22 16:03:33 -04:00
Eric Farman	3b7983ad6d	Print SCSI address attributes bus, target, unit as unsigned integer The address elements are all unsigned integers, so we should use the appropriate print directive when printing it. Signed-off-by: Eric Farman <farman@linux.vnet.ibm.com>	2015-06-22 16:03:33 -04:00
Eric Blake	40783db3f9	qemu: simplify json parsing Rather than grabbing an arbitrary JSON value and then checking if it has the right type, we might as well request the correct type to begin with. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONIOProcessEvent) (qemuMonitorJSONCommandWithFd, qemuMonitorJSONHandleGraphics) (qemuMonitorJSONGetStatus, qemuMonitorJSONExtractCPUInfo) (qemuMonitorJSONGetVirtType, qemuMonitorJSONGetBalloonInfo) (qemuMonitorJSONGetMemoryStats) (qemuMonitorJSONDevGetBlockExtent) (qemuMonitorJSONGetOneBlockStatsInfo) (qemuMonitorJSONGetAllBlockStatsInfo) (qemuMonitorJSONBlockStatsUpdateCapacityOne) (qemuMonitorJSONBlockStatsUpdateCapacity) (qemuMonitorJSONGetBlockExtent) (qemuMonitorJSONGetMigrationStatusReply) (qemuMonitorJSONGetDumpGuestMemoryCapability) (qemuMonitorJSONAddFd, qemuMonitorJSONQueryRxFilterParse) (qemuMonitorJSONExtractChardevInfo) (qemuMonitorJSONDiskNameLookupOne) (qemuMonitorJSONDiskNameLookup) (qemuMonitorJSONGetAllBlockJobInfo) (qemuMonitorJSONBlockIoThrottleInfo, qemuMonitorJSONGetVersion) (qemuMonitorJSONGetMachines, qemuMonitorJSONGetCPUDefinitions) (qemuMonitorJSONGetCommands, qemuMonitorJSONGetEvents) (qemuMonitorJSONGetKVMState, qemuMonitorJSONGetObjectTypes) (qemuMonitorJSONGetObjectListPaths) (qemuMonitorJSONGetObjectProps, qemuMonitorJSONGetTargetArch) (qemuMonitorJSONGetMigrationCapabilities) (qemuMonitorJSONGetStringArray, qemuMonitorJSONAttachCharDev) (qemuMonitorJSONGetCPUx86Data, qemuMonitorJSONGetIOThreads) (qemuMonitorJSONGetMemoryDeviceInfo): Use shorter idioms. Signed-off-by: Eric Blake <eblake@redhat.com>	2015-06-22 10:38:39 -06:00
Michal Privoznik	50a9099262	virQEMUCapsComputeCmdFlags: Indent correctly There's a small formatting problem in the function. One line is not correctly indented. Fix this. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-22 10:59:06 +02:00
Peter Krempa	a08e796bba	qemu: caps: Fix syntax-check failure in version based capabilities A single-line 'if' body should not be encased in curly braces. Our syntax-check enforces it. Introduced in `7f3515b4bb`	2015-06-22 10:48:30 +02:00
Pavel Fedin	7f3515b4bb	Introduce QEMU_CAPS_ARM_VIRT_PCI This capability specifies that "virt" machine on ARM has PCI controller. Enabled when version is at least 2.3.0. Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-06-21 16:06:47 -04:00
Jiri Denemark	d823fa6f64	qemu: cancel drive mirrors when p2p connection breaks When a connection to the destination host during a p2p migration drops, we know we will have to cancel the migration; it doesn't make sense to waste resources by trying to finish the migration. We already do so after sending "migrate" command to QEMU and we should do it while waiting for drive mirrors to become ready too. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:19:49 +02:00
Jiri Denemark	d29c45587b	qemu: Refactor qemuMigrationWaitForCompletion Checking status of all part of migration and aborting it when something failed is a complex thing which makes the waiting loop hard to read. This patch moves all the checks into a separate function similarly to what was done for drive mirror loops. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:12 +02:00
Jiri Denemark	92b5bcccaa	qemu: Don't pass redundant job name around Instead of passing current job name to several functions which already know what the current job is we can generate the name where we actually need to use it. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:12 +02:00
Jiri Denemark	c1a7f199e8	qemu: Refactor qemuMigrationUpdateJobStatus Once we start waiting for migration events instead of polling query-migrate, priv->job.current will not be regularly updated anymore because we will get the current status directly from the events. Thus virDomainGetJob{Info,Stats} will have to query QEMU, but they can't just blindly update priv->job.current structure. This patch introduces qemuMigrationFetchJobStatus which just fills in a caller supplied structure and makes qemuMigrationUpdateJobStatus a tiny wrapper around it. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:12 +02:00
Jiri Denemark	30d94bcdec	qemu: Refactor qemuDomainGetJob{Info,Stats} Move common parts of qemuDomainGetJobInfo and qemuDomainGetJobStats into a separate API (qemuDomainGetJobStatsInternal). Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:12 +02:00
Jiri Denemark	2ad46e5b0e	qemu: Do not poll for spice migration status QEMU_CAPS_SEAMLESS_MIGRATION capability says QEMU supports SPICE_MIGRATE_COMPLETED event. Thus we can just drop all code which polls query-spice and replace it with waiting for the event. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:11 +02:00
Jiri Denemark	bfbbb78ca3	qemu_monitor: Wire up SPICE_MIGRATE_COMPLETED event Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:11 +02:00
Jiri Denemark	d814c70b3b	qemu: Use domain condition for asyncAbort To avoid polling for asyncAbort flag changes. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:11 +02:00
Jiri Denemark	e8f263e0d0	qemu: Cancel disk mirrors after libvirtd restart When libvirtd is restarted during migration, we properly cancel the ongoing migration (unless it managed to almost finished before the restart). But if we were also migrating storage using NBD, we would completely forget about the running disk mirrors. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:11 +02:00
Jiri Denemark	3a18bd2d7a	qemu: Refactor qemuMonitorBlockJobInfo "query-block-jobs" QMP command returns all running block jobs at once, while qemuMonitorBlockJobInfo would only report one. This is not very nice in case we need to check several block jobs. This patch refactors the monitor code to always parse all block jobs and store them in a hash. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:11 +02:00
Jiri Denemark	40cd0290dc	qemu: Make qemuMigrationCancelDriveMirror usable without async job We don't have an async job when reconnecting to existing domains after libvirtd restart. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:10 +02:00
Jiri Denemark	3a0f39bcd8	Pass domain object to private data formatter/parser So that they can format private data (e.g., disk private data) stored elsewhere in the domain object. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:10 +02:00
Jiri Denemark	b247d47f39	qemu: Don't mess with disk->mirrorState This patch reverts commit `76c61cdca2`. VIR_DOMAIN_DISK_MIRROR_STATE_ABORT says we asked for a block job to be aborted rather than saying it was aborted. Let's just use VIR_DOMAIN_DISK_MIRROR_STATE_NONE consistently whenever a block job finishes since no caller depends on VIR_DOMAIN_DISK_MIRROR_STATE_ABORT (anymore) to check whether a block job failed or it was cancelled. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:10 +02:00
Jiri Denemark	a9ba39a1a7	qemu: Abort migration early if disk mirror failed Abort migration as soon as we detect that some of the disk mirrors failed. There's no sense in trying to finish memory migration first. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:10 +02:00
Jiri Denemark	cebb110f73	qemu: Cancel storage migration in parallel Instead of cancelling disk mirrors sequentially, let's just call block-job-cancel for all migrating disks and then wait until all disappear. In case we cancel disk mirrors at the end of successful migration we also need to check all block jobs completed successfully. Otherwise we have to abort the migration. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:10 +02:00
Jiri Denemark	4172b96a3e	qemu: Use domain condition for synchronous block jobs By switching block jobs to use domain conditions, we can drop some pretty complicated code in NBD storage migration. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:15:10 +02:00
Jiri Denemark	39564891f8	qemu: Properly report failed migration Because we are polling we may detect some errors after we asked QEMU for migration status even though they occurred before. If this happens and QEMU reports migration completed successfully, we would happily report the migration succeeded even though we should have cancelled it because of the other error. In practise it is not a big issue now but it will become a much bigger issue once the check for storage migration status is moved inside the loop in qemuMigrationWaitForCompletion. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:13:16 +02:00
Jiri Denemark	e2cc0e667e	qemu: Introduce qemuBlockJobUpdate The wrapper is useful for calling qemuBlockJobEventProcess with the event details stored in disk's privateData, which is the most likely usage of qemuBlockJobEventProcess. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-19 15:13:16 +02:00
Pavel Boldin	93a19e283e	qemu: migration: selective block device migration https://bugzilla.redhat.com/show_bug.cgi?id=1203032 Implement a `migrate_disks' parameters for the QEMU driver. This multi- value parameter can be used to explicitly specify what block devices are to be migrated using the NBD server. Tunnelled migration using NBD is to be done. Signed-off-by: Pavel Boldin <pboldin@mirantis.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-18 16:46:09 +02:00
Michal Privoznik	cb7297c150	qemuMigrationDriveMirror: Force raw format for NBD When playing with disk migration lately, I've noticed this warning in domain logs: WARNING: Image format was not specified for 'nbd://masina:49153/drive-virtio-disk0' and probing guessed raw. Automatically detecting the format is dangerous for raw images, write operations on block 0 will be restricted. Specify the 'raw' format explicitly to remove the restrictions. So I started digging into qemu source code to see what has triggered the warning. I'd expect qemu to know formats of guest's disks since we tell them on command line. This lead me to qmp_drive_mirror() where the following can be found: if (!has_format) { format = mode == NEW_IMAGE_MODE_EXISTING ? NULL : bs->drv->format_name; } So, format is automatically initialized from the disk iff mode != "existing". Unfortunately, in migration we are tied to use this mode (NBD doesn't support creating new images). Therefore the only way to avoid this warning is to pass format. The discussion on the mail-list [1] resulted in the code that always forces NBD export as "raw" format. [1] https://www.redhat.com/archives/libvir-list/2015-June/msg00153.html Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Signed-off-by: Pavel Boldin <pboldin@mirantis.com>	2015-06-18 16:46:09 +02:00
Michal Privoznik	9c5efd1afd	qemuMigrationBeginPhase: Fix function header indentation This function is returning a string (domain XML). Since `d3ce7363` when it was first introduced, it was indented incorrectly: static char *qemuMigrationBeginPhase(..) Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-18 16:46:09 +02:00
Peter Krempa	87837b487b	qemu: Fix double space in error message in qemuDomainGetVcpusFlags	2015-06-18 15:31:52 +02:00
Peter Krempa	99e4c1d6fd	qemu: Jump to correct label in qemuDomainPinIOThread If virDomainObjGetDefs used in qemuDomainPinIOThread would fail the code would jump to the 'cleanup' label after acquiring the job, thus the VM would be locked forever. Introduced in commit `cac6d639`.	2015-06-18 15:29:20 +02:00
Peter Krempa	0b416434f8	qemu: 'privileged' flag is not really configuration The privileged flag will not change while the configuration might change. Make the 'privileged' flag member of the driver again and mark it immutable. Should that ever change add an accessor that will group reads of the state.	2015-06-18 15:13:45 +02:00
Peter Krempa	58edccb4a5	qemu: Reuse virDomainObjGetDefs in qemuDomainGetMemoryParameters Simplify the code by restructuring control flow and reusing the better helper.	2015-06-18 15:13:45 +02:00
Peter Krempa	68ba0d8a89	qemu: Refactor qemuDomainGetMemoryParameters Replace the for loops with case inside with temp variables and a macro.	2015-06-18 15:13:45 +02:00
Peter Krempa	809ba20ebe	qemu: Refactor qemuDomainSetNumaParameters Use virDomainObjGetDefs and sanitize the control flow.	2015-06-18 15:13:45 +02:00
Peter Krempa	caa6cd39cc	qemu: Simplify qemuDomainSetInterfaceParameters by using virDomainObjGetDefs	2015-06-18 15:13:45 +02:00
Peter Krempa	875a731d21	qemu: Simplify qemuDomainGetVcpusFlags by using virDomainObjGetOneDef virDomainObjGetOneDef is simpler to use than virDomainObjGetDefs	2015-06-18 15:13:45 +02:00
Peter Krempa	f3a7006545	qemu: Simplify qemuDomainGetEmulatorPinInfo by using virDomainObjGetOneDef virDomainObjGetOneDef is simpler to use than virDomainObjGetDefs	2015-06-18 15:13:45 +02:00
Peter Krempa	2881c381a0	qemu: Simplify qemuDomainGetVcpuPinInfo by using virDomainObjGetOneDef virDomainObjGetOneDef is simpler to use than virDomainObjGetDefs	2015-06-18 15:13:45 +02:00
Peter Krempa	3f0fe2dcb2	qemu: Simplify qemuDomainGetNumaParameters by using virDomainObjGetOneDef	2015-06-18 15:13:45 +02:00
Peter Krempa	7d91a2d944	qemu: Simplify qemuDomainGetInterfaceParameters by using virDomainObjGetOneDef	2015-06-18 15:13:45 +02:00
James Cowgill	f486bb0494	qemu: implement address for isa-serial I needed to specify the iobase address for certain exotic mips configurations. Signed-off-by: James Cowgill <james410@cowgill.org.uk>	2015-06-18 08:17:20 -04:00
Luyao Huang	cb7e13ffbf	qemu: Add a check for slot and base dimm address conflicts When hotplugging a memory device, there wasn't a check to determine if there is a conflict with the address space being used by the to be added memory device and any existing device which is disallowed by qemu. This patch adds a check to ensure the new device address doesn't conflict with any existing device. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-06-18 08:08:42 -04:00
Ján Tomko	6fab625f96	remove redundant condition If the address type is SPAPRVIO, it will match the != NONE condition.	2015-06-18 12:13:00 +02:00
Michal Privoznik	a9a27e602c	virSysinfo: Introduce SMBIOS type 2 support https://bugzilla.redhat.com/show_bug.cgi?id=1220527 This type of information defines attributes of a system baseboard. With one exception: board type is yet not implemented in qemu so it's not introduced here either. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-18 10:10:26 +02:00
Jiri Denemark	beca509e43	qemu: Report all supported machine types in capabilities Some machine types are only reported as canonical names for other machine types, which make it a bit harder to find what machine types are supported by a specific QEMU binary. Ideally, one would just use /capabilities/guest/arch[@name='...']/machine/text() XPath to get a list of all supported machine types, but it doesn't work right now. For example, we report <machine canonical='pc-i440fx-2.3' maxCpus='255'>pc</machine> in guest capabilities, but the corresponding <machine maxCpus='255'>pc-i440fx-2.3</machine> is missing. This is a result of QMP probing. With "-machine ?" parsing QEMU sends us two lines: pc Standard PC (i440FX + PIIX, 1996) (alias of pc-i440fx-2.3) pc-i440fx-2.3 Standard PC (i440FX + PIIX, 1996) (default) while query-machines QMP command reports both in the same entry: {"name": "pc-i440fx-2.3", "is-default": true, "cpu-max": 255, "alias": "pc"} Let's make sure we always report separate <machine/> for both the canonical name and its alias and using the canonical name as the default machine type (i.e., inserting it before its alias) in case is-default is true. https://bugzilla.redhat.com/show_bug.cgi?id=1229666 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-18 09:49:33 +02:00
Boris Fiuczynski	840eca32ab	qemu: monitor: Add memory balloon support for virtio-ccw The search for the memory balloon driver object is extended by a second known name "virtio-balloon-ccw" in support for virtio-ccw. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Daniel Hansel <daniel.hansel@linux.vnet.ibm.com> Reviewed-by: Eric Farman <farman@linux.vnet.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com>	2015-06-15 17:05:32 +02:00
Peter Krempa	4ed2b9a7e4	qemu: emulatorpin: Don't reset pinning when pinning to all cpus Similarly to `a02a161bb8` remove the default pinning assumption from emulatorpin. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1227180	2015-06-15 16:18:49 +02:00
Ján Tomko	243bbcc5db	qemu caps: spell queue	2015-06-15 13:32:44 +02:00
John Ferlan	4fce9e8479	qemu: Do not support 'serial' scsi-block 'lun' devices https://bugzilla.redhat.com/show_bug.cgi?id=1021480 Seems the property has been deprecated for qemu, although seemingly ignored. This patch enforces from a libvirt perspective that a scsi-block 'lun' device should not provide the 'serial' property.	2015-06-15 07:30:29 -04:00
zhang bo	798461a1a3	qemu: update netdevs of the same mac addrs correctly If a guest has multiple network devices with the same MAC address, when we online update the second device, libvirtd always updates the first one. commit `def31e4c` forgot to fix the online updating scenario. We need to use virDomainNetFindIdx() to find the correct network device. Signed-off-by: Zhou Yimin <zhouyimin@huawei.com> Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com>	2015-06-12 13:04:39 -04:00
John Ferlan	785a8940ef	scsi: Need to translate disk source pool in config attach path https://bugzilla.redhat.com/show_bug.cgi?id=1228007 When attaching a scsi volume lun via the attach-device --config or --persistent options, there was no translation of the source pool like there was for the live path, thus the attempt to modify the config would fail since not enough was known about the disk.	2015-06-12 12:20:36 -04:00
Michal Privoznik	87c81cd5ee	qemuBuildDriveStr: s/virBufferEscapeString/virBufferAsprintf/ We are using it to print a value that can't be NULL and does not need any escaping anyway. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-12 16:44:24 +02:00
Michal Privoznik	c88bf57262	virQEMUCapsArch: openrisc vs or32 With a few exceptions, we assume that qemu binary for given architecture has form of qemu-system-$arch. Well, openrisc is yet another exception. It's binary is called qemu-system-or32. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-12 16:44:24 +02:00
Michal Privoznik	0b92974c15	virSysinfoDef: Exempt SYSTEM variables Move all the system_* fields into a separate struct. Not only this simplifies the code a bit it also helps us to identify whether BIOS info is present. We don't have to check all the four variables for being not-NULL, but we can just check the pointer to the struct. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-12 10:42:39 +02:00
Michal Privoznik	3f9cae18fe	virSysinfoDef: Exempt BIOS variables Move all the bios_* fields into a separate struct. Not only this simplifies the code a bit it also helps us to identify whether BIOS info is present. We don't have to check all the four variables for being not-NULL, but we can just check the pointer to the struct. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-06-12 10:42:34 +02:00
Maxime Leroy	366c22f2bc	qemu: add multiqueue vhost-user support This patch adds the support of queues attribute of the driver element for vhost-user interface type. Example: <interface type='vhostuser'> <mac address='52:54:00:ee:96:6d'/> <source type='unix' path='/tmp/vhost2.sock' mode='client'/> <model type='virtio'/> <driver queues='4'/> </interface> Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1207692 Signed-off-by: Maxime Leroy <maxime.leroy@6wind.com> Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-06-11 14:28:29 +02:00
Martin Kletzander	7971723b98	qemu: Add capability for vhost-user multiqueue The support for this was added in QEMU with commit 830d70db692e374b55555f4407f96a1ceefdcc97. Unfortunately we have to do another ugly version-based capability check. The other option would be not to check for the capability at all and leave that to qemu as it's done with multiqueue tap devices. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-06-11 14:18:41 +02:00
Daniel P. Berrange	d587704cc7	rpc: allow selection of TCP address family By default, getaddrinfo() will return addresses for both IPv4 and IPv6 if both protocols are enabled, and so the RPC code will listen/connect to both protocols too. There may be cases where it is desirable to restrict this to just one of the two protocols, so add an 'int family' parameter to all the TCP related APIs. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2015-06-11 12:11:18 +01:00
Andrea Bolognani	b378c0f10a	qemu: Capitalize "storage" in qemuDomainAttachUSBMassStorageDevice() Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-06-09 13:16:29 +02:00
Cole Robinson	daf2f51445	qemu: caps: Advertise arm 32-on-64 KVM option We need to use qemu-system-aarch64 to run armv7l KVM VMs on an aarch64 host.	2015-06-08 17:56:31 -04:00
Cole Robinson	667370d67e	qemu: caps: qemu-system-aarch64 supports armv7l And it always has, so advertise it similarly to i686	2015-06-08 17:51:06 -04:00
Cole Robinson	29ce1693fa	qemu: command: Support arm 32-on-64 KVM with -cpu aarch64=off qemu 2.3.0 added the -cpu host,aarch64=off option, which allows using qemu-system-aarch64 KVM to run armv7l VMs. Add a capabilities check for it, wire it up in qemu_command, and test the command line generation.	2015-06-08 17:51:06 -04:00
Ján Tomko	568aba8811	Turn qemuMonitorFindBalloonObjectPath into a void function We were effectively ignoring its errors anyway.	2015-06-05 16:19:21 +02:00
Ján Tomko	19c633c678	Do not access the domain definition in qemuMonitorFindBalloonObjectPath The monitor code does not hold the virDomainObjPtr lock and should not access the defitinion.	2015-06-05 16:19:07 +02:00
Ján Tomko	084ad13774	Only call SetMemoryStatsPeriod for virtio memballoon	2015-06-05 16:19:00 +02:00
Ján Tomko	bf42e6d848	Check for balloon model in qemuDomainSetMemoryStatsPeriod There's no point in calling the monitor if there is no balloon.	2015-06-05 16:17:49 +02:00
Ján Tomko	09ebc10fe1	Only call qemuMonitorGetMemoryStats for virtio memballoon There is nothing to get from the monitor for model='none'.	2015-06-05 16:06:05 +02:00
Ján Tomko	4bfc58b58d	Invert the condition in qemuDomainMemoryStats It only makes sense if qemuMonitorGetMemoryStats is called, but the following patch will make that call conditional.	2015-06-05 16:06:01 +02:00
Ján Tomko	4bcdd9cc9e	Add endjob label to qemuDomainMemoryStats Reduce the indentation level.	2015-06-05 16:05:44 +02:00
Ján Tomko	c09e1729df	Remove path argument from qemuMonitorJSONFindLinkPath All the callers use "/" anyway.	2015-06-05 16:01:23 +02:00
Ján Tomko	5aca919b6c	Introduce qemuMonitorJSONFindLinkPath When traversing through the QOM tree, we're looking for a link to a device, e.g.: link<virtio-balloon-pci> Introduce a helper that will format the link name at the start, instead of doing it every time while recursing through the tree.	2015-06-05 16:00:51 +02:00
Ján Tomko	88710cee75	Move qemuMonitorFindObjectPath to qemu_monitor_json This function is specific to the JSON monitor.	2015-06-05 15:55:22 +02:00
Peter Krempa	0f3e5325f5	qemu: Update balloon info only if job is allowed In qemuDomainUpdateCurrentMemorySize I misplaced the actual update of the balloon size to a place where it may not be initialized. Move it a few lines above.	2015-06-04 14:04:48 +02:00
Martin Kletzander	f833c70b85	qemu: Check for qemu capability when calling virDomainGetBlockIoTune() When getting block device I/O tuning data there is no check for whether QEMU supports such options and the call fails on qemuMonitorGetBlockIoThrottle() when getting the particular throttle data. So try reporting a better error when blkdeviotune is not supported. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1224053 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-06-04 11:30:41 +02:00
Peter Krempa	50a9d0e67d	qemu: Refactor qemuDomainSetVcpusFlags by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	09fc61264a	qemu: Refactor qemuDomainGetEmulatorPinInfo by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	6a34d41894	qemu: Refactor qemuDomainPinEmulator by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	582e8cb908	qemu: Refactor qemuDomainGetVcpuPinInfo by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	7721e7901f	qemu: Refactor qemuDomainPinVcpuFlags by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	8db9610f57	qemu: Refactor qemuDomainSetBlkioParameters by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	6200d86d60	qemu: Refactor qemuDomainChgIOThread by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	cac6d6396d	qemu: Refactor qemuDomainPinIOThread by reusing virDomainObjGetDefs	2015-06-04 10:52:32 +02:00
Peter Krempa	1282b350f3	qemu: Refactor qemuDomainGetIOThreadInfo by reusing virDomainObjGetDefs	2015-06-04 10:52:31 +02:00
Peter Krempa	0ad3d69e80	qemu: Refactor qemuDomainGetVcpusFlags by reusing virDomainObjGetDefs	2015-06-04 10:52:31 +02:00
Peter Krempa	10de2168e4	qemu: Refactor qemuDomainSetMemoryStatsPeriod by reusing virDomainObjGetDefs	2015-06-04 10:52:31 +02:00
Peter Krempa	65258d1f71	qemu: Refactor qemuDomainSetMemoryFlags by reusing virDomainObjGetDefs	2015-06-04 10:52:31 +02:00
Peter Krempa	a02a161bb8	qemu: libxl: vcpupin: Don't reset pinning when pinning to all pcpus In the pre-NUMA ages pinning a vCPU to all pCPUs was eaqual to deleting the pinning info. Now it does not entirely work that way. Pinning a vCPU to all pCPUs might be a desired operation. Additionally removal of the pinning will result into using the default pinning information at the next boot which might be different from all vcpus. This patch removes the false assumption that we should remove the pinning after pinning to all vCPUs and tweaks the documentation for virsh. A later patch will implement a new flag for the virDomainPinVcpuFlags API that will allow to remove the pinning in a sane way.	2015-06-04 10:52:31 +02:00
Peter Krempa	fbbea79890	conf: Store cpu count as unsigned int While we probably won't see machines with more than 65536 cpus for a while lets store the cpu count as an integer so that we can avoid quite a lot of overflow checks in our code.	2015-06-04 10:52:30 +02:00
Peter Krempa	d3889db071	qemu: Refactor qemuDomainGetInfo Since the returned structure uses "unsigned long" for memory sizes add a few overflow checks to notify the user in case we are not able to represent given values.	2015-06-04 10:52:30 +02:00
Peter Krempa	eaf4320869	qemu: Add helper to update domain balloon size and refactor usage places When qemu does not support the balloon event the current memory size needs to be queried. Since there are two places that implement the same logic, split it out into a function and reuse.	2015-06-04 10:52:30 +02:00
Peter Krempa	641a145d73	qemu: process: Update current balloon state to maximum on vm startup After libvirt issues the balloon resize command, the current balloon size needs to be changed to the maximum memory size since the vCPUs were not started and thus the balloon driver could not return the memory. Since GetXMLDesc and other APIs return the balloon size without updating it in case they are not able to obtain the job and the memory balloon does not support the asynchronous event the sizing might be incorrect.	2015-06-04 10:52:30 +02:00
Ján Tomko	12b949dfb2	maint: remove incorrect apostrophes from 'its'	2015-06-04 10:01:42 +02:00
Peter Krempa	825df8c315	util: process: Refactor and fix virProcessSetAffinity Refactor the function to return the bitmap instead of an integer and the inner workings so that they make more sense. This patch also fixes possible segfault on old systems that was introduced by commit: commit `f1a43a8e41` Author: Hu Tao <hutao@cn.fujitsu.com> Date: Fri Sep 14 15:46:59 2012 +0800 use virBitmap to store cpu affinity info	2015-06-03 14:20:25 +02:00
Peter Krempa	f4c67f0794	qemu: process: Refactor setup of memory ballooning Since the monitor code now supports ullongs when setting balloon size, drop the legacy code with overflow checking. Additionally the comment mentioning that the job is treated as a sync job does not make sense any more since the monitor is entered asynchronously.	2015-06-03 09:42:08 +02:00
Peter Krempa	987b70777a	qemu: monitor: Make qemuMonitorSetBalloon operate on unsinged long long	2015-06-03 09:42:08 +02:00
Peter Krempa	efe8b44a84	monitor: Move documentation for qemuMonitorGetBalloonInfo Document the top level function rather than both bottom level ones. It makes looking the docs up quicker.	2015-06-03 09:42:08 +02:00
Peter Krempa	9f8196ace2	qemu: Refactor qemuDomainHelperGetVcpus by reusing virBitmapToDataBuf Get rid of the unnecessary allocation and copying of the bitmap and clean up some unnecesary temporary variables.	2015-06-03 09:42:07 +02:00
Peter Krempa	f79bfd9b51	qemu: Reuse virBitmapToDataBuf in qemuDomainGetEmulatorPinInfo	2015-06-03 09:42:07 +02:00
Peter Krempa	b6d438e10c	qemu: Use virBitmapToDataBuf in qemuDomainGetVcpuPinInfo Reuse the function so that we can get rid of a lot of temporary allocations.	2015-06-03 09:42:07 +02:00
Peter Krempa	ee3da892f2	conf: Refactor emulatorpin handling Store the emulator pinning cpu mask as a pure virBitmap rather than the virDomainPinDef since it stores only the bitmap and refactor qemuDomainPinEmulator to do the same operations in a much saner way. As a side effect virDomainEmulatorPinAdd and virDomainEmulatorPinDel can be removed since they don't add any value.	2015-06-03 09:42:07 +02:00
Peter Krempa	ff4c42ed7a	qemu: Fix possible crash in qemuProcessSetVcpuAffinities In case when <vcpu ... cpuset=""> is not specified, the vcpupin array is not guaranteed to be allocated to def->vcpus. This would cause a crash for TCG since it does not report thread IDs for vCPUs.	2015-06-03 09:42:07 +02:00
Luyao Huang	0ed3b33535	qemu: Do not release device address on successful RNG attach Commit id '980b265d' neglected to check for a successful status when deciding whether to release the device address for the RNG attach thus the address would be released even though the device was added. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-06-02 14:03:48 -04:00
Luyao Huang	731d9cf251	qemu: Need to return status of RNG device removal Commit id '862473fa' neglected to return the status from the qemuDomainRemoveRNGDevice call in qemuDomainRemoveDevice causing the function to always fail when receiving an RNG device unplug event. Additionally the domain status/state would not be updated in the processDeviceDeletedEvent path. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-06-02 14:03:48 -04:00
Andrea Bolognani	778c56f000	qemu: Automatically add <panic> element for pSeries guests. The guest firmware provides the same functionality as the pvpanic device, and the relevant element should always be present in the domain XML to reflect this fact, so add it after parsing the definition if it wasn't there already.	2015-06-01 06:44:37 -04:00
Andrea Bolognani	7bd769e0ab	qemu: Allow panic device for pSeries guests The guest firmware provides the same functionality as the pvpanic device, which is not available in QEMU on pSeries, so the domain XML should be allowed to contain the <panic> element. On the other hand, unlike the pvpanic device, the guest firmware can't be configured, so report an error if an address has been provided in the XML. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1182388	2015-06-01 06:16:29 -04:00
Andrea Bolognani	b4ac4a4057	qemu: Improve error message for missing QEMU_CAPS_DEVICE_PANIC.	2015-06-01 06:16:23 -04:00
Ján Tomko	0a2581a110	Allocate priv->vioserialaddrs unconditionally When attempting to hotplug a virtio-serial console to a domain that had no virtio-serial controllers (not even those that are added by libvirt when some devices need them) at daemon startup, report a user-friendly error: error: Failed to attach device from console.xml error: internal error: no virtio-serial controllers are available instead of crashing the daemon: Process terminating with default action of signal 11 (SIGSEGV): dumping core Access not within mapped region at address 0x8 at 0x531028F: virDomainVirtioSerialAddrNext (domain_addr.c:916) by 0x531028F: virDomainVirtioSerialAddrAssign (domain_addr.c:1029) by 0x1CBF68: qemuDomainAttachChrDevice (qemu_hotplug.c:1565) by 0x1BCD5E: qemuDomainAttachDeviceLive (qemu_driver.c:7997) by 0x1BCD5E: qemuDomainAttachDeviceFlags (qemu_driver.c:8743) Introduced in v1.2.14-30-g5903378.	2015-05-29 15:26:25 +02:00
Andrea Bolognani	ceab3979de	qemu: Limit rtc-reset-reinjection requirement to x86 only. The QMP command, like the interrupt reinjection logic it's connected to, is only implemented in QEMU when TARGET_I386 is defined, so checking for its availability on any other architecture is pointless. On the other hand, when we're on x86, we shouldn still make sure that rtc-reset-reinjection is available and refuse to set the time otherwise. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1211938	2015-05-27 16:59:25 +02:00
Peter Krempa	27fd559892	qemu: Fix compilation error when enum variable size differs from 'int' Since commit `bcd9a564b6` virDomainNumatuneGetMode returns the value via a pointer rather than in the return value. The change triggered problems with platforms where the compiler decides to use a data type of size different than integer at the point where we typecast it. Work around the issue by using an intermediate variable of the correct type that gets casted back by the default typecasting rules.	2015-05-27 09:06:40 +02:00
John Ferlan	a14eff3847	qemu: Add libvirt version check to refresh capabilities algorithm Rather than an algorithm based solely on libvirtd ctime to refresh the capabilities add the element of the libvirt build version into the equation. Since that version wouldn't be there prior to this code being run - don't fail on reading the capabilities if not found. In this case, the cache will always be rebuilt when a new libvirt version is installed.	2015-05-26 11:48:56 -04:00
John Ferlan	0b4211f905	qemu: Force capabilities cache refresh if libvirtd date is different https://bugzilla.redhat.com/show_bug.cgi?id=1195882 Original commit id 'cbde3589' indicates that the cache file would be discarded if either the QEMU binary or libvirtd 'ctime' changes; however, the code only discarded if the QEMU binary time didn't match or if the new libvirtd ctime was later than what created the cache file. Since many factors come into play with 'ctime' adjustments (including perhaps turning back the hands of time), change the logic to also force a refresh if the ctime of libvirt is different than what's in the cache.	2015-05-26 11:48:56 -04:00
John Ferlan	2f9f7b5fc7	qemu: Resolve Coverity RESOURCE_LEAK Recent changes to the -M/--machine processing code in qemuParseCommandLine caused Coverity to determine there was a possible resource leak with how the 'list' is managed. Rather than try to add virStringFreeList calls everywhere - just promote list to the top of the variables and free it within the error processing code. Also required a couple of other tweaks in order to avoid double free's.	2015-05-26 06:36:09 -04:00
Michal Privoznik	04695f48b2	qemuDomainDetachChrDevice: Fix chardev hot-unplug Not every chardev is plugged onto virtio-serial bus. However, the code introduced in `89e991a2aa` assumes that. Incorrectly. With previous patches we have three options where a chardev can be plugged: virtio-serial, USB and PCI. This commit fixes the detach part. However, since we are not auto allocating USB addresses yet, I'm just marking the place where appropriate code should go. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-21 17:49:02 +02:00
Michal Privoznik	9807c47147	qemuDomainAttachChrDevice: Fix chardev hotplug Not every chardev is plugged onto virtio-serial bus. However, the code introduced in `89e991a2aa` assumes that. Incorrectly. With previous patches we have three options where a chardev can be plugged: virtio-serial, USB and PCI. This commit fixes the attach part. However, since we are not auto allocating USB addresses yet, I'm just marking the place where appropriate code should go. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-21 17:49:02 +02:00
Michal Privoznik	8e33cb41f3	qemu: Implement pci-serial https://bugzilla.redhat.com/show_bug.cgi?id=998813 Implementation is pretty straight-forward. Of course, not all qemus out there supports the device, so new capability is introduced and checked prior each use of the device. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-21 17:49:02 +02:00
Ján Tomko	886f43ad78	qemu: wire up virDomainSetUserPassword Base-64 encode the password and pass it to the guest agent via the 'guest-set-user-password' command. https://bugzilla.redhat.com/show_bug.cgi?id=1174177	2015-05-21 16:24:02 +02:00
Jiri Denemark	82cffb58a1	Use virDomainDiskByName where appropriate Most virDomainDiskIndexByName callers do not care about the index; what they really want is a disk def pointer. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-21 14:35:02 +02:00
Erik Skultety	fb0b9a2cc5	qemu: Log error if domain uses security driver which is not loaded When starting a domain, if a domain specifies security drivers we do not have loaded, we fail. However we don't check for this during reconnect, so any operation relying on security driver functionality would fail. If someone e.g. starts a domain with selinux driver loaded, then they change the security driver to 'none' in config, restart the daemon and call dump/save/.., QEMU will return an error. As we shouldn't kill the domain, we should at least log an error to let the user know that domain reconnect wasn't completely clean. https://bugzilla.redhat.com/show_bug.cgi?id=1183893	2015-05-21 12:33:52 +02:00
Michal Privoznik	bcd9a564b6	virDomainNumatuneGetMode: Report if numatune was defined So far, we are not reporting if numatune was even defined. The value of zero is blindly returned (which maps onto VIR_DOMAIN_NUMATUNE_MEM_STRICT). Unfortunately, we are making decisions based on this value. Instead, we should not only return the correct value, but report to the caller if the value is valid at all. For better viewing of this patch use '-w'. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-20 14:02:25 +02:00
John Ferlan	99a2d6af2b	Taint domains using cdrom-passthrough https://bugzilla.redhat.com/show_bug.cgi?id=976387 For a domain configured using the host cdrom, we should taint the domain due to problems encountered when the host and guest try to control the tray.	2015-05-20 07:29:13 -04:00
Martin Kletzander	9deb96f9f0	qemu: Fix numatune nodeset reporting Since `af2a1f0587`, qemuDomainGetNumaParameters() returns invalid value for a running guest. The problem is that it is getting the information from cgroups, but the parent cgroup is being left alone since the mentioned commit. Since the running guest's XML is in sync with cgroups, there is no need to look into cgroups (unless someone changes the configuration behind libvirt's back). Returning the info from the definition fixes a bug and is also a cleanup. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1221047 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-05-18 15:22:23 -07:00
Laine Stump	ffc40b63b5	conf: make virNodeDevCapData an official type For some reason a union (_virNodeDevCapData) that had only been declared inside the toplevel struct virNodeDevCapsDef was being used as an argument to functions all over the place. Since it was only a union, the "type" attribute wasn't necessarily sent with it. While this works, it just seems wrong. This patch creates a toplevel typedef for virNodeDevCapData and virNodeDevCapDataPtr, making it a struct that has the type attribute as a member, along with an anonymous union of everything that used to be in union _virNodeDevCapData. This way we only have to change the following: s/union _virNodeDevCapData */virNodeDevCapDataPtr / and s/caps->type/caps->data.type/ This will make me feel less guilty when adding functions that need a pointer to one of these.	2015-05-18 10:22:20 -04:00
Tony Krowiak	740c83f5b5	libvirt: qemu: enable/disable protected key management ops Introduces two new -machine option parameters to the QEMU command to enable/disable the CPACF protected key management operations for a guest: aes-key-wrap='on\|off' dea-key-wrap='on\|off' The QEMU code maps the corresponding domain configuration elements to the QEMU -machine option parameters to create the QEMU command: <cipher name='aes' state='on'> --> aes-key-wrap=on <cipher name='aes' state='off'> --> aes-key-wrap=off <cipher name='dea' state='on'> --> dea-key-wrap=on <cipher name='dea' state='off'> --> dea-key-wrap=off Signed-off-by: Tony Krowiak <akrowiak@linux.vnet.ibm.com> Signed-off-by: Daniel Hansel <daniel.hansel@linux.vnet.ibm.com> Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-18 09:54:16 +02:00
Laine Stump	eadd757cce	qemu: log error when domain has an unsupported IDE controller We have previously effectively ignored all <controller type='ide'> elements in a domain definition. On the i440fx-based machinetypes there is an IDE controller that is included in the chipset and can't be removed (which is the ide controller with index='0'>), so it makes sense to ignore that one controller. However, if an i440fx domain definition has a 2nd controller, nothing catches this error (unless you also have a disk attached to it, in which case qemu will complain that you're trying to use the ide controller named "ide1", which doesn't exist), and if any other type of domain has even a single controller defined, it will be incorrectly ignored. Ignoring a bogus controller definition isn't such a big problem, as long as an error is logged when any disk is attached to that non-existent controller. But in the case of q35-based machinetypes, the hardcoded id ("alias" in libvirt terms) of its builtin SATA controller is "ide", which happens to be the same id as the builtin IDE controller on i440fx machinetypes. So libvirt creates a commandline believing that it is connecting the disk to the builtin (but actually nonexistent) IDE controller, qemu thinks that libvirt wanted that disk connected to the builtin SATA controller, and everybody is happy. Until you try to connect a 2nd disk to the IDE controller. Then qemu will complain that you're trying to set unit=1 on a controller that requires unit=0 (SATA controllers are organized differently than IDE controllers). After this patch, if a domain has an IDE controller defined for a machinetype that has no IDE controllers, libvirt will log an error about the controller itself as it is building the qemu commandline (rather than a (possible) error from qemu about disks attached to that controller). This is done by adding IDE to the list of controller types that are handled in the loop that creates controller command strings in qemuBuildCommandline() (previously it would always skip IDE controllers). Then qemuBuildControllerDevStr() is modified to log an appropriate error in the case of IDE controllers. In the future, if we add support for extra IDE controllers (piix3-ide and/or piix4-ide) we can just add it into the IDE case in qemuBuildControllerDevStr(). For now, nobody seems anxious to add extra support for an aging and very slow controller, when there are so many better options available. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1176071 (Fedora)	2015-05-15 15:40:43 -04:00
Laine Stump	b8f345b486	qemu: clean up qemuBuildCommandline loop that builds controller args Reorganize the loop that builds controller args to remove unnecessary duplicated code and superfluous else clauses. No functional change.	2015-05-15 15:38:00 -04:00
Laine Stump	0260506c65	qemu: use controller alias when constructing device/controller args This makes sure that that the commandlines generated for devices and controller devices are all using the alias that has been set in the controller's object as the id of the controller, rather than hardcoding a printf (or worse, encoding exceptions to the standard ${controller}${index} into the logic) Since this "fixes" the controller name used for the sata controller, the commandline arg for the sata controller in the sata test case had to be adjusted to be "sata0" instead of "ahci0". All other tests remain unchanged, verifying that the patch causes no other functional change. Because the function that finds a controller alias based on a device def requires a pointer to the full domainDef in order to get the list of controllers, the arglist of a few functions had to have this added.	2015-05-15 15:36:28 -04:00
Laine Stump	75cd7d9b05	qemu: fix exceptions in qemuAssignDeviceControllerAlias There are a few extra exceptions that weren't being accounted for when creating the alias for a controller. This resulted in 1) incorrect status XML, and 2) exceptions/printfs of what should have been directly available in the controller alias when constructing device commandline arguments: 1) The primary (and only) IDE controller on a 440FX machinetype is hardcoded to be "ide" in qemu. 2) The primary SATA controller on a 440FX machinetype is also hardcoded to be "ide" in qemu. 3) On machinetypes that don't support multiple PCI buses, the PCI bus is hardcoded in qemu to have the name "pci". 4) The first usb master controller is "usb", all others are the normal "usb%d". (note that usb controllers that are not a "master" will have the same index, and thus alias, as the master). We needed to pass in the full domainDef and qemuCaps in order to properly make the decisions about these exceptions.	2015-05-15 15:36:21 -04:00
Jiri Denemark	a692277873	qemu: Don't give up on first error in qemuMigrationCancelDriverMirror When cancelling drive mirror, always try to do that for all disks even if it fails for some of them. Report the first error we saw. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-15 08:05:31 +02:00
Jiri Denemark	5139924b8d	qemu: Keep track of what disks are being migrated Instead of redoing the same filtering over and over everytime we need to walk through all disks which are being migrated. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-15 08:05:31 +02:00
Jiri Denemark	46a7a49535	Move QEMU-only fields from virDomainDiskDef into privateData Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-15 08:05:31 +02:00
Jiri Denemark	078717e151	Rename virDomainHasBlockjob as qemuDomainHasBlockjob And move it to qemu_domain.[ch] because this API is QEMU-only. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-15 08:05:26 +02:00
Jiri Denemark	890fa6a055	Add privateData to virDomainDiskDef Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-15 08:04:26 +02:00
John Ferlan	361801362f	qemu: Clear autofil fill flag when pinning iothread https://bugzilla.redhat.com/show_bug.cgi?id=1218577 Treat pinning an IOThread via API as if someone added an IOThread to ensure the iothreadid doesn't cause the guest to disappear	2015-05-14 08:36:34 -04:00
zhang bo	7eb5b4bf6f	qemuMigrationPrepareAny: Drop useless variable @now As of `eeb008dbfc` the variable is not used anymore. Drop it. Signed-off-by: Wang Yufei <james.wangyufei@huawei.com> Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-13 16:50:20 +02:00
Ján Tomko	083c33b324	Assign virtio-serial addresses after coldplugging a device Otherwise we might allow coldplugging a device that uses an address that is already occupied, creating an unstartable domain. https://bugzilla.redhat.com/show_bug.cgi?id=1220195	2015-05-13 12:20:31 +02:00
Pavel Hrdina	afaffeb873	qemu: vnc: error out for invalid port number In the XML we have the vnc port number, but QEMU takes on command line a vnc screen number, it's port-5900. We should fail with error message that only ports in range [5900,65535] are valid. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1164966 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-05-13 10:24:36 +02:00
Luyao Huang	5f6fe84d57	qemu: fix double free when RNG cold-plug fails https://bugzilla.redhat.com/show_bug.cgi?id=1220809 When cold-plugging an RNG device but something fails in qemuDomainAssignAddresses, we will double free the RNG device. Once a device is plugged into the domain, we should set the device pointer to NULL to fix this issue. ... 5 0x00007fb7d180ac8a in virFree at util/viralloc.c:582 6 0x00007fb7d1895cdd in virDomainRNGDefFree at conf/domain_conf.c:19786 7 0x00007fb7d1895d99 in virDomainDeviceDefFree at conf/domain_conf.c:2022 8 0x00007fb7b92b8baf in qemuDomainAttachDeviceFlags at qemu/qemu_driver.c:8785 9 0x00007fb7d190c5d7 in virDomainAttachDeviceFlags at libvirt-domain.c:8488 10 0x00007fb7d23af9d2 in remoteDispatchDomainAttachDeviceFlags at remote_dispatch.h:2842 ... Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-05-12 17:09:14 +02:00
Laine Stump	e27c5c8fcb	qemu: eliminate duplicated code in qemuBuildDriveDevStr() The code to add device type to the commandline was identical for lsi and other models of SCSI controllers, but was duplicated (with the exception of a minor ordering difference of the if-else clauses) for the two cases. This patch replaces those two with a single instance of the code just before the if().	2015-05-11 16:56:26 -04:00
Laine Stump	da558e72c4	qemu: use qemuDomainMachineIsI440FX() in appropriate place This patch makes qemuValideDevicePCISlotsChipsets() more consistent in appearance by replacing several clauses of an if with the equivalent call to qemuDomainMachineIsI440FX. The if was checking exactly the same items, just in a slightly different order.	2015-05-11 16:49:47 -04:00
Peter Krempa	2f37362e44	qemu: Fix balloon size handling with memory hot(un)plug Since libvirt doesn't call to update the new balloon size in qemu add code that will handle tweaking of the size of the current balloon statistic until qemu reports the new size using the event.	2015-05-11 08:50:36 +02:00
Peter Krempa	85d8ede9eb	qemu: Convert qemuConnectGetAllDomainStats to use new helpers Use the new domain list collection helpers to avoid going through virDomainPtrs. This additionally implements filter capability when called through the api that accepts domain list filters.	2015-05-11 08:47:42 +02:00
Peter Krempa	a5e89ae16e	util: Make the virDomainListFree helper more universal Extend it to a universal helper used for clearing lists of any objects. Note that the argument type is specifically void * to allow implicit typecasting. Additionally add a helper that works on non-NULL terminated arrays once we know the length.	2015-05-11 08:28:53 +02:00
Michal Privoznik	2af51483cc	processSerialChangedEvent: Close agent monitor early https://bugzilla.redhat.com/show_bug.cgi?id=890648 So, imagine you've issued an API that involves guest agent. For instance, you want to query guest's IP addresses. So the API acquires QUERY_JOB, locks the guest agent and issues the agent command. However, for some reason, guest agent replies to initial ping correctly, but then crashes tragically while executing real command (in this case guest-network-get-interfaces). Since initial ping went well, libvirt thinks guest agent is accessible and awaits reply to the real command. But it will never come. What will is a monitor event. Our handler (processSerialChangedEvent) will try to acquire MODIFY_JOB, which will fail obviously because the other thread that's executing the API already holds a job. So the event handler exits early, and the QUERY_JOB is never released nor ended. The way how to solve this is to put flag somewhere in the monitor internals. The flag is called @running and agent commands are issued iff the flag is set. The flag itself is set when we connect to the agent socket. And unset whenever we see DISCONNECT event from the agent. Moreover, we must wake up all the threads waiting for the agent. This is done by signalizing the condition they're waiting on. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-07 11:31:17 +02:00
Michal Privoznik	21e8fc36c6	qemuDomainShutdownFlags: check for domain activeness prior to guest presence Running shutdown with mode agent on a shutoff domain gives cryptic error message: virsh # shutdown --mode agent gentoo error: Failed to shutdown domain gentoo error: Guest agent is not responding: QEMU guest agent is not connected After this patch, the error is more clear: virsh # shutdown --mode agent gentoo error: Failed to shutdown domain gentoo error: Requested operation is not valid: domain is not running Reported-by: Martin Kletzander <mkletzan@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-06 18:03:04 +02:00
Boris Fiuczynski	808e771e83	qemu: multiqueue for ccw devices Allow ccw devices to be used with multiqueues. ccw provides a one to one relation of fds to queues and does not support the vectors option. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Matthew Rosato <mjrosato@linux.vnet.ibm.com> Reviewed-by: Daniel Hansel <daniel.hansel@linux.vnet.ibm.com> Reviewed-by: Cornelia Huck <cornelia.huck@de.ibm.com>	2015-05-06 11:42:42 -04:00
John Ferlan	b8e60f00d8	qemu: Resolve Coverity FORWARD_NULL Coverity points out that qemuMonitorGetAllBlockStatsInfo could return a -1 and thus not fill in 'stats' (leaving it NULL). Then the call to qemuMonitorBlockStatsUpdateCapacity will dereference it.	2015-05-05 20:02:37 -04:00
John Ferlan	3e4ce35926	qemu: Resolve Coverity FORWARD_NULL Coverity complains over the [n]values pairing in virQEMUCapsFreeStringList and rather than make a bunch if "if values" checks prior to calling, by just adding the values check inside the free function we avoid the chance that somehow nvalues is > 0, while values == NULL	2015-05-05 20:02:37 -04:00
John Ferlan	e7664eedaa	qemu: Resolve Coverity FORWARD_NULL Coverity points out it was possible to have a zero return from qemuBuildRNGBackendProps thus not filling in 'props' and then causing a NULL dereference on the next call.	2015-05-05 20:02:37 -04:00
John Ferlan	75dfbb85c7	qemu: Resolve Coverity FORWARD_NULL Coverity notes that ->ifname is used after the VIR_FREE done in the code path after the call to virNetDevMacVLanDeleteWithVPortProfile by a call to virNetDevOpenvswitchRemovePort. Since the ->ifname will be VIR_FREE()'d eventually in virDomainNetDefFree just remove the extraneous VIR_FREE here. When originally added, the Openvswitch code wasn't present and checks were made for non NULL prior to use.	2015-05-05 20:02:36 -04:00
John Ferlan	9ad32e5052	qemu: Resolve Coverity IDENTICAL_BRANCHES Coverity complains that in the error paths both the < 0 condition and the success path after the qemuDomainObjExitMonitor failure will end up going to cleanup. So just use ignore_value in this error path to resolve the complaint.	2015-05-05 20:02:36 -04:00
Michal Privoznik	608c95c76c	qemu: Implement GIC The only version that's supported in QEMU is version 2, currently. Fortunately, it is enabled by aarch64 automatically, so there's nothing for us that needs to be put onto command line. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-05-05 09:45:52 +02:00
Jiri Denemark	fc3601a308	qemu: Properly rename persistent def after migration When migrating a domain while changing its name and using VIR_MIGRATE_PERSIST_DEST flag, libvirt would fail to properly change the name in the persistent definition. The inconsistency results in weird behavior when dumping domain XML, destroying the domain, restarting libvirtd and likely in several other situations. Since the new name is already stored in vm->def->name, we just need to make sure the persistent definition uses this new name too. https://bugzilla.redhat.com/show_bug.cgi?id=1076354 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-04 22:59:51 +02:00
Jiri Denemark	b45ec56f58	qemu: Forbid unsupported parameters for tunnelled migration Neither migrate URI nor lister address make any sense for tunnelled migration. https://bugzilla.redhat.com/show_bug.cgi?id=1066375 https://bugzilla.redhat.com/show_bug.cgi?id=1073233 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-05-04 15:06:33 +02:00
Marc-André Lureau	7d3dc7a084	qemu: add machine vmport argument Fill qemu command line vmport argument as required.	2015-05-04 13:19:38 +02:00
Marc-André Lureau	f5a5f2b7d4	qemu: add virQEMUCapsSupportsVmport The vmport machine argument works with pc machine kind, not with xen for example.	2015-05-04 13:19:38 +02:00
Marc-André Lureau	46ae6b7fc7	qemu: move qemuDomainMachineIs{I440FX,Q35} Move common functions being used by the following virQEMUCapsSupportsVmport commit.	2015-05-04 13:19:38 +02:00
Marc-André Lureau	384a06f5f2	qemu: add QEMU_CAPS_MACHINE_VMPORT_OPT Set the capability based on qmp query, or qemu version. The qmp query includes vmport with 2.2, but no longer with 2.3. It lists only non-machine specific capabilities, so check the qemu version too until a machine-specific query is supported.	2015-05-04 13:19:38 +02:00
Luyao Huang	3f7cad7a8f	qemu: fix the vm deadlock when deleting a nonexistent iothread https://bugzilla.redhat.com/show_bug.cgi?id=1218145 Jump to endjob instead of cleanup to fix this deadlock. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-05-04 12:18:55 +02:00
Pavel Hrdina	28ca8520bb	qemu: use new macros for setvcpus to check flags and cleanup the code Now that we have macros for exclusive flags and flag requirements we can use them to cleanup the code for setvcpus and error out for all wrong flag combination. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-05-04 09:20:01 +02:00
Pavel Hrdina	6e3f9cbc9c	use new macro helpers to check flag requirements Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-05-04 09:20:01 +02:00
John Ferlan	63a368012d	qemu: Fix bus and lun checks when scsi-disk.channel not present Found by Laine and discussed a bit on internal IRC. Commit id `c56fe7f1d6` added support for creating a command line to support scsi-disk.channel. Series was here: http://www.redhat.com/archives/libvir-list/2012-February/msg01052.html Which pointed to a design proposal here: http://permalink.gmane.org/gmane.comp.emulators.libvirt/50428 Which states (in part): Libvirt should check for the QEMU "scsi-disk.channel" property. If it is unavailable, QEMU will only support channel=lun=0 and 0<=target<=7. However, the check added was ensuring that bus != lun and bus != 0. So if bus == lun and both were non zero, we'd never make the second check. Changing this to an or check fixes the check, but still is less readable than the just checking each for 0	2015-04-30 16:21:38 -04:00
Peter Krempa	f06d7daaa9	qemu: blockjob: Call qemuDomainSupportsBlockJobs only on online VMs Since the qemu capabilities are not initialized for offline VMs the caller might get suboptimal error message: $ virsh blockjob VM PATH --bandwidth 1 error: unsupported configuration: block jobs not supported with this QEMU binary Move the checks after we make sure that the VM is alive.	2015-04-30 16:46:42 +02:00
Jiri Denemark	6280294574	qemu: Check address type for USB disks Only USB addresses are allowed for USB disks. Report an error if another address is configured. https://bugzilla.redhat.com/show_bug.cgi?id=1043436 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-04-30 15:34:57 +02:00
Michael Chapman	99725f946c	qemu: migration: use sync block job helpers In qemuMigrationDriveMirror we can start all disk mirrors in parallel. We wait until they are all ready, or one of them aborts. In qemuMigrationCancelDriveMirror, we wait until all mirrors are properly stopped. This is necessary to ensure that destination VM is fully in sync with the (paused) source VM. If a drive mirror can not be cancelled, then the destination is not in a consistent state. In this case it is not safe to continue with the migration. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-29 13:11:42 +02:00
Michael Chapman	1e106fee57	qemuDomainBlockJobAbort: use sync block job helpers The !modern code path needs to call qemuBlockJobEventProcess directly. the modern code path will call it via qemuBlockJobSyncWait. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-29 13:11:42 +02:00
Michael Chapman	1ec03c8772	qemuProcessStop: wake up pending sync block jobs Other threads may be blocked in qemuBlockJobSyncWait. Ensure that they're woken up when the domain is stopped. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-29 13:11:42 +02:00
Michael Chapman	89a5e25d05	qemuBlockJobSync: introduce sync block job helpers qemuBlockJobSyncBegin and qemuBlockJobSyncEnd delimit a region of code where block job events are processed "synchronously". qemuBlockJobSyncWait and qemuBlockJobSyncWaitWithTimeout wait for an event generated by a block job. The Wait functions may be called multiple times while the synchronous block job is active. Any pending block job event will be processed by only when Wait* or End is called. disk->blockJobStatus is reset by these functions, so if it is needed a pointer to a virConnectDomainEventBlockJobStatus variable should be passed as the last argument. It is safe to pass NULL if you do not care about the block job status. All functions assume the VM object is locked. The Wait* functions will unlock the object for as long as they are waiting. They will return -1 and report an error if the domain exits before an event is received. Typical use is as follows: virQEMUDriverPtr driver; virDomainObjPtr vm; /* locked / virDomainDiskDefPtr disk; virConnectDomainEventBlockJobStatus status; qemuBlockJobSyncBegin(disk); ... start block job ... if (qemuBlockJobSyncWait(driver, vm, disk, &status) < 0) { / domain died while waiting for event / ret = -1; goto error; } ... possibly start other block jobs or wait for further events ... qemuBlockJobSyncEnd(driver, vm, disk, NULL); To perform other tasks periodically while waiting for an event: virQEMUDriverPtr driver; virDomainObjPtr vm; / locked / virDomainDiskDefPtr disk; virConnectDomainEventBlockJobStatus status; unsigned long long timeout = 500 1000ull; /* milliseconds / qemuBlockJobSyncBegin(disk); ... start block job ... do { ... do other task ... if (qemuBlockJobSyncWaitWithTimeout(driver, vm, disk, timeout, &status) < 0) { / domain died while waiting for event */ ret = -1; goto error; } } while (status == -1); qemuBlockJobSyncEnd(driver, vm, disk, NULL); Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-29 13:11:42 +02:00
Michael Chapman	206dbf3f0a	qemuBlockJobEventProcess: move to new source file We will want to use synchronous block jobs from qemu_migration as well, so split this function out into a new source file. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-29 13:11:42 +02:00
Peter Krempa	a83b2e253f	qemu: Validate available slot count for memory devices While qemu would reject the configuration we can check whether it makes sense to plug the device upfront.	2015-04-29 09:40:16 +02:00
Peter Krempa	6705d828fc	qemu: command: Validate that memory devices slot ID is in range slot id, if specified, has to be less than the slots count.	2015-04-29 09:40:16 +02:00
Peter Krempa	ebe0bd5590	qemu: blockCopy: Allow reuse of raw image for shallow block copy The documentation states that for shallow block copy the image has to have the same guest visible content as backing file of the current image if the file is being reused. This condition can be achieved also with a raw file (or a qcow without a backing file) so remove the condition that would disallow it. (This patch additionally fixes crash described in https://bugzilla.redhat.com/show_bug.cgi?id=1215569 )	2015-04-29 09:32:53 +02:00
Zhang Bo	6f5d29f40d	qemu: make qemuMonitorChardevInfoFree non-static It would be used in qemumonitorjsontest, thus we make it non-static. Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com> Signed-off-by: Zhou Yimin <zhouyimin@huawei.com>	2015-04-28 16:50:11 +02:00
John Ferlan	b515339fe7	qemu: Remove need for qemuMonitorIOThreadInfoFree Replace with just VIR_FREE.	2015-04-28 06:33:49 -04:00
John Ferlan	69b16513a5	qemu: qemuProcessDetectIOThreadPIDs invert checks If we received zero iothreads from the monitor, but were perhaps expecting to receive something, then the code was skipping the check to ensure what's in the monitor matches our expectations. So invert the checks to check that what we get back matches expectations and then check there are zero iothreads returned.	2015-04-28 06:33:35 -04:00
John Ferlan	4c2ca5664a	qemu: Remove need for qemuDomainParseIOThreadAlias Rather than have a separate routine to parse the alias of an iothread returned from qemu in order to get the iothread_id value, parse the alias when returning and just return the iothread_id in qemuMonitorIOThreadInfoPtr This set of patches removes the function, changes the "char *name" to "unsigned int" and handles all the fallout.	2015-04-28 06:33:30 -04:00
John Ferlan	d8082d2d44	qemu: Resolve Coverity DEADCODE Coverity notes that the switch() used to check 'connected' values has two DEADCODE paths (_DEFAULT & _LAST). Since 'connected' is a boolean it can only be one or the other (CONNECTED or DISCONNECTED), so it just seems pointless to use a switch to get "all" values. Convert to if-else	2015-04-27 14:55:35 -04:00
John Ferlan	a27ed6e78c	qemu: Add support to Add/Delete IOThreads Add qemuDomainAddIOThread and qemuDomainDelIOThread in order to add or remove an IOThread to/from the host either for live or config optoins The implementation for the 'live' option will use the iothreadpids list in order to make decision, while the 'config' option will use the iothreadids list. Additionally, for deletion each may have to adjust the iothreadpin list. IOThreads are implemented by qmp objects, the code makes use of the existing qemuMonitorAddObject or qemuMonitorDelObject APIs. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-04-27 12:36:36 -04:00
John Ferlan	b266486fb9	Move iothreadspin information into iothreadids Remove the iothreadspin array from cputune and replace with a cpumask to be stored in the iothreadids list. Adjust the test output because our printing goes in order of the iothreadids list now.	2015-04-27 12:36:35 -04:00
John Ferlan	8d4614a512	qemu: Use domain iothreadids to IOThread's 'thread_id' Add 'thread_id' to the virDomainIOThreadIDDef as a means to store the 'thread_id' as returned from the live qemu monitor data. Remove the iothreadpids list from _qemuDomainObjPrivate and replace with the new iothreadids 'thread_id' element. Rather than use the default numbering scheme of 1..number of iothreads defined for the domain, use the iothreadid's list for the iothread_id Since iothreadids list keeps track of the iothread_id's, these are now used in place of the many places where a for loop would "know" that the ID was "+ 1" from the array element. The new tests ensure usage of the <iothreadid> values for an exact number of iothreads and the usage of a smaller number of <iothreadid> values than iothreads that exist (and usage of the default numbering scheme).	2015-04-27 12:36:35 -04:00
Martin Kletzander	6957a606a5	Fix check for return value of qemuDomainAgentAvailable() Commit `dcbb243bbc` used the return value of the function as int even though it returns bool. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-04-27 10:11:35 +02:00
Zhang Bo	0a8bd97afa	qemu: fix memleaks in qemuBuildCommandLine free boot_opts_str and boot_order_str both in normal and error paths. Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com>	2015-04-27 10:04:38 +02:00
Jiri Denemark	659dbba37e	qemu: Fix comment for timeDelta Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-04-27 09:43:27 +02:00
zhang bo	21b64552fe	Fix typo in comment about memory binding rather then -> rather than Signed-off-by: YueWenyuan <yuewenyuan@huawei.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-04-27 09:05:29 +02:00
Peter Krempa	a03e2d3a99	qemu: Connect to guest agent after channel hotplug If a user hot-attaches the guest agent channel libvirt would ignore it until the restart of libvirtd or shutdown/destroy and start of the VM itself. This patch adds code that opens or closes the guest agent connection according to the state of the guest agent channel according to connect/disconnect events. To allow opening the channel from the event handler qemuConnectAgent needed to be exported.	2015-04-26 17:19:22 +02:00
Peter Krempa	e1c04108d7	qemu: agent: Differentiate errors when the agent channel was hotplugged When the guest agent channel gets hotplugged to a VM, libvirt would still report that "QEMU guest agent is not configured" rather than stating that the connection was not established yet. Currently the code won't be able to connect to the agent after hotplug but that will change in a later patch. As the qemuFindAgentConfig() helper is quite helpful in this case move it to a more usable place and export it.	2015-04-26 17:19:22 +02:00
Peter Krempa	dcbb243bbc	qemu: Reuse qemuDomainAgentAvailable in qemuDomainInterfaceAddresses	2015-04-26 17:19:22 +02:00
Peter Krempa	c6351cc689	qemu: Fix domain object leak in qemuDomainInterfaceAddresses The API didn't use virDomainObjEndAPI to release the domain object thus it leaked a reference to it.	2015-04-26 17:19:22 +02:00
Peter Krempa	249992876b	qemu: agent: Reuse virJSONValueObjectCreateVArgs in qemuAgentMakeCommand Since the code is now separated into the common helper, we can reuse it instead of maintaining two copies.	2015-04-26 17:19:22 +02:00
Cole Robinson	19425d110b	qemu: Build nvram directory at driver startup Similar to what was done for the channel socket in the previous commit.	2015-04-24 10:30:42 -04:00
Cole Robinson	e31ab02fd0	qemu: Build channel autosocket directory at driver startup Rather than depend on the RPM to put it in place, since this doesn't cover the qemu:///session case. Currently auto allocated socket path is completely busted with qemu:///session https://bugzilla.redhat.com/show_bug.cgi?id=1105274 And because we chown the directory at driver startup now, this also fixes autosocket startup failures when using user/group=root https://bugzilla.redhat.com/show_bug.cgi?id=1044561 https://bugzilla.redhat.com/show_bug.cgi?id=1146886	2015-04-24 10:30:42 -04:00
Cole Robinson	db3ccd582c	qemu: chown autoDumpPath on driver startup Not sure if this is required, but it makes things consistent with the rest of the directories.	2015-04-24 10:30:41 -04:00
Cole Robinson	c19f43ae7e	qemu: conf: Clarify paths that are relative to libDir Rather than duplicate libDir for each new path	2015-04-24 10:30:41 -04:00
Jiri Denemark	aa9f139599	migration: Usable time statistics without requiring NTP virDomainGetJobStats is able to report statistics of a completed migration, however to get usable downtime and total time statistics both hosts have to keep synchronized time. To provide at least some estimation of the times even when NTP daemons are not running on both hosts we can just ignore the time needed to transfer a migration cookie to the destination host. The result will be also inaccurate but a bit more predictable. The total/down time will just be at least what we report. https://bugzilla.redhat.com/show_bug.cgi?id=1213434	2015-04-24 15:02:00 +02:00
Michal Privoznik	620ff93bd3	virDomainObjListFindByName: Return referenced object Every domain that grabs a domain object to work over should reference it to make sure it won't disappear meanwhile. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-24 13:22:45 +02:00
Michal Privoznik	79d14a9930	Introduce virDomainObjEndAPI This is basically turning qemuDomObjEndAPI into a more general function. Other drivers which gets a reference to domain objects may benefit from this function too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-24 13:22:45 +02:00
Peter Krempa	5a35b2e599	qemu: cgroup: Fix priorities when setting emulatorpin Use the custom emulator pin setting with the highest priority same as with vcpupin.	2015-04-24 09:59:38 +02:00
zhang bo	eadf41fe31	qemu: Don't fail to reboot domains with unresponsive agent just as what `b8e25c35d7` did, we fall back to the ACPI method when the guest agent is unresponsive in qemuDomainReboot(). Signed-off-by: YueWenyuan <yuewenyuan@huawei.com> Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-23 10:34:32 +02:00
Peter Krempa	bd57977391	qemu: migration: Refactor hostdev validation in migration check The hostdev check can error out right away.	2015-04-22 14:05:50 +02:00
Michal Privoznik	74acc4cabf	processNicRxFilterChangedEvent: Take appropriate actions for NET_TYPE_NETWORK too Because packets going through the egress from a bridge (where our bandwidth limiting takes place) have no information about which interface they came from, the QoS rules that we create instead use the source MAC address of the packets to make their decisions about which QDisc the packet should be in. One flaw in this is that when a guest changed the MAC address it used, packets from the guest would no longer be put into the correct QDisc, but would instead be put in an "unprivileged" class, resulting in the bandwidth "floor" (minimum guaranteed) being no longer honored. Now that libvirt has infrastructure to capture and respond to RX_FILTER_CHANGE events from qemu (sent whenever a guest interface modifies its MAC address, among other things), we can notice when a guest MAC address changes, and update the QoS rules accordingly, so that bandwidth floor is honored even after a guest MAC address change. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-22 09:50:24 +02:00
Luyao Huang	d174394105	qemu: Fix issues with maxMemory in qemuDomainSetMemoryFlags() qemuDomainSetMemoryFlags() would allow to set the initial memory greater than the <maxMemory> field. While the configuration would not work as memory hotplug requires NUMA to be enabled and the qemuDomainSetMemoryFlags() API does not work on NUMA guests this just fixes a corner case. The fix is still worth though as it allows to induce an invalid configuration and make the VM vanish on libvirt restart. Additionally this tweaks error message to be more accurate. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-04-22 09:26:25 +02:00
Laine Stump	38172ed894	qemu: set macvtap physdevs online when macvtap is set online A further fix for: https://bugzilla.redhat.com/show_bug.cgi?id=1113474 Since there is no possibility that any type of macvtap will work if the parent physdev it's attached to is offline, we should bring the physdev online at the same time as the macvtap. When taking the macvtap offline, it's also necessary to take the physdev offline for macvtap passthrough mode (because the physdev has the same MAC address as the macvtap device, so could potentially cause problems with misdirected packets during migration, as outlined in commits 829770 and 879c13). We can't set the physdev offline for other macvtap modes 1) because there may be other macvtap devices attached to the same physdev (and/or the host itself may be using the device) in the other modes whereas passthrough mode is exclusive to one macvtap at a time, and 2) there's no practical reason to do so anyway.	2015-04-21 12:34:29 -04:00
Cole Robinson	95546c43de	qemu: Always refresh capabilities if no <guests> found - Remove all qemu emulators - Restart libvirtd - Install qemu emulators - Call 'virsh version' -> errors The only thing that will force the qemu driver to refresh it's cached capablities info is an explict API call to GetCapabilities. However in the case when the initial caps lookup at driver connect didn't find a single qemu emulator to poll, the driver is effectively useless and really can't do anything until it's populated some qemu capabilities info. With the above steps, the user would have to either know about the magic refresh capabilities call, or restart libvirtd to pick up the changes. Instead, this patch changes things so that every time a part of th driver requests access to capabilities info, check to see if we've previously seen any emulators. If not, force a refresh. In the case of 'still no emulators found', this is still very quick, so I can't think of a downside. https://bugzilla.redhat.com/show_bug.cgi?id=1000116	2015-04-21 11:14:20 -04:00
Cole Robinson	835cf84b7e	domain: conf: Drop expectedVirtTypes This needs to specified in way too many places for a simple validation check. The ostype/arch/virttype validation checks later in DomainDefParseXML should catch most of the cases that this was covering.	2015-04-20 16:43:43 -04:00
Cole Robinson	747761a79a	caps: Use DomainDataLookup to replace GuestDefault* This revealed that GuestDefaultEmulator was a bit buggy, capable of returning an emulator that didn't match the passed domain type. Fix up the test suite input to continue to pass.	2015-04-20 16:43:13 -04:00
Cole Robinson	4fa6f9b413	caps: Convert to use VIR_DOMAIN_VIRT internally	2015-04-20 16:40:26 -04:00
Cole Robinson	5f7c599456	domain: Convert os.type to VIR_DOMAIN_OSTYPE enum	2015-04-20 16:40:09 -04:00
Cole Robinson	d0440e3269	caps: Switch AddGuest to take VIR_DOMAIN_OSTYPE value Rather than an opencoded string. This should be a no-op	2015-04-20 16:38:09 -04:00
Michal Privoznik	0af9325e6a	domcaps: Check for architecture more wisely https://bugzilla.redhat.com/show_bug.cgi?id=1209948 So we have this bug. The virConnectGetDomainCapabilities() API performs a couple of checks before it produces any result. One of the checks is if the architecture requested by user can be run by the binary (again user provided). However, the check is pretty dumb. It merely compares if the default binary architecture matches the one provided by user. However, a qemu binary can run multiple architectures. For instance: qemu-system-ppc64 can run: ppc, ppcle, ppc64, ppc64le and ppcemb. The default is ppc64, so if user requested something else, like ppc64le, the check would have failed without obvious reason. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-17 17:00:15 +02:00
zhang bo	8be502fd90	qemuDomainShutdownFlags: Set fakeReboot more frequently When a qemu domain is to be rebooted, from outside, at libvirt level it looks like regular shutdown. To really restart the domain, libvirt needs to issue reset command on the monitor once SHUTDOWN event appeared. So, in order to differentiate bare shutdown and reboot libvirt uses a variable within domain private data. It's called fakeReboot. When the reboot API is called, the variable is set, but when the shutdown API is called it must be cleared out. But it was not for every possible case. So if user called virDomainReboot(), and there was no ACPI daemon running inside the guest (so guest didn't initiated shutdown sequence) and then virDomainShutdown(mode=agent) was called bad thing happened. We remembered the fakeReboot and instead of shutting the domain down, we just rebooted it. Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com> Signed-off-by: Wang Yufei <james.wangyufei@huawei.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-17 11:36:37 +02:00
Peter Krempa	c44108522b	qemu: monitor: Refactor and fix monitor checking Among all the monitor APIs some where checking if mon is NULL and some were not. Since it's possible to have mon equal to NULL in case a second call is attempted once entered the monitor. This requires that every single API checks for the monitor. This patch adds a macro that helps checking the state of the monitor and either refactors existing checking code to use the macro or adds it in case it was missing.	2015-04-16 14:49:46 +02:00
Peter Krempa	25aa7035d3	qemu: bulk stats: Ignore errors from missing/inaccessible disks Rather than erroring out make the best attempt to retrieve other data if disks are inaccessible or missing. The failure will still be logged though. Since the bulk stats API is called on multiple domains an error like this makes the API unusable. This regression was introduced by commit `596a137134` Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1209394	2015-04-16 09:24:04 +02:00
Ján Tomko	b693b2fb73	Emit VIR_DOMAIN_EVENT_ID_DEVICE_ADDED in the QEMU driver Only for devices that have an alias.	2015-04-15 17:06:01 +02:00
Peter Krempa	7961713410	qemu: monitor: Fix qemuMonitorGetAllBlockStatsInfo with HMP Commit `f6563bc3` introduced HMP impl of the function (so that a different uglier function could be removed). Before the HMP code is called there's a leftover check that the monitor is JSON which inhibits the code from working.	2015-04-15 13:58:26 +02:00
Peter Krempa	a745d83fe0	qemu: monitor: @running in qemuMonitorGetStatus is always non-NULL Add the attribute and remove the check.	2015-04-15 13:58:26 +02:00
Peter Krempa	81d14c0252	qemu: monitor: Don't use 'ret' variable where not necessary Quite a lot places set the 'ret' variable just once right before returning it's value. Remove such usage.	2015-04-15 13:58:26 +02:00
Peter Krempa	ee591240c2	qemu: monitor: Ensure that qemuMonitorSetLink is called with non-null name	2015-04-15 13:58:26 +02:00
Peter Krempa	0e9fadd66d	qemu: monitor: Sanitize control flow in qemuMonitorSetCapabilities	2015-04-15 13:58:26 +02:00
Peter Krempa	119aa5d35a	qemu: monitor: Clean up coding style Fix line spacing between functions, ensure that function return type is on a separate line and reflow arguments for VIR_DEBUG statements.	2015-04-15 13:58:25 +02:00
John Ferlan	29359e99bf	snapshot: Change 'index' to 'idx' Forthcoming syntax check rule will disallow usage of 'int index', so change it for snapshot	2015-04-14 11:18:28 -04:00
John Ferlan	f44429a0bc	qemu: Adjust the prototype to match the function Changing the prototype to not have "int *index" since we'll soon be disallowing index as a name. Curiously the original commit (`a4504ac`) for the function used 'int idx' in the function - so they didn't match. Now they do.	2015-04-14 11:18:28 -04:00
Martin Kletzander	f38bd06bff	sparc: Add default PCI root controller It is there even with -nodefaults and -no-user-config, so count with that so we can start sparc domains. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-04-14 14:49:15 +02:00
Huanle Han	c61ded8a7d	qemu: fix index error when clean up vport profile 1. 'last_good_net' indicates the index of last successfully configured net. so def->nets[last_good_net] should also be clean up if error occurs. 2. if error occurs in 'virNetDevMacVLanVPortProfileRegisterCallback' (second 'goto err_exit' in loop), we should also do 'virNetDevVPortProfileDisassociate' cleanup for the 'virNetDevVPortProfileAssociate'(first code block in loop). So we should consider the net is successfully configured after first code block in loop finishes. Signed-off-by: Huanle Han <hanxueluo@gmail.com>	2015-04-14 14:49:15 +02:00
Shanzhi Yu	1db6212331	qemu: save domain status after set memory parameters After set memory parameters for running domain, save the change to live xml is needed otherwise it will disappear after restart libvirtd. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1211548 Signed-off-by: Shanzhi Yu <shyu@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-04-14 13:50:08 +02:00
John Ferlan	d375368790	qemu: Use 'idx' instead of 'index' for variable name Apparently for Xen-devel 'index' is a global and causes a build failure, so just use the shortened 'idx' instead to avoid the conflict. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-04-14 07:18:32 -04:00
Peter Krempa	634285f9c1	qemu: Refactor qemuDomainBlockJobAbort() Change few variable names and refactor the code flow. As an additional bonus the function now fails if the event state is not as expected.	2015-04-14 10:00:56 +02:00
Peter Krempa	8a609afb6f	qemu: drivePivot: Fix assumption when 'block-job-complete' fails QEMU does not abandon the mirror. The job carries on in the synchronised phase and it might be either pivoted again or cancelled. The commit hints that the described behavior was happening in a downstream version. If the command returns false there are two possible options: 1) qemu did not reach the point where it would ask the block job to pivot 2) pivotting failed in the actual qemu coroutine If either of those would happen we return failure and reset the condition that waits for the block job to complete. This makes the API fail but in case where qemu would actually abandon the mirror the fact is notified via the event and handled asynchronously. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1202704	2015-04-14 10:00:56 +02:00
Peter Krempa	065a81082d	qemu: blockPull: Refactor the rest of qemuDomainBlockJobImpl Since it now handles only block pull code paths we can refactor it and remove tons of cruft.	2015-04-14 10:00:56 +02:00
Peter Krempa	cfc0a3d4ce	qemu: blockjob: Separate qemuDomainBlockJobAbort from qemuDomainBlockJobImpl Sacrifice a few lines of code in favor of the code being more readable.	2015-04-14 10:00:56 +02:00
Peter Krempa	1344a74ef2	qemu: blockjob: Split qemuDomainBlockJobSetSpeed from qemuDomainBlockJobImpl qemuDomainBlockJobImpl become an unmaintainable mess over the years of adding new stuff to it. This patch starts splitting up individual functions from it until it can be killed entirely. In bulk this will add lines of code rather than delete them but it will be traded for maintainability.	2015-04-14 10:00:56 +02:00
Peter Krempa	7db64d6b0a	qemu: monitor: Extract handling of JSON block job error codes My intention is to split qemuMonitorJSONBlockJob() into simpler separate functions for every block job type. Since the error handling code is the same for all block jobs, this patch extracts the code into a separate function that will later be reused in more places. With the new helper qemuMonitorJSONErrorIsClass we can save a few function calls as we can extract the error object once.	2015-04-14 10:00:56 +02:00
Peter Krempa	72613b18ac	qemu: monitor: json: Refactor error code class checker Split out the function that checks the actual error class string into a separate helper as it will be useful later and refactor qemuMonitorJSONHasError to return bool type and remove few useless checks. Basically virJSONValueObjectHasKey are useless here since the next call to virJSONValueObjectGet is checking the return value again (which can't fail at that point). By removing the first check we save a function call.	2015-04-14 10:00:56 +02:00
Peter Krempa	64985217bc	qemu: Fix condition for checking vcpu when pinning vcpus Previously we checked that the vcpu we are trying to set is in range of the number of threads presented by qemu. The problem is that if the VM is offline the count is 0. Since the condition subtracted 1 from the count the number would overflow and the check would never trigger. Change the condition for more sensible ones with specific error messages. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1208434	2015-04-14 09:31:48 +02:00
Peter Krempa	714b38cb23	qemu: Enforce WWN to be unique among VM's disks Operating systems use the identifier to name the disks. As the name suggests the ID should be unique. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1208009	2015-04-14 08:44:36 +02:00
John Ferlan	2bcc263338	Rename qemuCheckIothreads to qemuCheckIOThreads Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-04-13 17:26:37 -04:00
Erik Skultety	b77ce18a28	virBitmap: Place virBitmapIsAllClear check after virBitmapParse calls This patch adds checks for empty bitmaps right after the calls of virBitmapParse. These only include spots where set API's are called and where domain's XML is parsed. Also, it partially reverts commit 983f5a which added a check for invalid nodeset "0,^0" into virBitmapParse function. This change broke the logic, as an empty bitmap should not cause an error. https://bugzilla.redhat.com/show_bug.cgi?id=1210545	2015-04-13 14:21:02 +02:00
Ján Tomko	74b45e586e	Do not require virtio-pci devices when using virtio-mmio On arm, we probe for virtio--pci devices, but use their virtio--device variants. Set the capabilities based on the -device variants as well, to make them work with qemus with the PCI devices compiled out.	2015-04-13 12:23:13 +02:00
Michal Privoznik	65a88572ad	qemuMigrationPrecreateStorage: Fix debug message When pre-creating storage for domains, we need to find corresponding disk in the XML on the destination (domain XML may differ there, e.g. disk is accessible under different path). For better debugging, I'm printing all info I received on a disk. But there was a typo when printing the disk capacity: "%lluu" instead of "%llu". Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-13 11:40:57 +02:00
Xing Lin	522e81cbb5	qemu_migration.c: sleep first before checking for migration status. The problem with the previous implementation is, even when qemuMigrationUpdateJobStatus() detects a migration job has completed, it will do a sleep for 50 ms (which is unnecessary and only adds up to the VM pause time). Signed-off-by: Xing Lin <xinglin@cs.utah.edu> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-13 09:52:28 +02:00
John Ferlan	b487bb810e	qemu: qemuDomainHotplugVcpus - separate out pin adjustment code Future IOThread setting patches would copy the code anyway, so create and generalize the adding of pindef for the vcpu and the pinning of the thread into their own APIs.	2015-04-10 16:02:40 -04:00
Dmitry Guryanov	0d572b6982	conf: add VIR_DOMAIN_VIDEO_TYPE_PARALLELS video type We support VNC for containers to have the same interface with VMs. At this moment it just renders linux text console. Of course we don't pass any physical devices and don't emulate virtual devices. Our VNC server renders text from terminal master and sends input events from VNC client to terminal. So add special video type VIR_DOMAIN_VIDEO_TYPE_PARALLELS for these pseudo-devices. Signed-off-by: Dmitry Guryanov <dguryanov@parallels.com>	2015-04-10 09:50:29 +02:00
John Ferlan	97a1d94fa0	qemu: qemuDomainHotplugVcpus - separate out the del cgroup and pin Future IOThread setting patches would copy the code anyway, so create and generalize a delete cgroup and pindef for the vcpu into its own API. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-04-09 19:27:08 -04:00
John Ferlan	0ed8e47a7e	qemu: qemuDomainHotplugVcpus - separate out the add cgroup Future IOThread setting patches would copy the code anyway, so create and generalize the add the vcpu to a cgroup into its own API. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-04-09 19:27:08 -04:00
John Ferlan	0456eda317	cgroup: Use virCgroupNewThread Replace the virCgroupNew{Vcpu\|Emulator\|IOThread} calls with the common virCgroupNewThread API Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-04-09 19:27:08 -04:00
Peter Krempa	a45ef3a9cd	qemu: Avoid shadow of 'sync' symbol Old compilers whine that 'sync' is being shadowed in the function introduced in `1eccac1d2d`.	2015-04-09 15:36:26 +02:00
Peter Krempa	7c62f239f4	qemu: blockPivot: Don't pause the VM any more since we don't use drive-reopen Support for drive-reopen was never present in the upstream code so we don't need to pause the VM when doing the block pivot. Kill all the code related to this semi-upstream artifact.	2015-04-09 15:04:30 +02:00
Peter Krempa	db37f3cc3a	qemu: Clean up old leftovers in qemuMonitorDrivePivot There are two leftover unused variables. Remove them and clean up the fallout of the change.	2015-04-09 14:18:48 +02:00
Peter Krempa	3eab2f647a	qemu: blockjob: Use the new helpers in qemuDomainGetBlockJobInfo Refactor the function to use the new helpers.	2015-04-09 14:11:49 +02:00
Peter Krempa	1eccac1d2d	qemu: domain: Add helper to check block job support We need to check that qemu supports block jobs in multiple places. Add a helper to do the check.	2015-04-09 14:11:42 +02:00
Peter Krempa	88dc7e0c2f	qemu: domain: Introduce helper to retrieve domain monitor object In some cases where the function does not need to access the private data this helper may be used to retrieve the monitor object.	2015-04-09 14:11:36 +02:00
Luyao Huang	7cd0cf05f7	fix memleak in qemuRestoreCgroupState 131,088 bytes in 16 blocks are definitely lost in loss record 2,174 of 2,176 at 0x4C29BFD: malloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) by 0x4C2BACB: realloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) by 0x52A026F: virReallocN (viralloc.c:245) by 0x52BFCB5: saferead_lim (virfile.c:1268) by 0x52C00EF: virFileReadLimFD (virfile.c:1328) by 0x52C019A: virFileReadAll (virfile.c:1351) by 0x52A5D4F: virCgroupGetValueStr (vircgroup.c:763) by 0x1DDA0DA3: qemuRestoreCgroupState (qemu_cgroup.c:805) by 0x1DDA0DA3: qemuConnectCgroup (qemu_cgroup.c:857) by 0x1DDB7BA1: qemuProcessReconnect (qemu_process.c:3694) by 0x52FD171: virThreadHelper (virthread.c:206) by 0x82B8DF4: start_thread (pthread_create.c:308) by 0x85C31AC: clone (clone.S:113) Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-04-08 11:56:30 +02:00
Michal Privoznik	ea576ee543	qemuProcessHook: Call virNuma*() only when needed https://bugzilla.redhat.com/show_bug.cgi?id=1198645 Once upon a time, there was a little domain. And the domain was pinned onto a NUMA node and hasn't fully allocated its memory: <memory unit='KiB'>2355200</memory> <currentMemory unit='KiB'>1048576</currentMemory> <numatune> <memory mode='strict' nodeset='0'/> </numatune> Oh little me, said the domain, what will I do with so little memory. If I only had a few megabytes more. But the old admin noticed the whimpering, barely audible to untrained human ear. And good admin he was, he gave the domain yet more memory. But the old NUMA topology witch forbade to allocate more memory on the node zero. So he decided to allocate it on a different node: virsh # numatune little_domain --nodeset 0-1 virsh # setmem little_domain 2355200 The little domain was happy. For a while. Until bad, sharp teeth shaped creature came. Every process in the system was afraid of him. The OOM Killer they called him. Oh no, he's after the little domain. There's no escape. Do you kids know why? Because when the little domain was born, her father, Libvirt, called numa_set_membind(). So even if the admin allowed her to allocate memory from other nodes in the cgroups, the membind() forbid it. So what's the lesson? Libvirt should rely on cgroups, whenever possible and use numa_set_membind() as the last ditch effort. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-08 11:54:31 +02:00
Michael Chapman	cfcdf5ff01	qemu_driver: check caps after starting block job Currently we check qemuCaps before starting the block job. But qemuCaps isn't available on a stopped domain, which means we get a misleading error message in this case: # virsh domstate example shut off # virsh blockjob example vda error: unsupported configuration: block jobs not supported with this QEMU binary Move the qemuCaps check into the block job so that we are guaranteed the domain is running. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-08 11:16:19 +02:00
Michael Chapman	72df8314f0	qemu_migrate: use nested job when adding NBD to cookie qemuMigrationCookieAddNBD is usually called from within an async MIGRATION_OUT or MIGRATION_IN job, so it needs to start a nested job. (The one exception is during the Begin phase when change protection isn't enabled, but qemuDomainObjEnterMonitorAsync will behave the same as qemuDomainObjEnterMonitor in this case.) This bug was encountered with a libvirt client that repeatedly queries the disk mirroring block job info during a migration. If one of these queries occurs just as the Perform migration cookie is baked, libvirt crashes. Relevant logs are as follows: 6701: warning : qemuDomainObjEnterMonitorInternal:1544 : This thread seems to be the async job owner; entering monitor without asking for a nested job is dangerous [1] 6701: info : qemuMonitorSend:972 : QEMU_MONITOR_SEND_MSG: mon=0x7fefdc004700 msg={"execute":"query-block","id":"libvirt-629"} [2] 6699: info : qemuMonitorIOWrite:503 : QEMU_MONITOR_IO_WRITE: mon=0x7fefdc004700 buf={"execute":"query-block","id":"libvirt-629"} [3] 6704: info : qemuMonitorSend:972 : QEMU_MONITOR_SEND_MSG: mon=0x7fefdc004700 msg={"execute":"query-block-jobs","id":"libvirt-630"} [4] 6699: info : qemuMonitorJSONIOProcessLine:203 : QEMU_MONITOR_RECV_REPLY: mon=0x7fefdc004700 reply={"return": [...], "id": "libvirt-629"} 6699: error : qemuMonitorJSONIOProcessLine:211 : internal error: Unexpected JSON reply '{"return": [...], "id": "libvirt-629"}' At [1] qemuMonitorBlockStatsUpdateCapacity sends its request, then waits on mon->notify. At [2] the request is written out to the monitor socket. At [3] qemuMonitorBlockJobInfo sends its request, and also waits on mon->notify. The reply from the first request is received at [4]. However, qemuMonitorJSONIOProcessLine is not expecting this reply since the second request hadn't completed sending. The reply is dropped and an error is returned. qemuMonitorIO signals mon->notify twice during its error handling, waking up both of the threads waiting on it. One of them clears mon->msg as it exits qemuMonitorSend; the other crashes: qemuMonitorSend (mon=0x7fefdc004700, msg=<value optimized out>) at qemu/qemu_monitor.c:975 975 while (!mon->msg->finished) { (gdb) print mon->msg $1 = (qemuMonitorMessagePtr) 0x0 Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-08 10:30:17 +02:00
Michael Chapman	e5d729ba42	qemu: fix race between disk mirror fail and cancel If a VM migration is aborted, a disk mirror may be failed by QEMU before libvirt has a chance to cancel it. The disk->mirrorState remains at _ABORT in this case, and this breaks subsequent mirrorings of that disk. We should instead check the mirrorState directly and transition to _NONE if it is already aborted. Do the check after aborting the block job in QEMU to avoid a race. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-08 09:45:47 +02:00
Michael Chapman	77ddd0bba2	qemu: fix error propagation in qemuMigrationBegin If virCloseCallbacksSet fails, qemuMigrationBegin must return NULL to indicate an error occurred. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-08 09:45:47 +02:00
Michael Chapman	7578cc17f5	qemu: fix crash in qemuProcessAutoDestroy The destination libvirt daemon in a migration may segfault if the client disconnects immediately after the migration has begun: # virsh -c qemu+tls://remote/system list --all Id Name State ---------------------------------------------------- ... # timeout --signal KILL 1 \ virsh migrate example qemu+tls://remote/system \ --verbose --compressed --live --auto-converge \ --abort-on-error --unsafe --persistent \ --undefinesource --copy-storage-all --xml example.xml Killed # virsh -c qemu+tls://remote/system list --all error: failed to connect to the hypervisor error: unable to connect to server at 'remote:16514': Connection refused The crash is in: 1531 void 1532 qemuDomainObjEndJob(virQEMUDriverPtr driver, virDomainObjPtr obj) 1533 { 1534 qemuDomainObjPrivatePtr priv = obj->privateData; 1535 qemuDomainJob job = priv->job.active; 1536 1537 priv->jobs_queued--; Backtrace: #0 at qemuDomainObjEndJob at qemu/qemu_domain.c:1537 #1 in qemuDomainRemoveInactive at qemu/qemu_domain.c:2497 #2 in qemuProcessAutoDestroy at qemu/qemu_process.c:5646 #3 in virCloseCallbacksRun at util/virclosecallbacks.c:350 #4 in qemuConnectClose at qemu/qemu_driver.c:1154 ... qemuDomainRemoveInactive calls virDomainObjListRemove, which in this case is holding the last remaining reference to the domain. qemuDomainRemoveInactive then calls qemuDomainObjEndJob, but the domain object has been freed and poisoned by then. This patch bumps the domain's refcount until qemuDomainRemoveInactive has completed. We also ensure qemuProcessAutoDestroy does not return the domain to virCloseCallbacksRun to be unlocked in this case. There is similar logic in bhyveProcessAutoDestroy and lxcProcessAutoDestroy (which call virDomainObjListRemove directly). Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-04-08 09:45:47 +02:00
Michal Privoznik	225aa80246	virQEMUDriverGetConfig: Fix memleak ==19015== 968 (416 direct, 552 indirect) bytes in 1 blocks are definitely lost in loss record 999 of 1,049 ==19015== at 0x4C2C070: calloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) ==19015== by 0x52ADF14: virAllocVar (viralloc.c:560) ==19015== by 0x5302FD1: virObjectNew (virobject.c:193) ==19015== by 0x1DD9401E: virQEMUDriverConfigNew (qemu_conf.c:164) ==19015== by 0x1DDDF65D: qemuStateInitialize (qemu_driver.c:666) ==19015== by 0x53E0823: virStateInitialize (libvirt.c:777) ==19015== by 0x11E067: daemonRunStateInit (libvirtd.c:905) ==19015== by 0x53201AD: virThreadHelper (virthread.c:206) ==19015== by 0xA1EE1F2: start_thread (in /lib64/libpthread-2.19.so) ==19015== by 0xA4EFC8C: clone (in /lib64/libc-2.19.so) Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-07 18:52:27 +02:00
Michal Privoznik	9dbe6f3151	qemuSetupCgroupForVcpu: Fix memleak ==19015== 1,064 (656 direct, 408 indirect) bytes in 2 blocks are definitely lost in loss record 1,002 of 1,049 ==19015== at 0x4C2C070: calloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) ==19015== by 0x52AD74B: virAlloc (viralloc.c:144) ==19015== by 0x52B47CA: virCgroupNew (vircgroup.c:1057) ==19015== by 0x52B53E5: virCgroupNewVcpu (vircgroup.c:1451) ==19015== by 0x1DD85A40: qemuSetupCgroupForVcpu (qemu_cgroup.c:1013) ==19015== by 0x1DDA66EA: qemuProcessStart (qemu_process.c:4844) ==19015== by 0x1DDF1807: qemuDomainObjStart (qemu_driver.c:7265) ==19015== by 0x1DDF1A66: qemuDomainCreateWithFlags (qemu_driver.c:7320) ==19015== by 0x1DDF1ACD: qemuDomainCreate (qemu_driver.c:7337) ==19015== by 0x53F87EA: virDomainCreate (libvirt-domain.c:6820) ==19015== by 0x12690A: remoteDispatchDomainCreate (remote_dispatch.h:3481) ==19015== by 0x126827: remoteDispatchDomainCreateHelper (remote_dispatch.h:3457) Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-07 18:52:26 +02:00
Ján Tomko	1371ea92f0	Auto add virtio-serial controllers In virDomainVirtioSerialAddrNext, add another controller if we've exhausted all ports of the existing controllers. https://bugzilla.redhat.com/show_bug.cgi?id=1076708	2015-04-02 15:00:13 +02:00
Ján Tomko	89e991a2aa	Assign an address when hotplugging a virtio-serial device	2015-04-02 15:00:13 +02:00
Ján Tomko	ee0d97a770	Expand the address set when attaching a virtio-serial controller	2015-04-02 15:00:13 +02:00
Ján Tomko	5903378834	Allocate virtio-serial addresses when starting a domain Instead of always using controller 0 and incrementing port number, respect the maximum port numbers of controllers and use all of them. Ports for virtio consoles are quietly reserved, but not formatted (neither in XML nor on QEMU command line). Also rejects duplicate virtio-serial addresses. https://bugzilla.redhat.com/show_bug.cgi?id=890606 https://bugzilla.redhat.com/show_bug.cgi?id=1076708 Test changes: * virtio-auto.args Filling out the port when just the controller is specified. switched from using maxport + 1 to: first free port on the controller * virtio-autoassign.args Filling out the address when no <address> is specified. Started using all the controllers instead of 0, also discards the bus value. * xml -> xml output of virtio-auto The port assignment is no longer done as a part of XML parsing, so the unspecified values stay 0.	2015-04-02 15:00:13 +02:00
Luyao Huang	20aca080f3	qemuDomainBlockCopy: Check @granularity to be a power of two https://bugzilla.redhat.com/show_bug.cgi?id=1206479 As described in virDomainBlockCopy() parameters description, the VIR_DOMAIN_BLOCK_COPY_GRANULARITY parameter may require the value to have some specific attributes (e.g. be a power of two or fall within a certain range). And in qemu, a power of two is required. However, our code does not check that and let qemu operation fail. Moreover, the virsh man page is not as exact as it could be in this respect. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-04-02 11:12:01 +02:00
zhang bo	dd725c53e9	qemu: lifecycle: make agent-mode shutdown and reboot timeout When we shutdown/reboot a guest using agent-mode, if the guest itself blocks infinitely, libvirt would block in qemuAgentShutdown() forever. Thus, we set a timeout for shutdown/reboot, from our experience, 60 seconds would be fine. Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com> Signed-off-by: Wang Yufei <james.wangyufei@huawei.com>	2015-04-02 11:08:48 +02:00
Shanzhi Yu	ffe3d3e886	conf: Rename virDomainHasDiskMirror and detect block jobs properly virDomainHasDiskMirror() currently detects only jobs that add the mirror elements. Since some operations like migration are interlocked by existing block jobs on the given domain the check needs to be instrumented to check regular jobs too. This patch renames virDomainHasDiskMirror to virDomainHasDiskBlockjob and adds an argument that allows to select that it returns true only for block copy jobs as those interlock making the domain persistent. Other two uses trigger on any block job type. Signed-off-by: Shanzhi Yu <shyu@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-04-02 10:37:47 +02:00
Peter Krempa	c2a81eb7e1	qemu: snapshot: Check for block jobs individually If any disk of a VM was involved in a (copy) block job we refused to do a snapshot. As not only copy jobs interlock snapshots and the interlocking is applicable to individual disks only we can make the check in a more individual fashion and interlock all block job types supported by libvirt. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1203628	2015-04-02 10:37:47 +02:00
Ján Tomko	a0482396d7	Remove unused macros In the order of appearance: * MAX_LISTEN - never used added by `23ad665c` (qemud) and addec57 (lock daemon) * NEXT_FREE_CLASS_ID - never used, added by `07d1b6b` * virLockError - never used, added by `eb8268a4` * OPENVZ_MAX_ARG, CMDBUF_LEN, CMDOP_LEN unused since the removal of ADD_ARG_LIT in `d8b31306` * QEMU_NB_PER_CPU_STAT_PARAM - unused since `897808e` * QEMU_CMD_PROMPT, QEMU_PASSWD_PROMPT - unused since `1dc10a7` * TEST_MODEL_WORDSIZE - unused since `c25c18f7` * TEMPDIR - never used, added by `714bef5` * NSIG - workaround around old headers added by commit `60ed1d2` unused since virExec was moved by commit `02e8691` * DO_TEST_PARSE - never used, added by `9afa006` * DIFF_MSEC, GETTIMEOFDAY - unused since `eee6eb6`	2015-04-02 10:27:56 +02:00
Peter Krempa	4c7fb93d5a	qemu: Copy bitmap in a sane way Use virBitmapNewCopy instead of a combination of virBitmapNew and virBitmapCopy.	2015-04-02 10:12:09 +02:00
Peter Krempa	6afb0d04fe	qemu: cgroup: Kill qemuSetupCgroupVcpuPin() The function doesn't make sense. There's a simpler way to achieve the same.	2015-04-02 10:12:08 +02:00
Peter Krempa	8a81264b18	qemu: cgroup: Kill qemuSetupCgroupIOThreadsPin() The function doesn't make sense. There's a simpler way to achieve the same.	2015-04-02 10:12:08 +02:00
Peter Krempa	55072593d8	qemu: cgroup: Rename qemuSetupCgroupEmulatorPin to qemuSetupCgroupCpusetCpus The function is used to set cpuset.cpus in various other helpers.	2015-04-02 10:12:08 +02:00
Peter Krempa	98f08aba8e	qemu: cgroup: Use priv->autoCpuset instead of using qemuPrepareCpumap() Two places would call to qemuPrepareCpumap() with priv->autoNodeset to convert it to a cpuset. Remove the function and use the prepared cpuset automatically.	2015-04-02 10:12:08 +02:00
Peter Krempa	f0fa9080d4	qemu: cgroup: Properly set up vcpu pinning When the default cpuset or automatic numa placement is used libvirt would place the whole parent cgroup in the specified cpuset. This then disallowed to re-pin the vcpus to a different cpu. This patch pins only the vcpu threads to the default cpuset and thus allows to re-pin them later. The following config would fail to start: <domain type='kvm'> ... <vcpu placement='static' cpuset='0-1' current='2'>4</vcpu> <cputune> <vcpupin vcpu='0' cpuset='2-3'/> ... This is a regression since `a39f69d2b`.	2015-04-02 10:12:08 +02:00
Peter Krempa	7095006921	qemu: cgroup: Refactor setup for IOThread cgroups Use the default or auto cpuset if they are provided for IOThreads.	2015-04-02 10:12:08 +02:00
Peter Krempa	c9f9fa25d3	qemu: cgroup: Store auto cpuset instead of re-creating it on demand The automatic cpuset can be stored along with automatic nodeset and it does not have to be recreated when used.	2015-04-02 10:12:08 +02:00
Peter Krempa	630ee5ac6c	qemu: blockjob: Synchronously update backing chain in XML on ABORT/PIVOT When the synchronous pivot option is selected, libvirt would not update the backing chain until the job was exitted. Some applications then received invalid data as their job serialized first. This patch removes polling to wait for the ABORT/PIVOT job completion and replaces it with a condition. If a synchronous operation is requested the update of the XML is executed in the job of the caller of the synchronous request. Otherwise the monitor event callback uses a separate worker to update the backing chain with a new job. This is a regression since `1a92c71910` When the ABORT job is finished synchronously you get the following call stack: #0 qemuBlockJobEventProcess #1 qemuDomainBlockJobImpl #2 qemuDomainBlockJobAbort #3 virDomainBlockJobAbort While previously or while using the _ASYNC flag you'd get: #0 qemuBlockJobEventProcess #1 processBlockJobEvent #2 qemuProcessEventHandler #3 virThreadPoolWorker	2015-03-31 08:36:17 +08:00
Peter Krempa	0c4474df4e	qemu: Extract internals of processBlockJobEvent into a helper Later on I'll be adding a condition that will allow to synchronise a SYNC block job abort. The approach will require this code to be called from two different places so it has to be extracted into a helper.	2015-03-31 08:36:14 +08:00
Peter Krempa	6b6c4ab8a6	qemu: processBlockJob: Don't unlock @vm twice Commit `1a92c719` moved code to handle block job events to a different function that is executed in a separate thread. The caller of processBlockJob handles locking and unlocking of @vm, so the we should not do it in the function itself.	2015-03-31 08:36:10 +08:00
Peter Krempa	3c6a72d537	qemu: blockCopy: Pass adjusted bandwidth when called via blockRebase The block copy API takes the speed in bytes/s rather than MiB/s that was the prior approach in virDomainBlockRebase. We correctly converted the speed to bytes/s in the old API but we still called the common helper virDomainBlockCopyCommon with the unadjusted variable. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1207122	2015-03-30 17:34:13 +02:00
Michal Privoznik	53eae3e7c3	qemuDomainGetNumaParameters: Check for the correct CGroup controller When getting info on NUMA parameters for domain, virCgroupGetCpusetMems() may be called. However, as of `43b67f2e` the call is guarded by check if memory controller is present. Even though it may be not obvious instantly, NUMA parameters are stored under cpuset controller. Therefore the check needs to look like this: if (!virCgroupHasController(priv->cgroup, VIR_CGROUP_CONTROLLER_CPUSET) \|\| virCgroupGetCpusetMems(priv->cgroup, &nodeset) < 0) { Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-30 15:20:34 +02:00
Shanzhi Yu	c5fbad6623	qemu: end the job when try to blockcopy to non-file destination Blockcopy to non-file destination is not supported according the code, but a 'goto endjob' is missed after checking the destination. This leads to calling drive-mirror with wrong parameters. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1206406 Signed-off-by: Shanzhi Yu <shyu@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-03-27 10:30:50 +01:00
Ján Tomko	a96b68e7a4	Rename DomainGetIOThreadsInfo to DomainGetIOThreadInfo While it returns info about multiple threads, the version without the plural is easier to read.	2015-03-26 16:11:10 +01:00
Ján Tomko	9e48f6cf9f	Rename qemuMonitorIOThreadsInfo* to qemuMonitorIOThreadInfo* It only deals with a single thread.	2015-03-26 16:11:10 +01:00
Ján Tomko	cf8b828a72	Rename virDomainIOThreadsInfoFree to virDomainIOThreadInfoFree This function only frees the info for one thread.	2015-03-26 16:11:10 +01:00
Luyao Huang	a0bbdcd788	qemu: command: Fix property name for start address of a pc-dimm module Starting a qemu VM with a memory module that has the base address specified results in the following error: error: internal error: early end of file from monitor: possible problem: 2015-03-26T03:45:52.338891Z qemu-kvm: -device pc-dimm,node=0,memdev=memdimm0, id=dimm0,slot=0,base=4294967296: Property '.base' not found The correct property name for the base address is 'addr'. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-03-26 09:22:21 +01:00
Jiri Denemark	53c8062f7e	qemu: Give hint about -noTSX CPU model Because of the microcode update to Haswell/Broadwell CPUs, existing domains using these CPUs may fail to start even though they used to run just fine. To help users solve this issue we try to suggest switching to -noTSX variant of the CPU model: virsh # start cd error: Failed to start domain cd error: unsupported configuration: guest and host CPU are not compatible: Host CPU does not provide required features: rtm, hle; try using 'Haswell-noTSX' CPU model Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-03-26 09:20:00 +01:00
Peter Krempa	82f349a3a8	qemu: command: Check for empty network source when formatting drive cmd Use the virStorageSourceIsEmpty helper to determine whether the drive source is empty rather than checking for src->path. This will fix start of VM with empty network cdrom that would not report any error.	2015-03-26 08:24:46 +01:00
Peter Krempa	df9361859d	qemu: command: Report error when formatting network source with protocol _NONE The function that formats the string for network drives would return error code but did not set the error message when called on storage source with VIR_STORAGE_NET_PROTOCOL_LAST or _NONE. Report an error in this case if it would ever be called in that way.	2015-03-26 08:24:46 +01:00
Peter Krempa	9d574aa2c9	qemu: domain: Don't leak device alias list While adding tests for status XML parsing and formatting I've noticed that the device alias list is leaked. ==763001== 81 (48 direct, 33 indirect) bytes in 1 blocks are definitely lost in loss record 414 of 514 ==763001== at 0x4C2B8F0: calloc (vg_replace_malloc.c:623) ==763001== by 0x6ACF70F: virAllocN (viralloc.c:191) ==763001== by 0x447B64: qemuDomainObjPrivateXMLParse (qemu_domain.c:727) ==763001== by 0x6B848F9: virDomainObjParseXML (domain_conf.c:15491) ==763001== by 0x6B84CAC: virDomainObjParseNode (domain_conf.c:15608)	2015-03-25 13:27:22 +01:00
Luyao Huang	726072f0d2	qemu: Report better error when memory device source has wrong NUMA node When starting a VM with hotpluggable memory devices the user may specify an invalid source NUMA node. Libvirt would pass through the error from qemu: # virsh start test3 error: Failed to start domain test3 error: internal error: process exited while connecting to monitor: 2015-03-25T01:12:17.205913Z qemu-kvm: -object memory-backend-ram,id=memdimm0 ,size=536870912,host-nodes=1-3,policy=bind: cannot bind memory to host NUMA nodes: Invalid argument This patch adds a check that allows to report better error: # virsh start test3 error: Failed to start domain test3 error: configuration unsupported: NUMA node 1 is unavailable Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-03-25 12:24:40 +01:00
Jiri Denemark	f6fbd36fd4	qemu: Add timing to domain jobs Whenever we fail to acquire a job, we can report how long ago it was locked by another API. https://bugzilla.redhat.com/show_bug.cgi?id=853839 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-03-25 10:00:54 +01:00
Jiri Denemark	b79f25e881	qemu: Track the API which started the current job This is very helpful when we want to log and report why we could not acquire a state change lock. Reporting what job keeps it locked helps with understanding the issue. Moreover, after calling virDomainGetControlInfo, it's possible to tell whether libvirt is just stuck somewhere within the API (or it just forgot to cleanup the job) or whether libvirt is waiting for QEMU to reply. The error message will look like the following: # virsh resume cd error: Failed to resume domain cd error: Timed out during operation: cannot acquire state change lock (held by remoteDispatchDomainSuspend) https://bugzilla.redhat.com/show_bug.cgi?id=853839 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-03-25 10:00:54 +01:00
Jiri Denemark	667cce7b60	Set thread job for every RPC call Since all APIs are also RPC calls, we automatically get all APIs covered with thread jobs. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-03-25 10:00:54 +01:00
Pavel Hrdina	6cf1e11cc0	qemu: fix set vcpus on host without NUMA We don't have to modify cpuset.mems on hosts without NUMA. It also fixes an error message that you get instead of success if you trying update vcpus of a guest on a host without NUMA. error: internal error: NUMA isn't available on this host Signer-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-03-24 19:02:58 +01:00
Pavel Hrdina	5cd3c5012d	qemu: cleanup setvcpus Remove unnecessary maximum variable. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-03-24 19:02:58 +01:00
Pavel Hrdina	5bb0666545	qemu: move virDomainLiveConfigHelperMethod right after BeginJob We should call virDomainLiveConfigHelperMethod ASAP because this function transfers VIR_DOMAIN_AFFECT_CURRENT to VIR_DOMAIN_AFFECT_LIVE or VIR_DOMAIN_AFFECT_CONFIG. All other additional checks for those two flags should consider that the user give us VIR_DOMAIN_AFFECT_CURRENT. Remove the unnecessary check whether the domain is live in case of VIR_DOMAIN_VCPU_GUEST because this check is done by virDomainLiveConfigHelperMethod. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-03-24 19:02:58 +01:00
Ján Tomko	68545ea629	Fix typo in error message by rewriting it completely from: error: unsupported configuration: virtio only support device address type 'PCI' to: error: unsupported configuration: virtio disk cannot have an address of type drive Since we now support CCW addresses as well.	2015-03-24 18:06:38 +01:00
Laine Stump	dae3e24663	qemu: change accidental VIR_WARNING back to VIR_DEBUG While debugging the support for responding to qemu RX_FILTER_CHANGED events, I had changed the "ignoring this event" log message from VIR_DEBUG to VIR_WARN, but forgot to change it back before pushing. Since many guest OSes make enough changes to multicast lists and/or promiscuous mode settings to trigger this message, it's starting to show up as a red herring in bug reports.	2015-03-24 11:51:27 -04:00
Peter Krempa	3b289a81ea	qemu: Implement memory device hotunplug Add code to hot-remove memory devices from qemu. Unfortunately QEMU doesn't support this right now, so this is just for completenes.	2015-03-23 14:41:14 +01:00
Peter Krempa	9b4654f6f1	qemu: Implement memory device hotplug Add code to hot-add memory devices to running qemu instances.	2015-03-23 14:31:30 +01:00
Peter Krempa	96094fb28d	qemu: conf: Add support for memory device cold(un)plug Add a few helpers that allow to operate with memory device definitions on the domain config and use them to implement memory device coldplug in the qemu driver.	2015-03-23 14:25:16 +01:00
Peter Krempa	8b54bffbab	qemu: add support for memory devices Add support to start qemu instance with 'pc-dimm' device. Thanks to the refactors we are able to reuse the existing function to determine the parameters.	2015-03-23 14:25:15 +01:00
Peter Krempa	c5710066e8	qemu: migration: Forbid migration with memory modules lacking info Make sure that libvirt has all vital information needed to reliably represent configuration of guest's memory devices in case of a migration. This patch forbids migration in case the required slot number and module base address are not present (failed to be loaded from qemu via monitor).	2015-03-23 14:25:15 +01:00
Peter Krempa	5cdfaa31c4	qemu: memdev: Add infrastructure to load memory device information When using 'dimm' memory devices with qemu, some of the information like the slot number and base address need to be reloaded from qemu after process start so that it reflects the actual state. The state then allows to use memory devices across migrations.	2015-03-23 14:25:15 +01:00
Peter Krempa	3e4230d270	conf: Add interface to parse and format memory device information This patch adds code that parses and formats configuration for memory devices. A simple configuration would be: <memory model='dimm'> <target> <size unit='KiB'>524287</size> <node>0</node> </target> </memory> A complete configuration of a memory device: <memory model='dimm'> <source> <pagesize unit='KiB'>4096</pagesize> <nodemask>1-3</nodemask> </source> <target> <size unit='KiB'>524287</size> <node>1</node> </target> </memory> This patch preemptively forbids use of the <memory> device in individual drivers so the users are warned right away that the device is not supported.	2015-03-23 14:25:15 +01:00
Peter Krempa	a41185d8d1	qemu: Implement setup of memory hotplug parameters To enable memory hotplug the maximum memory size and slot count need to be specified. As qemu supports now other units than mebibytes when specifying memory, use the new interface in this case.	2015-03-23 14:25:14 +01:00
Peter Krempa	bffb9163a1	conf: Add support for parsing and formatting max memory and slot count Add a XML element that will allow to specify maximum supportable memory and the count of memory slots to use with memory hotplug. To avoid possible confusion and misuse of the new element this patch also explicitly forbids the use of the maxMemory setting in individual drivers's post parse callbacks. This limitation will be lifted when the support is implemented.	2015-03-23 14:25:14 +01:00
Peter Krempa	b15f588385	qemu: monitor: Don't leak @props with non-JSON in qemuMonitorAddObject The function comment states that @props is always consumed, even on failure. This was not true with the failure if the monitor is not using QMP.	2015-03-23 14:25:14 +01:00
Peter Krempa	104011ea8b	qemu: Don't return memory device config on error in qemuBuildMemoryBackendStr In the last section if the function determines that the config is invalid when QEMU doesn't support the memory device the JSON config object would be returned even if it doesn't make sense. Assign the object to be returned only on success.	2015-03-23 14:20:53 +01:00
Boris Fiuczynski	c35b277671	qemu: Set default SCSI controller model for S390 arch When no model is specified in the domain definition for a scsi controller and the architectur is s390 than virtio-scsi is set as default model. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Daniel Hansel <daniel.hansel@linux.vnet.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com> Reviewed-by: Jens Freimann <jfrei@linux.vnet.ibm.com> Reviewed-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-23 12:19:52 +01:00
Michael Chapman	a1b1805155	qemu: skip precreation of network disks Commit `cf54c60699` introduced the ability to create missing storage volumes during migration. For network disks, however, we may not necessarily be able to detect whether they already exist -- there is no straight-forward way to map the disk to a storage volume, and even if there were it's possible no configured storage pool actually contains the disk. It is better to assume the network disk exists in this case, rather than aborting the migration completely. If the volume really is missing, QEMU will generate an appropriate error later in the migration. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2015-03-23 10:25:20 +01:00
Martin Kletzander	0e7457e501	Fix common misspellings Wikipedia's list of common misspellings [1] has a machine-readable version. This patch fixes those misspellings mentioned in the list which don't have multiple right variants (as e.g. "accension", which can be both "accession" and "ascension"), such misspellings are left untouched. The list of changes was manually re-checked for false positives. [1] https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-03-23 09:01:30 +01:00
Martin Kletzander	3a0e5b0c20	qemu: Migrate memory on numatune change We've never set the cpuset.memory_migrate value to anything, keeping it on default. However, we allow changing cpuset.mems on live domain. That setting, however, don't have any consequence on a domain unless it's going to allocate new memory. I managed to make 'virsh numatune' move all the memory to any node I wanted even without disabling libnuma's numa_set_membind(), so this should be safe to use with it as well. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1198497 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-03-20 13:40:02 +01:00
Luyao Huang	4f06820938	qemu: do not overwrite the error in qemuDomainObjExitMonitor https://bugzilla.redhat.com/show_bug.cgi?id=1196934 When qemu exits during startup, libvirt includes the error from /var/log/libvirt/qemu/vm.log in the error message: $ virsh start test3 error: Failed to start domain test3 error: internal error: early end of file from monitor: possible problem: 2015-02-27T03:03:16.985494Z qemu-kvm: -numa memdev is not supported by machine rhel6.5.0 The check for domain liveness added to qemuDomainObjExitMonitor in commit `dc2fd51f` sometimes overwrites this error: $ virsh start test3 error: Failed to start domain test3 error: operation failed: domain is no longer running Fix the check to only report an error if there is none set. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-03-19 10:17:27 +01:00
John Ferlan	e06e6f1ee3	qemu: Fix two issues in qemuDomainSetVcpus error handling Issue #1 - A call to virBitmapNew did not check if the allocation failed which could lead to a NULL dereference Issue #2 - When deleting the pin entries from the config file, the code loops from the number of elements down to the "new" vcpu count; however, the pin id values are numbered 0..n-1 not 1..n, so the "first" pin attempt would never work. Luckily the check was for whether the incoming 'n' (vcpu id) matched the entry in the array from 0..arraysize rather than a dereference of the 'n' entry	2015-03-18 18:11:22 -04:00
Eric Blake	e2660cb8a6	qemu: track 'cancelling' migration state In qemu 2.3, the migration status will include 'cancelling' in the window between when an asynchronous cancel has been requested and when the migration is actually halted. Previously, qemu hid this state and reported 'active'. Libvirt manages the sequence okay even when the string is unrecognized (that is, it will report an unknown state: Migration: [ 69 %]^Cerror: internal error: unexpected migration status in cancelling. but the migration is still cancelled), but recognizing the string makes for a smoother user experience. * src/qemu/qemu_monitor.h (QEMU_MONITOR_MIGRATION_STATUS_CANCELLING): Add enum. * src/qemu/qemu_monitor.c (qemuMonitorMigrationStatus): Map it. * src/qemu/qemu_migration.c (qemuMigrationUpdateJobStatus): Adjust clients. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONGetMigrationStatusReply): Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2015-03-18 14:59:34 -06:00
Laine Stump	451547a422	util: clean up #includes of virnetdevopenvswitch.h virnetdevopenvswitch.h declares a few functions that can be called to add ports to and remove them from OVS bridges, and retrieve the migration data for a port. It does not contain any data definitions that are used by domain_conf.h. But for some reason, domain_conf.h virnetdevopenvswitch.h should be directly #including it. This adds a few lines to the project, but saves all the files that don't need it from the extra computing, and makes the dependencies more clear cut.	2015-03-18 14:43:47 -04:00
zhang bo	39ac323063	util: vhost user: support for bootindex Problem Description: When we set boot order for a vhost-user network interface, we found the boot index doesn't work. Cause of the Problem: In the function qemuBuildVhostuserCommandLine(), it forcely set the arg bootindex of function qemuBuildNicDevStr() to 0. Thus, the bootindex parameter got missing. Solution: Trans the arg bootindex down. Signed-off-by: Gao Haifeng <gaohaifeng.gao@huawei.com> Signed-off-by: Zhang Bo <oscar.zhangbo@huawei.com>	2015-03-18 18:39:09 +01:00
Jiri Denemark	18441ab914	Use PAUSED state for domains that are starting up When libvirt is starting a domain, it reports the state as SHUTOFF until it's RUNNING. This is not ideal because domain startup may take a long time (usually because of some configuration issues, firewalls blocking access to network disks, etc.) and domain lists provided by libvirt look awkward. One can see weird shutoff domains with IDs in a list of active domains or even shutoff transient domains. In any case, it looks more like a bug in libvirt than a normal state a domain goes through. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-03-18 10:08:22 +01:00
Michal Privoznik	3353c7c40a	qemuGetDHCPInterfaces: Don't leak @network The function needs a pointer to the network to get list of DHCP leases. The pointer is obtained via virNetworkLookupByName() which requires callers to free the returned network once no longer needed. Otherwise it's leaked. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-18 09:30:15 +01:00
Michal Privoznik	100fb08cee	qemuAgentGetInterfaces: Don't error out on missing HW address Now that we allow HW address to be not present on our RPC layer, don't error out if qemu-ga hasn't provided any. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-18 09:13:19 +01:00
Eric Blake	f9ea3d6011	qemu: read backing chain names from qemu https://bugzilla.redhat.com/show_bug.cgi?id=1199182 documents that after a series of disk snapshots into existing destination images, followed by active commits of the top image, it is possible for qemu 2.2 and earlier to end up tracking a different name for the image than what it would have had when opening the chain afresh. That is, when starting with the chain 'a <- b <- c', the name associated with 'b' is how it was spelled in the metadata of 'c', but when starting with 'a', taking two snapshots into 'a <- b <- c', then committing 'c' back into 'b', the name associated with 'b' is now the name used when taking the first snapshot. Sadly, older qemu doesn't know how to treat different spellings of the same filename as identical files (it uses strcmp() instead of checking for the same inode), which means libvirt's attempt to commit an image using solely the names learned from qcow2 metadata fails with a cryptic: error: internal error: unable to execute QEMU command 'block-commit': Top image file /tmp/images/c/../b/b not found even though the file exists. Trying to teach libvirt the rules on which name qemu will expect is not worth the effort (besides, we'd have to remember it across libvirtd restarts, and track whether a file was opened via metadata or via snapshot creation for a given qemu process); it is easier to just always directly ask qemu what string it expects to see in the first place. As a safety valve, we validate that any name returned by qemu still maps to the same local file as we have tracked it, so that a compromised qemu cannot accidentally cause us to act on an incorrect file. * src/qemu/qemu_monitor.h (qemuMonitorDiskNameLookup): New prototype. * src/qemu/qemu_monitor_json.h (qemuMonitorJSONDiskNameLookup): Likewise. * src/qemu/qemu_monitor.c (qemuMonitorDiskNameLookup): New function. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONDiskNameLookup) (qemuMonitorJSONDiskNameLookupOne): Likewise. * src/qemu/qemu_driver.c (qemuDomainBlockCommit) (qemuDomainBlockJobImpl): Use it. Signed-off-by: Eric Blake <eblake@redhat.com>	2015-03-17 11:57:53 -06:00
Antoni Segura Puimedon	d490f47ba3	network: Add midonet virtual port type support to qemu Use the utilities introduced in the previous patches so the qemu driver is able to create tap devices that are bound (and unbound on domain destroyal) to Midonet virtual ports. Signed-off-by: Antoni Segura Puimedon <toni+libvirt@midokura.com>	2015-03-17 13:10:17 -04:00
Peter Krempa	d0dc6c0369	qemu: driver: Fix cold-update of removable storage devices Only selected fields from the disk source were copied when cold updating source in a CDROM drive. When such drive was backed by a network file this resulted into corruption of the definition: <disk type='network' device='cdrom'> <driver name='qemu' type='raw' cache='none'/> <source protocol='gluster' name='gluster-vol1(null)'> <host name='localhost'/> </source> <target dev='vdc' bus='virtio'/> <readonly/> <address type='pci' domain='0x0000' bus='0x00' slot='0x0a' function='0x0'/> </disk> Update the whole source instead of cherry-picking elements. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1166024	2015-03-17 17:11:37 +01:00
Peter Krempa	e7974b4f80	qemu: hotplug: Use checker function to check if disk is empty	2015-03-17 17:11:37 +01:00
Nehal J Wani	0977b8aa07	domifaddr: Implement the API for qemu By querying the qemu guest agent with the QMP command "guest-network-get-interfaces" and converting the received JSON output to structured objects. Although "ifconfig" is deprecated, IP aliases created by "ifconfig" are supported by this API. The legacy syntax of an IP alias is: "<ifname>:<alias-name>". Since we want all aliases to be clubbed under parent interface, simply stripping ":<alias-name>" suffices. Note that IP aliases formed by "ip" aren't visible to "ifconfig", and aliases created by "ip" do not have any specific name. But we are lucky, as qemu guest agent detects aliases created by both. src/qemu/qemu_agent.h: * Define qemuAgentGetInterfaces src/qemu/qemu_agent.c: * Implement qemuAgentGetInterface src/qemu/qemu_driver.c: * New function qemuGetDHCPInterfaces * New function qemuDomainInterfaceAddresses src/remote_protocol-sructs: * Define new structs tests/qemuagenttest.c: * Add new test: testQemuAgentGetInterfaces Test cases for IP aliases, 0 or multiple ipv4/ipv6 address(es) Signed-off-by: Nehal J Wani <nehaljw.kkd1@gmail.com>	2015-03-17 15:15:38 +00:00
Martin Kletzander	ad69e8be4a	conf: Use correct type for balloon stats period We're parsing memballoon status period as unsigned int, but when we're trying to set it, both we and qemu use signed int. That means large values will get wrapped around to negative one resulting in error. Basically the same problem as commit `e3a7b874` was dealing with when updating live domain. QEMU changed the accepted value to int64 in commit 1f9296b5, but even values as INT_MAX don't make sense since the value passed means seconds. Hence adding capability flag for this change isn't worth it. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1140958 Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-03-17 12:06:14 +01:00
Martin Kletzander	4fca30e0bd	qemu: Don't duplicate errors when settings stats period In order not to leave old error messages set, this patch refactors the code so the error is reported only when acted upon. The only such place already rewrites any error, so cleaning up all the error reporting in qemuMonitorSetMemoryStatsPeriod() is enough. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-03-17 12:03:34 +01:00
Luyao Huang	4acd2bce26	qemu_command: Fix some indentation and a typo Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-17 11:27:26 +01:00
Peter Krempa	ee744b5b38	qemu: block-commit: Mark disk in block jobs only on successful command Patch `51f9f03a4c` introduces a regression where if a blockCommit operation fails the disk is still marked as being part of a block job but can't be unmarked later.	2015-03-16 17:21:13 +01:00
John Ferlan	4985dde0f5	Replace virDomainVcpuPinDel with virDomainPinDel Since both Vcpu and IOThreads code use the same API's, alter the naming of the API's to remove the "Vcpu" specific reference	2015-03-16 11:54:57 -04:00
John Ferlan	01c8f66275	Replace virDomainIOThreadsPinAdd with virDomainPinAdd This one is no longer necessary since the Vcpu and IOThreads API's share	2015-03-16 11:54:57 -04:00
John Ferlan	cacf27f212	Replace virDomainVcpuPinAdd with virDomainPinAdd Since both Vcpu and IOThreads code use the same API's, alter the naming of the API's to remove the "Vcpu" specific reference	2015-03-16 11:54:57 -04:00
John Ferlan	a8a89270ef	Convert virDomainVcpuPinFindByVcpu into virDomainPinFindByVcpu Since both Vcpu and IOThreads code use the same API's, alter the naming of the API's to remove the "Vcpu" specific reference	2015-03-16 11:54:57 -04:00
John Ferlan	c16723f606	Convert virDomainVcpuPinDefCopy into virDomainPinDefCopy Since both Vcpu and IOThreads code use the same API's, alter the naming of the API's to remove the "Vcpu" specific reference	2015-03-16 11:54:57 -04:00
John Ferlan	24b15ea90d	Convert virDomainVcpuPinDefArrayFree to virDomainPinDefArrayFree Since both Vcpu and IOThreads code use the same API's, alter the naming of the API's to remove the "Vcpu" specific reference	2015-03-16 11:54:57 -04:00
John Ferlan	0a06a1812d	Convert virDomainVcpuPinDefFree to virDomainPinDefFree Since both Vcpu and IOThreads code use the same API's, alter the naming of the API's to remove the "Vcpu" specific reference	2015-03-16 11:54:57 -04:00
John Ferlan	a9f528ab29	Convert virDomainPinDefPtr->vcpuid to virDomainPinDefPtr->id Since we're not specifically a vcpu related structure anymore...	2015-03-16 11:54:57 -04:00
John Ferlan	59ba70237a	Convert virDomainVcpuPinDefPtr to virDomainPinDefPtr As pointed out by jtomko in his review of the IOThreads pinning code: http://www.redhat.com/archives/libvir-list/2015-March/msg00495.html there are some comments sprinkled in indicating IOThreads were using the same structure as the VcpuPin code... This is the first patch of a few that will change the virDomainVcpuPin* structures and code to just virDomainPin* - starting with the data structure naming...	2015-03-16 11:54:56 -04:00
John Ferlan	ffa9f8a19b	qemu: Fix possible memory leak in qemuDomainPinVcpuFlags During his review of the iothreads pin setting code, Pavel noted that there was a potential memory leak with respect to how the newVcpuPin is handled and the goto endjob's in failure paths which would not free the memory. For reference, See: http://www.redhat.com/archives/libvir-list/2015-March/msg00415.html	2015-03-16 11:54:56 -04:00
Peter Krempa	57b215ab25	qemu: command: Add helper to align memory sizes The memory sizes in qemu are aligned up to 1 MiB boundaries. There are two places where this was done once for the total size and then for individual NUMA cell sizes. Add a function that will align the sizes in one place so that it's clear where the sizes are aligned.	2015-03-16 14:32:20 +01:00
Peter Krempa	4f9907cd11	conf: Replace access to def->mem.max_balloon with accessor functions As there are two possible approaches to define a domain's memory size - one used with legacy, non-NUMA VMs configured in the <memory> element and per-node based approach on NUMA machines - the user needs to make sure that both are specified correctly in the NUMA case. To avoid this burden on the user I'd like to replace the NUMA case with automatic totaling of the memory size. To achieve this I need to replace direct access to the virDomainMemtune's 'max_balloon' field with two separate getters depending on the desired size. The two sizes are needed as: 1) Startup memory size doesn't include memory modules in some hypervisors. 2) After startup these count as the usable memory size. Note that the comments for the functions are future aware and document state that will be present after a few later patches.	2015-03-16 14:26:51 +01:00
Peter Krempa	51f9f03a4c	qemu: Disallow concurrent block jobs on a single disk While qemu may be prepared to do this libvirt is not. Forbid the block ops until we fix our code.	2015-03-16 11:22:29 +01:00
Peter Krempa	1a92c71910	qemu: event: Don't fiddle with disk backing trees without a job Surprisingly we did not grab a VM job when a block job finished and we'd happily rewrite the backing chain data. This made it possible to crash libvirt when queueing two backing chains tightly and other badness. To fix it, add yet another handler to the helper thread that handles monitor events that require a job.	2015-03-16 10:57:33 +01:00
Peter Krempa	5c634730b9	qemu: process: Export qemuProcessFindDomainDiskByAlias	2015-03-16 10:57:33 +01:00
Erik Skultety	8464616526	qemu: Check for negative port values in network drive configuration We interpret port values as signed int (convert them from char *), so if a negative value is provided in network disk's configuration, we accept it as valid, however there's an 'unknown cause' error raised later. This error is only accidental because we return the port value in the return code. This patch adds just a minor tweak to the already existing check so we reject negative values the same way as we reject non-numerical strings. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1163553	2015-03-16 09:46:43 +01:00
Ján Tomko	22fd3ac38f	Introduce virBitmapIsBitSet A helper that never returns an error and treats bits out of bitmap range as false. Use it everywhere we use ignore_value on virBitmapGetBit, or loop over the bitmap size.	2015-03-13 15:31:33 +01:00
Ján Tomko	a00e5c662b	Error out on an address for isa-serial in QEMU driver. We've never formatted them on the qemu command line. https://bugzilla.redhat.com/show_bug.cgi?id=1164053	2015-03-12 09:13:31 +01:00
John Ferlan	fb562614e3	qemu: Add support to pin IOThreads to specific CPU Add qemuDomainPinIOThread to handle setting the CPU affinity for a specific IOThread	2015-03-11 12:23:55 -04:00
Peter Krempa	d3534a43e6	qemu: monitor: Kill qemuMonitorGetBlockStats(Info,ParamsNumber) The functions and their QMP and HMP implementations are no longer needed since everything is now done via the *AllStats functions.	2015-03-11 11:28:04 +01:00
Peter Krempa	e045587dda	qemu: blockstats: Refactor qemuDomainBlockStatsFlags Now that qemuDomainBlocksStatsGather provides functions of both qemuMonitorGetBlockStatsParamsNumber and qemuMonitorGetBlockStatsInfo we can reuse it and kill a lot of code. Additionally as a bonus qemuDomainBlockStatsFlags will now support summary statistics so add a statement to the virsh man page about that. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1142636	2015-03-11 11:28:04 +01:00
Peter Krempa	cf093414c5	qemu: blockstats: Add support for totalled block statistics In the LXC driver, if the disk path is not provided the API returns total statistics for all disks of the domain. With the new text monitor implementation this can be now done in the qemu driver too. Add code that wil total the stats for all disks if the path is not provided.	2015-03-11 11:28:04 +01:00
Peter Krempa	79966411cd	qemu: Split out working code qemuDomainBlockStats Extract the code to look up the disk alias and return the block stats struct so that it can be reused later in qemuDomainBlockStatsFlags. The function uses qemuMonitorGetAllBlockStatsInfo instead of qemuMonitorGetBlockStatsInfo.	2015-03-11 11:28:04 +01:00
Peter Krempa	bdc05128d7	qemu: monitor: Count block stats fields in qemuMonitorGetAllBlockStatsInfo Our virDomainBlockStatsFlags API uses the old approach where, when it's called without the typed parameter array, returns the count of parameters supported by qemu. The supported parameter count is obtained via separate monitor calls which is a waste since we can calculate it when gathering the data. This patch adds code to the qemuMonitorGetAllBlockStatsInfo workers that allows to track the count of supported fields reported by qemu and will allow to remove the old duplicate code.	2015-03-11 11:28:04 +01:00
Peter Krempa	4d8ebb7a35	qemu: monitor: Convert common code to a macro The function that is extracting block stats data from the QMP monitor reply contains a lot of repeated code. Since I'd be changing each of the copies in the next patch, lets convert it to a macro right away.	2015-03-11 11:28:04 +01:00
Peter Krempa	f6563bc361	qemu: monitor: Implement HMP version for listing all block device stats Add a different version of parser for "info blockstats" that basically parses the same information as the existing copy of the function. This will allow us to remove the single device version qemuMonitorGetBlockStatsInfo in the future. The new implementation uses few new helpers so it should be more understandable and provides a test case to verify that it works.	2015-03-11 11:28:04 +01:00
Peter Krempa	4f6b6788c4	qemu: blockstats: Switch to caller allocated hash table Allocate the hash table in the monitor wrapper function instead of the worker itself so that the text monitor impl that will be added in the next patch doesn't have to duplicate it.	2015-03-11 11:28:03 +01:00
Peter Krempa	32288fc9b9	qemu: monitor: Drop parsing of 'errs' from block info The error count statistic is not supported by qemu, so there's no need to pass the variables around if the result is ignored anyways.	2015-03-11 11:28:03 +01:00
Peter Krempa	7e9fb15145	qemu: Use macro to set block stats typed parameters All the setters are the same code except for parameter name and variable, so they can be converted to a macro to save a ton of duplicated code.	2015-03-11 11:28:03 +01:00
Luyao Huang	64595431cd	qemu: Remove unnecessary virReportError on networkGetNetworkAddress return Error messages are already set in all code paths returning -1 from networkGetNetworkAddress, so we don't want to overwrite them. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-03-10 17:29:28 -04:00
Michal Privoznik	954427c354	virQEMUCapsInitQMP: Don't dispose locked @vm When creating qemu capabilities, a dummy virDomainObj is created just because our monitor code expects that. However, the object is created locked already. Then, under cleanup label, we simply unref the object which results in whole domain object to be disposed. The object lock is destroyed subsequently, but hey - it's still locked: ==24845== Thread #14's call to pthread_mutex_destroy failed ==24845== with error code 16 (EBUSY: Device or resource busy) ==24845== at 0x4C3024E: pthread_mutex_destroy (in /usr/lib64/valgrind/vgpreload_helgrind-amd64-linux.so) ==24845== by 0x531F72E: virMutexDestroy (virthread.c:83) ==24845== by 0x5302977: virObjectLockableDispose (virobject.c:237) ==24845== by 0x5302A89: virObjectUnref (virobject.c:265) ==24845== by 0x1DD37866: virQEMUCapsInitQMP (qemu_capabilities.c:3397) ==24845== by 0x1DD37CC6: virQEMUCapsNewForBinary (qemu_capabilities.c:3481) ==24845== by 0x1DD381E2: virQEMUCapsCacheLookup (qemu_capabilities.c:3609) ==24845== by 0x1DD30F8A: virQEMUCapsInitGuest (qemu_capabilities.c:744) ==24845== by 0x1DD31889: virQEMUCapsInit (qemu_capabilities.c:1020) ==24845== by 0x1DD7DD36: virQEMUDriverCreateCapabilities (qemu_conf.c:888) ==24845== by 0x1DDC57C0: qemuStateInitialize (qemu_driver.c:803) ==24845== by 0x53DC743: virStateInitialize (libvirt.c:777) ==24845== Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-10 17:32:12 +01:00
Laine Stump	705242f880	qemu: don't fill in nicindexes for session mode libvirtd Commit `4bbe1029f` fixed a problem in commit `f7afeddc` by moving the call to virNetDevGetIndex() to a location common to all interface types (so that the nicindex array would be filled in for macvtap as well as tap interfaces), but the location was too common, as the original call to virNetDevGetIndex() had been in a section qualified by "if (cfg->privileged)". The result was that the "fixed" libvirtd would try to call virNetDevGetIndex() even for session mode libvirtd, and end up failing with the log message: Unable to open control socket: Operation not permitted To remedy that, this patch qualifies the call to virNetDevGetIndex() in its new location with cfg->privileged. This resolves https://bugzilla.redhat.com/show_bug.cgi?id=1198244	2015-03-10 07:53:10 -04:00
John Ferlan	86ef105a1d	qemu: Change qemuDomainGetEmulatorPinInfo bitmap manipulation Follow-up to the IOThread review on CPU affinity map manipulation: http://www.redhat.com/archives/libvir-list/2015-March/msg00294.html indicates that the GetEmulatorPinInfo could use similar algorithm adjustments which is what this patch does.	2015-03-09 08:11:49 -04:00
John Ferlan	10f2740ae3	qemu: Change qemuDomainGetVcpuPinInfo bitmap manipulation Follow-up to the IOThread review on CPU affinity map manipulation: http://www.redhat.com/archives/libvir-list/2015-March/msg00294.html indicates that the GetVcpuPinInfo could use similar algorithm adjustments which is what this patch does.	2015-03-09 08:11:49 -04:00
John Ferlan	fcb13292b1	qemu: Resolve Coverity CHECKED_RETURN issue By adding a call and check of return of virBitmapToData to the IOThreads code, my Coverity checker lets me know qemuDomainHelperGetVcpus also needs to check the status...	2015-03-09 08:11:49 -04:00
John Ferlan	daefe3f300	qemu: Change/Fix IOThread CPU affinity bitmap manipulation Based on review: http://www.redhat.com/archives/libvir-list/2015-March/msg00294.html Adjust how the cpumap and cpumaplen to be returned are generated	2015-03-09 08:11:43 -04:00
John Ferlan	82649eb7f1	qemu: Implement the qemu driver fetch for IOThreads Depending on the flags passed, either attempt to return the active/live IOThread data for the domain or the config data. The active/live path will call into the Monitor in order to get the IOThread data and then correlate the thread_id's returned from the monitor to the currently running system/threads in order to ascertain the affinity for each iothread_id. The config path will map each of the configured IOThreads and return any configured iothreadspin data Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-03-06 06:49:32 -05:00
Pavel Hrdina	cf521fc8ba	memtune: change the way how we store unlimited value There was a mess in the way how we store unlimited value for memory limits and how we handled values provided by user. Internally there were two possible ways how to store unlimited value: as 0 value or as VIR_DOMAIN_MEMORY_PARAM_UNLIMITED. Because we chose to store memory limits as unsigned long long, we cannot use -1 to represent unlimited. It's much easier for us to say that everything greater than VIR_DOMAIN_MEMORY_PARAM_UNLIMITED means unlimited and leave 0 as valid value despite that it makes no sense to set limit to 0. Remove unnecessary function virCompareLimitUlong. The update of test is to prevent the 0 to be miss-used as unlimited in future. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1146539 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-03-06 11:52:24 +01:00
Stefan Berger	9954a8bfc2	qemu: Pass file descriptor when using TPM passthrough Pass the TPM file descriptor to QEMU via command line. Instead of passing /dev/tpm0 we now pass /dev/fdset/10 and the additional parameters -add-fd set=10,fd=20. This addresses the use case when QEMU is started with non-root privileges and QEMU cannot open /dev/tpm0 for example. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com>	2015-03-05 18:57:06 -05:00
Stefan Berger	42bee147fe	qemu: Move TPM command line build code into own function Move the TPM command line build code into its own function. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com>	2015-03-05 18:57:06 -05:00
Michal Privoznik	5aee81a0cb	qemu: Allow spaces in disk serial https://bugzilla.redhat.com/show_bug.cgi?id=1195660 There's been a bug report appearing on the qemu-devel list, that libvirt is unable to pass spaces in disk serial number [1]. Not only our RNG schema forbids that, the code is not prepared either. However, with a bit of escaping (if needed) we can allow spaces there. 1: https://lists.gnu.org/archive/html/qemu-devel/2015-02/msg04041.html Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-05 13:35:55 +01:00
Shanzhi Yu	f7c1410b0e	qemu: snapshot: Don't skip check for qcow2 format with network disks When the domain's source disk type is network, if source protocol is rbd or sheepdog, the 'if().. break' will end the current case, which lead to miss check the driver type is raw or qcow2. Libvirt will allow to create internal snapshot for a running domain with raw format disk which based on rbd storage. While both protocols support internal snapshots of the disk qemu is not able to use it as it requires some place to store the memory image. The check if the disk is backed by a qcow2 image needs to be executed always. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1179533 Signed-off-by: Shanzhi Yu <shyu@redhat.com>	2015-03-04 14:13:06 +01:00
Peter Krempa	3923d98e05	qemu: Allow inactive domains in qemuDomainGetControlInfo() Inactive domains can still be stuck in a job or other problems. Add a way to detect it.	2015-03-04 10:41:30 +01:00
Peter Krempa	31a55c7cb4	qemu: Properly report error state in qemuDomainGetControlInfo() Previously when a domain would get stuck in a domain job due to a programming mistake we'd report the following control state: $ virsh domcontrol domain occupied (1424343406.150s) The timestamp is invalid as the monitor was not entered for that domain. We can use that to detect that the domain has an active job and report a better error instead: $ virsh domcontrol domain error: internal (locking) error	2015-03-04 10:41:30 +01:00
Michal Privoznik	63889e0c77	qemuProcessReconnect: Fill in pid file path https://bugzilla.redhat.com/show_bug.cgi?id=1197600 So, libvirt uses pid file to track pid of started qemus. Whenever a domain is started, its pid is put into corresponding pid file. The pid file path is generated based on domain name and stored into domain object internals. However, it's not stored in the status XML and therefore lost on daemon restarts. Hence, later, when domain is being shut down, the daemon does not know which pid file to unlink, and the correct pid file is left behind. To avoid this, lets generate the pid file path again in qemuProcessReconnect(). Reported-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-03-03 12:10:15 +01:00
Pavel Hrdina	a16e5f0a91	qemu: check defaultMode for spice graphics independently Instead of checking defaultMode for every channel that has no mode configured, test it only once outside of channel loop. This fixes a bug that in case all possible channels are fore example set to insecure, but defaultMode is set to secure, we wouldn't auto-generate TLS port. This results in failure while starting a guest. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1143832 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-03-03 11:42:33 +01:00
Pavel Hrdina	e4983952b4	qemu: remove duplicated code for allocating spice ports We have two different places that needs to be updated while touching code for allocation spice ports. Add a bool option to 'qemuProcessSPICEAllocatePorts' function to switch between true and fake allocation so we can use this function also in qemu_driver to generate native domain definition. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-03-03 11:41:46 +01:00
Martin Kletzander	2fd5880b3b	conf: De-duplicate scheduling policy enums Since adding the support for scheduler policy settings in commit `8680ea97`, there are two enums with the same information. That was caused by rewriting the patch since first draft. Find out thanks to clang, but there was no impact whatsoever. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-03-03 09:26:59 +01:00
Martin Kletzander	b3ea0a8fb8	qemu: Don't crash in qemuDomainOpenChannel() The problem here was that when opening a channel, we were checking whether the channel given is alias (can't be NULL for running domain) or it's name, which can be NULL (for example with spicevmc). In case of such domain qemuDomainOpenChannel() made the daemon crash. STREQ_NULLABLE() is safe to use since the code in question is wrapped in "if (name)" and is more readable, so use that instead of checking for non-NULL "vm->def->channels[i]->target.name". Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-03-03 09:26:59 +01:00
John Ferlan	e0e290552b	disk: Disallow duplicated target 'dev' values https://bugzilla.redhat.com/show_bug.cgi?id=1142631 This patch resolves a situation where the same "<target dev='$name'...>" can be used for multiple disks in the domain. While the $name is "mostly" advisory regarding the expected order that the disk is added to the domain and not guaranteed to map to the device name in the guest OS, it still should be unique enough such that other domblk* type operations can be performed. Without the patch, the domblklist will list the same Target twice: $ virsh domblklist $dom Target Source ------------------------------------------------ sda /var/lib/libvirt/images/file.qcow2 sda /var/lib/libvirt/images/file.img Additionally, getting domblkstat, domblkerror, domblkinfo, and other block* type calls will not be able to reference the second target. Fortunately, hotplug disallows adding a "third" sda value: $ qemu-img create -f raw /var/lib/libvirt/images/file2.img 10M $ virsh attach-disk $dom /var/lib/libvirt/images/file2.img sda error: Failed to attach disk error: operation failed: target sda already exists $ BUT, it since 'sdb' doesn't exist one would get the following on the same hotplug attempt, but changing to use 'sdb' instead of 'sda' $ virsh attach-disk $dom /var/lib/libvirt/images/file2.img sdb error: Failed to attach disk error: internal error: unable to execute QEMU command 'device_add': Duplicate ID 'scsi0-0-1' for device $ Since we cannot fix this issue at parsing time, the best that can be done so as to not "lose" a domain is to make the check prior to starting the guest with the results as follows: $ virsh start $dom error: Failed to start domain $dom error: XML error: target 'sda' duplicated for disk sources '/var/lib/libvirt/images/file.qcow2' and '/var/lib/libvirt/images/file.img' $ Running 'make check' found a few more instances in the tests where this duplicated target dev value was being used. These also exhibited some duplicated 'id=' values (negating the uniqueness argument of aliases) in the corresponding .args file and of course the *xmlout version of a few input XML files.	2015-03-02 22:38:36 -05:00
Peter Krempa	389634e667	qemu: Forbid setting maximum memory size with the API with NUMA enabled NUMA enabled guest configuration explicitly specifies memory sizes for individual nodes. Allowing the virDomainSetMemoryFlags API (and friends) to change the total doesn't make sense as the individual node configs are not updated in that case. Forbid use of the API in case NUMA is specified.	2015-03-02 16:41:32 +01:00
Ján Tomko	723522328f	Check if domain is running in qemuDomainAgentIsAvailable If the domain is not running, the agent will not respond. Do not even try. https://bugzilla.redhat.com/show_bug.cgi?id=872424	2015-03-02 08:07:56 +01:00
Ján Tomko	fbb94044ea	Pass virDomainObjPtr to qemuDomainAgentAvailable Not just the DomainObj's private data.	2015-03-02 08:07:56 +01:00
Ján Tomko	c8b80b49f6	Check for qemu guest agent availability after getting the job This way checks requiring the job can be done in qemuDomainAgentAvailable.	2015-03-02 08:07:56 +01:00
Ján Tomko	496156807b	Implement VIR_CONNECT_BASELINE_CPU_MIGRATABLE in the x86 cpu driver Filter out non-migratable features if VIR_CONNECT_BASELINE_CPU_MIGRATABLE was specified.	2015-03-02 07:59:12 +01:00
Ján Tomko	995ca6cbf3	Use virBufferTrim when generating boot options Instead of tracking the number of added parameters, add a comma at the end of each one unconditionally and trim the trailing one at the end.	2015-03-02 07:39:09 +01:00
Ján Tomko	354425dcd2	Make -boot arg generation more readable If we combine the boot order on the command line with other boot options, we prepend order= in front of it. Instead of checking if the number of added arguments is between 0 and 2, separate the strings for boot order and options and prepend boot order only if both strings are not empty.	2015-03-02 07:39:09 +01:00
Ján Tomko	92572c3d71	Remove code handling the QEMU_CAPS_DOMID capability This option is xenner-only (since commit `b81a7ece`), and we dropped support for xenner in commit `de9be0a`.	2015-03-02 07:39:09 +01:00
Ján Tomko	9aa316612a	Remove bootloader option from QEMU It was only supported by xenner (since commit `763a59d8`), for which we removed support in commit `de9be0a`. Remove the code generating this command line option, refuse to parse it and delete the outdated tests. https://bugzilla.redhat.com/show_bug.cgi?id=1176050	2015-03-02 07:39:09 +01:00
Ján Tomko	c75f42f331	Really fix XML formatting flags in SaveImageUpdateDef Commit `cf2d4c6` used a logical or instead of bitwise or, effectively passing 1, that is VIR_DOMAIN_XML_INACTIVE. This was caught by a warning when building with clang. https://bugzilla.redhat.com/show_bug.cgi?id=1183869	2015-02-27 12:01:31 +01:00
Laine Stump	4bbe1029f2	qemu: fix ifindex array reported to systemd Commit `f7afeddc` added code to report to systemd an array of interface indexes for all tap devices used by a guest. Unfortunately it not only didn't add code to report the ifindexes for macvtap interfaces (interface type='direct') or the tap devices used by type='ethernet', it ended up sending "-1" as the ifindex for each macvtap or hostdev interface. This resulted in a failure to start any domain that had a macvtap or hostdev interface (or actually any type other than "network" or "bridge"). This patch does the following with the nicindexes array: 1) Modify qemuBuildInterfaceCommandLine() to only fill in the nicindexes array if given a non-NULL pointer to an array (and modifies the test jig calls to the function to send NULL). This is because there are tests in the test suite that have type='ethernet' and still have an ifname specified, but that device of course doesn't actually exist on the test system, so attempts to call virNetDevGetIndex() will fail. 2) Even then, only add an entry to the nicindexes array for appropriate types, and to do so for all appropriate types ("network", "bridge", and "direct"), but only if the ifname is known (since that is required to call virNetDevGetIndex().	2015-02-25 13:11:14 -05:00
Laine Stump	118b240808	network: only clear bandwidth if it has been set libvirt was unconditionally calling virNetDevBandwidthClear() for every interface (and network bridge) of a type that supported bandwidth, whether it actually had anything set or not. This doesn't hurt anything (unless ifname == NULL!), but is wasteful. This patch makes sure that all calls to virNetDevBandwidthClear() are qualified by checking that the interface really had some bandwidth setup done, and checks for a null ifname inside virNetDevBandwidthClear(), silently returning success if it is null (as well as removing the ATTRIBUTE_NONNULL from that function's prototype, since we can't guarantee that it is never null, e.g. sometimes a type='ethernet' interface has no ifname as it is provided on the fly by qemu).	2015-02-25 13:09:34 -05:00
Yuri Chornoivan	8a833d1eb0	Fix typos in messages Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-02-25 14:12:51 +01:00
Ján Tomko	52a166f493	Assign default SCSI controller model before checking attribute validity If the qemu binary on x86 does not support lsi SCSI controller, but it supports virtio-scsi, we reject the virtio-specific attributes for no reason. Move the default controller assignment before the check. https://bugzilla.redhat.com/show_bug.cgi?id=1168849	2015-02-25 10:04:58 +01:00
Michal Privoznik	cf2d4c603c	qemu: Use correct flags for ABI stability check in SaveImageUpdateDef https://bugzilla.redhat.com/show_bug.cgi?id=1183869 Soo. you've successfully started yourself a domain. And since you want to use it on your host exclusively you are confident enough to passthrough the host CPU model, like this: <cpu mode='host-passthrough'/> Then, after a while, you want to save the domain into a file (e.g. virsh save dom dom.save). And here comes the trouble. The file consist of two parts: Libvirt header (containing domain XML among other things), and qemu migration data. Now, the domain XML in the header is formatted using special flags (VIR_DOMAIN_XML_SECURE \| VIR_DOMAIN_XML_UPDATE_CPU \| VIR_DOMAIN_XML_INACTIVE \| VIR_DOMAIN_XML_MIGRATABLE). Then, on your way back from the bar, you think of changing something in the XML in the saved file (we have a command for it after all), say listen address for graphics console. So you successfully type in the command: virsh save-image-edit dom.save Change all the bits, and exit the editor. But instead of success you're left with sad error message: error: unsupported configuration: Target CPU model <null> does not match source Pentium Pro Sigh. Digging into the code you see lines, where we check for ABI stability. The new XML you've produced is compared with the old one from the saved file to see if qemu ABI will break or not. Wait, what? We are using different flags to parse the XML you've provided so we were just lucky it worked in some cases? Yep, that's right. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-02-25 09:28:54 +01:00
Pavel Hrdina	efd30e2e1c	qemu: fix memory leak while starting a guest In commit `cc41c648` I've re-factored qemuMonitorFindBalloonObjectPath, but missed that there is a memory leak. The "nextpath" variable is overwritten while looping in for cycle and we have to free it before next cycle. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-02-24 16:38:50 +01:00
Stefan Zimmermann	8e6ee9f280	Rework s390 architecture checking Making use of the ARCH_IS_S390 macro introduced with `e808357528` Signed-off-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com>	2015-02-23 14:51:02 -05:00
Stefan Zimmermann	09ab9dcc85	Prevent default creation of usb controller on s390 and s390x Since s390 does not support usb the default creation of a usb controller for a domain should not occur. Also adjust s390 test cases by removing usb device instances since usb devices are no longer created by default for s390 the s390 test cases need to be adjusted. Signed-off-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com>	2015-02-23 14:50:15 -05:00
Cole Robinson	f2f1e388e1	qemu: Fix AAVMF/OVMF #define names The AAVMF and OVMF names were swapped. Reorder the one usage where it matters so behavior doesn't change.	2015-02-21 14:44:46 -05:00
Peter Krempa	103707d4b7	qemu: caps: Add capability bit for the "pc-dimm" device The pc-dimm device represents a RAM memory module.	2015-02-20 19:25:09 +01:00
Peter Krempa	181742d43f	conf: Move all NUMA configuration to virDomainNuma For historical reasons data regarding NUMA configuration were split between the CPU definition and numatune. We cannot do anything about the XML still being split, but we certainly can at least store the relevant data in one place. This patch moves the NUMA stuff to the right place.	2015-02-20 17:50:08 +01:00
Peter Krempa	b9ddb25822	conf: numa: Add setter/getter for NUMA node memory size Add the helpers and refactor places where the value is accessed without them.	2015-02-20 17:50:08 +01:00
Peter Krempa	7800d473f5	conf: numa: Add accessor to NUMA node's memory access mode	2015-02-20 17:50:08 +01:00
Peter Krempa	d9a779a36e	conf: numa: Add accessor for the NUMA node cpu mask Add virDomainNumaGetNodeCpumask() and refactor a few places that would get the cpu mask without the helper.	2015-02-20 17:50:08 +01:00
Peter Krempa	be22d07315	conf: numa: Add helper to get guest NUMA node count and refactor users Add an accessor so that a later refactor is simpler.	2015-02-20 17:50:07 +01:00
Peter Krempa	ba2183a331	qemu: command: Unify retrieval of NUMA cell count in qemuBuildNumaArgStr The function uses the cell count in 6 places. Add a temp variable to hold the count as it will greatly simplify the refactor.	2015-02-20 17:50:07 +01:00
Peter Krempa	fa9930720b	numa: conf: Tweak parameters of virDomainNumatuneSet As virDomainNumatuneSet now doesn't allocate the virDomainNuma object any longer it's not necessary to pass the pointer to a pointer to store the object as it will not change any longer. While touching the parameter definitions I've also changed the name of the parameter to "numa".	2015-02-20 17:50:07 +01:00
Peter Krempa	c03411199e	conf: Allocate domain definition with the new helper Use the virDomainDefNew() helper to allocate the definition instead of doing it via VIR_ALLOC.	2015-02-20 17:43:05 +01:00
Peter Krempa	a3673b225d	conf: Move enum virMemAccess to the NUMA code and rename it Name it virNumaMemAccess and add it to conf/numa_conf.[ch] Note that to avoid a circular dependency the type of the NUMA cell memAccess variable was changed to int. It will be turned back later after the circular dependency will not exist.	2015-02-20 17:43:04 +01:00
Peter Krempa	6bc80fa86d	conf: numa: Rename virDomainNumatune to virDomainNuma The structure will gradually become the only place for NUMA related config, thus rename it appropriately.	2015-02-20 17:43:04 +01:00
Michal Privoznik	af20423264	virQEMUCapsCacheLookupCopy: Filter qemuCaps based on machineType Not all machine types support all devices, device properties, backends, etc. So until we create a matrix of [machineType, qemuCaps], lets just filter out some capabilities before we return them to the consumer (which is going to make decisions based on them straight away). Currently, as qemu is unable to tell which capabilities are (not) enabled for given machine types, it's us who has to hardcode the matrix. One day maybe the hardcoding will go away and we can create the matrix dynamically on the fly based on a few monitor calls. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-02-20 13:28:04 +01:00
Michal Privoznik	37cf163ab2	virQEMUCapsCacheLookupCopy: Pass machine type It will come handy in the near future when we will filter some capabilities based on it. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-02-20 13:27:59 +01:00
Michal Privoznik	80c5f10e86	qemuMigrationDriveMirror: Listen to events https://bugzilla.redhat.com/show_bug.cgi?id=1179678 When migrating with storage, libvirt iterates over domain disks and instruct qemu to migrate the ones we are interested in (shared, RO and source-less disks are skipped). The disks are migrated in series. No new disk is transferred until the previous one hasn't been quiesced. This is checked on the qemu monitor via 'query-jobs' command. If the disk has been quiesced, it practically went from copying its content to mirroring state, where all disk writes are mirrored to the other side of migration too. Having said that, there's one inherent error in the design. The monitor command we use reports only active jobs. So if the job fails for whatever reason, we will not see it anymore in the command output. And this can happen fairly simply: just try to migrate a domain with storage. If the storage migration fails (e.g. due to ENOSPC on the destination) we resume the host on the destination and let it run on partly copied disk. The proper fix is what even the comment in the code says: listen for qemu events instead of polling. If storage migration changes state an event is emitted and we can act accordingly: either consider disk copied and continue the process, or consider disk mangled and abort the migration. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-02-19 14:12:38 +01:00
Michal Privoznik	76c61cdca2	qemuProcessHandleBlockJob: Take status into account Upon BLOCK_JOB_COMPLETED event delivery, we check if the job has completed (in qemuMonitorJSONHandleBlockJobImpl()). For better image, the event looks something like this: "timestamp": {"seconds": 1423582694, "microseconds": 372666}, "event": "BLOCK_JOB_COMPLETED", "data": {"device": "drive-virtio-disk0", "len": 8412790784, "offset": 409993216, "speed": 8796093022207, "type": "mirror", "error": "No space left on device"}} If "len" does not equal "offset" it's considered an error, and we can clearly see "error" field filled in. However, later in the event processing this case was handled no differently to case of job being aborted via separate API. It's time that we start differentiate these two because of the future work. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-02-19 14:12:38 +01:00
Michal Privoznik	c37943a068	qemuProcessHandleBlockJob: Set disk->mirrorState more often Currently, upon BLOCK_JOB_* event, disk->mirrorState is not updated each time. The callback code handling the events checks if a blockjob was started via our public APIs prior to setting the mirrorState. However, some block jobs may be started internally (e.g. during storage migration), in which case we don't bother with setting disk->mirror (there's nothing we can set it to anyway), or other fields. But it will come handy if we update the mirrorState in these cases too. The event wasn't delivered just for fun - we've started the job after all. So, in this commit, the mirrorState is set to whatever job status we've obtained. Of course, there are some actions on some statuses that we want to perform. But instead of if {} else if {} else {} ... enumeration, let's move to switch(). Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-02-19 14:12:38 +01:00
Peter Krempa	0df2f0404f	qemu: Exit job on error path of qemuDomainSetVcpusFlags() Commit `e105dc9814` moved some code but didn't adjust the jump labels so that the job would be terminated.	2015-02-18 18:17:54 +01:00
Pavel Hrdina	77a9dc0b8d	qemu_cgroup: initialize mem_mask to NULL If 'virNumaGetHostNodeset()' fails then the error path will try to free uninitialized pointer mem_mask. Introduced by commit `af2a1f058`. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-02-17 14:22:50 +01:00
Prerna Saxena	5e4f49ab8a	PowerPC : Forbid NULL CPU model with 'host-model' mode. PowerPC : Forbid NULL CPU model with 'host-model' mode in qemu command line. This ensures that an XML such as following: ... <cpu mode='host-model'> <model fallback='allow'/> </cpu> ... will not generate a '-cpu host,compat=(null)' command line with qemu-system-ppc64. Signed-off-by: Prerna Saxena <prerna@linux.vnet.ibm.com>	2015-02-17 12:20:40 +01:00
Prerna Saxena	bdbe723fcd	PowerPC : Make 'qemu-system-ppc64' the default emulator on ppc64[le]. PowerPC : Explicitly associate 'qemu-system-ppc64' as the default emulator for all 64-bit PowerPC guests ( both Big & Little Endian ) Signed-off-by: Prerna Saxena <prerna@linux.vnet.ibm.com>	2015-02-17 12:20:40 +01:00
Luyao Huang	337265bb52	qemu: fix vm deadlock when try to use numatune in session mode https://bugzilla.redhat.com/show_bug.cgi?id=1126762 Commit 43b67f introduced a deadlock issue when we use numatune to change numa settings to a vm in session mode. Jump to endjob instead of jump to cleanup. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-02-17 11:08:00 +01:00
Michal Privoznik	7832fac847	qemuBuildMemoryBackendStr: Report backend requirement more appropriately So, when building the '-numa' command line, the qemuBuildMemoryBackendStr() function does quite a lot of checks to chose the best backend, or to check if one is in fact needed. However, it returned that backend is needed even for this little fella: <numatune> <memory mode="strict" nodeset="0,2"/> </numatune> This can be guaranteed via CGroups entirely, there's no need to use memory-backend-ram to let qemu know where to get memory from. Well, as long as there's no <memnode/> element, which explicitly requires the backend. Long story short, we wouldn't have to care, as qemu works either way. However, the problem is migration (as always). Previously, libvirt would have started qemu with: -numa node,memory=X in this case and restricted memory placement in CGroups. Today, libvirt creates more complicated command line: -object memory-backend-ram,id=ram-node0,size=X -numa node,memdev=ram-node0 Again, one wouldn't find anything wrong with these two approaches. Both work just fine. Unless you try to migrated from the older libvirt into the newer one. These two approaches are, unfortunately, not compatible. My suggestion is, in order to allow users to migrate, lets use the older approach for as long as the newer one is not needed. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-02-17 09:07:09 +01:00
Erik Skultety	c3d9d3bbc9	security: introduce virSecurityManagerCheckAllLabel function We do have a check for valid per-domain security model, however we still do permit an invalid security model for a domain's device (those which are specified with <source> element). This patch introduces a new function virSecurityManagerCheckAllLabel which compares user specified security model against currently registered security drivers. That being said, it also permits 'none' being specified as a device security model. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1165485 Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-02-13 14:37:54 +01:00
Ján Tomko	6ba5d1afec	Wire up mrg_rxbuf option for qemu <interface ...> ... <model type='virtio'/> <driver ...> <host mrg_rxbuf='off'/> </driver> </interface> will result in: -device virtio-net-pci,mrg_rxbuf=off,... https://bugzilla.redhat.com/show_bug.cgi?id=1186886	2015-02-13 12:31:38 +01:00
Daniel P. Berrange	9358b63a0d	qemu: do upfront check for vcpupids being null when querying pinning The qemuDomainHelperGetVcpus attempted to report an error when the vcpupids info was NULL. Unfortunately earlier code would clamp the value of 'maxinfo' to 0 when nvcpupids was 0, so the error reporting would end up being skipped. This lead to 'virsh vcpuinfo <dom>' just returning an empty list instead of giving the user a clear error.	2015-02-12 10:02:50 +00:00
Daniel P. Berrange	a103bb105c	qemu: fix setting of VM CPU affinity with TCG If a previous commit I fixed the incorrect handling of vcpu pids for TCG mode QEMU: commit `b07f3d821d` Author: Daniel P. Berrange <berrange@redhat.com> Date: Thu Dec 18 16:34:39 2014 +0000 Don't setup fake CPU pids for old QEMU The code assumes that def->vcpus == nvcpupids, so when we setup fake CPU pids for old QEMU with nvcpupids == 1, we cause the later code to read off the end of the array. This has fun results like sche_setaffinity(0, ...) which changes libvirtd's own CPU affinity, or even better sched_setaffinity($RANDOM, ...) which changes the affinity of a random OS process. The intent was that this would merely disable the ability to set per-vCPU affinity. It should still have been possible to set VM level host CPU affinity. Unfortunately, when you set <vcpu cpuset='0-1'>4</vcpu>, the XML parser will internally take this & initialize an entry in the def->cputune.vcpupin array for every VCPU. IOW this is implicitly being treated as <cputune> <vcpupin cpuset='0-1' vcpu='0'/> <vcpupin cpuset='0-1' vcpu='1'/> <vcpupin cpuset='0-1' vcpu='2'/> <vcpupin cpuset='0-1' vcpu='3'/> </cputune> Even more fun, the faked cputune elements are hidden from view when querying the live XML, because their cpuset mask is the same as the VM default cpumask. The upshot was that it was impossible to set VM level CPU affinity. To fix this we must update qemuProcessSetVcpuAffinities so that it only reports a fatal error if the per-VCPU cpu mask is different from the VM level cpu mask. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2015-02-12 10:02:50 +00:00
Martin Kletzander	104ba5966a	qemu: Add support for setting vCPU and I/O thread scheduler setting Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1178986 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-02-11 17:30:07 +01:00
John Ferlan	92f09dab50	qemu: qemuOpenFileAs - set flag VIR_FILE_OPEN_FORCE_MODE In the event we're falling into the code that tries to create the file in a forked environment (VIR_FILE_OPEN_FORK) we pass different mode bits, but those are never set because the virFileOpenForceOwnerMode has a check if the OPEN_FORCE_MODE bit is set before attempting to change the mode. Since this is a special case it seems reasonable to set u+rw,g+rw,o	2015-02-11 07:29:29 -05:00
Luyao Huang	45853b5289	qemu: fix crash when migrateuri has no scheme https://bugzilla.redhat.com/show_bug.cgi?id=1191355 When we attempt to migrate a vm with a migrateuri that has no scheme: # virsh migrate test4 --live qemu+ssh://lhuang/system --migrateuri 127.0.0.1 target libvirtd will crash because uri->scheme is NULL in qemuMigrationPrepareDirect on this line: if (STRNEQ(uri->scheme, "tcp") && Add a value check before this line. Also fix a bug like this in doNativeMigrate, that could only happen when destination libvirtd returned an incorrect URI. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-02-11 13:20:30 +01:00
Ján Tomko	a7c9c7a6ab	Fix qemu job handling in SetSchedulerParameters Commit `c5ee5cf` added a job to SetSchedulerParameters, but forgot to change one label in the SCHED_RANGE_CHECK macro.	2015-02-10 14:36:03 +01:00
Luyao Huang	862473fa12	qemu: Implement random number generator hotunplug Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-02-10 13:05:23 +01:00
Luyao Huang	980b265d08	qemu: Implement random number generator hotplug Export the required helpers and add backend code to hotplug RNG devices. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-02-10 13:05:22 +01:00
Peter Krempa	fe6acfbd0e	qemu: Implement random number generator cold (un)plug Add support for using the attach/detach device APIs on the inactive configuration to add RNG devices.	2015-02-10 13:05:22 +01:00
Peter Krempa	25e2d89788	qemu: command: Refactor creation of RNG device commandline As the RNG device is using an -object as backend refactor the code to use the JSON to commandline generator so that we can reuse the code later in hotplug.	2015-02-10 13:05:22 +01:00
Peter Krempa	b9f2d781d9	qemu: command: Break some very long lines in qemuBuildRNGDevStr()	2015-02-10 13:05:22 +01:00
Peter Krempa	d7ec244f6e	qemu: command: Shuffle around formatting of alias for RNG device backend Move the alias name right after the object type for rng-egd backend so that we can later use the JSON to commandline generator to create the command line.	2015-02-10 13:05:22 +01:00
Luyao Huang	98e982b455	qemu: command: Make RNG backend device IDs unique Libvirt didn't prefix the random number generator backend object alias with any string thus the device alias and object alias were identical. To avoid possible problems, rename the alias for the backend object and tweak tests to comply with the change. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-02-10 13:05:22 +01:00
Luyao Huang	58a4eee81a	qemu: refactor qemuBuildRNGDeviceArgs to allow reuse in RNG hotplug Rename qemuBuildRNGDeviceArgs to qemuBuildRNGDevStr and change the return type so that it can be reused in the device hotplug code later. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-02-10 13:05:22 +01:00
Luyao Huang	3921d13581	qemu: Add helper to assign RNG device aliases This function is used to assign an alias for a RNG device. It will be later reused when hotplugging RNGs. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-02-10 13:05:22 +01:00
Ján Tomko	8e724e9f3e	Error out when custom tap device path makes no sense It is only usable for NETWORK and BRIDGE type interfaces. Error out when trying to start a domain where the custom tap device path is specified for interfaces of other types, or when the daemon is not privileged. Note that this cannot be checked at definition time, because the comparison is against actual type. https://bugzilla.redhat.com/show_bug.cgi?id=1147195	2015-02-06 12:52:50 +01:00
Daniel P. Berrange	95fd6a91c6	qemu: include libvirt & QEMU versions in QEMU log files It is often helpful to know which version of libvirt and QEMU was present when a guest was first launched. Ensure this info is written into the QEMU log file for each guest.	2015-02-06 10:22:07 +00:00
Luyao Huang	1b2c9ce752	qemu: Properly report error on uuid mismatch in the migration cookie Add the missing jump to the error label when the uuid in the migration cookie XML does not match the uuid of the migrated domain. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-02-05 08:14:36 +01:00
Daniel P. Berrange	b38da58423	Make tests independant of system page size Some code paths have special logic depending on the page size reported by sysconf, which in turn affects the test results. We must mock this so tests always have a consistent page size.	2015-02-02 20:27:43 +00:00
Peter Krempa	b92a003710	qemu: command: Don't combine old and modern NUMA node creation Change done by commit `f309db1f4d` wrongly assumes that qemu can start with a combination of NUMA nodes specified with the "memdev" option and the appropriate backends, and the legacy way by specifying only "mem" as a size argument. QEMU rejects such commandline though: $ /usr/bin/qemu-system-x86_64 -S -M pc -m 1024 -smp 2 \ -numa node,nodeid=0,cpus=0,mem=256 \ -object memory-backend-ram,id=ram-node1,size=12345 \ -numa node,nodeid=1,cpus=1,memdev=ram-node1 qemu-system-x86_64: -numa node,nodeid=1,cpus=1,memdev=ram-node1: qemu: memdev option must be specified for either all or no nodes To fix this issue we need to check if any of the nodes requires the new definition with the backend and if so, then all other nodes have to use it too. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1182467	2015-01-31 08:53:22 +01:00
Peter Krempa	8795adf7d1	qemu: command: Refactor NUMA backend object formatting to use JSON objs With the new JSON to argv formatter we are now able to represent the memory backend definitions in the JSON object format that is reusable for monitor use (hotplug) and then convert it into the shell string. This will avoid having two separate instances of the same code that would create the different formats. Previous refactors now allow to make this step without changes to the test suite.	2015-01-31 08:53:22 +01:00
Peter Krempa	b50b4ef30c	qemu: command: Switch to bytes when formatting size for memory backends QEMU's command line visitor as well as the JSON interface take bytes by default for memory object sizes. Convert mebibytes to bytes so that we can later refactor the existing code for hotplug purposes.	2015-01-31 08:53:22 +01:00
Peter Krempa	a47174c508	qemu: command: Unify values for boolean values when formating memory backends QEMU's qapi visitor code allows yes/on/y for true and no/off/n for false value of boolean properities. Unify the used style so that we can generate it later and fix test cases.	2015-01-31 08:53:22 +01:00
Peter Krempa	172100ac85	qemu: command: Shuffle around formating of alias for memory backend objs Move the alias as the second formated argument and tweak the tests so that a future refactor that will change the order doesn't break tests.	2015-01-31 08:53:22 +01:00
Peter Krempa	db3b1c4a1c	qemu: Extract code to setup memory backing objects Extract the memory backend device code into a separate function so that it can be later easily refactored and reused. Few small changes for future reusability, namely: - new (currently unused) parameter for user specified page size - size of the memory is specified in kibibytes, divided up in the function - new (currently unused) parameter for user specifed source nodeset - option to enforce capability check	2015-01-31 08:53:22 +01:00
Peter Krempa	331b2583ec	qemu: command: Add helper to format -object strings from JSON representation Unlike -device, qemu uses a JSON object to add backend "objects" via the monitor rather than the string that would be passed on the commandline. To be able to reuse code parts that configure backends for various devices, this patch adds a helper that will allow generating the command line representations from the JSON property object.	2015-01-31 08:53:22 +01:00
Tony Krowiak	79a8769479	qemu: change macvtap device options in response to NIC_RX_FILTER_CHANGED This patch enables synchronization of the host macvtap device options with the guest device's in response to the NIC_RX_FILTER_CHANGED event. The following device options will be synchronized: * PROMISC * MULTICAST * ALLMULTI Signed-off-by: Tony Krowiak <akrowiak@linux.vnet.ibm.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-01-30 13:16:28 +01:00
John Ferlan	7879d03197	qemu: Don't unconditionally delete file in qemuOpenFileAs https://bugzilla.redhat.com/show_bug.cgi?id=1158034 If we're expecting to create a file somewhere and that fails for some reason during qemuOpenFileAs, then we unlink the path we're attempting to create leaving no way to determine what the "existing" privileges, protections, or labels are that caused the failure (open, change owner and group, change mode, etc.). Furthermore, if we fall into the path where we'll be opening / creating the file using VIR_FILE_OPEN_FORK, we need to first unlink/delete the file we created in the first path; otherwise, the attempt by the child process to open as some specific user:group may fail because the file was already created using nfsnobody:nfsnobody. Again, if we didn't create the file we don't want to blindly delete what already exists. Thus, a second reason for the original check to set need_unlink to false when we find the file with CREAT set, but already existing. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-01-29 15:37:34 -05:00
John Ferlan	8ff383366b	qemu: Adjust EndAsyncJob for qemuDomainSaveInternal error path Commit id '540c339a' to fix issues with reference counting and transient domains moved the qemuDomainObjEndAsyncJob call prior to the attempt to restart the guest CPU's resulting in an error: error: Failed to save domain rhel70 to /tmp/pl/rhel70.save error: internal error: unexpected async job 3 when (ret != 0) - eg, the error path from qemuDomainSaveMemory. This patch will adjust the logic to call the EndAsyncJob only after we've tried to restart the guest CPUs. It also needs to adjust the test for qemuDomainRemoveInactive to add the ret == 0 condition. Additionally, if we get to endjob: because of some error earlier, then we need to save that error in the event the CPU restart logic fails. We don't want to return the error from CPU restart failure, rather we want to return the error from the failed save that caused us to fall into the retry to start the CPU logic. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-01-29 12:10:41 -05:00
Michal Privoznik	436dcf0b74	qemu: Add AAVMF to the list of known UEFIs Well, even though users can pass the list of UEFI:NVRAM pairs at the configure time, we may maintain the list of widely available UEFI ourselves too. And as arm64 begin to rises, OVMF was ported there too. With a slight name change - it's called AAVMF, with AAVMF_CODE.fd being the UEFI firmware and AAVMF_VARS.fd being the NVRAM store file. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-01-29 15:20:47 +01:00
Michal Privoznik	bc03a23149	qemu: Allow UEFI paths to be specified at compile time Up until now there are just two ways how to specify UEFI paths to libvirt. The first one is editing qemu.conf, the other is editing qemu_conf.c and recompile which is not that fancy. So, new configure option is introduced: --with-loader-nvram which takes a list of pairs of UEFI firmware and NVRAM store. This way, the compiled in defaults can be passed during compile time without need to change the code itself. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-01-29 15:20:42 +01:00
Luyao Huang	f76df311e8	qemu: fix cannot set graphic passwd via qemuDomainSaveImageDefineXML https://bugzilla.redhat.com/show_bug.cgi?id=1183890 When we try to update a xml to a image file, we will clear the graphics passwd settings, because we do not pass VIR_DOMAIN_XML_SECURE to qemuDomainDefCopy, qemuDomainDefFormatBuf won't format the passwd. Add VIR_DOMAIN_XML_SECURE flag when we call qemuDomainDefCopy in qemuDomainSaveImageUpdateDef. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-01-28 16:56:34 +01:00
Ján Tomko	21e0e8866e	hotplug: only add a chardev to vmdef after monitor call https://bugzilla.redhat.com/show_bug.cgi?id=1161024 This way the device is in vmdef only if ret = 0 and the caller (qemuDomainAttachDeviceFlags) does not free it. Otherwise it might get double freed by qemuProcessStop and qemuDomainAttachDeviceFlags if the domain crashed in monitor after we've added it to vm->def.	2015-01-28 10:10:54 +01:00
Ján Tomko	daf51be5f1	Split qemuDomainChrInsert into two parts Do the allocation first, then add the actual device. The second part should never fail. This is good for live hotplug where we don't want to remove the device on OOM after the monitor command succeeded. The only change in behavior is that on failure, the vmdef->consoles array is freed, not just the first console.	2015-01-27 18:30:15 +01:00
Daniel P. Berrange	f7afeddce9	qemu: report TAP device indexes to systemd Record the index of each TAP device created and report them to systemd, so they show up in machinectl status for the VM.	2015-01-27 13:57:02 +00:00
Daniel P. Berrange	55ea7be7d9	Removing probing of secondary drivers For stateless, client side drivers, it is never correct to probe for secondary drivers. It is only ever appropriate to use the secondary driver that is associated with the hypervisor in question. As a result the ESX & HyperV drivers have both been forced to do hacks where they register no-op drivers for the ones they don't implement. For stateful, server side drivers, we always just want to use the same built-in shared driver. The exception is virtualbox which is really a stateless driver and so wants to use its own server side secondary drivers. To deal with this virtualbox has to be built as 3 separate loadable modules to allow registration to work in the right order. This can all be simplified by introducing a new struct recording the precise set of secondary drivers each hypervisor driver wants struct _virConnectDriver { virHypervisorDriverPtr hypervisorDriver; virInterfaceDriverPtr interfaceDriver; virNetworkDriverPtr networkDriver; virNodeDeviceDriverPtr nodeDeviceDriver; virNWFilterDriverPtr nwfilterDriver; virSecretDriverPtr secretDriver; virStorageDriverPtr storageDriver; }; Instead of registering the hypervisor driver, we now just register a virConnectDriver instead. This allows us to remove all probing of secondary drivers. Once we have chosen the primary driver, we immediately know the correct secondary drivers to use. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2015-01-27 12:02:04 +00:00
Daniel P. Berrange	7b1ba9566b	Remove use of nwfilterPrivateData from nwfilter driver The nwfilter driver can rely on its global state instead of the connect private data.	2015-01-27 12:02:03 +00:00
Peter Krempa	d13f56f08a	qemu: Fix job handling in qemuDomainSetMetadata The code modifies the domain configuration but doesn't take a MODIFY type job to do so.	2015-01-27 10:39:21 +01:00
Peter Krempa	fb2ed975c3	qemu: Fix job type in qemuDomainGetBlockIoTune The function just queries status so there's no need for a MODIFY type job.	2015-01-27 10:39:21 +01:00
Peter Krempa	c5ee5cfb18	qemu: Fix job handling in qemuDomainSetSchedulerParametersFlags The code modifies the domain configuration but doesn't take a MODIFY type job to do so.	2015-01-27 10:38:47 +01:00
Peter Krempa	4fd7a72075	qemu: Fix job handling in qemuDomainSetMemoryParameters The code modifies the domain configuration but doesn't take a MODIFY type job to do so.	2015-01-27 10:24:04 +01:00
Peter Krempa	e3e72743df	qemu: Fix job handling in qemuDomainSetAutostart The code modifies the domain configuration but doesn't take a MODIFY type job to do so. This patch also fixes a few very long lines of code around the touched parts.	2015-01-27 10:24:04 +01:00
Peter Krempa	79e5603307	qemu: Fix job handling in qemuDomainPinEmulator The code modifies the domain configuration but doesn't take a MODIFY type job to do so.	2015-01-27 10:24:04 +01:00
Peter Krempa	46d950443d	qemu: Fix job handling in qemuDomainPinVcpuFlags The domain modifies the domain configuration but doesn't take a MODIFY type job to do it.	2015-01-27 10:24:03 +01:00
Richard W.M. Jones	ee4c13ce1d	aarch64: Support versioned machine types. For distros that want to add versioned machine types, they will add (downstream) machine types like "virt-foo-1.2.3". Detect these as MMIO too. Signed-off-by: Richard W.M. Jones <rjones@redhat.com>	2015-01-23 15:12:33 +00:00
Erik Skultety	b7e6f2fc80	qemu: Add check for PCI bridge placement if there are too many PCI devices Previous patch of this series fixed the issue with adding a new PCI bridge when all the slots were reserved by devices with user specified addresses. In case there are still some PCI devices waiting to get a slot reserved by qemuAssignDevicePCISlots, this means a new bus needs to be created along with a corresponding bridge controller. By adding an additional check, this scenario now results in a reasonable error instead of generating wrong qemu command line.	2015-01-23 14:35:03 +01:00
Erik Skultety	5d6904b991	qemu: Fix auto-adding PCI bridge when all slots are reserved Commit 93c8ca tried to fix the issue with auto-adding of a PCI bridge controller, but didn't work properly in all scenarios. This patch provides a better fix of the issue when all slots on a PCI bus are reserved by devices with user specified addresses and no additional bridges need to be created. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1132900	2015-01-23 14:32:18 +01:00
Erik Skultety	a3ecd63e92	qemu: move PCI slot assignment for PIIX3, Q35 into a separate function In order to be able to test for fully reserved PCI buses, assignment of PCI slots for integrated devices needs to be moved to a separate function. This also might be a good preparation if we decide to add support for other chipsets as well.	2015-01-23 14:26:55 +01:00
Erik Skultety	3fb2a69284	qemu: reorder PCI slot assignment functions Move qemuDomainAssignPCIAddresses after the definition of the static function qemuDomainValidateDevicePCISlotsQ35. This lets us define a new static function using qemuDomainValidateDevicePCISlots* and use it in qemuDomainAssignPCIAddresses without a forward declaration. Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-01-23 14:16:40 +01:00
Peter Krempa	165c34778b	qemu: command: Honor const-correctnes in qemuBuildNumaArgStr @def is modified in the function indirectly although it's marked as const.	2015-01-23 13:18:04 +01:00
Erik Skultety	2fbfb3ac41	qemu: Remove dead code in qemuDomainAssignPCIAddresses revert patch As it turned out, fix of dead code 419a22 changed the affected condition from "never true" to "always true", so better fix would be to change the return code of virDomainMaybeAddController from 0 to 1 if a new bridge has been added, thus distinguishing case when we didn't need to add any controller and case we successfully added one. The return code is changed in the next commit	2015-01-23 11:03:45 +01:00
Peter Krempa	b347c0c2a3	CVE-2015-0236: qemu: Check ACLs when dumping security info from snapshots The ACL check didn't check the VIR_DOMAIN_XML_SECURE flag and the appropriate permission for it. Found via code inspection while fixing permissions for save images.	2015-01-22 14:32:54 +01:00
Peter Krempa	03c3c0c874	CVE-2015-0236: qemu: Check ACLs when dumping security info from save image The ACL check didn't check the VIR_DOMAIN_XML_SECURE flag and the appropriate permission for it.	2015-01-22 14:32:54 +01:00
Luyao Huang	860522d26b	qemu: output error when try to hotplug unsupported console type https://bugzilla.redhat.com/show_bug.cgi?id=1164627 When using 'virsh attach-device' to hotplug an unsupported console type into a qemu guest the attachment would succeed as the command line formatter didn't report error in such case. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-01-22 11:17:14 +01:00
Ján Tomko	280ece4af9	qemu: format server interface without a listen address https://bugzilla.redhat.com/show_bug.cgi?id=1130390 The listen address is not mandatory for <interface type='server'> but when it's not specified, we've been formatting it as: -netdev socket,listen=(null):5558,id=hostnet0 which failed with: Device 'socket' could not be initialized Omit the address completely and only format the port in the listen attribute. Also fix the schema to allow specifying a model.	2015-01-21 13:22:36 +01:00
Ján Tomko	d16704fd60	qemu_conf: check for duplicate security drivers Using the same driver multiple times is pointless and it can result in confusing errors: $ virsh start test error: Failed to start domain test error: internal error: security label already defined for VM https://bugzilla.redhat.com/show_bug.cgi?id=1153891	2015-01-19 12:46:37 +01:00
Ján Tomko	5c703ca396	Always check return value of qemuDomainObjExitMonitor Depending on the context, either error out if the domain has disappeared in the meantime, or just ignore the value to allow marking the function as ATTRIBUTE_RETURN_CHECK.	2015-01-19 10:12:32 +01:00
Ján Tomko	3070bc8ee5	Fix vmdef usage after domain crash in monitor on device attach https://bugzilla.redhat.com/show_bug.cgi?id=1161024 If the domain crashed while we were in monitor, we cannot rely on the REALLOC done on live definition, since vm->def now points to the persistent definition. Skip adding the attached devices to domain definition if the domain crashed. In AttachChrDevice, the chardev was already added to the live definition and freed by qemuProcessStop in the case of a crash. Skip the device removal in that case. Also skip audit if the domain crashed in the meantime.	2015-01-19 10:12:32 +01:00
Ján Tomko	6edb97f29a	Fix vmdef usage after domain crash in monitor on device detach https://bugzilla.redhat.com/show_bug.cgi?id=1161024 In the device type-specific functions, exit early if the domain has disappeared, because the cleanup should have been done by qemuProcessStop. Check the return value in processDeviceDeletedEvent and qemuProcessUpdateDevices. Skip audit and removing the device from live def because it has already been cleaned up.	2015-01-19 10:12:07 +01:00
Dmitry Guryanov	c8a6f844c3	add ploop fs driver type Ploop is a pseudo device which makeit possible to access to an image in a file as a block device. Like loop devices, but with additional features, like snapshots, write tracker and without double-caching. It used in PCS for containers and in OpenVZ. You can manage ploop devices and images with ploop utility (http://git.openvz.org/?p=ploop). Signed-off-by: Dmitry Guryanov <dguryanov@parallels.com>	2015-01-16 14:07:46 +01:00
Martin Kletzander	6514c04c18	qemu: Add support for enabling/disabling PMU This is used as a boolean parameter for the '-cpu' option. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1178853 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-01-16 13:43:46 +01:00
Erik Skultety	419a22d5db	Remove dead code in qemuDomainAssignPCIAddresses We tested for positive return value from virDomainMaybeAddController, but it returns 0 or -1 only resulting in a dead code. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-01-16 10:59:13 +01:00
Erik Skultety	93c8ca9974	qemu: Tweak auto adding PCI bridge controller when extending default PCI bus In case we find out, there are more PCI devices to be connected than there are available slots on the default PCI bus, we automatically add a new bus and a related PCI bridge controller as well. As there are no free slots left on the default PCI bus, PCI bridge controller gets a free slot on a newly created PCI bus which causes qemu to refuse to start the guest. This fix introduces a new function qemuDomainPCIBusFullyReserved which is checked right before we possibly try to reserve a slot for PCI bridge controller. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1132900	2015-01-16 10:38:29 +01:00
Daniel P. Berrange	dd69a14f90	Add support for schema validation when passing in XML The virDomainDefineXMLFlags and virDomainCreateXML APIs both gain new flags allowing them to be told to validate XML. This updates all the drivers to turn on validation in the XML parser when the flags are set	2015-01-15 16:40:27 +00:00
Daniel P. Berrange	c5b6a4a5cb	Change int to size_t in size var for tap/vhost FDs A number of methods take an int for a parameter that indicates the size of an array. The correct type for array sizes is size_t	2015-01-15 11:07:13 +00:00
Daniel P. Berrange	318df5a05f	Add support for systemd-machined CreateMachineWithNetwork systemd-machined introduced a new method CreateMachineWithNetwork that obsoletes CreateMachine. It expects to be given a list of VETH/TAP device indexes for the host side device(s) associated with a container/machine. This falls back to the old CreateMachine method when the new one is not supported.	2015-01-15 11:07:07 +00:00
Luyao Huang	5035279198	qemu: free priv->origname when qemuMigrationPrepareAny fails https://bugzilla.redhat.com/show_bug.cgi?id=1181182 When we meet error in qemuMigrationPrepareAny and goto cleanup with rc < 0, we forget clear the priv->origname and this will make this vm migrate fail next time because leave a wrong origname in priv, and will Generate a wrong cookie when do migrate next time. This patch will make priv->origname is NULL when migrate fail in target host. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-01-15 11:32:50 +01:00
Ján Tomko	c749eda4a2	Fix vmdef usage while in monitor in qemu process Make local copy of the disk alias in qemuProcessInitPasswords, instead of referencing the one in domain definition, which might get freed if the domain crashes while we're in monitor. Also copy the memballoon period value.	2015-01-14 19:30:32 +01:00
Ján Tomko	3f21398437	Fix vmdef usage while in monitor in BlockStat* APIs Make a local copy of the disk alias instead of pointing to the domain definition, which might get freed if the domain dies while we're in monitor. Also exit early if that happens.	2015-01-14 19:30:32 +01:00
Ján Tomko	051add2ff9	Fix vmdef usage while in monitor in qemuDomainHotplugVcpus Exit the monitor right after we've done with it to get the virDomainObjPtr lock back, otherwise we might be accessing vm->def while it's being cleaned up by qemuProcessStop. If the domain crashed while we were in the monitor, exit early instead of changing vm->def which is now the persistent definition.	2015-01-14 19:30:32 +01:00
Ján Tomko	dc2fd51fd7	Check for domain liveness in qemuDomainObjExitMonitor The domain might disappear during the time in monitor when the virDomainObjPtr is unlocked, so the caller needs to check if it's still alive. Since most of the callers are going to need it, put the check inside qemuDomainObjExitMonitor and return -1 if the domain died in the meantime.	2015-01-14 19:30:32 +01:00
Pavel Hrdina	ce745914b3	qemu_process: detect updated video ram size values from QEMU QEMU internally updates the size of video memory if the domain XML had provided too low memory size or there are some dependencies for a QXL devices 'vgamem' and 'ram' size. We need to know about the changes and store them into the status XML to not break migration or managedsave through different libvirt versions. The values would be loaded only if the "vgamem_mb" property exists for the device. The presence of the "vgamem_mb" also tells that the "ram_size" and "vram_size" exists for QXL devices. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-01-14 11:55:51 +01:00
Pavel Hrdina	cc41c64878	qemu_monitor: introduce new function to get QOM path The search is done recursively only through QOM object that has a type prefixed with "child<" as this indicate that the QOM is a parent for other QOM objects. The usage is that you give known device name with starting path where to search. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-01-14 11:55:51 +01:00
Pavel Hrdina	e105dc9814	qemu_driver: fix setting vcpus for offline domain Commit `e3435caf` fixed hot-plugging of vcpus with strict memory pinning on NUMA hosts, but unfortunately it also broke updating number of vcpus for offline guests using our API. The issue is that we try to create a cpu cgroup for non-running guest which fails as there are no cgroups for that domain. We should create cgroups and update cpuset.mems only if we are hot-plugging. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-01-14 10:34:20 +01:00
Michal Privoznik	04cf99a6b6	qemu, lxc: Warn if setting QoS on unsupported vNIC types https://bugzilla.redhat.com/show_bug.cgi?id=1165993 So, there are still plenty of vNIC types that we don't know how to set bandwidth on. Let's warn explicitly in case user has requested it instead of pretending everything was set. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-01-14 08:54:49 +01:00
Shanzhi Yu	9f974858dd	qemu: snapshot: inactive external snapshot can't work after libvirtd restart When create inactive external snapshot, after update disk definitions, virDomainSaveConfig is needed, if not after restart libvirtd the new snapshot file definitions in xml will be lost. Reproduce steps: 1. prepare a shut off guest $ virsh domstate rhel7 && virsh domblklist rhel7 shut off Target Source ------------------------------------------------ vda /var/lib/libvirt/images/rhel7.img 2. create external disk snapshot $ virsh snapshot-create rhel7 --disk-only && virsh domblklist rhel7 Domain snapshot 1417882967 created Target Source ------------------------------------------------ vda /var/lib/libvirt/images/rhel7.1417882967 3. restart libvirtd then check guest source file $ service libvirtd restart && virsh domblklist rhel7 Redirecting to /bin/systemctl restart libvirtd.service Target Source ------------------------------------------------ vda /var/lib/libvirt/images/rhel7.img This was first reported by Eric Blake http://www.redhat.com/archives/libvir-list/2014-December/msg00369.html Signed-off-by: Shanzhi Yu <shyu@redhat.com>	2015-01-13 15:59:06 -05:00
Daniel P. Berrange	0ecd685109	Give virDomainDef parser & formatter their own flags The virDomainDefParse* and virDomainDefFormat* methods both accept the VIR_DOMAIN_XML_* flags defined in the public API, along with a set of other VIR_DOMAIN_XML_INTERNAL_* flags defined in domain_conf.c. This is seriously confusing & error prone for a number of reasons: - VIR_DOMAIN_XML_SECURE, VIR_DOMAIN_XML_MIGRATABLE and VIR_DOMAIN_XML_UPDATE_CPU are only relevant for the formatting operation - Some of the VIR_DOMAIN_XML_INTERNAL_* flags only apply to parse or to format, but not both. This patch cleanly separates out the flags. There are two distint VIR_DOMAIN_DEF_PARSE_* and VIR_DOMAIN_DEF_FORMAT_* flags that are used by the corresponding methods. The VIR_DOMAIN_XML_* flags received via public API calls must be converted to the VIR_DOMAIN_DEF_FORMAT_* flags where needed. The various calls to virDomainDefParse which hardcoded the use of the VIR_DOMAIN_XML_INACTIVE flag change to use the VIR_DOMAIN_DEF_PARSE_INACTIVE flag.	2015-01-13 16:26:12 +00:00
Eric Blake	e1125cebfc	qemu: forbid second blockcommit during active commit https://bugzilla.redhat.com/show_bug.cgi?id=1135339 documents some confusing behavior when a user tries to start an inactive block commit in a second connection while there is already an on-going active commit from a first connection. Eventually, qemu will support multiple simultaneous block jobs, but as of now, it does not; furthermore, libvirt also needs an overhaul before we can support simultaneous jobs. So, the best way to avoid confusing ourselves is to quit relying on qemu to tell us about the situation (where we risk getting in weird states) and instead forbid a duplicate block commit ourselves. Note that we are still relying on qemu to diagnose attempts to interrupt an inactive commit (since we only track XML of an active commit), but as inactive commit is less confusing for libvirt to manage, there is less that can go wrong by leaving that detection up to qemu. * src/qemu/qemu_driver.c (qemuDomainBlockCommit): Hoist check for active commit to occur earlier outside of conditions. Signed-off-by: Eric Blake <eblake@redhat.com>	2015-01-13 08:21:20 -07:00
Daniel P. Berrange	4d2ebc71ce	Add stub virDomainDefineXMLFlags impls Make sure every virt driver implements virDomainDefineXMLFlags by adding a trivial passthrough from the existing impl with no flags set.	2015-01-13 10:38:56 +00:00
Martin Kletzander	adff345e1e	qemu: Allow enabling/disabling features with host-passthrough QEMU supports feature specification with -cpu host and we just skip using that. Since QEMU developers themselves would like to use this feature, this patch modifies the code to work. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1178850 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-01-13 08:51:01 +01:00
Pavel Hrdina	0e502466ac	qxl: change the default value for vgamem_mb to 16 MiB The default value should be 16 MiB instead of 8 MiB. Only really old version of upstream QEMU used the 8 MiB as default for vga framebuffer. Without this change if you update your libvirt where we introduced the "vgamem" attribute for QXL video device the value will be set to 8 MiB, but previously your guest had 16 MiB because we didn't pass any value to QEMU command line which means QEMU used its own 16 MiB as default. This will affect all users with guest's display resolution higher than 1920x1080. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2015-01-12 14:51:13 +01:00
Michal Privoznik	732586d979	qemu: Fix system pages handling in <memoryBacking/> In one of my previous commits (`311b4a67`) I've tried to allow to pass regular system pages to <hugepages>. However, there was a little bug that wasn't caught. If domain has guest NUMA topology defined, qemuBuildNumaArgStr() function takes care of generating corresponding command line. The hugepages backing for guest NUMA nodes is handled there too. And here comes the bug: the hugepages setting from XML is stored in KiB internally, however, the system pages size was queried and stored in Bytes. So the check whether these two are equal was failing even if it shouldn't. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-01-07 18:32:07 +01:00
Ján Tomko	b073179085	Indentation	2015-01-07 16:35:18 +01:00
Peter Krempa	79bb49a83d	qemu: Don't unref domain after exit from nested async job In commit `540c339a25` the whole domain reference counting was refactored in the qemu driver. Domain jobs now don't need to reference the domain object as they now expect the reference from the calling function. However, the patch forgot to remove the unref call in case we exit the monitor when we were acquiring a nested job. This caused the daemon to crash on a subsequent access to the domain object once we've done an operation requiring a nested job for a monitor access. An easy reproducer case: 1) Start a vm with qcow disks 2) virsh snapshot-create-as DOMNAME 3) virsh dumpxml DOMNAME 4) daemon crashes in a semi-random spot while accessing a now-removed VM object. Fortunately, the commit wasn't released yet, so there are no security implications. Reported-by: Shanzi Yu <shyu@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2015-01-07 13:47:31 +01:00
Luyao Huang	565d049fd1	qemu: Restore old bandwidth rules when setting new fails https://bugzilla.redhat.com/show_bug.cgi?id=1177723 When setting new bandwidth limits via virDomainSetInterfaceParameters, the old ones are cleared first. However, if setting the new ones fails, the old are already gone and interface is left in inconsistent state. Therefore, right before failing we ought to try to restore the old bandwidth. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-01-06 13:27:43 +01:00
Luyao Huang	a791599cc6	qemu: fix miss goto cleanup in qemuDomainAttachNetDevice This place have a wrong logic, maybe forget goto cleanup. Also fix some small things. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-01-06 11:07:13 +01:00
Luyao Huang	39449f70b9	qemu: use a wrong name for guest panic status https://bugzilla.redhat.com/show_bug.cgi?id=1178652 We will get a warning when we have a guest in paused status (caused by kernel panic) and restart libvirtd, warning message like this: Qemu reported unknown VM status: 'guest-panicked' and this seems because we set a wrong status name in qemu_monitor.c, and from qemu qapi-schema.json file we know this status should named 'guest-panicked'. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2015-01-05 16:55:35 -07:00
Cédric Bosdonnat	aa2cc72100	Domain conf: allow more than one IP address for net devices Add the possibility to have more than one IP address configured for a domain network interface. IP addresses can also have a prefix to define the corresponding netmask.	2015-01-05 20:24:04 +01:00
Martin Kletzander	31354b5b32	qemu: Fix coverity issues after refcount refactoring Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-23 05:34:05 +01:00
Martin Kletzander	540c339a25	qemu: completely rework reference counting There is one problem that causes various errors in the daemon. When domain is waiting for a job, it is unlocked while waiting on the condition. However, if that domain is for example transient and being removed in another API (e.g. cancelling incoming migration), it get's unref'd. If the first call, that was waiting, fails to get the job, it unref's the domain object, and because it was the last reference, it causes clearing of the whole domain object. However, when finishing the call, the domain must be unlocked, but there is no way for the API to know whether it was cleaned or not (unless there is some ugly temporary variable, but let's scratch that). The root cause is that our APIs don't ref the objects they are using and all use the implicit reference that the object has when it is in the domain list. That reference can be removed when the API is waiting for a job. And because each domain doesn't do its ref'ing, it results in the ugly checking of the return value of virObjectUnref() that we have everywhere. This patch changes qemuDomObjFromDomain() to ref the domain (using virDomainObjListFindByUUIDRef()) and adds qemuDomObjEndAPI() which should be the only function in which the return value of virObjectUnref() is checked. This makes all reference counting deterministic and makes the code a bit clearer. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-21 10:48:56 +01:00
Daniel P. Berrange	65686e5a81	disable vCPU pinning with TCG mode Although QMP returns info about vCPU threads in TCG mode, the data it returns is mostly lies. Only the first vCPU has a valid thread_id returned. The thread_id given for the other vCPUs is in fact the main emulator thread. All vCPUs actually run under the same thread in TCG mode. Our vCPU pinning code is not at all able to cope with this so if you try to set CPU affinity per-vCPU you end up with wierd errors error: Failed to start domain instance-00000007 error: cannot set CPU affinity on process 24365: Invalid argument Since few people will care about the performance of TCG with strict CPU pinning, lets just disable that for now, so we get a clear error message error: Failed to start domain instance-00000007 error: Requested operation is not valid: cpu affinity is not supported	2014-12-19 11:32:21 +00:00
Daniel P. Berrange	b07f3d821d	Don't setup fake CPU pids for old QEMU The code assumes that def->vcpus == nvcpupids, so when we setup fake CPU pids for old QEMU with nvcpupids == 1, we cause the later code to read off the end of the array. This has fun results like sche_setaffinity(0, ...) which changes libvirtd's own CPU affinity, or even better sched_setaffinity($RANDOM, ...) which changes the affinity of a random OS process.	2014-12-19 11:32:21 +00:00
Michal Privoznik	f309db1f4d	qemu: Create memory-backend-{ram,file} iff needed Libvirt BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1175397 QEMU BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1170093 In qemu there are two interesting arguments: 1) -numa to create a guest NUMA node 2) -object memory-backend-{ram,file} to tell qemu which memory region on which host's NUMA node it should allocate the guest memory from. Combining these two together we can instruct qemu to create a guest NUMA node that is tied to a host NUMA node. And it works just fine. However, depending on machine type used, there might be some issued during migration when OVMF is enabled (see QEMU BZ). While this truly is a QEMU bug, we can help avoiding it. The problem lies within the memory backend objects somewhere. Having said that, fix on our side consists on putting those objects on the command line if and only if needed. For instance, while previously we would construct this (in all ways correct) command line: -object memory-backend-ram,size=256M,id=ram-node0 \ -numa node,nodeid=0,cpus=0,memdev=ram-node0 now we create just: -numa node,nodeid=0,cpus=0,mem=256 because the backend object is obviously not tied to any specific host NUMA node. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-12-19 07:44:44 +01:00
Ján Tomko	1adda68a1b	Remove redundant cleanup in qemuDomainAttachVirtioDiskDevice Commit `ca91ba7` moved these into the qemuDomainPrepareDisk helper, but forgot to remove them from here as well.	2014-12-18 12:53:56 +01:00
Ján Tomko	1cddf0001f	Fix hotplugging of block device-backed usb disks Commit `ca91ba7` moved qemuSetupDiskCgroup into the qemuDomainPrepareDisk helper, but failed to call it for usb disks. https://bugzilla.redhat.com/show_bug.cgi?id=1175668`	2014-12-18 12:53:56 +01:00
Eric Blake	af5c3a1015	qemu: fix memory leak in blockinfo Coverity flagged commit `0282ca45` as introducing a memory leak; in all my refactoring to make capacity probing conditional on whether the image is non-raw, I missed deleting the unconditional probe. * src/qemu/qemu_driver.c (qemuStorageLimitsRefresh): Drop redundant assignment. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-17 16:10:45 -07:00
Ján Tomko	952f8a7394	Fix error message on redirdev caps detection	2014-12-17 16:23:45 +01:00
Luyao Huang	dddd832735	conf: fix cannot start a guest have a shareable network iscsi hostdev https://bugzilla.redhat.com/show_bug.cgi?id=1174569 There's nothing we need to do for shared iSCSI devices in qemuAddSharedHostdev and qemuRemoveSharedHostdev. The iSCSI layer takes care about that for us. Signed-off-by: Luyao Huang <lhuang@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-12-17 11:23:00 +01:00
Eric Blake	3937ef9cf4	getstats: crawl backing chain for qemu Wire up backing chain recursion. For the first time, it is now possible to get libvirt to expose that qemu tracks read statistics on backing files, as well as report maximum extent written on a backing file during a block-commit operation. For a running domain, where one of the two images has a backing file, I see the traditional output: $ virsh domstats --block testvm2 Domain: 'testvm2' block.count=2 block.0.name=vda block.0.path=/tmp/wrapper.qcow2 block.0.rd.reqs=1 block.0.rd.bytes=512 block.0.rd.times=28858 block.0.wr.reqs=0 block.0.wr.bytes=0 block.0.wr.times=0 block.0.fl.reqs=0 block.0.fl.times=0 block.0.allocation=0 block.0.capacity=1310720000 block.0.physical=200704 block.1.name=vdb block.1.path=/dev/sda7 block.1.rd.reqs=0 block.1.rd.bytes=0 block.1.rd.times=0 block.1.wr.reqs=0 block.1.wr.bytes=0 block.1.wr.times=0 block.1.fl.reqs=0 block.1.fl.times=0 block.1.allocation=0 block.1.capacity=1310720000 vs. the new output: $ virsh domstats --block --backing testvm2 Domain: 'testvm2' block.count=3 block.0.name=vda block.0.path=/tmp/wrapper.qcow2 block.0.rd.reqs=1 block.0.rd.bytes=512 block.0.rd.times=28858 block.0.wr.reqs=0 block.0.wr.bytes=0 block.0.wr.times=0 block.0.fl.reqs=0 block.0.fl.times=0 block.0.allocation=0 block.0.capacity=1310720000 block.0.physical=200704 block.1.name=vda block.1.path=/dev/sda6 block.1.backingIndex=1 block.1.rd.reqs=0 block.1.rd.bytes=0 block.1.rd.times=0 block.1.wr.reqs=0 block.1.wr.bytes=0 block.1.wr.times=0 block.1.fl.reqs=0 block.1.fl.times=0 block.1.allocation=327680 block.1.capacity=786432000 block.2.name=vdb block.2.path=/dev/sda7 block.2.rd.reqs=0 block.2.rd.bytes=0 block.2.rd.times=0 block.2.wr.reqs=0 block.2.wr.bytes=0 block.2.wr.times=0 block.2.fl.reqs=0 block.2.fl.times=0 block.2.allocation=0 block.2.capacity=1310720000 I may later do a patch that trims the output to avoid 0 stats, particularly for backing files (which are more likely to have 0 stats, at least for write statistics when no block-commit is performed). Also, I still plan to expose physical size information (qemu doesn't expose it yet, so it requires a stat, and for block devices, a further open/seek operation). But this patch is good enough without worrying about that yet. * src/qemu/qemu_driver.c (QEMU_DOMAIN_STATS_BACKING): New internal enum bit. (qemuConnectGetAllDomainStats): Recognize new user flag, and pass details to... (qemuDomainGetStatsBlock): ...here, where we can do longer recursion. (qemuDomainGetStatsOneBlock): Output new field. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-17 02:07:44 -07:00
Eric Blake	c2d380bff8	getstats: split block stats reporting for easier recursion In order to report stats on backing chains, we need to separate the output of stats for one block from how we traverse blocks. * src/qemu/qemu_driver.c (qemuDomainGetStatsBlock): Split... (qemuDomainGetStatsOneBlock): ...into new helper. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-17 02:07:44 -07:00
Eric Blake	14ef1f62e3	getstats: prepare for dynamic block.count stat A coming patch will make it optionally possible to list backing chain block stats; in this mode of operation, block.counts is no longer the number of <disks> in the domain, but the number of blocks in the array being reported. We still want block.count listed first, but rather than iterate the tree twice (once to count, and once to list stats), it's easier to just touch things up after the fact. * src/qemu/qemu_driver.c (qemuDomainGetStatsBlock): Compute count after the fact. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-17 00:20:21 -07:00
Eric Blake	596a137134	getstats: report block sizes for offline domains The prior refactoring can now be put to use. With the same domain as the earlier commit `7b49926` (one qcow2 disk and an empty cdrom drive): $ virsh domstats --block foo Domain: 'foo' block.count=2 block.0.name=hda block.0.path=/var/lib/libvirt/images/foo.qcow2 block.0.allocation=1309614080 block.0.capacity=42949672960 block.0.physical=1309671424 block.1.name=hdc * src/qemu/qemu_driver.c (qemuDomainGetStatsBlock): Use qemuStorageLimitsRefresh to report offline statistics. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-17 00:20:21 -07:00
Eric Blake	8de6544e98	qemu: refactor blockinfo data gathering Create a helper function that can be reused for gathering block info from virDomainListGetStats. * src/qemu/qemu_driver.c (qemuDomainGetBlockInfo): Split guts... (qemuStorageLimitsRefresh): ...into new helper function. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-16 23:28:36 -07:00
Eric Blake	0282ca45a0	qemu: fix bugs in blockstats The documentation for virDomainBlockInfo was confusing: it stated that 'physical' was the size of the container, then gave an example of it being the amount of storage used by a sparse file (that is, for a sparse raw image on a regular file, the wording implied capacity==physical, while allocation was smaller; but the example instead claimed physical==allocation). Since we use 'physical' for the last offset of a block device, we should do likewise for regular files. Furthermore, the example claimed that for a qcow2 regular file, allocation==physical. At the time the code was first written, this was true (qcow2 files were allocated sequentially, and were never sparse, so the last sector written happened to also match the disk space occupied); but modern qemu does much better and can punch holes for a qcow2 with allocation < physical. Basically, after this patch, the three fields are now reliably mapped as: 'capacity' - how much storage the guest can see (equal to physical for raw images, determined by image metadata otherwise) 'allocation' - how much storage the image occupies (similar to what 'du' would report) 'physical' - the last offset of the image (similar to what 'ls' would report) 'capacity' can be larger than 'physical' (such as for a qcow2 image that does not vary much from a backing file) or smaller (such as for a qcow2 file with lots of internal snapshots). Likewise, 'allocation' can be (slightly) larger than 'physical' (such as counting the tail of cluster allocations required to round a file size up to filesystem granularity) or smaller (for a sparse file). A block-resize operation changes capacity (which, for raw images, also changes physical); many non-raw images automatically grow physical and allocation as necessary when starting with an allocation smaller than capacity; and even when capacity and physical stay unchanged, allocation can change when converting sectors from holes to data or back. Note that this does not change semantics for qcow2 images stored on block devices; there, we still rely on qemu to report the highest written extent for allocation. So using this API to track when to extend a block device because a qcow2 image is about to exceed a threshold will not see any changes. Also, note that virStorageVolInfo is unfortunately limited to just 'capacity' and 'allocation' (we can't expand it to add 'physical', although we can expand the XML to add it there); historically, that struct's 'allocation' value has reported file size for qcow2 files (what this patch terms 'physical' for a domain block device), but disk usage for raw files (what this patch terms 'allocation'). So follow-up patches will be needed to make storage volumes report the same allocation values and get at physical values, where those differ. * include/libvirt/libvirt-domain.h (_virDomainBlockInfo): Tweak documentation to match saner definition. * src/qemu/qemu_driver.c (qemuDomainGetBlockInfo): For regular files, physical size is capacity, not allocation. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-16 23:19:08 -07:00
Eric Blake	05e702cfd4	getstats: rearrange blockinfo gathering Ultimately, we want to avoid read()ing a file while qemu is running. We still have to open() block devices to determine their physical size, but that is safer. This patch rearranges code to group together all code that reads the image, to make it easier for later patches to skip the metadata collection when possible. * src/qemu/qemu_driver.c (qemuDomainGetBlockInfo): Check for empty disk up front. Place metadata reading next to use. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-16 23:13:04 -07:00
Eric Blake	b1802714da	getstats: perform recursion in monitor collection When requested in a later patch, the QMP command results are now examined recursively. As qemu_driver will eventually have to read items out of the hash table as stored by this patch, the computation of backing alias string is done in a shared location. * src/qemu/qemu_domain.h (qemuDomainStorageAlias): New prototype. * src/qemu/qemu_domain.c (qemuDomainStorageAlias): Implement it. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONGetOneBlockStatsInfo) (qemuMonitorJSONBlockStatsUpdateCapacityOne): Perform recursion. (qemuMonitorJSONGetAllBlockStatsInfo) (qemuMonitorJSONBlockStatsUpdateCapacity): Update callers. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-16 16:14:55 -07:00
Eric Blake	7b11f5e554	getstats: prepare monitor collection for recursion A future patch will allow recursion into backing chains when collecting block stats. This patch should not change behavior, but merely moves out the common code that will be reused once recursion is enabled, and adds the parameter that will turn on recursion. * src/qemu/qemu_monitor.h (qemuMonitorGetAllBlockStatsInfo) (qemuMonitorBlockStatsUpdateCapacity): Add recursion parameter, although it is ignored for now. * src/qemu/qemu_monitor.h (qemuMonitorGetAllBlockStatsInfo) (qemuMonitorBlockStatsUpdateCapacity): Likewise. * src/qemu/qemu_monitor_json.h (qemuMonitorJSONGetAllBlockStatsInfo) (qemuMonitorJSONBlockStatsUpdateCapacity): Likewise. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONGetAllBlockStatsInfo) (qemuMonitorJSONBlockStatsUpdateCapacity): Add parameter, and split... (qemuMonitorJSONGetOneBlockStatsInfo) (qemuMonitorJSONBlockStatsUpdateCapacityOne): ...into helpers. (qemuMonitorJSONGetBlockStatsInfo): Update caller. * src/qemu/qemu_driver.c (qemuDomainGetStatsBlock): Update caller. * src/qemu/qemu_migration.c (qemuMigrationCookieAddNBD): Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-16 16:08:04 -07:00
Eric Blake	89646e69ac	qemu: let blockinfo reuse virStorageSource Right now, grabbing blockinfo always calls stat on the disk, then opens the image to determine the capacity, using a throw-away virStorageSourcePtr. This has a couple of drawbacks: 1. We are calling stat and opening a file on every invocation of the API. However, there are cases where the stats should NOT be changing between successive calls (if a domain is running, no one should be changing the physical size of a block device or raw image behind our backs; capacity of read-only files should not be changing; and we are the gateway to the block-resize command to know when the capacity of read-write files should be changing). True, we still have to use stat in some cases (a sparse raw file changes allocation if it is read-write and the amount of holes is changing, and a read-write qcow2 image stored in a file changes physical size if it was not fully pre-allocated). But for read-only images, even this should be something we can remember from the previous time, rather than repeating every call. 2. We want to enhance the power of virDomainListGetStats, by sharing code. But we already have a virStorageSourcePtr for each disk, and it would be easier to reuse the common structure than to have to worry about the one-off virDomainBlockInfoPtr. While this patch does not optimize reuse of information in point 1, it does get us closer to being able to do so; by updating a structure that survives between consecutive calls. * src/util/virstoragefile.h (_virStorageSource): Add physical, to mirror virDomainBlockInfo; rearrange fields to match public struct. (virStorageSourceCopy): Copy the new field. * src/qemu/qemu_driver.c (qemuDomainGetBlockInfo): Store into storage source, then copy to block info. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-16 16:05:47 -07:00
Eric Blake	a20c3aafbe	qemu: refactor blockinfo job handling In order for a future patch to virDomainListGetStats to reuse some code for determining disk usage of offline domains, we need to make it easier to pull out part of the guts of grabbing blockinfo. The current implementation grabs a job fairly late in the game, while getstats will already own a job; reordering things so that the job is always grabbed up front in both functions will make it easier to pull out the common code. This patch results in grabbing a job in cases where one was not previously needed, but as it is a query job, it should not be noticeably slower. This patch touches the same code as the fix for CVE-2014-6458 (commit `b799259`); in that patch, we avoided hotplug changing a disk reference during the time of obtaining a monitor lock by copying all data we needed and no longer referencing disk; this patch goes the other way and ensures that by holding the job, the disk cannot be changed so we no longer need to worry about the disk being invalidated across the monitor lock. * src/qemu/qemu_driver.c (qemuDomainGetBlockInfo): Rearrange job control to be outside of disk information. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-16 14:12:24 -07:00
Martin Kletzander	4d1e3943d6	qemu: Free saved error in qemuDomainSetVcpusFlags Commit `e3435caf` added cleanup code to qemuDomainSetVcpusFlags() that was not supposed to reset the error. Usual procedure was done, saving the error to temporary variable, but it was never free'd, but rather leaked. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-16 20:45:05 +01:00
Martin Kletzander	86759ec61a	qemu: Add missing goto error in qemuRestoreCgroupState Commit `af2a1f05` tried clearly separating each condition in qemuRestoreCgroupState() for the sake of readability, however somehow one condition body was missing. That means that the body of the next condition got executed only if both of there were true, which is impossible, thus resulting in a dead code and a logic error. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-16 20:44:33 +01:00
Martin Kletzander	e3435caf6a	qemu: Fix hotplugging cpus with strict memory pinning When hot-plugging a VCPU into the guest, kvm needs to allocate some data from the DMA zone, which might be in a memory node that's not allowed in cpuset.mems. Basically the same problem as there was with starting the domain and due to which commit `7e72ac7878` exists. This patch just extends it to hotplugging as well. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1161540 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-16 11:15:27 +01:00
Martin Kletzander	af2a1f0587	qemu: Leave cpuset.mems in parent cgroup alone Instead of setting the value of cpuset.mems once when the domain starts and then re-calculating the value every time we need to change the child cgroup values, leave the cgroup alone and rather set the child data every time there is new cgroup created. We don't leave any task in the parent group anyway. This will ease both current and future code. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-16 11:15:27 +01:00
Martin Kletzander	c74d58ad47	qemu: Save numad advice into qemuDomainObjPrivate Thanks to that we don't need to drag the pointer everywhere and future code will get cleaner. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-16 11:15:27 +01:00
Martin Kletzander	f801a81208	qemu: Remove unnecessary qemuSetupCgroupPostInit function Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-16 11:15:27 +01:00
Luyao Huang	98dee71759	qemu: Auto generate a controller when attach hostdev and chr device https://bugzilla.redhat.com/show_bug.cgi?id=1174154 When we use attach-device add a hostdev or chr device which have a iscsi address or others (just like guest agent, subsys iscsi disk...), we will find there is no basic controller for our new attached device. Somtimes this will make guest cannot start after we add them (although they can start at the second time). Signed-off-by: Luyao Huang <lhuang@redhat.com>	2014-12-15 16:24:01 +01:00
Laine Stump	44292e48a0	qemu: add/remove bridge fdb entries as guest CPUs are started/stopped When libvirt is managing a bridge's forwarding database (FDB) (macTableManager='libvirt'), if we add FDB entries for a new guest interface even before the qemu process is created, then in the case of a migration any other guest attached to the "destination" bridge will have its traffic immediately sent to the destination of the migration even while the source domain is still running (and the destination, of course, isn't). To make sure that traffic from other guests on the new host continues flowing to the old guest until the new one is ready, we have to wait until the new guest CPUs are started to add the FDB entries. Conversely, we need to remove the FDB entries from the bridge any time the guest CPUs are stopped; among other things, this will assure proper operation during a post-copy migration (which is just the opposite of the problem described in the previous paragraph).	2014-12-15 10:07:06 -05:00
Wang Rui	9603bce7b1	qemu: make persistent update of graphics device supported We can change vnc password by using virDomainUpdateDeviceFlags API with live flag. But it can't be changed with config flag. Error is reported as below. error: Operation not supported: persistent update of device 'graphics' is not supported This patch supports the graphics arguments changed with config flag. Signed-off-by: Wang Rui <moon.wangrui@huawei.com>	2014-12-15 15:45:24 +01:00
Wang Rui	dec5f07b9e	qemu: fix alignment of qemuDomainFindGraphics Signed-off-by: Wang Rui <moon.wangrui@huawei.com>	2014-12-15 15:45:24 +01:00
Wang Rui	2609479b54	qemu: report properer error number when change graphics failed It's not supported to change some graphics arguments with '--live'. Replace some error code VIR_ERR_INTERNAL_ERROR and VIR_ERR_INVALID_ARG with VIR_ERR_OPERATION_UNSUPPORTED. Signed-off-by: Wang Rui <moon.wangrui@huawei.com>	2014-12-15 15:45:24 +01:00
Michal Privoznik	311b4a677f	qemu: Allow system pages to <memoryBacking/> https://bugzilla.redhat.com/show_bug.cgi?id=1173507 It occurred to me that OpenStack uses the following XML when not using regular huge pages: <memoryBacking> <hugepages> <page size='4' unit='KiB'/> </hugepages> </memoryBacking> However, since we are expecting to see huge pages only, we fail to startup the domain with following error: libvirtError: internal error: Unable to find any usable hugetlbfs mount for 4 KiB While regular system pages are not huge pages technically, our code is prepared for that and if it helps OpenStack (or other management applications) we should cope with that. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-12-15 13:36:47 +01:00
Michal Privoznik	ca4f9518b8	virconf: Introduce VIR_CONF_ULONG https://bugzilla.redhat.com/show_bug.cgi?id=1160995 In our config files users are expected to pass several integer values for different configuration knobs. However, majority of them expect a nonnegative number and only a few of them accept a negative number too (notably keepalive_interval in libvirtd.conf). Therefore, a new type to config value is introduced: VIR_CONF_ULONG that is set whenever an integer is positive or zero. With this approach knobs accepting VIR_CONF_LONG should accept VIR_CONF_ULONG too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-12-15 10:34:18 +01:00
Laine Stump	c5a54917d5	qemu: add a qemuInterfaceStopDevices(), called when guest CPUs stop We now have a qemuInterfaceStartDevices() which does the final activation needed for the host-side tap/macvtap devices that are used for qemu network connections. It will soon make sense to have the converse qemuInterfaceStopDevices() which will undo whatever was done during qemuInterfaceStartDevices(). A function to "stop" a single device has also been added, and is called from the appropriate place in qemuDomainDetachNetDevice(), although this is currently unnecessary - the device is going to immediately be deleted anyway, so any extra "deactivation" will be for naught. The call is included for completeness, though, in anticipation that in the future there may be some required action that isn't nullified by deleting the device. This patch is a part of a more complete fix for: https://bugzilla.redhat.com/show_bug.cgi?id=1081461	2014-12-13 22:20:28 -05:00
Laine Stump	879c13d6cc	qemu: always call qemuInterfaceStartDevices() when starting CPUs The patch that added qemuInterfaceStartDevices() (upstream commit `82977058f5`) had an extra conditional to prevent calling it if the reason for starting the CPUs was VIR_DOMAIN_RUNNING_UNPAUSED or VIR_DOMAIN_RUNNING_SAVE_CANCELED. This was put in by the author as the result of a reviewer asking if it was necessary to ifup the interfaces in all occasions (because these were the two cases where the CPU would have already been started (and stopped) once, so the interface would already be ifup'ed). It turns out that, as long as there is no corresponding qemuInterfaceStopDevices() to ifdown the interfaces anytime the CPUs are stopped, neglecting to ifup when reason is RUNNING_UNPAUSED or RUNNING_SAVE_CANCELED doesn't cause any problems (because it just happens that the interface will have already been ifup'ed by a prior call when the CPU was previously started for some other reason). However, it also doesn't help, and there will soon be a qemuInterfaceStopDevices() function which will ifdown these interfaces when the guest CPUs are stopped, and once that is done, the interfaces will be left down in some cases when they should be up (for example, if a domain is paused and then unpaused). So, this patch is removing the condition in favor of always calling qemuInterfaeStartDevices() when the guest CPUs are started. This patch (and the aforementioned patch) resolve: https://bugzilla.redhat.com/show_bug.cgi?id=1081461	2014-12-13 21:44:45 -05:00
Francesco Romani	cb104ef734	qemu: bulk stats: Fix logic in monitor handling A logic bug in qemuConnectGetAllDomainStats makes the code mark the monitor as available when qemuDomainObjBeginJob fails, instead of when it succeeds, as the correct flow requires. This patch fixes the check and updates the code documentation accordingly. Broken by commit `57023c0a3a`. Signed-off-by: Francesco Romani <fromani@redhat.com>	2014-12-11 11:02:05 +01:00
Matthew Rosato	82977058f5	network: Bring netdevs online later Currently, MAC registration occurs during device creation, which is early enough that, during live migration, you end up with duplicate MAC addresses on still-running source and target devices, even though the target device isn't actually being used yet. This patch proposes to defer MAC registration until right before the guest can actually use the device -- In other words, right before starting guest CPUs. Signed-off-by: Matthew Rosato <mjrosato@linux.vnet.ibm.com> Signed-off-by: Laine Stump <laine@laine.org>	2014-12-10 15:09:01 -05:00
Wang Rui	6ee1c0ff67	maint: clean up the unused variable 'caps' in src/qemu/qemu_*.c Signed-off-by: Wang Rui <moon.wangrui@huawei.com>	2014-12-10 11:21:31 +01:00
Martin Kletzander	57023c0a3a	CVE-2014-8131: Fix possible deadlock and segfault in qemuConnectGetAllDomainStats() When user doesn't have read access on one of the domains he requested, the for loop could exit abruptly or continue and override pointer which pointed to locked object. This patch fixed two issues at once. One is that domflags might have had QEMU_DOMAIN_STATS_HAVE_JOB even when there was no job started (this is fixed by doing domflags \|= QEMU_DOMAIN_STATS_HAVE_JOB only when the job was acquired and cleaning domflags on every start of the loop. Second one is that the domain is kept locked when virConnectGetAllDomainStatsCheckACL() fails and continues the loop when it didn't end. Adding a simple virObjectUnlock() and clearing the pointer ought to do. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-10 09:11:57 +01:00
Peter Krempa	2bdcd29c71	qemu: migration: Unlock vm on failed ACL check in protocol v2 APIs Avoid leaving the domain locked on a failed ACL check in qemuDomainMigratePerform() and qemuDomainMigrateFinish2(). Introduced in commit `abf75aea24` (Add ACL checks into the QEMU driver).	2014-12-09 10:10:24 +01:00
Laine Stump	4aae2ed6fb	qemu: always use virDomainNetGetActualBridgeName to get interface's bridge qemuNetworkIfaceConnect() used to have a special case for actualType='network' (a network with forward mode of route, nat, or isolated) to call the libvirt public API to retrieve the bridge being used by a network. That is no longer necessary - since all network types that use a bridge and tap device now get the bridge name stored in the ActualNetDef, we can just always use virDomainNetGetActualBridgeName() instead. (an audit of the two callers to qemuNetworkIfaceConnect() confirms that it is never called for any other type of network, so the dead code in the else statement (logging an internal error if it is called for any other type of network) is eliminated in the process.)	2014-12-08 14:50:50 -05:00
Laine Stump	7cb822c2a5	qemu: setup tap devices for macTableManager='libvirt' When libvirt is managing the MAC table of a Linux host bridge, it must turn off learning and unicast_flood for each tap device attached to that bridge, then add a Forwarding Database (fdb) entry for the tap device using the MAC address from the domain interface config. Once we have disabled learning and flooding, any packet that has a destination MAC address not present in the fdb will be dropped by the bridge. This, along with the opportunistic disabling of promiscuous mode[], can result in enhanced network performance. and a potential slight security improvement. [] If there is only one device on the bridge with learning/unicast_flood enabled, then that device will automatically have promiscuous mode disabled. If there are no devices with learning/unicast_flood enabled (e.g. for a libvirt "route", "nat", or isolated network that has no physical device attached), then all non-tap devices will have promiscuous mode disabled (tap devices always have promiscuous mode enabled, which may be a bug in the kernel, but in practice has 0 effect). None of this has any effect for kernels prior to 3.15 (upstream kernel commit 2796d0c648c940b4796f84384fbcfb0a2399db84 "bridge: Automatically manage port promiscuous mode"). Even after that, until kernel 3.17 (upstream commit 5be5a2df40f005ea7fb7e280e87bbbcfcf1c2fc0 "bridge: Add filtering support for default_pvid") traffic will not be properly forwarded without manually adding vlan table entries. Unfortunately, although the presence of the first patch is signalled by existence of the "learning" and "unicast_flood" options in sysfs, there is no reliable way to query whether or not the system's kernel has the second of those patches installed, the only thing that can be done is to try the setting and see if traffic continues to pass.	2014-12-08 14:49:09 -05:00
Eric Blake	7b499262cb	getstats: add block.n.path stat I'm about to make block stats optionally more complex to cover backing chains, where block.count will no longer equal the number of <disks> for a domain. For these reasons, it is nicer if the statistics output includes the source path (for local files). This patch doesn't add anything for network disks, although we may decide to add that later. With this patch, I now see the following for the same domain as in the previous patch (one qcow2 file, and an empty cdrom drive): $ virsh domstats --block foo Domain: 'foo' block.count=2 block.0.name=hda block.0.path=/var/lib/libvirt/images/foo.qcow2 block.1.name=hdc * src/libvirt-domain.c (virConnectGetAllDomainStats): Document new field. * tools/virsh.pod (domstats): Document new field. * src/qemu/qemu_driver.c (qemuDomainGetStatsBlock): Return the new stat for local files/block devices. (QEMU_ADD_NAME_PARAM): Add parameter. (qemuDomainGetStatsInterface): Update caller. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-08 11:58:39 -07:00
Eric Blake	56b21dfe0c	getstats: start giving offline block stats I noticed that for an offline domain, 'virsh domstats --block $dom' was producing just the domain name, with no stats. But the older 'virsh domblkinfo' works just fine on offline domains. This patch starts to get us closer, by at least reporting the disk names for an offline domain. With this patch, I now see the following for an offline domain with one qcow2 disk and an empty cdrom drive: $ virsh domstats --block foo Domain: 'foo' block.count=2 block.0.name=hda block.1.name=hdc * src/qemu/qemu_driver.c (qemuDomainGetStatsBlock): Don't short-circuit output of block name. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-08 11:55:12 -07:00
Eric Blake	2f61602edb	getstats: avoid memory leak on OOM qemuDomainGetStatsBlock() could leak a stats hash table if it encountered OOM while populating the virTypedParameters. Oddly, the fix doesn't even touch qemuDomainGetStatsBlock :) * src/qemu/qemu_driver.c (QEMU_ADD_COUNT_PARAM) (QEMU_ADD_NAME_PARAM): Don't return early. (qemuDomainGetStatsInterface): Adjust caller. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-08 09:43:35 -07:00
Daniel P. Berrange	25bf888a66	Report original error when QMP probing fails with new QEMU If probing capabilities via QMP fails, we now have a check that prevents us falling back to -help parsing. Unfortunately the error message "Failed to probe capabilities for /usr/bin/qemu-kvm: unsupported configuration: QEMU 2.1.2 is too new for help parsing" is proving rather unhelpful to the user. We need to be telling them why QMP failed (the root cause), rather than they can't use -help (the side effect). To do this we should capture stderr during QMP probing, and if -help parsing then sees a new QEMU version, we know that QMP should have worked, and so we can show the messages from stderr. The message thus becomes "Failed to probe capabilities for /usr/bin/qemu-kvm: internal error: QEMU / QMP failed: Could not access KVM kernel module: No such file or directory failed to initialize KVM: No such file or directory"	2014-12-05 10:57:46 +00:00
Shanzhi Yu	d1e460136a	qemu: snapshot: Forbid internal snapshot with passthrough devices When attempting to create internal system checkpoint with a passthrough device qemu will report the following error: error: operation failed: Error -22 while writing VM This patch calls the function to check if migration is possible with given VM and thus improves the error to: error: Requested operation is not valid: domain has assigned non-USB host devices Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=874418#c19 Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2014-12-05 11:08:45 +01:00
Peter Krempa	38bde5776a	qemu: process: Avoid uninitialized use two vars when reconnecting to vm `3ecebf0711` breaks the build as it adds a way to jump to cleanup before the 'cfg' object is retrieved and 'priv' is initialized.	2014-12-04 16:24:25 +01:00
Peter Krempa	3ecebf0711	qemu: process: Refactor reconnecting to qemu processes Move entering the job into the thread to simplify the program flow. Also as the code holds a separate reference to the domain object some conditions can be simplified. After this patch qemuDomainObjTransferJob is no longer needed so this patch removes it.	2014-12-04 15:28:39 +01:00
Erik Skultety	fe3691f663	qemu: Fix virsh freeze when blockcopy storage file is removed If someone removes blockcopy storage file when still in mirroring phase and then requesting blockjob abort using pivot, virsh cmd freezes. This is not an issue with older qemu versions which did not support asynchronous jobs (which we prefer by default). As we have reached the mirroring phase successfully, polling monitor for blockjob info always returns 1 and the loop never ends. This fix introduces a check for qemuDomainBlockPivot return code, possibly skipping the asynchronous waiting completely, if an error occurred and asynchronous waiting was the preferred method. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1139567	2014-12-04 09:05:59 +01:00
Peter Krempa	48a055607c	qemu: driver: Reload snapshots and managedsaves prior to reconnecting Reconnect to the VM is a possibly long-running job spawned in a separate thread. We should reload the snapshot defs and managedsave state prior to spawning the thread to avoid blocking of the daemon startup which would serialize on the VM lock. Also the reloading code would violate the domain job held while reconnecting as the loader functions don't create jobs.	2014-12-03 18:50:22 +01:00
Michal Privoznik	cf54c60699	qemu_migration: Precreate missing storage Based on previous commit, we can now precreate missing volumes. While digging out the functionality from storage driver would be nicer, if you've seen the code it's nearly impossible. So I'm going from the other end: 1) For given disk target, disk path is looked up. 2) For the disk path, storage pool is looked up, a volume XML is constructed and then passed to virStorageVolCreateXML() which has all the knowledge how to create raw images, (encrypted) qcow(2) images, etc. One of the advantages of this approach is, we don't have to care about image conversion - qemu does that for us. So for instance, users can transform qcow2 into raw on migration (if the correct XML is passed to the migration API). Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-12-02 18:02:13 +01:00
Michal Privoznik	e1466dc7fa	qemu_migration: Send disk sizes to the other side Up 'til now, users need to precreate non-shared storage on migration themselves. This is not very friendly requirement and we should do something about it. In this patch, the migration cookie is extended, so that <nbd/> section does not only contain NBD port, but info on disks being migrated. This patch sends a list of pairs of: <disk target; disk size> to the destination. The actual storage allocation is left for next commit. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-12-02 17:51:57 +01:00
Michal Privoznik	a714533b2b	qemuMonitorJSONBlockStatsUpdateCapacity: Don't skip disks The function queries the block devices visible to qemu ('query-block') and parses the qemu's output. The info is returned in a hash table which is expected to be pre-filled by qemuMonitorJSONGetAllBlockStatsInfo(). However, in the next patch we are not going to call the latter function at all, so we should make the former function add devices into the hash table if not found there. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-12-02 17:51:57 +01:00
John Ferlan	c8230c4ded	Replace virDomainSnapshotFree with virObjectUnref Since virDomainSnapshotFree will call virObjectUnref anyway, let's just use that directly so as to avoid the possibility that we inadvertently clear out a pending error message when using the public API.	2014-12-02 11:03:41 -05:00
John Ferlan	121c09a90b	Replace virNetworkFree with virObjectUnref Since virNetworkFree will call virObjectUnref anyway, let's just use that directly so as to avoid the possibility that we inadvertently clear out a pending error message when using the public API.	2014-12-02 11:03:40 -05:00
John Ferlan	8fb3aee2f8	Replace virDomainFree with virObjectUnref Since virDomainFree will call virObjectUnref anyway, let's just use that directly so as to avoid the possibility that we inadvertently clear out a pending error message when using the public API.	2014-12-02 11:03:40 -05:00
Eduardo Costa	ff018e686a	Fix race condition in qemuGetProcessInfo There is a race condition between the fopen and fscanf calls in qemuGetProcessInfo. If fopen succeeds, there is a small possibility that the file no longer exists before reading from it. Now, if either fopen or fscanf calls fail, the function will behave just as only fopen had failed. Fixes https://bugzilla.redhat.com/show_bug.cgi?id=1169055 Signed-off-by: Eric Blake <eblake@redhat.com>	2014-12-01 15:42:47 -07:00
John Ferlan	59802f23bc	hotplug: Resolve Coverity FORWARD_NULL Coverity complained that because the cfg->macFilter call checked net->ifname != NULL before calling ebtablesRemoveForwardAllowIn, then the virNetDevOpenvswitchRemovePort call should have the same check. However, if I move the ebtables call prior to the check for TYPE_DIRECT (where there is a VIR_FREE(net->ifname)), then it seems Coverity is happy. Since firewall info is tacked on last during setup, removing it in the opposite order of initialization seems to be natural anyway	2014-12-01 11:07:31 -05:00
Luyao Huang	f8c1fb3d2e	qemu: Make pid available for security managers in qemuProcessAttach There are some small issue in qemuProcessAttach: 1.Fix virSecurityManagerGetProcessLabel always get pid = 0, move 'vm->pid = pid' before call virSecurityManagerGetProcessLabel. 2.Use virSecurityManagerGenLabel to get image label. 3.Fix always set selinux label for other security driver label. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2014-12-01 12:04:38 +01:00
Martin Kletzander	03caa543c2	conf: Add device-related code for panic devices Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1169183 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-01 12:01:27 +01:00
Martin Kletzander	bfeee8dee4	conf: Add device-related code for TPM devices Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1169183 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-12-01 12:01:27 +01:00
Erik Skultety	8e23e0e977	qemu: fix block{commit,copy} abort handling When a block{commit,copy} job was aborted on a domain, block job handler did not process it correctly, leaving a phantom job in the background. Any further calls to any blockjob causes "block <jobtype> still active" error. This patch fixes the blockjob handler so that it checks not only for VIR_DOMAIN_BLOCK_JOB_FAILED status, but VIR_DOMAIN_BLOCK_JOB_CANCELED status as well, followed by our existing cleanup routine. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1135169 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2014-12-01 10:09:03 +01:00
Wang Rui	111198210b	qemu: set jobinfo type to FAILED if job is failed in qemuMigrationRun If job is failed in qemuMigrationRun, we expect the jobinfo type as FAILED. But jobinfo type won't be updated until entering qemuMigrationWaitForCompletion. We should make it updated in all conditions. Moreover, we can't use qemuMigrationUpdateJobStatus here because job may fail in libvirt, so we can't query job status from QEMU. Signed-off-by: Wang Rui <moon.wangrui@huawei.com>	2014-12-01 08:17:24 +01:00
Wang Rui	0b0cba4dba	qemu: set jobinfo type to CANCELLED if migration is cancelled in all conditions The migration job status is traced in qemuMigrationUpdateJobStatus which is called in qemuMigrationRun. But if migration is cancelled before the trace such as in qemuMigrationDriveMirror, the jobinfo type won't be updated to CANCELLED. After this patch, we can get jobinfo type CANCELLED if migration is cancelled during drive mirror. Moreover, we can't use qemuMigrationUpdateJobStatus because from qemu's point of view it's just the drive mirror being cancelled and the migration hasn't even started yet. Signed-off-by: Wang Rui <moon.wangrui@huawei.com>	2014-12-01 08:17:24 +01:00
Michal Privoznik	6085d917d5	qemu: Don't track quiesced state of FSs https://bugzilla.redhat.com/show_bug.cgi?id=1160084 As of `b6d4dad11b` (1.2.5) we are trying to keep the status of FSFreeze in the guest. Even though I've tried to fixed couple of corner cases (`6ea54769ba`), it occurred to me just recently, that the approach is broken by design. Firstly, there are many other ways to talk to qemu-ga (even through libvirt) that filesystems can be thawed (e.g. qemu-agent-command) without libvirt noticing. Moreover, there are plenty of ways to thaw filesystems without even qemu-ga noticing (yes, qemu-ga keeps internal track of FSFreeze status). So, instead of keeping the track ourselves, or asking qemu-ga for stale state, it's the best to let qemu-ga deal with that (and possibly let guest kernel propagate an error). Moreover, there's one bug with the following approach, if fsfreeze command failed, we've executed fsthaw subsequently. So issuing domfsfreeze in virsh gave the following result: virsh # domfsfreeze gentoo Froze 1 filesystem(s) virsh # domfsfreeze gentoo error: Unable to freeze filesystems error: internal error: unable to execute QEMU agent command 'guest-fsfreeze-freeze': The command guest-fsfreeze-freeze has been disabled for this instance virsh # domfsfreeze gentoo Froze 1 filesystem(s) virsh # domfsfreeze gentoo error: Unable to freeze filesystems error: internal error: unable to execute QEMU agent command 'guest-fsfreeze-freeze': The command guest-fsfreeze-freeze has been disabled for this instance Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-11-28 11:22:24 +01:00
Jiri Denemark	9340528a7f	Fix usage of virReportSystemError virReportSystemError is reserved for reporting system errors, calling it with VIR_ERR_* error codes produces error messages that do not make any sense, such as internal error: guest failed to start: Kernel doesn't support user namespace: Link has been severed We should prohibit wrong usage with a syntax-check rule. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2014-11-28 09:42:13 +01:00
Wang Rui	64b84911ce	qemu: add the missing jobinfo type in qemuDomainGetJobInfo Commit `6fcddfcd` refactored job statistics but missed the jobinfo type updated in qemuDomainGetJobInfo. After this patch, we can use virDomainGetJobInfo to get jobinfo type again. Signed-off-by: Wang Rui <moon.wangrui@huawei.com> Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2014-11-25 14:40:19 +01:00
Pavel Hrdina	742d49fa17	qemu-command: introduce new vgamem attribute for QXL video device Add attribute to set vgamem_mb parameter of QXL device for QEMU. This value sets the size of VGA framebuffer for QXL device. Default value in QEMU is 8MB so reuse it also in libvirt to not break things. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1076098 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2014-11-24 22:20:13 +01:00
Pavel Hrdina	24c6ca860e	qemu-command: use vram attribute for all video devices So far we didn't have any option to set video memory size for qemu video devices. There was only the vram (ram for QXL) attribute but it was valid only for the QXL video device. To provide this feature to users QEMU has a dedicated device attribute called 'vgamem_mb' to set the video memory size. We will use the 'vram' attribute for setting video memory size for other QEMU video devices. For the cirrus device we will ignore the vram value because it has hardcoded video size in QEMU. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1076098 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2014-11-24 22:18:18 +01:00
Pavel Hrdina	f480a87aa6	caps: introduce new QEMU capability for vgamem_mb device property Allow setting vgamem size for video devices. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1076098 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2014-11-24 22:05:56 +01:00

... 13 14 15 16 17 ...

5142 Commits