libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2025-01-04 03:55:20 +00:00

Author	SHA1	Message	Date
Daniel P. Berrange	a45b99ead9	Introduce a more convenient virCgroupNewDetectMachine Instead of requiring drivers to use a combination of calls to virCgroupNewDetect and virCgroupIsValidMachine, combine the two into virCgroupNewDetectMachine Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-25 19:47:30 +01:00
Ján Tomko	926055474d	Don't overwrite errors in qemuTranslateDiskSourcePool Both virStoragePoolFree and virStorageVolFree reset the last error, which might lead to the cryptic message: An error occurred, but the cause is unknown When the volume wasn't found, virStorageVolFree was called with NULL, leading to an error: invalid storage volume pointer in virStorageVolFree This patch changes it to: Storage volume not found: no storage vol with matching name 'tomato'	2013-07-25 13:12:22 +02:00
Daniel P. Berrange	02098ac260	Convert QEMU driver to use virCgroupNewMachine Convert the QEMU driver code to use the new atomic API for setup of cgroups Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-25 11:42:47 +01:00
Martin Kletzander	b4a40dd92d	Use qemuOpenFile in qemu_driver.c On two places, the usage of open() is replaced with qemuOpenFile as that is the preferred method in those cases. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=963881	2013-07-24 14:29:12 +02:00
Martin Kletzander	849df2875d	Make qemuOpenFile aware of per-VM DAC seclabel. Function qemuOpenFile() haven't had any idea about seclabels applied to VMs only, so in case the seclabel differed from the "user:group" from configuration, there might have been issues with opening files. Make qemuOpenFile() VM-aware, but only optionally, passing NULL argument means skipping VM seclabel info completely. However, all current qemuOpenFile() calls look like they should use VM seclabel info in case there is any, so convert these calls as well. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=869053	2013-07-24 14:29:11 +02:00
Laine Stump	3ceb4c7df6	qemu: set/validate slot/connection type when assigning slots for PCI devices Since PCI bridges, PCIe bridges, PCIe switches, and PCIe root ports all share the same namespace, they are all defined as controllers of type='pci' in libvirt (but with a differing model attribute). Each of these controllers has a certain connection type upstream, allows certain connection types downstream, and each can either allow a single downstream connection at slot 0, or connections from slot 1 - 31. Right now, we only support the pci-root and pci-bridge devices, both of which only allow PCI devices to connect, and both which have usable slots 1 - 31. In preparation for adding other types of controllers that have different capabilities, this patch 1) adds info to the qemuDomainPCIAddressBus object to indicate the capabilities, 2) sets those capabilities appropriately for pci-root and pci-bridge devices, and 3) validates that the controller being connected to is the proper type when allocating slots or validating that a user-selected slot is appropriate for a device.. Having this infrastructure in place will make it much easier to add support for the other PCI controller types. While it would be possible to do all the necessary checking by just storing the controller model in the qemyuDomainPCIAddressBus, it greatly simplifies all the validation code to also keep a "flags", "minSlot" and "maxSlot" for each - that way we can just check those attributes rather than requiring a nearly identical switch statement everywhere we need to validate compatibility. You may notice many places where the flags are seemingly hard-coded to QEMU_PCI_CONNECT_HOTPLUGGABLE \| QEMU_PCI_CONNECT_TYPE_PCI This is currently the correct value for all PCI devices, and in the future will be the default, with small bits of code added to change to the flags for the few devices which are the exceptions to this rule. Finally, there are a few places with "FIXME" comments. Note that these aren't indicating places that are broken according to the currently supported devices, they are places that will need fixing when support for new PCI controller models is added. To assure that there was no regression in the auto-allocation of PCI addresses or auto-creation of integrated pci-root, ide, and usb controllers, a new test case (pci-bridge-many-disks) has been added to both the qemuxml2argv and qemuxml2xml tests. This new test defines a domain with several dozen virtio disks but no pci-root or pci-bridges. The .args file of the new test case was created using libvirt sources from before this patch, and the test still passes after this patch has been applied.	2013-07-24 06:45:07 -04:00
Laine Stump	9adafa08e6	qemu: make QEMU_PCI_ADDRESS_(SLOT\|FUNCTION)_LAST less misleading Although these two enums are named ..._LAST, they really had the value of ..._SIZE. This patch changes their values so that, e.g., QEMU_PCI_ADDRESS_SLOT_LAST really is the slot number of the last slot on a PCI bus.	2013-07-24 06:31:28 -04:00
Laine Stump	fcbfd58429	qemu: only check for PIIX3-specific device addrs on pc-* machinetypes The implicit IDE, USB, and video controllers provided by the PIIX3 chipset in the pc-* machinetypes are not present on other machinetypes, so we shouldn't be doing the special checking for them. This patch places those validation checks into a separate function that is only called for machine types that have a PIIX3 chip (which happens to be the i440fx-based pc-* machine types). One qemuxml2argv test data file had to be changed - the pseries-usb-multi test had included a piix3-usb-uhci device, which was being placed at a specific address, and also had slot 2 auto reserved for a video device, but the pseries virtual machine doesn't actually have a PIIX3 chip, so even if there was a piix3-usb-uhci driver for it, the device wouldn't need to reside at slot 1 function 2. I just changed the .argv file to have the generic slot info for the two devices that results when the special PIIX3 code isn't executed.	2013-07-24 06:29:23 -04:00
Laine Stump	23cc535220	qemu: turn qemuDomainPCIAddressBus into a struct qemuDomainPCIAddressBus was an array of QEMU_PCI_ADDRESS_SLOT_LAST uint8_t's, which worked fine as long as every PCI bus was identical. In the future, some PCI busses will allow connecting PCI devices, and some will allow PCIe devices; also some will only allow connection of a single device, while others will allow connecting 31 devices. In order to keep track of that information for each bus, we need to turn qemuDomainPCIAddressBus into a struct, for now with just one member: uint8_t slots[QEMU_PCI_ADDRESS_SLOT_LAST]; Additional members will come in later patches. The item in qemuDomainPCIAddresSet that contains the array of qemuDomainPCIAddressBus is now called "buses" to be more consistent with the already existing "nbuses" (and with the new "slots" array).	2013-07-24 06:24:57 -04:00
Daniel P. Berrange	2049ef9942	Create + setup cgroups atomically for QEMU process Currently the QEMU driver creates the VM's cgroup prior to forking, and then uses a virCommand hook to move the child into the cgroup. This won't work with systemd whose APIs do the creation of cgroups + attachment of processes atomically. Fortunately we have a handshake taking place between the QEMU driver and the child process prior to QEMU being exec()d, which was introduced to allow setup of disk locking. By good fortune this synchronization point can be used to enable the QEMU driver to do atomic setup of cgroups removing the use of the hook script. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-23 22:46:31 +01:00
Daniel P. Berrange	87b2e6fa84	Auto-detect existing cgroup placement Use the new virCgroupNewDetect function to determine cgroup placement of existing running VMs. This will allow the legacy cgroups creation APIs to be removed entirely Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-23 22:46:31 +01:00
John Ferlan	200ed39d0d	qemu_common: Create qemuBuildVolumeString() to process storage pool Split out into its own separate routine	2013-07-23 10:49:02 -04:00
John Ferlan	7fa4a88afa	qemu: Create a common qemuGetSecretString Make the secret fetching code common for qemuBuildRBDString() and qemuBuildDriveURIString() using the virDomainDiskDef.	2013-07-23 10:49:02 -04:00
John Ferlan	b83556d8e7	qemu: Add source pool auth info to virDomainDiskDef for iSCSI During qemuTranslateDiskSourcePool() execution, if the srcpool has been defined with authentication information, then for iSCSI pools copy the authentication and host information to virDomainDiskDef.	2013-07-23 10:49:02 -04:00
Peter Krempa	29c2208c04	qemu: Take error path if acquiring of job fails in qemuDomainSaveInternal Due to a goto statement missed when refactoring in `2771f8b74c` when acquiring of a domain job failed the error path was not taken. This resulted into a crash afterwards as an extra reference was removed from a domain object leading to it being freed. An attempt to list the domains leaded to a crash of the daemon afterwards. https://bugzilla.redhat.com/show_bug.cgi?id=928672	2013-07-23 16:27:56 +02:00
Osier Yang	b6c162d3bb	qemu: Translate the volume type disk source before cgroup setting The translation must be done before both of cgroup and security setting, otherwise since the disk source is not translated yet, it might be skipped on cgroup and security setting.	2013-07-22 14:03:31 -04:00
John Ferlan	1b4eaa6195	qemu: Translate the iscsi pool/volume disk source The difference with already supported pool types (dir, fs, block) is: there are two modes for iscsi pool (or network pools in future), one can specify it either to use the volume target path (the path showed up on host) with mode='host', or to use the remote URI qemu supports (e.g. file=iscsi://example.org:6000/iqn.1992-01.com.example/1) with mode='direct'. For 'host' mode, it copies the volume target path into disk->src. For 'direct' mode, the corresponding info in the one pool source host def is copied to disk->hosts[0].	2013-07-22 14:01:04 -04:00
John Ferlan	1f49b05a82	conf: Introduce virDomainDiskSourceIsBlockType Introduce a new helper to check if the disk source is of block type	2013-07-22 14:01:04 -04:00
Daniel P. Berrange	0d7f45aea7	Convert remainder of cgroups code to report errors Convert the remaining methods in vircgroup.c to report errors instead of returning errno values. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-22 13:09:58 +01:00
Daniel P. Berrange	b64dabff27	Report full errors from virCgroupNew* Instead of returning raw errno values, report full libvirt errors in virCgroupNew* functions. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-22 13:09:58 +01:00
Jiri Denemark	1dfa174ad2	cpu: Store arch in virCPUData	2013-07-22 13:56:54 +02:00
Jiri Denemark	57d52b244b	Replace union cpuData with virCPUData	2013-07-22 13:54:46 +02:00
Viktor Mihajlovski	1a82e01c97	qemu: Shorten SCSI hostdev alias to avoid QEMU failure The alias for hostdevs of type SCSI can be too long for QEMU if larger LUNs are encountered. Here's a real life example: <hostdev mode='subsystem' type='scsi' managed='no'> <source> <adapter name='scsi_host0'/> <address bus='0' target='19' unit='1088634913'/> </source> <address type='drive' controller='0' bus='0' target='0' unit='0'/> </hostdev> this results in a too long drive id, resulting in QEMU yelling Property 'scsi-generic.drive' can't find value 'drive-hostdev-scsi_host0-0-19-1088634913' This commit changes the alias back to the default hostdev$(index) scheme. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-07-22 13:16:29 +02:00
Jiri Denemark	0dfb8a1b9e	qemu: Unplug devices that disappeared when libvirtd was down In case libvirtd is asked to unplug a device but the device is actually unplugged later when libvirtd is not running, we need to detect that and remove such device when libvirtd starts again and reconnects to running domains.	2013-07-19 18:45:48 +02:00
Jiri Denemark	58b147ad07	qemu: Introduce qemuMonitorGetDeviceAliases This API provides a NULL-terminated list of devices which are currently attached to a QEMU domain.	2013-07-19 18:45:47 +02:00
Jiri Denemark	d327ac5328	conf: Make error reporting in virDomainDefFindDevice optional	2013-07-19 17:59:47 +02:00
Eric Blake	fdb3bde31c	security: framework for driver PreFork handler A future patch wants the DAC security manager to be able to safely get the supplemental group list for a given uid, but at the time of a fork rather than during initialization so as to pick up on live changes to the system's group database. This patch adds the framework, including the possibility of a pre-fork callback failing. For now, any driver that implements a prefork callback must be robust against the possibility of being part of a security stack where a later element in the chain fails prefork. This means that drivers cannot do any action that requires a call to postfork for proper cleanup (no grabbing a mutex, for example). If this is too prohibitive in the future, we would have to switch to a transactioning sequence, where each driver has (up to) 3 callbacks: PreForkPrepare, PreForkCommit, and PreForkAbort, to either clean up or commit changes made during prepare. * src/security/security_driver.h (virSecurityDriverPreFork): New callback. * src/security/security_manager.h (virSecurityManagerPreFork): Change signature. * src/security/security_manager.c (virSecurityManagerPreFork): Optionally call into driver, and allow returning failure. * src/security/security_stack.c (virSecurityDriverStack): Wrap the handler for the stack driver. * src/qemu/qemu_process.c (qemuProcessStart): Adjust caller. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-07-18 15:19:36 -06:00
Jiri Denemark	984c01ba5c	qemu: Emit VIR_DOMAIN_EVENT_ID_DEVICE_REMOVED events	2013-07-18 15:28:45 +02:00
Jiri Denemark	3fbf78bdf3	qemu: Remove devices only after DEVICE_DELETED event	2013-07-18 15:28:45 +02:00
Jiri Denemark	ab47cc9bf9	qemu: Add support for DEVICE_DELETED event	2013-07-18 15:28:45 +02:00
Jiri Denemark	d077cda4e9	qemu: Separate char device removal into a standalone function	2013-07-18 15:18:04 +02:00
Peter Krempa	bac2182041	qemu: Cleanup coding style nits in qemu_cgroup.c	2013-07-18 14:58:12 +02:00
Osier Yang	a39f69d2bb	qemu: Set cpuset.cpus for domain process When either "cpuset" of <vcpu> is specified, or the "placement" of <vcpu> is "auto", only setting the cpuset.mems might cause the guest starting to fail. E.g. ("placement" of both <vcpu> and <numatune> is "auto"): 1) Related XMLs <vcpu placement='auto'>4</vcpu> <numatune> <memory mode='strict' placement='auto'/> </numatune> 2) Host NUMA topology % numactl --hardware available: 8 nodes (0-7) node 0 cpus: 0 4 8 12 16 20 24 28 node 0 size: 16374 MB node 0 free: 11899 MB node 1 cpus: 32 36 40 44 48 52 56 60 node 1 size: 16384 MB node 1 free: 15318 MB node 2 cpus: 2 6 10 14 18 22 26 30 node 2 size: 16384 MB node 2 free: 15766 MB node 3 cpus: 34 38 42 46 50 54 58 62 node 3 size: 16384 MB node 3 free: 15347 MB node 4 cpus: 3 7 11 15 19 23 27 31 node 4 size: 16384 MB node 4 free: 15041 MB node 5 cpus: 35 39 43 47 51 55 59 63 node 5 size: 16384 MB node 5 free: 15202 MB node 6 cpus: 1 5 9 13 17 21 25 29 node 6 size: 16384 MB node 6 free: 15197 MB node 7 cpus: 33 37 41 45 49 53 57 61 node 7 size: 16368 MB node 7 free: 15669 MB 4) cpuset.cpus will be set as: (from debug log) 2013-05-09 16:50:17.296+0000: 417: debug : virCgroupSetValueStr:331 : Set value '/sys/fs/cgroup/cpuset/libvirt/qemu/toy/cpuset.cpus' to '0-63' 5) The advisory nodeset got from querying numad (from debug log) 2013-05-09 16:50:17.295+0000: 417: debug : qemuProcessStart:3614 : Nodeset returned from numad: 1 6) cpuset.mems will be set as: (from debug log) 2013-05-09 16:50:17.296+0000: 417: debug : virCgroupSetValueStr:331 : Set value '/sys/fs/cgroup/cpuset/libvirt/qemu/toy/cpuset.mems' to '0-7' I.E, the domain process's memory is restricted on the first NUMA node, however, it can use all of the CPUs, which will likely cause the domain process to fail to start because of the kernel fails to allocate memory with the the memory policy as "strict". % tail -n 20 /var/log/libvirt/qemu/toy.log ... 2013-05-09 05:53:32.972+0000: 7318: debug : virCommandHandshakeChild:377 : Handshake with parent is done char device redirected to /dev/pts/2 (label charserial0) kvm_init_vcpu failed: Cannot allocate memory ... Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2013-07-18 14:57:57 +02:00
Martin Kletzander	b7f1c0c387	Add virtio-scsi to fallback models of scsi controller When user does not specify any model for scsi controller, or worse, no controller at all, but libvirt automatically adds scsi controller with no model, we are not searching for virtio-scsi and thus this can fail for example on qemu which doesn't support lsi logic adapter. This means that when qemu on x86 doesn't support lsi53c895a and the user adds the following to an XML without any scsi controller: <disk ...> ... <target dev='sda'> </disk> libvirt fails like this: # virsh define asdf.xml error: Failed to define domain from asdf.xml error: internal error Unable to determine model for scsi controller Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=974943	2013-07-18 14:36:57 +02:00
Michal Privoznik	272769becc	qemu: Move close callbacks handling into util/virclosecallbacks.c	2013-07-18 14:16:53 +02:00
Michal Privoznik	b7658f6234	qemuDomainDetachChrDevice: Don't leak @charAlias Moreover, since virAsprintf now does report OOM error, there's no need to call virReportOOMError in error path.	2013-07-18 14:16:53 +02:00
Ján Tomko	23e938ee63	virAsprintf: correctly check return value When virAsprintf was changed from a function to a macro reporting OOM error in `dc6f2da`, it was documented as returning 0 on success. This is incorrect, it returns the number of bytes written as asprintf does. Some of the functions were converted to use virAsprintf's return value directly, changing the return value on success from 0 to >= 0. For most of these, this is not a problem, but the change in virPCIDriverDir breaks PCI passthrough. The return value check in virhashtest pre-dates virAsprintf OOM conversion. vmwareMakePath seems to be unused.	2013-07-18 14:05:46 +02:00
Daniel P. Berrange	040d996342	Merge virCommandPreserveFD / virCommandTransferFD Merge the virCommandPreserveFD / virCommandTransferFD methods into a single virCommandPasFD method, and use a new VIR_COMMAND_PASS_FD_CLOSE_PARENT to indicate their difference in behaviour Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-18 12:18:24 +01:00
Michal Privoznik	e80e07f657	qemuDomainGetSchedulerType: Prefer qemuDomObjFromDomain In all qemu APIs we tend to prefer qemuDomObjFromDomain over virDomainObjListFindByUUID. But somehow the qemuDomainGetSchedulerType left unattended.	2013-07-17 12:37:15 +02:00
Jiri Denemark	53f3739afe	qemu: Separate host device removal into a standalone function	2013-07-16 20:29:04 +02:00
Jiri Denemark	ac68a785cc	qemu: Separate net device removal into a standalone function	2013-07-16 20:29:04 +02:00
Jiri Denemark	92758a71d8	qemu: Separate controller removal into a standalone function	2013-07-16 20:29:04 +02:00
Jiri Denemark	a22ae222ee	qemu: Separate disk device removal into a standalone function	2013-07-16 20:29:04 +02:00
Jiri Denemark	89b7bb75d7	qemu: Add qemuDomainReleaseDeviceAddress to remove any address	2013-07-16 20:29:04 +02:00
Eric Blake	cbe31911ad	build: avoid compiler warning on shadowed name Introduced in commit 24b08219; compilation on RHEL 6.4 complained: qemu/qemu_hotplug.c: In function 'qemuDomainAttachChrDevice': qemu/qemu_hotplug.c:1257: error: declaration of 'remove' shadows a global declaration [-Wshadow] /usr/include/stdio.h:177: error: shadowed declaration is here [-Wshadow] * src/qemu/qemu_hotplug.c (qemuDomainAttachChrDevice): Avoid the name 'remove'. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-07-16 10:11:32 -06:00
Peter Krempa	dfc692350a	qemu: Fix double free of returned JSON array in qemuAgentGetVCPUs() A part of the returned monitor response was freed twice and caused crashes of the daemon when using guest agent cpu count retrieval. # virsh vcpucount dom --guest Introduced in v1.0.6-48-gc6afcb0	2013-07-16 16:51:36 +02:00
John Ferlan	2431269bd3	Implement the virDomainSetMemoryStatsPeriod for QEMU driver Implement the new API that will handle setting the balloon driver statistics collection period in order to enable or disable the collection dynamically.	2013-07-16 08:44:53 -04:00
John Ferlan	ab60062117	Add capability to fetch balloon stats This patch will add the qemuMonitorJSONGetMemoryStats() to execute a "guest-stats" on the balloonpath using "get-qom" replacing the former mechanism which looked through the "query-ballon" returned data for the fields. The "query-balloon" code only returns 'actual' memory. Rather than duplicating the existing code, have the JSON API use the GetBalloonInfo API. A check in the qemuMonitorGetMemoryStats() will be made to ensure the balloon driver path has been set. Since the underlying JSON code can return data not associated with the balloon driver, we don't fail on a failure to get the balloonpath. Of course since we've made the check, we can then set the ballooninit flag. Getting the path here is primarily due to the process reconnect path which doesn't attempt to set the collection period.	2013-07-16 08:44:52 -04:00
John Ferlan	ffdf82a9da	Determine whether to start balloon memory stats gathering. At vm startup and attach attempt to set the balloon driver statistics collection period based on the value found in the domain xml file. This is not done at reconnect since it's possible that a collection period was set on the live guest and making the set period call would reset to whatever value is stored in the config file. Setting the stats collection period has a side effect of searching through the qom-list output for the virtio balloon driver and making sure that it has the right properties in order to allow setting of a collection period and eventually fetching of statistics. The walk through the qom-list is expensive and thus the balloonpath will be saved in the monitor private structure as well as a flag indicating that the initialization has already been attempted (in the event that a path is not found, no sense to keep checking). This processing model conforms to the qom object model model which requires setting object properties after device startup. That is, it's not possible to pass the period along via the startup code as it won't be recognized.	2013-07-16 08:44:52 -04:00
Alex Jia	96518d4316	qemu: Prevent crash of libvirtd without guest agent configuration If users haven't configured guest agent then qemuAgentCommand() will dereference a NULL 'mon' pointer, which causes crash of libvirtd when using agent based cpu (un)plug. With the patch, when the qemu-ga service isn't running in the guest, a expected error "error: Guest agent is not responding: Guest agent not available for now" will be raised, and the error "error: argument unsupported: QEMU guest agent is not configured" is raised when the guest hasn't configured guest agent. GDB backtrace: (gdb) bt #0 virNetServerFatalSignal (sig=11, siginfo=<value optimized out>, context=<value optimized out>) at rpc/virnetserver.c:326 #1 <signal handler called> #2 qemuAgentCommand (mon=0x0, cmd=0x7f39300017b0, reply=0x7f394b090910, seconds=-2) at qemu/qemu_agent.c:975 #3 0x00007f39429507f6 in qemuAgentGetVCPUs (mon=0x0, info=0x7f394b0909b8) at qemu/qemu_agent.c:1475 #4 0x00007f39429d9857 in qemuDomainGetVcpusFlags (dom=<value optimized out>, flags=9) at qemu/qemu_driver.c:4849 #5 0x00007f3957dffd8d in virDomainGetVcpusFlags (domain=0x7f39300009c0, flags=8) at libvirt.c:9843 How to reproduce? # To start a guest without guest agent configuration # then run the following cmdline # virsh vcpucount foobar --guest error: End of file while reading data: Input/output error error: One or more references were leaked after disconnect from the hypervisor error: Failed to reconnect to the hypervisor RHBZ: https://bugzilla.redhat.com/show_bug.cgi?id=984821 Signed-off-by: Alex Jia <ajia@redhat.com> Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2013-07-16 14:14:07 +02:00
Michal Privoznik	24b0821926	qemu: Implement chardev hotplug on live level Since previous patches has prepared everything for us, we may now implement live hotplug of a character device.	2013-07-16 11:47:39 +02:00
Michal Privoznik	75f0fd5112	qemu: Implement chardev hotplug on config level There are two levels on which a device may be hotplugged: config and live. The config level requires just an insert or remove from internal domain definition structure, which is exactly what this patch does. There is currently no implementation for a chardev update action, as there's not much to be updated. But more importantly, the only thing that can be updated is path or socket address by which chardevs are distinguished. So the update action is currently not supported.	2013-07-16 11:47:39 +02:00
John Ferlan	50336d871a	Add qemuMonitorJSONSetObjectProperty() method for QMP qom-set command Add a new qemuMonitorJSONSetObjectProperty() method to support invocation of the 'qom-set' JSON monitor command with a provided path, property, and expected data type to set. NOTE: The set API was added only for the purpose of the qemumonitorjsontest The test code uses the same "/machine/i440fx" property as the get test and attempts to set the "realized" property to "true" (which it should be set at anyway).	2013-07-15 12:26:16 -04:00
John Ferlan	bdce278984	Add qemuMonitorJSONGetObjectProperty() method for QMP qom-get command Add a new qemuMonitorJSONGetObjectProperty() method to support invocation of the 'qom-get' JSON monitor command with a provided path, property, and expected data type return. The qemuMonitorJSONObjectProperty is similar to virTypedParameter; however, a future patch will extend it a bit to include a void pointer to balloon driver statistic data. NOTE: The ObjectProperty structures and API are added only for the purpose of the qemumonitorjsontest The provided test will execute a qom-get on "/machine/i440fx" which will return a property "realized".	2013-07-15 12:26:16 -04:00
John Ferlan	d76a89780b	Add qemuMonitorJSONGetObjectListPaths() method for QMP qom-list command Add a new qemuMonitorJSONGetObjectListPaths() method to support invocation of the 'qom-list' JSON monitor command with a provided path. NOTE: The ListPath structures and API's are added only for the purpose of the qemumonitorjsontest The returned list of paired data fields of "name" and "type" that can be used to peruse QOM configuration data and eventually utilize for the balloon statistics. The test does a "{"execute":"qom-list", "arguments": { "path": "/"}}" which returns "{"return": [{"name": "machine", "type": "child<container>"}, {"name": "type", "type": "string"}]}" resulting in a return of an array of 2 elements with [0].name="machine", [0].type="child<container>". The [1] entry appears to be a header that could be used some day via a command such as "virsh qemuobject --list" to format output.	2013-07-15 12:26:15 -04:00
Matthew Rosato	97f97a4907	qemu: add macvlan delete to qemuDomainAttachNetDevice cleanup If an error occurs during qemuDomainAttachNetDevice after the macvtap was created in qemuPhysIfaceConnect, the macvtap device gets left behind. This patch adds code to the cleanup routine to delete the macvtap. Signed-off-by: Matthew Rosato <mjrosato@linux.vnet.ibm.com> Reviewed-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-07-15 10:43:03 -04:00
Laine Stump	9e37f57f43	pci: make virPCIDeviceReset more autonomous I recently patches the callers to virPCIDeviceReset() to not call it if the current driver for a device was vfio-pci (since that driver will always reset the device itself when appropriate. At the time, Dan Berrange suggested that I could instead modify virPCIDeviceReset to check the currently bound driver for the device, and decide for itself whether or not to go ahead with the reset. This patch removes the previously added checks, and replaces them with a check down in virPCIDeviceReset(), as suggested. The functional difference here is that previously we were deciding based on either the hostdev configuration or the value of stubDriverName in the virPCIDevice object, but now we are actually comparing to the "driver" link in the device's sysfs entry directly. In practice, both should be the same.	2013-07-15 10:43:03 -04:00
Michal Privoznik	797b1ffce1	qemuBuildChrDeviceCommandLine: Don't leak devstr It's caller's responsibility to free return value of qemuBuildChrDeviceStr().	2013-07-15 16:25:11 +02:00
Jincheng Miao	945b18eb7d	Change domain controller index type to unsigned Error out on negative index values. https://bugzilla.redhat.com/show_bug.cgi?id=981261	2013-07-12 14:55:04 +02:00
Michal Privoznik	f293d76333	qemu: Introduce qemuBuildChrDeviceStr The function being introduced is responsible for creating command line argument for '-device' for given character device. Based on the chardev type, it calls appropriate qemuBuild.*ChrDeviceStr(), e.g. qemuBuildSerialChrDeviceStr() for serial chardev and so on.	2013-07-12 11:00:28 +02:00
Michal Privoznik	2a9a5bef97	qemu_command: Honour chardev alias assignment with a function The chardev alias assignment is going to be needed in a separate places, so it should be moved into a separate function rather than copying code randomly around.	2013-07-12 11:00:08 +02:00
Michal Privoznik	0f7a7ce5ff	qemu_monitor: Introduce qemuMonitorDetachCharDev This function wraps 'chardev-remove' qemu monitor command around. It takes chardev alias as its single argument besides qemu monitor pointer.	2013-07-12 11:00:04 +02:00
Michal Privoznik	4a51447abe	qemu_monitor: Introduce qemuMonitorAttachCharDev The function being introduced is responsible for preparing and executing 'chardev-add' qemu monitor command. Moreover, in case of PTY chardev, the corresponding pty path is updated.	2013-07-12 11:00:01 +02:00
Michal Privoznik	41e826d539	qemu_monitor_json: Move InetSocketAddress build to a separate function Currently, we are building InetSocketAddress qemu json type within the qemuMonitorJSONNBDServerStart function. However, other future functions may profit from the code as well. So it should be moved into a static function.	2013-07-12 10:59:57 +02:00
John Ferlan	a5fcea5513	qemu_hostdev: Resolve Coverity issue Recent changes uncovered a possibility that 'last_processed_hostdev_vf' was set to -1 in 'qemuPrepareHostdevPCIDevices' and would cause problems in for loop end condition in the 'resetvfnetconfig' label if the variable was never set to 'i' due to 'qemuDomainHostdevNetConfigReplace' failure.	2013-07-11 14:18:12 -04:00
Michal Privoznik	95ff6a3993	qemu: Fix hot (un-)plug error codes and messages With current code, error reporting for unsupported devices for hot plug, unplug and update is total mess. The VIR_ERR_CONFIG_UNSUPPORTED error code is reported instead of VIR_ERR_OPERATION_UNSUPPORTED. Moreover, the error messages are not helping to find the root cause (lack of implementation).	2013-07-11 16:19:10 +02:00
Jiri Denemark	f24e90d542	qemu: Slightly increase memory limit For low-memory domains (roughly under 400MB) our automatic memory limit computation comes up with a limit that's too low. This is because the 0.5 multiplication does not add enough for such small values. Let's increase the constant part of the computation to fix this.	2013-07-11 11:17:47 +02:00
Daniel P. Berrange	50760e2a8a	Convert 'int i' to 'size_t i' in src/qemu files Convert the type of loop iterators named 'i', 'j', k', 'ii', 'jj', 'kk', to be 'size_t' instead of 'int' or 'unsigned int', also santizing 'ii', 'jj', 'kk' to use the normal 'i', 'j', 'k' naming Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-10 17:55:15 +01:00
Ján Tomko	f38c8185f9	Fix crash when multiple event callbacks were registered CVE-2013-2230 Don't overwrite the callback ID returned by virDomainEventStateRegisterID in ret by 0. Introduced by `abf75aea`.	2013-07-10 13:02:30 +02:00
Ján Tomko	5744d96f21	qemu: fix double free in qemuMigrationPrepareDirect Remove assignment of the string freed by virURIFree to hostname, since it's not used anywhere. Double free introduced by `ddf8ad8`, useless code introduced by `f03dcc5`. https://bugzilla.redhat.com/show_bug.cgi?id=977961	2013-07-10 12:48:54 +02:00
Michal Privoznik	e987a30dfa	Adapt to VIR_ALLOC and virAsprintf in src/qemu/*	2013-07-10 11:07:32 +02:00
Michal Privoznik	f2d5e864a2	Adapt to VIR_ALLOC and virAsprintf in src/conf/*	2013-07-10 11:07:31 +02:00
Eric Blake	5598f81fe6	maint: fix typo in qemu error message Introduced in commit `d47eff88`. * src/qemu/qemu_driver.c (qemuDomainSetVcpusFlags): Fix spelling. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-07-09 11:39:07 -06:00
Jiri Denemark	59cc0fe5aa	qemu: Set RLIMIT_MEMLOCK when memoryBacking/locked is used If a domain is configured to have all its memory locked, we need to set RLIMIT_MEMLOCK so that QEMU is actually allowed to lock the memory.	2013-07-08 12:35:28 +02:00
Jiri Denemark	6d8ebc7538	qemu: Use qemuDomainMemoryLimit when computing memory for VFIO	2013-07-08 12:35:27 +02:00
Jiri Denemark	e0e438af00	qemu: Move memory limit computation to a reusable function	2013-07-08 12:35:27 +02:00
Jiri Denemark	86dba8f3de	Don't spam logs with "port 0 must be in range" errors Whenever virPortAllocatorRelease is called with port == 0, it complains that the port is not in an allowed range, which is expectable as the port was never allocated. Let's make virPortAllocatorRelease ignore 0 ports in a similar way free() ignores NULL pointers.	2013-07-08 12:27:58 +02:00
Jiri Denemark	0d7dc70824	qemu: Release correct websocket port	2013-07-08 12:27:58 +02:00
Jiri Denemark	d4ce75ba76	Paused domain should remain paused after migration https://bugzilla.redhat.com/show_bug.cgi?id=981139 If a domain is paused before migration starts, we need to tell that to the destination libvirtd to prevent it from resuming the domain at the end of migration. This regression was introduced by commit `5379bb0`.	2013-07-08 12:27:58 +02:00
Jiri Denemark	db0a18a165	Fix NULL dereference caused by ACL filtering of domains Caused by `763973607d`.	2013-07-04 16:55:53 +02:00
Daniel P. Berrange	763973607d	Add access control filtering of domain objects Ensure that all APIs which list domain objects filter them against the access control system. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-07-03 15:54:53 +01:00
Martin Kletzander	a72582cb91	qemu: Allow seamless migration for domains with multiple graphics Since commit `23e8b5d8`, the code is refactored in a way that supports domains with multiple graphics elements and commit `37b415200` allows starting such domains. However none of those commits take migration into account. Even though qemu doesn't support relocation for anything else than SPICE and for no more than one graphics, there is no reason to hardcode one graphics into this part of the code as well.	2013-07-03 14:58:01 +02:00
Martin Kletzander	556808ec9d	qemu: Don't miss errors when changing graphics passwords Commit `23e8b5d8e7` forgot to check the return value for all calls to qemuDomainChangeGraphicsPasswords().	2013-07-03 14:56:13 +02:00
Chen Fan	36bac65d8a	qemu: Implement 'oncrash' coredump events when guest panicked Add doDumpCoreToAutoPath to implement 'coredump-destroy' and 'coredump-restart' events of the 'on_crash' in the XML when domain crashed.	2013-07-02 12:02:31 -06:00
Chen Fan	9aa527dccb	qemu: Implement 'oncrash' events when guest panicked Add monitor callback API domainGuestPanic, that implements 'destroy', 'restart' and 'preserve' events of the 'on_crash' in the XML when domain crashed.	2013-07-02 12:02:30 -06:00
Chen Fan	e8ccf7ed8a	qemu: expose qemuProcessShutdownOrReboot() Later code will need this outside of qemu_process.c	2013-07-02 12:02:27 -06:00
Chen Fan	bcf0c14491	qemu: refactor processWatchdogEvent Split the code to make the driver workpool more generalized	2013-07-02 12:02:27 -06:00
Michal Privoznik	bc09c5d335	qemuNodeDeviceDetachFlags: Avoid use of uninitialized variables After `abf75aea24` the compiler screams: qemu/qemu_driver.c: In function 'qemuNodeDeviceDetachFlags': qemu/qemu_driver.c:10693:9: error: 'domain' may be used uninitialized in this function [-Werror=maybe-uninitialized] pci = virPCIDeviceNew(domain, bus, slot, function); ^ qemu/qemu_driver.c:10693:9: error: 'bus' may be used uninitialized in this function [-Werror=maybe-uninitialized] qemu/qemu_driver.c:10693:9: error: 'slot' may be used uninitialized in this function [-Werror=maybe-uninitialized] qemu/qemu_driver.c:10693:9: error: 'function' may be used uninitialized in this function [-Werror=maybe-uninitialized] Since the other functions qemuNodeDeviceReAttach and qemuNodeDeviceReset looks exactly the same, I've initialized the variables there as well. However, I am still wondering why those functions don't matter to gcc while the first one does.	2013-07-02 12:39:14 +02:00
Peter Krempa	cbba3268eb	qemu: Improve info message and remove a variable in qemuDomainManagedSave Mention the domain name that is being saved and remove the unneeded variable that only stores a constant.	2013-07-02 09:53:19 +02:00
Ján Tomko	c34107dfd3	qemu: fix return value of qemuDomainBlockPivot on errors If qemuMonitorBlockJob returned 0, qemuDomainBlockPivot might return 0 even if an error occured. https://bugzilla.redhat.com/show_bug.cgi?id=977678	2013-07-02 07:51:51 +02:00
Ján Tomko	87bbf83f99	qemu: indentation fix	2013-07-01 17:41:22 +02:00
Michal Novotny	ff96888991	qemu: Implement CPUs check against machine type's cpu-max Implement check whether (maximum) vCPUs doesn't exceed machine type's cpu-max settings. On older versions of QEMU the check is disabled. Signed-off-by: Michal Novotny <minovotn@redhat.com>	2013-07-01 14:30:42 +02:00
Laine Stump	a47b9e879c	qemu: fix infinite loop in OOM error path A loop in qemuPrepareHostdevPCIDevices() intended to cycle through all the objects on the list pcidevs was doing "while (listcount > 0)", but nothing in the body of the loop was reducing the size of the list - it was instead removing items from a different list. It has now been safely changed to a for() loop.	2013-06-25 18:24:56 -04:00
Laine Stump	b2a2d00f57	pci: fix dangling pointer in qemuDomainReAttachHostdevDevices (This isn't as bad as it sounds - it's only a problem in case of an OOM error.) qemuGetActivePciHostDeviceList() had been creating a list that contained pointers to objects that were also on the activePciHostdevs list. In case of an OOM error, this newly created list would be virObjectUnref'ed, which would cause everything on the list to be freed. But all of those objects would still be on the activePciHostdevs list, which could have very bad consequences if that list was ever again accessed. The solution used here is to populate the new list with copies of the objects from the original list. It turns out that on return from qemuGetActivePciHostDeviceList(), the caller would almost immediately go through all the device objects and "steal" them (i.e. remove the pointer from the list but not delete it) all from either one list or the other; we now instead just delete (remove from the list and free) each device from one list or the other, so in the end we have the same state.	2013-06-25 18:24:50 -04:00
Laine Stump	1d829e1306	pci: rename virPCIDeviceGetVFIOGroupDev to virPCIDeviceGetIOMMUGroupDev I realized after the fact that it's probably better in the long run to give this function a name that matches the name of the link used in sysfs to hold the group (iommu_group). I'm changing it now because I'm about to add several more functions that deal with iommu groups.	2013-06-25 18:07:38 -04:00
Laine Stump	ee1d1f3b54	pci: eliminate unused driver arg from virPCIDeviceDetach The driver arg to virPCIDeviceDetach is no longer used (the name of the stub driver is now set in the virPCIDevice object, and virPCIDeviceDetach retrieves it from there). Remove it.	2013-06-25 18:03:52 -04:00
Jiri Denemark	d2664daf1b	qemu: Implement support for VIR_MIGRATE_PARAM_GRAPHICS_URI	2013-06-25 16:41:58 +02:00
Jiri Denemark	35461438cb	Implement extensible migration APIs in qemu driver	2013-06-25 16:41:58 +02:00
Jiri Denemark	1004d6323a	qemu: Move internals of Confirm phase to qemu_migration.c	2013-06-25 16:41:57 +02:00
Jiri Denemark	ecd811310c	qemu: Move common parts of Prepare phase to qemu_migration.c	2013-06-25 16:41:57 +02:00
Jiri Denemark	d3ce7363f3	qemu: Move internals of Begin phase to qemu_migration.c	2013-06-25 16:41:57 +02:00
Laine Stump	1eeab6e6de	qemu: don't reset PCI devices being assigned with VFIO I just learned that VFIO resets PCI devices when they are assigned to guests / returned to the host, so it is redundant for libvirt to reset the devices. This patch inhibits calling virPCIDeviceReset to devices that will be/were assigned using VFIO.	2013-06-24 23:07:07 -04:00
Jiri Denemark	c40ed4168a	Rename virTypedParameterArrayValidate as virTypedParamsValidate	2013-06-25 00:38:24 +02:00
Laine Stump	9b4a666608	pci: make virPCIDeviceDetach consistent in behavior virPCIDeviceDetach would previously sometimes consume the input device object (to put it on the inactive list) and sometimes not. Avoiding memory leaks required checking beforehand to see if the device was already on the list, and freeing the device object in the caller only if there wasn't already an identical object on the inactive list. This patch makes it consistent - virPCIDeviceDetach will never consume the input virPCIDevice object; if it needs to put one on the inactive list, it will create a copy and put that on the list. This way the caller knows that it is always their responsibility to free the device object they created.	2013-06-24 17:35:13 -04:00
Laine Stump	53e52b4ac3	pci: change stubDriver from const char* to char* Previously stubDriver was always set from a string literal, so it was okay to use a const char * that wasn't freed when the virPCIDevice was freed. This will not be the case in the near future, so it is now a char* that is allocated in virPCIDeviceSetStubDriver() and freed during virPCIDeviceFree().	2013-06-24 17:33:29 -04:00
Daniel P. Berrange	abf75aea24	Add ACL checks into the QEMU driver Insert calls to the ACL checking APIs in all QEMU driver entrypoints. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-06-24 15:25:43 +01:00
Ján Tomko	d3c8788492	qemu: check if block I/O limits fit into long long We can only pass values up to LLONG_MAX through JSON and QEMU checks if the int64_t number is not negative at startup since 1.5.0. https://bugzilla.redhat.com/show_bug.cgi?id=974010	2013-06-24 14:18:14 +02:00
Ján Tomko	19f75d5eeb	qemu: add hv_vapic and hv_spinlocks support XML: <features> <hyperv> <vapic state='on'/> <spinlocks state='on' retries='4096'/> </hyperv> </features> results in the following QEMU command line: qemu -cpu <cpu_model>,hv_vapic,hv_spinlocks=0x1000 https://bugzilla.redhat.com/show_bug.cgi?id=784836	2013-06-21 13:24:44 +02:00
Ján Tomko	800b51d7b0	conf: add vapic and spinlocks to hyperv features Add new CPU features for HyperV: vapic for virtual APIC support spinlocks for setting spinlock support <features> <hyperv> <vapic state='on'/> <spinlocks state='on' retries='4096'/> </hyperv> </features> https://bugzilla.redhat.com/show_bug.cgi?id=784836	2013-06-21 12:33:46 +02:00
Jiri Denemark	adb7b0b562	qemu: Make probing for commands declarative	2013-06-21 09:32:42 +02:00
Jiri Denemark	61a2841493	qemu: Make probing for events declarative	2013-06-21 09:32:42 +02:00
Jim Fehlig	24d0e67aba	build: Fix build with -Werror Commit `752596b5` broke the build with -Werror qemu/qemu_hotplug.c: In function 'qemuDomainChangeGraphics': qemu/qemu_hotplug.c:1980:39: error: declaration of 'listen' shadows a global declaration [-Werror=shadow] Fix with s/listen/newlisten/	2013-06-20 12:59:19 -06:00
Michal Privoznik	752596b5dd	qemuDomainChangeGraphics: Check listen address change by listen type Currently, we have a bug when updating a graphics device. A graphics device can have a listen address set. This address is either defined by user (in which case it's type is VIR_DOMAIN_GRAPHICS_LISTEN_TYPE_ADDRESS) or it can be inherited from a network (in which case it's type is VIR_DOMAIN_GRAPHICS_LISTEN_TYPE_NETWORK). However, in both cases we have a listen address to process (e.g. during migration, as I've tried to fix in `7f15ebc7`). Later, when a user tries to update the graphics device (e.g. set a password), we check if listen addresses match the original as qemu doesn't know how to change listen address yet. Hence, users are required to not change the listen address. The implementation then just dumps listen addresses and compare them. Previously, while dumping the listen addresses, NULL was returned for NETWORK. After my patch, this is no longer true, and we get a listen address for olddev even if it is a type of NETWORK. So we have a real string on one side, the NULL from user's XML on the other side and hence we think user wants to change the listen address and we refuse it. Therefore, we must take the type of listen address into account as well.	2013-06-20 19:41:53 +02:00
John Ferlan	b237545341	qemu: Resolve issue with GetScheduler APIs for non running domain As a consequence of the cgroup layout changes from commit '632f78ca', the qemuDomainGetSchedulerParameters[Flags]()' and qemuGetSchedulerType() APIs failed to return data for a non running domain. This can be seen through a 'virsh schedinfo <domain>' command which returns: Scheduler : Unknown error: Requested operation is not valid: cgroup CPU controller is not mounted Prior to that change a non running domain would return: Scheduler : posix cpu_shares : 0 vcpu_period : 0 vcpu_quota : 0 emulator_period: 0 emulator_quota : 0 This patch will restore the capability to return configuration only data for a non running domain regardless of whether cgroups are available.	2013-06-19 15:01:48 -04:00
Peter Krempa	5379bb0f33	migration: Don't propagate VIR_MIGRATE_ABORT_ON_ERROR This flag is meant for errors happening on the source of the migration and isn't used on the destination. To allow better migration compatibility, don't propagate it to the destination.	2013-06-18 14:52:26 +02:00
Peter Krempa	cf6d56ac43	migration: Make erroring out on I/O error controllable by flag Paolo Bonzini pointed out that it's actually possible to migrate a qemu instance that was paused due to I/O error and it will be able to work on the destination if the storage is accessible. This patch introduces flag VIR_MIGRATE_ABORT_ON_ERROR that cancels the migration in case an I/O error happens while it's being performed and allows migration without this flag. This flag can be possibly used for other error reasons that may be introduced in the future.	2013-06-18 14:52:26 +02:00
Jiri Denemark	ddf8ad82eb	qemu: Avoid leaking uri in qemuMigrationPrepareDirect	2013-06-18 14:49:20 +02:00
Michal Privoznik	9da7b11bcd	qemu_migration: Move waiting for SPICE migration Currently, we wait for SPICE to migrate in the very same loop where we wait for qemu to migrate. This has a disadvantage of slowing seamless migration down. One one hand, we should not kill the domain until all SPICE data has been migrated. On the other hand, there is no need to wait in the very same loop and hence slowing down 'cont' on the destination. For instance, if users are watching a movie, they can experience the movie to be stopped for a couple of seconds, as processors are not running nor on src nor on dst as libvirt waits for SPICE to migrate. We should move the waiting phase to migration CONFIRM phase.	2013-06-18 14:32:52 +02:00
Guannan Ren	0ad9025ef4	qemu: set QEMU_CAPS_DEVICE_VIDEO_PRIMARY cap flag in QMP detection When qemu >= 1.20, it is safe to use -device for primary video device as described in `4c993d8ab`. So, we are missing the cap flag in QMP capabilities detection, this flag can be initialized safely in virQEMUCapsInitQMPBasic.	2013-06-18 16:57:48 +08:00
Ján Tomko	07966f6a8b	qemu: allow restore with non-migratable XML input Convert input XML to migratable before using it in qemuDomainSaveImageOpen. XML in the save image is migratable, i.e. doesn't contain implicit controllers. If these controllers were in a non-default order in the input XML, the ABI check would fail. Removing and re-adding these controllers fixes it. https://bugzilla.redhat.com/show_bug.cgi?id=834196	2013-06-13 16:58:30 +02:00
Peter Krempa	5f719f217e	qemu: Forbid migration of machines with I/O errors Such machine can't be successuflly migrated unles the I/O error has recovered and might lead to data corruption. Forbid this kind of migration.	2013-06-11 14:52:26 +02:00
Peter Krempa	caa467db62	qemu: Cancel migration if guest encoutners I/O error while migrating During a live migration the guest may receive a disk access I/O error. In this state the guest is unable to continue running on a remote host after migration as some state may be present in the kernel and not migrated. With this patch, the migration is canceled in such case so it can either continue on the source if the I/O issues are recovered or has to be destroyed anyways.	2013-06-11 14:52:26 +02:00
Michal Privoznik	6546017c50	qemu_migrate: Dispose listen address if set from config https://bugzilla.redhat.com/show_bug.cgi?id=971485 As of `d7f9d82753` we copy the listen address from the qemu.conf config file in case none has been provided via XML. But later, when migrating, we should not include such listen address in the migratable XML as it is something autogenerated, not requested by user. Moreover, the binding to the listen address will likely fail, unless the address is '0.0.0.0' or its IPv6 equivalent. This patch introduces a new boolean attribute to virDomainGraphicsListenDef to distinguish autofilled listen addresses. However, we must keep the attribute over libvirtd restarts, so it must be kept within status XML.	2013-06-11 14:11:46 +02:00
Jiri Denemark	9313a6a7fc	qemu: Fix memory leak in Prepare phase Avoid leaking virDomainDef if Prepare phase fails before it gets to qemuMigrationPrepareAny.	2013-06-11 13:27:52 +02:00
Peter Krempa	c2093b2aba	Fix commit `29c1e913e4` This patch fixes changes done in commit `29c1e913e4` that was pushed without implementing review feedback. The flag introduced by the patch is changed to VIR_DOMAIN_VCPU_GUEST and documentation makes the difference between regular hotplug and this new functionality more explicit. The virsh options that enable the use of the new flag are changed to "--guest" and the documentation is fixed too.	2013-06-10 09:52:49 +02:00
Michal Privoznik	cdd823c073	qemuDomainGetVcpusFlags: Initialize ncpuinfo Currently, there's a path to use the ncpuinfo variable uninitialized, which leads to a compiler warning: qemu/qemu_driver.c: In function 'qemuDomainGetVcpusFlags': qemu/qemu_driver.c:4573:9: error: 'ncpuinfo' may be used uninitialized in this function [-Werror=maybe-uninitialized] for (i = 0; i < ncpuinfo; i++) { ^	2013-06-07 16:42:24 +02:00
Peter Krempa	c12b2be516	qemu: Implement new QMP command for cpu hotplug This patch implements support for the "cpu-add" QMP command that plugs CPUs into a live guest. The "cpu-add" command was introduced in QEMU 1.5. For the hotplug to work machine type "pc-i440fx-1.5" is required.	2013-06-07 16:19:20 +02:00
Peter Krempa	d47eff88fe	qemu: Implement support for VIR_DOMAIN_VCPU_AGENT in qemuDomainSetVcpusFlags This patch adds support for agent-based cpu disabling and enabling to qemuDomainSetVcpusFlags() API.	2013-06-07 15:58:25 +02:00
Peter Krempa	c6afcb052c	qemu: Implement request of vCPU state using the guest agent This patch implements the VIR_DOMAIN_VCPU_AGENT flag for the qemuDomainGetVcpusFlags() libvirt API implementation.	2013-06-07 15:58:25 +02:00
Peter Krempa	3099c063e3	qemu_agent: Introduce helpers for agent based CPU hot(un)plug The qemu guest agent allows to online and offline CPUs from the perspective of the guest. This patch adds helpers that call 'guest-get-vcpus' and 'guest-set-vcpus' guest agent functions and convert the data for internal libvirt usage.	2013-06-07 15:58:24 +02:00
Peter Krempa	82e119f5cd	qemu: Use bool instead of int in qemuMonitorSetCPU APIs The 'online' parameter has only two possible values. Use a bool for it.	2013-06-07 15:57:03 +02:00
Michal Privoznik	b72ba1da36	qemuDomainMigrateGraphicsRelocate: Use then new virSocketAddrIsWildcard Since we have the new internal API to check for wildcard address, we can use it instead of parsing and formatting.	2013-06-07 15:27:17 +02:00
Osier Yang	e31b5cf393	qemu: Report the offset from host UTC for RTC_CHANGE event https://bugzilla.redhat.com/show_bug.cgi?id=964177 Though both libvirt and QEMU's document say RTC_CHANGE returns the offset from the host UTC, qemu actually returns the offset from the specified date instead when specific date is provided (-rtc base=$date). It's not safe for qemu to fix it in code, it worked like that for 3 years, changing it now may break other QEMU use cases. What qemu tries to do is to fix the document: http://lists.gnu.org/archive/html/qemu-devel/2013-05/msg04782.html And in libvirt side, instead of replying on the value from qemu, this converts the offset returned from qemu to the offset from host UTC, by: /* * a: the offset from qemu RTC_CHANGE event * b: The specified date (-rtc base=$date) * c: the host date when libvirt gets the RTC_CHANGE event * offset: What libvirt will report */ offset = a + (b - c); The specified date (-rtc base=$date) is recorded in clock's def as an internal only member (may be useful to exposed outside?). Internal only XML tag "basetime" is introduced to not lose the guest's basetime after libvirt restarting/reloading: <clock offset='variable' adjustment='304' basis='utc' basetime='1370423588'/>	2013-06-07 14:45:08 +08:00
Ján Tomko	d60570b315	qemu: simplify CPU command line parsing Use virStringSplit. Change the 'error' label to 'cleanup' to prevent memory leaks on error.	2013-06-06 17:30:08 +02:00
Ján Tomko	5debc7224a	qemu: change two-state int parameters to bool	2013-06-06 17:22:53 +02:00
Ján Tomko	85f9178160	Remove redundant two-state integers	2013-06-06 17:22:53 +02:00
Ján Tomko	e557766c3b	Replace two-state local integers with bool Found with 'git grep "= 1"'.	2013-06-06 17:22:53 +02:00
Michal Privoznik	e5fa9db17e	qemu: Reformat listen address prior to checking Currently, a listen address for a SPICE server can be specified. Later, when the domain is migrated, we need to relocate the graphics which involves telling new destination to the SPICE server. However, we can't just assume the listen address is the new location, because the listen address can be ANYCAST (0.0.0.0 for IPv4, :: for IPv6). In which case, we want to pass the remote hostname. But there are some troubles with ANYCAST. In both IPv4 and IPv6 it has many ways for specifying such address. For instance, in IPv4: 0, 0.0, 0.0.0, 0.0.0.0. The number of variations gets bigger in IPv6 world. Hence, in order to check for ANYCAST address sanely, we should take the provided listen address, parse it and format back in it's full form. Which is exactly what this patch does.	2013-06-06 08:31:09 +02:00
Eric Blake	1add9c78da	maint: don't use config.h in .h files Enforce the rule that .h files don't need to (redundantly) include <config.h>. * cfg.mk (sc_prohibit_config_h_in_headers): New rule. (_virsh_includes): Delete; instead, inline a smaller number of exclusions... (exclude_file_name_regexp--sc_require_config_h) (exclude_file_name_regexp--sc_require_config_h_first): ...here. * daemon/libvirtd.h (includes): Fix offenders. * src/driver.h (includes): Likewise. * src/gnutls_1_0_compat.h (includes): Likewise. * src/libxl/libxl_conf.h (includes): Likewise. * src/libxl/libxl_driver.h (includes): Likewise. * src/lxc/lxc_conf.h (includes): Likewise. * src/lxc/lxc_driver.h (includes): Likewise. * src/lxc/lxc_fuse.h (includes): Likewise. * src/network/bridge_driver.h (includes): Likewise. * src/phyp/phyp_driver.h (includes): Likewise. * src/qemu/qemu_conf.h (includes): Likewise. * src/util/virnetlink.h (includes): Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-06-05 05:53:25 -06:00
Osier Yang	8da9516a84	qemu: Abstract code for the cpu controller setting into a helper	2013-06-05 19:25:48 +08:00
Guannan Ren	ed91e32b08	snapshot: remove mutually exclusive memory and disk-only duplicate check The work was done at the time of snapshot xmlstring parsing if (offline && def->memory && def->memory != VIR_DOMAIN_SNAPSHOT_LOCATION_NONE) { virReportError(...); }	2013-06-05 10:37:45 +08:00
Peter Krempa	6e5b36d5d2	qemu: Properly report guest agent errors on command passthrough The code for arbitrary guest agent passthrough was horribly broken since introduction. Fix it to correctly report errors.	2013-06-03 17:25:27 +02:00
Laine Stump	2ea45647bc	qemu: prevent termination of guests w/hostdev on driver reconnect This should resolve: https://bugzilla.redhat.com/show_bug.cgi?id=959191 The problem was that qemuUpdateActivePciHostdevs was returning 0 (success) when no hostdevs were present, but would otherwise return -1 (failure) even when it completed successfully. It is only called from qemuProcessReconnect(), and when qemuProcessReconnect got back an error, it would not only stop reconnecting, but would terminate the guest qemu process "to remove danger of it ending up running twice if user tries to start it again later". (This bug was introduced in commit `011cf7ad`, which was pushed between v1.0.2 and v1.0.3, so all maintenance branches from v1.0.3 up to 1.0.5 will need this one line patch applied.)	2013-05-31 14:57:55 -04:00
Ján Tomko	2136327e23	qemu: escape literal IPv6 address in NBD migration A literal IPv6 must be escaped, otherwise migration fails with: unable to execute QEMU command 'drive-mirror': address resolution failed for f0::0d:5901: Servname not supported for ai_socktype since QEMU treats everything after the first ':' as the port.	2013-05-31 17:21:10 +02:00
Peter Krempa	177046753f	qemu: snapshot: Don't kill access to disk if snapshot creation fails If snapshot creation failed for example due to invalid use of the "REUSE_EXTERNAL" flag, libvirt killed access to the original image file instead of the new image file. On machines with selinux this kills the whole VM as the selinux context is enforced immediately. * qemu_driver.c:qemuDomainSnapshotUndoSingleDiskActive(): - Kill access to the new image file instead of the old one. Partially resolves: https://bugzilla.redhat.com/show_bug.cgi?id=906639	2013-05-31 15:41:59 +02:00
Peter Krempa	6c23d60961	qemu: Fix damaged whitespace After deleting "WithDriver" from the async job function the code was unaligned.	2013-05-31 15:35:37 +02:00
Eric Blake	9fda950f5c	build: work around cygwin header bug A bug in Cygwin [1] and poor error messages from gcc [2] lead to this confusing compilation error: qemu/qemu_monitor.c:418:9: error: passing argument 2 of 'sendmsg' from incmpatible pointer type /usr/include/sys/socket.h:42:11: note: expected 'const struct msghdr ' but argument is of type 'struct msghdr ' [1] http://cygwin.com/ml/cygwin/2013-05/msg00451.html [2] http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57475 * src/qemu/qemu_monitor.c (includes): Include <sys/socket.h> before <sys/un.h>. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-30 14:51:58 -06:00
Eric Blake	f43bb1dc20	build: cast [ug]id_t when printing This is a recurring problem for cygwin :) For example, see commit `23a4df88`. qemu/qemu_driver.c: In function 'qemuStateInitialize': qemu/qemu_driver.c:691:13: error: format '%d' expects type 'int', but argument 8 has type 'uid_t' [-Wformat] * src/qemu/qemu_driver.c (qemuStateInitialize): Add casts. * daemon/remote.c (remoteDispatchAuthList): Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-30 10:36:16 -06:00
Eric Blake	19a7f9fffb	build: port qemu to cygwin A cygwin build of the qemu driver fails with: qemu/qemu_process.c: In function 'qemuPrepareCpumap': qemu/qemu_process.c:1803:31: error: 'CPU_SETSIZE' undeclared (first use in this function) CPU_SETSIZE is a Linux extension in <sched.h>; a bit more portable is using sysconf if _SC_NPROCESSORS_CONF is defined (several platforms have it, including Cygwin). Ultimately, I would have preferred to use gnulib's 'nproc' module, but it is currently under an incompatible license. * src/qemu/qemu_conf.h (QEMUD_CPUMASK_LEN): Provide definition on cygwin. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-30 06:39:27 -06:00
Cole Robinson	98bbda00cf	qemu: migration: error if tunnelled + storage specified Since as the code indicates it doesn't work yet, so let's be explicit about it.	2013-05-29 12:31:00 -04:00
Cole Robinson	5751fc4f4e	qemu: migration: Improve p2p error if we can't open conn By actually showing the Open() error to the user	2013-05-29 12:31:00 -04:00
Michal Privoznik	d10cfaec3b	qemuOpenVhostNet: Decrease vhostfdSize on open failure Currently, if there's an error opening /dev/vhost-net (e.g. because it doesn't exist) but it's not required we proceed with vhostfd array filled with -1 and vhostfdSize unchanged. Later, when constructing the qemu command line only non-negative items within vhostfd array are taken into account. This means, vhostfdSize may be greater than the actual count of non-negative items in vhostfd array. This results in improper command line arguments being generated, e.g.: -netdev tap,fd=21,id=hostnet0,vhost=on,vhostfd=(null)	2013-05-29 09:20:04 +02:00
Cole Robinson	406d8a9809	qemu: Don't report error on successful media eject If we are just ejecting media, ret == -1 even after the retry loop determines that the tray is open, as requested. This means media disconnect always report's error. Fix it, and fix some other mini issues: - Don't overwrite the 'eject' error message if the retry loop fails - Move the retries decrement inside the loop, otherwise the final loop might succeed, yet retries == 0 and we will raise error - Setting ret = -1 in the disk->src check is unneeded - Fix comment typos cc: mprivozn@redhat.com	2013-05-28 11:45:19 -04:00
Jiri Denemark	c6f2523fb1	qemu: Fix build without gnutls "error" label in qemuMigrationCookieGraphicsAlloc is now used unconditionally thanks to VIR_STRDUP.	2013-05-27 10:19:36 +02:00
Sergey Fionov	2697c8a116	qemu: save domain state to XML after reboot Currently qemuDomainReboot() does reboot in two phases: qemuMonitorSystemPowerdown() and qemuProcessFakeReboot(). qemuMonitorSystemPowerdown() shutdowns the domain and saves domain state/reason as VIR_DOMAIN_SHUTDOWN_UNKNOWN. qemuProcessFakeReboot() sets domain state/reason to VIR_DOMAIN_RESUMED_UNPAUSED but does not save domain state changes. Subsequent restart of libvirtd leads to restoring domain state/reason to saved that is VIR_DOMAIN_SHUTDOWN_UNKNOWN and to automatic shutdown of the domain. This commit adds virDomainSaveStatus() into qemuProcessFakeReboot() to avoid unexpected shutdowns.	2013-05-24 15:29:22 -06:00
Michal Privoznik	0fc5d09cbb	Adapt to new VIR_STRNDUP behavior With previous patch, we accept negative value as length of string to duplicate. So there is no need to pass strlen(src) in case we want to do duplicate the whole string.	2013-05-24 17:00:39 +02:00
Martin Kletzander	5af3ce8277	Fix blkdeviotune for shutoff domain Function qemuDomainSetBlockIoTune() was checking QEMU capabilities even when !(flags & VIR_DOMAIN_AFFECT_LIVE) and the domain was shutoff, resulting in the following problem: virsh # domstate asdf; blkdeviotune asdf vda --write-bytes-sec 100 shut off error: Unable to change block I/O throttle error: unsupported configuration: block I/O throttling not supported with this QEMU binary Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=965016	2013-05-24 13:29:20 +02:00
Ján Tomko	2326006410	qemu: fix NBD migration to hosts with IPv6 enabled Since `f03dcc5` we use [::] as the listening address both on qemu command line in -incoming and in nbd-server-start QMP command. However the latter requires just :: without the braces.	2013-05-23 17:55:34 +02:00
Michal Privoznik	a88fb3009f	Adapt to VIR_STRDUP and VIR_STRNDUP in src/qemu/*	2013-05-23 09:56:38 +02:00
Michal Privoznik	03eb06632a	qemu: Enable multiqueue network	2013-05-22 17:34:02 +02:00
Michal Privoznik	1f24f68225	qemu: Adapt qemuBuildInterfaceCommandLine to to multiqueue net In order to learn libvirt multiqueue several things must be done: 1) The '/dev/net/tun' device needs to be opened multiple times with IFF_MULTI_QUEUE flag passed to ioctl(fd, TUNSETIFF, &ifr); 2) Similarly, '/dev/vhost-net' must be opened as many times as in 1) in order to keep 1:1 ratio recommended by qemu and kernel folks. 3) The command line construction code needs to switch from 'fd=X' to 'fds=X:Y:...:Z' and from 'vhostfd=X' to 'vhostfds=X:Y:...:Z'. 4) The monitor handling code needs to learn to pass multiple FDs.	2013-05-22 17:24:27 +02:00
Michal Privoznik	565c07f171	qemu: Move interface cmd line construction into a separate function Currently, we have one huge function to construct qemu command line. This is very ineffective esp. if there's a fault somewhere.	2013-05-22 17:05:36 +02:00
Guannan Ren	3c53984412	qemu: add ', share=<policy>' to qemu commandline example: qemu ${otherargs} \ -vnc 127.0.0.1:0,share=allow-exclusive	2013-05-22 19:18:48 +08:00
Guannan Ren	d377d02dc4	qemu: new vnc display sharing policy caps flag QEMU_CAPS_VNC_SHARE_POLICY (qemu >= 1.1)	2013-05-22 19:18:37 +08:00
Osier Yang	66194f71df	src/qemu: Remove the whitespace before ';'	2013-05-21 23:41:44 +08:00
Osier Yang	58f8e0cd58	qemu: Don't remove the "return 0" Commit `f60a50c795` intended to remove the warning only, but not with the "return 0" together.	2013-05-21 23:08:57 +08:00
Guannan Ren	ceae74608c	qemu: fix a typo in qemuAddSharedDevice	2013-05-21 18:38:57 +08:00
Michal Privoznik	543af79a14	qemuDomainChangeEjectableMedia: Unlock domain while waiting for event In `84c59ffa` I've tried to fix changing ejectable media process. The process should go like this: 1) we need to call 'eject' on the monitor 2) we should wait for 'DEVICE_TRAY_MOVED' event 3) now we can issue 'change' command However, while waiting in step 2) the domain monitor was locked. So even if qemu reported the desired event, the proper callback was not called immediately. The monitor handling code needs to lock the monitor in order to read the event. So that's the first lock we must not hold while waiting. The second one is the domain lock. When monitor handling code reads an event, the appropriate callback is called then. The first thing that each callback does is locking the corresponding domain as a domain or its device is about to change state. So we need to unlock both monitor and VM lock. Well, holding any lock while sleep()-ing is not the best thing to do anyway.	2013-05-21 10:42:21 +02:00
Osier Yang	3a6204cbbd	qemu: Add callback struct for qemuBuildCommandLine Since `0d70656afd`, it starts to access the sysfs files to build the qemu command line (by virSCSIDeviceGetSgName, which is to find out the scsi generic device name by adpater🚌target:unit), there is no way to work around, qemu wants to see the scsi generic device like "/dev/sg6" anyway. And there might be other places which need to access sysfs files when building qemu command line in future. Instead of increasing the arguments of qemuBuildCommandLine, this introduces a new callback for qemuBuildCommandLine, and thus tests can register their own callbacks for sysfs test input files accessing. * src/qemu/qemu_command.h: (New callback struct qemuBuildCommandLineCallbacks; extern buildCommandLineCallbacks) * src/qemu/qemu_command.c: (wire up the callback struct) * src/qemu/qemu_driver.c: (Use the new syntax of qemuBuildCommandLine) * src/qemu/qemu_hotplug.c: Likewise * src/qemu/qemu_process.c: Likewise * tests/testutilsqemu.[ch]: (Helper testSCSIDeviceGetSgName; callback struct testCallbacks;) * tests/qemuxml2argvtest.c: (Use testCallbacks) * src/tests/qemuxmlnstest.c: (Like above)	2013-05-20 20:14:19 +08:00
Osier Yang	479d5991cd	qemu: Abstract code for cpuset controller setting into a helper	2013-05-20 19:57:00 +08:00
Osier Yang	9f2455d359	qemu: Abstract code for devices controller setting into a helper	2013-05-20 19:52:35 +08:00
Osier Yang	f60a50c795	qemu: Abstract code for memory controller setting into a helper	2013-05-20 19:39:54 +08:00
Osier Yang	2fd16df7b5	qemu: Abstract the code for blkio controller setting into a helper	2013-05-20 19:24:45 +08:00
Guannan Ren	6459af6a43	qemu: report useful error failling to destroy domain gracefully Resolves:https://bugzilla.redhat.com/show_bug.cgi?id=927620 #kill -STOP `pidof qemu-kvm` #virsh destroy $guest --graceful error: Failed to destroy domain testVM error: An error occurred, but the cause is unknown With --graceful, SIGTERM always is emitted to kill driver process, but it won't success till burning out waiting time in case of process being stopped. But domain destroy without --graceful can work, SIGKILL will be emitted to the stopped process after 10 secs which always kills a process even one that is currently stopped. So report an error after burning out waiting time in this case.	2013-05-17 22:22:46 +08:00
Osier Yang	6aa4fc656d	qemu: Check conflicts for shared scsi host device Just like previous patches, this changes qemuCheckSharedDisk into qemuCheckSharedDevice, which takes a virDomainDeviceDefPtr argument instead.	2013-05-17 19:26:33 +08:00
Daniel P. Berrange	c2cf5f1c2a	Fix failure to detect missing cgroup partitions Change `bbe97ae968` caused the QEMU driver to ignore ENOENT errors from cgroups, in order to cope with missing /proc/cgroups. This is not good though because many other things can cause ENOENT and should not be ignored. The callers expect to see ENXIO when cgroups are not present, so adjust the code to report that errno when /proc/cgroups is missing Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-17 10:25:15 +01:00
Jiri Denemark	fd74f74fe6	qemu: Implement support for locking domain's memory pages	2013-05-16 23:21:58 +02:00
Martin Kletzander	0471637d56	qemu: Fix cgroup handling when setting VCPU BW Commit `632f78c` introduced a regression which causes schedinfo being unable to set some parameters. When migrating to priv->cgroup there was missing variable left out and due to passed NULL to underlying function, the setting failed. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=963592	2013-05-16 22:13:29 +02:00
Osier Yang	a842df78ea	qemu: Set unpriv_sgio for scsi host device	2013-05-17 01:00:01 +08:00
Osier Yang	0453bcdfc3	qemu: Refactor qemuSetUnprivSGIO to support scsi host device Just like what previous patches do, it refactors qemuSetUnprivSGIO to take the virDomainDeviceDefPtr as argument instead.	2013-05-17 00:57:01 +08:00
Osier Yang	99fdd434bc	qemu: Move qemuSetUnprivSGIO into qemu_conf.c unpriv_sgio setting is tight with the shared device helpers, let's put them together in qemu_conf.c	2013-05-17 00:51:58 +08:00
Osier Yang	ead4391562	Rename virDomainDiskSGIO to virDomainDeviceSGIO SCSI host device will also support "sgio", and perhaps we could use "sgio" in other places too in future, renaming the enum to reuse.	2013-05-17 00:43:38 +08:00
Osier Yang	1d94b3e760	qemu: Manage shared device entry for scsi host device This adds the shared device entry when starting domain (more exactly, when preparing host devices), and remove the entry when destroying domain (when reattaching host devices).	2013-05-17 00:34:29 +08:00
Osier Yang	aeda1ff12d	qemu: Refactor the helpers to track shared scsi host device This changes the helpers qemu{Add,Remove}SharedDisk into qemu{Add,Remove}SharedDevice, as most of the code in the helpers can be reused for scsi host device. To track the shared scsi host device, first it finds out the device path (e.g. /dev/s[dr]) which is mapped to the sg device, and use device ID of the found device path (/dev/s[dr]) as the hash key. This is because of the device ID is not unique between between /dev/s[dr]* and /dev/sg*, e.g. % sg_map /dev/sg0 /dev/sda /dev/sg1 /dev/sr0 % ls -l /dev/sda brw-rw----. 1 root disk 8, 0 May 2 19:26 /dev/sda %ls -l /dev/sg0 crw-rw----. 1 root disk 21, 0 May 2 19:26 /dev/sg0	2013-05-17 00:32:09 +08:00
Osier Yang	539d0e19fd	qemu: Rename qemu_driver->sharedDisks to qemu_driver->sharedDevices "Shared disk" is not only the thing we should care about after "scsi hostdev" is introduced. A same scsi device can be used as "disk" for one domain, and as "scsi hostdev" for another domain at the same time. That's why this patch renames qemu_driver->sharedDisks. Related functions and structs are also renamed.	2013-05-16 23:48:27 +08:00
Viktor Mihajlovski	9684bb11fd	qemu: Fix crash in migration of graphics-less guests. Commit `7f15ebc7a2` introduced a bug happening when guests without a <graphics> element are migrated. The initialization of listenAddress happens unconditionally from the cookie even if the cookie->graphics pointer was NULL. Moved the initialization to where it is safe. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-05-16 15:48:34 +02:00
Osier Yang	a7c4202cdd	qemu: Support discard for disk QEMU introduced "discard" option for drive since commit a9384aff53, <...> @var{discard} is one of "ignore" (or "off") or "unmap" (or "on") and controls whether @dfn{discard} (also known as @dfn{trim} or @dfn{unmap}) requests are ignored or passed to the filesystem. Some machine types may not support discard requests. </...> This patch exposes the support in libvirt. QEMU supported "discard" for "-drive" since v1.5.0-rc0: % git tag --contains a9384aff53 contains v1.5.0-rc0 v1.5.0-rc1 So this only detects the capability bit using virQEMUCapsProbeQMPCommandLine.	2013-05-15 19:01:00 +08:00
John Ferlan	efdcc92faa	Handle the domain event 'on_reboot' and 'on_poweroff' settings	2013-05-15 06:25:41 -04:00
John Ferlan	0e034efaf9	Adjust usage of qemu -no-reboot and -no-shutdown options During building of the qemu command line determine whether to add/use the '-no-reboot' option only if each of the 'on' events want to to destroy the domain; otherwise, use the '-no-shutdown' option. Prior to this change both could be on the command line, which while allowed could be construed as a conflict.	2013-05-15 06:19:32 -04:00
Martin Kletzander	85ec7ff6fd	qemu: Add VNC WebSocket support Adding a VNC WebSocket support for QEMU driver. This functionality is in upstream qemu from commit described as v1.3.0-982-g7536ee4, so the capability is being recognized based on QEMU version for now.	2013-05-15 09:48:05 +02:00
Osier Yang	77b54b9661	qemu: New XML to disable memory merge at guest startup QEMU introduced command line "-mem-merge=on\|off" (defaults to on) to enable/disable the memory merge (KSM) at guest startup. This exposes it by new XML: <memoryBacking> <nosharepages/> </memoryBacking> The XML tag is same with what we used internally for old RHEL.	2013-05-15 11:25:45 +08:00
Eric Blake	d12bbd6a7d	qemu: detect -machine mem-merge capability * src/qemu/qemu_capabilities.h: New capability bit. * src/qemu/qemu_capabilities.c (virQEMUCapsProbeQMPCommandLine): New function, based on qemuMonitorGetCommandLineOptionParameters, which was introduced by commit bd56d0d813; use it to set new capability bit. (virQEMUCapsInitQMP): Use new function.	2013-05-15 11:25:42 +08:00
Daniel P. Berrange	2a2bc1517a	Forbid use of ':' in RBD pool names The QEMU command line syntax for RBD disks is file=rbd:pool/image:opt1=val1:opt2=val2... There is no way to escape the ':' if it appears in the pool or image name. Thus it must be explicitly forbidden if it occurs in the libvirt XML. People are known to be abusing the lack of escaping in current libvirt to pass arbitrary args to QEMU. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-14 15:02:42 +01:00
Eric Blake	0b923ba3c8	qemu: fix bad free Commit `bd56d0d8` could lead to freeing an uninitialized pointer: qemu/qemu_monitor_json.c: In function 'qemuMonitorJSONGetCommandLineOptionParameters': qemu/qemu_monitor_json.c:4284: warning: 'cmd' may be used uninitialized in this function * src/qemu/qemu_monitor_json.c (qemuMonitorJSONGetCommandLineOptionParameters): Initialize variable. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-13 16:48:55 -06:00
Eric Blake	bd56d0d813	qemu: query command line options in QMP Ever since the conversion to using only QMP for probing features of qemu 1.2 and newer, we have been unable to detect features that are added only by additional command line options. For example, we'd like to know if '-machine mem-merge=on' (added in qemu 1.5) is present. To do this, we will take advantage of qemu 1.5's query-command-line-parameters QMP call [1]. This patch wires up the framework for probing the command results; if the QMP command is missing, or if a particular command line option does not output any parameters (for example, -net uses a polymorphic parser, which showed up as no parameters as of qemu 1.5), we silently treat that command as having no results. [1] https://lists.gnu.org/archive/html/qemu-devel/2013-04/msg05180.html * src/qemu/qemu_monitor.h (qemuMonitorGetOptions) (qemuMonitorSetOptions) (qemuMonitorGetCommandLineOptionParameters): New functions. * src/qemu/qemu_monitor_json.h (qemuMonitorJSONGetCommandLineOptionParameters): Likewise. * src/qemu/qemu_monitor.c (_qemuMonitor): Add cache field. (qemuMonitorDispose): Clean it. (qemuMonitorGetCommandLineOptionParameters): Implement new function. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONGetCommandLineOptionParameters): Likewise. (testQemuMonitorJSONGetCommandLineParameters): Test it. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-13 15:15:54 -06:00
Eric Blake	082274ea41	qemu: simplify string cleanup No need to open code a string list cleanup, if we are nice to the caller by guaranteeing a NULL-terminated result. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONGetCPUDefinitions) (qemuMonitorJSONGetCommands, qemuMonitorJSONGetEvents) (qemuMonitorJSONGetObjectTypes, qemuMonitorJSONGetObjectProps): Use simpler cleanup. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-13 15:15:54 -06:00
Eric Blake	764bb5e5aa	qemu: use bool in monitor struct Follows on the heels of other bool cleanups, such as commit `93002b98`. * src/qemu/qemu_monitor.h (qemuMonitorOpen, qemuMonitorOpenFD): Update json parameter type. * src/qemu/qemu_monitor.c (qemuMonitorOpen, qemuMonitorOpenFD): Likewise. (_qemuMonitor): Adjust field type. * src/qemu/qemu_domain.h (_qemuDomainObjPrivate): Likewise. * src/qemu/qemu_domain.c (qemuDomainObjPrivateXMLParse): Adjust client. * src/qemu/qemu_process.c (qemuProcessStart): Likewise. * tests/qemumonitortestutils.c (qemuMonitorTestNew): Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-13 15:15:54 -06:00
Han Cheng	8f76ad9992	qemu: Add hotplug support for scsi host device This adds both attachment and detachment support for scsi host device. Signed-off-by: Han Cheng <hanc.fnst@cn.fujitsu.com> Signed-off-by: Osier Yang <jyang@redhat>	2013-05-14 00:12:42 +08:00
Jim Fehlig	bbe97ae968	Fix starting domains when kernel has no cgroups support Found that I was unable to start existing domains after updating to a kernel with no cgroups support # zgrep CGROUP /proc/config.gz # CONFIG_CGROUPS is not set # virsh start test error: Failed to start domain test error: Unable to initialize /machine cgroup: Cannot allocate memory virCgroupPartitionNeedsEscaping() correctly returns errno (ENOENT) when attempting to open /proc/cgroups on such a system, but it was being dropped in virCgroupSetPartitionSuffix(). Change virCgroupSetPartitionSuffix() to propagate errors returned by its callees. Also check for ENOENT in qemuInitCgroup() when determining if cgroups support is available.	2013-05-13 09:27:46 -06:00
Osier Yang	7d763acaf2	qemu: Refactor helpers for USB device attachment It's better to put the usb related codes into qemuDomainAttachHostUsbDevice instead of qemuDomainAttachHostDevice. And in the old qemuDomainAttachHostDevice, just stealing the "usb" from driver->activeUsbHostdevs leaks the memory.	2013-05-13 21:51:55 +08:00
Han Cheng	ea74c07636	qemu: Introduce activeScsiHostdevs list for scsi host devices Although virtio-scsi supports SCSI PR (Persistent Reservations), the device on host may do not support it. To avoid losing data, Just like PCI and USB pass through devices, only one live guest is allowed per SCSI host pass through device." Signed-off-by: Han Cheng <hanc.fnst@cn.fujitsu.com>	2013-05-13 21:26:06 +08:00
Daniel P. Berrange	13579d4544	Add 'nbd' as a valid filesystem driver type The <filesystem> element can now accept a <driver type='nbd'/> as an alternative to 'loop'. The benefit of NBD is support for non-raw disk image formats. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-13 13:15:19 +01:00
Daniel P. Berrange	ada14b86cc	Add support for storage format in FS <driver> Extend the <driver> element in filesystem devices to allow a storage format to be set. The new attribute uses 'format' to reflect the storage format. This is different from the <driver> element in disk devices which use 'type' to reflect the storage format. This is because the 'type' attribute on filesystem devices is already used for the driver backend, for which the disk devices use the 'name' attribute. Arggggh. Anyway for disks we have <driver name="qemu" type="raw"/> And for filesystems this change means we now have <driver type="loop" format="raw"/> Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-13 13:15:19 +01:00
Han Cheng	6eb42e38e8	qemu: Allow the scsi-generic device in cgroup This adds the scsi-generic device into the device controller's whitelist, so that it's allowed to used by the qemu process. Signed-off-by: Han Cheng <hanc.fnst@cn.fujitsu.com> Signed-off-by: Osier Yang <jyang@redhat.com>	2013-05-13 19:08:34 +08:00
Osier Yang	bab6ee6b30	qemu: Support bootindex for scsi host device	2013-05-13 19:08:32 +08:00
Osier Yang	f4bb7b4807	Introduce <readonly> for hostdev Since it's generic enough to be used by other types in future, I put it in <hostdev> as sub-element, though now it's only used by scsi host device.	2013-05-13 19:02:40 +08:00
Han Cheng	0d70656afd	qemu: Build qemu command line for scsi host device Except the scsi host device's controller is "lsilogic", mapping between the libvirt attributes and scsi-generic properties is: libvirt qemu ----------------------------------------- controller bus ($libvirt_controller.0) bus channel target scsi-id unit lun For scsi host device with "lsilogic" controller, the mapping is: ('target (libvirt)' must be 0, as it's not used; 'unit (libvirt) must <= 7). libvirt qemu ---------------------------------------------------------- controller && bus bus ($libvirt_controller.$libvirt_bus) unit scsi-id It's not good to hardcode/hard-check limits of these attributes, and even worse, these limits are not documented, one has to find out by either testing or reading the qemu code, I'm looking forward to qemu expose limits like these one day). For example, exposing "max_target", "max_lun" for megasas: static const struct SCSIBusInfo megasas_scsi_info = { .tcq = true, .max_target = MFI_MAX_LD, .max_lun = 255, .transfer_data = megasas_xfer_complete, .get_sg_list = megasas_get_sg_list, .complete = megasas_command_complete, .cancel = megasas_command_cancel, }; Example of the qemu command line (lsilogic controller): -drive file=/dev/sg2,if=none,id=drive-hostdev-scsi_host7-0-0-0 \ -device scsi-generic,bus=scsi0.0,scsi-id=8,\ drive=drive-hostdev-scsi_host7-0-0-0,id=hostdev-scsi_host7-0-0-0 Example of the qemu command line (virtio-scsi controller): -drive file=/dev/sg2,if=none,id=drive-hostdev-scsi_host7-0-0-0 \ -device scsi-generic,bus=scsi0.0,channel=0,scsi-id=128,lun=128,\ drive=drive-hostdev-scsi_host7-0-0-0,id=hostdev-scsi_host7-0-0-0 Signed-off-by: Han Cheng <hanc.fnst@cn.fujitsu.com> Signed-off-by: Osier Yang <jyang@redhat.com>	2013-05-13 18:50:16 +08:00
Han Cheng	b238c0bec1	qemu: New cap flags for scsi-generic Adding two cap flags for scsi-generic: QEMU_CAPS_SCSI_GENERIC QEMU_CAPS_SCSI_GENERIC_BOOTINDEX Signed-off-by: Han Cheng <hanc.fnst@cn.fujitsu.com> Signed-off-by: Osier Yang <jyang@redhat.com>	2013-05-13 18:30:26 +08:00
Daniel P. Berrange	f493d83fbd	Cope with missing swap cgroup controls It is possible to build a kernel without swap cgroup controls present. This causes a fatal error when querying memory parameters. Treat missing swap controls as meaning "unlimited". The fatal error remains if the user tries to actually change the limit. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-10 19:57:18 +01:00
Laine Stump	a2c1bedbd8	util: fix virFileOpenAs return value and resulting error logs This resolves: https://bugzilla.redhat.com/show_bug.cgi?id=851411 https://bugzilla.redhat.com/show_bug.cgi?id=955500 The first problem was that virFileOpenAs was returning fd (-1) in one of the error cases rather than ret (-errno), so the caller thought that the error was EPERM rather than ENOENT. The second problem was that some log messages in the general purpose qemuOpenFile() function would always say "Failed to create" even if the caller hadn't included O_CREAT (i.e. they were trying to open an existing file). This fixes virFileOpenAs to jump down to the error return (which returns ret instead of fd) in the previously mentioned incorrect failure case of virFileOpenAs(), removes all error logging from virFileOpenAs() (since the callers report it), and modifies qemuOpenFile to appropriately use "open" or "create" in its log messages. NB: I seriously considered removing logging from all callers of virFileOpenAs(), but there is at least one case where the caller doesn't want virFileOpenAs() to log any errors, because it's just going to try again (qemuOpenFile()). We can't simply make a silent variation of virFileOpenAs() though, because qemuOpenFile() can't make the decision about whether or not it wants to retry until after virFileOpenAs() has already returned an error code. Likewise, I also considered changing virFileOpenAs() to return -1 with errno set on return, and may still do that, but only as a separate patch, as it obscures the intent of this patch too much.	2013-05-10 13:09:25 -04:00
Ján Tomko	c075f89fa2	don't mention disk controllers in generic controller errors The controller element supports non-disk controller types too. https://bugzilla.redhat.com/show_bug.cgi?id=960958	2013-05-09 14:25:11 +02:00
Daniel P. Berrange	a605b7e041	Unmerge attach/update/modify device APIs in drivers The LXC, QEMU, and LibXL drivers have all merged their handling of the attach/update/modify device APIs into one large 'xxxxDomainModifyDeviceFlags' which then does a 'switch()' based on the actual API being invoked. While this saves some lines of code, it is not really all that significant in the context of the driver API impls as a whole. This merger of the handling of different APIs creates pain when wanting to automated analysis of the code and do things which are specific to individual APIs. The slight duplication of code from unmerged the API impls, is preferrable to allow for easier automated analysis. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-08 10:47:48 +01:00
Daniel P. Berrange	449e6b1b58	Pull parsing of migration xml up into QEMU driver APIs Currently the parsing of XML is pushed down into the various migration helper APIs. This makes it difficult to insert the correct access control checks, since one helper API services many public APIs. Pull the parsing of XML up to the top level of the QEMU driver APIs	2013-05-08 10:47:48 +01:00
Daniel P. Berrange	03a600368e	Don't allow renaming of domains by the backdoor Several APIs allow for custom XML to be passed in. This is checked for ABI stability, which will ensure the UUID is not being changed. There isn't validation that the name did not change though. This could allow renaming of guests via the backdoor, which in turn could allow for bypassing access control restrictions based on names. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-08 10:47:47 +01:00
Daniel P. Berrange	4a044d0256	Separate internal node suspend APIs from public API The individual hypervisor drivers were directly referencing APIs in virnodesuspend.c in their virDriverPtr struct. Separate these methods, so there is always a wrapper in the hypervisor driver. This allows the unused virConnectPtr args to be removed from the virnodesuspend.c file. Again this will ensure that ACL checks will only be performed on invocations that are directly associated with public API usage. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-08 10:47:47 +01:00
Daniel P. Berrange	1c6d4ca557	Separate internal node device APIs from public API The individual hypervisor drivers were directly referencing APIs in src/nodeinfo.c in their virDriverPtr struct. Separate these methods, so there is always a wrapper in the hypervisor driver. This allows the unused virConnectPtr args to be removed from the nodeinfo.c file. Again this will ensure that ACL checks will only be performed on invocations that are directly associated with public API usage. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-08 10:47:47 +01:00
Daniel P. Berrange	ead630319d	Separate virGetHostname() API contract from driver APIs Currently the virGetHostname() API has a bogus virConnectPtr parameter. This is because virtualization drivers directly reference this API in their virDriverPtr tables, tieing its API design to the public virConnectGetHostname API design. This also causes problems for access control checks since these must only be done for invocations from the public API, not internal invocation. Remove the bogus virConnectPtr parameter, and make each hypervisor driver provide a dedicated function for the driver API impl. This will allow access control checks to be easily inserted later. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-08 10:47:47 +01:00
Ján Tomko	dcea5a492f	get rid of virBufferAsprintf where possible Use virBufferAddLit or virBufferAddChar instead.	2013-05-07 17:38:58 +02:00
Laine Stump	8cd40e7e0d	qemu: allocate network connections sooner during domain startup VFIO device assignment requires a cgroup ACL to be setup for access to the /dev/vfio/nn "group" device for any devices that will be assigned to a guest. In the case of a host device that is allocated from a pool, it was being allocated during qemuBuildCommandLine(), which is called by qemuProcessStart() after the all-encompassing qemuSetupCgroup() was called, meaning that the standard Cgroup ACL setup wasn't creating ACLs for these devices allocated from pools. One possible solution was to manually add a single ACL down inside qemuBuildCommandLine() when networkAllocateActualDevice() is called, but that has two problems: 1) the function that adds the cgroup ACL requires a virDomainObjPtr, which isn't available in qemuBuildCommandLine(), and 2) we really shouldn't be doing network device setup inside qemuBuildCommandLine() anyway. Instead, I've created a new function called qemuNetworkPrepareDevices() which is called just before qemuPrepareHostDevices() during qemuProcessStart() (explanation of ordering in the comments), i.e. well before the call to qemuSetupCgroup(). To minimize code churn in a patch that will be backported to 1.0.5-maint, qemuNetworkPrepareDevices only does networkAllocateActualDevice() and the bare amount of setup required for type='hostdev network devices, but it eventually should do all device setup for guest network devices. Note that some of the code that was previously needed in qemuBuildCommandLine() is no longer required when networkAllocateActualDevice() is called earlier: * qemuAssignDeviceHostdevAlias() is already done further down in qemuProcessStart(). * qemuPrepareHostdevPCIDevices() is called by qemuPrepareHostDevices() which is called after qemuNetworkPrepareDevices() in qemuProcessStart(). As hinted above, this new function should be moved into a separate qemu_network.c (or similarly named) file along with qemuPhysIfaceConnect(), qemuNetworkIfaceConnect(), and qemuOpenVhostNet(), and expanded to call those functions as well, then the nnets loop in qemuBuildCommandLine() should be reduced to only build the commandline string (which itself can be in a separate qemuInterfaceBuilldCommandLine() function as suggested by Michal). However, this will require storing away an array of tapfd and vhostfd that are needed for the commandline, so I would rather do that in a separate patch and leave this patch at the minimum to fix the bug.	2013-05-07 11:36:43 -04:00
Boris Fiuczynski	bde1731613	qemu: Enable the capability bit for -no-kvm-pit-reinjection on x86 only On architectures not supporting the Intel specific programmable interval timer, like e.g. S390, starting a domain with a clock definition containing a pit timer results in the error "Option no-kvm-pit-reinjection not supported for this target". By moving the capability enablement for -no-kvm-pit-reinjection from the InitQMPBasic section into the x86_64 and i686 only enablement section all other architectures are no longer automatically enabled. In addition architecture related capabilities enablements have refactored into a new architecture bound capabilities initialization function. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-05-07 14:42:40 +02:00
Peter Krempa	246d0068ac	qemu: Do fake auto-allocation of ports when generating native command When attempting to generate the native command line from an XML file that uses graphics port auto allocation, the generated commandline wouldn't be valid. This patch adds fake autoallocation of ports as done when starting the actual machine.	2013-05-06 22:13:22 +02:00
Laine Stump	52ba0f6e1c	qemu: fix stupid typos in VFIO cgroup setup/teardown I must have looked at this a couple dozen times before I noticed it had "!=" instead of "==". Not doing this setup prevented qemu from doing anything with the vfio group device.	2013-05-03 14:32:54 -04:00
Daniel P. Berrange	848a08bc94	Fix warning about unsupported cookie flags in QEMU driver The QEMU migration code unconditionally sets the 'persistent' cookie flag on the source host. The dest host, however, only allows it during parsing if VIR_MIGRATE_PERSIST_DEST was set. Make the source host only set it if this flag is present. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-03 14:06:15 +01:00
Eric Blake	22d12905e6	build: avoid non-portable cast of pthread_t POSIX says pthread_t is opaque. We can't guarantee if it is scaler or a pointer, nor what size it is; and BSD differs from Linux. We've also had reports of gcc complaining on attempts to cast it, if we use a cast to the wrong type (for example, pointers have to be cast to void* or intptr_t before being narrowed; while casting a function return of scalar pthread_t to void* triggers a different warning). Give up on casts, and use unions to get at decent bits instead. And rather than futz around with figuring which 32 bits of a potentially 64-bit pointer are most likely to be unique, convert the rest of the code base to use 64-bit values when using a debug id. Based on a report by Guido Günther against kFreeBSD, but with a fix that doesn't regress commit `4d970fd29` for FreeBSD. * src/util/virthreadpthread.c (virThreadSelfID, virThreadID): Use union to get at a decent bit representation of thread_t bits. * src/util/virthread.h (virThreadSelfID, virThreadID): Alter signature. * src/util/virthreadwin32.c (virThreadSelfID, virThreadID): Likewise. * src/qemu/qemu_domain.h (qemuDomainJobObj): Alter type of owner. * src/qemu/qemu_domain.c (qemuDomainObjTransferJob) (qemuDomainObjSetJobPhase, qemuDomainObjReleaseAsyncJob) (qemuDomainObjBeginNestedJob, qemuDomainObjBeginJobInternal): Fix clients. * src/util/virlog.c (virLogFormatString): Likewise. * src/util/vireventpoll.c (virEventPollInterruptLocked): Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-05-03 06:30:22 -06:00
Daniel P. Berrange	377ac10c8f	Remove redundant () in expression The use of () in a simple boolean comparison was not required Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-05-03 10:29:07 +01:00
Michal Privoznik	7c9a2d88cd	virutil: Move string related functions to virstring.c The source code base needs to be adapted as well. Some files include virutil.h just for the string related functions (here, the include is substituted to match the new file), some include virutil.h without any need (here, the include is removed), and some require both.	2013-05-02 16:56:55 +02:00
Michal Privoznik	297c99a567	qemu: Generate agent socket path if missing It's not desired to force users imagine path for a socket they are not even supposed to connect to. On the other hand, we already have a release where the qemu agent socket path is exposed to XML, so we cannot silently drop it from there. The new path is generated in form: $LOCALSTATEDIR/lib/libvirt/qemu/channel/target/$domain.$name for qemu system mode, and $XDG_CONFIG_HOME/qemu/lib/channel/target/$domain.$name for qemu session mode.	2013-05-02 16:40:24 +02:00
Laine Stump	e482693b24	pci: autolearn name of stub driver, remove from arglist virPCIDeviceReattach and virPCIDeviceUnbindFromStub (called by virPCIDeviceReattach) had previously required the name of the stub driver as input. This is unnecessary, because the name of the driver the device is currently bound to can be found by looking at the link: /sys/bus/pci/dddd:bb:ss.ff/driver Instead of requiring that the name of the expected stub driver name and only unbinding if that one name is matched, we no longer take a driver name in the arglist for either of these functions. virPCIDeviceUnbindFromStub just compares the name of the currently bound driver to a list of "well known" stubs (right now contains "pci-stub" and "vfio-pci" for qemu, and "pciback" for xen), and only performs the unbind if it's one of those devices. This allows virsh nodedevice-reattach to work properly across a libvirtd restart, and fixes a couple of cases where we were erroneously still hard-coding "pci-stub" as the drive name. For some unknown reason, virPCIDeviceReattach had been calling modprobe on the stub driver prior to unbinding the device. This was problematic because we no longer know the name of the stub driver in that function. However, it is pointless to probe for the stub driver at that time anyway - because the device is bound to the stub driver, we are guaranteed that it is already loaded, and so that call to modprobe has been removed.	2013-05-02 02:09:29 -04:00
Viktor Mihajlovski	3a82f628a9	S390: Do not generate a default USB controller For s390 we don't want to have a default USB device generated even if QEMU is silently tolerating -usb on the command line. This may change in the future. Another reason to avoid the USB controller is that it implies a PCI bus which might cause a regression at some later point in time. The following change will set the USB controller model to 'none' unless a model or address has been specified, which can be the case if a legacy definition is loaded or the XML writer knows what she/he's doing. Requiring the user to explicitly disable USB on systems not supporting it seems cumbersome. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-04-30 19:18:43 -06:00
Laine Stump	f6966b6277	qemu: fix failure to start with spice graphics and no tls Commit `eca3fdf` inadvertantly caused a failure to start for any domain with the following in its config: <graphics type='spice' autoport='yes'/> The problem is that when tlsPort == 0 and defaultMode == "any" (which is the default for defaultMode), this would be flagged in the code as "needTLSPort", and if there was then no spice tls config, the new error+fail would happen. This patch checks for the case of defaultMode == "any", and in that case simply doesn't allocate a TLS port (since that's probably not what the user wanted, and it would have failed later anyway.). It does leave the error in place for cases when the user specifically asked to use tls in one way or another, though.	2013-04-30 18:20:53 -04:00
John Ferlan	d0761c18a4	Resolve valgrind error As a result of commit id '19c345f2', 'make -C tests valgrind' has the following for qemuxml2argvtest: ==22482== 197 (80 direct, 117 indirect) bytes in 1 blocks are definitely lost in loss record 101 of 120 ==22482== at 0x4A06B6F: calloc (vg_replace_malloc.c:593) ==22482== by 0x4C6F301: virAlloc (viralloc.c:124) ==22482== by 0x4C840FC: virSaveLastError (virerror.c:308) ==22482== by 0x431882: qemuBuildCommandLine (qemu_command.c:8204) ==22482== by 0x41E8F0: testCompareXMLToArgvHelper (qemuxml2argvtest.c:155) ==22482== by 0x41FE9F: virtTestRun (testutils.c:157) ==22482== by 0x419DEB: mymain (qemuxml2argvtest.c:654) ==22482== by 0x4204DA: virtTestMain (testutils.c:719) ==22482== by 0x39D0821A04: (below main) (libc-start.c:225) ==22482==	2013-04-30 13:26:22 -04:00
Martin Kletzander	a6a10a52eb	Fix typo in augeas comment	2013-04-30 16:31:40 +02:00
Ján Tomko	29bd350bf6	qemu: report an error if memballoon has wrong address type qemuBuildMemballoonDevStr returns NULL if memballoon doesn't have the right address type, but it doesn't report an error, leading to: error: An error occurred, but the cause is unknown Report a helpful error message instead, e.g.: error: XML error: memballoon unsupported with address type 'usb'	2013-04-30 10:23:44 +02:00
Ján Tomko	11fc1beab6	qemu: assign addresses when converting xml to native This adds addresses to domxml-to-native output and chooses the correct virtio devices for ccw and s390 machines. https://bugzilla.redhat.com/show_bug.cgi?id=957077	2013-04-30 10:23:44 +02:00
Peter Krempa	eca3fdf738	qemu: Error out if spice port autoallocation is requested, but disabled When a user requests auto-allocation of the spice TLS port but spice TLS is disabled in qemu.conf, we start the machine and let qemu fail instead of erroring out sooner. Add an error message so that this doesn't happen.	2013-04-30 09:43:12 +02:00
Laine Stump	811143c0b6	qemu: put usb cgroup setup in common function The USB-specific cgroup setup had been inserted inline in qemuDomainAttachHostUsbDevice and qemuSetupCgroup, but now there is a common cgroup setup function called for all hostdevs, so it makes sens to put the usb-specific setup there and just rely on that function being called. The one thing I'm uncertain of here (and a reason for not pushing until after release) is that previously hostdev->missing was checked only when starting a domain (and cgroup setup for the device skipped if missing was true), but with this consolidation, it is now checked in the case of hotplug as well. I don't know if this will have any practical effect (does it make sense to hotplug a "missing" usb device?)	2013-04-29 21:52:28 -04:00
Laine Stump	6e13860cb4	qemu: add vfio devices to cgroup ACL when appropriate PCIO device assignment using VFIO requires read/write access by the qemu process to /dev/vfio/vfio, and /dev/vfio/nn, where "nn" is the VFIO group number that the assigned device belongs to (and can be found with the function virPCIDeviceGetVFIOGroupDev) /dev/vfio/vfio can be accessible to any guest without danger (according to vfio developers), so it is added to the static ACL. The group device must be dynamically added to the cgroup ACL for each vfio hostdev in two places: 1) for any devices in the persistent config when the domain is started (done during qemuSetupCgroup()) 2) at device attach time for any hotplug devices (done in qemuDomainAttachHostDevice) The group device must be removed from the ACL when a device it "hot-unplugged" (in qemuDomainDetachHostDevice()) Note that USB devices are already doing their own cgroup setup and teardown in the hostdev-usb specific function. I chose to make the new functions generic and call them in a common location though. We can then move the USB-specific code (which is duplicated in two locations) to this single location. I'll be posting a followup patch to do that.	2013-04-29 21:52:28 -04:00
Ján Tomko	dfb4834940	qemu: honor allowDiskFormatProbing when parsing command line My commit `024e9af` broke this.	2013-04-29 15:52:02 +02:00
Ján Tomko	379e4bcce5	qemu: prevent invalid reads in qemuAssignDevicePCISlots Don't reserve slot 2 for video if the machine has no PCI buses. Error out when the user specifies a video device without a PCI address when there are no PCI buses. (This wouldn't work on a machine with no PCI bus anyway since we do add PCI addresses for video devices to the command line)	2013-04-27 12:55:46 +02:00
Ján Tomko	877bc08947	qemu: don't always reserve PCI addresses for implicit controllers In the past we automatically added a USB controller and assigned it a PCI address (0:0:1.2) even on machines without a PCI bus. This didn't break machines with no PCI bus because the command line for it is just '-usb', with no mention of the PCI bus. The implicit IDE controller (reserved address 0:0:1.1) has no command line at all. Commit `b33eb0dc` removed the ability to reserve PCI addresses on machines without a PCI bus. This made them stop working, since there would always be the implicit USB controller. Skip the reservation of addresses for these controllers when there is no PCI bus, instead of failing.	2013-04-27 12:55:46 +02:00
Laine Stump	19635f7d0d	conf: remove extraneous _TYPE from driver backend enums This isn't strictly speaking a bugfix, but I realized I'd gotten a bit too verbose when I chose the names for VIR_DOMAIN_HOSTDEV_PCI_BACKEND_TYPE_*. This shortens them all a bit.	2013-04-26 21:51:12 -04:00
Paolo Bonzini	2d80fbb14d	qemu: launch bridge helper from libvirtd <source type='bridge'> uses a helper application to do the necessary TUN/TAP setup to use an existing network bridge, thus letting unprivileged users use TUN/TAP interfaces. However, libvirt should be preventing QEMU from running any setuid programs at all, which would include this helper program. From a security POV, any setuid helper needs to be run by libvirtd itself, not QEMU. This is what this patch does. libvirt now invokes the setuid helper, gets the TAP fd and then passes it to QEMU in the normal manner. The path to the helper is specified in qemu.conf. As a small advantage, this adds a <target dev='tap0'/> element to the XML of an active domain using <interface type='bridge'>. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-04-26 15:37:51 -06:00
Ján Tomko	a12475bd44	qemu: don't assign a PCI address to 'none' USB controller Adjust the usb-none test, since it gives the memballoon a lower PCI slot now. Add a test for 'none' controller on s390, which doesn't have PCI buses.	2013-04-26 20:06:01 +02:00
Laine Stump	9395894585	qemu: set qemu process' RLIMIT_MEMLOCK when VFIO is used VFIO requires all of the guest's memory and IO space to be lockable in RAM. The domain's max_balloon is the maximum amount of memory the domain can have (in KiB). We add a generous 1GiB to that for IO space (still much better than KVM device assignment, where the KVM module actually ignores the process limits and locks everything anyway), and convert from KiB to bytes. In the case of hotplug, we are changing the limit for the already existing qemu process (prlimit() is used under the hood), and for regular commandline additions of vfio devices, we schedule a call to setrlimit() that will happen after the qemu process is forked.	2013-04-26 10:23:46 -04:00
Laine Stump	7bdf459d2c	qemu: use new virCommandSetMax(Processes\|Files) These were previously being set in a custom hook function, but now that virCommand directly supports setting them, we can eliminate that part of the hook and call the APIs directly.	2013-04-26 10:23:46 -04:00
Laine Stump	eaff16113a	qemu: implement virNodeDeviceDetachFlags backend The differences from virNodeDeviceDettach are very minor: 1) Check that the flags are 0. 2) Set the virPCIDevice's stubDriver according to the driverName that is passed in. 3) Call virPCIDeviceDetach with a NULL stubDriver, indicating it should get the name of the stub driver from the virPCIDevice object.	2013-04-25 21:28:10 -04:00
Laine Stump	cc0a918872	qemu: bind/unbind stub driver according to config <driver name='x'/> If the config for a device has specified <driver name='vfio'/>, "backend" in the pci part of the hostdev object will be set to ..._VFIO. In this case, when creating a virPCIDevice set the stubDriver to "vfio-pci", otherwise set it to "pci-stub". We will rely on the lower levels to report an error if the vfio driver isn't loaded. The detach/attach functions in virpci.c will pay attention to the stubDriver setting in the device, and bind/unbind the appropriate driver when preparing hostdevs for the domain. Note that we don't yet attempt to do anything to mark active any other devices in the same vfio "group" as a single device that is being marked active. We do need to do that, but in order to get basic VFIO functionality testing sooner rather than later, initially we'll just live with more cryptic errors when someone tries to do that.	2013-04-25 21:28:10 -04:00
Laine Stump	731b0f36f1	qemu: use vfio-pci on commandline when appropriate The device option for vfio-pci is nearly identical to that for pci-assign - only the configfd parameter isn't supported (or needed). Checking for presence of the bootindex parameter is done separately from constructing the commandline, similar to how it is done for pci-assign. This patch contains tests to check for proper commandline construction. It also includes tests for parser-formatter-parser roundtrips (xml2xml), because those tests use the same data files, and would have failed had they been included before now. qemu: xml/args tests for VFIO hostdev and <interface type='hostdev'/> These should be squashed in with the patch that adds commandline handling of vfio (they would fail at any earlier time).	2013-04-25 21:28:10 -04:00
Laine Stump	9f80fc1bd5	conf: put hostdev pci address in a struct There will soon be other items related to pci hostdevs that need to be in the same part of the hostdevsubsys union as the pci address (which is currently a single member called "pci". This patch replaces the single member named pci with a struct named pci that contains a single member named "addr".	2013-04-25 21:23:38 -04:00
Laine Stump	5b90ef0847	qemu: detect vfio-pci device and its bootindex parameter QEMU_CAPS_DEVICE_VFIO_PCI is set if the device named "vfio-pci" is supported in the qemu binary. QEMU_CAPS_VFIO_PCI_BOOTINDEX is set if the vfio-pci device supports the "bootindex" parameter; for some reason, the bootindex parameter wasn't included in early versions of vfio support (qemu 1.4) so we have to check for it separately from vfio itself.	2013-04-25 21:23:38 -04:00
Eric Blake	b121584f58	qemu: fix build error with older platforms Jim Fehlig reported on IRC that older gcc/glibc triggers this warning: cc1: warnings being treated as errors qemu/qemu_domain.c: In function 'qemuDomainDefFormatBuf': qemu/qemu_domain.c:1297: error: declaration of 'remove' shadows a global declaration [-Wshadow] /usr/include/stdio.h:157: error: shadowed declaration is here [-Wshadow] make[3]: *** [libvirt_driver_qemu_impl_la-qemu_domain.lo] Error 1 Fix it like we have done in the past (such as commit `2e6322a`). * src/qemu/qemu_domain.c (qemuDomainDefFormatBuf): Avoid shadowing a function name. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-04-25 11:26:58 -06:00
Ján Tomko	5c9cffea23	qemu: auto-add pci-root to 'pc-i440*' machines too Commit `b33eb0d` missed this machine type.	2013-04-25 17:29:27 +02:00
Michal Privoznik	01d5a97210	qemu_command.c: Fix whitespacing within for() After `9d6e56db` the syntax-check was unhappy due to wrong whitespacing: src/qemu/qemu_command.c:1637: for ( ; a.slot < QEMU_PCI_ADDRESS_SLOT_LAST; a.slot++) { maint.mk: incorrect whitespace around brackets, see HACKING for rules make: *** [bracket-spacing-check] Error 1	2013-04-25 13:52:49 +02:00
Michal Privoznik	6ddbabf938	qemu_conf: Don't discard strdup OOM error After `78d7c3c5` we are strdup()-ing path to qemu-bridge-helper. However, the check for its return value is missing. So it is possible we've ignored the OOM error silently.	2013-04-25 13:45:37 +02:00
Ján Tomko	9d6e56dbce	qemu: auto-add bridges and allow using them Add a "dry run" address allocation to figure out how many bridges will be needed for all the devices without explicit addresses. Auto-add just enough bridges to put all the devices on, or up to the bridge with the largest specified index.	2013-04-25 13:19:40 +02:00
Ján Tomko	b33eb0dca1	qemu: auto-add pci-root controller for pc machine types <controller type='pci' index='0' model='pci-root'/> is auto-added to pc* machine types. Without this controller PCI bus 0 is not available and no PCI addresses are assigned by default. Since older libvirt supported PCI bus 0 even without this controller, it is removed from the XML when migrating.	2013-04-25 13:05:10 +02:00
liguang	d350a34caf	qemu: build command line for pci-bridge device Signed-off-by: Ján Tomko <jtomko@redhat.com>	2013-04-25 12:54:59 +02:00
Ján Tomko	024e9af3e5	qemu: call post-parse callbacks when parsing command line too Now we set the default disk driver name when parsing the qemu command line too, hence all the test changes. Assume format type is 'auto' when none is specified on qemu command line.	2013-04-25 12:10:22 +02:00
Osier Yang	48f43940e9	qemu: Fix the indention Pushed under trivial rule.	2013-04-25 17:13:33 +08:00
Li Zhang	dfd0e4f7f2	qemu: Add command line builder and parser for NVRAM. This patch is to add command line builder and parser for NVRAM device, and add test cases. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2013-04-25 16:50:45 +08:00
Michal Privoznik	19c345f2fe	qemuBuildCommandLine: Don't overwrite errors with NWFilter's one Currently, if there has been an error in building command line process after virtual interfaces has been created, the flow jumps to 'error' label, where virDomainConfNWFilterTeardown() is called. This may report an error as well, but should not overwrite the original cause why we jumped to 'error' label.	2013-04-25 08:59:49 +02:00
Wido den Hollander	e3e866aee0	qemu: Don't require a block or file when looking for an alias This for example prohibits you to use iotune for Ceph or Sheepdog devices. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2013-04-24 16:29:26 -06:00
Osier Yang	18b428980f	Change the tag name "num_queues" into "queues" Instead of making a choice between the underscore and camelCase, this simply changes "num_queues" into "queues", which is also consistent with Michal's multiple queue support for interface.	2013-04-24 23:36:07 +08:00
Peter Krempa	20cb7f3a41	qemu: Improve handling of channels when generating SPICE command line Improve error reporting and generating of SPICE command line arguments according to the need to enable TLS. If TLS is disabled, there's no need to pass the certificate dir to qemu. This patch resolves: https://bugzilla.redhat.com/show_bug.cgi?id=953126	2013-04-24 14:37:57 +02:00
Peter Krempa	7b4a630484	qemu: Do sensible auto allocation of SPICE port numbers With this patch, if the autoport attribute is used, the code will sensibly auto allocate the ports only if needed.	2013-04-24 14:37:20 +02:00
Daniel P. Berrange	90430791ae	Make driver method names consistent with public APIs Ensure that all drivers implementing public APIs use a naming convention for their implementation that matches the public API name. eg for the public API virDomainCreate make sure QEMU uses qemuDomainCreate and not qemuDomainStart Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-24 11:00:18 +01:00
Daniel P. Berrange	abe038cfc0	Extend previous check to validate driver struct field names Ensure that the driver struct field names match the public API names. For an API virXXXX we must have a driver struct field xXXXX. ie strip the leading 'vir' and lowercase any leading uppercase letters. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-24 10:59:53 +01:00
Peter Krempa	23090823f1	qemu: Split out SPICE port allocation into a separate function Later on this function will be used to do more sophisticated checks and determination if port allocation is needed.	2013-04-23 21:30:56 +02:00
Peter Krempa	bd15ee89a7	qemu: Use switch instead of ifs in qemuBuildGraphicsCommandLine Switch the function from a bunch of ifs to a switch statement with correct type and reflow some code. Also fix comment in enum describing possible graphics types	2013-04-23 21:30:55 +02:00
Peter Krempa	66135c7208	qemu: Split out code to generate VNC command line Decrease size of qemuBuildGraphicsCommandLine() by splitting out spice-related code into qemuBuildGraphicsVNCCommandLine(). This patch also fixes 2 possible memory leaks on error path in the code that was split-out. The buffer containing the already generated options and a listen address string could be leaked. Also break a few very long lines and reflow code that fits now.	2013-04-23 21:30:55 +02:00
Peter Krempa	d05b6844c9	qemu: Split out code to generate SPICE command line Decrease size of qemuBuildGraphicsCommandLine() by splitting out spice-related code into qemuBuildGraphicsSPICECommandLine(). This patch also fixes 2 possible memory leaks on error path in the code that was split-out. The buffer containing the already generated options and a listen address string could be leaked. Also break a few very long lines.	2013-04-23 21:30:55 +02:00
Jiri Denemark	6d4804858e	qemu: Use -machine accel=tcg\|kvm when available This is a better interface to choose accelerator than guessing whether we should enable or disable kvm to get the right one.	2013-04-23 21:19:35 +02:00
Jiri Denemark	cfe24c1a18	qemu: Move -enable-kvm and friends earlier in the command line	2013-04-23 21:19:35 +02:00
Peter Krempa	fa006c4fdd	qemu: Fix setting of memory tunables Refactoring done in `19c6ad9ac7` didn't correctly take into account the order cgroup limit modification needs to be done in. This resulted into errors when decreasing the limits. The operations need to take place in this order: decrease hard limit change swap hard limit or change swap hard limit increase hard limit This patch also fixes the check if the hard_limit is less than swap_hard_limit to print better error messages. For this purpose I introduced a helper function virCompareLimitUlong to compare limit values where value of 0 is equal to unlimited. Additionally the check is now applied also when the user does not provide all of the tunables through the API and in that case the currently set values are used. This patch resolves: https://bugzilla.redhat.com/show_bug.cgi?id=950478	2013-04-23 07:10:56 +02:00
Jiri Denemark	6d1b3edc6e	qemu: Ignore libvirt logs when reading QEMU error output When QEMU fails to start, libvirt read its error output and reports it back in an error message. However, when libvirtd is configured to log debug messages, one would get the following unhelpful garbage: virsh # start cd error: Failed to start domain cd error: internal error process exited while connecting to monitor: \ 2013-04-22 14:24:54.214+0000: 2194219: debug : virFileClose:72 : \ Closed fd 21 2013-04-22 14:24:54.214+0000: 2194219: debug : virFileClose:72 : \ Closed fd 27 2013-04-22 14:24:54.215+0000: 2194219: debug : virFileClose:72 : \ Closed fd 3 2013-04-22 14:24:54.215+0000: 2194220: debug : virExec:602 : Run \ hook 0x7feb8f600bf0 0x7feb86ef9300 2013-04-22 14:24:54.215+0000: 2194220: debug : qemuProcessHook:2507 \ : Obtaining domain lock 2013-04-22 14:24:54.216+0000: 2194220: debug : \ virDomainLockProcessStart:170 : plugin=0x7feb780261f0 \ dom=0x7feb7802a360 paused=1 fd=0x7feb86ef8ec4 2013-04-22 14:24:54.216+0000: 2194220: debug : \ virDomainLockManagerNew:128 : plugin=0x7feb780261f0 \ dom=0x7feb7802a360 withResources=1 2013-04-22 14:24:54.216+0000: 2194220: debug : \ virLockManagerPluginGetDriver:297 : plugin=0x7feb780261f0 2013-04-22 14:24:54.216+0000: 2194220: debug : \ virLockManagerNew:321 : driver=0x7feb8ef08640 type=0 nparams=5 \ params=0x7feb86ef8d60 flags=0 2013-04-22 14:24:54.216+000 instead of (the output with this patch applied): virsh # start cd error: Reconnected to the hypervisor error: Failed to start domain cd error: internal error process exited while connecting to monitor: \ char device redirected to /dev/pts/33 (label charserial0) qemu-system-x86_64: -drive file=/home/vm/systemrescuecd-x86-1.2.0.\ iso,if=none,id=drive-ide0-1-0,readonly=on,format=raw,cache=none: \ could not open disk image /home/vm/systemrescuecd-x86-1.2.0.iso: \ Permission denied	2013-04-22 20:13:40 +02:00
Jiri Denemark	e4bdba8d7f	qemu: Move QEMU log reading into a separate function	2013-04-22 20:13:40 +02:00
Daniel P. Berrange	1e05073fbb	Replace more cases of /system with /machine The change in commit `aed4986322` was incomplete, missing a couple of cases of /system. This caused failure to start VMs. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-22 17:11:36 +01:00
Daniel P. Berrange	aed4986322	Change default resource partition to /machine After discussions with systemd developers it was decided that a better default policy for resource partitions is to have 3 default partitions at the top level /system - system services /machine - virtual machines / containers /user - user login session This ensures that the default policy isolates guest from user login sessions & system services, so a mis-behaving guest can't consume 100% of CPU usage if other things are contending for it. Thus we change the default partition from /system to /machine Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-22 12:10:12 +01:00
Osier Yang	a71ec98841	qemu: Fix the wrong expression Wrong use of the parentheses causes "rc" always having a boolean value, either "1" or "0", and thus we can't get the detailed error message when it fails: Before (I only have 1 node): % virsh numatune f18 --nodeset 12 error: Unable to change numa parameters error: unable to set numa tunable: Unknown error -1 After: virsh numatune f18 --nodeset 12 error: Unable to change numa parameters error: unable to set numa tunable: Invalid argument	2013-04-22 18:56:20 +08:00
Ján Tomko	6f45099723	qemu: rename CheckSlot to SlotInUse Also change its return value from int to bool.	2013-04-19 18:16:01 +02:00
Ján Tomko	5d29ca063d	qemu: switch PCI address set from hash table to an array Each bus is represented as an array of 32 8-bit integers where each bit represents a PCI function and each byte represents a PCI slot. Uses just one bus so far.	2013-04-19 18:16:01 +02:00
Ján Tomko	db180a1d31	qemu: move PCI address check out of qemuPCIAddressAsString Create a new function qemuPCIAddressValidate and call it everywhere the user might supply an incorrect address: * qemuCollectPCIAddress for domain definition * qemuDomainPCIAddressEnsureAddr and ReleaseSlot for hotplug Slot and function shouldn't be wrong at this point, since values out of range should be rejected by the XML parser.	2013-04-19 17:50:54 +02:00
Ján Tomko	62940d6c68	qemu: QEMU_PCI constant consistency Change QEMU_PCI_ADDRESS_LAST_SLOT to the number of slots in the bus, not the maximum slot value, to match QEMU_PCI_ADDRESS_LAST_FUNCTION and rename them both to have _LAST at the end.	2013-04-19 17:50:54 +02:00
Ján Tomko	ba8b8ddb7f	qemu: print PCI address hexadecimally in errors Use the same formatting as we do for XML in error and debug outputs.	2013-04-19 17:50:54 +02:00
Ján Tomko	8e5928de98	qemu: make qemuComparePCIDevice aware of multiple buses Bus and domain need to be checked as well, otherwise we might get false positives when searching for multi-function devices.	2013-04-19 17:50:54 +02:00
Li Zhang	88c6159ca7	Set legacy USB option with default for ppc64. Currently, -device xxx still doesn't work well for ppc64 platform. It's better use legacy USB option with default for ppc64. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-19 11:30:49 +01:00
Ján Tomko	4327df7eee	qemu: fix default spice password setting Set spice password even if default VNC password hasn't been set. https://bugzilla.redhat.com/show_bug.cgi?id=953720	2013-04-19 07:08:30 +02:00
Paolo Bonzini	78d7c3c569	qemu_conf: add new configuration key bridge_helper Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-04-18 14:58:33 -06:00
Tal Kain	9b3322c766	qemu: simplify use of virArchFromHost Reusing the result of virArchFromHost instead of calling it multiple times Signed-off-by: Tal Kain <tal.kain@ravellosystems.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-04-18 06:42:11 -06:00
Osier Yang	09d2547f96	qemu: Allow the disk wwn to have "0x" prefix The recent qemu requires "0x" prefix for the disk wwn, this patch changes virValidateWWN to allow the prefix, and prepend "0x" if it's not specified. E.g. qemu-kvm: -device scsi-hd,bus=scsi0.0,channel=0,scsi-id=0,lun=0,\ drive=drive-scsi0-0-0-0,id=scsi0-0-0-0,wwn=6000c60016ea71ad: Property 'scsi-hd.wwn' doesn't take value '6000c60016ea71ad' Though it's a qemu regression, but it's nice to allow the prefix, and doesn't hurt for us to always output "0x".	2013-04-17 23:05:56 +08:00
Osier Yang	bc95be5dea	cleanup: Remove the duplicate header Detected by a simple Shell script: for i in $(git ls-files -- '.[ch]'); do awk 'BEGIN { fail=0 } /# include.\.h/{ match($0, /["<][^">][">]/) arr[substr($0, RSTART+1, RLENGTH-2)]++ } END { for (key in arr) { if (arr[key] > 1) { fail=1 printf("%d %s\n", arr[key], key) } } if (fail == 1) exit 1 }' $i if test $? != 0; then echo "Duplicate header(s) in $i" fi done; A later patch will add the syntax-check to avoid duplicate headers.	2013-04-17 15:49:35 +08:00
Stefan Berger	8b934a5cb6	Check for unsupported QMP command Check for an unsupported QMP command when using the query-tpm-models and query-tpm-types commands before checking for general errors in order to avoid error messages in the log. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com>	2013-04-16 07:05:21 -04:00
Stefan Berger	f62cb55666	Revert checking for QMP query-tpm-models Revert the patch checking for the QMP query-tpm-models command. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com>	2013-04-16 07:05:21 -04:00
Peter Krempa	cbf8ebaad4	qemu_agent: Add support for appending arrays to commands Add support for array elements for agent commands just like `64d5e815` did for monitor commands	2013-04-16 10:38:30 +02:00
Stefan Berger	3208c562b4	Check for QMP query-tpm-models Check for QMP query-tpm-models and set a capability flag. Do not use this QMP command if it is not supported. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com>	2013-04-15 16:46:53 -04:00
Daniel P. Berrange	767596bdb4	Remove non-functional code for setting up non-root cgroups The virCgroupNewDriver method had a 'bool privileged' param. If a false value was ever passed in, it would simply not work, since non-root users don't have any privileges to create new cgroups. Just delete this broken code entirely and make the QEMU driver skip cgroup setup in non-privileged mode Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-15 17:35:31 +01:00
Daniel P. Berrange	db44eb1b5f	Change default cgroup layout for QEMU/LXC and honour XML config Historically QEMU/LXC guests have been placed in a cgroup layout that is $LOCATION-OF-LIBVIRTD/libvirt/{qemu,lxc}/$VMNAME This is bad for a number of reasons - The cgroup hierarchy gets very deep which seriously impacts kernel performance due to cgroups scalability limitations. - It is hard to setup cgroup policies which apply across services and virtual machines, since all VMs are underneath the libvirtd service. To address this the default cgroup location is changed to be /system/$VMNAME.{lxc,qemu}.libvirt This puts virtual machines at the same level in the hierarchy as system services, allowing consistent policy to be setup across all of them. This also honours the new resource partition location from the XML configuration, for example <resource> <partition>/virtualmachines/production</partitions> </resource> will result in the VM being placed at /virtualmachines/production/$VMNAME.{lxc,qemu}.libvirt NB, with the exception of the default, /system, path which is intended to always exist, libvirt will not attempt to auto-create the partitions in the XML. It is the responsibility of the admin/app to configure the partitions. Later libvirt APIs will provide a way todo this. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-15 17:35:31 +01:00
Daniel P. Berrange	aa8604dd45	Add a new virCgroupNewPartition for setting up resource partitions A resource partition is an absolute cgroup path, ignoring the current process placement. Expose a virCgroupNewPartition API for constructing such cgroups Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-15 17:35:31 +01:00
Daniel P. Berrange	04c18d25f1	Rename virCgroupForXXX to virCgroupNewXXX Rename all the virCgroupForXXX methods to use the form virCgroupNewXXX since they are all constructors. Also make sure the output parameter is the last one in the list, and annotate all pointers as non-null. Fix up all callers, and make sure they use true/false not 0/1 for the boolean parameters Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-15 17:35:31 +01:00
Daniel P. Berrange	632f78caaf	Store a virCgroupPtr instance in qemuDomainObjPrivatePtr Instead of calling virCgroupForDomain every time we need the virCgrouPtr instance, just do it once at Vm startup and cache a reference to the object in qemuDomainObjPrivatePtr until shutdown of the VM. Removing the virCgroupPtr from the QEMU driver state also means we don't have stale mount info, if someone mounts the cgroups filesystem after libvirtd has been started Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-15 17:35:31 +01:00
Peter Krempa	63b68f3cb4	qemu: Report also domain name in error message when domain object wasn't found Report the errors as: Domain not found: no domain with matching uuid '41414141-4141-4141-4141-414141414141' (crashtest) instead of: Domain not found: no domain with matching uuid '41414141-4141-4141-4141-414141414141'	2013-04-15 09:43:54 +02:00
Peter Krempa	54a99ba867	qemu: Refactor lookup of domain object Use the helper to lookup the domain object in the remaining places. This patch also fixes error reporting when the domain was not found in several functions that were printing the raw UUID buffer instead of the formatted string. The offending functions were: qemuDomainGetInterfaceParameters qemuDomainSetInterfaceParameters qemuGetSchedulerParametersFlags qemuSetSchedulerParametersFlags qemuDomainGetNumaParameters qemuDomainSetNumaParameters qemuDomainGetMemoryParameters qemuDomainSetMemoryParameters qemuDomainGetBlkioParameters qemuDomainSetBlkioParameters qemuDomainGetCPUStats	2013-04-15 09:43:54 +02:00
Osier Yang	00b6828dc2	cleanup: Change datatype of graphic's members to boolean	2013-04-13 13:28:36 +08:00
Stefan Berger	291cfb83f3	TPM support for QEMU command line For TPM passthrough device support create command line parameters like: -tpmdev passthrough,id=tpm-tpm0,path=/dev/tpm0,cancel-path=/sys/class/misc/tpm0/device/cancel -device tpm-tis,tpmdev=tpm-tpm0,id=tpm0 Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com> Reviewed-by: Corey Bryant <coreyb@linux.vnet.ibm.com> Tested-by: Corey Bryant <coreyb@linux.vnet.ibm.com>	2013-04-12 16:55:46 -04:00
Stefan Berger	22feb0d3e7	QEMU Cgroup support for TPM passthrough Some refactoring for virDomainChrSourceDef type of devices so we can use common code. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com> Reviewed-by: Corey Bryant <coreyb@linux.vnet.ibm.com> Tested-by: Corey Bryant <coreyb@linux.vnet.ibm.com>	2013-04-12 16:55:46 -04:00
Stefan Berger	f447ff5982	Convert QMP strings into QEMU capability bits Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com> Reviewed-by: Corey Bryant <coreyb@linux.vnet.ibm.com> Tested-by: Corey Bryant <coreyb@linux.vnet.ibm.com>	2013-04-12 16:55:45 -04:00
Stefan Berger	ed1f031850	Add QMP probing for TPM Probe for QEMU's QMP TPM support by querying the lists of supported TPM models (query-tpm-models) and backend types (query-tpm-types). The setting of the capability flags following the strings returned from the commands above is only provided in the patch where domain_conf.c gets TPM support due to dependencies on functions only introduced there. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com> Reviewed-by: Corey Bryant <coreyb@linux.vnet.ibm.com> Tested-by: Corey Bryant <coreyb@linux.vnet.ibm.com>	2013-04-12 16:55:45 -04:00
Li Zhang	a6e37aedff	Add USB option capability To avoid the collision for creating USB controllers in machine->init() and -device xx command line, it needs to set usb=off to avoid one USB controller created in machine->init(). So that libvirt can use -device or -usb to create USB controller sucessfully. So QEMU_CAPS_MACHINE_USB_OPT capability is added, and it is for QEMU v1.3.0 onwards which supports USB option. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2013-04-12 10:56:03 +01:00
Jiri Denemark	88624b5d4c	qemu: Do not report unsafe migration for local files When migrating a domain with disk images stored locally (and using storage migration), we should not complain about unsafe migration no matter what cache policy is used for that disk.	2013-04-11 21:57:50 +02:00
Peter Krempa	608d149e97	qemu: Try to use QMP for send-key if supported Instead of always using HMP use the QMP send-key command introduced in qemu 1.3.	2013-04-11 16:42:30 +02:00
Michal Privoznik	7f15ebc7a2	qemu: Set correct migrate host in client_migrate_info https://bugzilla.redhat.com/show_bug.cgi?id=920441 Currently, we are discarding listen attribute from qemu cookie even though we strive to gather it. This result in not so cool bug: if user have different networks, one for management/migration, and one for VNC/SPICE we pass incorrect host to the qemu in client_migrate_info. What we actually pass is remote hostname, while we should be passing remote listen address. It doesn't matter as long as these two are the same, but they don't need necessary to be like that.	2013-04-11 12:32:17 +02:00
Ján Tomko	74bff25090	qemu: fix crash in qemuOpen If the path part of connection URI is not present, cfg is used unitialized. https://bugzilla.redhat.com/show_bug.cgi?id=950855	2013-04-11 11:41:22 +02:00
Osier Yang	e9e37538bb	cleanup: Change datatype of disk->readonly to boolean	2013-04-11 11:36:44 +08:00
Osier Yang	1bbc1e7524	cleanup: Change datatype of hostdev->missing to boolean	2013-04-11 11:36:28 +08:00
Osier Yang	9fda2f5cc9	Cleanup: Change datatype of hostdev->managed to boolean	2013-04-11 11:31:02 +08:00
Han Cheng	5bc5a44db9	conf: Change help function The helper function to look up disk controller model may be used by scsi hostdev. But it should be changed to use device info. Signed-off-by: Han Cheng <hanc.fnst@cn.fujitsu.com>	2013-04-09 22:21:16 +08:00
Peter Krempa	b0216da8ee	qemu: Remove now obsolete assignment of default network card model for s390 hosts This effectively reverts commit `539d73dbf6` as the changes aren't needed after introduction of the XML post parse callbacks.	2013-04-09 15:47:58 +02:00
Peter Krempa	74ba039f82	qemu: Clean up network device CLI generator With the default model assigned in the parse callback, this code is now obsolete.	2013-04-09 15:47:58 +02:00
Viktor Mihajlovski	d8ddf522a0	qemu: Use correct default model on s390 Commit `a68d672667` breaks networking on s390 as it changes the default network card model.	2013-04-09 15:47:58 +02:00
Daniel P. Berrange	dca927c82f	Rename virCgroupMounted to virCgroupHasController & make it more robust The virCgroupMounted method is badly named, since a controller can be mounted, but disabled in the current object. Rename the method to be virCgroupHasController. Also make it tolerant to a NULL virCgroupPtr and out-of-range controller index, to avoid duplication of these checks in all callers Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-08 14:49:12 +01:00
Osier Yang	70bb34eb2e	qemu: Allow volume type disk for device 'lun' This allows one use block type volume as the disk source for device 'lun'.	2013-04-08 19:10:34 +08:00
Osier Yang	a9762b730b	qemu: Support sgio setting for volume type disk	2013-04-08 19:10:12 +08:00
Osier Yang	464d4e559c	qemu: Support shareable volume type disk Since the source is already translated before. This just adds the checking. Move !disk->shared and !disk->src to improve the performance a bit.	2013-04-08 19:08:47 +08:00
Osier Yang	60b78b33e1	qemu: Translate the pool disk source earlier To support "shareable" for volume type disk, we have to translate the source before trying to add the shared disk entry. To achieve the goal, this moves the helper qemuTranslateDiskSourcePool into src/qemu/qemu_conf.c, and introduce an internal only member (voltype) for struct _virDomainDiskSourcePoolDef, to record the underlying volume type for use when building the drive string. Later patch will support "shareable" volume type disk.	2013-04-08 19:02:34 +08:00
Osier Yang	43404fee37	Support startupPolicy for 'volume' disk "startupPolicy" is only valid for file type storage volume, otherwise it fails on starting the domain.	2013-04-08 18:54:37 +08:00
Osier Yang	db94a1d3a0	qemu: Translate the pool disk source when building drive string This adds a new helper qemuTranslateDiskSourcePool which uses the storage pool/vol APIs to translate the disk source before building the drive string. Network volume is not supported yet. Disk chain for volume type disk may be supported later, but before I'm confident it doesn't break anything, it's just disabled now.	2013-04-08 18:54:17 +08:00
Osier Yang	fd1432c7ae	qemu: Error out if the bitmap for pinning is all clear For both "live" and "config" changes of vcpupin and emulatorpin, an all clear bitmap doesn't make sense, and it can just cause corruptions. E.g (similar for emulatorpin). % virsh vcpupin hame 0 8,^8 --config % virsh vcpupin hame VCPU: CPU Affinity ---------------------------------- 0: 1: 0-63 2: 0-63 3: 0-63 % virsh dumpxml hame \| grep cpuset <vcpupin vcpu='0' cpuset=''/> % virsh start hame error: Failed to start domain hame error: An error occurred, but the cause is unknown	2013-04-06 10:16:59 +08:00
Osier Yang	d4bf0a9378	qemu: Support multiple queue virtio-scsi This introduce a new attribute "num_queues" (same with the good name QEMU uses) for virtio-scsi controller. An example of the XML: <controller type='scsi' index='0' model='virtio-scsi' num_queues='8'/> The corresponding QEMU command line: -device virtio-scsi-pci,id=scsi0,num_queues=8,bus=pci.0,addr=0x3 \	2013-04-06 10:08:47 +08:00
Peter Krempa	ce65b43589	qemu: Remove maximum cpu limit when setting processor count using the API When setting processor count for a domain using the API libvirt enforced a maximum processor count, while it isn't enforced when taking the XML path. This patch removes the check to match the XML.	2013-04-05 15:36:00 +02:00
Daniel P. Berrange	56f27b3bbc	Don't create dirs in cgroup controllers we don't want to use Currently when getting an instance of virCgroupPtr we will create the path in all cgroup controllers. Only at the virt driver layer are we attempting to filter controllers. This is bad because the mere act of creating the dirs in the controllers can have a functional impact on the kernel, particularly for performance. Update the virCgroupForDriver() method to accept a bitmask of controllers to use. Only create dirs in the controllers that are requested. When creating cgroups for domains, respect the active controller list from the parent cgroup Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-05 10:41:54 +01:00
Peter Krempa	482e5f159c	virCaps: get rid of defaultConsoleTargetType callback This patch refactors various places to allow removing of the defaultConsoleTargetType callback from the virCaps structure. A new console character device target type is introduced - VIR_DOMAIN_CHR_CONSOLE_TARGET_TYPE_NONE - to mark that no type was specified in the XML. This type is at the end converted to the standard VIR_DOMAIN_CHR_CONSOLE_TARGET_TYPE_SERIAL. Other types that are different from this default have to be processed separately in the device post parse callback.	2013-04-04 22:42:39 +02:00
Peter Krempa	46becc18ba	virCaps: get rid of macPrefix field Use the virDomainXMLConf structure to hold this data and tweak the code to avoid semantic change. Without configuration the KVM mac prefix is used by default. I chose it as it's in the privately administered segment so it should be usable for any purposes.	2013-04-04 22:42:38 +02:00
Peter Krempa	8960d65674	virCaps: get rid of hasWideScsiBus Use the virDomainXMLConf structure to hold this data.	2013-04-04 22:42:38 +02:00
Peter Krempa	b299084988	virCaps: get rid of defaultDiskDriverType Use the qemu specific callback to fill this data in the qemu driver as it's the only place where it was used and fix tests as the qemu test capability object didn't configure the defaults for the tests.	2013-04-04 22:42:38 +02:00
Peter Krempa	b5def001cc	virCaps: get rid of emulatorRequired This patch removes the emulatorRequired field and associated infrastructure from the virCaps object. Instead the driver specific callbacks are used as this field isn't enforced by all drivers. This patch implements the appropriate callbacks in the qemu and lxc driver and moves to check to that location.	2013-04-04 22:42:38 +02:00
Peter Krempa	9ea249e7d9	virCaps: get rid of defaultDiskDriverName This patch removes the defaultDiskDriverName from the virCaps structure. This particular default value is used only in the qemu driver so this patch uses the recently added callback to fill the driver name if it's needed instead of propagating it through virCaps.	2013-04-04 22:42:38 +02:00
Peter Krempa	a68d672667	qemu: Record the default NIC model in the domain XML This patch implements the devices post parse callback and uses it to fill the default qemu network card model into the XML if none is specified. Libvirt assumes that the network card model for qemu is the "rtl8139". Record this in the XML using the new callback to avoid user confusion.	2013-04-04 22:41:20 +02:00
Peter Krempa	ad0d10b2b1	conf callback: Rearrange function parameters Move the xmlopt and caps arguments to the end of the argument list.	2013-04-04 22:41:19 +02:00
Peter Krempa	43b99fc4c0	conf: Add post XML parse callbacks and prepare for cleaning of virCaps This patch adds instrumentation that will allow hypervisor drivers to fill and validate domain and device definitions after parsed by the XML parser. With this patch, after the XML is parsed, a callback to the driver is issued requesting to fill and validate driver specific details of the configuration. This allows to use sensible defaults and checks on a per driver basis at the time the XML is parsed. Two callback pointers are stored in the new virDomainXMLConf object: * virDomainDeviceDefPostParseCallback (devicesPostParseCallback) - called for a single device parsed and for every single device in a domain config. A virDomainDeviceDefPtr is passed along with the domain definition and virCaps. * virDomainDefPostParseCallback, (domainPostParseCallback) - A callback that is meant to process the domain config after it's parsed. A virDomainDefPtr is passed along with virCaps. Both types of callbacks support arbitrary opaque data passed for the callback functions. Errors may be reported in those callbacks resulting in a XML parsing failure.	2013-04-04 22:29:48 +02:00
Peter Krempa	e84b19316a	maint: Rename xmlconf to xmlopt and virDomainXMLConfig to virDomainXMLOption This patch is the result of running: for i in $(git ls-files \| grep -v html \| grep -v \.po$ ); do sed -i -e "s/virDomainXMLConf/virDomainXMLOption/g" -e "s/xmlconf/xmlopt/g" $i done and a few manual tweaks.	2013-04-04 22:18:56 +02:00
Eric Blake	e52a31d166	qemu: fix memory leak on -machine usage error Commit `f84b92ea` introduced a memory leak on error; John Ferlan reported that valgrind caught it during 'make check'. * src/qemu/qemu_command.c (qemuBuildMachineArgStr): Plug leak.	2013-04-03 11:55:18 -06:00
Peter Krempa	24ca8fae64	qemu-blockjob: Fix limit of bandwidth for block jobs to supported value The JSON generator is able to represent only values less than LLONG_MAX, fix the bandwidth limit checks when converting to value to catch overflows before they reach the generator.	2013-04-03 16:38:51 +02:00
Peter Krempa	43b6f304bc	qemu: Fix crash when updating media with shared device Mimic the fix done in `02b9097274` to fix crash by accessing an already freed structure. Also copy the explaining comment why the pointer can't be accessed any more.	2013-04-02 23:15:00 +02:00
Peter Krempa	6bd94a1b59	Use virMacAddrFormat instead of manual mac address formatting Format the address using the helper instead of having similar code in multiple places. This patch also fixes leak of the MAC address string in ebtablesRemoveForwardAllowIn() and ebtablesAddForwardAllowIn() in src/util/virebtables.c	2013-04-02 15:53:43 +02:00
Li Zhang	f84b92ea19	Optimize machine option to set more options with it Currently, -machine option is used only when dump-guest-core is set. To use options defined in machine option for newer version of QEMU, it needs to use -machine xxx, and to be compatible with older version -M, this patch adds QEMU_CAPS_MACHINE_OPT capability for newer version which supports -machine option. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-04-02 07:02:34 -06:00
Eric Blake	6f7e4ea359	smartcard: spell ccid-card-emulated qemu property correctly Reported by Anthony Messina in https://bugzilla.redhat.com/show_bug.cgi?id=904692 Present since introduction of smartcard support in commit `f5fd9baa` * src/qemu/qemu_command.c (qemuBuildCommandLine): Match qemu spelling. * tests/qemuxml2argvdata/qemuxml2argv-smartcard-host-certificates.args: Fix broken test.	2013-04-02 06:23:33 -06:00
Ján Tomko	f03dcc5df1	qemu: Allow migration over IPv6 Allow migration over IPv6 by listening on [::] instead of 0.0.0.0 when QEMU supports it (QEMU_CAPS_IPV6_MIGRATION) and there is at least one v6 address configured on the system. Use virURIParse in qemuMigrationPrepareDirect to allow parsing IPv6 addresses, which would cause an 'incorrect :port' error message before. Move setting of migrateFrom from qemuMigrationPrepare{Direct,Tunnel} after domain XML parsing, since we need the QEMU binary path from it to get its capabilities. Bug: https://bugzilla.redhat.com/show_bug.cgi?id=846013	2013-04-02 11:23:47 +02:00
John Ferlan	9a80050e52	Resolve valgrind failure Code added by commit id '523207fe8' TEST: qemuxml2argvtest ........................................ 40 ........................................ 80 ........................................ 120 ........................................ 160 ........................................ 200 ........................................ 240 ................................. 273 OK ==30993== 39 bytes in 1 blocks are definitely lost in loss record 33 of 87 ==30993== at 0x4A0887C: malloc (vg_replace_malloc.c:270) ==30993== by 0x41E501: fakeSecretGetValue (qemuxml2argvtest.c:33) ==30993== by 0x427591: qemuBuildDriveURIString (qemu_command.c:2571) ==30993== by 0x42C502: qemuBuildDriveStr (qemu_command.c:2627) ==30993== by 0x4335FC: qemuBuildCommandLine (qemu_command.c:6443) ==30993== by 0x41E8A0: testCompareXMLToArgvHelper (qemuxml2argvtest.c:154 ==30993== by 0x41FE8F: virtTestRun (testutils.c:157) ==30993== by 0x418BE3: mymain (qemuxml2argvtest.c:506) ==30993== by 0x4204CA: virtTestMain (testutils.c:719) ==30993== by 0x38D6821A04: (below main) (in /usr/lib64/libc-2.16.so) ==30993== ==30993== 46 bytes in 1 blocks are definitely lost in loss record 64 of 87 ==30993== at 0x4A0887C: malloc (vg_replace_malloc.c:270) ==30993== by 0x38D690A167: __vasprintf_chk (in /usr/lib64/libc-2.16.so) ==30993== by 0x4CB28E7: virVasprintf (stdio2.h:210) ==30993== by 0x4CB29A3: virAsprintf (virutil.c:2017) ==30993== by 0x4275B4: qemuBuildDriveURIString (qemu_command.c:2580) ==30993== by 0x42C502: qemuBuildDriveStr (qemu_command.c:2627) ==30993== by 0x4335FC: qemuBuildCommandLine (qemu_command.c:6443) ==30993== by 0x41E8A0: testCompareXMLToArgvHelper (qemuxml2argvtest.c:154 ==30993== by 0x41FE8F: virtTestRun (testutils.c:157) ==30993== by 0x418BE3: mymain (qemuxml2argvtest.c:506) ==30993== by 0x4204CA: virtTestMain (testutils.c:719) ==30993== by 0x38D6821A04: (below main) (in /usr/lib64/libc-2.16.so) ==30993== ==30993== 385 (56 direct, 329 indirect) bytes in 1 blocks are definitely los ==30993== at 0x4A06B6F: calloc (vg_replace_malloc.c:593) ==30993== by 0x4C6B2CF: virAllocN (viralloc.c:152) ==30993== by 0x4C9C7EB: virObjectNew (virobject.c:191) ==30993== by 0x4D21810: virGetSecret (datatypes.c:642) ==30993== by 0x41E5D5: fakeSecretLookupByUsage (qemuxml2argvtest.c:51) ==30993== by 0x4D4BEC5: virSecretLookupByUsage (libvirt.c:15295) ==30993== by 0x4276A9: qemuBuildDriveURIString (qemu_command.c:2565) ==30993== by 0x42C502: qemuBuildDriveStr (qemu_command.c:2627) ==30993== by 0x4335FC: qemuBuildCommandLine (qemu_command.c:6443) ==30993== by 0x41E8A0: testCompareXMLToArgvHelper (qemuxml2argvtest.c:154 ==30993== by 0x41FE8F: virtTestRun (testutils.c:157) ==30993== by 0x418BE3: mymain (qemuxml2argvtest.c:506) ==30993== PASS: qemuxml2argvtest Interesting side note is that running the test singularly via 'make -C tests check TESTS=qemuxml2argvtest' didn't trip the valgrind error; however, running during 'make -C tests valgrind' did cause the error to be seen.	2013-04-01 13:13:31 -04:00
Guannan Ren	1cb03d4e4b	qemu:release qemu config object when qemu driver shutdown	2013-03-28 12:07:27 +08:00
Guido Günther	ea2e31fa5b	qemu: Don't set address type too early during virtio disk hotplug `f946462e14` changed behavior by settings VIR_DOMAIN_DEVICE_ADDRESS_TYPE_PCI upfront. If we do so before invoking qemuDomainPCIAddressEnsureAddr we merely try to set the PCI slot via qemuDomainPCIAddressReserveSlot instead reserving a new address via qemuDomainPCIAddressSetNextAddr which fails with $ ~/run-tck-test domain/200-disk-hotplug.t ./scripts/domain/200-disk-hotplug.t .. # Creating a new transient domain ./scripts/domain/200-disk-hotplug.t .. 1/5 # Attaching the new disk /var/lib/jenkins/jobs/libvirt-tck-build/workspace/scratchdir/200-disk-hotplug/extra.img # Failed test 'disk has been attached' # at ./scripts/domain/200-disk-hotplug.t line 67. # died: Sys::Virt::Error (libvirt error code: 1, message: internal error unable to reserve PCI address 0:0:0.0 # )	2013-03-26 18:54:41 +01:00
Michal Privoznik	ceb31795af	qemu: Set migration FD blocking Since we switched from direct host migration scheme to the one, where we connect to the destination and then just pass a FD to a qemu, we have uncovered a qemu bug. Qemu expects migration FD to block. However, we are passing a nonblocking one which results in cryptic error messages like: qemu: warning: error while loading state section id 2 load of migration failed The bug is already known to Qemu folks, but we should workaround already released Qemus. Patch has been originally proposed by Stefan Hajnoczi <stefanha@gmail.com>	2013-03-26 17:16:27 +01:00
Eric Blake	7524cd893e	Revert "qemu: detect multi-head qxl via more than version check" This reverts commit `5ac846e42e`. After further discussions with Alon Levy, I learned the following: The use of '-vga qxl' vs. '-device qxl-vga' is completely orthogonal to whether ram_size can be exposed. Downstream distros are interested in backporting support for multi-head qxl, but this can be done in one of two ways: 1. Support one head per PCI device. If you do this, then it makes sense to have full control over the PCI address of each device. For full control, you need '-device qxl-vga' instead of '-vga qxl'. 2. Support multiple heads through a single PCI device. If you do this, then you need to allocate more RAM to that PCI device (enough ram to cover the multiple screens). Here, the device is hard-coded to 0:0:2.0, both in qemu and libvirt code. Apparently, backporting ram_size changes to allow multiple heads in a single device is much easier than backporting multiple device support. Furthermore, the presence or absence of qxl-vga.surfaces is no different than the presence or absence of qxl-vga.ram_size; both properties can be applied regardless of whether you have one PCI device (-vga qxl) or multiple (-device qxl-vga), so this property is NOT a good witness of whether '-device qxl-vga' support has been backported. Downstream RHEL will NOT be using this patch; and worse, leaving this patch in risks doing the wrong thing if compiling upstream libvirt on RHEL, so the best course of action is to revert it. That means that libvirt will go back to only using '-device qxl-vga' for qemu >= 1.2, but this is just fine because we know of no distros that plan on backporting multiple PCI address support to any older version of qemu. Meanwhile, downstream can still use ram_size to pack multiple heads through a single PCI device.	2013-03-25 08:38:35 -06:00
Paolo Bonzini	9f7a9aee37	qemu: add support for LSI MegaRAID SAS1078 (aka megasas) SCSI controller This does nothing more than adding the new device and capability. The device is present since QEMU 1.2.0. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-03-22 12:11:14 +08:00
Paolo Bonzini	523207fe8c	qemu: pass iscsi authorization credentials A better way to do this would be to use a configuration file like [iscsi "target-name"] user = name password = pwd and pass it via -readconfig. This would remove the username and password from the "ps" output. For now, however, keep this solution. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-03-22 12:10:23 +08:00
Paolo Bonzini	8110a8249d	domain: make port optional for network disks Only sheepdog actually required it in the code, and we can use 7000 as the default---the same value that QEMU uses for the simple "sheepdog:VOLUME" syntax. With this change, the schema can be fixed to allow no port. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-03-22 12:10:23 +08:00
Paolo Bonzini	c820fbff9f	qemu: support passthrough for iscsi disks This enables usage of commands like persistent reservations. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-03-22 12:10:23 +08:00
Paolo Bonzini	1a308ee015	qemu: add support for libiscsi libiscsi provides a userspace iSCSI initiator. The main advantage over the kernel initiator is that it is very easy to provide different initiator names for VMs on the same host. Thus libiscsi supports usage of persistent reservations in the VM, which otherwise would only be possible with NPIV. libiscsi uses "iscsi" as the scheme, not "iscsi+tcp". We can change this in the tests (while remaining backwards-compatible manner, because QEMU uses TCP as the default transport for both Gluster and NBD). Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-03-22 12:10:22 +08:00
Peter Krempa	a584eaa5ff	qemu: Un-mark volume as mirrored/copied if blockjob copy fails When the blockjob fails for some reason an event is emitted but the disk wasn't unmarked as being part of a active block copy operation.	2013-03-21 12:32:03 +01:00
Michal Privoznik	cb86e9d39b	qemu: s/VIR_ERR_NO_SUPPORT/VIR_ERR_OPERATION_UNSUPPORTED The VIR_ERR_NO_SUPPORT error code is reserved for cases where an API is not implemented in a driver. It definitely should not be used when an API execution fails due to unsupported operation.	2013-03-21 09:26:15 +01:00
Osier Yang	65f61e4594	qemu: Add the new disk src into shared disk table when updating disk We should record the new disk src in the shared disk table for updating disk (CD-ROM or Floppy) API. Fortunately, we only allow to update the disk source now, otherwise we might also want to set the unpriv_sgio setting.	2013-03-21 12:20:36 +08:00
Li Zhang	a67aebd699	Clean redundant code about VCPU string checking Now that VCPU number are removed from qemu_monitor_text.c (commit `cc78d7ba`), VCPU string checking also should be removed. Report-by: John Ferlan <jferlan@redhat.com> Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2013-03-20 16:06:20 -06:00
Gao feng	45e9d27ad8	NUMA: cleanup for numa related codes Intend to reduce the redundant code,use virNumaSetupMemoryPolicy to replace virLXCControllerSetupNUMAPolicy and qemuProcessInitNumaMemoryPolicy. This patch also moves the numa related codes to the file virnuma.c and virnuma.h Signed-off-by: Gao feng <gaofeng@cn.fujitsu.com>	2013-03-20 19:37:00 +08:00
Gao feng	763edb5ebe	rename qemuGetNumadAdvice to virNumaGetAutoPlacementAdvice qemuGetNumadAdvice will be used by LXC driver, rename it to virNumaGetAutoPlacementAdvice and move it to virnuma.c Signed-off-by: Gao feng <gaofeng@cn.fujitsu.com>	2013-03-19 15:55:40 -06:00
Olivia Yin	0b3509e245	qemu: add dtb option support The "dtb" option sets the filename for the device tree. If without this option support, "-dtb file" will be converted into <qemu:commandline> in domain XML file. For example, '-dtb /media/ram/test.dtb' will be converted into <qemu:commandline> <qemu:arg value='-dtb'/> <qemu:arg value='/media/ram/test.dtb'/> </qemu:commandline> This is not very friendly. This patchset add special <dtb> tag like <kernel> and <initrd> which is easier for user to write domain XML file. <os> <type arch='ppc' machine='ppce500v2'>hvm</type> <kernel>/media/ram/uImage</kernel> <initrd>/media/ram/ramdisk</initrd> <dtb>/media/ram/test.dtb</dtb> <cmdline>root=/dev/ram rw console=ttyS0,115200</cmdline> </os> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-03-19 15:48:58 -06:00
Jiri Denemark	ef3cd6473f	qemu: Fix startupPolicy regression Commit `82d5fe5437` qemu: check backing chains even when cgroup is omitted added backing file checks just before the code that removes optional disks if they are not present. However, the backing chain code fails in case the disk file does not exist, which makes qemuProcessStart fail regardless on configured startupPolicy. Note that startupPolicy implementation is still wrong after this patch since it only check the first file in a possible chain. It should rather check the complete backing chain. But this is an existing limitation that can be solved later. After all, startupPolicy is most useful for CDROM images and they won't make use of backing files in most cases.	2013-03-18 14:11:58 +01:00
Paolo Bonzini	eebbb232e6	qemu: support URI syntax for NBD QEMU 1.3 and newer support an alternative URI-based syntax to specify the location of an NBD server. Libvirt can keep on using the old syntax in general, but only the URI syntax supports IPv6 addresses. The URI syntax also supports relative paths to Unix sockets. These should never be used but aren't explicitly blocked either by the parser, so support it just in case. The URI syntax is intentionally compatible with Gluster's, and the code can be reused. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-03-15 15:47:50 -06:00
Paolo Bonzini	be2a15dd60	qemu: support NBD with Unix sockets This reuses the XML format that was introduced for Gluster. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-03-15 15:27:56 -06:00
Paolo Bonzini	0aa9f522c4	qemu: support named nbd exports These are supported by nbd-server and by the NBD server that QEMU embeds for live image access. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-03-15 15:12:41 -06:00
Paolo Bonzini	db95213e59	qemu: rewrite NBD command-line builder and parser Move the code to an external function, and structure it to prepare the addition of new features in the next few patches. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-03-15 14:52:43 -06:00
Paolo Bonzini	af9474557e	qemu: do not support non-network disks without -drive QEMU added -drive in 2007, and NBD in 2008. Both appeared first in release 0.10.0. Thus the code to support network disks without -drive is dead, and in fact it incorrectly escapes commas. Drop it. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-03-15 08:34:06 -06:00
Li Zhang	cc78d7ba0e	Remove contiguous CPU indexes assumption When getting CPUs' information, it assumes that CPU indexes are not contiguous. But for ppc64 platform, CPU indexes are not contiguous because SMT is needed to be disabled, so CPU information is not right on ppc64 and vpuinfo, vcpupin can't work corretly. This patch is to remove the assumption to be compatible with ppc64. Test: 4 vcpus are assigned to one VM and execute vcpuinfo command. Without patch: There is only one vcpu informaion can be listed. With patch: All vcpus' information can be listed correctly. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2013-03-15 17:56:17 +08:00
Viktor Mihajlovski	4c1d1497e2	S390: Enable virtio-scsi and virtio-rng Newer versions of QEMU support virtio-scsi and virtio-rng devices on the virtio-s390 and ccw buses. Adding capability detection, address assignment and command line generation for that. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-03-14 15:34:54 -06:00
Viktor Mihajlovski	6c92773256	qemu: Rename virtio-scsi capability QEMU_CAPS_VIRTIO_SCSI_PCI implies that virtio-scsi is only supported for the PCI bus, which is not the case. Remove the _PCI suffix. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-03-14 14:56:11 -06:00
Eric Blake	5ac846e42e	qemu: detect multi-head qxl via more than version check Multi-head QXL support is so useful that distros have started to backport it to qemu earlier than 1.2. After discussion with Alon Levy, we determined that the existence of the qxl-vga.surfaces property is a reliable indicator of whether '-device qxl-vga' works, or whether we have to stick to the older '-vga qxl'. I'm leaving in the existing check for QEMU_CAPS_DEVICE_VIDEO_PRIMARY tied to qemu 1.2 and newer (in case qemu is built without qxl support), but for those distros that backport qxl, this additional capability check will allow the correct command line for both RHEL 6.3 (which lacks the feature) and RHEL 6.4 (where qemu still claims to be version 0.12.2.x, but has backported multi-head qxl). * src/qemu/qemu_capabilities.c (virQEMUCapsObjectPropsQxlVga): New property test. (virQEMUCapsExtractDeviceStr): Probe for backport of new capability to qemu earlier than 1.2. * tests/qemuhelpdata/qemu-kvm-1.2.0-device: Update test. * tests/qemuhelpdata/qemu-1.2.0-device: Likewise. * tests/qemuhelpdata/qemu-kvm-0.12.1.2-rhel62-beta-device: Likewise.	2013-03-14 09:38:20 -06:00
Peter Krempa	32bd699f55	virtio-rng: Add rate limiting options for virtio-RNG Qemu's implementation of virtio RNG supports rate limiting of the entropy used. This patch exposes the option to tune this functionality. This patch is based on qemu commit 904d6f588063fb5ad2b61998acdf1e73fb4 The rate limiting is exported in the XML as: <devices> ... <rng model='virtio'> <rate bytes='123' period='1234'/> <backend model='random'/> </rng> ...	2013-03-14 13:28:10 +01:00
J.B. Joret	f946462e14	S390: Add hotplug support for s390 virtio devices We didn't yet expose the virtio device attach and detach functionality for s390 domains as the device hotplug was very limited with the old virtio-s390 bus. With the CCW bus there's full hotplug support for virtio devices in QEMU, so we are adding this to libvirt too. Since the virtio hotplug isn't limited to PCI anymore, we change the function names from xxxPCIyyy to xxxVirtioyyy, where we handle all three virtio bus types. Signed-off-by: J.B. Joret <jb@linux.vnet.ibm.com> Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-03-13 18:13:09 -06:00
Viktor Mihajlovski	608512b24a	S390: QEMU driver support for CCW addresses This commit adds the QEMU driver support for CCW addresses. The current QEMU only allows virtio devices to be attached to the CCW bus. We named the new capability indicating that support QEMU_CAPS_VIRTIO_CCW accordingly. The fact that CCW devices can only be assigned to domains with a machine type of s390-ccw-virtio requires a few extra checks for machine type in qemu_command.c on top of querying QEMU_CAPS_VIRTIO_{CCW\|S390}. The majority of the new functions deals with CCW address generation and management. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-03-13 17:14:38 -06:00
Michal Privoznik	3b94239ffb	qemu_driver: Try KVM_CAP_MAX_VCPUS only if defined With our recent patch (`1715c83b5f`) we thrive to get the correct number of maximal VCPUs. However, we are using a constant from linux/kvm.h which may be not defined in every distro. Hence, we should guard usage of the constant with ifdef preprocessor directive. This was introduced in kernel: commit 8c3ba334f8588e1d5099f8602cf01897720e0eca Author: Sasha Levin <levinsasha928@gmail.com> Date: Mon Jul 18 17:17:15 2011 +0300 KVM: x86: Raise the hard VCPU count limit The patch raises the hard limit of VCPU count to 254. This will allow developers to easily work on scalability and will allow users to test high VCPU setups easily without patching the kernel. To prevent possible issues with current setups, KVM_CAP_NR_VCPUS now returns the recommended VCPU limit (which is still 64) - this should be a safe value for everybody, while a new KVM_CAP_MAX_VCPUS returns the hard limit which is now 254. $ git desc 8c3ba334f v3.1-rc7-48-g8c3ba33	2013-03-13 14:31:29 +01:00
Peter Krempa	27cf98e2d1	virCaps: conf: start splitting out irrelevat data The virCaps structure gathered a ton of irrelevant data over time that. The original reason is that it was propagated to the XML parser functions. This patch aims to create a new data structure virDomainXMLConf that will contain immutable data that are used by the XML parser. This will allow two things we need: 1) Get rid of the stuff from virCaps 2) Allow us to add callbacks to check and add driver specific stuff after domain XML is parsed. This first attempt removes pointers to private data allocation functions to this new structure and update all callers and function that require them.	2013-03-13 09:27:14 +01:00
Jiri Denemark	57bb725aca	qemu: Avoid NULL dereference in qemuSharedDiskEntryFree At least one caller may call qemuSharedDiskEntryFree with NULL as the first argument. Let's make the function similar to other *Free functions and do nothing in such case.	2013-03-12 09:10:41 +01:00
Peter Krempa	1715c83b5f	qemu: Fix retrieval of maximum number of vCPUs on KVM hosts The detection of the maximum number of cpus used incorrect ioctl argument value. This flaw caused that on kvm hosts this returns always "160" as the maximum. This is just a recommended maximum value. The real value is higher than that. This patch tweaks the detection function to behave as described by the kernel docs: https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/tree/Documentation/virtual/kvm/api.txt?id=refs/tags/v3.9-rc2#n199	2013-03-11 18:01:55 +01:00
Michal Privoznik	5a791c8995	qemuDomainBlockStatsFlags: Guard disk lookup with a domain job When there are two concurrent threads, we may dereference a NULL pointer, even though it has been checked before: 1. Thread1: starts executing qemuDomainBlockStatsFlags() with nparams != 0. It finds given disk and successfully pass check for disk->info.alias not being NULL. 2. Thread2: starts executing qemuDomainDetachDeviceFlags() on the very same disk as Thread1 is working on. 3. Thread1: gets to qemuDomainObjBeginJob() where it sets a job on a domain. 4. Thread2: also tries to set a job. However, we are not guaranteed which thread wins. So assume it's Thread2 who can continue. 5. Thread2: does the actual detach and frees disk->info.alias 6. Thread2: quits the job 7. Thread1: now successfully acquires the job, and accesses a NULL pointer.	2013-03-08 13:09:32 +01:00
Daniel P. Berrange	82793a2a55	Convert QEMU driver to use virLogProbablyLogMessage The current QEMU code for skipping log messages only skips over 'debug' message, switch to virLogProbablyLogMessage to make sure it skips over all of them	2013-03-07 18:56:52 +00:00
Guannan Ren	0047d5d6e8	qemu: update domain live xml for virsh memtune with --live flag virsh subcommand memtune forgot updating domain live xml after setting cgroup value.	2013-03-06 11:46:33 +08:00
Satoru Moriya	464ad16f5c	qemu: fix wrong evaluation in qemuDomainSetMemoryParameters `19c6ad9a` (qemu: Refactor qemuDomainSetMemoryParameters) introduced a new macro, VIR_GET_LIMIT_PARAMETER(PARAM, VALUE). But if statement in the macro is not correct and so set_XXXX flags are set to false in the wrong. As a result, libvirt ignores all memtune parameters. This patch fixes the conditional expression to work correctly. Signed-off-by: Satoru Moriya <satoru.moriya@hds.com>	2013-03-04 18:34:28 +01:00
Peter Krempa	9933a6b2fa	qemu: Remove managed save flag from VM when starting with --force-boot At the start of the guest after the image is unlinked the state wasn't touched up to match the state on disk.	2013-03-04 12:10:28 +01:00
Christophe Fergeau	aff6942c23	qemu: Use -1 as unpriviledged uid/gid Commit `f506a4c1` changed virSetUIDGID() to be a noop when uid/gid are -1, while it used to be a noop when they are <= 0. The changes in this commit broke creating new VMs in GNOME Boxes as qemuDomainCheckDiskPresence gets called during domain creation/startup, which in turn calls virFileAccessibleAs which fails after calling virSetUIDGID(0, 0) (Boxes uses session libvirtd). virSetUIDGID is called with (0, 0) as these are the default user/group values in virQEMUDriverConfig for session libvirtd. This commit changes virQEMUDriverConfigNew to use -1 as the unpriviledged uid/gid. I've also looked at the various places where cfg->user is used, and they all seem to handle -1 correctly.	2013-03-04 08:50:09 +01:00
Daniel P. Berrange	9c4ecb3e8e	Revert hack for autodestroy in qemuProcessStop This reverts the hack done in commit `568a6cda27` Author: Jiri Denemark <jdenemar@redhat.com> Date: Fri Feb 15 15:11:47 2013 +0100 qemu: Avoid deadlock in autodestroy since we now have a fix which avoids the deadlock scenario entirely	2013-03-01 10:18:27 +00:00
Daniel P. Berrange	96b893f092	Fix deadlock in QEMU close callback APIs There is a lock ordering problem in the QEMU close callback APIs. When starting a guest we have a lock on the VM. We then set a autodestroy callback, which acquires a lock on the close callbacks. When running auto-destroy, we obtain a lock on the close callbacks, then run each callbacks - which obtains a lock on the VM. This causes deadlock if anyone tries to start a VM, while autodestroy is taking place. The fix is to do autodestroy in 2 phases. First obtain all the callbacks and remove them from the list under the close callback lock. Then invoke each callback from outside the close callback lock. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-03-01 10:16:29 +00:00
Daniel P. Berrange	7ccad0b16d	Fix crash in QEMU auto-destroy with transient guests When the auto-destroy callback runs it is supposed to return NULL if the virDomainObjPtr is no longer valid. It was not doing this for transient guests, so we tried to virObjectUnlock a mutex which had been freed. This often led to a crash. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-03-01 10:16:29 +00:00
Jiri Denemark	e4e28220b5	qemu: Make sure qemuProcessStart is run within a job qemuProcessStart expects to be run with a job already set and every caller except for qemuMigrationPrepareAny use it correctly. This bug can be observed in libvirtd logs during incoming migration as warning : qemuDomainObjEnterMonitorInternal:979 : This thread seems to be the async job owner; entering monitor without asking for a nested job is dangerous	2013-03-01 08:32:08 +01:00
Serge Hallyn	4f773a8c30	Fix a message typo As pointed out in https://bugs.launchpad.net/ubuntu/+source/libvirt/+bug/1034661 The sentence "The function of PCI device addresses must less than 8" does not quite make sense. Update that to read "The function of PCI device addresses must be less than 8" Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2013-02-28 15:29:10 -07:00
Michal Privoznik	b8e25c35d7	qemu: Don't fail to shutdown domains with unresponsive agent Currently, qemuDomainShutdownFlags() chooses the agent method of shutdown whenever the agent is configured. However, this assumption is not enough as the guest agent may be unresponsive at the moment. So unless guest agent method has been explicitly requested, we should fall back to the ACPI method.	2013-02-28 12:24:34 +01:00
Viktor Mihajlovski	adfa3469bb	qemu: virConnectGetVersion returns bogus value The unitialized local variable qemuVersion can cause an random value to be returned for the hypervisor version, observable with virsh version. Introduced by commit `b46f7f4a0b` Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-02-28 11:48:02 +01:00
Paolo Bonzini	0a562de1ff	qemu: fix use-after-free when parsing NBD disk disk->src is still used for disks->hosts->name, do not free it. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-02-27 22:02:01 -07:00
Daniel P. Berrange	7f544a4c8f	Don't try to add non-existant devices to ACL The QEMU driver has a list of devices nodes that are whitelisted for all guests. The kernel has recently started returning an error if you try to whitelist a device which does not exist. This causes a warning in libvirt logs and an audit error for any missing devices. eg 2013-02-27 16:08:26.515+0000: 29625: warning : virDomainAuditCgroup:451 : success=no virt=kvm resrc=cgroup reason=allow vm="vm031714" uuid=9d8f1de0-44f4-a0b1-7d50-e41ee6cd897b cgroup="/sys/fs/cgroup/devices/libvirt/qemu/vm031714/" class=path path=/dev/kqemu rdev=? acl=rw Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Daniel P. Berrange	d0b3ee55ec	Fix typo in internal VIR_QEMU_PROCESS_START_AUTODESROY constant s/VIR_QEMU_PROCESS_START_AUTODESROY/VIR_QEMU_PROCESS_START_AUTODESTROY/ Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Daniel P. Berrange	279336c5d8	Avoid spamming logs with cgroups warnings The code for putting the emulator threads in a separate cgroup would spam the logs with warnings 2013-02-27 16:08:26.731+0000: 29624: warning : virCgroupMoveTask:887 : no vm cgroup in controller 3 2013-02-27 16:08:26.731+0000: 29624: warning : virCgroupMoveTask:887 : no vm cgroup in controller 4 2013-02-27 16:08:26.732+0000: 29624: warning : virCgroupMoveTask:887 : no vm cgroup in controller 6 This is because it has only created child cgroups for 3 of the controllers, but was trying to move the processes from all the controllers. The fix is to only try to move threads in the controllers we actually created. Also remove the warning and make it return a hard error to avoid such lazy callers in the future. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Daniel P. Berrange	b4a124efc3	Fix autodestroy of QEMU guests The virQEMUCloseCallbacksRunOne method was passing a uuid string to virDomainObjListFindByUUID, when it actually expected to get a raw uuid buffer. This was not caught by the compiler because the method was using a 'void *uuid' instead of first casting it to the expected type. This regression was accidentally caused by refactoring in commit `568a6cda27` Author: Jiri Denemark <jdenemar@redhat.com> Date: Fri Feb 15 15:11:47 2013 +0100 qemu: Avoid deadlock in autodestroy Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Eric Blake	25dc8ba08b	qemu: -numa doesn't (yet) support disjoint range https://bugzilla.redhat.com/show_bug.cgi?id=896092 mentions that qemu 1.4 and earlier only accept a simple start-stop range for the cpu=... argument of -numa. Libvirt would attempt to use -numa cpu=1,3 for a disjoint range, which did not work as intended. Upstream qemu will be adding a new syntax for disjoint cpu ranges in 1.5; but the design for that syntax is still under discussion at the time of this patch. So for libvirt 1.0.3, it is safest to just reject attempts to build an invalid qemu command line; in the future, we can add a capability bit and translate to the final accepted design for selecting a disjoint cpu range in numa. * src/qemu/qemu_command.c (qemuBuildNumaArgStr): Reject disjoint ranges.	2013-02-27 09:31:42 -07:00
Daniel P. Berrange	02b9097274	Fix crash changing CDROM media This change tried to fix a crash with changing CDROM media but failed to actually do so commit `d0172d2b1b` Author: Osier Yang <jyang@redhat.com> Date: Tue Feb 19 20:27:45 2013 +0800 qemu: Remove the shared disk entry if the operation is ejecting or updating It was still accessing disk->src, when the entire 'disk' object has been free'd already. Even if it weren't free'd, accessing the 'src' value of virDomainDiskDef is not allowed without first validating disk->type is file or block. Just remove the broken code entirely. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-26 17:45:31 +00:00
Paolo Bonzini	45dc3f1703	qemu: do not set unpriv_sgio if neither supported nor requested Currently we call virSetDeviceUnprivSGIO with val == 0 if a block device has an sgio attribute. But for sgio='filtered', we know that a kernel with no unpriv_sgio support will always behave as the user wanted. In this case, there is no need to call the function and report a (bogus) error. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-02-26 13:46:52 +01:00
Eric Blake	6abd5ea124	qemu: minor monitor lock cleanups If virCondInit fails (okay, so that's unlikely), then we end up attempting a virObjectUnlock() on the cleanup path, even though we don't hold a lock. This is not guaranteed to be safe. While at it, I noticed a couple places where we were referencing mon->fd outside locks. * src/qemu/qemu_monitor.c (qemuMonitorOpenInternal): Minimize lock duration. mon->watch doesn't need clean up on error. (qemuMonitorGetBlockExtent, qemuMonitorBlockResize): Don't dereference fd outside of lock.	2013-02-25 17:36:51 -07:00
Eric Blake	29424d1acd	qemu: don't override earlier json error I built without yajl support, and noticed a strange failure message in qemumonitorjsontest: 2013-02-22 16:12:37.503+0000: 19812: error : virJSONValueToString:1119 : internal error No JSON parser implementation is available 2013-02-22 16:12:37.503+0000: 19812: error : qemuMonitorJSONCommandWithFd:253 : out of memory While a later patch will fix the test to skip when json is not present, this patch avoids overriding the more useful error message from virJSONValueToString returning NULL. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONCommandWithFd): Don't override message. (qemuMonitorJSONCheckError): Don't print NULL. * src/qemu/qemu_agent.c (qemuAgentCommand): Don't override message. (qemuAgentCheckError): Don't print NULL. (qemuAgentArbitraryCommand): Properly fail on OOM.	2013-02-25 17:36:03 -07:00
Peter Krempa	19c6ad9ac7	qemu: Refactor qemuDomainSetMemoryParameters The new TypedParam helper APIs allow to simplify this function significantly. This patch integrates the fix in `75e5bec97b` by correctly ordering the setting functions instead of reordering the parameters.	2013-02-25 17:24:34 +01:00
Peter Krempa	820019fcdf	qemu: Implement support for EGD backend for virtio-rng This patch adds a new capability bit QEMU_CAPS_OBJECT_RNG_EGD and code to support the egd backend for the VirtIO RNG device. The device is added by 3 qemu command line options: -chardev socket,id=charrng0,host=1.2.3.4,port=1234 (communication backend) -object rng-egd,chardev=charrng0,id=rng0 (RNG protocol client) -device virtio-rng-pci,rng=rng0,bus=pci.0,addr=0x4 (the RNG device)	2013-02-25 10:55:14 +01:00
Peter Krempa	234a55604e	qemu: Implement support for default 'random' backend for virtio-rng This patch implements support for the virtio-rng-pci device and the rng-random backend in qemu. Two capabilities bits are added to track support for those: QEMU_CAPS_DEVICE_VIRTIO_RNG - for the device support and QEMU_CAPS_OBJECT_RNG_RANDOM - for the backend support. qemu is invoked with these additional parameters if the device is enabled: -object rng-random,id=rng0,filename=/test/phile (to add the backend) -device virtio-rng-pci,rng=rng0,bus=pci.0,addr=0x4 (to add the device)	2013-02-25 10:46:19 +01:00
Michal Privoznik	1e54685fc7	qemu_migration: Cancel running jobs on failed migration If a migration fails, we need to stop all block jobs running so qemu doesn't try to send data to destination over and over again.	2013-02-23 08:51:30 +01:00
Michal Privoznik	ae21b9bde6	qemu_migration: Stop NBD server at Finish phase At the end of migration, it is important to stop NBD server and thus release all allocated resources.	2013-02-23 08:42:57 +01:00
Michal Privoznik	7b7600b3e6	qemu_migration: Introduce qemuMigrationDriveMirror This function does the source part of NBD magic. It invokes drive-mirror on each non shared and RW disk with a source and wait till the mirroring process completes. When it does we can proceed with migration. Currently, an active waiting is done: every 500ms libvirt asks qemu if block-job is finished or not. However, once the job finishes, qemu doesn't report its progress so we can only assume if the job finished successfully or not. The better solution would be to listen to the event which is sent as soon as the job finishes. The event does contain the result of job.	2013-02-23 08:42:54 +01:00
Michal Privoznik	86d90b3abd	qemu_migration: Introduce qemuMigrationStartNBDServer() We need to start NBD server and feed it with all non-<shared/>, RW and source-full disks. Moreover, with new virPortAllocator we must ensure the borrowed port for NBD server will be returned if either migration completes or qemu process is torn down.	2013-02-23 08:25:09 +01:00
Michal Privoznik	f1748e34e2	qemu: Introduce nbd-server-stop command This will be used after all migration work is done to stop NBD server running on destination. It doesn't take any arguments, just issues a command.	2013-02-23 08:16:42 +01:00
Michal Privoznik	c833d8111d	qemu: Introduce nbd-server-add command This will be used with new migration scheme. This patch creates basically just monitor stub functions. Wiring them into something useful is done in later patches.	2013-02-23 08:06:37 +01:00
Michal Privoznik	bb6359e8d4	qemu: Introduce nbd-server-start command This will be used with new migration scheme. This patch creates basically just monitor stub functions. Wiring them into something useful is done in later patches.	2013-02-23 07:58:13 +01:00
Michal Privoznik	121d4cfb9a	Introduce NBD migration cookie This migration cookie is meant for two purposes. The first is to be sent in begin phase from source to destination to let it know we support new implementation of VIR_MIGRATE_NON_SHARED_{DISK,INC} so destination can start NBD server. Then, the second purpose is, destination can let us know, on which port the NBD server is running.	2013-02-23 07:49:56 +01:00
Michal Privoznik	e9a6704f99	qemu: Introduce NBD_SERVER capability This just keeps track whether qemu knows nbd-server-* commands so we can use it during migration or not.	2013-02-23 07:33:43 +01:00
Jiri Denemark	492afb8202	qemu: Implement virDomainMigrate*CompressionCache	2013-02-22 17:36:00 +01:00
Jiri Denemark	8def32916d	qemu: Implement virDomainGetJobStats	2013-02-22 17:35:59 +01:00
Jiri Denemark	4121a77c1a	qemu: Parse more fields from query-migrate QMP command As a side effect, this also fixes reporting disk migration process. It was added to memory migration progress, which was wrong. Disk progress has dedicated fields in virDomainJobInfo structure.	2013-02-22 17:35:59 +01:00
Jiri Denemark	94f59b9ece	qemu: Add support for compressed migration	2013-02-22 17:35:58 +01:00
Eric Blake	82d5fe5437	qemu: check backing chains even when cgroup is omitted https://bugzilla.redhat.com/show_bug.cgi?id=896685 points out a regression caused by commit `38c4a9c` - libvirt only labels the backing chain if the backing chain cache is populated, but the code to populate the cache was only conditionally performed if cgroup labeling was necessary. * src/qemu/qemu_cgroup.c (qemuSetupCgroup): Hoist cache setup... * src/qemu/qemu_process.c (qemuProcessStart): ...earlier into caller, where it is now unconditional.	2013-02-21 12:32:56 -07:00
Peter Krempa	db07957646	qemu: Refactor error paths in virQEMUDriverCreateCapabilities Change the error label to "error" and simplify some error paths.	2013-02-21 11:04:34 +01:00
Jiri Denemark	568a6cda27	qemu: Avoid deadlock in autodestroy Since closeCallbacks were turned into virObjectLockable, we can no longer call virQEMUCloseCallbacks APIs from within a registered close callback.	2013-02-21 10:38:28 +01:00
Jiri Denemark	3898ba7f2c	qemu: Turn closeCallbacks into virObjectLockable To avoid having to hold the qemu driver lock while iterating through close callbacks and calling them. This fixes a real deadlock when a domain which is being migrated from another host gets autodestoyed as a result of broken connection to the other host.	2013-02-21 10:27:24 +01:00
Guannan Ren	091831633f	qemu: fix an off-by-one error in qemuDomainGetPercpuStats The max value of number of cpus to compute(id) should not be equal or greater than max cpu number. The bug ocurrs when id value is equal to max cpu number which leads to the off-by-one error in the following for loop. # virsh cpu-stats guest --start 1 error: Failed to virDomainGetCPUStats() error: internal error cpuacct parse error	2013-02-21 11:27:35 +08:00
Osier Yang	5c9034bf05	qemu: Fix the memory leak Found by John Ferlan (coverity script)	2013-02-21 10:33:49 +08:00
John Ferlan	2bff35d5bb	Remove a couple of misplaced VIR_FREE	2013-02-20 12:43:00 -05:00
Michal Privoznik	0eeedf52e7	qemu: Run lzop with '--ignore-warn' Currently, if lzop decompression binary produces a warning, it doesn't exit with zero status but 2 instead. Terrifying, but true. However, warnings may be ignored using '--ignore-warn' command line argument. Moreover, in which case, the exit status will be zero.	2013-02-20 18:10:01 +01:00
Osier Yang	d0172d2b1b	qemu: Remove the shared disk entry if the operation is ejecting or updating For both AttachDevice and UpdateDevice APIs, if the disk device is 'cdrom' or 'floppy', the operations could be ejecting, updating, and inserting. For either ejecting or updating, the shared disk entry of the original disk src has to be removed, because it's not useful anymore. And since the original disk def will be changed, new disk def passed as argument will be free'ed in qemuDomainChangeEjectableMedia, so we need to copy the orignal disk def before qemuDomainChangeEjectableMedia, to use it for qemuRemoveSharedDisk.	2013-02-21 00:31:24 +08:00
Osier Yang	0db7ff59cc	qemu: Move the shared disk adding and sgio setting prior to attaching The disk def could be free'ed by qemuDomainChangeEjectableMedia, which can thus cause crash if we reference the disk pointer. On the other hand, we have to remove the added shared disk entry from the table on error codepath.	2013-02-21 00:31:24 +08:00
Osier Yang	d0e4b76204	qemu: Update shared disk table when reconnecting qemu process	2013-02-21 00:31:24 +08:00
Osier Yang	a4504ac184	qemu: Record names of domain which uses the shared disk in hash table The hash entry is changed from "ref" to {ref, @domains}. With this, the caller can simply call qemuRemoveSharedDisk, without afraid of removing the entry belongs to other domains. qemuProcessStart will obviously benifit from it on error codepath (which calls qemuProcessStop to do the cleanup).	2013-02-21 00:31:24 +08:00
Osier Yang	371df778eb	qemu: Merge qemuCheckSharedDisk into qemuAddSharedDisk Based on moving various checking into qemuAddSharedDisk, this avoids the caller using it in wrong ways. Also this adds two new checking for qemuCheckSharedDisk (disk device not 'lun' and kernel doesn't support unpriv_sgio simply returns 0).	2013-02-21 00:31:24 +08:00
Osier Yang	dab878a861	qemu: Add checking in helpers for sgio setting This moves the various checking into the helpers, to avoid the callers missing the checking.	2013-02-21 00:31:24 +08:00
Jiri Denemark	69660042fb	qemu: Do not ignore mandatory features in migration cookie Due to "feature"/"features" nasty typo, any features marked as mandatory by one side of a migration are silently considered optional by the other side. The following is the code that formats mandatory features in migration cookie: for (i = 0 ; i < QEMU_MIGRATION_COOKIE_FLAG_LAST ; i++) { if (mig->flagsMandatory & (1 << i)) virBufferAsprintf(buf, " <feature name='%s'/>\n", qemuMigrationCookieFlagTypeToString(i)); }	2013-02-20 15:24:01 +01:00
Ján Tomko	bc28e56b35	qemu: switch PCI address alocation to use virDevicePCIAddress Some functions were using virDomainDeviceInfo where virDevicePCIAddress would suffice. Some were only using integers for slots and functions, assuming the bus numbers are always 0. Switch from virDomainDeviceInfoPtr to virDevicePCIAddressPtr: qemuPCIAddressAsString qemuDomainPCIAddressCheckSlot qemuDomainPCIAddressReserveAddr qemuDomainPCIAddressReleaseAddr Switch from int slot to virDevicePCIAddressPtr: qemuDomainPCIAddressReserveSlot qemuDomainPCIAddressReleaseSlot qemuDomainPCIAddressGetNextSlot Deleted functions (they would take the same parameters as ReserveAddr/ReleaseAddr do now.) qemuDomainPCIAddressReserveFunction qemuDomainPCIAddressReleaseFunction	2013-02-20 13:57:59 +01:00
Jiri Denemark	5d6f636764	qemu: Use atomic ops for driver->nactive	2013-02-19 19:11:23 +01:00
Guido Günther	272be1a840	qemu: pass "-1" as uid/gid for unprivileged qemu so we don't try to change uid/git to 0 when probing capabilities.	2013-02-18 12:08:38 -06:00
Doug Goldstein	41046256fe	Add capabilities bit for -no-kvm-pit-reinjection The conversion to qemuCaps dropped the ability with qemu{,-kvm} 1.2 and newer to set the lost tick policy for the PIT. While the -no-kvm-pit-reinjection option is depreacated, it is still supported at least through 1.4, it is better to not lose the functionality.	2013-02-18 12:03:52 -06:00
Laine Stump	0345c7281b	qemu: let virCommand set child process security labels/uid/gid The qemu driver had been calling virSecurityManagerSetProcessLabel() from a "pre-exec hook" function that is run after the child is forked, but before exec'ing qemu. This is problematic because the uid and gid of the child are set by the security driver, but capabilities are dropped by virCommand - such separation doesn't work; the two operations must be done together or the capabilities do not transfer properly to the child process. This patch switches to using virSecurityManagerSetChildProcessLabel(), which is called prior to virCommandRun() (rather than being called during virCommandrun() by the hook function), and doesn't set the UID/GID/security label directly, but instead merely informs virCommand what it should set them all to when the time is appropriate. This lets virCommand choose to do the uid/gid and caps dropping all at the same time if it wants (it does want to, but isn't doing so yet; that's for an upcoming patch).	2013-02-13 16:11:16 -05:00
Laine Stump	6a8ecc373e	qemu: replace exec hook with virCommandSetUID/GID in qemuCaps* Setting the uid/gid of the child process was the only thing done by the hook function in this case, and that can now be done more simply with virCommandSetUID/GID.	2013-02-13 16:11:15 -05:00
Daniel P. Berrange	a9e97e0c30	Remove qemuDriverLock from almost everywhere With the majority of fields in the virQEMUDriverPtr struct now immutable or self-locking, there is no need for practically any methods to be using the QEMU driver lock. Only a handful of helper APIs in qemu_conf.c now need it	2013-02-13 11:10:30 +00:00
Daniel P. Berrange	61b52d2e38	Fix potential deadlock across fork() in QEMU driver The hook scripts used by virCommand must be careful wrt accessing any mutexes that may have been held by other threads in the parent process. With the recent refactoring there are 2 potential flaws lurking, which will become real deadlock bugs once the global QEMU driver lock is removed. Remove use of the QEMU driver lock from the hook function by passing in the 'virQEMUDriverConfigPtr' instance directly. Add functions to the virSecurityManager to be invoked before and after fork, to ensure the mutex is held by the current thread. This allows it to be safely used in the hook script in the child process. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-12 11:05:31 +00:00
Daniel P. Berrange	8cdd5faf46	Pass virQEMUDriverPtr into APIs managed shared disk list Currently the APIs for managing the shared disk list take a virHashTablePtr as the primary argument. This is bad because it requires the caller to deal with locking of the QEMU driver. Switch the APIs to take the full virQEMUDriverPtr instance Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-11 12:48:22 +00:00
Daniel P. Berrange	48b49a631a	Serialize execution of security manager APIs Add locking to virSecurityManagerXXX APIs, so that use of the security drivers is internally serialized. This avoids the need to rely on the global driver locks to achieve serialization Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-11 12:33:44 +00:00
Daniel P. Berrange	11d926659b	Turn virSecurityManager into a virObjectLockable To enable locking to be introduced to the security manager objects later, turn virSecurityManager into a virObjectLockable class Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-11 12:33:41 +00:00
Laine Stump	66d9bc00ab	qemu: support vhost-net for generic ethernet devices From qemu's point of view these are still just tap devices, so there's no reason they shouldn't work with vhost-net; as a matter of fact, Raja Sivaramakrishnan <srajag00@yahoo.com> verified on libvir-list that at least the qemu_command.c part of this patch works: https://www.redhat.com/archives/libvir-list/2012-December/msg01314.html (the hotplug case is extrapolation on my part).	2013-02-08 13:13:55 -05:00
Daniel P. Berrange	020a030786	Stop accessing driver->caps directly in QEMU driver The 'driver->caps' pointer can be changed on the fly. Accessing it currently requires the global driver lock. Isolate this access in a single helper, so a future patch can relax the locking constraints. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-08 11:49:16 +00:00
Daniel P. Berrange	32803ba409	Rename 'qemuCapsXXX' to 'virQEMUCapsXXX' To avoid confusion between 'virCapsPtr' and 'qemuCapsPtr' do some renaming of various fucntions/variables. All instances of 'qemuCapsPtr' are renamed to 'qemuCaps'. To avoid that clashing with the 'qemuCaps' typedef though, rename the latter to virQEMUCaps. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-08 11:49:14 +00:00
Daniel P. Berrange	fed92f08db	Turn virCapabilities into a virObject To enable virCapabilities instances to be reference counted, turn it into a virObject. All cases of virCapabilitiesFree turn into virObjectUnref Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-08 11:34:26 +00:00
Daniel P. Berrange	5b984370f6	Fix comment about virCgroupPtr locking rules in QEMU driver The virCgroupPtr instance APIs are safe to use without locking in the QEMU driver, since all internal state they rely on is immutable. Update the comment to reflect this. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-08 11:34:25 +00:00
Michal Privoznik	0d36f228a4	virCondDestroy: Lose attribute RETURN_CHECK We are wrapping it in ignore_value() anyway.	2013-02-08 09:12:11 +01:00
Michal Privoznik	4ca6f5089f	Drop useless virFileWrapperFdCatchError We are requesting for stderr catching for all cases in virFileWrapperFdNew(). There is no need to have a separate function just to report an error, esp. when we can do it in virFileWrapperFdClose().	2013-02-08 09:11:51 +01:00
John Ferlan	890b6b351f	qemu_command: Resolve resource leaks found by Valgrind The qemuParseGlusterString() replaced dst->src without a VIR_FREE() of what was in there before. The qemuBuildCommandLine() did not properly free the boot_buf depending on various usages. The qemuParseCommandLineDisk() had numerous paths that didn't clean up the virDomainDiskDefPtr def properly. Adjust the logic to go through an error: label before cleanup in order to free the resource.	2013-02-07 14:08:14 -05:00
John Ferlan	75fabbdf3f	qemu_hotplug: Need to call virUSBDeviceFree()	2013-02-05 17:11:06 -05:00
Daniel P. Berrange	0f5e3f136f	Initialize qemuImageBinary path at startup	2013-02-05 19:22:26 +00:00
Daniel P. Berrange	34589575bd	Introduce annotations for virQEMUDriverPtr fields Annotate the fields in virQEMUDriverPtr to indicate the locking rules for their use Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-05 19:22:26 +00:00
Daniel P. Berrange	011cf7ad10	Protect USB/PCI device list access in QEMU with dedicated locks Currently the activePciHostdevs, inactivePciHostdevsd and activeUsbHostdevs lists are all implicitly protected by the QEMU driver lock. Now that the lists all inherit from the virObjectLockable, we can make the locking explicit, removing the dependency on the QEMU driver lock for correctness. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-05 19:22:26 +00:00
Daniel P. Berrange	0f9ef55814	Convert virPCIDeviceList and virUSBDeviceList into virObjectLockable To allow modifications to the lists to be synchronized, convert virPCIDeviceList and virUSBDeviceList into virObjectLockable classes. The locking, however, will not be self-contained. The users of these classes will have to call virObjectLock/Unlock in the critical regions. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-05 19:22:26 +00:00
Daniel P. Berrange	77c3015f9c	Rename all USB device functions to have a standard name prefix Rename all the usbDeviceXXX and usbXXXDevice APIs to have a fixed virUSBDevice name prefix	2013-02-05 19:22:25 +00:00
Daniel P. Berrange	3e86e8f327	Fix leak of usbDevice struct when initializing cgroups When iterating over USB host devices to setup cgroups, the usbDevice object was leaked in both LXC and QEMU driers Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-05 19:22:25 +00:00
Daniel P. Berrange	202535601c	Rename all PCI device functions to have a standard name prefix Rename all the pciDeviceXXX and pciXXXDevice APIs to have a fixed virPCIDevice name prefix	2013-02-05 19:22:25 +00:00
Daniel P. Berrange	b46f7f4a0b	Remove pointless 'qemuVersion' field from virQEMUDriverPtr The QEMU driver struct has a 'qemuVersion' field that was previously used to cache the version lookup from capabilities. With the recent QEMU capabilities rewrite the caching happens at a lower level so this field is pointless. Removing it avoids worries about locking when updating it. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-05 19:22:25 +00:00
Daniel P. Berrange	6ffcab65c9	Use atomic ops to increment nextvmid Use atomic ops to increment nextvmid and encapsulate it in a method to prevent accidental non-atomic access	2013-02-05 19:22:25 +00:00
Daniel P. Berrange	eea87129f1	Merge virDomainObjListIsDuplicate into virDomainObjListAdd The duplicate VM checking should be done atomically with virDomainObjListAdd, so shoud not be a separate function. Instead just use flags to indicate what kind of checks are required. This pair, used in virDomainCreateXML: if (virDomainObjListIsDuplicate(privconn->domains, def, 1) < 0) goto cleanup; if (!(dom = virDomainObjListAdd(privconn->domains, privconn->caps, def, false))) goto cleanup; Changes to if (!(dom = virDomainObjListAdd(privconn->domains, privconn->caps, def, VIR_DOMAIN_OBJ_LIST_ADD_CHECK_LIVE, NULL))) goto cleanup; This pair, used in virDomainRestoreFlags: if (virDomainObjListIsDuplicate(privconn->domains, def, 1) < 0) goto cleanup; if (!(dom = virDomainObjListAdd(privconn->domains, privconn->caps, def, true))) goto cleanup; Changes to if (!(dom = virDomainObjListAdd(privconn->domains, privconn->caps, def, VIR_DOMAIN_OBJ_LIST_ADD_LIVE \| VIR_DOMAIN_OBJ_LIST_ADD_CHECK_LIVE, NULL))) goto cleanup; This pair, used in virDomainDefineXML: if (virDomainObjListIsDuplicate(privconn->domains, def, 0) < 0) goto cleanup; if (!(dom = virDomainObjListAdd(privconn->domains, privconn->caps, def, false))) goto cleanup; Changes to if (!(dom = virDomainObjListAdd(privconn->domains, privconn->caps, def, 0, NULL))) goto cleanup;	2013-02-05 19:22:25 +00:00
Eric Blake	753020dc2c	qemu: don't log failure during QMP add-fd probe Otherwise, we get a lot of scary (but harmless) noise in the logs: 2013-02-05 15:35:48.555+0000: 8637: error : qemuMonitorJSONCheckError:353 : internal error unable to execute QEMU command 'add-fd': Parameter 'fdset-id' expects an existing fdset-id one for every qemu 1.2 binary that we probe. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONAddFd): During probe, avoid logging failures.	2013-02-05 10:46:12 -07:00
Daniel P. Berrange	37abd47165	Turn virDomainObjList into an opaque virObject As a step towards making virDomainObjList thread-safe turn it into an opaque virObject, preventing any direct access to its internals. As part of this a new method virDomainObjListForEach is introduced to replace all existing usage of virHashForEach	2013-02-05 15:49:25 +00:00
Daniel P. Berrange	4f6ed6c33a	Rename all domain list APIs to have virDomainObjList prefix The APIs names for accessing the domain list object are very inconsistent. Rename them all to have a standard virDomainObjList prefix.	2013-02-05 15:49:25 +00:00
Daniel P. Berrange	b090aa7d55	Introduce a virQEMUDriverConfigPtr object Currently the virQEMUDriverPtr struct contains an wide variety of data with varying access needs. Move all the static config data into a dedicated virQEMUDriverConfigPtr object. The only locking requirement is to hold the driver lock, while obtaining an instance of virQEMUDriverConfigPtr. Once a reference is held on the config object, it can be used completely lockless since it is immutable. NB, not all APIs correctly hold the driver lock while getting a reference to the config object in this patch. This is safe for now since the config is never updated on the fly. Later patches will address this fully. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-05 15:49:25 +00:00
Michal Privoznik	137229bf4a	qemu: Catch stderr of image compression binary If a compression binary prints something to stderr, currently it is discarded. However, it can contain useful data from debugging POV, so we should catch it.	2013-02-05 15:45:21 +01:00
Michal Privoznik	cc6c425f94	qemu: Catch stderr of image decompression binary If a decompression binary prints something to stderr, currently it is discarded. However, it can contain useful data from debugging POV, so we should catch it.	2013-02-05 15:45:21 +01:00
Stefan Berger	410b335d23	Add support for QEMU -add-fd support detection Add support for QEMU -add-fd command line parameter detection. This intentionally rejects qemu 1.2, where 'add-fd' QMP did not allow full control of set ids, and where there was no command line counterpart, but accepts qemu 1.3. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-01-31 10:23:28 -07:00
Eric Blake	7b2c5893c2	qemu: expose qemu 1.3 add-fd monitor command Add entry points for calling the qemu 'add-fd' and 'remove-fd' monitor commands. There is no entry point for 'query-fdsets'; the assumption is that a developer can use virsh qemu-monitor-command domain '{"execute":"query-fdsets"}' when debugging issues, and that meanwhile, libvirt is responsible enough to remember what fds it associated with what fdsets. Likewise, on the 'add-fd' command, it is assumed that libvirt will always pass a set id, rather than letting qemu autogenerate the next available id number. * src/qemu/qemu_monitor.c (qemuMonitorAddFd, qemuMonitorRemoveFd): New functions. * src/qemu/qemu_monitor.h (qemuMonitorAddFd, qemuMonitorRemoveFd): New prototypes. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONAddFd) (qemuMonitorJSONRemoveFd): New functions. * src/qemu/qemu_monitor_json.h (qemuMonitorJSONAddFd) (qemuMonitorJSONRemoveFd): New prototypes.	2013-01-31 10:23:28 -07:00
Michal Privoznik	93e5a1432d	qemu: Destroy domain on decompression binary error https://bugzilla.redhat.com/show_bug.cgi?id=894723 Currently, if qemuProcessStart() succeeds, but it's decompression binary that returns nonzero status, we don't kill the qemu process, but remove it from internal domain list, leaving the qemu process hanging around totally uncontrolled.	2013-01-29 09:51:47 +01:00
Michal Privoznik	84c59ffaec	qemu_hotplug: Rework media changing process https://bugzilla.redhat.com/show_bug.cgi?id=892289 It seems like with new udev within guest OS, the tray is locked, so we need to: - 'eject' - wait for tray to open - 'change' Moreover, even when doing bare 'eject', we should check for 'tray_open' as guest may have locked the tray. However, the waiting phase shouldn't be unbounded, so I've chosen 10 retries maximum, each per 500ms. This should give enough time for guest to eject a media and open the tray.	2013-01-27 08:47:48 +01:00
Michal Privoznik	319ed26437	qemu_monitor: Fix tray-open attribute in query-block With our code, we fail to query for tray-open attribute currently. That's because in HMP it is 'tray-open' and in QMP it's 'tray_open'. It always has been. However, we got it exactly the opposite.	2013-01-25 14:39:48 +01:00
Daniel P. Berrange	c29eafc890	Fix bogus reporting of KVM support for non-native emulators A logic bug meant we reported KVM was possible for every architecture, merely based on whether the query-kvm command exists. We should instead have been doing it based on whether the query-kvm command returns 'present: 1' Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-25 10:47:54 +00:00
Daniel P. Berrange	d7a3700ee7	Move QEMU capabilities initialization later in QEMU startup Currently QEMU capabilities are initialized before the QEMU driver sets ownership on its various directories. The upshot is that if you change the user/group in the qemu.conf file, libvirtd will fail to probe QEMU the first time it is run after the config change. Moving QEMU capabilities initialization to after the chown() calls fixes this Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-25 10:41:48 +00:00
Daniel P. Berrange	1b253a102f	Fix performance & reliabilty of QMP probing This previous commit commit `1a50ba2cb0` Author: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com> Date: Mon Nov 26 15:17:13 2012 +0100 qemu: Fix QMP Capabability Probing Failure which attempted to make sure the QEMU process used for probing ran as the right user id, caused serious performance regression and unreliability in probing. The -daemonize switch in QEMU guarantees that the monitor socket is present before the parent process exits. This means libvirtd is guaranteed to be able to connect immediately. By switching from -daemonize to the virCommandDaemonize API libvirtd was no longer synchronized with QEMU's startup process. The result was that the QEMU monitor failed to open and went into its 200ms sleep loop. This happened for all 25 binaries resulting in 5 seconds worth of sleeping at libvirtd startup. In addition sometimes when libvirt connected, QEMU would be partially initialized and crash causing total failure to probe that binary. This commit reverts the previous change, ensuring we do use the -daemonize flag to QEMU. Startup delay is cut from 7 seconds to 2 seconds on my machine, which is on a par with what it was prior to the capabilities rewrite. To deal with the fact that QEMU needs to be able to create the pidfile, we switch pidfile location fron runDir to libDir, which QEMU is guaranteed to be able to write to. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-25 10:41:48 +00:00
Michal Privoznik	2eb54c74ff	qemuDomainSendKey: Relax the qemu driver locking Currently, there is no reason to hold qemu driver locked throughout whole API execution. Moreover, we can use the new qemuDomObjFromDomain() internal API to lookup domain then.	2013-01-25 07:39:19 +01:00
Josh Durgin	c1509ab47e	qemu: escape ipv6 for rbd network disk hosts Hosts for rbd are ceph monitor daemons. These have fixed IP addresses, so they are often referenced by IP rather than hostname for convenience, or to avoid relying on DNS. Using IPv4 addresses as the host name works already, but IPv6 addresses require rbd-specific escaping because the colon is used as an option separator in the string passed to qemu. Escape these colons, and enclose the IPv6 address in square brackets so it is distinguished from the port, which is currently mandatory. Acked-by: Osier Yang <jyang@redhat.com> Signed-off-by: Josh Durgin <josh.durgin@inktank.com>	2013-01-25 11:48:24 +08:00
Eric Blake	339bdd99a1	snapshot: fix state after external snapshot of S3 domain https://bugzilla.redhat.com/show_bug.cgi?id=876829 complains that if a guest is put into S3 state (such as via virsh dompmsuspend) and then an external snapshot is taken, qemu forcefully transitions the domain to paused, but libvirt doesn't reflect that change internally. Thus, a user has to use 'virsh suspend' to get libvirt back in sync with qemu state, and if the user doesn't know this trick, then the guest appears hung. * src/qemu/qemu_driver.c (qemuDomainSnapshotCreateActiveExternal): Track fact that qemu wakes up a suspended domain on migration.	2013-01-24 16:55:55 -07:00
Daniel P. Berrange	bbc663b1c3	Fix crash free'ing securityDriverNames in QEMU driver The previous fix to avoid leaking securityDriverNames forgot to handle the case of securityDriverNames being NULL, leading to a crash Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-24 18:36:37 +00:00
Daniel P. Berrange	d200363ee6	Fix leak of securityDriverNames When shutting down, the QEMU driver forgot to free the securityDriverNames string list Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-24 14:13:26 +00:00
Daniel P. Berrange	4e4c6620e2	Avoid use of free'd memory in auto destroy callback The autodestroy callback code has the following function called from a hash iterator qemuDriverCloseCallbackRun(void payload, const void name, void opaque) { ... char uuidstr = name ... dom = closeDef->cb(data->driver, dom, data->conn); if (dom) virObjectUnlock(dom); virHashRemoveEntry(data->driver->closeCallbacks, uuidstr); } The closeDef->cb function may well cause the current callback to be removed, if it shuts down 'dom'. As such the use of 'uuidstr' in virHashRemoveEntry is accessing free'd memory. We must make a copy of the uuid str before invoking the callback to be safe. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-24 14:13:26 +00:00
Peter Krempa	87b4c10c6c	capabilities: Switch CPU data in NUMA topology to a struct This will allow storing additional topology data in the NUMA topology definition. This patch changes the storage type and fixes fallout of the change across the drivers using it. This patch also changes semantics of adding new NUMA cell information. Until now the data were re-allocated and copied to the topology definition. This patch changes the addition function to steal the pointer to a pre-allocated structure to simplify the code.	2013-01-24 10:53:00 +01:00
Viktor Mihajlovski	053e813a30	S390: Enhance memballoon handling for virtio-s390 The way in that memory balloon suppression was handled for S390 is flawed for a number or reasons. 1. Just preventing the default balloon to be created in the case of VIR_ARCH_S390[X] is not sufficient. An explicit memballoon element in the guest definition will still be honored, resulting both in a -balloon option and the allocation of a PCI bus address, neither being supported. 2. Prohibiting balloon for S390 altogether at a domain_conf level is no good solution either as there's work in progress on the QEMU side to implement a virtio-balloon device, although in conjunction with a new machine type. Suppressing the balloon should therefore be done at the QEMU driver level depending on the present capabilities. Therefore we remove the conditional suppression of the default balloon in domain_conf.c. Further, we are claiming the memballoon device for virtio-s390 during device address assignment to prevent it from being considered as a PCI device. Finally, we suppress the generation of the balloon command line option if this is a virtio-s390 machine. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-01-23 15:08:07 -07:00
Viktor Mihajlovski	7b3a9f754e	qemu: Re-add driver unlock to qemuDomainSendKey Should have been done in commit `56fd513` already, but was missed due to oversight: qemuDomainSendKey didn't release the driver lock in its cleanup section. This fixes an issue introduced by commit `8c5d2ba`. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-01-23 15:01:07 -07:00
Michal Privoznik	d960d06fc0	qemu_agent: Ignore expected EOFs https://bugzilla.redhat.com/show_bug.cgi?id=892079 One of my previous patches (`f2a4e5f176`) tried to fix crashing libvirtd on domain detroy. However, we need to copy pattern from qemuProcessHandleMonitorEOF() instead of decrementing reference counter. The rationale for this is, if qemu process is dying due to domain being destroyed, we obtain EOF on both the monitor and agent sockets. However, if the exit is expected, qemuProcessStop is called, which cleans both agent and monitor sockets up. We want qemuAgentClose() to be called iff the EOF is not expected, so we don't leak an FD and memory. Moreover, there could be race with qemuProcessHandleMonitorEOF() which could have already closed the agent socket, in which case we don't want to do anything.	2013-01-23 15:35:44 +01:00
Alon Levy	55bfd020d8	qemu: Support ram bar size for qxl devices Adds a "ram" attribute globally to the video.model element, that changes the resulting qemu command line only if video.type == "qxl". <video> <model type='qxl' ram='65536' vram='65536' heads='1'/> </video> That attribute gets a default value of 641024. The schema is unchanged for other video element types. The resulting qemu command line change is the addition of -global qxl-vga.ram_size=<ram>1024 or -global qxl.ram_size=<ram>1024 For the main and secondary qxl devices respectively. The default for the qxl ram bar is 641024 kilobytes (the same as the default qxl vram bar size).	2013-01-22 10:40:45 -07:00
John Ferlan	6c2e4c3856	qemu: Add coverity[negative_returns] tag This avoids "Event negative_returns: A negative constant "-1" is passed as an argument to a parameter that cannot be negative.". The called function uses -1 to determine whether it needs to traverse all the hostdevs.	2013-01-22 16:59:45 +01:00
Peter Krempa	f4ece17665	qemu: Forbid snapshot names starting with '.' Forbid the names to match the loading procedure of snapshots.	2013-01-22 11:54:52 +01:00
Peter Krempa	790f912b46	qemu: Reject attempts to create snapshots with names containig '/' The snapshot name is used to create path to the definition save file. When the name contains slashes the creation of the file fails. Reject such names.	2013-01-21 11:48:45 +01:00
Peter Krempa	27054e1217	qemu: Don't return success if creation of snapshot save file fails When the snapshot definition can't be saved, the qemuDomainSnapshotCreate function succeeded without filling some of the fields in the internal definition. This patch removes the snapshot and returns failure if the XML file cannot be written.	2013-01-21 11:48:45 +01:00
Michal Privoznik	31bee8572f	Log flags passed to qemuMigrationPrepare{Tunnel,Direct} APIs We are already logging other arguments passed, however, @flags were missing there.	2013-01-18 18:14:00 +01:00
Daniel P. Berrange	81621f3e6e	Fix race condition when destroying guests When running virDomainDestroy, we need to make sure that no other background thread cleans up the domain while we're doing our work. This can happen if we release the domain object while in the middle of work, because the monitor might detect EOF in this window. For this reason we have a 'beingDestroyed' flag to stop the monitor from doing its normal cleanup. Unfortunately this flag was only being used to protect qemuDomainBeginJob, and not qemuProcessKill This left open a race condition where either libvirtd could crash, or alternatively report bogus error messages about the domain already having been destroyed to the caller Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-18 15:45:38 +00:00
Peter Krempa	5c13ed4f02	qemu: Simplify condition with already extracted flag	2013-01-18 13:19:52 +01:00
John Ferlan	e44d240092	qemu: Check valid activeDev before calling pciDeviceSetUsedBy	2013-01-17 23:46:35 +01:00
Viktor Mihajlovski	56fd513458	qemu: Double mutex unlock in qemuDomainModifyDeviceFlags The driver mutex was unlocked in qemuDomainModifyDeviceFlags before entering qemuDomainObjBeginJobWithDriver where it will be unlocked once more leaving it in an undefined state. The result was that two threads were simultaneously looking up the domain hash table during multiple parallel device attach/detach operations. Luckily this triggered a virHashIterationError. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-01-17 11:57:00 -07:00
Daniel P. Berrange	da5a8aee2b	Avoid integer wrap on remotePortMax in QEMU driver The QEMU driver default max port is 65535, but it then increments this by 1 to 65536. This maps to 0 in an unsigned short :-( This was apparently done so that for() loops could use "< max" instead of "<= max". Remove this insanity and just make the loop do the right thing.	2013-01-17 13:52:33 +00:00
Ján Tomko	31494974c4	qemu: fix QEMU_CAPS_NO_ACPI detection In commit `c4bbaaf8`, caps->arch was checked uninitialized, rendering the whole check useless. This patch moves the conditional setting of QEMU_CAPS_NO_ACPI to qemuCapsInitQMP, and removes the no longer needed exception for S390. It also clears the flag for all non-x86 archs instead of just S390 in qemuCapsInitHelp.	2013-01-16 17:37:04 +01:00
Daniel P. Berrange	dfb1022c72	Convert QEMU driver over to use virPortAllocator APIs Replace the current QEMU driver code for managing port reservations with the new virPortAllocator APIs.	2013-01-16 11:02:58 +00:00
Daniel P. Berrange	325b02b5a3	Convert virDomainObj, qemuAgent, qemuMonitor, lxcMonitor to virObjectLockable The virDomainObj, qemuAgent, qemuMonitor, lxcMonitor classes all require a mutex, so can be switched to use virObjectLockable Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-16 11:02:58 +00:00
Peter Krempa	761fc48136	qemu: Don't update count of vCPUs if hot-unplug has failed After live change of cpu counts, the number of processor threads is verified. This patch makes use of this approach to check if qemu ignored the request for cpu hot-unplug and report an appropriate message.	2013-01-15 23:43:10 +01:00
Daniel P. Berrange	69218922e8	Allow for multi-level inheritance of virObject classes Currently all classes must directly inherit from virObject. This allows for arbitrarily deep hierarchy. There's not much to this aside from chaining up the 'dispose' handlers from each class & providing APIs to check types. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-15 19:21:31 +00:00
Daniel P. Berrange	bccd4a8cbc	Rename HAVE_GNUTLS to WITH_GNUTLS	2013-01-14 13:26:47 +00:00
Daniel P. Berrange	6f736c83e5	Convert HAVE_NUMACTL to WITH_NUMACTL Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-14 13:25:06 +00:00
Peter Krempa	6e1e26e9a7	qemu: Fix grouping of capabilities strings Commit `f8d478b6df` broke the grouping by five items.	2013-01-11 17:43:49 +01:00
Daniel P. Berrange	654c709baa	Convert yajl check to use LIBVIRT_CHECK_LIB_ALT Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-11 11:03:23 +00:00
Daniel P. Berrange	49a1c16027	Convert HAVE_YAJL into WITH_YAJL Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-11 11:03:22 +00:00
Chunyan Liu	66b4693269	pass stub driver name instead of pciFindStubDriver Pass stub driver name directly to pciDettachDevice and pciReAttachDevice to fit for different libvirt drivers. For example, qemu driver prefers pci-stub, but Xen prefers pciback. Signed-off-by: Chunyan Liu <cyliu@suse.com>	2013-01-10 11:30:09 -05:00
Guannan Ren	fc66c1603c	qemu: add usb-net caps flag QEMU_CAPS_DEVICE_USB_NET /* -device usb-net */	2013-01-10 21:56:31 +08:00
Guannan Ren	e3a04455fa	qemu: add usb-serial support Add an optional 'type' attribute to <target> element of serial port device. There are two choices for its value, 'isa-serial' and 'usb-serial'. For backward compatibility, when attribute 'type' is missing the 'isa-serial' will be chosen as before. Libvirt XML sample <serial type='pty'> <target type='usb-serial' port='0'/> <address type='usb' bus='0' port='1'/> </serial> qemu commandline: qemu ${other_vm_args} \ -chardev pty,id=charserial0 \ -device usb-serial,chardev=charserial0,id=serial0,bus=usb.0,port=1	2013-01-10 21:29:20 +08:00
Guannan Ren	f8d478b6df	qemu: add usb-serial caps flag QEMU_CAPS_DEVICE_USB_SERIAL /* -device usb-serial */	2013-01-10 21:26:50 +08:00
Michal Privoznik	f2a4e5f176	qemu_agent: Remove agent reference only when disposing it https://bugzilla.redhat.com/show_bug.cgi?id=892079 With current code, if user calls virDomainPMSuspendForDuration() followed by virDomainDestroy(), the former API checks for qemu agent presence, which will evaluate as true (if agent is configured). While talking to qemu agent, the qemu driver is unlocked, so the latter API starts executing. However, if machine dies meanwhile, libvirtd gets EOF on the agent socket and qemuProcessHandleAgentEOF() is called. The handler clears reference to qemu agent while the destroy API already holding a reference to it. This leads to NULL dereferencing later in the code. Therefore, the agent pointer should be set to NULL only if we are the exclusive owner of it.	2013-01-10 10:32:54 +01:00
Eric Blake	7034531814	maint: fix comment typo While OOM can have knock-on effects that trash a system, generally the first symptom is one of memory thrashing. * src/qemu/qemu_cgroup.c (qemuSetupCgroup): Reword slightly.	2013-01-09 16:45:59 -07:00
Andres Lagar-Cavilla	aedfcce33e	Add RESUME event listener to qemu monitor. Perform all the appropriate plumbing. When qemu/KVM VMs are paused manually through a monitor not-owned by libvirt, libvirt will think of them as "paused" event after they are resumed and effectively running. With this patch the discrepancy goes away. This is meant to address bug 892791. Signed-off-by: Andres Lagar-Cavilla <andres@lagarcavilla.org>	2013-01-09 10:17:40 +01:00
Daniel P. Berrange	f587c27768	Make TLS support conditional Add checks for existence of GNUTLS and automatically disable it if not found. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-08 20:57:31 +00:00
Michal Privoznik	3c83df679e	qemu: Relax hard RSS limit Currently, if there's no hard memory limit defined for a domain, libvirt tries to calculate one, based on domain definition and magic equation and set it upon the domain startup. The rationale behind was, if there's a memory leak or exploit in qemu, we should prevent the host system trashing. However, the equation was too tightening, as it didn't reflect what the kernel counts into the memory used by a process. Since many hosts do have a swap, nobody hasn't noticed anything, because if hard memory limit is reached, process can continue allocating memory on a swap. However, if there is no swap on the host, the process gets killed by OOM killer. In our case, the qemu process it is. To prevent this, we need to relax the hard RSS limit. Moreover, we should reflect more precisely the kernel way of accounting the memory for process. That is, even the kernel caches are counted within the memory used by a process (within cgroups at least). Hence the magic equation has to be changed: limit = 1.5 * (domain memory + total video memory) + (32MB for cache per each disk) + 200MB	2013-01-08 16:32:11 +01:00
J.B. Joret	db2b6861dc	S390: Enable SCLP Console in QEMU driver This is the QEMU backend code for the SCLP console support. It includes SCLP capability detection, QEMU command line generation and a test case. Signed-off-by: J.B. Joret <jb@linux.vnet.ibm.com> Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-01-08 11:37:52 +01:00
Daniel P. Berrange	198c992d26	Speed up fallback to legacy non-QMP probing Since we daemonized QEMU for capabilities probing there is a long time if QEMU fails to launch. This is because we're not passing in any virDomainObjPtr instance and thus the monitor code can not check to see if the PID is still alive. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-01-07 18:13:54 +00:00
Daniel P. Berrange	038cffd831	Only initialize capabilities after setting dir permissions The current code is initializing capabilities before setting directory permissions. Thus the QEMU binaries being run may not have the ability to create the UNIX monitor socket on the first run of libvirtd.	2013-01-07 18:13:39 +00:00
Osier Yang	1279e421b2	qemu: Check if the shared disk's cdbfilter conflicts with others This prevents domain starting and disk attaching if the shared disk's setting conflicts with other active domain(s), E.g. A domain with "sgio" set as "filtered", however, another active domain is using it set as "unfiltered".	2013-01-07 21:39:20 +08:00
Osier Yang	278f87c4b5	qemu: set unpriv_sgio when starting domain and attaching disk This ignores the default "filtered" if unpriv_sgio is not supported by kernel, but for explicit request "filtered", it error out for domain starting.	2013-01-07 21:39:06 +08:00
Osier Yang	d7ead3e19a	qemu: Add a hash table for the shared disks This introduces a hash table for qemu driver, to store the shared disk's info as (@major:minor, @ref_count). @ref_count is the number of domains which shares the disk. Since we only care about if the disk support unprivileged SG_IO commands, and the SG_IO commands only make sense for block disk, this patch only manages (add/remove hash entry) the shared disk for block disk. * src/qemu/qemu_conf.h: (Add member 'sharedDisks' of type virHashTablePtr; Declare helpers qemuGetSharedDiskKey, qemuAddSharedDisk and qemuRemoveSharedDisk) * src/qemu/qemu_conf.c (Implement the 3 helpers) * src/qemu/qemu_process.c (Update 'sharedDisks' when domain starting and shutdown) * src/qemu/qemu_driver.c (Update 'sharedDisks' when attaching or detaching disk).	2013-01-07 21:35:19 +08:00
Peter Krempa	731a5a4df7	snapshot: qemu: Allow redefinition of external snapshots A redefinition of an external inactive snapshot/checkpoint wasn't possible without this change.	2013-01-05 08:40:01 +01:00
Peter Krempa	709b0f37c5	snapshot: qemu: Fix segfault and vanishing snapshots when redefining When the disk alignment check done while redefining an existing snapshot failed, the qemu driver attempted to free the existing snapshot. As in the cleanup path the definition of the snapshot wasn't assigned, the cleanup code dereferenced a NULL pointer. This patch changes the behavior on error paths while redefining snapshot in two ways: 1) On failure, modifications done on the snapshot definition object are rolled back. 2) The previous definition of the data isn't freed until it's certain it won't be needed any more. This change avoids the segfault and additionally the snapshot doesn't vanish if redefinition fails for some reason.	2013-01-05 08:40:01 +01:00
Peter Krempa	4494b11f8f	snapshot: qemu: Separate logic blocks with newlines	2013-01-05 08:40:00 +01:00
John Eckersberg	346e43ecfd	qemu: Implement virDomainOpenChannel API	2013-01-04 19:03:32 -07:00
John Eckersberg	66a0664974	conf: Add unix socket support to virChrdevOpen This also changes the function signature to take a virDomainChrSourceDefPtr instead of just a path, since it needs to differentiate behavior based on source->type.	2013-01-04 18:07:11 -07:00
John Eckersberg	3c971c675a	conf: Rename console-specific identifiers to be more generic The functionality provided in virchrdev.c (previously virconsole.c) is applicable to other types of character devices besides consoles, such as channels. This patch is just code motion, renaming things such as "console" or "pty", instead using more general terms such as "character device" or "device path".	2013-01-04 17:43:21 -07:00
John Eckersberg	4c85421c6c	conf: Rename virconsole.* to virchrdev.* This is just code motion, in preparation to rename identifiers to be less console-specific.	2013-01-04 17:26:30 -07:00
Michal Privoznik	632c60edde	qemu: Detect VGA_QXL capability correctly Since `4c993d8a` we failed to set this important capability, which allows starting a domain with QXL video card. We set DEVICE_QXL capability bit instead, which is not necessary wrong. Anyway, if qemu supports the new '-device qxl' it supports older '-vga qxl' as well. The latter is used for the primary (the first) qxl video card, the former for other video cards.	2013-01-04 15:37:09 +01:00
Ján Tomko	b7a443fcbb	qemu: fix a segfault in qemuProcessWaitForMonitor Commit `b3f2b4ca5c` left buf unallocated in the case of QMP capability probing being used, leading to a segfault in strlen in the cleanup path. This patch opens the log and allocates the buffer if QMP probing was used, so we can display the helpful error message.	2013-01-04 11:00:43 +01:00
Michal Privoznik	b3f2b4ca5c	qemu: Don't parse log output when starting up a domain Despite our great effort we still parsed qemu log output. We wouldn't notice unless upcoming qemu 1.4 changed the format of the logs slightly. Anyway, now we should gather all interesting knobs like pty paths from monitor. Moreover, since for historical reasons the first console can be just an alias to the first serial port, we need to check this and copy the pty path if that's the case to the first console.	2013-01-03 09:56:51 +01:00
Michal Privoznik	fe915278c1	Revert "qemu: Adapt to new log format" This reverts commit `28224c4d2a` which shouldn't be needed at all because with current qemu we obtain all paths from 'query-chardev' output. We ought not parse log output at all anymore.	2013-01-02 11:52:18 +01:00
Michal Privoznik	28224c4d2a	qemu: Adapt to new log format Since 586502189edf9fd0f89a83de96717a2ea826fdb0 qemu commit, the log lines reporting chardev's path has changed from: $ ./x86_64-softmmu/qemu-system-x86_64 -serial pty -serial pty -monitor pty char device redirected to /dev/pts/5 char device redirected to /dev/pts/6 char device redirected to /dev/pts/7 to: $ ./x86_64-softmmu/qemu-system-x86_64 -serial pty -serial pty -monitor pty char device compat_monitor0 redirected to /dev/pts/5 char device serial0 redirected to /dev/pts/6 char device serial1 redirected to /dev/pts/7 However, with current code we are not prepared for such change, which results in us being unable to start any domain.	2012-12-30 12:12:21 +01:00
Michal Privoznik	a14768c9d3	qemu: Convert some APIs to use qemuDomObjFromDomain Many internal qemu APIs must find domain object from passed virDomainPtr. And with function Peter's introduced, we can use it instead of copying multiple lines among code.	2012-12-24 09:34:13 +01:00
Michal Privoznik	8c5d2bad12	qemu: Relax locking in DomainHasManagedSaveImage and DomainMonitorCommand There is no need to hold qemu lock during the whole execution of these two APIs.	2012-12-24 09:34:13 +01:00
Viktor Mihajlovski	fec9822eeb	S390: Re-enable capability probing for virtio devices. Since we switched to QMP probing, the object types are spelled out explicitly, i.e. virtio-net-pci. This has effectively disabled the capability detection of s390 virtio devices. The trivial fix is to add the s390 virtio types explicitly to qemuCapsObjectProps. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-12-21 14:20:28 -07:00
Eric Blake	08230dbd7d	blockjob: fix memleak that prevented block pivot https://bugzilla.redhat.com/show_bug.cgi?id=888426 The code for doing a block-copy was supposed to track the destination file in drive->mirror, but was set up to do all mallocs prior to starting the copy so that OOM wouldn't leave things partially started. However, the wrong variable was being written; later in the code we silently did 'disk->mirror = mirror' which was still NULL, and thus leaking memory and leaving libvirt to think that the mirror job was never started, which prevented a pivot operation after a copy. Problem introduced in commit `35c7701c6`. * src/qemu/qemu_driver.c (qemuDomainBlockCopy): Initialize correct variable.	2012-12-21 12:43:49 -07:00
Daniel P. Berrange	f24404a324	Rename virterror.c virterror_internal.h to virerror.{c,h}	2012-12-21 11:19:50 +00:00
Daniel P. Berrange	556cf5f617	Rename xml.{c,h} to virxml.{c,h}	2012-12-21 11:19:50 +00:00
Daniel P. Berrange	e861b31275	Rename uuid.{c,h} to viruuid.{c,h}	2012-12-21 11:19:49 +00:00
Daniel P. Berrange	44f6ae27fe	Rename util.{c,h} to virutil.{c,h}	2012-12-21 11:19:49 +00:00
Daniel P. Berrange	404174cad3	Rename threads.{c,h} to virthread.{c,h}	2012-12-21 11:19:49 +00:00
Daniel P. Berrange	20463736cc	Rename threadpool.{c,h} to virthreadpool.{c,h}	2012-12-21 11:19:48 +00:00
Daniel P. Berrange	88ba722c12	Rename sysinfo.{c,h} to virsysinfo.{c,h}	2012-12-21 11:19:48 +00:00
Daniel P. Berrange	05dc8398dd	Rename storage_file.{c,h} to virstoragefile.{c,h}	2012-12-21 11:19:48 +00:00
Daniel P. Berrange	fde9df8dcc	Rename stats_linux.{c,h} to virstatslinux.{c,h}	2012-12-21 11:19:48 +00:00
Daniel P. Berrange	f56c773bf8	Merge processinfo.{c,h} into virprocess.{c,h}	2012-12-21 11:19:45 +00:00
Daniel P. Berrange	3ddddd98c3	Rename pci.{c,h} to virpci.{c,h}	2012-12-21 11:17:14 +00:00
Daniel P. Berrange	ab9b7ec2f6	Rename memory.{c,h} to viralloc.{c,h}	2012-12-21 11:17:14 +00:00
Daniel P. Berrange	936d95d347	Rename logging.{c,h} to virlog.{c,h}	2012-12-21 11:17:14 +00:00
Daniel P. Berrange	6a095d0851	Rename json.{c,h} to virjson.{c,h}	2012-12-21 11:17:13 +00:00
Daniel P. Berrange	ebc8db5189	Rename hostusb.{c,h} to virusb.{c,h}	2012-12-21 11:17:13 +00:00
Daniel P. Berrange	30f3a005ff	Rename hooks.{c,h} to virhook.{c,h}	2012-12-21 11:17:13 +00:00
Daniel P. Berrange	f14b5bce73	Rename ebtables.{c,h} to virebtables.{c,h}	2012-12-21 11:17:13 +00:00
Daniel P. Berrange	0f8454101d	Rename conf.{c,h} to virconf.{c,h}	2012-12-21 11:17:13 +00:00
Daniel P. Berrange	04d9510f50	Rename command.{c,h} to vircommand.{c,h}	2012-12-21 11:17:13 +00:00
Daniel P. Berrange	2005f7b552	Rename buf.{c,h} to virbuffer.{c,h} Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-21 11:17:12 +00:00
Daniel P. Berrange	a27e4fbb72	Rename bitmap.{c,h} to virbitmap.{c,h} Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-21 11:17:12 +00:00
Daniel P. Berrange	f9c7020c1f	Rename cgroup.{h,c} to vircgroup.{h,c} To bring in line with new naming practice, rename the= src/util/cgroup.{h,c} files to vircgroup.{h,c} Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-21 11:17:12 +00:00
Li Zhang	da3d40c0eb	Support all backend serial devices for pSeries guest Currently, it only considers PTY backend serial devices for pseries. It need to support all kinds of serial devices. This patch is to fix the problem which is that it doesn't work when specifying source type as file. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2012-12-20 16:19:49 -07:00
Li Zhang	c4bbaaf877	Remove QEMU_CAPS_NO_ACPI capability for non-x86 platform ACPI is only supported on x86 platform, PPC can't support it. So QEMU_CAPS_NO_ACPI shouldn't be set. This patch is to remove QEMU_CAPS_NO_ACPI capability for non-x86 platform. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2012-12-20 16:15:57 -07:00
Daniel P. Berrange	012ff583fe	Change string form of VIR_ARCH_ITANIUM back to ia64 Historically there was an inconsistency in handling of the itanium arch. The xen driver & CPU model code treated it as 'ia64' but the QEMU capabilities code used 'itanium'. On the grounds that no one has ever seriously used itanium with QEMU, while RHEL shipped itanium with Xen, we should favour 'ia64' as the canonical format	2012-12-19 10:56:37 +00:00
Martin Kletzander	b72c97e732	fix typo in the word affinities This patch fixes just the word Affinites to Affinities (it's really painful to search in TAGS without being able to find the right function).	2012-12-19 02:17:38 +01:00
Daniel P. Berrange	aaf1636875	Convert QEMU capabilities code to use virArch Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-18 18:54:50 +00:00
Daniel P. Berrange	1846b80be8	Convert CPU APIs to use virArch Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-18 16:53:03 +00:00
Daniel P. Berrange	c25c18f71b	Convert capabilities / domain_conf to use virArch Convert the host capabilities and domain config structs to use the virArch datatype. Update the parsers and all drivers to take account of datatype change Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-18 16:53:03 +00:00
Daniel P. Berrange	2f4a139a4c	Convert QEMU command line builder to virArch APIs Use virArch APIs to determine host architecture when launching QEMU. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-18 16:53:03 +00:00
Daniel P. Berrange	df5928ea56	Allow passing a vroot into security manager hostdev labelling When LXC labels USB devices during hotplug, it is running in host context, so it needs to pass in a vroot path to the container root. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-17 17:50:51 +00:00
Guannan Ren	aa51202b72	qemu: use newer -device video device in qemu commandline '-device VGA' maps to '-vga std' '-device cirrus-vga' maps to '-vga cirrus' '-device qxl-vga' maps to '-vga qxl' (there is also '-device qxl' for secondary devices) '-device vmware-svga' maps to '-vga vmware' For qemu(>=1.2), we can use -device to replace -vga for video device. For the primary video device, the patch tries to use 0x2 slot for matching old qemu. If the 0x2 slot is allocated already, the addr property could help for using any available slot. For qemu(< 1.2), we keep using -vga for primary device.	2012-12-17 14:02:50 +08:00
Guannan Ren	4c993d8ab5	qemu: add qemu vga devices caps and one cap to mark them usable QEMU_CAPS_DEVICE_QXL -device qxl QEMU_CAPS_DEVICE_VGA -device VGA QEMU_CAPS_DEVICE_CIRRUS_VGA -device cirrus-vga QEMU_CAPS_DEVICE_VMWARE_SVGA -device vmware-svga QEMU_CAPS_DEVICE_VIDEO_PRIMARY /* safe to use -device XXX for primary video device */ Fix a typo in qemuCapsObjectTypes, the string 'qxl' here should be -device qxl rather than -vga [...\|qxl\|..]	2012-12-17 13:55:50 +08:00
Eric Blake	70743daeec	build: minor build fixes for BSD Noticed these while building on FreeBSD. * src/qemu/qemu_monitor.c (qemuMonitorBlockInfoLookup): Rename variable to avoid 'devname' collision. * src/qemu/qemu_driver.c (qemuDomainInterfaceStats): Mark unused variable.	2012-12-14 12:14:52 -07:00
Laine Stump	9cf8734e7c	qemu: don't fail update netdev on bridge detach failure When a network device's bridge connection is changed by virDomainUpdateDevice, libvirt first removes the netdev's tap from its old bridge, then adds it to the new bridge. Sometimes, due to a network being destroyed while a guest device is still attached, the tap may already be "removed" from the old bridge (or the old bridge may not even exist any more); the existing code was needlessly failing the update when this happened, making it impossible to recover from the situation without completely detaching (i.e. removing) the netdev from the guest and re-attaching. Instead of failing the entire operation when removal of the tap from the old bridge fails, this patch changes qemuDomainChangeNetBridge to just log a warning and continue, allowing a reasonable recover from the situation. (you'll appreciate this change if you ever accidentally destroy a network while your guests are still using it).	2012-12-14 07:14:10 -05:00
Daniel P. Berrange	f199f75e9b	Refactor creation of lock manager plugins Refactor virLockManagerPluginNew() so that the caller does not need to pass in the config file path itself - just the config directory and driver name. Fix QEMU to actually pass in a config file when creating the default lock manager plugin, rather than NULL. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-13 15:26:57 +00:00
Daniel P. Berrange	f6bd0a8899	Fix memory leak in QEMU QMP capabilities initialization The qemuCapsInitQMP method never frees the QEMU 'package' version string.	2012-12-13 14:45:53 +00:00
Roman Bogorodskiy	9a2f36ec04	Qemu FreeBSD: fix compilation * Autotools changes: - Don't assume Qemu is Linux-only - Check Linux headers only on Linux - Disable firewalld on FreeBSD * Initctl: Initctl seem to present only on Linux, so stub it on other platforms * Raw I/O: Linux-only as well * Headers cleanup	2012-12-12 11:59:53 -07:00
Roman Bogorodskiy	b467e9323c	Drop mntent.h include. It's no longer used and also causes build fail on FreeBSD.	2012-12-12 11:07:24 -07:00
Peter Krempa	ed0bfd04f8	qemu: Improve error reporting from qemuDomainManagedSaveRemove Report an error if unlink of the managedsave file fails.	2012-12-12 14:34:12 +01:00
Peter Krempa	a02579141e	qemu: Small code cleanups in the managedsave functions Save a few lines moving assignments into conditions and fix braces position.	2012-12-12 14:34:12 +01:00
Peter Krempa	2745177b34	qemu: Refactor managed save functions to use domain lookup helpers	2012-12-12 14:34:12 +01:00
Peter Krempa	7fc06b0480	qemu: Add a new domain lookup helper and improve the docs This patch adds a new domain lookup helper qemuDomObjFromDomainDriver that lookups the domain and leaves the driver locked. The driver is returned as the second argument of that function. If the lookup fails the driver is unlocked to help avoid cleanup codepaths. This patch also improves docs for the helpers.	2012-12-12 14:34:12 +01:00
Serge Hallyn	88bd1a644b	add security hook for permitting hugetlbfs access When a qemu domain is backed by huge pages, apparmor needs to grant the domain rw access to files under the hugetlbfs mount point. Add a hook, called in qemu_process.c, which ends up adding the read-write access through virt-aa-helper. Qemu will be creating a randomly named file under the mountpoint and unlinking it as soon as it has mmap()d it, therefore we cannot predict the full pathname, but for the same reason it is generally safe to provide access to $path/**. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2012-12-11 14:27:20 -07:00
Peter Krempa	08379dbd45	qemu: reuse qemuMigrationIsAllowed when doing save and managedsave Save and managedsave both use migration to file. This patch reuses qemuMigrationIsAllowed to check if the migration could happen before trying.	2012-12-11 19:48:37 +01:00
Peter Krempa	98e92ba83b	qemu: snapshot: Report better error message if migration isn't allowed Qemu doesn't support migration on guests with host devices. This patch adds a check to ensure migration is safe before actually doing so.	2012-12-11 19:48:37 +01:00
Peter Krempa	e5d3ab5e21	qemu: Make qemuMigrationIsAllowed more reusable This patch exports qemuMigrationIsAllowed and adds a new parameter to it to denote if it's a remote migration or a local migration. Local migrations are used in snapshots and saving of the machine state and have fewer restrictions. This patch also adjusts callers of the function and tweaks some error messages to be more universal.	2012-12-11 19:48:37 +01:00
Ján Tomko	6543a459ef	qemu: assume seccomp sandbox is supported since qemu 1.2 Currently there is no way to detect it via QMP and requesting "-sandbox off" works correctly even if it was compiled out, so this will work unless someone both requests the sandbox in qemu.conf and builds QEMU without the support for it.	2012-12-11 18:52:29 +01:00
Michal Privoznik	67159f1c60	bandwidth: Create hierarchical shaping classes These classes can borrow unused bandwidth. Basically, only egress qdsics can have classes, therefore we can do this kind of traffic shaping only on host's outgoing, that is domain's incoming traffic.	2012-12-11 18:36:55 +01:00
Peter Krempa	a912977a65	qemu: snapshot: Remove memory image if external checkpoint fails When the disk snapshot part of an external system checkpoint fails the memory image is retained. This patch adds code to remove the image in such case.	2012-12-11 13:59:14 +01:00
Peter Krempa	d5b2828763	qemu: snapshot: Don't leak XML definition if restarting of CPUs fails In case the snapshot code isn't able to restart CPUs after an external checkpoint we would leak a copy of the domains XML definition. This patch fixes the cleanup path.	2012-12-11 13:48:15 +01:00
Ján Tomko	07b64de505	qemu: fix uninitialized variable warning in doPeer2PeerMigrate False positive, but it breaks the build with gcc-4.6.3. qemu/qemu_migration.c:2931:37: error: 'offline' may be used uninitialized in this function [-Werror=uninitialized] qemu/qemu_migration.c:2887:10: note: 'offline' was declared here	2012-12-11 13:38:22 +01:00
Peter Krempa	46b0c93332	qemu: Restart CPUs with valid async job type when doing external snapshots When restarting CPUs after an external snapshot, the restarting function was called without the appropriate async job type. This caused that a new sync job wasn't created and allowed races in the monitor.	2012-12-11 11:20:53 +01:00
liguang	8b9bf7879b	Add support for offline migration Offline migration transfers inactive definition of a domain (which may or may not be active). After successful completion, the domain remains in its current state on source host and is defined but inactive on destination host. It's a bit more clever than virDomainGetXMLDesc() on source host followed by virDomainDefineXML() on destination host, as offline migration will run pre-migration hook to update the domain XML on destination host. Currently, copying non-shared storage is not supported during offline migration. Offline migration can be requested with a new migration flag called VIR_MIGRATE_OFFLINE (which has to be combined with VIR_MIGRATE_PERSIST_DEST flag).	2012-12-10 21:52:15 +01:00
Laine Stump	e5577872cb	qemu: eliminate bogus error log when changing netdev's bridge This fixes a problem that showed up during testing of: https://bugzilla.redhat.com/show_bug.cgi?id=881480 Due to a logic error in the function that gets the name of the bridge an interface connects to, any time a bridge was specified directly (type='bridge') rather than indirectly (type='network'), An error would be logged (although the operation would then complete successfully): Network type 6 is not supported The final virReportError() in the function qemuDomainNetGetBridgeName() was apparently avoided in the past with a "goto cleanup" at the end of each case, but the case of bridge somehow no longer has that final goto cleanup. The proper solution is anyway to not rely on goto's, but put the error log inside an else {} clause, so that it's executed only if the type is neither bridge nor network (in reality, this function should only ever be called for those two types, that's why this is an internal error). While making this change, the error message was also tuned to be more correct (since it's not really the type of the network, but the type of the interface, and it is otherwise supported, it's just that the interface type in question doesn't have a bridge device associated with it, or at least we don't know how to get it).	2012-12-10 13:17:41 -05:00
Viktor Mihajlovski	539d73dbf6	S390: Assign default model "virtio" for network interfaces If a network interface model is not specified, libvirt will run into an unchecked NULL pointer coredump. On the other hand if the empty model is ignored, a PCI bus address would be generated, which is not supported by S390. Since the only valid network type model for S390 is virtio, we use this as the default value, which is the same for QEMU. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-12-10 14:57:17 +01:00
Cole Robinson	3130541ebf	qemu: capabilities: fix machine name/canonical swappage Things are supposed to look like: <machine canonical='pc-0.12'>pc</machine> But are currently swapped. This can cause many VMs to revert to having machine type='pc' which will affect save/restore across qemu upgrades.	2012-12-07 11:30:34 -05:00
Osier Yang	b718ded39a	qemu: Allow the user to specify vendor and product for disk QEMU supports setting vendor and product strings for disk since 1.2.0 (only scsi-disk, scsi-hd, scsi-cd support it), this patch exposes it with new XML elements <vendor> and <product> of disk device.	2012-12-07 16:53:27 +08:00
Jiri Denemark	6910318798	qemu: Fix memory (and FD) leak on PCI device detach Unmanaged PCI devices were only leaked if pciDeviceListAdd failed but managed devices were always leaked. And leaking PCI device is likely to leave PCI config file descriptor open. This patch fixes qemuReattachPciDevice to either free the PCI device or add it to the inactivePciHostdevs list.	2012-12-05 13:45:34 +01:00
Jiri Denemark	ea1a9b5fdd	qemu: Don't free PCI device if adding it to activePciHostdevs fails The device is still referenced from pcidevs and freeing it would leave an invalid pointer there.	2012-12-05 13:45:34 +01:00
Jiri Denemark	935550c6d3	qemu: Fix error code when attaching existing device An attempt to attach device that is already attached to a domain results in the following error: virsh # attach-device rhel6 pci2 --persistent error: Failed to attach device from pci2 error: invalid argument: device is already in the domain configuration The "invalid argument" error code looks wrong, we usually use "operation invalid" when the action cannot be done in current state.	2012-12-05 13:45:34 +01:00
Osier Yang	9ee809d60c	qemu: Simplify the code "disk" is initialized to "dev->data.disk" in the beginning of the function.	2012-12-05 12:45:10 +08:00
Eric Blake	149fa591c1	qemu: improve error for failed JSON commands Only one error in qemu_monitor was already using the relatively new OPERATION_UNSUPPORTED error, even though it is a better fit for all of the messages related to options that are unsupported due to the version of qemu in use rather than due to a user's XML or .conf file choice. Suggested by Osier Yang. * src/qemu/qemu_monitor.c (qemuMonitorSendFileHandle) (qemuMonitorAddHostNetwork, qemuMonitorRemoveHostNetwork) (qemuMonitorAttachDrive, qemuMonitorDiskSnapshot) (qemuMonitorDriveMirror, qemuMonitorTransaction) (qemuMonitorBlockCommit, qemuMonitorDrivePivot) (qemuMonitorBlockJob, qemuMonitorSystemWakeup) (qemuMonitorGetVersion, qemuMonitorGetMachines) (qemuMonitorGetCPUDefinitions, qemuMonitorGetCommands) (qemuMonitorGetEvents, qemuMonitorGetKVMState) (qemuMonitorGetObjectTypes, qemuMonitorGetObjectProps) (qemuMonitorGetTargetArch): Use better error category.	2012-12-04 15:56:03 -07:00
Eric Blake	3bef4adf73	qemu: nicer error message if live disk snapshot unsupported Without this patch, attempts to create a disk snapshot when qemu is too old results in a cryptic message: virsh # snapshot-create 23 --disk-only error: operation failed: Failed to take snapshot: unknown command: 'snapshot_blkdev' Now it reports: virsh # snapshot-create 23 --disk-only error: unsupported configuration: live disk snapshot not supported with this QEMU binary All versions of qemu that support live disk snapshot also support QMP (basically upstream qemu 1.1 and later, and backports to RHEL 6.2). * src/qemu/qemu_capabilities.h (QEMU_CAPS_DISK_SNAPSHOT): New capability. * src/qemu/qemu_capabilities.c (qemuCaps): Track it. (qemuCapsProbeQMPCommands): Set it. * src/qemu/qemu_driver.c (qemuDomainSnapshotCreateDiskActive): Use it. * src/qemu/qemu_monitor.c (qemuMonitorDiskSnapshot): Simplify. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONDiskSnapshot): Likewise. * src/qemu/qemu_monitor_text.h (qemuMonitorTextDiskSnapshot): Delete. * src/qemu/qemu_monitor_text.c (qemuMonitorTextDiskSnapshot): Likewise.	2012-12-04 15:53:41 -07:00
Daniel P. Berrange	79b8a56995	Replace polling for active VMs with signalling by drivers Currently to deal with auto-shutdown libvirtd must periodically poll all stateful drivers. Thus sucks because it requires acquiring both the driver lock and locks on every single virtual machine. Instead pass in a "inhibit" callback to virStateInitialize which drivers can invoke whenever they want to inhibit shutdown due to existance of active VMs. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-04 12:14:04 +00:00
Daniel P. Berrange	8f9a69317d	Make QEMU perform managed save of all VMs on stop of libvirtd When the virStateStop() method is invoked, perform a managed save of all VMs currently running Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-12-04 12:07:49 +00:00
Laine Stump	258fb278f2	qemu: support live update of an interface's filter Since we can't (currently) rely on the ability to provide blanket support for all possible network changes by calling the toplevel netdev hostside disconnect/connect functions (due to qemu only supporting a lockstep between initialization of host side and guest side of devices), in order to support live change of an interface's nwfilter we need to make a special purpose function to only call the nwfilter teardown and setup functions if the filter for an interface (or its parameters) changes. The pattern is nearly identical to that used to change the bridge that an interface is connected to. This patch was inspired by a request from Guido Winkelmann <guido@sagersystems.de>, who tested an earlier version.	2012-12-03 14:35:58 -05:00
Daniel P. Berrange	dff4a753c4	Move reboot/shutdown flags combination check into QEMU driver The fact that only the guest agent, or ACPI flag can be used when requesting reboot/shutdown is merely a limitation of the QEMU driver impl at this time. Thus it should not be in libvirt.c code Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-11-30 19:18:27 +00:00
Viktor Mihajlovski	3c465728bf	qemu: Fix up the default machine type for QMP probing The default machine type must be stored in the first element of the caps->machineTypes array. This was done for help output parsing but not for QMP probing. Added a helper function qemuSetDefaultMachine to apply the same fix up for both probing methods. Further, it was necessary to set caps->nmachineTypes after QMP probing. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-11-30 11:56:57 -07:00
Guido Günther	d01e427e01	Fix uninitialized variables detecet by http://honk.sigxcpu.org:8001/job/libvirt-build/348/console	2012-11-30 19:12:06 +01:00
Eric Blake	3d7f6649e8	qemu: don't attempt undefined QMP commands https://bugzilla.redhat.com/show_bug.cgi?id=872292 Libvirt should not attempt to call a QMP command that has not been documented in qemu.git - if future qemu introduces a command by the same name but with subtly different semantics, then libvirt will be broken when trying to use that command. We also had some code that could never be reached - some of our commands have an alternate for new vs. old qemu HMP commands; but if we are new enough to support QMP, we only need a fallback to the new HMP counterpart, and don't need to try for a QMP counterpart for the old HMP version. See also this attempt to convert the three snapshot commands to QMP: https://lists.gnu.org/archive/html/qemu-devel/2012-07/msg01597.html although it looks like that will still not happen before qemu 1.3. That thread eventually decided that qemu would use the name 'save-vm' rather than 'savevm', which mitigates the fact that libvirt's attempt to use a QMP 'savevm' would be broken, but we might not be as lucky on the other commands. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONSetCPU) (qemuMonitorJSONAddDrive, qemuMonitorJSONDriveDel) (qemuMonitorJSONCreateSnapshot, qemuMonitorJSONLoadSnapshot) (qemuMonitorJSONDeleteSnapshot): Use only HMP fallback for now. (qemuMonitorJSONAddHostNetwork, qemuMonitorJSONRemoveHostNetwork) (qemuMonitorJSONAttachDrive, qemuMonitorJSONGetGuestDriveAddress): Delete; QMP implies QEMU_CAPS_DEVICE, which prefers AddNetdev, RemoveNetdev, and AddDrive anyways (qemu_hotplug.c has all callers). * src/qemu/qemu_monitor.c (qemuMonitorAddHostNetwork) (qemuMonitorRemoveHostNetwork, qemuMonitorAttachDrive): Reflect deleted commands. * src/qemu/qemu_monitor_json.h (qemuMonitorJSONAddHostNetwork) (qemuMonitorJSONRemoveHostNetwork, qemuMonitorJSONAttachDrive): Likewise.	2012-11-30 09:51:09 -07:00
Eric Blake	ddd103d342	storage: fix scsi detach regression with cgroup ACLs https://bugzilla.redhat.com/show_bug.cgi?id=876828 Commit `38c4a9cc` introduced a regression in hot unplugging of disks from qemu, where cgroup device ACLs were no longer being revoked (thankfully not a security hole: cgroup ACLs only prevent open() of the disk; so reverting the ACL prevents future abuse but doesn't stop abuse from an fd that was already opened before the ACL change). Commit `1b2ebf95` overlooked that there were two spots affected. * src/qemu/qemu_hotplug.c (qemuDomainDetachDiskDevice): Transfer backing chain before deletion. * src/qemu/qemu_driver.c (qemuDomainDetachDeviceDiskLive): Fix spacing (partly to ensure a different-looking patch).	2012-11-30 08:26:34 -07:00
Peter Krempa	6c5c4b8d4d	qemu: Refactor error reporting in qemu driver configuration parser This patch adds two labels and gets rid of a ton of duplicated code. This patch also fixes some error message and switches most of them to proper error reporting functions.	2012-11-29 22:23:16 +01:00
Peter Krempa	7aba113ca7	qemu: Refactor config parameter retrieval This patch adds macros to help retrieve configuration values from qemu driver's configuration. Some configuration options are grouped together in the process.	2012-11-29 21:54:16 +01:00
Daniel P. Berrange	f4ea67f5b3	Turn some dual-state int parameters into booleans The virStateInitialize method and several cgroups methods were using an 'int privileged' parameter or similar for dual-state values. These are better represented with the bool type. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-11-29 16:14:43 +00:00
Jiri Denemark	c0ee3d3b54	qemu: Remove full stop from error messages	2012-11-29 14:16:48 +01:00
Guido Günther	d521119c09	Don't fail hard when we can't connect to the monitor As of `1a50ba2cb0` we fail to connect to the monitor instead of getting an exit status != 0 from qemu itself. This breaks capabilities probing for the non QMP case.	2012-11-29 13:54:44 +01:00
Daniel P. Berrange	b7aba48bca	Rename misc QEMU structs/enums to use normal naming style Replace the following names * struct qemu_snap_remove with virQEMUSnapRemovePtr * struct qemu_snap_reparent with virQEMUSnapReparentPtr * struct qemu_save_header with virQEMUSaveHeaderPtr * enum qemu_save_formats with virQEMUSaveFormat Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-11-28 18:17:31 +00:00
Daniel P. Berrange	4738c2a7e7	Replace 'struct qemud_driver *' with virQEMUDriverPtr Remove the obsolete 'qemud' naming prefix and underscore based type name. Introduce virQEMUDriverPtr as the replacement, in common with LXC driver naming style	2012-11-28 18:17:25 +00:00
Michal Privoznik	4ded3fb1c2	maint: Fix use of invalid reboot flags Throughout the code, we've always used VIR_DOMAIN_SHUTDOWN* flags even for virDomainReboot() API and its implementation. Fortunately, the appropriate macros has the same value. But if we want to keep things consistent, we should be using the correct macros. This patch doesn't break anything, luckily.	2012-11-28 17:45:30 +01:00
Ján Tomko	7794e02c56	util: check for NULL parameter in virFileWrapperFdCatchError This reverts `8927c0e` qemu: fix a crash when save file can't be opened and allows virFileWrapperFdCatchError to be called with NULL instead.	2012-11-29 00:00:39 +08:00
Peter Krempa	d3337028f5	qemu: Fix error messages when dispatching guest agent commands Error messages produced while dispatching guest agent commands didn't have an apparent reference to the fact that they are dealing with guest agent commands. This patch fixes up some of the messages to contain that reference.	2012-11-28 16:36:34 +01:00
Peter Krempa	86727836c2	qemu: Drop word "either" from comments for agent monitor functions	2012-11-28 16:36:34 +01:00
Michal Privoznik	6092fea93a	qemu: Implement virDomainFSTrim using qemu guest agent. As said in previous patch, @mountPoint must be NULL and @flags zero because qemu guest agent doesn't support these arguments yet. If qemu learns them, we can start supporting them as well.	2012-11-28 16:15:01 +01:00
Viktor Mihajlovski	856a482207	qemu: Add QEMU version computation to QMP probing With QMP capability probing, the version was not set. virsh version returns: ... Cannot extract running QEMU hypervisor version This is fixed by computing caps->version from QMP major, minor, micro values. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-11-28 14:54:44 +00:00
Viktor Mihajlovski	1a50ba2cb0	qemu: Fix QMP Capabability Probing Failure QMP Capability probing will fail if QEMU cannot bind to the QMP monitor socket in the qemu_driver->libDir directory. That's because the child process is stripped of all capabilities and this directory is chown'ed to the configured QEMU user/group (normally qemu:qemu) by the QEMU driver. To prevent this from happening, the driver startup will now pass the QEMU uid and gid down to the capability probing code. All capability probing invocations of QEMU will be run with the configured QEMU uid instead of libvirtd's. Furter, the pid file handling is moved to libvirt, as QEMU cannot write to the qemu_driver->runDir (root:root). This also means that the libvirt daemonizing must be used. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-11-28 14:54:29 +00:00
Viktor Mihajlovski	7a95eccc81	qemu: Wait for monitor socket even without pid If qemuMonitorOpenUnix is called without a related pid, i.e. for QMP probing, a connect failure can happen as the result of a race. Without a pid there is no retry and thus we give up too early. This changes the code to retry if no pid is supplied. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-11-28 14:54:21 +00:00
Ján Tomko	8927c0eab6	qemu: fix a crash when save file can't be opened In qemuDomainSaveMemory, wrapperFd might be NULL and should be checked before calling virFileWrapperFdCatchError. Same in doCoreDump. Bug: https://bugzilla.redhat.com/show_bug.cgi?id=880919	2012-11-28 10:24:31 +01:00
Daniel P. Berrange	7492276317	s/qemud/qemu/ in QEMU driver sources Change some legacy function names to use 'qemu' as their prefix instead of 'qemud' which was a hang over from when the QEMU driver ran inside a separate daemon Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-11-27 19:36:36 +00:00
Eric Blake	1b2ebf9502	storage: fix device detach regression with cgroup ACLs https://bugzilla.redhat.com/show_bug.cgi?id=876828 Commit `38c4a9cc` introduced a regression in hot unplugging of disks from qemu, where cgroup device ACLs were no longer being revoked (thankfully not a security hole: cgroup ACLs only prevent open() of the disk; so reverting the ACL prevents future abuse but doesn't stop abuse from an fd that was already opened before the ACL change). The actual regression is due to a latent bug. The hot unplug code was computing the set of files needing cgroup ACL revocation based on the XML passed in by the user, rather than based on the domain's details on which disk was being deleted. As long as the revoke path was always recomputing the backing chain, this didn't really matter; but now that we want to compute the chain exactly once and remember that computation, we need to hang on to the backing chain until after the revoke has happened. * src/qemu/qemu_hotplug.c (qemuDomainDetachPciDiskDevice): Transfer backing chain before deletion.	2012-11-27 08:02:26 -07:00
Harsh Prateek Bora	c33c36d28f	qemu: Add support for gluster protocol based network storage backend. Qemu accepts gluster protocol as supported storage backend beside others. Signed-off-by: Harsh Prateek Bora <harsh@linux.vnet.ibm.com>	2012-11-27 10:19:22 +01:00
Harsh Prateek Bora	a2d2b80fbd	Add Gluster protocol as supported network disk backend This patch introduces the RNG schema and updates necessary data strucutures to allow various hypervisors to make use of Gluster protocol as one of the supported network disk backend. Next patch will add support to make use of this feature in Qemu since it now supports Gluster protocol as one of the network based storage backend. Two new optional attributes for <host> element are introduced - 'transport' and 'socket'. Valid transport values are tcp, unix or rdma. If none specified, tcp is assumed. If transport is unix, socket specifies path to unix socket. This patch allows users to specify disks on gluster backends like this: <disk type='network' device='disk'> <driver name='qemu' type='raw'/> <source protocol='gluster' name='Volume1/image'> <host name='example.org' port='6000' transport='tcp'/> </source> <target dev='vda' bus='virtio'/> </disk> <disk type='network' device='disk'> <driver name='qemu' type='raw'/> <source protocol='gluster' name='Volume2/image'> <host transport='unix' socket='/path/to/sock'/> </source> <target dev='vdb' bus='virtio'/> </disk> Signed-off-by: Harsh Prateek Bora <harsh@linux.vnet.ibm.com>	2012-11-27 10:19:22 +01:00
Eric Blake	7e5aa78d0f	build: avoid C99 for loop Although we require various C99 features, we don't yet require a complete C99 compiler. On RHEL 5, compilation complained: qemu/qemu_command.c: In function 'qemuBuildGraphicsCommandLine': qemu/qemu_command.c:4688: error: 'for' loop initial declaration used outside C99 mode * src/qemu/qemu_command.c (qemuBuildGraphicsCommandLine): Declare variable sooner. * src/qemu/qemu_process.c (qemuProcessInitPasswords): Likewise.	2012-11-26 15:28:25 -07:00
Martin Kletzander	03cd6e4ae8	conf: Report sensible error for invalid disk name The error "... but the cause is unknown" appeared for XMLs similar to this: <disk type='file' device='cdrom'> <driver name='qemu' type='raw'/> <source file='/dev/zero'/> <target dev='sr0'/> </disk> Notice unsupported disk type (for the driver), but also no address specified. The first part is not a problem and we should not abort immediately because of that, but the combination with the address unknown was causing an unspecified error. While fixing this, I added an error to one place where this return value was not managed properly.	2012-11-22 15:23:40 +01:00
Scott Sullivan	f0e72b2f5c	qemu: fix RBD attach regression I have been testing libvirt v1.0.0 for deployment within my organization, and in the process discovered what appears to be a bug that breaks virsh attach-device, when attaching an RBD volume to an instance. First, here is the error presented, with v1.0.0 (this worked in v0.10.2): [root@host ~]# virsh attach-device W5APQ8 G84VV1.xml error: Failed to attach device from G84VV1.xml error: cannot open file 'dc3-1-test/G84VV1': No such file or directory Using git bisect, I narrowed the problem down to this as the first commit to break this setup: `4d34c92947` is the first bad commit	2012-11-21 12:33:23 -07:00
Alon Levy	283aafdb29	qemu/qemu_command.c: fix indent of label	2012-11-20 19:57:39 +01:00
Alon Levy	37b415200d	qemu: graphics support for simultaneous one of each sdl, vnc, spice	2012-11-20 19:57:39 +01:00
Alon Levy	23e8b5d8e7	qemu: refactor graphics code to not hardcode a single display The check for a single display remains so no new functionality is added.	2012-11-20 19:57:39 +01:00
Eric Blake	0b5617a607	snapshot: make cloning of domain definition easier Upcoming patches for revert-and-clone branching of snapshots need to be able to copy a domain definition; make this step reusable. * src/conf/domain_conf.h (virDomainDefCopy): New prototype. * src/conf/domain_conf.c (virDomainObjCopyPersistentDef): Split... (virDomainDefCopy): ...into new function. (virDomainObjSetDefTransient): Use it. * src/libvirt_private.syms (domain_conf.h): Export it. * src/qemu/qemu_driver.c (qemuDomainRevertToSnapshot): Use it.	2012-11-20 08:41:45 -07:00
liguang	63158d586b	qemu: Beautify code indent in migration codes Signed-off-by: liguang <lig.fnst@cn.fujitsu.com>	2012-11-16 16:42:09 +08:00
Viktor Mihajlovski	a2b3d7cff8	qemu, lxc: Change host CPU number detection logic. The drivers for QEMU and LXC use virNodeGetInfo only to determine the number of host CPUs. On Linux hosts nodeGetCPUCount has less overhead.	2012-11-15 08:48:19 -07:00
Ján Tomko	a4c19459aa	qemu: add bootindex for usb-host and usb-redir devices Allow bootindex to be specified for redirected USB devices and host USB devices. Bug: https://bugzilla.redhat.com/show_bug.cgi?id=805414	2012-11-14 19:03:18 -07:00
Michal Privoznik	9f87247235	qemu: Don't force port=0 for SPICE If domain uses only TLS port we don't want to add 'port=0' explicitly to command line.	2012-11-14 10:07:27 +01:00
Peter Krempa	30f1bccf33	snapshot: qemu: Fix detection of external snapshots when deleting This patch adds a helper to determine if snapshots are external and uses the helper to fix detection of those in snapshot deletion code. Snapshots are external if they have an external memory image or if the disk locations are external. As mixed snapshots are forbidden for now we need to check just one disk to know.	2012-11-13 20:36:26 +01:00
Michal Privoznik	ab5e7d4977	qemu: Allow migration to be cancelled at prepare phase Currently, if user calls virDomainAbortJob we just issue 'migrate_cancel' and hope for the best. However, if user calls the API in wrong phase when migration hasn't been started yet (perform phase) the cancel request is just ignored. With this patch, the request is remembered and as soon as perform phase starts, migration is cancelled.	2012-11-12 10:39:39 +01:00
Viktor Mihajlovski	b1c88c1476	capabilities: defaultConsoleTargetType can depend on architecture For S390, the default console target type cannot be of type 'serial'. It is necessary to at least interpret the 'arch' attribute value of the os/type element to produce the correct default type. Therefore we need to extend the signature of defaultConsoleTargetType to account for architecture. As a consequence all the drivers supporting this capability function must be updated. Despite the amount of changed files, the only change in behavior is that for S390 the default console target type will be 'virtio'. N.B.: A more future-proof approach could be to to use hypervisor specific capabilities to determine the best possible console type. For instance one could add an opaque private data pointer to the virCaps structure (in case of QEMU to hold capsCache) which could then be passed to the defaultConsoleTargetType callback to determine the console target type. Seems to be however a bit overengineered for the use case... Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-11-09 09:20:59 -07:00
Peter Krempa	02cf57c0d0	qemu: Fix domain ID numbering race condition When the libvirt daemon is restarted it tries to reconnect to running qemu domains. Since commit `d38897a5d4` the re-connection code runs in separate threads. In the original implementation the maximum of domain ID's (that is used as an initializer for numbering guests created next) while libvirt was reconnecting to the guest. With the threaded implementation this opens a possibility for race conditions with the thread that is autostarting guests. When there's a guest running with id 1 and the daemon is restarted. The autostart code is reached first and spawns the first guest that should be autostarted as id 1. This results into the following unwanted situation: # virsh list Id Name State ---------------------------------------------------- 1 guest1 running 1 guest2 running This patch extracts the detection code before the re-connection threads are started so that the maximum id of the guests being reconnected to is known. The only semantic change created by this is if the guest with greatest ID quits before we are able to reconnect it's ID is used anyway as the greatest one as without this patch the greatest ID of a process we could successfuly reconnect to would be used.	2012-11-09 00:12:38 +01:00
Peter Krempa	e124f49890	qemu: Fix function header formating of 2 functions Headers of qemuDomainSnapshotLoad and qemuDomainNetsRestart were improperly formatted.	2012-11-08 13:45:45 +01:00
Peter Krempa	9b5a514b31	snapshot: qemu: Add support for external inactive snapshots This patch adds support for external disk snapshots of inactive domains. The snapshot is created by calling using qemu-img by calling: qemu-img create -f format_of_snapshot -o backing_file=/path/to/src,backing_fmt=format_of_backing_image /path/to/snapshot in case the backing image format is known or probing is allowed and otherwise: qemu-img create -f format_of_snapshot -o backing_file=/path/to/src /path/to/snapshot on each of the disks selected for snapshotting. This patch also modifies the snapshot preparing function to support creating external snapshots and to sanitize arguments. For now the user isn't able to mix external and internal snapshots but this restriction might be lifted in the future.	2012-11-08 11:27:34 +01:00
Michal Privoznik	a08fc66d90	qemu: Emit event if 'cont' fails Some operations, APIs needs domain to be paused prior operation can be performed, e.g. (managed-) save of a domain. The processors should be restored in the end. However, if 'cont' fails for some reason, we log a message but this is not sufficient as an event should be emitted as well. Mgmt application can then decide what to do.	2012-11-07 12:06:09 +01:00
Peter Krempa	fb58f8e2a4	qemu: Don't corrupt pointer in qemuDomainSaveMemory() The code that was split out into the qemuDomainSaveMemory expands the pointer containing the XML description of the domain that it gets from higher layers. If the pointer changes the old one is invalid and the upper layer function tries to free it causing an abort. This patch changes the expansion of the original string to a new allocation and copy of the contents.	2012-11-06 14:45:27 +01:00
Michal Privoznik	0f720ab35a	qemu: Add controllers in specified order qemu is sensitive to the order of arguments passed. Hence, if a device requires a controller, the controller cmd string must precede device cmd string. The same apply for controllers, when for instance ccid controller requires usb controller. So controllers create partial ordering in which they should be added to qemu cmd line.	2012-11-06 10:11:34 +01:00
Michal Privoznik	77b93dbc3e	qemu: Wrap controllers code into dummy loop which just re-indent code and prepare it for next patch.	2012-11-06 10:11:34 +01:00
Peter Krempa	0dac29d89f	snapshot: qemu: Remove restrictions preventing external checkpoints Some of the pre-snapshot check have restrictions wired in regarding configuration options that influence taking of external checkpoints. This patch removes restrictions that would inhibit taking of such a snapshot.	2012-11-04 20:17:57 +01:00
Peter Krempa	f569b87f51	snapshot: qemu: Add support for external checkpoints This patch adds support to take external system checkpoints. The functionality is layered on top of the previous disk-only snapshot code. When the checkpoint is requested the domain memory is saved to the memory image file using migration to file. (The user may specify to take the memory image while the guest is live with the VIR_DOMAIN_SNAPSHOT_CREATE_LIVE flag.) The memory save image shares format with the image created by virDomainSave() API.	2012-11-04 16:53:32 +01:00
Peter Krempa	b5fd404471	snapshot: qemu: Rename qemuDomainSnapshotCreateActive Before now, libvirt supported only internal snapshots for active guests. This patch renames this function to qemuDomainSnapshotCreateActiveInternal to prepare the grounds for external active snapshots.	2012-11-03 15:06:09 +01:00
Peter Krempa	2a59a3d597	snapshot: qemu: Add async job type for snapshots The new external system checkpoints will require an async job while the snapshot is taken. This patch adds QEMU_ASYNC_JOB_SNAPSHOT to track this job type.	2012-11-03 14:57:43 +01:00
Peter Krempa	2771f8b74c	qemu: Split out domain memory saving code to allow reuse The code that saves domain memory by migration to file can be reused while doing external checkpoints of a machine. This patch extracts the common code and places it in a separate function.	2012-11-03 11:49:41 +01:00
Peter Krempa	ec69ca14f9	qemu: Clean up snapshot retrieval to use the new helper Two other places were left with the old code to look up snapshots. Change them to use the snapshot lookup helper.	2012-11-03 11:26:39 +01:00
Peter Krempa	d0fc6dc831	qemu: Fix possible race when pausing guest When pausing the guest while migration is running (to speed up convergence) the virDomainSuspend API checks if the migration job is active before entering the job. This could cause a possible race if the virDomainSuspend is called while the job is active but ends before the Suspend API enters the job (this would require that the migration is aborted). This would cause a incorrect event to be emitted.	2012-11-02 20:18:46 +01:00
Eric Blake	de76cae971	snapshot: merge pre-snapshot checks Both system checkpoint snapshots and disk snapshots were iterating over all disks, doing a final sanity check before doing any work. But since future patches will allow offline snapshots to be either external or internal, it makes sense to share the pass over all disks, and then relax restrictions in that pass as new modes are implemented. Future patches can then handle external disks when the domain is offline, then handle offline --disk-snapshot, and finally, combine with migration to file to gain a complete external system checkpoint snapshot of an active domain without using 'savevm'. * src/qemu/qemu_driver.c (qemuDomainSnapshotDiskPrepare) (qemuDomainSnapshotIsAllowed): Merge... (qemuDomainSnapshotPrepare): ...into one function. (qemuDomainSnapshotCreateXML): Update caller.	2012-11-02 10:19:03 -06:00
Eric Blake	e260e401a5	snapshot: populate new XML info for qemu snapshots Now that the XML supports listing internal snapshots, it is worth always populating the <memory> and <disks> element to match. * src/qemu/qemu_driver.c (qemuDomainSnapshotCreateXML): Always parse disk info and set memory info.	2012-11-02 10:11:50 -06:00
Daniel P. Berrange	1c04f99970	Remove spurious whitespace between function name & open brackets The libvirt coding standard is to use 'function(...args...)' instead of 'function (...args...)'. A non-trivial number of places did not follow this rule and are fixed in this patch. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-11-02 13:36:49 +00:00
Guannan Ren	1851a0c864	qemu: use default machine type if missing it in qemu command line BZ:https://bugzilla.redhat.com/show_bug.cgi?id=871273 when using virsh qemu-attach to attach an existing qemu process, if it misses the -M option in qemu command line, libvirtd crashed because the NULL value of def->os.machine in later use. Example: /usr/libexec/qemu-kvm -name foo \ -cdrom /var/lib/libvirt/images/boot.img \ -monitor unix:/tmp/demo,server,nowait \ error: End of file while reading data: Input/output error error: Failed to reconnect to the hypervisor This patch tries to set default machine type if the value of def->os.machine is still NULL after qemu command line parsing.	2012-11-02 12:55:29 +08:00
Doug Goldstein	ba804d9fd1	qemu: QMP capabilities support starts with 1.2 Per the code comment in qemuCapsInitQMPBasic() and commit `43e23c7`, we should only use QMP for capabilities probing starting with 1.2 and newer. The old code had dead logic that probed on 1.0 and newer. Signed-off-by: Eric Blake <eblake@redhat.com>	2012-11-01 17:50:02 -06:00
Stefan Hajnoczi	23d47b33a2	qemu: Fix name comparison in qemuMonitorJSONBlockIoThrottleInfo() The string comparison logic was inverted and matched the first drive that does not have the name we search for. Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>	2012-11-01 13:23:27 -06:00
Stefan Hajnoczi	04ee70bfda	qemu: Keep QEMU host drive prefix in BlkIoTune The QEMU -drive id= begins with libvirt's QEMU host drive prefix ("drive-"), which is stripped off in several places two convert between host ("-drive") and guest ("-device") device names. In the case of BlkIoTune it is unnecessary to strip the QEMU host drive prefix because we operate on "info block"/"query-block" output that uses host drive names. Stripping the prefix incorrectly caused string comparisons to fail since we were comparing the guest device name against the host device name. Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>	2012-11-01 13:03:26 -06:00
Daniel P. Berrange	6fea88a119	Fix arch detection for qemu-system-i386 with QMP QEMU uses 'i386' for its 32-bit x86 architecture, but libvirt wants that to be 'i686', so we must fix it up Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2012-11-01 09:16:37 +00:00
Eric Blake	4dbd6e9654	build: prefer mkostemp for multi-thread safety https://bugzilla.redhat.com/show_bug.cgi?id=871756 Commit `cd1e8d1` assumed that systems new enough to have journald also have mkostemp; but this is not true for uclibc. For that matter, use of mkstemp[s] is unsafe in a multi-threaded program. We should prefer mkostemp[s] in the first place. * bootstrap.conf (gnulib_modules): Add mkostemp, mkostemps; drop mkstemp and mkstemps. * cfg.mk (sc_prohibit_mkstemp): New syntax check. * tools/virsh.c (vshEditWriteToTempFile): Adjust caller. * src/qemu/qemu_driver.c (qemuDomainScreenshot) (qemudDomainMemoryPeek): Likewise. * src/secret/secret_driver.c (replaceFile): Likewise. * src/vbox/vbox_tmpl.c (vboxDomainScreenshot): Likewise.	2012-10-31 10:06:10 -06:00
Martin Kletzander	10c5212b10	qemu: Fix EmulatorPinInfo without emulatorpin https://bugzilla.redhat.com/show_bug.cgi?id=871312 Recent fixes made almost all the right steps to make emulator pinned to the cpuset of the whole domain in case <emulatorpin> isn't specified, but qemudDomainGetEmulatorPinInfo still reports all the CPUs even when cpuset is specified. This patch fixes that.	2012-10-31 16:27:02 +01:00
Martin Kletzander	037a49dc66	Make non-KVM machines work with QMP probing When there is no 'qemu-kvm' binary and the emulator used for a machine is, for example, 'qemu-system-x86_64' that, by default, runs without kvm enabled, libvirt still supplies '-no-kvm' option to this process, even though it does not recognize such option (making the start of a domain fail in that case). This patch fixes building a command-line for QEMU machines without KVM acceleration and is based on following assumptions: - QEMU_CAPS_KVM flag means that QEMU is running KVM accelerated machines by default (without explicitly requesting that using a command-line option). It is the closest to the truth according to the code with the only exception being the comment next to the flag, so it's fixed in this patch as well. - QEMU_CAPS_ENABLE_KVM flag means that QEMU is, by default, running without KVM acceleration and in case we need KVM acceleration it needs to be explicitly instructed to do so. This is partially true for the past (this option essentially means that QEMU recognizes the '-enable-kvm' option, even though it's almost the same).	2012-10-31 08:31:49 +01:00
Vladislav Bogdanov	81af5336ac	qemu: pass -usb and usb hubs earlier, so USB disks with static address are handled properly	2012-10-30 08:54:32 +01:00
Vladislav Bogdanov	8f708761c0	qemu: Do not ignore address for USB disks	2012-10-30 08:54:28 +01:00
Michal Privoznik	34e8f63a32	qemu: Report errors from iohelper Currently, we use iohelper when saving/restoring a domain. However, if there's some kind of error (like I/O) it is not propagated to libvirt. Since it is not qemu who is doing the actual write() it will not get error. The iohelper does. Therefore we should check for iohelper errors as it makes libvirt more user friendly.	2012-10-29 17:04:26 +01:00
Ján Tomko	0b121614a2	xml: print uuids in the warning In the XML warning, we print a virsh command line that can be used to edit that XML. This patch prints UUIDs if the entity name contains special characters (like shell metacharacters, or "--" that would break parsing of the XML comment). If the entity doesn't have a UUID, just print the virsh command that can be used to edit it.	2012-10-29 14:38:43 +01:00
Jiri Denemark	23f5e74ed3	Revert "qemu: Do not require hostuuid in migration cookie" This reverts commit `8d75e47ede`. Libvirt was never released with support for migration cookies without hostuuid.	2012-10-29 09:04:27 +01:00
Cole Robinson	9a2975786b	qemu: Fix domxml-to-native network model conversion https://bugzilla.redhat.com/show_bug.cgi?id=636832	2012-10-27 12:20:49 -04:00
Eric Blake	dd0a7040f7	build: typo fix for qemu cpu affinity Introduced in commit `0039a32f`. * src/qemu/qemu_process.c (qemuPrepareCpumap): s/covert/convert/	2012-10-27 08:09:51 -06:00
Eric Blake	5a3501be9e	blockjob: relabel entire existing chain When using block copy to pivot over to a new chain, the backing files for the new chain might still need labeling (particularly if the user passes --reuse-ext with a relative backing file name). Relabeling a file that is already labeled won't hurt, so this just labels the entire chain at the point of the pivot. Doing the relabel of the chain uses the fact that we already safely probed the file type of an external file at the start of the block copy. * src/qemu/qemu_driver.c (qemuDomainBlockPivot): Relabel chain before asking qemu to pivot.	2012-10-27 07:43:39 -06:00
Eric Blake	35c7701c64	blockjob: allow mirroring under SELinux and cgroup Use the recent addition of qemuDomainPrepareDiskChainElement to obtain locking manager lease, permit a block device through cgroups, and set the SELinux label; then audit the fact that we hand a new file over to qemu. Alas, releasing the lease and label at the end of the mirroring is a trickier prospect (we would have to trace the backing chain of both source and destination, and be sure not to revoke rights to any part of the chain that is shared), so for now, virDomainBlockJobAbort still leaves things with additional access granted (as block-pull and block-commit have the same problem of not clamping access after completion, a future cleanup would cover all three commands). * src/qemu/qemu_driver.c (qemuDomainBlockCopy): Set up labeling.	2012-10-27 07:43:39 -06:00
Eric Blake	8ee5073c1e	blockjob: allow for existing files in block-copy Support the REUSE_EXT flag, in part by copying sanity checks from snapshot code. This code introduces a case of probing an external file for its type; such an action would be a security risk if the existing file is supposed to be raw but the contents resemble some other format; however, since the virDomainBlockRebase API has a flag to force treating the file as raw rather than probe, we can assume that probing is safe in all other instances. Besides, if we don't probe or force raw, then qemu will. * src/qemu/qemu_driver.c (qemuDomainBlockRebase): Allow REUSE_EXT flag. (qemuDomainBlockCopy): Wire up flag, and add some sanity checks.	2012-10-27 07:43:39 -06:00
Eric Blake	c1eb38053d	blockjob: implement block copy for qemu Minimal patch to wire up all the pieces in the previous patches to actually enable a block copy job. By minimal, I mean that qemu creates the file (that is, no REUSE_EXT flag support yet), SELinux must be disabled, a lock manager is not informed, and the audit logs aren't updated. But those will be added as improvements in future patches. This patch is designed so that if we ever add a future API virDomainBlockCopy with more bells and whistles (such as letting the user specify a destination image format different than the source), where virDomainBlockRebase is a wrapper around the simpler portions of the new functionality, then the new API can just reuse the new qemuDomainBlockCopy function and already support _SHALLOW and _REUSE_EXT flags. Also note that libvirt.c already filtered the new flags if _COPY is not present, so that we are not impacting the case of BlockRebase being a wrapper around BlockPull. * src/qemu/qemu_driver.c (qemuDomainBlockCopy): New function. (qemuDomainBlockRebase): Call it when appropriate.	2012-10-27 07:43:39 -06:00
Eric Blake	400ac797ef	blockjob: make block pivot safer Since libvirt drops locks between issuing a monitor command and getting a response, it is possible for libvirtd to be restarted before getting a response on a block-job-complete command; worse, it is also possible for the guest to shut itself down during the window while libvirtd is down, ending the qemu process. A management app needs to know if the pivot happened (and the destination file contains guest contents not in the source) or failed (and the source file contains guest contents not in the destination), but since the job is finished, 'query-block-jobs' no longer tracks the status of the job, and if the qemu process itself has disappeared, even 'query-block' cannot be checked to ask qemu its current state. At the time of this patch, the design for persistent bitmap has not been clarified, so a followup patch will be needed once qemu actually figures out how to expose it, and we figure out how to use it. In the meantime, we have a solution that avoids the worst of the problem. [This problem was first analyzed with the RHEL 6.3 __com.redhat_drive-reopen command; which partly explains why upstream qemu 1.3 ditched the drive-reopen idea and went with block-job-complete plus persistent bitmap instead.] If we surround 'drive-reopen' with a pause/resume pair, then we can guarantee that the guest cannot modify either source or destination files in the window of libvirtd uncertainty, and the management app is guaranteed that either libvirt knows the outcome and reported it correctly; or that on libvirtd restart, the guest will still be paused and that the qemu process cannot have disappeared due to guest shutdown; and use that as a clue that the management app must implement recovery protocol, with both source and destination files still being in sync and with 'query-block' still being an option as part of that recovery. My testing shows that the pause window will typically be only a fraction of a second. * src/qemu/qemu_driver.c (qemuDomainBlockPivot): Pause around drive-reopen. (qemuDomainBlockJobImpl): Update caller.	2012-10-27 07:43:38 -06:00
Eric Blake	eaba79d22e	blockjob: support pivot operation on cancel This is the bare minimum to end a copy job (of course, until a later patch adds the ability to start a copy job, this patch doesn't do much in isolation; I've just split the patches to ease the review). This patch intentionally avoids SELinux, lock manager, and audit actions. Also, if libvirtd restarts at the exact moment that a 'block-job-complete' is in flight, the proposed proper way to detect the outcome of that would be with a persistent bitmap and some additional query commands when libvirtd restarts. This patch is enough to test the common case of success when used correctly, while saving the subtleties of proper cleanup for worst-case errors for later. When a mirror job is started, cancelling the job safely reverts back to the source disk, regardless of whether the destination is in phase 1 (streaming, in which case the destination is worthless) or phase 2 (mirroring, in which case the destination is synced up to the source at the time of the cancel). Our existing code does just fine in either phase, other than some bookkeeping cleanup; this implements live block copy. Ideas for future enhancements via new flags: Depending on when persistent bitmap support is added, it may be worth adding a VIR_DOMAIN_REBASE_COPY_ATOMIC flag that fails up front if we detect an older qemu with risky pivot operation. Interesting side note: while snapshot-create --disk-only creates a copy of the disk at a point in time by moving the domain on to a new file (the copy is the file now in the just-extended backing chain), blockjob --abort of a copy job creates a copy of the disk while keeping the domain on the original file. There may be potential improvements to the snapshot code to exploit block copy over multiple disks all at one point in time. And, if 'block-job-cancel' were made part of 'transaction', you could copy multiple disks at the same point in time without pausing the domain. This also implies we may want to add a --quiesce flag to virDomainBlockJobAbort, so that when breaking a mirror (whether by cancel or pivot), the side of the mirror that we are abandoning is at least in a stable state with regards to guest I/O. * src/qemu/qemu_driver.c (qemuDomainBlockJobAbort): Accept new flag. (qemuDomainBlockPivot): New helper function. (qemuDomainBlockJobImpl): Implement it.	2012-10-27 07:43:38 -06:00
Eric Blake	edecd45c78	blockjob: return appropriate event and info Handle the new type of block copy event and info. Of course, this patch does nothing until a later patch actually allows the creation/abort of a block copy job. * include/libvirt/libvirt.h.in (VIR_DOMAIN_BLOCK_JOB_READY): New block job status. * src/libvirt.c (virDomainBlockRebase): Document the event. * src/qemu/qemu_monitor_json.c (eventHandlers): New event. (qemuMonitorJSONHandleBlockJobReady): New function. (qemuMonitorJSONGetBlockJobInfoOne): Translate new job type. (qemuMonitorJSONHandleBlockJobImpl): Handle new event and job type. * src/qemu/qemu_process.c (qemuProcessHandleBlockJob): Recognize the event to minimize snooping. * src/qemu/qemu_driver.c (qemuDomainBlockJobImpl): Snoop a successful info query to save effort on a pivot request.	2012-10-27 07:43:38 -06:00
Eric Blake	b3822ed04a	blockjob: react to active block copy For now, disk migration via block copy job is not implemented in libvirt. But when we do implement it, we have to deal with the fact that qemu does not yet provide an easy way to re-start a qemu process with mirroring still intact. Paolo has proposed an idea for a persistent dirty bitmap that might make this possible, but until that design is complete, it's hard to say what changes libvirt would need. Even something like 'virDomainSave' becomes hairy, if you realize the implications that 'virDomainRestore' would be stuck with recreating the same mirror layout. But if we step back and look at the bigger picture, we realize that the initial client of live storage migration via disk mirroring is oVirt, which always uses transient domains, and that if a transient domain is destroyed while a mirror exists, oVirt can easily restart the storage migration by creating a new domain that visits just the source storage, with no loss in data. We can make life a lot easier by being cowards for now, forbidding certain operations on a domain. This patch guarantees that we never get in a state where we would have to restart a domain with a mirroring block copy, by preventing saves, snapshots, migration, hot unplug of a disk in use, and conversion to a persistent domain (thankfully, it is still relatively easy to 'virsh undefine' a running domain to temporarily make it transient, run tests on 'virsh blockcopy', then 'virsh define' to restore the persistence). Later, if the qemu design is enhanced, we can relax our code. The change to qemudDomainDefine looks a bit odd for undoing an assignment, rather than probing up front to avoid the assignment, but this is because of how virDomainAssignDef combines both a lookup and assignment into a single function call. * src/conf/domain_conf.h (virDomainHasDiskMirror): New prototype. * src/conf/domain_conf.c (virDomainHasDiskMirror): New function. * src/libvirt_private.syms (domain_conf.h): Export it. * src/qemu/qemu_driver.c (qemuDomainSaveInternal) (qemuDomainSnapshotCreateXML, qemuDomainRevertToSnapshot) (qemuDomainBlockJobImpl, qemudDomainDefine): Prevent dangerous actions while block copy is already in action. * src/qemu/qemu_hotplug.c (qemuDomainDetachDiskDevice): Likewise. * src/qemu/qemu_migration.c (qemuMigrationIsAllowed): Likewise.	2012-10-27 07:43:38 -06:00
Eric Blake	6d264c9182	blockjob: add qemu capabilities related to block jobs Upstream qemu 1.3 is adding two new monitor commands, 'drive-mirror' and 'block-job-complete'[1], which can drive live block copy and storage migration. [Additionally, RHEL 6.3 had backported an earlier version of most of the same functionality, but under the names '__com.redhat_drive-mirror' and '__com.redhat_drive-reopen' and with slightly different JSON arguments, and has been using patches similar to these upstream patches for several months now.] The libvirt API virDomainBlockRebase as already committed for 0.9.12 is flexible enough to expose the basics of block copy, but some additional features in the 'drive-mirror' qemu command, such as setting error policy, setting granularity, or using a persistent bitmap, may later require a new libvirt API virDomainBlockCopy. I will wait to add that API until we know more about what qemu 1.3 will finally provide. This patch caters only to the upstream qemu 1.3 interface, although I have proven that the changes for RHEL 6.3 can be isolated to just qemu_monitor_json.c, and the rest of this series will gracefully handle either interface once the JSON differences are papered over in a downstream patch. For consistency with other block job commands, libvirt must handle the bandwidth argument as MiB/sec from the user, even though qemu exposes the speed argument as bytes/sec; then again, qemu rounds up to cluster size internally, so using MiB hides the worst effects of that rounding if you pass small numbers. [1]https://lists.gnu.org/archive/html/qemu-devel/2012-10/msg04123.html * src/qemu/qemu_capabilities.h (QEMU_CAPS_DRIVE_MIRROR) (QEMU_CAPS_DRIVE_REOPEN): New bits. * src/qemu/qemu_capabilities.c (qemuCaps): Name them. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONCheckCommands): Set them. (qemuMonitorJSONDriveMirror, qemuMonitorDrivePivot): New functions. * src/qemu/qemu_monitor_json.h (qemuMonitorJSONDriveMirror) (qemuMonitorDrivePivot): Declare them. * src/qemu/qemu_monitor.c (qemuMonitorDriveMirror) (qemuMonitorDrivePivot): New passthroughs. * src/qemu/qemu_monitor.h (qemuMonitorDriveMirror) (qemuMonitorDrivePivot): Declare them.	2012-10-27 07:43:37 -06:00
Laine Stump	def31e4c58	qemu: fix attach/detach of netdevs with matching mac addrs This resolves: https://bugzilla.redhat.com/show_bug.cgi?id=862515 which describes inconsistencies in dealing with duplicate mac addresses on network devices in a domain. (at any rate, it resolves almost everything, and prints out an informative error message for the one problem that isn't solved, but has a workaround.) A synopsis of the problems: 1) you can't do a persistent attach-interface of a device with a mac address that matches an existing device. 2) you can do a live attach-interface of such a device. 3) you can directly edit a domain and put in two devices with matching mac addresses. 4) When running virsh detach-device (live or config), only MAC address is checked when matching the device to remove, so the first device with the desired mac address will be removed. This isn't always the one that's wanted. 5) when running virsh detach-interface (live or config), the only two items that can be specified to match against are mac address and model type (virtio, etc) - if multiple netdevs match both of those attributes, it again just finds the first one added and assumes that is the only match. Since it is completely valid to have multiple network devices with the same MAC address (although it can cause problems in many cases, there are valid use cases), what is needed is: 1) remove the restriction that prohibits doing a persistent add of a netdev with a duplicate mac address. 2) enhance the backend of virDomainDetachDeviceFlags to check for something that is guaranteed unique (but still work with just mac address, as long as it yields only a single results. This patch does three things: 1) removes the check for duplicate mac address during a persistent netdev attach. 2) unifies the searching for both live and config detach of netdevices in the subordinate functions of qemuDomainModifyDeviceFlags() to use the new function virDomainNetFindIdx (which matches mac address and PCI address if available, checking for duplicates if only mac address was specified). This function returns -2 if multiple matches are found, allowing the callers to print out an appropriate message. Steps 1 & 2 are enough to fully fix the problem when using virsh attach-device and detach-device (which require an XML description of the device rather than a bunch of commandline args) 3) modifies the virsh detach-interface command to check for multiple matches of mac address and show an error message suggesting use of the detach-device command in cases where there are multiple matching mac addresses. Later we should decide how we want to input a PCI address on the virsh commandline, and enhance detach-interface to take a --address option, eliminating the need to use detach-device * src/conf/domain_conf.c * src/conf/domain_conf.h * src/libvirt_private.syms * added new virDomainNetFindIdx function * removed now unused virDomainNetIndexByMac and virDomainNetRemoveByMac * src/qemu/qemu_driver.c * remove check for duplicate max from qemuDomainAttachDeviceConfig * use virDomainNetFindIdx/virDomainNetRemove instead of virDomainNetRemoveByMac in qemuDomainDetachDeviceConfig * use virDomainNetFindIdx instead of virDomainIndexByMac in qemuDomainUpdateDeviceConfig * src/qemu/qemu_hotplug.c * use virDomainNetFindIdx instead of a homespun loop in qemuDomainDetachNetDevice. * tools/virsh-domain.c: modified detach-interface command as described above	2012-10-26 20:47:54 -04:00
Eric Blake	4fbf322fe9	cpustat: fix regression when cpus are offline It turns out that the cpuacct results properly account for offline cpus, and always returns results for every possible cpu, not just the online ones. So there is no need to check the map of online cpus in the first place, merely only a need to know the maximum possible cpu. Meanwhile, virNodeGetCPUBitmap had a subtle change from returning the maximum id to instead returning the width of the bitmap (one larger than the maximum id) in commit `2f4c5338`, which made this code encounter some off-by-one logic leading to bad error messages when a cpu was offline: $ virsh cpu-stats dom error: Failed to virDomainGetCPUStats() error: An error occurred, but the cause is unknown Cleaning this up unraveled a chain of other unused variables. * src/qemu/qemu_driver.c (qemuDomainGetPercpuStats): Drop pointless check for cpumap changes, and use correct number of cpus. Simplify signature. (qemuDomainGetCPUStats): Adjust caller. * src/nodeinfo.h (nodeGetCPUCount): New prototype. (nodeGetCPUBitmap): Drop unused parameter. * src/nodeinfo.c (nodeGetCPUBitmap): Likewise. (nodeGetCPUMap): Adjust caller. (nodeGetCPUCount): New function. * src/libvirt_private.syms (nodeinfo.h): Export it.	2012-10-26 15:34:52 -06:00
Viktor Mihajlovski	e3ba67037b	virNodeGetCPUMap: Implement driver support Driver support added for: - test: pretending 8 host CPUS, 3 being online - qemu, lxc, openvz, uml: using nodeGetCPUMap Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2012-10-25 11:20:15 -06:00
Eric Blake	2f4c5338a6	nodeinfo: improve probing node cpu bitmap Callers should not need to know what the name of the file to be read in the Linux-specific version of nodeGetCPUmap; furthermore, qemu cares about online cpus, not present cpus, when determining which cpus to skip. While at it, I fixed the fact that we were computing the maximum online cpu id by doing a slow iteration, when what we really want to know is the max available cpu. * src/nodeinfo.h (nodeGetCPUmap): Rename... (nodeGetCPUBitmap): ...and simplify signature. * src/nodeinfo.c (linuxParseCPUmax): New function. (linuxParseCPUmap): Simplify and alter signature. (nodeGetCPUBitmap): Change implementation. * src/libvirt_private.syms (nodeinfo.h): Reflect rename. * src/qemu/qemu_driver.c (qemuDomainGetPercpuStats): Update caller.	2012-10-25 11:20:08 -06:00
Osier Yang	a6bd7c22ea	qemu: Prohibit chaning affinity of domain process if placement is 'auto' On one hand, numad probably will manage the affinity of domain process dynamically in future. On the other hand, even numad won't manage it, it still could confusion. Let's make things simpler enough to avoid the lair for now.	2012-10-24 22:26:11 +08:00
Osier Yang	bb81021bfe	qemu: Keep the affinity when creating cgroup for emulator thread When the cpu placement model is "auto", it sets the affinity for domain process with the advisory nodeset from numad, however, creating cgroup for the domain process (called emulator thread in some contexts) later overrides that with pinning it to all available pCPUs. How to reproduce: * Configure the domain with "auto" placement for <vcpu>, e.g. <vcpu placement='auto'>4</vcpu> * % virsh start dom * % cat /proc/$dompid/status Though the emulator cgroup cause conflicts, but we can't simply prohibit creating it, as other tunables are still useful, such as "emulator_period", which is used by API virDomainSetSchedulerParameter. So this patch doesn't prohibit creating the emulator cgroup, but inherit the nodeset from numad, and reset the affinity for domain process. * src/qemu/qemu_cgroup.h: Modify definition of qemuSetupCgroupForEmulator to accept the passed nodenet * src/qemu/qemu_cgroup.c: Set the affinity with the passed nodeset	2012-10-24 21:46:24 +08:00
Osier Yang	0039a32fca	qemu: Add helper to prepare cpumap for affinity setting Abstract the codes to prepare cpumap into a helper a function, which can be used later. * src/qemu/qemu_process.h: Declare qemuPrepareCpumap * src/qemu/qemu_process.c: Implement qemuPrepareCpumap, and use it.	2012-10-24 21:24:10 +08:00
Kyle Mestery	2f3e2c0c43	qemu_migration: Transport OVS per-port data during live migration Transport Open vSwitch per-port data during live migration by using the utility functions virNetDevOpenvswitchGetMigrateData() and virNetDevOpenvswitchSetMigrateData(). Signed-off-by: Kyle Mestery <kmestery@cisco.com>	2012-10-23 15:26:04 -04:00
Kyle Mestery	694d0c520b	qemu_migration: Add hooks to transport network data during migration Add the ability for the Qemu V3 migration protocol to include transporting network configuration. A generic framework is proposed with this patch to allow for the transfer of opaque data. Signed-off-by: Kyle Mestery <kmestery@cisco.com> Signed-off-by: Laine Stump <laine@laine.org>	2012-10-23 15:26:04 -04:00
Eric Blake	33eaebe48e	snapshot: sanity check when reusing file for snapshot The snapshot code when reusing an existing file had hard-to-read logic, as well as a missing sanity check: REUSE_EXT should require the destination to already be present. * src/qemu/qemu_driver.c (qemuDomainSnapshotDiskPrepare): Require destination on REUSE_EXT, rename variable for legibility.	2012-10-22 15:10:16 -06:00
Cole Robinson	e58dfad4a4	qemu: Don't use -enable-nesting with qemu 1.2.0+ Since the option doesn't exist. Fixes booting with cpu mode='host-model' and qemu 1.2.0	2012-10-22 16:15:12 -04:00
Doug Goldstein	2da776b1d6	qemu: Don't blindly assume VNC is supported Currently it's assumed that qemu always supports VNC, however it is definitely possible to compile qemu without VNC support so we should at the very least check for it and handle that correctly.	2012-10-22 23:16:17 +08:00

... 12 13 14 15 16 ...

3371 Commits