libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-12-29 09:05:25 +00:00

Author	SHA1	Message	Date
Daniel P. Berrange	9c4ecb3e8e	Revert hack for autodestroy in qemuProcessStop This reverts the hack done in commit `568a6cda27` Author: Jiri Denemark <jdenemar@redhat.com> Date: Fri Feb 15 15:11:47 2013 +0100 qemu: Avoid deadlock in autodestroy since we now have a fix which avoids the deadlock scenario entirely	2013-03-01 10:18:27 +00:00
Daniel P. Berrange	96b893f092	Fix deadlock in QEMU close callback APIs There is a lock ordering problem in the QEMU close callback APIs. When starting a guest we have a lock on the VM. We then set a autodestroy callback, which acquires a lock on the close callbacks. When running auto-destroy, we obtain a lock on the close callbacks, then run each callbacks - which obtains a lock on the VM. This causes deadlock if anyone tries to start a VM, while autodestroy is taking place. The fix is to do autodestroy in 2 phases. First obtain all the callbacks and remove them from the list under the close callback lock. Then invoke each callback from outside the close callback lock. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-03-01 10:16:29 +00:00
Daniel P. Berrange	7ccad0b16d	Fix crash in QEMU auto-destroy with transient guests When the auto-destroy callback runs it is supposed to return NULL if the virDomainObjPtr is no longer valid. It was not doing this for transient guests, so we tried to virObjectUnlock a mutex which had been freed. This often led to a crash. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-03-01 10:16:29 +00:00
Jiri Denemark	e4e28220b5	qemu: Make sure qemuProcessStart is run within a job qemuProcessStart expects to be run with a job already set and every caller except for qemuMigrationPrepareAny use it correctly. This bug can be observed in libvirtd logs during incoming migration as warning : qemuDomainObjEnterMonitorInternal:979 : This thread seems to be the async job owner; entering monitor without asking for a nested job is dangerous	2013-03-01 08:32:08 +01:00
Jim Fehlig	438a3850db	Fix starting qemu instances when apparmor driver is enabled With the apparmor security driver enabled, qemu instances fail to start # grep ^security_driver /etc/libvirt/qemu.conf security_driver = "apparmor" # virsh start test-kvm error: Failed to start domain test-kvm error: internal error security label already defined for VM The model field of virSecurityLabelDef object is always populated by virDomainDefGetSecurityLabelDef(), so remove the check for a NULL model when verifying if a label is already defined for the instance. Checking for a NULL model and populating it later in AppArmorGenSecurityLabel() has been left in the code to be consistent with virSecuritySELinuxGenSecurityLabel().	2013-02-28 17:35:59 -07:00
Serge Hallyn	4f773a8c30	Fix a message typo As pointed out in https://bugs.launchpad.net/ubuntu/+source/libvirt/+bug/1034661 The sentence "The function of PCI device addresses must less than 8" does not quite make sense. Update that to read "The function of PCI device addresses must be less than 8" Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2013-02-28 15:29:10 -07:00
Michal Privoznik	b8e25c35d7	qemu: Don't fail to shutdown domains with unresponsive agent Currently, qemuDomainShutdownFlags() chooses the agent method of shutdown whenever the agent is configured. However, this assumption is not enough as the guest agent may be unresponsive at the moment. So unless guest agent method has been explicitly requested, we should fall back to the ACPI method.	2013-02-28 12:24:34 +01:00
Viktor Mihajlovski	adfa3469bb	qemu: virConnectGetVersion returns bogus value The unitialized local variable qemuVersion can cause an random value to be returned for the hypervisor version, observable with virsh version. Introduced by commit `b46f7f4a0b` Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-02-28 11:48:02 +01:00
Paolo Bonzini	0a562de1ff	qemu: fix use-after-free when parsing NBD disk disk->src is still used for disks->hosts->name, do not free it. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2013-02-27 22:02:01 -07:00
Daniel P. Berrange	7f544a4c8f	Don't try to add non-existant devices to ACL The QEMU driver has a list of devices nodes that are whitelisted for all guests. The kernel has recently started returning an error if you try to whitelist a device which does not exist. This causes a warning in libvirt logs and an audit error for any missing devices. eg 2013-02-27 16:08:26.515+0000: 29625: warning : virDomainAuditCgroup:451 : success=no virt=kvm resrc=cgroup reason=allow vm="vm031714" uuid=9d8f1de0-44f4-a0b1-7d50-e41ee6cd897b cgroup="/sys/fs/cgroup/devices/libvirt/qemu/vm031714/" class=path path=/dev/kqemu rdev=? acl=rw Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Daniel P. Berrange	d0b3ee55ec	Fix typo in internal VIR_QEMU_PROCESS_START_AUTODESROY constant s/VIR_QEMU_PROCESS_START_AUTODESROY/VIR_QEMU_PROCESS_START_AUTODESTROY/ Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Daniel P. Berrange	279336c5d8	Avoid spamming logs with cgroups warnings The code for putting the emulator threads in a separate cgroup would spam the logs with warnings 2013-02-27 16:08:26.731+0000: 29624: warning : virCgroupMoveTask:887 : no vm cgroup in controller 3 2013-02-27 16:08:26.731+0000: 29624: warning : virCgroupMoveTask:887 : no vm cgroup in controller 4 2013-02-27 16:08:26.732+0000: 29624: warning : virCgroupMoveTask:887 : no vm cgroup in controller 6 This is because it has only created child cgroups for 3 of the controllers, but was trying to move the processes from all the controllers. The fix is to only try to move threads in the controllers we actually created. Also remove the warning and make it return a hard error to avoid such lazy callers in the future. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Daniel P. Berrange	b4a124efc3	Fix autodestroy of QEMU guests The virQEMUCloseCallbacksRunOne method was passing a uuid string to virDomainObjListFindByUUID, when it actually expected to get a raw uuid buffer. This was not caught by the compiler because the method was using a 'void *uuid' instead of first casting it to the expected type. This regression was accidentally caused by refactoring in commit `568a6cda27` Author: Jiri Denemark <jdenemar@redhat.com> Date: Fri Feb 15 15:11:47 2013 +0100 qemu: Avoid deadlock in autodestroy Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-27 22:51:24 +00:00
Eric Blake	25dc8ba08b	qemu: -numa doesn't (yet) support disjoint range https://bugzilla.redhat.com/show_bug.cgi?id=896092 mentions that qemu 1.4 and earlier only accept a simple start-stop range for the cpu=... argument of -numa. Libvirt would attempt to use -numa cpu=1,3 for a disjoint range, which did not work as intended. Upstream qemu will be adding a new syntax for disjoint cpu ranges in 1.5; but the design for that syntax is still under discussion at the time of this patch. So for libvirt 1.0.3, it is safest to just reject attempts to build an invalid qemu command line; in the future, we can add a capability bit and translate to the final accepted design for selecting a disjoint cpu range in numa. * src/qemu/qemu_command.c (qemuBuildNumaArgStr): Reject disjoint ranges.	2013-02-27 09:31:42 -07:00
Laine Stump	db2536a627	Revert "Add support for <option> tag in network config" This reverts commit `383ebc4694`. We decided the xml for this feature needed more thought to make sure we are doing it the best way, in particular wrt option values that have multiple items.	2013-02-27 10:55:24 -05:00
Peter Krempa	923e4bf810	util: Add docs for virXMLProp string To avoid confusion about usage of this function explicitly document that this function returns copy of the attribute string.	2013-02-27 11:49:03 +01:00
Michal Privoznik	c69331b500	libvirt.c: Make VIR_MIGRATE_NON_SHARED_{INC,DISK} mutually exclusive These two flags in fact are mutually exclusive. Requesting them both doesn't make any sense regardless of hypervisor driver. Hence, we have to make it within libvirt.c file instead of fixing it in each driver.	2013-02-27 09:07:18 +01:00
Eric Blake	76f352690a	build: don't use gcrypt.h without gnutls support Eugene Marcotte reported that if gcrypt-devel (a prereq of gnutls-devel) is not present, then compilation fails due to an unconditional use of <gcrypt.h>. * src/libvirt.c (includes): Properly guard use of gcrypt.h.	2013-02-26 16:43:03 -07:00
Eric Blake	d76f3e0643	Revert "S390: domain_conf support for CCW" This reverts commit `0bbbd42c30`. The design for this feature is not complete, and may change the name of the 'schid' attribute. Revert requested by Viktor Mihajlovski.	2013-02-26 16:32:15 -07:00
Doug Goldstein	01207bb703	interface: udev backend coverity NULL deref This fixes a potential NULL deref identified by John Ferlan <jferlan@redhat.com> if scandir() didn't return an expected value.	2013-02-26 13:30:17 -06:00
Daniel P. Berrange	464c92c051	Remove some C99 variable decls in parallels driver The parallels storage driver declared some loop variables inside the for(;;). This is not allowed by libvirt coding standards Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-26 17:54:52 +00:00
Daniel P. Berrange	02b9097274	Fix crash changing CDROM media This change tried to fix a crash with changing CDROM media but failed to actually do so commit `d0172d2b1b` Author: Osier Yang <jyang@redhat.com> Date: Tue Feb 19 20:27:45 2013 +0800 qemu: Remove the shared disk entry if the operation is ejecting or updating It was still accessing disk->src, when the entire 'disk' object has been free'd already. Even if it weren't free'd, accessing the 'src' value of virDomainDiskDef is not allowed without first validating disk->type is file or block. Just remove the broken code entirely. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-26 17:45:31 +00:00
Ján Tomko	633d859b73	libvirt: fix error message when connection can't be opened VIR_ERR_NO_CONNECT already contains "no connection driver available". This patch changes: no connection driver available for No connection for URI hello to: no connection driver available for hello Bug: https://bugzilla.redhat.com/show_bug.cgi?id=851413	2013-02-26 15:01:03 +01:00
Paolo Bonzini	45dc3f1703	qemu: do not set unpriv_sgio if neither supported nor requested Currently we call virSetDeviceUnprivSGIO with val == 0 if a block device has an sgio attribute. But for sgio='filtered', we know that a kernel with no unpriv_sgio support will always behave as the user wanted. In this case, there is no need to call the function and report a (bogus) error. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2013-02-26 13:46:52 +01:00
Eric Blake	6abd5ea124	qemu: minor monitor lock cleanups If virCondInit fails (okay, so that's unlikely), then we end up attempting a virObjectUnlock() on the cleanup path, even though we don't hold a lock. This is not guaranteed to be safe. While at it, I noticed a couple places where we were referencing mon->fd outside locks. * src/qemu/qemu_monitor.c (qemuMonitorOpenInternal): Minimize lock duration. mon->watch doesn't need clean up on error. (qemuMonitorGetBlockExtent, qemuMonitorBlockResize): Don't dereference fd outside of lock.	2013-02-25 17:36:51 -07:00
Eric Blake	29424d1acd	qemu: don't override earlier json error I built without yajl support, and noticed a strange failure message in qemumonitorjsontest: 2013-02-22 16:12:37.503+0000: 19812: error : virJSONValueToString:1119 : internal error No JSON parser implementation is available 2013-02-22 16:12:37.503+0000: 19812: error : qemuMonitorJSONCommandWithFd:253 : out of memory While a later patch will fix the test to skip when json is not present, this patch avoids overriding the more useful error message from virJSONValueToString returning NULL. * src/qemu/qemu_monitor_json.c (qemuMonitorJSONCommandWithFd): Don't override message. (qemuMonitorJSONCheckError): Don't print NULL. * src/qemu/qemu_agent.c (qemuAgentCommand): Don't override message. (qemuAgentCheckError): Don't print NULL. (qemuAgentArbitraryCommand): Properly fail on OOM.	2013-02-25 17:36:03 -07:00
Peter Krempa	34f1a618a5	conf: Avoid leaking of RNG device definition	2013-02-25 22:31:11 +01:00
Peter Krempa	19c6ad9ac7	qemu: Refactor qemuDomainSetMemoryParameters The new TypedParam helper APIs allow to simplify this function significantly. This patch integrates the fix in `75e5bec97b` by correctly ordering the setting functions instead of reordering the parameters.	2013-02-25 17:24:34 +01:00
Doug Goldstein	65bb1b9795	interface: Fix udev backend bridge device display The bridge device was showing the vnet devices created for the domains as connected to the bridge. libvirt should only show host devices when trying to get the interface definition rather than the domain devices as well.	2013-02-25 10:06:29 -06:00
Philipp Hahn	5474870cda	storage: cast -1 for uid_t\|gid_t uid_t and gid_t are opaque types, ranging from s32 to u32 to u64. Explicitly cast the magic -1 to the appropriate type. Signed-off-by: Philipp Hahn <hahn@univention.de>	2013-02-25 15:46:32 +01:00
Philipp Hahn	81af880a6f	storage: Cast uid_t\|gid_t to unsigned int uid_t and gid_t are opaque types, ranging from s32 to u32 to u64. Explicitly cast them to unsigned int for printing. Signed-off-by: Philipp Hahn <hahn@univention.de>	2013-02-25 15:46:32 +01:00
Philipp Hahn	7b4b59fa4a	util: Fix printf format for uid_t\|gid_t The uid_t\|gid_t values are explicitly casted to "unsigned long", but the printf() still used "%d", which is for signed values. Change the format to "%u". Signed-off-by: Philipp Hahn <hahn@univention.de>	2013-02-25 15:46:32 +01:00
Peter Krempa	820019fcdf	qemu: Implement support for EGD backend for virtio-rng This patch adds a new capability bit QEMU_CAPS_OBJECT_RNG_EGD and code to support the egd backend for the VirtIO RNG device. The device is added by 3 qemu command line options: -chardev socket,id=charrng0,host=1.2.3.4,port=1234 (communication backend) -object rng-egd,chardev=charrng0,id=rng0 (RNG protocol client) -device virtio-rng-pci,rng=rng0,bus=pci.0,addr=0x4 (the RNG device)	2013-02-25 10:55:14 +01:00
Peter Krempa	234a55604e	qemu: Implement support for default 'random' backend for virtio-rng This patch implements support for the virtio-rng-pci device and the rng-random backend in qemu. Two capabilities bits are added to track support for those: QEMU_CAPS_DEVICE_VIRTIO_RNG - for the device support and QEMU_CAPS_OBJECT_RNG_RANDOM - for the backend support. qemu is invoked with these additional parameters if the device is enabled: -object rng-random,id=rng0,filename=/test/phile (to add the backend) -device virtio-rng-pci,rng=rng0,bus=pci.0,addr=0x4 (to add the device)	2013-02-25 10:46:19 +01:00
Peter Krempa	1d28aa05a6	conf: Add RNG device ABI compatibility check	2013-02-25 10:43:05 +01:00
Peter Krempa	e2a0dfcfe9	conf: Add support for RNG device configuration in XML This patch adds basic configuration support for the RNG device supporting the virtio model with the "random" and "egd" backend types as described in the schema in the previous patch.	2013-02-25 10:36:31 +01:00
Peter Krempa	5e3c344785	conf: Add fake switch statement to warn for new device types This patch adds a fake switch statement to force the compiler to warn after a new device type was added. This should remind the contributor to add the new device also to this iterator function.	2013-02-25 09:36:24 +01:00
Gene Czarcinski	0b73a763f3	use client id for IPv6 DHCP host definition Originally, only a host name was used to associate a DHCPv6 request with a specific IPv6 address. Further testing demonstrates that this is an unreliable method and, instead, a client-id or DUID needs to be used. According to DHCPv6 standards, this id can be a duid-LLT, duid-LL, or duid-UUID even though dnsmasq will accept almost any text string. Although validity checking of a specified string makes sure it is hexadecimal notation with bytes separated by colons, there is no rigorous check to make sure it meets the standard. Documentation and schemas have been updated. Signed-off-by: Gene Czarcinski <gene@czarc.net> Signed-off-by: Laine Stump <laine@laine.org>	2013-02-25 02:49:06 -05:00
Laine Stump	c720ddd6ac	util: fix typo in comment	2013-02-24 23:21:28 -05:00
Ján Tomko	c9ad54db40	storage: qemu-img: change INFO to DEBUG For really old qemu-img binaries which do not support specifying the format of the backing file, display a DEBUG message instead of INFO that this can't be done.	2013-02-24 18:11:40 +01:00
Ján Tomko	81738cd014	storage: move qemu-img options before positional arguments Modify the expected output of storagevolxml2argv tests as well.	2013-02-24 18:11:40 +01:00
Ján Tomko	c6e87d1a4b	storage: separate qemu-img command generation and execution This allows us to create a test for the generated command line.	2013-02-24 18:11:40 +01:00
Ján Tomko	41c73f66bc	storage: move flag setting after declarations	2013-02-24 18:11:40 +01:00
Michal Privoznik	1e54685fc7	qemu_migration: Cancel running jobs on failed migration If a migration fails, we need to stop all block jobs running so qemu doesn't try to send data to destination over and over again.	2013-02-23 08:51:30 +01:00
Michal Privoznik	ae21b9bde6	qemu_migration: Stop NBD server at Finish phase At the end of migration, it is important to stop NBD server and thus release all allocated resources.	2013-02-23 08:42:57 +01:00
Michal Privoznik	7b7600b3e6	qemu_migration: Introduce qemuMigrationDriveMirror This function does the source part of NBD magic. It invokes drive-mirror on each non shared and RW disk with a source and wait till the mirroring process completes. When it does we can proceed with migration. Currently, an active waiting is done: every 500ms libvirt asks qemu if block-job is finished or not. However, once the job finishes, qemu doesn't report its progress so we can only assume if the job finished successfully or not. The better solution would be to listen to the event which is sent as soon as the job finishes. The event does contain the result of job.	2013-02-23 08:42:54 +01:00
Michal Privoznik	86d90b3abd	qemu_migration: Introduce qemuMigrationStartNBDServer() We need to start NBD server and feed it with all non-<shared/>, RW and source-full disks. Moreover, with new virPortAllocator we must ensure the borrowed port for NBD server will be returned if either migration completes or qemu process is torn down.	2013-02-23 08:25:09 +01:00
Michal Privoznik	f1748e34e2	qemu: Introduce nbd-server-stop command This will be used after all migration work is done to stop NBD server running on destination. It doesn't take any arguments, just issues a command.	2013-02-23 08:16:42 +01:00
Michal Privoznik	c833d8111d	qemu: Introduce nbd-server-add command This will be used with new migration scheme. This patch creates basically just monitor stub functions. Wiring them into something useful is done in later patches.	2013-02-23 08:06:37 +01:00
Michal Privoznik	bb6359e8d4	qemu: Introduce nbd-server-start command This will be used with new migration scheme. This patch creates basically just monitor stub functions. Wiring them into something useful is done in later patches.	2013-02-23 07:58:13 +01:00
Michal Privoznik	121d4cfb9a	Introduce NBD migration cookie This migration cookie is meant for two purposes. The first is to be sent in begin phase from source to destination to let it know we support new implementation of VIR_MIGRATE_NON_SHARED_{DISK,INC} so destination can start NBD server. Then, the second purpose is, destination can let us know, on which port the NBD server is running.	2013-02-23 07:49:56 +01:00
Michal Privoznik	e9a6704f99	qemu: Introduce NBD_SERVER capability This just keeps track whether qemu knows nbd-server-* commands so we can use it during migration or not.	2013-02-23 07:33:43 +01:00
Pieter Hollants	383ebc4694	Add support for <option> tag in network config This patch adds support for a new <option>-Tag in the <dhcp> block of network configs, based on a subset of the fifth proposal by Laine Stump in the mailing list discussion at https://www.redhat.com/archives/libvir-list/2012-November/msg01054.html. Any such defined option will result in a dhcp-option=<number>,"<value>" statement in the generated dnsmasq configuration file. Currently, DHCP options can be specified by number only and there is no whitelisting or blacklisting of option numbers, which should probably be added. Signed-off-by: Pieter Hollants <pieter@hollants.com> Signed-off-by: Laine Stump <laine@laine.org>	2013-02-22 19:45:19 -05:00
Sage Weil	c2092abfb9	storage: use f_frsize, not f_bsize, for calculating available space The bfree and blocks fields are supposed to be in units of frsize. We were calculating capacity correctly using those units, but the available calculation was using bsize instead. Most file systems report these as the same value specifically because many programs are buggy, but that is no reason to rely on that behavior, or to behave inconsistently. This bug has been present since `e266ded` (2008) and `aa296e6c`, when the code was originally introduced (the latter via cut and paste). Signed-off-by: Sage Weil <sage@newdream.net>	2013-02-22 17:01:25 -07:00
Eric Blake	91ffb42c2c	maint: check all symfiles for sorting On FreeBSD, I got a 'make check' failure: GEN check-symsorting Symbol block at ./libvirt_atomic.syms:4: viratomic.h not found * src/Makefile.am (SYM_FILES): New define. (check-symsorting): Check on all symfiles, even when not used. * src/libvirt_atomic.syms: Fix offender.	2013-02-22 16:48:12 -07:00
Jiri Denemark	492afb8202	qemu: Implement virDomainMigrate*CompressionCache	2013-02-22 17:36:00 +01:00
Jiri Denemark	be80186732	remote: Implement virDomainMigrate*CompressionCache	2013-02-22 17:35:59 +01:00
Jiri Denemark	34f44e911f	Introduce virDomainMigrate*CompressionCache APIs Introduce virDomainMigrateGetCompressionCache and virDomainMigrateSetCompressionCache APIs.	2013-02-22 17:35:59 +01:00
Jiri Denemark	8def32916d	qemu: Implement virDomainGetJobStats	2013-02-22 17:35:59 +01:00
Jiri Denemark	4121a77c1a	qemu: Parse more fields from query-migrate QMP command As a side effect, this also fixes reporting disk migration process. It was added to memory migration progress, which was wrong. Disk progress has dedicated fields in virDomainJobInfo structure.	2013-02-22 17:35:59 +01:00
Jiri Denemark	34fd94278a	remote: Implement virDomainGetJobStats	2013-02-22 17:35:58 +01:00
Jiri Denemark	98b9acf5aa	remote: Auto-allocate params in remoteDeserializeTypedParameters remoteDeserializeTypedParameters can now be called with either preallocated params array (size of which is announced by nparams) or it can allocate params array according to the number of parameters received from the server.	2013-02-22 17:35:58 +01:00
Jiri Denemark	4dd00f4238	Introduce virDomainGetJobStats API This is an extensible version of virDomainGetJobInfo.	2013-02-22 17:35:58 +01:00
Jiri Denemark	94f59b9ece	qemu: Add support for compressed migration	2013-02-22 17:35:58 +01:00
Christophe Fergeau	2ed7e7eedc	conf: Don't leak 'ram' video property on error It's only freed on normal returns from virDomainVideoDefParseXML, but not when erroring out.	2013-02-22 09:31:19 +01:00
Christophe Fergeau	9879fce9f6	conf: Don't leak 'primary' video property on error It's only freed on normal returns from virDomainVideoDefParseXML, but not when erroring out.	2013-02-22 09:31:19 +01:00
Doug Goldstein	058273d096	interface: dev type support for bond interfaces Patch has been accepted into net-next's 3.9 queue to correctly expose bond interfaces with the 'bond' devtype.	2013-02-21 18:00:06 -06:00
Doug Goldstein	69ab26bd33	interface: add bond support to udev backend The udev backend now supports bond interfaces.	2013-02-21 18:00:06 -06:00
Doug Goldstein	2f272ad759	interface: Improve udev backend device type id Refactored the interface device type identification to make it more clear about the operations. Add support for udev devtype to detect VLANs on Linux 3.7 and newer. Move VLAN detection based on device name to fallback case.	2013-02-21 18:00:06 -06:00
Doug Goldstein	37099f0020	interface: Refactor interface vlan to helper func Mechanical move to break up udevIfaceGetIfaceDef() into different helpers for each of the interface types to hopefully make the code easier to follow. This moves the vlan code to udevIfaceGetIfaceDefVlan().	2013-02-21 17:31:41 -06:00
Doug Goldstein	f5f7f4fe4d	interface: udev bridge code error handling updates Based on feedback from Laine Stump, improve a number of the error handling cases to report the issue to the user instead of not generating data or giving vague errors. Added the bridge device name to every error message as well to make it clear which bridge failed.	2013-02-21 17:31:40 -06:00
Doug Goldstein	8a26ee04d8	interface: Refactor udev bridge to helper func Mechanical move to break up udevIfaceGetIfaceDef() into different helpers for each of the interface types to hopefully make the code easier to follow. This moves the bridge code to udevIfaceGetIfaceDefBridge().	2013-02-21 17:15:35 -06:00
Eric Blake	82d5fe5437	qemu: check backing chains even when cgroup is omitted https://bugzilla.redhat.com/show_bug.cgi?id=896685 points out a regression caused by commit `38c4a9c` - libvirt only labels the backing chain if the backing chain cache is populated, but the code to populate the cache was only conditionally performed if cgroup labeling was necessary. * src/qemu/qemu_cgroup.c (qemuSetupCgroup): Hoist cache setup... * src/qemu/qemu_process.c (qemuProcessStart): ...earlier into caller, where it is now unconditional.	2013-02-21 12:32:56 -07:00
Peter Krempa	f47b39719e	conf: Refactor ABI stability checking and break long lines Get rid of the "identical" variable in the ABI stability checks in favor of return statements and break or refactor very long lines where possible.	2013-02-21 11:04:35 +01:00
Peter Krempa	2a60ce323b	conf: Refactor cpumask handling Declare local variables at the start of the block and fix trivial formatting issues.	2013-02-21 11:04:34 +01:00
Peter Krempa	00a9da9b88	conf: Reformat many function headers in domain_conf.c Many of the headers were using the old style and even overflowing the 80 column mark.	2013-02-21 11:04:34 +01:00
Peter Krempa	6d35a84cc8	conf: Fix label naming in virDomainDefFormatInternal The label named "cleanup" was used in error cases only. Change it to "error".	2013-02-21 11:04:34 +01:00
Peter Krempa	1550d3165b	conf: Ensure that new devices are added to conf copy function Use the correct type and get rid of "default" label in switch to make the compiler complain if a new device type is added.	2013-02-21 11:04:34 +01:00
Peter Krempa	db07957646	qemu: Refactor error paths in virQEMUDriverCreateCapabilities Change the error label to "error" and simplify some error paths.	2013-02-21 11:04:34 +01:00
Peter Krempa	cdf0ba1c3a	conf: Improve core dump config error message The message didn't seem to be much helpful	2013-02-21 11:04:34 +01:00
Jiri Denemark	568a6cda27	qemu: Avoid deadlock in autodestroy Since closeCallbacks were turned into virObjectLockable, we can no longer call virQEMUCloseCallbacks APIs from within a registered close callback.	2013-02-21 10:38:28 +01:00
Jiri Denemark	3898ba7f2c	qemu: Turn closeCallbacks into virObjectLockable To avoid having to hold the qemu driver lock while iterating through close callbacks and calling them. This fixes a real deadlock when a domain which is being migrated from another host gets autodestoyed as a result of broken connection to the other host.	2013-02-21 10:27:24 +01:00
Guannan Ren	091831633f	qemu: fix an off-by-one error in qemuDomainGetPercpuStats The max value of number of cpus to compute(id) should not be equal or greater than max cpu number. The bug ocurrs when id value is equal to max cpu number which leads to the off-by-one error in the following for loop. # virsh cpu-stats guest --start 1 error: Failed to virDomainGetCPUStats() error: internal error cpuacct parse error	2013-02-21 11:27:35 +08:00
Osier Yang	5c9034bf05	qemu: Fix the memory leak Found by John Ferlan (coverity script)	2013-02-21 10:33:49 +08:00
John Ferlan	903f43ce6d	keepalive: Guard against integer overflow Don't allow interval to be > MAX_INT/1000 in virKeepAliveStart() Guard against possible overflow in virKeepAliveTimeout() by setting the timeout to be MAX_INT/1000 since the math following will multiply it by 1000.	2013-02-20 16:56:59 -05:00
Jim Fehlig	567779e51a	libxl: Fix setting of disk backend The libxl driver was setting the backend field of libxl_device_disk structure to LIBXL_DISK_BACKEND_TAP when the driver element of disk configuration was not specified. This needlessly forces the use of blktap driver, which may not be loaded in dom0 https://bugzilla.redhat.com/show_bug.cgi?id=912488 Ian Campbell suggested that LIBXL_DISK_BACKEND_UNKNOWN is a better default in this case https://www.redhat.com/archives/libvir-list/2013-February/msg01126.html	2013-02-20 14:52:10 -07:00
John Ferlan	2bff35d5bb	Remove a couple of misplaced VIR_FREE	2013-02-20 12:43:00 -05:00
Michal Privoznik	0eeedf52e7	qemu: Run lzop with '--ignore-warn' Currently, if lzop decompression binary produces a warning, it doesn't exit with zero status but 2 instead. Terrifying, but true. However, warnings may be ignored using '--ignore-warn' command line argument. Moreover, in which case, the exit status will be zero.	2013-02-20 18:10:01 +01:00
Sergey Fionov	1d8193ee8a	Fix memory leak in virNetClientIOWriteMessage Commit `18937c3ae0` introduced the memory leak when client->msg.fds is copied to thecall->msg and then never freed.	2013-02-20 17:56:35 +01:00
Osier Yang	d0172d2b1b	qemu: Remove the shared disk entry if the operation is ejecting or updating For both AttachDevice and UpdateDevice APIs, if the disk device is 'cdrom' or 'floppy', the operations could be ejecting, updating, and inserting. For either ejecting or updating, the shared disk entry of the original disk src has to be removed, because it's not useful anymore. And since the original disk def will be changed, new disk def passed as argument will be free'ed in qemuDomainChangeEjectableMedia, so we need to copy the orignal disk def before qemuDomainChangeEjectableMedia, to use it for qemuRemoveSharedDisk.	2013-02-21 00:31:24 +08:00
Osier Yang	0db7ff59cc	qemu: Move the shared disk adding and sgio setting prior to attaching The disk def could be free'ed by qemuDomainChangeEjectableMedia, which can thus cause crash if we reference the disk pointer. On the other hand, we have to remove the added shared disk entry from the table on error codepath.	2013-02-21 00:31:24 +08:00
Osier Yang	d0e4b76204	qemu: Update shared disk table when reconnecting qemu process	2013-02-21 00:31:24 +08:00
Osier Yang	a4504ac184	qemu: Record names of domain which uses the shared disk in hash table The hash entry is changed from "ref" to {ref, @domains}. With this, the caller can simply call qemuRemoveSharedDisk, without afraid of removing the entry belongs to other domains. qemuProcessStart will obviously benifit from it on error codepath (which calls qemuProcessStop to do the cleanup).	2013-02-21 00:31:24 +08:00
Osier Yang	371df778eb	qemu: Merge qemuCheckSharedDisk into qemuAddSharedDisk Based on moving various checking into qemuAddSharedDisk, this avoids the caller using it in wrong ways. Also this adds two new checking for qemuCheckSharedDisk (disk device not 'lun' and kernel doesn't support unpriv_sgio simply returns 0).	2013-02-21 00:31:24 +08:00
Osier Yang	dab878a861	qemu: Add checking in helpers for sgio setting This moves the various checking into the helpers, to avoid the callers missing the checking.	2013-02-21 00:31:24 +08:00
Eric Blake	6ea7b3e8d1	maint: enforce private symbol section sorting Automating a sorting check is the only way to ensure we don't regress. Suggested by Dan Berrange. * src/check-symsorting.pl (check_sorting): Add a parameter, validate that groups are in order, and that files exist. * src/Makefile.am (check-symsorting): Adjust caller. * src/libvirt_private.syms: Fix typo. * src/libvirt_linux.syms: Fix file name. * src/libvirt_vmx.syms: Likewise. * src/libvirt_xenxs.syms: Likewise. * src/libvirt_sasl.syms: Likewise. * src/libvirt_libssh2.syms: Likewise. * src/libvirt_esx.syms: Mention file name. * src/libvirt_openvz.syms: Likewise.	2013-02-20 08:27:03 -07:00
Jiri Denemark	69660042fb	qemu: Do not ignore mandatory features in migration cookie Due to "feature"/"features" nasty typo, any features marked as mandatory by one side of a migration are silently considered optional by the other side. The following is the code that formats mandatory features in migration cookie: for (i = 0 ; i < QEMU_MIGRATION_COOKIE_FLAG_LAST ; i++) { if (mig->flagsMandatory & (1 << i)) virBufferAsprintf(buf, " <feature name='%s'/>\n", qemuMigrationCookieFlagTypeToString(i)); }	2013-02-20 15:24:01 +01:00
Ján Tomko	bc28e56b35	qemu: switch PCI address alocation to use virDevicePCIAddress Some functions were using virDomainDeviceInfo where virDevicePCIAddress would suffice. Some were only using integers for slots and functions, assuming the bus numbers are always 0. Switch from virDomainDeviceInfoPtr to virDevicePCIAddressPtr: qemuPCIAddressAsString qemuDomainPCIAddressCheckSlot qemuDomainPCIAddressReserveAddr qemuDomainPCIAddressReleaseAddr Switch from int slot to virDevicePCIAddressPtr: qemuDomainPCIAddressReserveSlot qemuDomainPCIAddressReleaseSlot qemuDomainPCIAddressGetNextSlot Deleted functions (they would take the same parameters as ReserveAddr/ReleaseAddr do now.) qemuDomainPCIAddressReserveFunction qemuDomainPCIAddressReleaseFunction	2013-02-20 13:57:59 +01:00
Eric Blake	f190a636e5	maint: sort private syms to reflect recent header renames Purely mechanical (roughly, s/\n/~/; s/~~/\n/; sort by line; s/~/\n/) * src/libvirt_private.syms: Sort sections by header file name.	2013-02-19 14:59:05 -07:00
Eric Blake	8a256f3cb4	maint: fix header file owners of private symbols Recent renames were not reflected into the comments of libvirt_private.syms; furthermore, since we mix private headers from several directories into this file, knowing where the file lives can be helpful. * src/libvirt_private.sym: Reflect recent names.	2013-02-19 14:59:05 -07:00
Natanael Copa	bac8b2ca09	net: use structs for address and port ranges We pass over the address/port start/end values many times so we put them in structs. Signed-off-by: Natanael Copa <ncopa@alpinelinux.org> Signed-off-by: Laine Stump <laine@laine.org>	2013-02-19 14:42:18 -05:00
Natanael Copa	1716e7a6c5	net: add support for specifying port range for forward mode nat Let users set the port range to be used for forward mode NAT: ... <forward mode='nat'> <nat> <port start='1024' end='65535'/> </nat> </forward> ... Signed-off-by: Natanael Copa <ncopa@alpinelinux.org> Signed-off-by: Laine Stump <laine@laine.org>	2013-02-19 14:42:18 -05:00
Natanael Copa	905629f47e	net: support set public ip range for forward mode nat Support setting which public ip to use for NAT via attribute address in subelement <nat> in <forward>: ... <forward mode='nat'> <address start='1.2.3.4' end='1.2.3.10'/> </forward> ... This will construct an iptables line using: '-j SNAT --to-source <start>-<end>' instead of: '-j MASQUERADE' Signed-off-by: Natanael Copa <ncopa@alpinelinux.org> Signed-off-by: Laine Stump <laine@laine.org>	2013-02-19 14:42:18 -05:00
Jiri Denemark	5d6f636764	qemu: Use atomic ops for driver->nactive	2013-02-19 19:11:23 +01:00
Jiri Denemark	921af42980	rpc: Avoid deadlock when closing client connection We need to drop the server lock before calling virObjectUnlock(client) since in case we had the last reference to the client, its dispose callback would be called and that could possibly try to lock the server and cause a deadlock. This is exactly what happens when there is only one QEMU domain running and it is marked to be autodestroyed when the connection dies. This results in qemuProcessAutoDestroy -> qemuProcessStop -> virNetServerRemoveShutdownInhibition call sequence, where the last function locks the server.	2013-02-19 10:52:37 +01:00
Jiri Denemark	ee1d6d91d1	Avoid resetting errors in virTypedParamsFree The function does not report any errors so there should be no need too reset an existing error first. Moreover, virTypedParamsFree is mostly called in cleanup phase where it has the potential to reset any useful reported earlier.	2013-02-19 10:52:37 +01:00
Eric Blake	e086deda3c	build: force correct gcc syntax for attribute_nonnull Gcc lets you do: int ATTRIBUTE_NONNULL(1) foo(void param); int foo(void param) ATTRIBUTE_NONNULL(1); int ATTRIBUTE_NONNULL(1) foo(void param) { ... } but chokes on: int foo(void param) ATTRIBUTE_NONNULL(1) { ... } However, since commit `eefb881`, we have intentionally been disabling ATTRIBUTE_NONNULL because of lame gcc handling of the attribute (that is, gcc doesn't do decent warning reporting, then compiles code that mysteriously fails if you break the contract of the attribute, which is surprisingly easy to do), leaving it on only for Coverity (which does a much better job of improved static analysis when the attribute is present). But completely eliding the macro makes it too easy to write code that uses the fourth syntax option, if you aren't using Coverity. So this patch forces us to avoid syntax errors, even when not using the attribute under gcc. It also documents WHY we disable the warning under gcc, rather than forcing you to find the commit log. * src/internal.h (ATTRIBUTE_NONNULL): Expand to empty attribute, rather than nothing, when on gcc.	2013-02-18 14:17:40 -07:00
Guido Günther	272be1a840	qemu: pass "-1" as uid/gid for unprivileged qemu so we don't try to change uid/git to 0 when probing capabilities.	2013-02-18 12:08:38 -06:00
Doug Goldstein	41046256fe	Add capabilities bit for -no-kvm-pit-reinjection The conversion to qemuCaps dropped the ability with qemu{,-kvm} 1.2 and newer to set the lost tick policy for the PIT. While the -no-kvm-pit-reinjection option is depreacated, it is still supported at least through 1.4, it is better to not lose the functionality.	2013-02-18 12:03:52 -06:00
Doug Goldstein	5eb621fcb3	interface: fix udev backend use after free udevIfaceListAllInterface() used the udev_device after it had its ref count decremented which results in a use after free issue.	2013-02-18 08:59:23 -06:00
John Ferlan	676688b69b	security: Remove unnecessary checks for mgr == NULL Coverity found the DACGenLabel was checking for mgr == NULL after a possible dereference; however, in order to get into the function the virSecurityManagerGenLabel would have already dereferenced sec_managers[i] so the check was unnecessary. Same check is made in SELinuxGenSecurityLabel.	2013-02-16 07:44:35 -05:00
John Ferlan	277aaeeebf	vircommand: Remove unnecessary sa_assert Changes from commit '3178df9a' removed the need for the sa_assert(infd).	2013-02-16 07:44:35 -05:00
Stefan Berger	b7d00de2bd	Fix libvirt upgrade path when nwfilter is used Between revision `65fb9d49` and before this patch, an upgrade of libvirt while VMs are running and instantiating iptables filtering rules due to nwfilter rules, may leave stray iptables rules behind when shutting VMs down. Left-over iptables rules may look like this: Chain FP-vnet0 (1 references) target prot opt source destination DROP tcp -- 0.0.0.0/0 0.0.0.0/0 tcp spt:122 ACCEPT all -- 0.0.0.0/0 0.0.0.0/0 [...] Chain libvirt-out (1 references) target prot opt source destination FO-vnet0 all -- 0.0.0.0/0 0.0.0.0/0 [goto] PHYSDEV match --physdev-out vnet0 The reason is that the recent nwfilter code only removed filtering rules in the libvirt-out chain that contain the --physdev-is-bridged parameter. Older rules didn't match and were not removed. Note that the user-defined chain FO-vnet0 could not be removed due to the reference from the rule in libvirt-out. Often the work around may be done through service iptables restart kill -SIGHUP $(pidof libvirtd) This patch now also removes older libvirt versions' iptables rules. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com>	2013-02-15 21:33:37 -05:00
Eric Blake	d1333dd0fb	storage: don't follow backing chain symlinks too eagerly If you have a qcow2 file /path1/to/file pointed to by symlink /path2/symlink, and pass qemu /path2/symlink, then qemu treats a relative backing file in the qcow2 metadata as being relative to /path2, not /path1/to. Yes, this means that it is possible to create a qcow2 file where the choice of WHICH directory and symlink you access its contents from will then determine WHICH backing file (if any) you actually find; the results can be rather screwy, but we have to match what qemu does. Libvirt and qemu default to creating absolute backing file names, so most users don't hit this. But at least VDSM uses symlinks and relative backing names alongside the --reuse-external flags to libvirt snapshot operations, with the result that libvirt was failing to follow the intended chain of backing files, and then backing files were not granted the necessary sVirt permissions to be opened by qemu. See https://bugzilla.redhat.com/show_bug.cgi?id=903248 for more gory details. This fixes a regression introduced in commit `8250783`. I tested this patch by creating the following chain: ls /home/eblake/Downloads/Fedora.iso # raw file for base cd /var/lib/libvirt/images qemu-img create -f qcow2 \ -obacking_file=/home/eblake/Downloads/Fedora.iso,backing_fmt=raw one mkdir sub cd sub ln -s ../one onelink qemu-img create -f qcow2 \ -obacking_file=../sub/onelink,backing_fmt=qcow2 two mv two .. ln -s ../two twolink qemu-img create -f qcow2 \ -obacking_file=../sub/twolink,backing_fmt=qcow2 three mv three .. ln -s ../three threelink then pointing my domain at /var/lib/libvirt/images/sub/threelink. Prior to this patch, I got complaints about missing backing files; afterwards, I was able to verify that the backing chain (and hence DAC and SELinux relabels) of the entire chain worked. * src/util/virstoragefile.h (_virStorageFileMetadata): Add directory member. * src/util/virstoragefile.c (absolutePathFromBaseFile): Drop, replaced by... (virFindBackingFile): ...better function. (virStorageFileGetMetadataInternal): Add an argument. (virStorageFileGetMetadataFromFD, virStorageFileChainLookup) (virStorageFileGetMetadata): Update callers.	2013-02-15 16:07:01 -07:00
Eric Blake	2485f92153	storage: refactor metadata lookup Prior to this patch, we had the callchains: external users \-> virStorageFileGetMetadataFromFD \-> virStorageFileGetMetadataFromBuf virStorageFileGetMetadataRecurse \-> virStorageFileGetMetadataFromFD \-> virStorageFileGetMetadataFromBuf However, a future patch wants to add an additional parameter to the bottom of the chain, for use by virStorageFileGetMetadataRecurse, without affecting existing external callers. Since there is only a single caller of the internal function, we can repurpose it to fit our needs, with this patch giving us: external users \-> virStorageFileGetMetadataFromFD \-> virStorageFileGetMetadataInternal virStorageFileGetMetadataRecurse / \-> virStorageFileGetMetadataInternal * src/util/virstoragefile.c (virStorageFileGetMetadataFromFD): Move most of the guts... (virStorageFileGetMetadataFromBuf): ...here, and rename... (virStorageFileGetMetadataInternal): ...to this. (virStorageFileGetMetadataRecurse): Use internal helper.	2013-02-15 16:07:00 -07:00
Eric Blake	b7df4f92d6	storage: prepare for refactoring virStorageFileGetMetadataFromFD is the only caller of virStorageFileGetMetadataFromBuf; and it doesn't care about the difference between a return of 0 (total success) or 1 (metadata was inconsistent, but pointer was populated as best as possible); only about a return of -1 (could not read metadata or out of memory). Changing the return type, and normalizing the variable names used, will make merging the functions easier in the next commit. * src/util/virstoragefile.c (virStorageFileGetMetadataFromBuf): Change return value, and rename some variables. (virStorageFileGetMetadataFromFD): Rename some variables.	2013-02-15 16:07:00 -07:00
Eric Blake	5e4946d4d9	storage: rearrange functions No semantic change; done so the next patch doesn't need a forward declaration of a static function. * src/util/virstoragefile.c (virStorageFileProbeFormatFromBuf): Hoist earlier.	2013-02-15 16:07:00 -07:00
Eric Blake	c51c3e45d5	build: more mingw fixes More mingw build failures: CCLD libvirt-lxc.la /usr/lib64/gcc/i686-w64-mingw32/4.7.2/../../../../i686-w64-mingw32/bin/ld: cannot find libvirt_lxc.def: No such file or directory CC virportallocatortest-virportallocatortest.o ../../tests/virportallocatortest.c: In function 'main': ../../tests/virportallocatortest.c:195:1: error: implicit declaration of function 'setenv' [-Werror=implicit-function-declaration] * src/Makefile.am (GENERATED_SYM_FILES): Also generate libvirt_lxc.def. * bootstrap.conf (gnulib_modules): Import setenv.	2013-02-15 15:45:52 -07:00
Eric Blake	660db5bf72	build: fix mingw build Commits `2025356` and `ba72cb12` introduced typos. * src/util/virpci.c (virPCIIsVirtualFunction) [!__linux__]: Fix function name. * src/util/virutil.c (virGetDeviceID): Fix attribute spelling.	2013-02-15 15:05:25 -07:00
Eric Blake	ec2cc0f860	build: fix vircommand build on mingw CC libvirt_util_la-vircommand.lo ../../src/util/vircommand.c:2358:1: error: 'virCommandHandshakeChild' defined but not used [-Werror=unused-function] The function is only implemented inside #ifndef WIN32. * src/util/vircommand.c (virCommandHandshakeChild): Hoist earlier, so that win32 build doesn't hit an unused forward declaration.	2013-02-15 13:16:46 -07:00
Eric Blake	ec8a2d0327	regex: gnulib guarantees that we have regex support No need to use HAVE_REGEX_H - our use of gnulib guarantees that the header exists and works, regardless of platform. Similarly, we can unconditionally assume a compiling <sys/wait.h> (although the mingw version of this header is not full-featured). * src/storage/storage_backend.c: Drop useless conditional. * tests/testutils.c: Likewise.	2013-02-15 13:01:23 -07:00
Jiri Denemark	2e5d7798df	vbox: Don't overwrite errors returned by virDomainDeviceDefParse	2013-02-14 09:05:36 +01:00
Laine Stump	7a2e845a86	util: maintain caps when running command with uid != 0 virCommand was previously calling virSetUIDGID() to change the uid and gid of the child process, then separately calling virSetCapabilities(). This did not work if the desired uid was != 0, since a setuid to anything other than 0 normally clears all capabilities bits. The solution is to use the new virSetUIDGIDWithCaps(), sending it the uid, gid, and capabilities bits. This will get the new process setup properly. Since the static functions virSetCapabilities() and virClearCapabilities are no longer called, they have been removed. NOTE: When combined with "filecap $path-to-qemu sys_rawio", this patch will make CAP_SYS_RAWIO (which is required for passthrough of generic scsi commands to a guest - see commits `e8daeeb`, `177db08`, `397e6a7`, and `74e0349`) be retained by qemu when necessary. Apparently that capability has been broken for non-root qemu ever since it was originally added.	2013-02-13 16:11:16 -05:00
Laine Stump	e11451f42e	util: virSetUIDGIDWithCaps - change uid while keeping caps Normally when a process' uid is changed to non-0, all the capabilities bits are cleared, even those explicitly set with calls to capng_update()/capng_apply() made immediately before setuid. And after the process' uid has been changed, it no longer has the necessary privileges to add capabilities back to the process. In order to set a non-0 uid while still maintaining any capabilities bits, it is necessary to either call capng_change_id() (which unfortunately doesn't currently call initgroups to setup auxiliary group membership), or to perform the small amount of calisthenics contained in the new utility function virSetUIDGIDWithCaps(). Another very important difference between the capabilities setting/clearing in virSetUIDGIDWithCaps() and virCommand's virSetCapabilities() (which it will replace in the next patch) is that the new function properly clears the capabilities bounding set, so it will not be possible for a child process to set any new capabilities. A short description of what is done by virSetUIDGIDWithCaps(): 1) clear all capabilities then set all those desired by the caller (in capBits) plus CAP_SETGID, CAP_SETUID, and CAP_SETPCAP (which is needed to change the capabilities bounding set). 2) call prctl(), telling it that we want to maintain current capabilities across an upcoming setuid(). 3) switch to the new uid/gid 4) again call prctl(), telling it we will no longer want capabilities maintained if this process does another setuid(). 5) clear the capabilities that we added to allow us to setuid/setgid/change the bounding set (unless they were also requested by the caller via the virCommand API). Because the modification/maintaining of capabilities is intermingled with setting the uid, this is necessarily done in a single function, rather than having two independent functions. Note that, due to the way that effective capabilities are computed (at time of execve) for a process that has uid != 0, the file capabilities of the binary being executed must also have the desired capabilities bit(s) set (see "man 7 capabilities"). This can be done with the "filecap" command. (e.g. "filecap /usr/bin/qemu-kvm sys_rawio").	2013-02-13 16:11:16 -05:00
Laine Stump	c0e3e685cd	util: drop capabilities immediately after changing uid/gid of child This is an interim measure to make sure everything still works in this order. The next step will be to perform capabilities drop and setuid/gid as a single operation (which is the only way to keep any capabilities when switching to a non-root uid).	2013-02-13 16:11:16 -05:00
Laine Stump	0345c7281b	qemu: let virCommand set child process security labels/uid/gid The qemu driver had been calling virSecurityManagerSetProcessLabel() from a "pre-exec hook" function that is run after the child is forked, but before exec'ing qemu. This is problematic because the uid and gid of the child are set by the security driver, but capabilities are dropped by virCommand - such separation doesn't work; the two operations must be done together or the capabilities do not transfer properly to the child process. This patch switches to using virSecurityManagerSetChildProcessLabel(), which is called prior to virCommandRun() (rather than being called during virCommandrun() by the hook function), and doesn't set the UID/GID/security label directly, but instead merely informs virCommand what it should set them all to when the time is appropriate. This lets virCommand choose to do the uid/gid and caps dropping all at the same time if it wants (it does want to, but isn't doing so yet; that's for an upcoming patch).	2013-02-13 16:11:16 -05:00
Laine Stump	7bf1aa0b9b	security: add new virSecurityManagerSetChildProcessLabel API The existing virSecurityManagerSetProcessLabel() API is designed so that it must be called after forking the child process, but before exec'ing the child. Due to the way the virCommand API works, that means it needs to be put in a "hook" function that virCommand is told to call out to at that time. Setting the child process label is a basic enough need when executing any process that virCommand should have a method of doing that. But virCommand must be told what label to set, and only the security driver knows the answer to that question. The new virSecurityManagerSetChildProcessLabel() API is the way to transfer the knowledge about what label to set from the security driver to the virCommand object. It is given a virCommandPtr, and each security driver calls the appropriate virCommand* API to tell virCommand what to do between fork and exec. 1) in the case of the DAC security driver, it calls virCommandSetUID/GID() to set a uid and gid that must be set for the child process. 2) for the SELinux security driver, it calls virCommandSetSELinuxLabel() to save a copy of the char* that will be sent to setexeccon_raw() after forking the child process. 3) for the AppArmor security drivers, it calls virCommandSetAppArmorProfile() to save a copy of the char* that will be sent to aa_change_profile() after forking the child process. With this new API in place, we will be able to remove virSecurityManagerSetProcessLabel() from any virCommand pre-exec hooks. (Unfortunately, the LXC driver uses clone() rather than virCommand, so it can't take advantage of this new security driver API, meaning that we need to keep around the older virSecurityManagerSetProcessLabel(), at least for now.)	2013-02-13 16:11:16 -05:00
Laine Stump	6c3f3d0d89	util: add security label setting to virCommand virCommand gets two new APIs: virCommandSetSELinuxLabel() and virCommandSetAppArmorProfile(), which both save a copy of a null-terminated string in the virCommand. During virCommandRun, if the string is non-NULL and we've been compiled with AppArmor and/or SELinux security driver support, the appropriate security library function is called for the child process, using the string that was previously set. In the case of SELinux, setexeccon_raw() is called, and for AppArmor, aa_change_profile() is called. This functionality has been added so that users of virCommand can use the upcoming virSecurityManagerSetChildProcessLabel() prior to running a child process, rather than needing to setup a hook function to be called (and in turn call virSecurityManagerSetProcessLabel()) during the setup of the child process.	2013-02-13 16:11:15 -05:00
Laine Stump	4a56e80fa5	build: define SECDRIVER_LIBS in Makefile.am This makes it simpler to include the necessary system security driver libraries for a particular system. For this patch, several existing conditional sections from the Makfile were replaced; I'll later be adding SECDRIVER_LIBS to libvirt_util_la_LIBADD, because vircommand.c will be calling a function from $securitylib.	2013-02-13 16:11:15 -05:00
Laine Stump	dd5bb43962	qemu: replace exec hook with virCommandSetUID/GID in storage_backend	2013-02-13 16:11:15 -05:00
Laine Stump	6a8ecc373e	qemu: replace exec hook with virCommandSetUID/GID in qemuCaps* Setting the uid/gid of the child process was the only thing done by the hook function in this case, and that can now be done more simply with virCommandSetUID/GID.	2013-02-13 16:11:15 -05:00
Laine Stump	f506a4c115	util: make virSetUIDGID a NOP only when uid or gid is -1 Rather than treating uid:gid of 0:0 as a NOP, we blindly pass that through to the lower layers. However, we do check for a requested value of "-1" to mean "don't change this setting". setregid() and setreuid() already interpret -1 as a NOP, so this is just an optimization, but we are also calling getpwuid_r and initgroups, and it's unclear what the former would do with a uid of -1.	2013-02-13 16:11:15 -05:00
Laine Stump	417182b072	util: add virCommandSetUID and virCommandSetGID If a uid and/or gid is specified for a command, it will be set just after the user-supplied post-fork "hook" function is called. The intent is that this can replace user hook functions that set uid/gid. This moves the setting of uid/gid and dropping of capabilities closer to each other, which is important since the two should really be done at the same time (libcapng provides a single function that does both, which we will be unable to use, but want to mimic as closely as possible).	2013-02-13 16:11:15 -05:00
Laine Stump	ad5cb11be6	util: refactor virCommandHook into virExec and virCommandHandshakeChild	2013-02-13 16:11:15 -05:00
Laine Stump	5f2ce53984	util: eliminate extra args from virExec All args except "cmd" in the call to virExec are now redundant, since they can all be found in cmd, so remove the args and reference the data directly in cmd. One exception to this is that "infd" was being modified within virExec, and modifying the original in cmd caused make check failures, so cmd->infd is copied to a local, and the local is used during virExec().	2013-02-13 16:11:15 -05:00
Laine Stump	b6decc57b1	util: eliminate generic hook from virExecWithHook virExecWithHook is only called from one place, so it always has the same "hook" function (virHookCommand), and the data sent to that function is always a virCommandPtr, so eliminate the function and generic data from the arglist, and replace it with "virCommandPtr cmd". The call to (hook)(data) is replaced with "virHookCommand(cmd)". Finally, virExecWithHook is renamed to virExec. Indentation has been updated only for code that will remain after the next patch, which will remove all other args to virExec (since they are now redundant, as they're all members of virCommandPtr).	2013-02-13 16:11:15 -05:00
Daniel P. Berrange	a9e97e0c30	Remove qemuDriverLock from almost everywhere With the majority of fields in the virQEMUDriverPtr struct now immutable or self-locking, there is no need for practically any methods to be using the QEMU driver lock. Only a handful of helper APIs in qemu_conf.c now need it	2013-02-13 11:10:30 +00:00
Michal Privoznik	3178df9afa	virCommand: Don't misuse the eventloop for async IO Currently, if a command wants to do asynchronous IO, a callback is registered in the libvirtd eventloop to handle writes and reads. However, there's a race in virCommandWait. The eventloop may already be executing the callback, while virCommandWait is mangling internal state of virCommand. To deal with it, we need to either introduce locking or spawn a separate thread where we poll() on stdio from child. The former, however, requires to unlock all mutexes held, as the event loop may execute other callbacks which tries to lock one of the mutexes, deadlock and thus never wake us up. So it's safer to spawn a separate thread.	2013-02-13 09:54:19 +01:00
Eric Blake	069b5c5a58	xen: clean up the mess with cpumap Commit `8b55992f` added some Coverity comments to silence what was a real bug in the code. Since then, we've had a miserable run of trying to fix the underlying problem (commits `c059cde` and `ba5193c`), and still have a problem on 32-bit machines. This fixes the problem for once and for all, by realizing that on older xen, cpumap_t is identical to uint64_t, and using the new virendian.h to do the transformation from the API (documented to be little-endian) to the host structure. * src/xen/xen_hypervisor.c (virXen_setvcpumap): Do the conversion correctly. Finally.	2013-02-12 09:00:17 -07:00
Eric Blake	731ad69240	util: use new virendian.h macros This makes code easier to read, by avoiding lines longer than 80 columns and removing the repetition from the callers. * src/util/virstoragefile.c (qedGetHeaderUL, qedGetHeaderULL): Delete in favor of more generic macros. (qcow2GetBackingStoreFormat, qcowXGetBackingStore) (qedGetBackingStore, virStorageFileMatchesVersion) (virStorageFileGetMetadataInternal): Use new macros. * src/cpu/cpu_x86.c (x86VendorLoad): Likewise.	2013-02-12 09:00:17 -07:00
Eric Blake	c6f1060ca7	util: add virendian.h macros We have several cases where we need to read endian-dependent data regardless of host endianness; rather than open-coding these call sites, it will be nicer to funnel things through a macro. The virendian.h file can be expanded to add writer functions, and/or 16-bit access patterns, if needed. Also, if we need to turn things into a function to avoid multiple evaluations of buf, that can be done later. But for now, a macro worked. * src/util/virendian.h: New file. * src/Makefile.am (UTIL_SOURCES): Ship it. * tests/virendiantest.c: New test. * tests/Makefile.am (test_programs, virendiantest_SOURCES): Run the test. * .gitignore: Ignore built file.	2013-02-12 09:00:15 -07:00
Daniel P. Berrange	d1c7b00b60	Release VM lock before acquiring virDomainObjListPtr lock When removing a VM from the virDomainObjListPtr, we must not be holding the VM lock while acquiring the list lock. Re-order code to ensure that we can release the VM lock early.	2013-02-12 11:06:03 +00:00
Daniel P. Berrange	61b52d2e38	Fix potential deadlock across fork() in QEMU driver The hook scripts used by virCommand must be careful wrt accessing any mutexes that may have been held by other threads in the parent process. With the recent refactoring there are 2 potential flaws lurking, which will become real deadlock bugs once the global QEMU driver lock is removed. Remove use of the QEMU driver lock from the hook function by passing in the 'virQEMUDriverConfigPtr' instance directly. Add functions to the virSecurityManager to be invoked before and after fork, to ensure the mutex is held by the current thread. This allows it to be safely used in the hook script in the child process. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-02-12 11:05:31 +00:00
Eric Blake	736a87b916	build: fix compilation of selinux on RHEL 5 On RHEL 5, I got: security/security_selinux.c: In function 'getContext': security/security_selinux.c:971: warning: unused parameter 'mgr' [-Wunused-parameter] * src/security/security_selinux.c (getContext): Mark potentially unused parameter.	2013-02-11 17:12:25 -07:00
Viktor Mihajlovski	0bbbd42c30	S390: domain_conf support for CCW Add necessary handling code for the new s390 CCW address type to virDomainDeviceInfo. Further, introduce memory management, XML parsing, output formatting and range validation for the new virDomainDeviceCCWAddress type. Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com>	2013-02-11 16:38:21 -07:00
Laine Stump	6627ea3cbf	build: fix make check of remote_protocol-structs Broken by incorrect formatting / spelling of remote_nonnull in commit `39758e7567`	2013-02-11 15:34:49 -05:00
John Ferlan	ba5193c8d8	hypervisor: Restore pm initialization Adjustment for 'c059cdeaf' due to older compiler complaint about pm not being initialized even though the j&7 == 0 does the trick.	2013-02-12 00:32:57 +08:00
Osier Yang	fb2e465362	nodedev: Implement virNodeDeviceLookupSCSIHostByWWN This just simply changes nodeDeviceLookupByWWN to be not static, and its name into nodeDeviceLookupSCSIHostByWWN. And use that for udev and HAL backends.	2013-02-12 00:23:57 +08:00
Osier Yang	39758e7567	remote: Wire up the remote protocol Like virNodeDeviceCreateXML, virNodeDeviceLookupSCSIHostByWWN has to be treated specially when generating the RPC codes. Also new rules are added in fixup_name to keep the name SCSIHostByWWN.	2013-02-12 00:23:57 +08:00
Osier Yang	efed366eb7	Introduce API virNodeDeviceLookupSCSIHostByWWN Since the name (like scsi_host10) is not stable for vHBA, (it can be changed either after recreating or system rebooting), current API virNodeDeviceLookupByName is not nice to use for management app in this case. (E.g. one wants to destroy the vHBA whose name has been changed after system rebooting, he has to find out current name first). Later patches will support the persistent vHBA via storage pool, with which one can identify the vHBA stably by the wwnn && wwpn pair. So this new API comes.	2013-02-12 00:23:57 +08:00

1 2 3 4 5 ...

8934 Commits