libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-12-26 07:36:19 +00:00

Author	SHA1	Message	Date
Nehal J Wani	969493f91d	Fix memory leak in virSCSIDeviceListDel() While running virscsitest, it was found that valgrind pointed out the following memory leak: ==320== 5 bytes in 1 blocks are definitely lost in loss record 4 of 37 ==320== at 0x4A069EE: malloc (vg_replace_malloc.c:270) ==320== by 0x3E6CE81171: strdup (strdup.c:43) ==320== by 0x4CB28DF: virStrdup (virstring.c:554) ==320== by 0x4CAC987: virSCSIDeviceSetUsedBy (virscsi.c:289) ==320== by 0x402321: test2 (virscsitest.c:100) ==320== by 0x403231: virtTestRun (testutils.c:199) ==320== by 0x402121: mymain (virscsitest.c:180) ==320== by 0x4039AD: virtTestMain (testutils.c:782) ==320== by 0x3E6CE1ED1C: (below main) (libc-start.c:226) ==320== Introduced by commit `fd243fc`. Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-26 11:41:40 +01:00
Michal Privoznik	c0d162c68c	virNetDevVethCreate: Serialize callers Consider dozen of LXC domains, each of them having this type of interface: <interface type='network'> <mac address='52:54:00:a7:05:4b'/> <source network='default'/> </interface> When starting these domain in parallel, all workers may meet in virNetDevVethCreate() where a race starts. Race over allocating veth pairs because allocation requires two steps: 1) find first nonexistent '/sys/class/net/vnet%d/' 2) run 'ip link add ...' command Now consider two threads. Both of them find N as the first unused veth index but only one of them succeeds allocating it. The other one fails. For such cases, we are running the allocation in a loop with 10 rounds. However this is very flaky synchronization. It should be rather used when libvirt is competing with other process than when libvirt threads fight each other. Therefore, internally we should use mutex to serialize callers, and do the allocation in loop (just in case we are competing with a different process). By the way we have something similar already since `1cf97c87`. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-26 08:50:47 +01:00
Eric Blake	fa2e4dbfd6	build: fix cgroups on non-Linux Running ./autobuild.sh detected a mingw failure: CCLD libvirt.la Cannot export virCgroupGetPercpuStats: symbol not defined Cannot export virCgroupSetOwner: symbol not defined * src/util/vircgroup.c (virCgroupGetPercpuStats) (virCgroupSetOwner): Implement stubs. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-25 17:38:46 -07:00
Jim Fehlig	4d975deddd	libxl: queue domain event earlier in shutdown handler The shutdown handler may restart a domain when handling a reboot event or when <on_*> is set to 'restart'. Restarting consists of calling libxlVmCleanup followed by libxlVmStart. libxlVmStart will emit a VIR_DOMAIN_EVENT_STARTED event, but the SHUTDOWN event is not emitted until exiting the shutdown handler, after the STARTED event. This patch changes the logic a bit to queue the event at the start of the shutdown action, ensuring it is queued before any subsequent events that may be generated while executing the shutdown action. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-25 10:54:04 -07:00
Laine Stump	2122cf3979	network: include plugged interface XML in "plugged" network hook The network hook script gets called whenever an interface is plugged into or unplugged from a network, but even though the full XML of both the network and the domain is included, there is no reasonable way to determine what exact resources the plugged interface is using: 1) Prior to a recent patch which modified the status XML of interfaces to include the information about actual hardware resources used, it would be possible to scan through the domain XML output sent to the hook, and from there find the correct interface, but that interface definition would not include any runtime info (e.g. bandwidth or vlan taken from a portgroup, or which physdev was used in case of a macvtap network). 2) After the patch modifying the status XML of interfaces, the network name would no longer be included in the domain XML, so it would be completely impossible to determine which interface was the one being plugged. To solve that problem, this patch includes a single <interface> element at the beginning of the XML sent to the network hook for "plugged" and "unplugged" (just inside <hookData>) that is the status XML of the interface being plugged. This XML will include all info gathered from the chosen network and portgroup. NB: due to hardcoded spaces in all of the device Format() functions, the <interface> element inside the <hookData> will be indented by 6 spaces rather than 2. I had intended to fix this, but it turns out that to make virDomainNetDefFormat() indentation relative, I would have to do the same to virDomainDeviceInfoFormat(), and that function is called from 19 places - making that a prerequisite of this patch would cause too many merge difficulties if we needed to backport network hooks, so I chose to ignore the problem here and fix the problem for all* devices in a followup later.	2014-02-25 16:07:36 +02:00
Laine Stump	7d5bf48474	conf: output actual netdev status in <interface> XML Until now, the "live" XML status of an <interface type='network'> device would always show the network information, rather than the exact hardware device that was used. It would also show the name of any portgroup the interface belonged to, rather than providing the configuration that was derived from that portgroup. As an example, given the following network definition: [A] <network> <name>testnet</name> <forward type='bridge' dev='p4p1_0'> <interface dev='p4p1_0'/> <interface dev='p4p1_1'/> <interface dev='p4p1_2'/> <interface dev='p4p1_3'/> </forward> <portgroup name='admin'> <bandwidth> <inbound average='1000' peak='5000' burst='1024'/> <outbound average='128' peak='256' burst='256'/> </bandwidth> </portgroup> </network> and the following domain <interface>: [B] <interface type='network'> <source network='testnet' portgroup='admin'/> </interface> the output of "virsh dumpxml $domain" while the domain was running would yield something like this: [C] <interface type='network'> <source network='testnet' portgroup='admin'/> <target dev='macvtap0'/> <alias name='net0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x03' function='0x0'/> </interface> In order to learn the exact bandwidth information of the interface, a management application would need to retrieve the XML for testnet, then search for the portgroup named "admin". Even worse, there was no simple and standard way to learn which host physdev the macvtap0 device is attached to. Internally, libvirt has always kept this information in the virDomainDef that is held in memory, as well as storing it in the (libvirt-internal-only) domain status XML (in /var/run/libvirt/qemu/$domain.xml). In order to not confuse the runtime "actual state" with the config of the device, it's internally stored like this: [D] <interface type='network'> <source network='testnet' portgroup='admin'/> <actual type='direct'> <source dev='p4p1_0' mode='bridge'/> <bandwidth> <inbound average='1000' peak='5000' burst='1024'/> <outbound average='128' peak='256' burst='256'/> </bandwidth> </actual> <target dev='macvtap0'/> <alias name='net0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x03' function='0x0'/> </interface> This was never exposed outside of libvirt though, because I thought it would be too awkward for a management application to need to look in two places for the same information, but I also wasn't sure that it would be okay to overwrite the config info (in this case "<source network='testnet' portgroup='admin'/>") with the actual runtime info (everything inside <actual> above). Now we have a need for this information to be made available to management applications (in particular, so that a network "plugged" hook will have full information about the device that is being plugged in), so it's time to take the leap and decide that it is acceptable for the config info to be replaced with actual runtime state (but only when reporting domain live status, not when saving state in /var/run/libvirt/qemu/$domain.xml - that remains the same so that there is no loss of information). That is what this patch does - once applied, the output of "virsh dumpxml $domain" when the domain is running will contain something like this: [E] <interface type='direct'> <source dev='p4p1_0' mode='bridge'/> <bandwidth> <inbound average='1000' peak='5000' burst='1024'/> <outbound average='128' peak='256' burst='256'/> </bandwidth> <target dev='macvtap0'/> <alias name='net0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x03' function='0x0'/> </interface> In effect, everything that is internally stored within <actual> is moved up a level to where a management application will expect it. This means that the management application will only look in a single place to learn - the type of interface in use, the name of the physdev (if relevant), the <bandwidth>, <vlan>, and <virtualport> settings in use. The potential downside is that a management app looking at this output will not see that the physdev 'p4p1_0' was actually allocated from the network named 'testnet', or that the bandwidth numbers were taken from the portgroup 'admin'. However, if they are interested in that info, they can always get the "inactive" XML for the domain. An example of where this could cause problems is in virt-manager's network device display, which shows the status of the device, but allows you to edit that status info and save it as the new config. Previously virt-manager would always display the information in example [C] above, and allow editing that. With this patch, it will instead display what is in [E] and allow editing it directly, which could lead to some confusion. I would suggest that virt-manager have an "edit" button which would change the display from the "live" xml to the "inactive" xml, so that editing would be done on that; such a change would both handle the new situation, and also be compatible with older releases.	2014-02-25 16:06:43 +02:00
Laine Stump	9da98aa5e1	conf: new function virDomainActualNetDefContentsFormat This function is currently only called from one place, but in a subsequent patch will be called from a 2nd place. The new function exactly replicates the original behavior of the part of virDomainActualNetDefFormat() that it replaces, but takes a virDomainNetDefPtr instead of virDomainActualNetDefPtr, and uses the virDomainNetGetActual*() functions whenever possible, rather than reaching into def->data.network.actual - this is to be sure that we are reporting exactly what is being used internally, just in case there are any discrepancies (there shouldn't be).	2014-02-25 16:04:26 +02:00
Laine Stump	65487c0fc5	conf: re-situate <bandwidth> element in <interface> This moves the call to virNetDevBandwidthFormat() in virDomainNetDefFormat() to be called right after the call to virNetDevVPortProfileFormat(), so that a single chunk of that function can be placed inside an if that conditionally calls virDomainActualNetDefContentsFormat() instead (next patch). The re-ordering necessitates modifying a couple of test data files.	2014-02-25 16:03:05 +02:00
Laine Stump	7c39214cd4	conf: make virDomainNetDefFormat a public function We will need to call virDomainNetDefFormat() from the network hook (in the network driver).	2014-02-25 16:01:39 +02:00
Laine Stump	79358733b0	conf: handle null pointer in virNetDevVlanFormat Other Format() functions (e.g. virNetDevBandwidthFormat()) return with no action when called with a NULL Def pointer. This makes virNetDevVlanFormat() consistent with that behavior.	2014-02-25 15:56:12 +02:00
Laine Stump	6d4ffae4fc	conf: clarify what is returned for actual bandwidth and vlan In practice, if a virDomainNetDef has a virDomainActualNetDef allocated, the ActualNetDef will always contain the bandwidth and vlan data from the NetDef (unless there was also a portgroup involved - see networkAllocateActualDevice()). However, virDomainNetGetActual(Bandwidth\|Vlan)() were coded to make it appear as if it might be possible to have a valid bandwidth/vlan in the NetDef, but a NULL in the ActualNetDef. Believing this un-truth could lead to writing unnecessarily defensive code when dealing with the virDomainGetActual() functions, so this patch makes it more obvious: If there is an ActualNetDef, it will always have a copy of the various appropriate bits from its parent NetDef, and the virDomainGetActual function will always return the data from the ActualNetDef, not from the NetDef. The reason for this effective-NOP patch is that a subsequent patch to change virDomainNetDefFormat will rely on the above rule.	2014-02-25 15:55:19 +02:00
Wido den Hollander	60f70542f9	rbd: Set timeout options for librados These timeout values make librados/librbd return -ETIMEDOUT when a operation is blocking due to a failing/unreachable Ceph cluster. By having the operations time out libvirt will not block.	2014-02-25 11:14:44 +01:00
Wido den Hollander	761491eb7c	rbd: Include return statuses from librados/librbd in logging With this information it's easier for the user to debug what is going wrong.	2014-02-25 11:14:28 +01:00
Jim Fehlig	cfad607b23	libxl: handle on_crash coredump actions Add support for coredump-{destroy,restart} actions of <on_crash> event. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-24 10:39:44 -07:00
Jim Fehlig	c2de456e4e	libxl: add dump dir to libxlDriverConfig object Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-24 10:27:53 -07:00
Jim Fehlig	51b9b39127	libxl: honor domain lifecycle event configuration The libxl driver was ignoring the <on_> domain event configuration, causing e.g. a domain to be rebooted even when on_reboot is set to destroy. This patch honors the <on_> configuration in the shutdown event handler. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-24 10:26:52 -07:00
Richard Weinberger	6fb42d7cdc	Ensure systemd cgroup ownership is delegated to container with userns This function is needed for user namespaces, where we need to chmod() the cgroup to the initial uid/gid such that systemd is allowed to use the cgroup. Signed-off-by: Richard Weinberger <richard@nod.at> Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-24 15:35:47 +00:00
Roman Bogorodskiy	8ca5f46c59	bhyve: implement node information reporting - Implement nodeGetCPUStats using nodeGetCPUStats() - Implement nodeGetMemoryStats using nodeGetMemoryStats()	2014-02-24 19:03:46 +04:00
Daniel P. Berrange	66e3a3e914	Add virStringReplace method for substring replacement Add a virStringReplace method to virstring.{h,c} to perform substring matching and replacement Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-24 10:51:22 +00:00
Manuel VIVES	12aa71dfde	Add virStringSearch method for regex matching Add a virStringSearch method to virstring.{c,h} which performs a regex match against a string and returns the matching substrings. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-24 10:46:28 +00:00
Michal Privoznik	68954fb25c	virNetServerRun: Notify systemd that we're accepting clients Systemd does not forget about the cases, where client service needs to wait for daemon service to initialize and start accepting new clients. Setting a dependency in client is not enough as systemd doesn't know when the daemon has initialized itself and started accepting new clients. However, it offers a mechanism to solve this. The daemon needs to call a special systemd function by which the daemon tells "I'm ready to accept new clients". This is exactly what we need with libvirtd-guests (client) and libvirtd (daemon). So now, with this change, libvirt-guests.service is invoked not any sooner than libvirtd.service calls the systemd notify function. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-24 10:54:48 +01:00
Michal Privoznik	ba79e3879e	virSystemdCreateMachine: Set dependencies for slices https://bugzilla.redhat.com/show_bug.cgi?id=1031696 When creating a new domain, we let systemd know about it by calling CreateMachine() function via dbus. Systemd then creates a scope and places domain into it. However, later when the host is shutting down, systemd computes the shutdown order to see what processes can be shut down in parallel. And since we were not setting dependencies at all, the slices (and thus domains) were most likely killed before libvirt-guests.service. So user domains that had to be saved, shut off, whatever were in fact killed. This problem can be solved by letting systemd know that scopes we're creating must not be killed before libvirt-guests.service. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-24 10:21:00 +01:00
Ján Tomko	57e17a74b7	Ignore additional fields in iscsiadm output There has been a new field introduced in iscsiadm --mode session output [1], but our regex only expects four fields. This breaks startup of iscsi pools: error: Failed to start pool iscsi error: internal error: cannot find session Fix this by ignoring anything after the fourth field. https://bugzilla.redhat.com/show_bug.cgi?id=1067173 [1] https://github.com/mikechristie/open-iscsi/commit/181af9a	2014-02-21 10:35:57 +01:00
Ján Tomko	abf1daf0d7	Add a stub for virCgroupGetDomainTotalCpuStats Commit `6515889` broke the build on FreeBSD: In function `qemuDomainGetCPUStats': /../../src/qemu/qemu_driver.c:16102: undefined reference to `virCgroupGetDomainTotalCpuStats'	2014-02-21 09:10:48 +01:00
Jim Fehlig	84a6209d7f	libxl: queue shutdown event on domain shutdown Emit libvirt shutdown event when receiving LIBXL_SHUTDOWN_REASON_POWEROFF event from libxl. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-20 15:50:06 -07:00
Jim Fehlig	d716d942e2	libxl: always use libxlVmCleanupJob in shutdown thread Commit `e4a0e900` missed calling libxlVmCleanupJob in the shutdown handler when processing a reboot event. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-20 11:50:33 -07:00
Eric Blake	60f7303c15	qemu: adjust maxmem/maxvcpu computation https://bugzilla.redhat.com/show_bug.cgi?id=1038363 If a domain has a different maximum for persistent and live maxmem or max vcpus, then it is possible to hit cases where libvirt refuses to adjust the current values or gets halfway through the adjustment before failing. Better is to determine up front if the change is possible for all requested flags. Based on an idea by Geoff Franks. * src/qemu/qemu_driver.c (qemuDomainSetMemoryFlags): Compute correct maximum if both live and config are being set. (qemuDomainSetVcpusFlags): Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-20 11:27:16 -07:00
Daniel P. Berrange	432a3fee3b	Rename virDomainGetRootFilesystem to virDomainGetFilesystemForTarget The virDomainGetRootFilesystem method can be generalized to allow any filesystem path to be obtained. While doing this, start a new test case for purpose of testing various helper methods in the domain_conf.{c,h} files, such as this one. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-20 15:50:46 +00:00
Daniel P. Berrange	cb9b3bc257	Fix multiple bugs in LXC domainMemoryStats driver The virCgroupXXX APIs' return value must be checked for being less than 0, not equal to 0. An VIR_ERR_OPERATION_INVALID error must also be raised when the VM is not running to prevent a crash on NULL priv->cgroup field. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-20 15:32:49 +00:00
Thorsten Behrens	0bd2ccdecc	Widening API change - accept empty path for virDomainBlockStats And provide domain summary stat in that case, for lxc backend. Use case is a container inheriting all devices from the host, e.g. when doing application containerization.	2014-02-20 16:20:09 +01:00
Thorsten Behrens	dcc85c603e	Implement lxcDomainBlockStats* for lxc driver Adds lxcDomainBlockStatsFlags and lxcDomainBlockStats functions.	2014-02-20 16:20:09 +01:00
Thorsten Behrens	4b3b2f6ceb	Implement domainGetCPUStats for lxc driver.	2014-02-20 16:20:09 +01:00
Thorsten Behrens	65158899b7	Make qemuGetDomainTotalCPUStats a virCgroup function. To reuse this from other drivers, like lxc.	2014-02-20 16:20:09 +01:00
Thorsten Behrens	192604ddee	Implement domainMemoryStats API slot for LXC driver.	2014-02-20 16:20:09 +01:00
Thorsten Behrens	a2bb187c7e	Add util virCgroupGetBlkioIo*Serviced methods. This reads blkio stats from blkio.throttle.io_service_bytes and blkio.throttle.io_serviced.	2014-02-20 16:20:09 +01:00
Richard Weinberger	39aad72510	lxc: Add destroy support for suspended domains Destroying a suspended domain needs special action. We cannot simply terminate all process because they are frozen. Do deal with that we send them SIGKILL and thaw them. Upon wakeup the process sees the pending signal and dies immediately. Signed-off-by: Richard Weinberger <richard@nod.at>	2014-02-20 10:46:31 +01:00
Ján Tomko	057d26b2ac	Fix build of portallocator on mingw IN6ADDR_ANY_INIT does not seem to be working as expected on MinGW: error: missing braces around initializer [-Werror=missing-braces] .sin6_addr = IN6ADDR_ANY_INIT, Use the in6addr_any variable instead. Reported by Daniel P. Berrange.	2014-02-20 10:16:07 +01:00
Michal Privoznik	83c404ff9b	networkRunHook: Run hook only if possible Currently, networkRunHook() is called in networkAllocateActualDevice and friends. These functions, however, doesn't necessarily work on networks, For example, if domain's interface is defined in this fashion: <interface type='bridge'> <mac address='52:54:00:0b:3b:16'/> <source bridge='virbr1'/> <model type='rtl8139'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x09' function='0x0'/> </interface> The networkAllocateActualDevice jumps directly onto 'validate' label as the interface is not type of 'network'. Hence, @network is left initialized to NULL and networkRunHook(network, ...) is called. One of the things that the hook function does is dereference @network. Soupir. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-20 08:56:17 +01:00
Jim Fehlig	e6dcb0e2a1	libxl: use job functions in libxlDomainSetSchedulerParametersFlags Modify operation that needs to wait in the queue of modify jobs. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:01 -07:00
Jim Fehlig	7d9ff81603	libxl: use job functions in libxlDomainSetAutostart Setting autostart is a modify operation that needs to wait in the queue of modify jobs. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:01 -07:00
Jim Fehlig	85ff3d7aec	libxl: use job functions in device attach and detach functions These operations aren't necessarily time consuming, but need to wait in the queue of modify jobs. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:01 -07:00
Jim Fehlig	7df46cff6b	libxl: use job functions in vcpu set and pin functions These operations aren't necessarily time consuming, but need to wait in the queue of modify jobs. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:01 -07:00
Jim Fehlig	f9e6b7024c	libxl: use job functions in libxlDomainCoreDump Dumping a domain's core can take considerable time. Use the recently added job functions and unlock the virDomainObj while dumping core. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Jim Fehlig	341870b10d	libxl: use job functions in domain save operations Saving domain memory and cpu state can take considerable time. Use the recently added job functions and unlock the virDomainObj while saving the domain. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Jim Fehlig	e4a0e900d3	libxl: use job functions when cleaning up a domain When explicitly destroying a domain (libxlDomainDestroyFlags), or handling an out-of-band domain shutdown event, cleanup the domain in the context of a job. Introduce libxlVmCleanupJob to wrap libxlVmCleanup in a job block.	2014-02-19 11:10:00 -07:00
Jim Fehlig	f5bc5bd4df	libxl: use job functions in libxlDomain{Suspend,Resume} These operations aren't necessarily time consuming, but need to wait in the queue of modify jobs. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Jim Fehlig	ac1444c35f	libxl: use job functions in libxlDomainSetMemoryFlags Large balloon operation can be time consuming. Use the recently added job functions and unlock the virDomainObj while ballooning. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Jim Fehlig	491593e840	libxl: use job functions in libxlVmStart Creating a large domain could potentially be time consuming. Use the recently added job functions and unlock the virDomainObj while the create operation is in progress. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Jim Fehlig	4b4b61c329	libxl: Add job support to libxl driver Follows the pattern used in the QEMU driver for managing multiple, simultaneous jobs within the driver. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Jim Fehlig	343119a44b	libxl: remove libxlVmReap function This function, which only has five call sites, simply calls libxl_domain_destroy and libxlVmCleanup. Call those functions directly at the call sites, allowing more control over how a domain is destroyed and cleaned up. This patch maintains the existing semantic, leaving changes to a subsequent patch. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Jim Fehlig	219d34cfe2	libxl: always set vm id to -1 on shutdown Once a domain has reached the shutdown state, set its ID to -1. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-19 11:10:00 -07:00
Oleg Strikov	41b9b71877	qemu: Use virtio network device for aarch64/virt This patch changes network device type used by default from rtl8139 to virtio when architecture type is aarch64 and machine type is virt. Qemu doesn't support any other machine types for aarch64 right now and we can't make any other aarch64-specific tuning in this function yet. Signed-off-by: Oleg Strikov <oleg.strikov@canonical.com>	2014-02-19 10:46:10 -05:00
Roman Bogorodskiy	0eb4a5f4f1	bhyve: add a basic driver At this point it has a limited functionality and is highly experimental. Supported domain operations are: * define * start * destroy * dumpxml * dominfo It's only possible to have only one disk device and only one network, which should be of type bridge.	2014-02-19 14:21:50 +00:00
Li Zhang	cffa51b81d	Add a default USB keyboard and USB mouse for PPC64 There is no keyboard working on PPC64 and PS2 mouse is only for X86 when graphics are enabled. Add a USB keyboard and USB mouse for PPC64 when graphics are enabled. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-19 09:16:31 +01:00
Li Zhang	2a81430c85	xen: format xen config for USB keyboard Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-19 09:16:31 +01:00
Li Zhang	78730478aa	qemu: format qemu command line for USB keyboard Format qemu command line for USB keyboard and add test cases for it. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-19 09:16:31 +01:00
Li Zhang	f5ffd45f4c	qemu: Add USB keyboard capability Add USB keyboard capability probing and test cases. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-19 09:16:31 +01:00
Li Zhang	b39275954b	conf: Remove the implicit PS2 devices for non-X86 platforms PS2 devices only work on X86 platform, other platforms may need USB devices instead. Athough it doesn't influence the QEMU command line, it's not right to add PS2 mouse/keyboard for non-X86 platform. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-19 09:16:31 +01:00
Li Zhang	bc18373391	conf: Add keyboard input device type There is no keyboard support currently in libvirt. For some platforms (PPC64 QEMU) this makes graphics unusable, since the keyboard is not implicit and it can't be added via libvirt. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-19 09:16:31 +01:00
Li Zhang	f608a713f6	conf: Add one interface to add default input devices Use it for the default mouse. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com> Signed-off-by: Ján Tomko <jtomko@redhat.com>	2014-02-19 09:16:30 +01:00
Michal Privoznik	4d88294483	bridge_driver.h: Fix build --without-network The networkNotifyActualDevice function is accepting two arguments, not one: qemu/qemu_process.c: In function 'qemuProcessNotifyNets': qemu/qemu_process.c:2776:47: error: macro "networkNotifyActualDevice" passed 2 arguments, but takes just 1 if (networkNotifyActualDevice(def, net) < 0) ^ Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-18 19:52:39 +01:00
Ján Tomko	adc8b2afbb	Fix conflicting types of virInitctlSetRunLevel `aebbcdd` didn't change the non-linux definition of the function, breaking the build on FreeBSD: ../../src/util/virinitctl.c:164: error: conflicting types for 'virInitctlSetRunLevel' ../../src/util/virinitctl.h:40: error: previous declaration of 'virInitctlSetRunLevel' was here	2014-02-18 15:05:06 +01:00
Michal Privoznik	9de7309125	network: Taint networks that are using hook script Basically, the idea is copied from domain code, where tainting exists for a while. Currently, only one taint reason exists - VIR_NETWORK_TAINT_HOOK to mark those networks which caused invoking of hook script. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-18 14:46:49 +01:00
Michal Privoznik	f1ab06e43d	network: Introduce network hooks There might be some use cases, where user wants to prepare the host or its environment prior to starting a network and do some cleanup after the network has been shut down. Consider all the functionality that libvirt doesn't currently have as an example what a hook script can possibly do. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-18 14:46:49 +01:00
Michal Privoznik	e0a31274ec	network_conf: Expose virNetworkDefFormatInternal In the next patch I'm going to need the network format function that takes virBuffer as argument. However, slightly change of name is more appropriate then: virNetworkDefFormatBuf to match the rest of functions that format an object to buffer. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-18 14:46:48 +01:00
Daniel P. Berrange	5fc590ad9f	CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC hotunplug code Rewrite multiple hotunplug functions to to use the virProcessRunInMountNamespace helper. This avoids risk of a malicious guest replacing /dev with an absolute symlink, tricking the driver into changing the host OS filesystem. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-18 12:59:14 +00:00
Daniel P. Berrange	1cadeafcaa	CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC chardev hostdev hotplug Rewrite lxcDomainAttachDeviceHostdevMiscLive function to use the virProcessRunInMountNamespace helper. This avoids risk of a malicious guest replacing /dev with a absolute symlink, tricking the driver into changing the host OS filesystem. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-18 12:59:14 +00:00
Daniel P. Berrange	1754c7f0ab	CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC block hostdev hotplug Rewrite lxcDomainAttachDeviceHostdevStorageLive function to use the virProcessRunInMountNamespace helper. This avoids risk of a malicious guest replacing /dev with a absolute symlink, tricking the driver into changing the host OS filesystem. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-18 12:59:11 +00:00
Daniel P. Berrange	7fba01c15c	CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC USB hotplug Rewrite lxcDomainAttachDeviceHostdevSubsysUSBLive function to use the virProcessRunInMountNamespace helper. This avoids risk of a malicious guest replacing /dev with a absolute symlink, tricking the driver into changing the host OS filesystem. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-18 12:59:07 +00:00
Daniel P. Berrange	4dd3a7d5bc	CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC disk hotplug Rewrite lxcDomainAttachDeviceDiskLive function to use the virProcessRunInMountNamespace helper. This avoids risk of a malicious guest replacing /dev with a absolute symlink, tricking the driver into changing the host OS filesystem. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-18 12:59:05 +00:00
Eric Blake	aebbcdd33c	CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC shutdown/reboot code Use helper virProcessRunInMountNamespace in lxcDomainShutdownFlags and lxcDomainReboot. Otherwise, a malicious guest could use symlinks to force the host to manipulate the wrong file in the host's namespace. Idea by Dan Berrange, based on an initial report by Reco <recoverym4n@gmail.com> at http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=732394 Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-18 12:59:02 +00:00
Daniel P. Berrange	7c72ef6f55	Add helper for running code in separate namespaces Implement virProcessRunInMountNamespace, which runs callback of type virProcessNamespaceCallback in a container namespace. This uses a child process to run the callback, since you can't change the mount namespace of a thread. This implies that callbacks have to be careful about what code they run due to async safety rules. Idea by Dan Berrange, based on an initial report by Reco <recoverym4n@gmail.com> at http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=732394 Signed-off-by: Daniel Berrange <berrange@redhat.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-18 12:45:41 +00:00
Daniel P. Berrange	c321bfc5c3	Add virFileMakeParentPath helper function Add a helper function which takes a file path and ensures that all directory components leading up to the file exist. IOW, it strips the filename part of the path and passes the result to virFileMakePath. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-18 12:39:06 +00:00
Daniel P. Berrange	c3eb12cace	Move check for cgroup devices ACL upfront in LXC hotplug The check for whether the cgroup devices ACL is available is done quite late during LXC hotplug - in fact after the device node is already created in the container in some cases. Better to do it upfront so we fail immediately. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-17 15:40:01 +00:00
Daniel P. Berrange	d24e6b8b1e	Disks are always block devices, never character devices The LXC disk hotplug code was allowing block or character devices to be given as disk. A disk is always a block device. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-17 15:39:55 +00:00
Daniel P. Berrange	2c2bec94d2	Fix reset of cgroup when detaching USB device from LXC guests When detaching a USB device from an LXC guest we must remove the device from the cgroup ACL. Unfortunately we were telling the cgroup code to use the guest /dev path, not the host /dev path, and the guest device node had already been unlinked. This was, however, fortunate since the code passed &priv->cgroup instead of priv->cgroup, so would have crash if the device node were accessible. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-17 15:39:55 +00:00
Daniel P. Berrange	a537827d15	Record hotplugged USB device in LXC live guest config After hotplugging a USB device, the LXC driver forgot to add the device def to the virDomainDefPtr. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-17 15:39:37 +00:00
Daniel P. Berrange	c364897222	Fix path used for USB device attach with LXC The LXC code missed the 'usb' component out of the path /dev/bus/usb/$BUSNUM/$DEVNUM, so it failed to actually setup cgroups for the device. This was in fact lucky because the call to virLXCSetupHostUsbDeviceCgroup was also mistakenly passing '&priv->cgroup' instead of just 'priv->cgroup'. So once the path is fixed, libvirtd would then crash trying to access the bogus virCgroupPtr pointer. This would have been a security issue, were it not for the bogus path preventing the pointer reference being reached. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-17 15:11:06 +00:00
Daniel P. Berrange	7a44af963e	Don't block use of USB with containers virDomainDefCompatibleDevice blocks use of USB if no USB controller is present. This is not correct for containers since devices can be assigned directly regardless of any controllers. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-02-17 15:11:06 +00:00
Michal Privoznik	3b2c279449	qemu: Implement VIR_DOMAIN_TAINT_HOOK Currently, there's just one place where we care if hook script is changing the domain XML: migration hook for incoming migration. In all other places where a hook script is executed, we don't read the XML back from the script. Anyway, the hook script can alter domain XML and hence we should taint it if the script did. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-17 11:38:15 +01:00
Michal Privoznik	287d30a816	virDomainTaintFlags: Introduce VIR_DOMAIN_TAINT_HOOK This new flag is to be used for tainting domains which XML definition was altered at runtime by a hook script. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-17 11:38:15 +01:00
Peter Krempa	98bbc8d59a	Revert "storage: Introduce internal pool support" The internal pools were an idea in one of the first iterations of the gluster series, which we decided not to use. Somehow the patch still got pushed. Remove it as the internal flag isn't needed. This reverts commit `362da8209d`.	2014-02-14 17:39:37 +01:00
Peter Krempa	0a584b1fcd	lxc: Don't shadow global symbol "link" Yet another variable name frowned upon by older compilers. Introduced in commit `b73c029d`.	2014-02-14 14:01:45 +01:00
Ján Tomko	0ee9081215	Support IPv6 in port allocator Also try to bind on IPv6 to check if the port is occupied. Change the mocked bind in the test to return EADDRINUSE for some ports only for the IPv4/IPv6 socket if we're testing on a host with IPv6 compiled in. Also mock socket() to make it fail with EAFNOTSUPPORTED if LIBVIRT_TEST_IPV4ONLY is set in the environment, to simulate a host without IPv6 support in the kernel. The tests are repeated again with this variable set. https://bugzilla.redhat.com/show_bug.cgi?id=1025407	2014-02-14 13:18:35 +01:00
Ján Tomko	531bc0bbd0	Split out bind() from virPortAllocatorAcquire	2014-02-14 13:18:35 +01:00
Peter Krempa	ad95fa5957	storage: gluster: Don't leak private data when storage file init fails In `a44b7b87bc` I've introduced a function that initializes a storage file wrapper object on gluster based volumes. The initialization function leaks the private data pointer in case of failure. This patch fixes it. Reported by John Ferlan.	2014-02-14 13:08:39 +01:00
Peter Krempa	8d8b32b0da	storage: Fix build with older compilers afeter gluster snapshot series In commit `e32268184b` I accidentally added twice a typedef for virStorageFileBackend when I moved it between files across patch iterations. The double declaration breaks build on older compilers in RHEL5 and FreeBSD. Remove the spurious definition.	2014-02-14 11:46:37 +01:00
Peter Krempa	3cf074ee40	qemu: snapshot: Add support for external active snapshots on gluster Add support for gluster backed images as sources for snapshots in the qemu driver. This will also simplify adding further network backed volumes as sources for snapshot in case qemu will support them.	2014-02-14 11:07:29 +01:00
Peter Krempa	7183d7d2e8	qemu: snapshot: Use new APIs to detect presence of existing storage files Use the new storage driver based "stat" api to detect exiting files just as we did with local files.	2014-02-14 11:07:29 +01:00
Peter Krempa	8f4091d677	qemu: Switch snapshot deletion to the new API functions Use the new storage driver APIs to delete snapshot backing files in case of failure instead of directly relying on "unlink". This will help us in the future when we will be adding network based storage without local representation in the host.	2014-02-14 11:07:29 +01:00
Peter Krempa	a44b7b87bc	storage: Add storage file backends for gluster Implement storage backend functions to deal with gluster volumes and implement the "stat" and "unlink" backend APIs.	2014-02-14 11:07:23 +01:00
Peter Krempa	e62d09b155	storage: add file functions for local and block files Implement the "stat" and "unlink" function for "file" volumes and "stat" for "block" volumes using the regular system calls.	2014-02-14 10:47:57 +01:00
Peter Krempa	e32268184b	storage: Add file storage APIs in the default storage driver Add APIs that will allow to use the storage driver to assist in operations on files even for remote filesystems without native representation as files in the host.	2014-02-14 10:47:56 +01:00
Peter Krempa	6fb5a397bf	conf: Move qemuSnapshotDiskGetActualType to virDomainSnapshotDiskGetActualType All the data for getting the actual type is present in the snapshot config. There is no need to have this function private to the qemu driver and it will be re-used later in other parts of libvirt	2014-02-14 10:47:56 +01:00
Peter Krempa	f8f020da0a	conf: Move qemuDiskGetActualType to virDomainDiskGetActualType All the data for getting the actual type is present in the domain config. There is no need to have this function private to the qemu driver and it will be re-used later in other parts of libvirt	2014-02-14 10:47:56 +01:00
Cédric Bosdonnat	7554a85be2	lxc from native: removed now remaining useless line	2014-02-13 07:55:05 -05:00
Philipp Hahn	760498fdc7	Fix stream related spelling mistakes Remove double "is". Consistent spelling of all-uppercase I/O. Signed-off-by: Philipp Hahn <hahn@univention.de>	2014-02-13 11:12:02 +01:00
Cédric Bosdonnat	3d58fa3f85	LXC from native: convert blkio throttle config	2014-02-12 17:52:47 +00:00
Cédric Bosdonnat	a09bbc024d	LXC from native: map vlan network type The problem with VLAN is that the user still has to manually create the vlan interface on the host. Then the generated configuration will use it as a nerwork hostdev device. So the generated configurations of the following two fragments are equivalent (see rhbz#1059637). lxc.network.type = phys lxc.network.link = eth0.5 lxc.network.type = vlan lxc.network.link = eth0 lxc.network.vlan.id = 5	2014-02-12 17:52:47 +00:00
Cédric Bosdonnat	d1520c5c9a	LXC from native: map block filesystems	2014-02-12 17:52:47 +00:00
Cédric Bosdonnat	0f13a525d2	LXC from native: map lxc.arch to /domain/os/type@arch	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	5b8bfb0276	LXC from native: add lxc.cgroup.blkio.* mapping	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	281e2990ee	LXC from native: map lxc.cgroup.cpuset.*	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	4f3f7aea6c	LXC from native: map lxc.cgroup.cpu.*	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	13b9946eb5	LXC from native: migrate memory tuning	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	99d8cddfbe	LXC from native: convert lxc.id_map into <idmap>	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	8e45b88772	LXC from native: convert macvlan network configuration	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	f01fe54e75	LXC from native: convert lxc.tty to console devices	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	69fc236243	LXC from native: convert phys network types to net hostdev devices	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	b73c029d83	LXC from native: migrate veth network configuration Some of the LXC configuration properties aren't migrated since they would only cause problems in libvirt-lxc: * lxc.network.ipv[46]: LXC driver doesn't setup IP address of guests, see rhbz#1059624 * lxc.network.name, see rhbz#1059630	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	7bfd6e97ec	LXC from native: implement no network conversion If no network configuration is provided, LXC only provides the loopback interface. To match this, we need to use the privnet feature. LXC will also define a 'none' network type in its 1.0.0 version that fits libvirt LXC driver's default.	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	a41680f8c5	LXC from native: migrate fstab and lxc.mount.entry Tmpfs relative size and default 50% size values aren't supported as we have no idea of the available memory at the conversion time.	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	197b13e5d9	LXC from native: import rootfs LXC rootfs can be either a directory or a block device or an image file. The first two types have been implemented, but the image file is still to be done since LXC auto-guesses the file format at mount time and the LXC driver doesn't support the 'auto' format.	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	7195c807b2	LXC driver: started implementing connectDomainXMLFromNative This function aims at converting LXC configuration into a libvirt domain XML description to help users migrate from LXC to libvirt. Here is an example of how the lxc configuration works: virsh -c lxc:/// domxml-from-native lxc-tools /var/lib/lxc/migrate_test/config It is possible that some parts couldn't be properly mapped into a domain XML fragment, so users should carefully review the result before creating the domain. fstab files in lxc.mount lines will need to be merged into the configuration file as lxc.mount.entry. As we can't know the amount of memory of the host, we have to set a default value for max_balloon that users will probably want to adjust.	2014-02-12 17:52:46 +00:00
Cédric Bosdonnat	3daa14834a	Improve virConf parse to handle LXC config format virConf now honours a VIR_CONF_FLAG_LXC_FORMAT flag to handle LXC configuration files. The differences are that property names can contain '.' character and values are all strings without any bounding quotes. Provide a new virConfWalk function calling a handler on all non-comment values. This function will be used by the LXC conversion code to loop over LXC configuration lines.	2014-02-12 17:52:46 +00:00
Eric Blake	6831c1d327	event: pass reason for PM events Commit `57ddcc23` (v0.9.11) introduced the pmwakeup event, with an optional 'reason' field reserved for possible future expansion. But it failed to wire the field through RPC, so even if we do add a reason in the future, we will be unable to get it back to the user. Worse, commit `7ba5defb` (v1.0.0) repeated the same mistake with the pmsuspend_disk event. As long as we are adding new RPC calls, we might as well fix the events to actually match the signature so that we don't have to add yet another RPC in the future if we do decide to start using the reason field. * src/remote/remote_protocol.x (remote_domain_event_callback_pmwakeup_msg) (remote_domain_event_callback_pmsuspend_msg) (remote_domain_event_callback_pmsuspend_disk_msg): Add reason field. * daemon/remote.c (remoteRelayDomainEventPMWakeup) (remoteRelayDomainEventPMSuspend) (remoteRelayDomainEventPMSuspendDisk): Pass reason to client. * src/conf/domain_event.h (virDomainEventPMWakeupNewFromDom) (virDomainEventPMSuspendNewFromDom) (virDomainEventPMSuspendDiskNewFromDom): Require additional parameter. * src/conf/domain_event.c (virDomainEventPMClass): New class. (virDomainEventPMDispose): New function. (virDomainEventPMWakeupNew, virDomainEventPMSuspendNew) (virDomainEventPMSuspendDiskNew) (virDomainEventDispatchDefaultFunc): Use new class. src/remote/remote_driver.c (remoteDomainBuildEventPM): Pass reason through. * src/remote_protocol-structs: Regenerate. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-12 10:48:16 -07:00
Eric Blake	158795d20e	event: convert remaining domain events to new style Following the patterns established by lifecycle events, this creates all the new RPC calls needed to pass callback IDs for every domain event, and changes the limits in client and server codes to use modern style when possible. I've tested all combinations: both 'old client and new server' and 'new client and old server' continue to work with the old RPCs, and 'new client and new server' benefit from server-side filtering with the new RPCs. * src/remote/remote_protocol.x (REMOTE_PROC_DOMAIN_EVENT_): Add REMOTE_PROC_DOMAIN_EVENT_CALLBACK_ counterparts. * daemon/remote.c (remoteRelayDomainEvent): Send callbackID via newer RPC when used with new-style registration. (remoteDispatchConnectDomainEventCallbackRegisterAny): Extend to cover all domain events. src/remote/remote_driver.c (remoteDomainBuildEvent): Add new Callback and Helper functions. (remoteEvents): Match order of RPC numbers, register new handlers. (remoteConnectDomainEventRegisterAny) (remoteConnectDomainEventDeregisterAny): Extend to cover all domain events. src/remote_protocol-structs: Regenerate. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-12 10:48:16 -07:00
Eric Blake	355ea62650	event: client RPC protocol tweaks for domain lifecycle events The counterpart to the server RPC additions; here, a single function can serve both old and new calls, while incoming events must be serviced by two different functions. Again, some wise choices in our XDR made it easier to share code managing similar events. While this only supports lifecycle events, it covers the harder part of how Register and RegisterAny interact; the remaining 15 events will be a mechanical change in a later patch. For Register, we now have a callbackID locally for more efficient cleanup if the RPC fails; we also prefer to use the newer RPC where we know it is supported (the older RPC must be used if we don't know if RegisterAny is supported). * src/remote/remote_driver.c (remoteEvents): Register new RPC event handler. (remoteDomainBuildEventLifecycle): Move guts... (remoteDomainBuildEventLifecycleHelper): ...here. (remoteDomainBuildEventCallbackLifecycle): New function. (remoteConnectDomainEventRegister) (remoteConnectDomainEventDeregister) (remoteConnectDomainEventRegisterAny) (remoteConnectDomainEventDeregisterAny): Use new RPC when supported.	2014-02-12 10:48:16 -07:00
Eric Blake	caaf6ba1b6	event: prepare client to track domain callbackID We want to convert over to server-side events, even for older APIs. To do that, the client side of the remote driver wants to distinguish between legacy virConnectDomainEventRegister and normal virConnectDomainEventRegisterAny, while knowing the client callbackID and the server's serverID for both types of registration. The client also needs to probe whether the server supports server-side filtering. However, for ease of review, we don't actually use the new RPCs until a later patch. * src/conf/object_event_private.h (virObjectEventStateCallbackID): Add parameter. * src/conf/object_event.c (virObjectEventCallbackListAddID) (virObjectEventStateRegisterID): Separate legacy from callbackID. (virObjectEventStateCallbackID): Pass through parameter. (virObjectEventCallbackLookup): Let legacy and global domain lifecycle events share a common remoteID. * src/conf/network_event.c (virNetworkEventStateRegisterID): Update caller. * src/conf/domain_event.c (virDomainEventStateRegister) (virDomainEventStateRegisterID, virDomainEventStateDeregister): Likewise. (virDomainEventStateRegisterClient) (virDomainEventStateCallbackID): Implement new functions. * src/conf/domain_event.h (virDomainEventStateRegisterClient) (virDomainEventStateCallbackID): New prototypes. * src/remote/remote_driver.c (private_data): Add field. (doRemoteOpen): Probe server feature. (remoteConnectDomainEventRegister) (remoteConnectDomainEventRegisterAny): Use new function. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-12 10:48:15 -07:00
Eric Blake	0372295770	event: server RPC protocol tweaks for domain lifecycle events This patch adds some new RPC call numbers, but for ease of review, they sit idle until a later patch adds the client counterpart to drive the new RPCs. Also for ease of review, I limited this patch to just the lifecycle event; although converting the remaining 15 domain events will be quite mechanical. On the server side, we have to have a function per RPC call, largely with duplicated bodies (the key difference being that we store in our callback opaque pointer whether events should be fired with old or new style); meanwhile, a single function can drive multiple RPC messages. With a strategic choice of XDR struct layout, we can make the event generation code for both styles fairly compact. I debated about adding a tri-state witness variable per connection (values 'unknown', 'legacy', 'modern'). It would start as 'unknown', move to 'legacy' if any RPC call is made to a legacy event call, and move to 'modern' if the feature probe is made; then the event code could issue an error if the witness state is incorrect (a legacy RPC call while in 'modern', a modern RPC call while in 'unknown' or 'legacy', and a feature probe while in 'legacy' or 'modern'). But while it might prevent odd behavior caused by protocol fuzzing, I don't see that it would prevent any security holes, so I considered it bloat. Note that sticking @acl markers on the new RPCs generates unused functions in access/viraccessapicheck.c, because there is no new API call that needs to use the new checks; however, having a consistent .x file is worth the dead code. * src/libvirt_internal.h (VIR_DRV_FEATURE_REMOTE_EVENT_CALLBACK): New feature. * src/remote/remote_protocol.x (REMOTE_PROC_CONNECT_DOMAIN_EVENT_CALLBACK_REGISTER_ANY) (REMOTE_PROC_CONNECT_DOMAIN_EVENT_CALLBACK_DEREGISTER_ANY) (REMOTE_PROC_DOMAIN_EVENT_CALLBACK_LIFECYCLE): New RPCs. * daemon/remote.c (daemonClientCallback): Add field. (remoteDispatchConnectDomainEventCallbackRegisterAny) (remoteDispatchConnectDomainEventCallbackDeregisterAny): New functions. (remoteDispatchConnectDomainEventRegisterAny) (remoteDispatchConnectDomainEventDeregisterAny): Mark legacy use. (remoteRelayDomainEventLifecycle): Change message based on legacy or new use. (remoteDispatchConnectSupportsFeature): Advertise new feature. * src/remote_protocol-structs: Regenerate. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-12 10:48:15 -07:00
Michael Chapman	74cf8202d2	storage: handle NULL return from virGetStorageVol virGetStorageVol can return NULL on out-of-memory. If it does, cleanly abort the volume clone operation. Signed-off-by: Michael Chapman <mike@very.puzzling.org>	2014-02-12 15:18:43 +01:00
Ján Tomko	7236a473f0	Revert "storage: disk: Separate creating of the volume from building" This reverts commit `67ccf91bf2`. We only generate the volume key after we've built it, but the storage driver expects it to be filled after createVol finishes. Squash the volume building back with creating to fulfill this expectation.	2014-02-12 14:54:05 +01:00
Ján Tomko	42bbde2d06	Revert "storage: lvm: Separate creating of the volume from building" This reverts commit `af1fb38f55`. With it, creating new logical volumes fails: https://www.redhat.com/archives/libvir-list/2014-February/msg00658.html In the storage driver, we expect CreateVol to fill out the volume key, but the LVM backend fills the key with the uuid reported by lvs after the logical volume is created.	2014-02-12 14:51:05 +01:00
Cédric Bosdonnat	d385239260	Fixed build with clang. Two unused global variables, and DBUS_TYPE_INVALID used as a const char*. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-12 06:36:17 -07:00
Oleg Strikov	69fba97f63	qemu: Implement a stub cpuArchDriver.baseline() handler for aarch64 Openstack Nova calls virConnectBaselineCPU() during initialization of the instance to get a full list of CPU features. This patch adds a stub to aarch64-specific code to handle this request (no actual work is done). That's enough to have this stub with limited functionality because qemu/kvm backend supports only 'host-passthrough' cpu mode on aarch64. Signed-off-by: Oleg Strikov <oleg.strikov@canonical.com>	2014-02-11 17:34:55 -05:00
Jim Fehlig	2fbfedeb0d	libxl: fix libxlDoDomainSave documentation Update the function's comment, which was missed when removing use of the driver lock everywhere.	2014-02-11 11:03:53 -07:00
Jim Fehlig	3d8a3d6e5b	libxl: register for domain events immediately after creation A small fix for the possiblitiy of jumping to an error path before registering for domain events, preventing receiving important ones like shutdown and death.	2014-02-11 11:03:53 -07:00
Jim Fehlig	e20bf46741	libxl: rename libxlCreateDomEvents to libxlDomEventsRegister libxlDomEventsRegister better reflects its purpose: register for domain events from libxl.	2014-02-11 11:03:53 -07:00
Ján Tomko	47fa97a799	Rename 'index' in virCapabilitiesGetCpusForNode This shadows the index function on some systems (RHEL-6.4, FreeBSD 9): ../../src/conf/capabilities.c: In function 'virCapabilitiesGetCpusForNode': ../../src/conf/capabilities.c:1005: warning: declaration of'index' shadows a global declaration [-Wshadow] /usr/include/strings.h:57: warning: shadowed declaration is here [-Wshadow]	2014-02-11 16:35:33 +01:00
Pradipta Kr. Banerjee	cd921cf077	Handle non-sequential NUMA node numbers On some platforms like IBM PowerNV the NUMA node numbers can be non-sequential. For eg. numactl --hardware o/p from such a machine looks as given below node distances: node 0 1 16 17 0: 10 40 40 40 1: 40 10 40 40 16: 40 40 10 40 17: 40 40 40 10 The NUMA nodes are 0,1,16,17 Libvirt uses sequential index as NUMA node numbers and this can result in crash or incorrect results. Signed-off-by: Shivaprasad G Bhat <sbhat@linux.vnet.ibm.com> Signed-off-by: Pradipta Kr. Banerjee <bpradip@in.ibm.com>	2014-02-11 14:44:20 +00:00
Peter Krempa	037ffda3c7	storage: gluster: Set volume metadata in a separate function Extract the metadata setting code into a separate function for future use.	2014-02-11 13:46:32 +01:00
Martin Kletzander	d27e6bc40f	qemu: introduce spiceport chardev backend Add a new backend for any character device. This backend uses channel in spice connection. This channel is similar to spicevmc, but all-purpose in contrast to spicevmc. Apart from spicevmc, spiceport-backed chardev will not be formatted into the command-line if there is no spice to use (with test for that as well). For this I moved the def->graphics counting to the start of the function so its results can be used in rest of the code even in the future. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-02-11 13:43:55 +01:00
Martin Kletzander	296a4791eb	qemu: remove pointless condition This patch is here just to ease the code review and make related changes look more sensible. Apart from removing the condition this is merely a whitespace (indentation) change. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-02-11 13:43:55 +01:00
Martin Kletzander	a53e504052	qemu: rework '-serial none' Limiting ourselves to qemu without QEMU_CAPS_DEVICE capability, we used '-serial none' only if there was no serial device defined in the domain XML. This means that if we want to have a possibility of the device being defined in XML, but not used in the command-line (e.g. when it's pointless), we'll fail to attach '-serial none' to the command-line (when skipping the device's command-line building and the device being the only one). Since there is no such device, this patch doesn't actually do anything, but enables easier future additions in this manner. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-02-11 13:43:55 +01:00
Martin Kletzander	5b189541ac	conf: introduce spiceport chardev backend Add a new character device backend called 'spiceport' that uses spice's channel for communications and apart from spicevmc can be used as a backend for any character device from libvirt's point of view. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-02-11 13:43:55 +01:00
Wido den Hollander	0227889ab0	rbd: Use rbd_create3 to create RBD format 2 images by default This new RBD format supports snapshotting and cloning. By having libvirt create images in format 2 end-users of the created images can benefit from the new RBD format. Older versions of libvirt can work with this new RBD format as long as librbd supports format 2. RBD format is supported by librbd since version 0.56 (Ceph Bobtail). Signed-off-by: Wido den Hollander <wido@widodh.nl>	2014-02-11 12:10:22 +00:00
Joel SIMOES	9741006333	Libvirt lose sheepdogs volumes on pool refresh or restart. When restarting sheepdog pool, all volumes are missing. This patch add automatically all volume from the added pool. Adding last Daniel P. Berrange's syntaxes correction. Adding vol on separeted function 'inspired' from parallels_storage : parallelsAddDiskVolume	2014-02-11 11:32:04 +00:00
Laine Stump	0144d72963	build: correctly check for SOICGIFVLAN GET_VLAN_VID_CMD command In order to make a client-only build successful on RHEL4 (yes, you read that correctly!), commit `3ed2e54` modified src/util/virnetdev.c so that the functional version of virNetDevGetVLanID() was only compiled if GET_VLAN_VID_CMD was defined. However, it is never defined, but is only an enum value, so the proper version was no longer compiled even on platforms that support it. This resulted in the vlan tag not being properly set for guest traffic on VEPA mode guest macvtap interfaces that were bound to a vlan interface (that's the only place that libvirt currently uses virNetDevGetVLanID) Since there is no way to compile conditionally based on the presence of an enum value, this patch modifies configure.ac to check for said enum value with AC_CHECK_DECLS(), which #defines HAVE_DECL_GET_VLAN_VID_CMD to 1 if it's successful compiling a test program that uses GET_VLAN_VID_CMD (and still #defines it, but to 0, if it's not successful). We can then make the compilation of virNetDevGetVLanID() conditional on the value of HAVE_DECL_GET_VLAN_VID_CMD.	2014-02-11 01:43:38 +02:00
Yuri Myasoedov	cc25e45158	maint: fix line numbers in check-aclrules reports Reset line numbering on each input file in check-aclrules.pl. Otherwise it reports wrong line numbers in its error messages. Signed-off-by: Yuri Myasoedov <ymyasoedov@yandex.ru> Signed-off-by: Roman Bogorodskiy <bogorodskiy@gmail.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-10 14:07:22 -07:00
Michal Privoznik	28900766d5	virNetworkLoadState: Disallow mangled 'floor' element In the network status XML we may have the <floor/> element with the 'sum' attribute. The attribute represents sum of all 'floor'-s of computed over each interface connected to the network (this is needed to guarantee certain bandwidth for certain domain). The sum is therefore a number. However, if the number was mangled (e.g. by an user's interference to network status file), we've just ignored it without refusing to parse such file. This was all due to 'goto error' missing. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-10 19:26:16 +01:00
Peter Krempa	9bf629ab60	qemu: Use correct permissions when determining the image chain The code took into account only the global permissions. The domains now support per-vm DAC labels and per-image DAC labels. Use the most specific label available.	2014-02-10 15:49:59 +01:00
Michal Privoznik	e209c07760	networkStartNetwork: Be more verbose The lack of debug printings might be frustrating in the future. Moreover, this function doesn't follow the usual pattern we have in the rest of the code: int ret = -1; /* do some work / ret = 0; cleanup: / some cleanup work */ return ret; Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-10 11:47:24 +01:00
Peter Krempa	600bca592b	qemu: hyperv: Add support for timer enlightenments Add a new <timer> for the HyperV reference time counter enlightenment and the iTSC reference page for Windows guests. This feature provides a paravirtual approach to track timer events for the guest (similar to kvmclock) with the option to use real hardware clock on systems with a iTSC with compensation across various hosts.	2014-02-10 11:30:10 +01:00
Peter Krempa	8ffaa42d7b	conf: Enforce supported options for certain timers According to the documentation various timer options are only supported by certain timer types. Add a post parse check to verify that the user didn't specify invalid options. Also fix the qemu command line parsing function to set correct default values for the kvmclock timer so that it passes the new check.	2014-02-10 11:17:32 +01:00
Peter Krempa	bbd392ff86	schema: Fix guest timer specification schema according to the docs According to the documentation describing various tunables for domain timers not all the fields are supported by all the driver types. Express these in the RNG: - rtc, platform: Only these support the "track" attribute. - tsc: only one to support "frequency" and "mode" attributes - hpet, pit: tickpolicy/catchup attribute/element - kvmclock: no extra attributes are supported Additionally the attributes of the <catchup> element for tickpolicy='catchup' are optional according to the parsing code. Express this in the XML and fix a spurious space added while formatting the <catchup> element and add tests for it.	2014-02-10 11:09:14 +01:00
John Ferlan	b60644f38f	virpci: Resolve coverity issues Coverity complains about "USE_AFTER_FREE" due to how virPCIDeviceSetStubDriver "could" return either -1, 0, or 1 from the VIR_STRDUP() and then possibly makes a call to virPCIDeviceDetach(). The only way this could happen is if NULL were passed as the "driver" name and virStrdup() returned 0. Since the calling functions check < 0 on the initial function call, the 0 possibility causes Coverity to complain. To fix this - enforce that the second parameter is not NULL using ATTRIBUTE_NONNULL(2) for the function prototype, then in virPCIDeviceDetach add an sa_assert(dev->stubDriver). This will result in Coverity not complaining any more.	2014-02-07 10:58:24 -05:00
Christophe Fergeau	f336b1cccb	Add glusterfs to VIR_CONNECT_LIST_STORAGE_POOLS_FILTERS_POOL_TYPE If it's not present in this list, we won't be able to get only glusterfs pools when using virConnectListAllStoragePools.	2014-02-07 10:26:46 +01:00
Martin Kletzander	440a1aa508	qemu: keep pre-migration domain state after failed migration Couple of codepaths shared the same code which can be moved out to a function and on one of such places, qemuMigrationConfirmPhase(), the domain was resumed even if it wasn't running before the migration started. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1057407 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-02-07 10:07:38 +01:00
Jim Fehlig	630b645695	libxl: remove unneeded locking of driver when restoring libxlDomainRestoreFlags acquires the driver lock while reading the domain config from the save file and adding it to libxlDriverPrivatePtr->domains. But virDomainObjList provides self-locking APIs, so remove the needless driver locking. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-06 10:39:32 -07:00
Jim Fehlig	778067e195	libxl: improve subprocess handling If available, let libxl handle reaping any children it creates by specifying libxl_sigchld_owner_libxl_always_selective_reap. This feature was added to improve subprocess handling in libxl when used in an application that does not install a SIGCHLD handler like libvirt http://lists.xen.org/archives/html/xen-devel/2014-01/msg01555.html Prior to this patch, it is possible to hit asserts in libxl when reaping subprocesses, particularly during simultaneous operations on multiple domains. With this patch, and the corresponding changes to libxl, I no longer see the asserts. Note that the libxl changes will be included in Xen 4.4.0. Previous Xen versions will be susceptible to hitting the asserts even with this patch applied to the libvirt libxl driver. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-06 10:20:31 -07:00
Jim Fehlig	03b3f8940a	libxl: handle domain shutdown events in a thread Handling the domain shutdown event within the event handler seems a bit unfair to libxl's event machinery. Domain "shutdown" could take considerable time. E.g. if the shutdown reason is reboot, the domain must be reaped and then started again. Spawn a shutdown handler thread to do this work, allowing libxl's event machinery to go about its business. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-06 10:17:58 -07:00
Jim Fehlig	eaa8d9b2c7	libxl: remove list of timer registrations from libxlDomainObjPrivate Due to some misunderstanding of requirements libxl places on timer handling, I introduced the half-brained idea of maintaining a list of timeouts that the driver could force to expire before freeing a libxlDomainObjPrivate (and hence libxl_ctx). But testing all the latest versions of Xen supported by the libxl driver (4.2.3, 4.3.1, 4.4.0 RC3), I see that libxl will handle this just fine and there is no need to force expiration behind libxl's back. Indeed it may be harmful to do so. This patch removes the timer list, allowing libxl to handle cleanup of its timer registrations. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-06 10:08:11 -07:00
Jim Fehlig	cda52dbfe5	libxl: fix leaking libxlDomainObjPrivate When libxl registers an FD with the libxl driver, the refcnt of the associated libxlDomainObjPrivate object is incremented. The refcnt is decremented when libxl deregisters the FD. But some FDs are only deregistered when their libxl ctx is freed, which unfortunately is done in the libxlDomainObjPrivate dispose function. With references held by the FDs, libxlDomainObjPrivate is never disposed. I added the ref/unref in FD registration/deregistration when adding the same in timer registration/deregistration. For timers, this is a simple approach to ensuring the libxlDomainObjPrivate is not disposed prior to their expirtation, which libxl guarantees will occur. It is not needed for FDs, and only causes libxlDomainObjPrivate to leak. This patch removes the reference on libxlDomainObjPrivate for FD registrations, but retains them for timer registrations. Tested on the latest releases of Xen supported by the libxl driver: 4.2.3, 4.3.1, and 4.4.0 RC3. Signed-off-by: Jim Fehlig <jfehlig@suse.com>	2014-02-06 10:06:26 -07:00
Matthieu Coudron	0778fc1ab9	qemu_driver: Introduce <filesystem/> support in device attach/detach This commit allows to attach/detach a <filesystem> device in qemu. For this purpose I'm introducing two new functions: virDomainFSInsert() and virDomainFSRemove() and adding necessary code in the qemu driver. It compares filesystems based on their "destination" folder. So if two filesystems share the same destination, they are considered equal and the qemu driver would reject the insertion. Signed-off-by: Matthieu Coudron <mattator@gmail.com>	2014-02-06 17:20:03 +01:00
Matthieu Coudron	8fc98ac8cf	virDomainHostdev{Insert,Delete}: Replace VIR_REALLOC_N by VIR_{APPEND,DELETE}_ELEMENT With this change the code gets shorter and more readable. Signed-off-by: Matthieu Coudron <mattator@gmail.com>	2014-02-06 17:10:26 +01:00
Roman Bogorodskiy	3b00df01fb	BSD: implement nodeGetCPUStats Implementation obtains CPU usage information using kern.cp_time and kern.cp_times sysctl(8)s and reports CPU utilization.	2014-02-06 14:09:15 +01:00
Jiri Denemark	05bf937572	qemu: Fix crash in virDomainMemoryStats with old qemu If virDomainMemoryStats was run on a domain with virtio balloon driver running on an old qemu which supports QMP but does not support qom-list QMP command, libvirtd would crash. The reason is we did not check if qemuMonitorJSONGetObjectListPaths failed and moreover we even stored its result in an unsigned integer type.	2014-02-06 11:29:29 +01:00
Peter Krempa	5d2691cc4c	qemu: blockjob: Print correct file name in error message When attempting a blockcommit from the top layer, the base argument passed is NULL. This will be dereferenced when attempting a commit with an empty image chain. Output the real volume path instead: virsh blockcommit --verbose --path vda --domain DOMNAME --wait error: invalid argument: top '/path/somefile' in chain for 'vda' has no backing file instead of: error: invalid argument: top '(null)' in chain for 'vda' has no backing file	2014-02-06 10:43:57 +01:00
Peter Krempa	cc3d335b76	maint: Change the text of the NULLSTR() macro to "<null>" Eric Blake suggested to change this message to be different from the glibc's NULL deref protection message in printf to be able to differentiate errors.	2014-02-06 10:43:57 +01:00
Michal Privoznik	51bea5df5d	qemuBuildClockArgStr: Allow localtime clock basis https://bugzilla.redhat.com/show_bug.cgi?id=1046192 Commit `b8bf79a`, which adds clock='variable', forgets to check localtime basis in qemuBuildClockArgStr(). So that localtime basis could not be used. Reported-by: Jincheng Miao <jmiao@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-02-06 07:51:07 +01:00
Ján Tomko	0db9b0883c	Generate a valid imagelabel even for type 'none' Commit `2ce63c1` added imagelabel generation when relabeling is turned off. But we weren't filling out the sensitivity for type 'none' labels, resulting in an invalid label: $ virsh managedsave domain error: unable to set security context 'system_u:object_r:svirt_image_t' on fd 28: Invalid argument	2014-02-05 19:47:30 +01:00
Eric Blake	f34ea654de	maint: fix grammar in conf file Noticed a misuse of 'to' while testing my event regression under polkit ACLs, and decided to review the entire conf files for other legibility bugs. * daemon/libvirtd.conf: Use correct grammar. * src/qemu/qemu.conf: Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-05 10:40:14 -07:00
Eric Blake	11f20e43f1	event: move event filtering to daemon (regression fix) https://bugzilla.redhat.com/show_bug.cgi?id=1058839 Commit `f9f56340` for CVE-2014-0028 almost had the right idea - we need to check the ACL rules to filter which events to send. But it overlooked one thing: the event dispatch queue is running in the main loop thread, and therefore does not normally have a current virIdentityPtr. But filter checks can be based on current identity, so when libvirtd.conf contains access_drivers=["polkit"], we ended up rejecting access for EVERY event due to failure to look up the current identity, even if it should have been allowed. Furthermore, even for events that are triggered by API calls, it is important to remember that the point of events is that they can be copied across multiple connections, which may have separate identities and permissions. So even if events were dispatched from a context where we have an identity, we must change to the correct identity of the connection that will be receiving the event, rather than basing a decision on the context that triggered the event, when deciding whether to filter an event to a particular connection. If there were an easy way to get from virConnectPtr to the appropriate virIdentityPtr, then object_event.c could adjust the identity prior to checking whether to dispatch an event. But setting up that back-reference is a bit invasive. Instead, it is easier to delay the filtering check until lower down the stack, at the point where we have direct access to the RPC client object that owns an identity. As such, this patch ends up reverting a large portion of the framework of commit `f9f56340`. We also have to teach 'make check' to special-case the fact that the event registration filtering is done at the point of dispatch, rather than the point of registration. Note that even though we don't actually use virConnectDomainEventRegisterCheckACL (because the RegisterAny variant is sufficient), we still generate the function for the purposes of documenting that the filtering takes place. Also note that I did not entirely delete the notion of a filter from object_event.c; I still plan on using that for my upcoming patch series for qemu monitor events in libvirt-qemu.so. In other words, while this patch changes ACL filtering to live in remote.c and therefore we have no current client of the filtering in object_event.c, the notion of filtering in object_event.c is still useful down the road. * src/check-aclrules.pl: Exempt event registration from having to pass checkACL filter down call stack. * daemon/remote.c (remoteRelayDomainEventCheckACL) (remoteRelayNetworkEventCheckACL): New functions. (remoteRelayEvent): Use new functions. * src/conf/domain_event.h (virDomainEventStateRegister) (virDomainEventStateRegisterID): Drop unused parameter. * src/conf/network_event.h (virNetworkEventStateRegisterID): Likewise. * src/conf/domain_event.c (virDomainEventFilter): Delete unused function. * src/conf/network_event.c (virNetworkEventFilter): Likewise. * src/libxl/libxl_driver.c: Adjust caller. * src/lxc/lxc_driver.c: Likewise. * src/network/bridge_driver.c: Likewise. * src/qemu/qemu_driver.c: Likewise. * src/remote/remote_driver.c: Likewise. * src/test/test_driver.c: Likewise. * src/uml/uml_driver.c: Likewise. * src/vbox/vbox_tmpl.c: Likewise. * src/xen/xen_driver.c: Likewise. Signed-off-by: Eric Blake <eblake@redhat.com>	2014-02-05 08:03:31 -07:00
Laine Stump	eafb53fec2	network: disallow <bandwidth>/<mac> for bridged/macvtap/hostdev networks https://bugzilla.redhat.com/show_bug.cgi?id=1057321 pointed out that we weren't honoring the <bandwidth> element in libvirt networks using <forward mode='bridge'/>. In fact, these networks are just a method of giving a libvirt network name to an existing Linux host bridge on the system, and libvirt doesn't have enough information to know where to set such limits. We are working on a method of supporting network bandwidths for some specific cases of <forward mode='bridge'/>, but currently libvirt doesn't support it. So the proper thing to do now is just log an error when someone tries to put a <bandwidth> element in that type of network. (It's unclear if we will be able to do proper bandwidth limiting for macvtap networks, and most definitely we will not be able to support it for hostdev networks). While looking through the network XML documentation and comparing it to the networkValidate function, I noticed that we also ignore the presence of a mac address in the config in the same cases, rather than failing so that the user will understand that their desired action has not been taken. This patch updates networkValidate() (which is called any time a persistent network is defined, or a transient network created) to log an error and fail if it finds either a <bandwidth> or <mac> element and the network forward mode is anything except 'route'. 'nat', or nothing. (Yes, neither of those elements is acceptable for any macvtap mode, nor for a hostdev network). NB: This does not cause failure to start any existing network that contains one of those elements, so someone might have erroneously defined such a network in the past, and that network will continue to function unmodified. I considered it too disruptive to suddenly break working configs on the next reboot after a libvirt upgrade.	2014-02-05 15:04:58 +02:00
John Ferlan	19259574d5	Honor blacklist for modprobe command https://bugzilla.redhat.com/show_bug.cgi?id=1045124 When loading modules, libvirt does not honor the modprobe blacklist. Use the new virKModLoad() API in order to attempt load with blacklist check. Use the new virKModIsBlacklisted() API to check if the failure to load was due to the blacklist Signed-off-by: John Ferlan <jferlan@redhat.com>	2014-02-04 10:43:53 -05:00
John Ferlan	4a2179ea92	utils: Introduce functions for kernel module manipulation virKModConfig() - Return a buffer containing kernel module configuration virKModLoad() - Load a specific module into the kernel configuration virKModUnload() - Unload a specific module from the kernel configuration virKModIsBlacklisted() - Determine whether a module is blacklisted within the kernel configuration	2014-02-04 08:52:27 -05:00
Laine Stump	0d0a7bf45a	qemu: be sure we're using the updated value of backend during hotplug commit `f094aaac` changed qemuPrepareHostdevPCIDevices() such that it may modify the "backend" (vfio vs. legacy kvm) setting in the virHostdevDef. However, qemuDomainAttachHostPciDevice() (used by hotplug) copies the backend setting into a local before calling qemuPrepareHostdevPCIDevices(), and then later makes a decision based on that pre-change value. The result is that, if the backend had been set to "default" (i.e. not specified in the config) and was later updated to "VFIO" by qemuPrepareHostdevPCIDevices(), the qemu process' MacMemLock is not increased (as is required for VFIO device assignment). This patch delays making the local copy of backend until after its potential modification.	2014-02-04 14:05:09 +02:00
Laine Stump	66f75925eb	network: change default of forwardPlainNames to 'yes' The previous patch fixed "forwardPlainNames" so that it really is doing only what is intended, but left the default to be "forwardPlainNames='no'". Discussion around the initial version of that patch led to the decision that the default should instead be "forwardPlainNames='yes'" (i.e. the original behavior before commit f3886825). This patch makes that change to the default.	2014-02-04 12:00:26 +02:00
Laine Stump	f69a6b987d	network: only prevent forwarding of DNS requests for unqualified names In commit `f386825` we began adding the options --domain-needed --local=/$mydomain/ to all dnsmasq commandlines with the stated reason of preventing forwarding of DNS queries for names that weren't fully qualified domain names ("FQDN", i.e. a name that included some "."s and a domain name). This was later changed to domain-needed local=/$mydomain/ when we moved the options from the dnsmasq commandline to a conf file. The original patch on the list, and discussion about it, is here: https://www.redhat.com/archives/libvir-list/2012-August/msg01594.html When a domain name isn't specified (mydomain == ""), the addition of "domain-needed local=//" will prevent forwarding of domain-less requests to the virtualization host's DNS resolver, but if a domain is specified, the addition of "local=/domain/" will prevent forwarding of any requests for qualified names within that domain that aren't resolvable by libvirt's dnsmasq itself. An example of the problems this causes - let's say a network is defined with: <domain name='example.com'/> <dhcp> .. <host mac='52:54:00:11:22:33' ip='1.2.3.4' name='myguest'/> </dhcp> This results in "local=/example.com/" being added to the dnsmasq options. If a guest requests "myguest" or "myguest.example.com", that will be resolved by dnsmasq. If the guest asks for "www.example.com", dnsmasq will not know the answer, but instead of forwarding it to the host, it will return NOT FOUND to the guest. In most cases that isn't the behavior an admin is looking for. A later patch (commit `4f595ba`) attempted to remedy this by adding a "forwardPlainNames" attribute to the <dns> element. The idea was that if forwardPlainNames='yes' (default is 'no'), we would allow unresolved names to be forwarded. However, that patch was botched, in that it only removed the "domain-needed" option when forwardPlainNames='yes', and left the "local=/mydomain/". Really we should have been just including the option "--domain-needed --local=//" (note the lack of domain name) regardless of the configured domain of the network, so that requests for names without a domain would be treated as "local to dnsmasq" and not forwarded, but all others (including those in the network's configured domain) would be forwarded. We also shouldn't include either of those options if forwardPlainNames='yes'. This patch makes those corrections. This patch doesn't remedy the fact that default behavior was changed by the addition of this feature. That will be handled in a subsequent patch.	2014-02-04 12:00:26 +02:00
Martin Kletzander	b44f9e7ec9	spice: don't force user to specify spicevmc channel We support only one spicevmc channel name anyway and the code is prepared to use the default one, there's only one check missing. It is also mentioned in the documentation already and helps defining domains with spice vdagent for people using virsh. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2014-02-03 09:46:47 +01:00
John Ferlan	5c36e63198	Resolve Coverity dead_error_begin Coverity complains about default: label in libxl_driver.c not be able to be reached. It's by design for the code and since it's not necessary in the code nor does it elicit any compiler/make check warnings - just remove it rather than adding a coverity[dead_error_begin] tag. While I'm at it, lxc_driver.c and nodeinfo.c have the same design, so I removed the default labels and the existing coverity tags.	2014-01-31 12:48:01 -05:00
Daniel P. Berrange	6e5c79a1b5	Push nwfilter update locking up to top level The NWFilter code has as a deadlock race condition between the virNWFilter{Define,Undefine} APIs and starting of guest VMs due to mis-matched lock ordering. In the virNWFilter{Define,Undefine} codepaths the lock ordering is 1. nwfilter driver lock 2. virt driver lock 3. nwfilter update lock 4. domain object lock In the VM guest startup paths the lock ordering is 1. virt driver lock 2. domain object lock 3. nwfilter update lock As can be seen the domain object and nwfilter update locks are not acquired in a consistent order. The fix used is to push the nwfilter update lock upto the top level resulting in a lock ordering for virNWFilter{Define,Undefine} of 1. nwfilter driver lock 2. nwfilter update lock 3. virt driver lock 4. domain object lock and VM start using 1. nwfilter update lock 2. virt driver lock 3. domain object lock This has the effect of serializing VM startup once again, even if no nwfilters are applied to the guest. There is also the possibility of deadlock due to a call graph loop via virNWFilterInstantiate and virNWFilterInstantiateFilterLate. These two problems mean the lock must be turned into a read/write lock instead of a plain mutex at the same time. The lock is used to serialize changes to the "driver->nwfilters" hash, so the write lock only needs to be held by the define/undefine methods. All other methods can rely on a read lock which allows good concurrency. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-01-30 18:00:20 +00:00
Daniel P. Berrange	0240d94c36	Remove windows thread implementation in favour of pthreads There are a number of pthreads impls available on Win32 these days, in particular the mingw64 project has a good impl. Delete the native windows thread implementation and rely on using pthreads everywhere. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-01-30 18:00:20 +00:00
Daniel P. Berrange	c065984b58	Add a read/write lock implementation Add virRWLock backed up by a POSIX rwlock primitive Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-01-30 18:00:20 +00:00
Daniel P. Berrange	94e0906839	Skip check-augeas-lockd when QEMU is disabled The check-augeas-lockd test depends on the file locking/qemu-lockd.conf, so must be skipped when QEMU is disabled. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2014-01-30 18:00:20 +00:00
Osier Yang	b1b81efe9a	util: Accept test data path for scsi device's sg_path Commit `10c9ceff6d` intended to introduce new argument for the testing purpose, but it missed the similar changing of the device's sg_path. The problem was hidden since my laptop has the /dev/sg0 and /dev/sg1. A later patch will modify the tests accordingly. Signed-off-by: Osier Yang <jyang@redhat.com> Reported-by: Pavel Hrdina <phrdina@redhat.com>	2014-01-30 16:34:43 +01:00
Osier Yang	f406aa25f2	qemu: Fix the error message for scsi host device's shareable checking This fixes the wrong argument order.	2014-01-30 16:50:10 +08:00
Osier Yang	10c9ceff6d	util: Add one argument for several scsi utils To support passing the path of the test data to the utils, one more argument is added to virSCSIDeviceGetSgName, virSCSIDeviceGetDevName, and virSCSIDeviceNew, and the related code is changed accordingly. Later tests for the scsi utils will be based on this patch. Signed-off-by: Osier Yang <jyang@redhat.com>	2014-01-30 15:48:28 +08:00
Osier Yang	fd243fc4ad	qemu: Don't fail if the SCSI host device is shareable between domains It doesn't make sense to fail if the SCSI host device is specified as "shareable" explicitly between domains (NB, it works if and only if the device is specified as "shareable" for all domains, otherwise it fails). To fix the problem, this patch introduces an array for virSCSIDevice struct, which records all the names of domain which are using the device (note that the recorded domains must specify the device as shareable). And the change on the data struct brings on many subsequent changes in the code. Prior to this patch, the "shareable" tag didn't work as expected, it actually work like "non-shareable". So this patch also added notes in formatdomain.html to declare the fact. * src/util/virscsi.h: - Remove virSCSIDeviceGetUsedBy - Change definition of virSCSIDeviceGetUsedBy and virSCSIDeviceListDel - Add virSCSIDeviceIsAvailable * src/util/virscsi.c: - struct virSCSIDevice: Change "used_by" to be an array; Add "n_used_by" as the array count - virSCSIDeviceGetUsedBy: Removed - virSCSIDeviceFree: frees the "used_by" array - virSCSIDeviceSetUsedBy: Copy the domain name to avoid potential memory corruption - virSCSIDeviceIsAvailable: New - virSCSIDeviceListDel: Change the logic, for device which is already in the list, just remove the corresponding entry in "used_by". And since it's only used in one place, we can safely removing the code to find out the dev in the list first. - Copyright updating * src/libvirt_private.sys: - virSCSIDeviceGetUsedBy: Remove - virSCSIDeviceIsAvailable: New * src/qemu/qemu_hostdev.c: - qemuUpdateActiveScsiHostdevs: Check if the device existing before adding it to the list; - qemuPrepareHostdevSCSIDevices: Error out if the not all domains use the device as "shareable"; Also don't try to add the device to the activeScsiHostdevs list if it already there; And make more sensible error w.r.t the current "shareable" value in driver->activeScsiHostdevs. - qemuDomainReAttachHostScsiDevices: Change the logic according to the changes on helpers. Signed-off-by: Osier Yang <jyang@redhat.com>	2014-01-30 15:46:24 +08:00
Roman Bogorodskiy	d779d218d4	maint: add configure checks for BSD CPU affinity Check for presence of sys/cpuset.h header and cpuset_getaffinity() in configure instead of just using #ifdef __FreeBSD__ for that code.	2014-01-29 12:11:48 -07:00
Michal Privoznik	122cd16982	Revert "networkAllocateActualDevice: Set QoS for bridgeless networks too" This reverts commit `2996e6be19` and some parts of `2636dc8c4d`. The former one tried to implement QoS setting on bridgeless networks. However, as discussed upstream [1], the patch is far away from being useful in even a single case. The whole idea of network QoS is to have aggregated limits over several interfaces. This patch is doing completely the opposite when merging two QoS settings (from the network and the domain interface) into one which is then set at the domain interface itself, not the network. The latter one is the test for the previous one. Now none of them makes sense. 1: https://www.redhat.com/archives/libvir-list/2014-January/msg01441.html Conflicts: tests/virnetdevbandwidthtest.c: New test has been introduced since then.	2014-01-29 19:01:19 +01:00
Michal Privoznik	550a2ceffb	virCommand: Introduce virCommandSetDryRun There are some units within libvirt that utilize virCommand API to run some commands and deserve own unit testing. These units are, however, not desired to be rewritten to dig virCommand API usage out. As a great example virNetDevBandwidth could be used. The problem with the bandwidth unit is: it uses virCommand API heavily. Therefore we need a mechanism to not really run a command, but rather see its string representation after which we can decide if the unit construct the correct sequence of commands or not. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-01-29 18:01:36 +01:00
Peter Krempa	7076b4b72c	snapshot: Add support for specifying snapshot disk backing type Add support for specifying various types when doing snapshots. This will later allow to do snapshots on network backed volumes. Disks of type 'volume' are not supported by snapshots (yet). Also amend the test suite to check parsing of the various new disk types that can now be specified.	2014-01-29 12:56:35 +01:00
Jim Fehlig	37564b471d	xen: fix parsing xend http response Commit `df36af58` broke parsing of http response from xend. The prior use of atoi() would happily parse e.g. a string containing "200 OK\r\n", whereas virStrToLong_i() will fail when called with a NULL end_ptr. Change the calls to virStrToLong_i() to provide a non-NULL end_ptr.	2014-01-28 18:32:49 -07:00
Jiri Denemark	580ddf0d34	cpu: Try to use source CPU model in virConnectBaselineCPU https://bugzilla.redhat.com/show_bug.cgi?id=1049391 When all source CPU XMLs contain just a single CPU model (with a possibly varying set of additional feature elements), virConnectBaselineCPU will try to use this CPU model in the computed guest CPU. Thus, when used on just a single CPU (useful with VIR_CONNECT_BASELINE_CPU_EXPAND_FEATURES), the result will not use a different CPU model. If the computed CPU uses the source model, set fallback mode to 'forbid' to make sure the guest CPU will always be as close as possible to the source CPUs.	2014-01-28 21:27:37 +01:00
Jiri Denemark	802f157e8c	cpu: Fix VIR_CONNECT_BASELINE_CPU_EXPAND_FEATURES https://bugzilla.redhat.com/show_bug.cgi?id=1049391 VIR_CONNECT_BASELINE_CPU_EXPAND_FEATURES flag for virConnectBaselineCPU did not work if the resulting guest CPU would disable some features present in its base model. This patch makes sure we won't try to add such features twice.	2014-01-28 21:27:37 +01:00
Ján Tomko	1c1242a2ed	Reword error message for oversized cpu time fields	2014-01-28 08:48:32 +01:00
Ján Tomko	61cf8d8461	Simplify linuxNodeGetCPUStats Split out the repetitive code.	2014-01-28 08:48:31 +01:00
Roman Bogorodskiy	c022fbc9bb	BSD: implement virProcess{Get,Set}Affinity Implement virProcess{Get,Set}Affinity() using cpuset_getaffinity() and cpuset_setaffinity() calls. Quick search showed that they are only available on FreeBSD, so placed it inside existing #ifdef blocks for FreeBSD instead of adding configure checks.	2014-01-27 09:51:55 -07:00
Pradipta Kr. Banerjee	c6320d3463	Add hw random number generator (/dev/hwrng) to cgroup ACL Creating a qemu VM with /dev/hwrng as backend RNG device throws the following error - "Could not open '/dev/hwrng': Permission denied" This patch fixes the issue Signed-off-by: Pradipta Kr. Banerjee <bpradip@in.ibm.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2014-01-27 09:48:39 -07:00
Michal Privoznik	2996e6be19	networkAllocateActualDevice: Set QoS for bridgeless networks too https://bugzilla.redhat.com/show_bug.cgi?id=1055484 Currently, libvirt's XML schema of network allows QoS to be defined for every network even though it has no bridge. For instance: <network> <name>vdsm-no-bridge</name> <forward mode='passthrough'> <interface dev='em1.10'/> </forward> <bandwidth> <inbound average='1000' peak='5000' burst='1024'/> <outbound average='1000' burst='1024'/> </bandwidth> </network> The bandwidth limitations can be, however, applied even on such networks. In fact, they are going to be applied on the interface that will be connected to the network on a domain startup. This approach, however, has one limitation. With bridged networks, there are two points where QoS can be set: bridge and domain interface. The lower limit of the two is enforced then. For instance, if the interface has 10Mbps average, but the network only 1Mbps, there's no way for interface to transmit packets faster than the 1Mbps limit. With two points this is enforced by kernel. With only one point, we must combine both QoS settings into one which is set afterwards. Look at virNetDevBandwidthMinimal() and you'll understand immediately what I mean. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2014-01-27 12:11:27 +01:00
Ján Tomko	5099f745e6	Add test for linuxNodeGetCPUStats Check if cpu stats are read correctly from a sample /proc/stat collected from a 24 CPU machine.	2014-01-27 11:04:02 +01:00
Ján Tomko	b3b44c572c	Move test-local declarations to nodeinfopriv.h linuxNodeInfoCPUPopulate is only used in the nodeinfo.c file and in the test suite.	2014-01-27 11:04:02 +01:00
Oleg Strikov	29ea437e40	qemu: Enable 'host-passthrough' cpu mode for aarch64 This patch allows libvirt user to specify 'host-passthrough' cpu mode while using qemu/kvm backend on aarch64. It uses 'host' as a CPU model name instead of some other stub (correct CPU detection is not implemented yet) to allow libvirt user to specify 'host-model' cpu mode as well. Signed-off-by: Oleg Strikov <oleg.strikov@canonical.com> (crobinso: fix some indentation)	2014-01-25 12:10:02 -05:00
John Ferlan	46a0737e13	Block info query: Add check for transient domain Currently the qemuDomainGetBlockInfo will return allocation == physical for most backing stores. For a qcow2 block backed device it's possible to return the highest lv extent allocated from qemu for an active guest. That is a value where allocation != physical and one would hope be less. However, if the guest is not running, then the code falls back to returning allocation == physical. This turns out to be problematic for rhev which monitors the size of the backing store. During a migration, before the VM has been started on the target and while it is deemed inactive on the source, there's a small window of time where the allocation is returned as physical triggering the code to extend the file unnecessarily. Since rhev uses transient domains and this is edge condition for a transient domain, rather than returning good status and allocation == physical when this "window of opportunity" exists, this patch will check for a transient (or non persistent) domain and return a failure to the caller rather than returning the defaults. For a persistent domain, the defaults will be returned. The description for the virDomainGetBlockInfo has been updated to describe the phenomena.	2014-01-24 11:37:18 -05:00
Gao feng	71f7d5840f	qemu: remove memset params array to zero in qemuDomainGetPercpuStats the array params is allocated by VIR_ALLOC_N in remoteDispatchDomainGetCPUStats. it had been set to zero. No need to reset it to zero again, and this reset here is incorrect too, nparams * ncpus is the array length not the size of params array. Signed-off-by: Gao feng <gaofeng@cn.fujitsu.com>	2014-01-24 16:31:53 +08:00
Osier Yang	88ae5dc759	storage: Fix the memory leak The return value of virGetFCHostNameByWWN is a strdup'ed string. Also add comments to declare that the caller should take care of freeing it.	2014-01-23 21:39:05 +08:00
Osier Yang	7519958735	util: Fix the indention Left in the git cache without commit before pushing. Pushed under build breaker and trivial rule.	2014-01-23 18:16:11 +08:00
Osier Yang	2b66504ded	util: Add "shareable" field for virSCSIDevice struct Unlike the host devices of other types, SCSI host device XML supports "shareable" tag. This patch introduces it for the virSCSIDevice struct for a later patch use (to detect if the SCSI device is shareable when preparing the SCSI host device in QEMU driver).	2014-01-23 17:52:33 +08:00
Osier Yang	2340f0196f	storage: Fix autostart of pool with "fc_host" type adapter The "checkPool" is a bit different for pool with "fc_host" type source adapter, since the vHBA it's based on might be not created yet (it's created by "startPool", which is involked after "checkPool" in storageDriverAutostart). So it should not fail, otherwise the "autostart" of the pool will fail either. The problem is easy to reproduce: * Enable "autostart" for the pool * Restart libvirtd service * Check the pool's state	2014-01-23 17:50:29 +08:00

... 2 3 4 5 6 ...

11423 Commits