libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-12-30 17:45:23 +00:00

Author	SHA1	Message	Date
Laine Stump	d5e6d1cfc7	Revert "qemu: Allow to plug virtio-net-pci into PCIe slot" This reverts commit `ede34470fd`, which was apparently written based on testing performed before commits `1e15be1` and 9a12b6 were pushed upstream. Once those two patches are in place, commit `ede34470` is redundant, and can even cause incorrect/unexpected behavior when auto-assigning addresses for virtio-net devices.	2015-08-12 11:23:29 -04:00
Michal Privoznik	f7fba69bd7	networkBandwidthGenericChecks: Drop useless check There's a check right at the beginning of the function that shortcuts if the function was called over all NULL arguments. However, this was meant just as a fool-proof check so that we don't crash if function is used in a bad manner. Anyway, it makes Coverity unhappy as it then thinks any of the arguments could be NULL. Well, with the current state of the code it can't. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-12 10:57:36 +02:00
Michal Privoznik	ef6b3b625d	networkBandwidthUpdate: Don't blindly dereference pointers It may happen that an interface don't have any bandwidth set and a new one is to be set. In that case, @ifaceBand will be NULL. This will cause troubles later in the code when deciding what to do. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-12 10:57:36 +02:00
Cole Robinson	c7790408d7	domain: Fix crash if trying to live update disk <serial> If you pass <disk><serial> XML to UpdateDevice, and the original device didn't have a <serial> block, libvirtd crashes trying to read the original NULL serial string. Use _NULLABLE string comparisons to avoid the crash. A couple other properties needed the change too.	2015-08-11 17:33:34 -04:00
Laine Stump	9bd16ad3b4	qemu: fix qemuDomainSupportsPCI() for ARM machines of "virt" machinetype Commit `e8d5517` updated the domain post-parse to automatically add pcie-root et al for certain ARM "virt" machinetypes, but didn't update the function qemuDomainSupportsPCI() which is called later on when we are auto-assigning PCI addresses and default settings for the PCI controller <model> and <target> attributes. The result was that PCI addresses weren't assigned, and the controllers didn't have their attribute default values set, leading to an error when the domain was started, e.g.: internal error: autogenerated dmi-to-pci-bridge options not set This patch adds the same check made in the earlier patch to qemuDomainSupportsPCI(), so that PCI address auto-assignment and target/model default values will be set.	2015-08-11 16:11:05 -04:00
Guido Günther	fbb27088ee	virNetSocketCheckProtocols: handle EAI_NONAME as IPv6 unavailable When running the test suite using "unshare -n" we might have IPv6 but no configured addresses. Due to AI_ADDRCONFIG getaddrinfo then fails with EAI_NONAME which we should then treat as IPv6 unavailable.	2015-08-11 22:08:50 +02:00
Laine Stump	bfaaa2b681	util: don't overwrite stack when getting ethtool gfeatures This fixes the crash described here: https://www.redhat.com/archives/libvir-list/2015-August/msg00162.html In short, we were calling ioctl(SIOCETHTOOL) pointing to a too-short object that was a local on the stack, resulting in the memory past the end of the object being overwritten. This was because the struct used by the ETHTOOL_GFEATURES command of SIOCETHTOOL ends with a 0-length array, but we were telling ethtool that it could use 2 elements on the array. The fix is to allocate the necessary memory with VIR_ALLOC_VAR(), including the extra length needed for a 2 element array at the end.	2015-08-11 15:29:14 -04:00
Andrea Bolognani	133c25c81c	cpu: Fix segfault in the ppc64 driver Commit `adb865d` introduced some changes in ppc64DriverNodeData() that cause libvirtd to crash on startup unless this patch is applied as well.	2015-08-11 18:05:04 +02:00
Michal Privoznik	b044e3257f	qemu: Implement VIR_DOMAIN_BANDWIDTH_IN_FLOOR Well, there are just two places that needs adjustment: qemuDomainGetInterfaceParameters - to report the @floor qemuDomainSetInterfaceParameters - now that the function has been fixed, we can allow updating @floor too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Michal Privoznik	5ee6d243fc	qemuDomainSetInterfaceParameters: Use new functions to update bandwidth As sketched in previous commits, imagine the following scenario: virsh # domiftune gentoo vnet0 inbound.average: 100 inbound.peak : 0 inbound.burst : 0 outbound.average: 100 outbound.peak : 0 outbound.burst : 0 virsh # domiftune gentoo vnet0 --inbound 0 virsh # shutdown gentoo Domain gentoo is being shutdown virsh # list --all error: Failed to list domains error: Cannot recv data: Connection reset by peer Program received signal SIGSEGV, Segmentation fault. 0x00007fffe80ea221 in networkUnplugBandwidth (net=0x7fff9400c1a0, iface=0x7fff940ea3e0) at network/bridge_driver.c:4881 4881 net->floor_sum -= ifaceBand->in->floor; This is rather unfortunate. We should not SIGSEGV here. The problem is, that while in the second step the inbound QoS was cleared out, the network part of it was not updated (moreover, we don't report that vnet0 had inbound.floor set). Internal structure therefore still had some fragments left (e.g. class_id). So when qemuProcessStop() started to clean up the environment it got to networkUnplugBandwidth(). Here, class_id is set therefore function assumes that there is an inbound QoS. This actually is a fair assumption to make, there's no need for a special QoS box in network's QoS when there's no QoS to set. Anyway, the problem is not the networkUnplugBandwidth() rather than qemuDomainSetInterfaceParameters() which completely forgot about QoS being disperse (some parts are set directly on interface itself, some on bridge the interface is plugged into). Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Michal Privoznik	812932bea2	bridge_driver: Introduce networkBandwidthUpdate So, if a domain vNIC's bandwidth has been successfully set, it's possible that because @floor is set on network's bridge, this part may need updating too. And that's exactly what this function does. While the previous commit introduced a function to check if @floor can be satisfied, this does all the hard work. In general, there may be three, well four possibilities: 1) No change in @floor value (either it remain unset, or its value hasn't changed) 2) The @floor value has changed from a non-zero to a non-zero value 3) New @floor is to be set 4) Old @floor must be cleared out The difference between 2), 3) and 4) is, that while in 2) the QoS tree on the network's bridge already has a special class for the vNIC, in 3) the class must be created from scratch. In 4) it must be removed. Fortunately, we have helpers for all three interesting cases. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Michal Privoznik	41a1531de5	bridge_driver: Introduce networkBandwidthChangeAllowed When a domain vNIC's bandwidth is to be changed (at runtime) it is possible that guaranteed minimal bandwidth (@floor) will change too. Well, so far it is, because we still don't have an implementation that allows setting it dynamically, so it's effectively erased on: #virsh domiftune $dom vnet0 --inbound 0 However, that's slightly unfortunate. We do some checks on domain startup to see if @floor can be guaranteed. We ought do the same if QoS is changed at runtime. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Michal Privoznik	45090449c4	virNetDevBandwidthUpdateRate: turn class_id into integer This is no functional change. It's just that later in the series we will need to pass class_id as an integer. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Michal Privoznik	327bc16a05	virNetDevParseMcast: Avoid magic constant There is no guarantee that an enum start it mapped onto a value of zero. However, we are guaranteed that enum items are consecutive integers. Moreover, it's a pity to define an enum to avoid using magical constants but then using them anyway. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-11 16:10:32 +02:00
Martin Kletzander	1f24c1494a	conf: Don't try formating non-existing addresses Commit `a6f9af8292` added checking for address colisions between starting and ending addresses of forwarding addresses, but forgot that there might be no addresses set at all. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-11 16:07:41 +02:00
Andrea Bolognani	344d1675e8	cpu: Forbid model fallback in the ppc64 driver Unlike what happens on x86, on ppc64 you can't mix and match CPU features to obtain the guest CPU you want regardless of the host CPU, so the concept of model fallback doesn't apply. Make sure CPU definitions emitted by the driver, eg. as output of the cpuBaseline() and cpuUpdate() calls, reflect this fact.	2015-08-11 15:25:28 +02:00
Andrea Bolognani	dee2247afa	cpu: Implement backwards compatibility in the ppc64 driver All previously recognized CPU models (POWER7_v2.1, POWER7_v2.3, POWER7+_v2.1 and POWER8_v1.0) are internally converted to the corrisponding generation name so that existing guests don't stop working.	2015-08-11 15:25:24 +02:00
Andrea Bolognani	36300d2ba1	cpu: Add POWER8NVL information to CPU map XML This is yet another variation of POWER8. The PVR information comes from arch/powerpc/kernel/cputable.c in the Linux kernel tree.	2015-08-11 14:09:43 +02:00
Andrea Bolognani	5d0aa93c50	cpu: Parse and use PVR masks in the ppc64 driver Instead of relying on a hard-coded mask value, read it from the CPU map XML and use it when looking up models by PVR.	2015-08-11 14:09:37 +02:00
Andrea Bolognani	d87359af5e	cpu: Simplify ppc64 part of CPU map XML Use multiple PVRs per CPU model to reduce the number of models we need to keep track of. Remove specific CPU models (eg. POWER7+_v2.1): the corresponding generic CPU model (eg. POWER7) should be used instead to ensure the guest can be booted on any compatible host. Get rid of all the entries that did not match any of the CPU models supported by QEMU, like power8 and power8e. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1250977	2015-08-11 14:05:35 +02:00
Andrea Bolognani	59fcc96195	cpu: Support multiple PVRs in the ppc64 driver This will allow us to perform PVR matching more broadly, eg. consider both POWER8 and POWER8E CPUs to be the same even though they have different PVR values.	2015-08-11 14:05:25 +02:00
Andrea Bolognani	adb865df85	cpu: Align ppc64 CPU data with x86 Use a typedef instead of the plain struct and heap allocation. This will make it easier to extend the ppc64 specific CPU data later on.	2015-08-11 11:04:57 +02:00
Andrea Bolognani	d574094d30	cpu: Use ppc64Compute() to implement ppc64DriverCompare() This ensures comparison of two CPU definitions will be consistent regardless of the fact that it is performed using cpuCompare() or cpuGuestData(). The x86 driver uses the same exact code.	2015-08-11 11:04:57 +02:00
Andrea Bolognani	96b2c7459c	cpu: CPU model names have to match on ppc64 Limitations of the POWER architecture mean that you can't run eg. a POWER7 guest on a POWER8 host when using KVM. This applies to all guests, not just those using VIR_CPU_MATCH_STRICT in the CPU definition; in fact, exact and strict CPU matching are basically the same on ppc64. This means, of course, that hosts using different CPUs have to be considered incompatible as well. Change ppc64Compute(), called by cpuGuestData(), to reflect this fact and update test cases accordingly. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1250977	2015-08-11 11:04:57 +02:00
Andrea Bolognani	8382136d42	cpu: Never skip CPU model name check in ppc64 driver ppc64Compute(), called by cpuNodeData(), is used not only to retrieve the driver-specific data associated to a guest CPU definition, but also to check whether said guest CPU is compatible with the host CPU. If the user is not interested in the CPU data, it's perfectly fine to pass a NULL pointer instead of a return location, and the compatibility data returned should not be affected by this. One of the checks, specifically the one on CPU model name, was however only performed if the return location was non-NULL.	2015-08-11 11:04:57 +02:00
Andrea Bolognani	cb8c0e1102	cpu: Remove ISA information from CPU map XML The information is not used anywhere in libvirt. No functional changes.	2015-08-11 11:04:57 +02:00
Andrea Bolognani	b85b51f2a5	cpu: Reorder functions in the ppc64 driver Having the functions grouped together this way will avoid further shuffling around down the line. No functional changes.	2015-08-11 11:04:56 +02:00
Andrea Bolognani	c238d16af4	cpu: Simplify ppc64ModelFromCPU()	2015-08-11 11:04:56 +02:00
Andrea Bolognani	4590f0678f	cpu: Simplify NULL handling in ppc64 driver Use briefer checks, eg. (!model) instead of (model == NULL), and avoid initializing to NULL a pointer that would be assigned in the first line of the function anyway. Also remove a pointless NULL assignment. No functional changes.	2015-08-11 11:04:56 +02:00
Andrea Bolognani	2686bf2292	cpu: Mark driver functions in ppc64 driver Use the ppc64Driver prefix for all functions that are used to fill in the cpuDriverPPC64 structure, ie. those that are going to be called by the generic CPU code. This makes it clear which functions are exported and which are implementation details; it also gets rid of the ambiguity that affected the ppc64DataFree() function which, despite what the name suggested, was not related to ppc64DataCopy() and could not be used to release the memory allocated for a virCPUppc64Data* instance. No functional changes.	2015-08-11 11:04:56 +02:00
Laine Stump	f4f1d18dc4	qemu: fail on attempts to use <filterref> for non-tap network connections nwfilter uses iptables and ebtables, which only work properly on tap-based network connections (not on macvtap, for example), but we just ignore any <filterref> elements for other types of networks, potentially giving users a false sense of security. This patch checks the network type and fails/logs an error if any domain <interface> has a <filterref> when the connection isn't using a tap device. This resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1180011	2015-08-10 13:08:41 -04:00
Laine Stump	a6f9af8292	network: validate network NAT range This patch modifies virSocketAddrGetRange() to function properly when the containing network/prefix of the address range isn't known, for example in the case of the NAT range of a virtual network (since it is a range of addresses on the host, not within the network itself). We then take advantage of this new functionality to validate the NAT range of a virtual network. Extra test cases are also added to verify that virSocketAddrGetRange() works properly in both positive and negative cases when the network pointer is NULL. This is the real fix for: https://bugzilla.redhat.com/show_bug.cgi?id=985653 Commits 1e334a and 48e8b9 had earlier been pushed as fixes for that bug, but I had neglected to read the report carefully, so instead of fixing validation for the NAT range, I had fixed validation for the DHCP range. sigh.	2015-08-10 13:06:56 -04:00
Martin Kletzander	cf0404455c	qemu: Enable ioeventfd usage for virtio-scsi controllers Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1150484 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-10 15:05:34 +02:00
Martin Kletzander	35eecddee3	conf: Add ioeventfd option for controllers This will be used with a virtio-scsi controller later on. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-10 15:05:34 +02:00
Michal Privoznik	2a5d3f227d	virNetDevBandwidthParseRate: Reject negative values https://bugzilla.redhat.com/show_bug.cgi?id=1022292 The following XML really does not make any sense: <inbound average="-1" burst="-2" peak="-3" floor="-4"/> There can't be a negative packet rate. Well, so far we haven't assigned any meaning to it. So reject it unless users harm themselves, because otherwise we turn the negative numbers into really big values. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-10 13:47:48 +02:00
Cao jin	17cba9fb51	qemuMonitorOpenInternal: remove redundant code There's no need to set mon->fd to a dummy value since it's initialized to proper value just a few lines below. Signed-off-by: Cao jin <caoj.fnst@cn.fujitsu.com>	2015-08-10 13:47:33 +02:00
Martin Kletzander	a8743c3938	rpc: Remove keepalive_required option Since its introduction in 2011 (particularly in commit `f4324e3292`), the option doesn't work. It just effectively disables all incoming connections. That's because the client private data that contain the 'keepalive_supported' boolean, are initialized to zeroes so the bool is false and the only other place where the bool is used is when checking whether the client supports keepalive. Thus, according to the server, no client supports keepalive. Removing this instead of fixing it is better because a) apparently nobody ever tried it since 2011 (4 years without one month) and b) we cannot know whether the client supports keepalive until we get a ping or pong keepalive packet. And that won't happen until after we dispatched the ConnectOpen call. Another two reasons would be c) the keepalive_required was tracked on the server level, but keepalive_supported was in private data of the client as well as the check that was made in the remote layer, thus making all other instances of virNetServer miss this feature unless they all implemented it for themselves and d) we can always add it back in case there is a request and a use-case for it. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-10 13:15:56 +02:00
Cao jin	b1ad57ecd2	fix typo in comments Signed-off-by: Cao jin <caoj.fnst@cn.fujitsu.com>	2015-08-10 09:39:13 +02:00
Michal Privoznik	bc359f77f3	virDomainCoreDumpWithFormat: Mention enum for @dumpformat So the API takes @dumpformat argument. This is what makes it special when compared to virDomainCoreDump. The argument is there so that users can choose the format of resulting core dump file. And to ease them the choosing process we even have an enum with supported values across all the hypervisors. But we don't mention the enum in the function description anywhere. Fix it! Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-10 08:40:27 +02:00
Laine Stump	6a21bc119e	network: verify proper address family in updates to <host> and <range> By specifying parentIndex in a call to virNetworkUpdate(), it was possible to direct libvirt to add a dhcp range or static host of a non-matching address family to the <dhcp> element of an <ip>. For example, given: <ip address='192.168.122.1' netmask='255.255.255.0'/> <ip family='ipv6' address='2001:db6:ca3:45::1' prefix='64'/> you could provide a static host entry with an IPv4 address, and specify that it be added to the 2nd <ip> element (index 1): virsh net-update default add ip-dhcp-host --parent-index 1 \ '<host mac="52:54:00:00:00:01" ip="192.168.122.45"/>' This would be happily added with no error (and no concern of any possible future consequences). This patch checks that any dhcp range or host element being added to a network ip's <dhcp> subelement has addresses of the same family as the ip element they are being added to. This resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1184736	2015-08-10 02:38:41 -04:00
Laine Stump	7d69387cd6	qemu: support new pci controller model "pcie-switch-downstream-port" This is backed by the qemu device xio3130-downstream. It can only be connected to a pcie-switch-upstream-port (x3130-upstream) on the upstream side.	2015-08-09 22:32:00 -04:00
Laine Stump	76379a6ec1	conf: new pcie-controller model "pcie-switch-downstream-port" This controller can be connected only to a port on a pcie-switch-upstream-port. It provides a single hotpluggable port that will accept any PCI or PCIe device, as well as any device requiring a pcie-*-port (the only current example of such a device is the pcie-switch-upstream-port).	2015-08-09 22:30:47 -04:00
Laine Stump	ad1748a1aa	qemu: add capabilities bit for device xio3130-downstream The downstream ports of an x3130-upstream switch can each have one of these plugged into them (and that is the only place they can be connected). Each xio3130-downstream provides a single PCIe port that can have PCI or PCIe devices hotplugged into it. Apparently an entire set of x3130-upstream + several xio3130-downstreams can be hotplugged as a unit, but it's not clear to me yet how that would be done, since qemu only allows attaching a single device at a time. This device will be used to implement the "pcie-switch-downstream-port" model of pci controller.	2015-08-09 22:29:25 -04:00
Laine Stump	cb99086d1b	qemu: support new pci controller model "pcie-switch-upstream-port" this is backed by the qemu device x3130-upstream. It can only plug into a pcie-root-port or pcie-switch-downstream-port.	2015-08-09 22:16:10 -04:00
Laine Stump	38ea9515af	conf: new pci controller model "pcie-switch-upstream-port" This controller can be connected only to a pcie-root-port or a pcie-switch-downstream-port (which will be added in a later patch), which is the reason for the new connect type VIR_PCI_CONNECT_TYPE_PCIE_PORT. A pcie-switch-upstream-port provides 32 ports (slot=0 to slot=31) on the downstream side, which can only have pci controllers of model "pcie-switch-downstream-port" plugged into them, which is the reason for the other new connect type VIR_PCI_CONNECT_TYPE_PCIE_SWITCH.	2015-08-09 22:12:29 -04:00
Laine Stump	4cde758808	qemu: add capabilities bit for device x3130-upstream This is the upstream part of a PCIe switch. It connects to a PCIe port (but not PCI) on the upstream side, and can have up to 31 xio3130-downstream controllers (but no other types of devices) connected to its downstream side. This device will be used to implement the "pcie-switch-upstream-port" model of pci controller.	2015-08-09 22:02:16 -04:00
Laine Stump	16328520f6	qemu: support new pci controller model "pcie-root-port" This is backed by the qemu device ioh3420. chassis and port from the <target> subelement are used to store/set the respective qemu device options for the ioh3420. Currently, chassis is set to be the index of the controller, and port is set to "(slot << 3) + function" (per suggestion from Alex Williamson).	2015-08-09 21:58:55 -04:00
Laine Stump	dce3b8beb3	conf: new pci controller model "pcie-root-port" This controller can be connected (at domain startup time only - not hotpluggable) only to a port on the pcie root complex ("pcie-root" in libvirt config), hence the new connect type VIR_PCI_CONNECT_TYPE_PCIE_ROOT. It provides a hotpluggable port that will accept any PCI or PCIe device. New attributes must be added to the controller <target> subelement for this - chassis and port are guest-visible option values that will be set by libvirt with values derived from the controller's index and pci address information.	2015-08-09 21:52:52 -04:00
Laine Stump	408b100a06	qemu: add capabilities bit for device ioh3420 This is a PCIE "root port". It connects only to a port of the integrated pcie.0 bus of a Q35 machine (can't be hotplugged), and provides a single PCIe port that can have PCI or PCIe devices hotplugged into it. This device will be used to implement the "pcie-root-port" model of pci controller.	2015-08-09 21:44:11 -04:00
Laine Stump	18c104516e	qemu: implement <target chassisNr='n'/> subelement/attribute of <controller> This uses the new subelement/attribute in two ways: 1) If a "pci-bridge" pci controller has no chassisNr attribute, it will automatically be set to the controller's index as soon as the controller's PCI address is known (during qemuDomainAssignPCIAddresses()). 2) when creating the commandline for a pci-bridge device, chassisNr will be used to set qemu's chassis_nr option (rather than the previous practice of hard-coding it to the controller's index).	2015-08-09 21:40:40 -04:00
Laine Stump	8dc88aeed6	conf: add new <target> subelement with chassisNr attribute to <controller> There are some configuration options to some types of pci controllers that are currently automatically derived from other parts of the controller's configuration. For example, in qemu a pci-bridge controller has an option that is called "chassis_nr"; up until now libvirt has always set chassis_nr to the index of the pci-bridge. So this: <controller type='pci' model='pci-bridge' index='2'/> will always result in: -device pci-bridge,chassis_nr=2,... on the qemu commandline. In the future we may decide there is a better way to derive that option, but even in that case we will need for existing domains to retain the same chassis_nr they were using in the past - that is something that is visible to the guest so it is part of the guest ABI and changing it would lead to problems for migrating guests (or just guests with very picky OSes). The <target> subelement has been added as a place to put the new "chassisNr" attribute that will be filled in by libvirt when it auto-generates the chassisNr; it will be saved in the config, then reused any time the domain is started: <controller type='pci' model='pci-bridge' index='2'> <model type='pci-bridge'/> <target chassisNr='2'/> </controller> The one oddity of all this is that if the controller configuration is changed (for example to change the index or the pci address where the controller is plugged in), the items in <target> will not be re-generated, which might lead to conflict. I can't really see any way around this, but fortunately if there is a material conflict qemu will let us know and we will pass that on to the user.	2015-08-09 21:35:00 -04:00
Laine Stump	572ebdbce7	qemu: implement <model> subelement to <controller> This patch provides qemu support for the contents of <model> in <controller> for the two existing PCI controller types that need it (i.e. the two controller types that are backed by a device that must be specified on the qemu commandline): 1) pci-bridge - sets <model> name attribute default as "pci-bridge" 2) dmi-to-pci-bridge - sets <model> name attribute default as "i82801b11-bridge". These both match current hardcoded practice. The defaults are set at the end of qemuDomainAssignPCIAddresses(). This can't be done earlier because some of the options that will be autogenerated need full PCI address info for the controller, and because qemuDomainAssignPCIAddresses() might create extra controllers which would need default settings added, and that hasn't yet been done at the time the PostParse callbacks are being run. qemuDomainAssignPCIAddresses() is still called prior to the XML being written to disk, though, so the autogenerated defaults are persistent. qemu capabilities bits aren't checked when the domain is defined, but rather when the commandline is actually created (so the domain can possibly be defined on a host that doesn't yet have support for the given device, or a host different from the one where it will eventually be run). When the commandline is being generated we compare the modelName to known qemu device names implementing the given type of controller, and check the capabilities bit for that device.	2015-08-09 21:33:58 -04:00
Laine Stump	bf20251048	conf: add new <model> subelement with name attribute to <controller> This new subelement is used in PCI controllers: the toplevel attribute "model" of a controller denotes what kind of PCI controller is being described, e.g. a "dmi-to-pci-bridge", "pci-bridge", or "pci-root". But in the future there will be different implementations of some of those types of PCI controllers, which behave similarly from libvirt's point of view (and so should have the same model), but use a different device in qemu (and present themselves as a different piece of hardware in the guest). In an ideal world we (i.e. "I") would have thought of that back when the pci controllers were added, and used some sort of type/class/model notation (where class was used in the way we are now using model, and model was used for the actual manufacturer's model number of a particular family of PCI controller), but that opportunity is long past, so as an alternative, this patch allows selecting a particular implementation of a pci controller with the "name" attribute of the <model> subelement, e.g.: <controller type='pci' model='dmi-to-pci-bridge' index='1'> <model name='i82801b11-bridge'/> </controller> In this case, "dmi-to-pci-bridge" is the kind of controller (one that has a single PCIe port upstream, and 32 standard PCI ports downstream, which are not hotpluggable), and the qemu device to be used to implement this kind of controller is named "i82801b11-bridge". Implementing the above now will allow us in the future to add a new kind of dmi-to-pci-bridge that doesn't use qemu's i82801b11-bridge device, but instead uses something else (which doesn't yet exist, but qemu people have been discussing it), all without breaking existing configs. (note that for the existing "pci-bridge" type of PCI controller, both the model attribute and <model> name are 'pci-bridge'. This is just a coincidence, since it turns out that in this case the device name in qemu really is a generic 'pci-bridge' rather than being the name of some real-world chip)	2015-08-09 21:29:27 -04:00
Laine Stump	f8fe8f0345	conf: more useful error message when pci function is out of range If a pci address had a function number out of range, the error message would be: Insufficient specification for PCI address which is logged by virDevicePCIAddressParseXML() after virDevicePCIAddressIsValid returns a failure. This patch enhances virDevicePCIAddressIsValid() to optionally report the error itself (since it is the place that decides which part of the address is "invalid"), and uses that feature when calling from virDevicePCIAddressParseXML(), so that the error will be more useful, e.g.: Invalid PCI address function=0x8, must be <= 7 Previously, virDevicePCIAddressIsValid didn't check for the theoretical limits of domain or bus, only for slot or function. While adding log messages, we also correct that ommission. (The RNG for PCI addresses already enforces this limit, which by the way means that we can't add any negative tests for this - as far as I know our domainschematest has no provisions for passing XML that is supposed to fail). Note that virDevicePCIAddressIsValid() can only check against the absolute maximum attribute values for any possible PCI controller, not for the actual maximums of the specific controller that this device is attaching to; fortunately there is later more specific validation for guest-side PCI addresses when building the set of assigned PCI addresses. For host-side PCI addresses (e.g. for <hostdev> and for network device pools), we rely on the error that will be logged when it is found that the device doesn't actually exist. This resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1004596	2015-08-08 18:37:35 -04:00
Michal Privoznik	82af954c52	virDomainDefParseXML: Check for malicious cpu ids in <numa/> https://bugzilla.redhat.com/show_bug.cgi?id=1176020 Some users think this is a good idea: <vcpu placement='static'>4</vcpu> <cpu mode='host-model'> <model fallback='allow'/> <numa> <cell id='0' cpus='0-1' memory='1048576' unit='KiB'/> <cell id='1' cpus='9-10' memory='2097152' unit='KiB'/> </numa> </cpu> It's not. Lets therefore introduce a check and discourage them in doing so. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-07 17:19:07 +02:00
Michal Privoznik	8f2535dec1	numa_conf: Introduce virDomainNumaGetMaxCPUID This function should return the greatest CPU number set in /domain/cpu/numa/cell/@cpus. The idea is that we should compare the returned value against /domain/vcpu value. Yes, there exist users who think the following is a good idea: <vcpu placement='static'>4</vcpu> <cpu mode='host-model'> <model fallback='allow'/> <numa> <cell id='0' cpus='0-1' memory='1048576' unit='KiB'/> <cell id='1' cpus='9-10' memory='2097152' unit='KiB'/> </numa> </cpu> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-07 17:19:03 +02:00
Peter Krempa	8dc2725925	qemu: Fix reporting of physical capacity for block devices Qemu reports physical size 0 for block devices. As `15fa84acbb` changed the behavior of qemuDomainGetBlockInfo to just query the monitor this created a regression since we didn't report the size correctly any more. This patch adds code to refresh the physical size of a block device by opening it and seeking to the end and uses it both in qemuDomainGetBlockInfo and also in qemuDomainGetStatsOneBlock that was broken since it was introduced in this respect. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1250982	2015-08-07 13:28:50 +02:00
Shivaprasad G Bhat	e3810db34f	Allow vfio hotplug of a device to the domain which owns the iommu The commit `7e72de4` didn't consider the hotplug scenarios. The patch addresses the hotplug case whereby if atleast one of the pci function is owned by a guest, the hotplug of other functions/devices in the same iommu group to the same guest goes through successfully. Signed-off-by: Shivaprasad G Bhat <sbhat@linux.vnet.ibm.com>	2015-08-06 17:55:03 +02:00
Michal Privoznik	c646814438	qemuDomainDefPostParse: Adjust indent While reviewing `e8d551725` I've noticed a few unaligned lines. Fix this. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-06 15:33:01 +02:00
Pavel Fedin	ede34470fd	qemu: Allow to plug virtio-net-pci into PCIe slot virtio-net-pci adapter is capable to use irqfd with vhost-net only in MSI-X mode, which appears to be available only on PCIe bus, at least on ARM Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-08-06 14:28:05 +02:00
Pavel Fedin	8b78ec011c	qemu: Build correct command line for PCI NICs on ARM Legacy -net option works correctly only with embedded device models, which do not require any bus specification. Therefore, we should use -device for PCI hardware Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-08-06 14:25:02 +02:00
Pavel Fedin	e8d5517254	qemu: Add PCI-Express root to ARM virt machine Here we assume that if qemu supports generic PCI host controller, it is a part of virt machine and can be used for adding PCI devices. In qemu this is actually a PCIe bus, so we also declare multibus capability so that 0'th bus is specified to qemu correctly as 'pcie.0' Signed-off-by: Pavel Fedin <p.fedin@samsung.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-08-06 14:24:51 +02:00
Pavel Fedin	8a482abf75	qemu: Introduce QEMU_CAPS_OBJECT_GPEX This capability specifies that qemu can implement generic PCI host controller. It is often used for virtual environments, including ARM. Signed-off-by: Pavel Fedin <p.fedin@samsung.com>	2015-08-06 13:59:22 +02:00
Peter Krempa	6da3b694cc	qemu: Forbid image pre-creation for non-shared storage migration Libvirt doesn't reliably know the location of the backing chain when pre-creating images for non-shared migration. This isn't a problem for full copy, but incremental copy requires the information. Forbid pre-creating the image in cases where incremental migration is required. This limitation can perhaps be lifted once libvirt will fully support loading of backing chain information from the XML. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1249587	2015-08-05 17:24:59 +02:00
Andrea Bolognani	4ff3e939e7	cpu: Indentation changes in the ppc64 driver	2015-08-05 13:30:16 +02:00
Andrea Bolognani	3d1515890b	cpu: Rename {powerpc,ppc} => ppc64 (internal symbols) Update the names of the symbols used internally by the driver. No functional changes.	2015-08-05 13:30:16 +02:00
Andrea Bolognani	e89733c4a4	cpu: Rename {powerpc,ppc} => ppc64 (exported symbols) Only the symbols exported by the driver have been updated; the driver implementation itself still uses the old names internally. No functional changes.	2015-08-05 13:30:16 +02:00
Andrea Bolognani	ef770f0160	cpu: Rename {powerpc,ppc} => ppc64 (filesystem) The driver only supports VIR_ARCH_PPC64 and VIR_ARCH_PPC64LE. Just shuffling files around and updating the build system accordingly. No functional changes.	2015-08-05 13:30:16 +02:00
John Ferlan	a16871fef7	conf: Resolve Coverity FORWARD_NULL The recent changes to perform SCSI device address checks during the post parse callbacks ran afoul of the Coverity checker since the changes assumed that the 'xmlopt' parameter to virDomainDeviceDefPostParse would be non NULL (commit id 'ca2cf74e87'); however, what was missed is there was an "if (xmlopt &&" check being made, so Coverity believed that it could be possible for a NULL 'xmlopt'. Checking the various calling paths seemingly disproves that. If called from virDomainDeviceDefParse, there were two other possible calls that would end up dereffing, so that path could not be NULL. If called via virDomainDefPostParseDeviceIterator via virDomainDefPostParse there are two callers (virDomainDefParseXML and qemuParseCommandLine) which deref xmlopt either directly or through another call. So I'm removing the check for non-NULL xmlopt.	2015-08-05 05:44:46 -04:00
John Ferlan	36025c552c	conf: Allow error reporting in virDomainDiskSourceIsBlockType Rather than provide a somewhat generic error message when the API returns false, allow the caller to supply a "report = true" option in order to cause virReportError's to describe which of the 3 paths that can cause failure. Some callers don't care about what caused the failure, they just want to have a true/false - for those, calling with report = false should be sufficient.	2015-08-04 07:19:25 -04:00
Kothapally Madhu Pavan	d9557572ae	Avoid starting a PowerPC VM with floppy disk PowerPC pseries based VMs do not support a floppy disk controller. This prohibits libvirt from creating qemu command with floppy device. Signed-off-by: Kothapally Madhu Pavan <kmp@linux.vnet.ibm.com> https://bugzilla.redhat.com/show_bug.cgi?id=1180486 Signed-off-by: Ján Tomko <jtomko@redhat.com>	2015-08-04 10:17:07 +02:00
Kothapally Madhu Pavan	020a178318	Caps: Disable floppy disk for PowerPC VM PowerPC pseries based VMs do not support a floppy disk controller. This prohibits libvirt from adding floppy disk for a PowerPC pseries VM. Signed-off-by: Kothapally Madhu Pavan <kmp@linux.vnet.ibm.com>	2015-08-04 10:16:20 +02:00
John Ferlan	e1dbce1589	conf: Change when virDomainDiskDefAssignAddress is called Rather than calling virDomainDiskDefAssignAddress during the parsing of the XML, moving the setting of disk addresses into the domain/device post processing. Commit id '37588b25' which introduced VIR_DOMAIN_DEF_PARSE_DISK_SOURCE in order to avoid generating the address which wasn't required will not be affected by this as all it cared about was processing the source XML. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 17:06:15 -04:00
John Ferlan	9bcdb21005	conf: Remove unused param from virDomainHostdevDefParseXML Remove unused xmlopt param Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
John Ferlan	0d8b24f6b6	conf: Change when virDomainHostdevAssignAddress is called Rather than calling virDomainHostdevAssignAddress during the parsing of the XML, move the setting of a default hostdev address to domain/ device post processing. Since the parse code no longer generates an address, we can remove the virDomainDefMaybeAddHostdevSCSIcontroller since the call to virDomainHostdevAssignAddress will attempt to add the controllers that were not already defined in the XML. This patch will also enforce that the address type is type 'drive' when a SCSI subsystem <hostdev> element is provided with an <address>. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
John Ferlan	0785966d03	conf: Try controller add when searching hostdev bus for unit If virDomainControllerSCSINextUnit failed to find a slot on the current VIR_DOMAIN_CONTROLLER_TYPE_SCSI controller(s), try to add a new controller; otherwise, there may be multiple unit=0 entries for the same "next" controller. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
John Ferlan	83f2b62c1f	conf: Add check for host address type while checking in use While searching the hostdevs the drive type can be either _TYPE_DRIVE or _TYPE_NONE. If the type is _TYPE_NONE on the first scsi_host, then there is an erroneous "match" that the address already exists. Although this works by chance currently because hostdev's are added one at a time and 'nhostdevs' would be zero, thus returning false for the first hostdev added, a future patch will move the hostdev address assignment into post processing resulting in the bad match. This code is only called by path's expecting either drive or none. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
John Ferlan	ca2cf74e87	conf: Add xmlopt to virDomainDeviceDefPostParseInternal Add the xmlopt parameter that was saved during virDomainDefPostParse to the parameters. A future patch will use it. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
John Ferlan	0a41871562	conf: Move hostdev and disk address validations Move the functions above the post processing for upcoming patch Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
John Ferlan	b79c52d8c2	conf: Add 'bus' and 'target' to SCSI address conflict checks Modify virDomainDriveAddressIsUsedBy{Disk\|Hostdev} and virDomainSCSIDriveAddressIsUsed to take 'bus' and 'target' parameters. Will be used by future patches for more complete address conflict checks Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
John Ferlan	8b97ba2952	conf: Remove extraneous check in virDomainHostdevAssignAddress Since the only way virDomainHostdevAssignAddress can be called is from within virDomainHostdevDefParseXML when hostdev->source.subsys.type is VIR_DOMAIN_HOSTDEV_SUBSYS_TYPE_SCSI, thus there's no need for redundancy. Signed-off-by: John Ferlan <jferlan@redhat.com>	2015-08-03 16:48:45 -04:00
Andrea Bolognani	88c4c32af1	nodeinfo: Fix build failure when KVM headers are not available Compiler error: ../../src/nodeinfo.c: In function 'nodeGetThreadsPerSubcore': ../../src/nodeinfo.c:2393: error: label 'out' defined but not used [-Wunused-label] ../../src/nodeinfo.c:2352: error: unused parameter 'arch' [-Wunused-parameter]	2015-08-03 17:14:16 +02:00
Martin Kletzander	c43c661fe4	qemu: Remove double unlock for domains The virDomainObjListRemove() function unlocks a domain that it's given due to legacy code. And because of that code, which should be refactored, that last virObjectUnlock() cannot be just removed. So instead, lock it right back for qemu for now. All calls to qemuDomainRemoveInactive() are followed by code that unlocks the domain again, plus the domain should be locked during qemuDomainObjEndJob(), so the right place to lock it is right after virDomainObjListRemove(). The only place where this would cause a problem is the autodestroy callback, so we need to get another reference there and uref+unlock it afterwards. Luckily, returning NULL from that function doesn't mean an error, and only means that it doesn't need to be unlocked anymore. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-03 16:59:20 +02:00
Shivaprasad G Bhat	014208c4d0	nodeinfo: Fix output on PPC64 KVM hosts The nodeinfo is reporting incorrect number of cpus and incorrect host topology on PPC64 KVM hosts. The KVM hypervisor on PPC64 needs only the primary thread in a core to be online, and the secondaries offlined. While scheduling a guest in, the kvm scheduler wakes up the secondaries to run in guest context. The host scheduling of the guests happen at the core level(as only primary thread is online). The kvm scheduler exploits as many threads of the core as needed by guest. Further, starting POWER8, the processor allows splitting a physical core into multiple subcores with 2 or 4 threads each. Again, only the primary thread in a subcore is online in the host. The KVM-PPC scheduler allows guests to exploit all the offline threads in the subcore, by bringing them online when needed. (Kernel patches on split-core http://www.spinics.net/lists/kvm-ppc/msg09121.html) Recently with dynamic micro-threading changes in ppc-kvm, makes sure to utilize all the offline cpus across guests, and across guests with different cpu topologies. (https://www.mail-archive.com/kvm@vger.kernel.org/msg115978.html) Since the offline cpus are brought online in the guest context, it is safe to count them as online. Nodeinfo today discounts these offline cpus from cpu count/topology calclulation, and the nodeinfo output is not of any help and the host appears overcommited when it is actually not. The patch carefully counts those offline threads whose primary threads are online. The host topology displayed by the nodeinfo is also fixed when the host is in valid kvm state. Signed-off-by: Shivaprasad G Bhat <sbhat@linux.vnet.ibm.com> Signed-off-by: Andrea Bolognani <abologna@redhat.com>	2015-08-03 08:38:46 -04:00
Ossi Herrala	d9c9e138f2	rpc: Fix slow volume download (virsh vol-download) Use I/O vector (iovec) instead of one huge memory buffer as suggested in https://bugzilla.redhat.com/show_bug.cgi?id=1026137#c7. This avoids doing memmove() to big buffers and performance doesn't degrade if source (virNetClientStreamQueuePacket()) is faster than sink (virNetClientStreamRecvPacket()). Resolves: http://bugzilla.redhat.com/1026137 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-08-03 13:08:00 +02:00
Cao jin	e7fef6d00e	There is no virDomainFindBy{ID, Name, UUID} anymore s/virDomainFindBy/virDomainObjListFindBy/ Signed-off-by: Cao jin <caoj.fnst@cn.fujitsu.com>	2015-08-03 13:08:00 +02:00
Luyao Huang	1439eb32af	qemu: fix some api cannot work when disable cpuset in conf If cpuset is disabled or not available, it libvirt must not use it. Mainly for actions that do not need it and can use sched_setaffinity() or numa_membind() instead, because they will fail without good reason. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1244664 Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-08-03 13:08:00 +02:00
Jiri Denemark	e8d0166e1d	qemu: Do not reset labels when migration fails When stopping a domain on the destination host after a failed migration, we need to avoid reseting security labels since the domain is still running on the source host. While we were correctly doing so in some cases, there were still some paths which did this wrong. https://bugzilla.redhat.com/show_bug.cgi?id=1242904 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-31 15:15:12 +02:00
Jiri Denemark	40a6dd9c16	qemu: Properly check for incoming migration job In addition to checking the current asynchronous job qemuMigrationJobIsActive reports an error if the current job does not match the one we asked for. Let's just check the job directly since we are not interested in the error in qemuProcessHandleMonitorEOF. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2015-07-31 15:15:12 +02:00
Peter Krempa	136f3de411	qemu: Reject migration with memory-hotplug if destination doesn't support it If destination libvirt doesn't support memory hotplug since all the support was introduced by adding new elements the destination would attempt to start qemu with an invalid configuration. The worse part is that qemu might hang in such situation. Fix this by sending a required migration feature called 'memory-hotplug' to the destination. If the destination doesn't recognize it it will fail the migration. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1248350	2015-07-30 16:44:02 +02:00
Cédric Bosdonnat	e5e8406e9c	Fix syntax-check: missing "%s"	2015-07-30 11:10:48 +02:00
Cédric Bosdonnat	0aedcbd37c	Load nbd module before running qemu-nbd So far qemu-nbd is run even if the nbd kernel module isn't loaded. This leads to errors when the user starts his lxc container while libvirt could easily load the nbd module automatically.	2015-07-30 09:55:37 +02:00
Erik Skultety	b2960501c7	qemu: Adjust VM id allocation Our atomic increment (virAtomicIntInc) uses (if available) gcc __sync_add_and_fetch builtin. In qemu driver though, we'd profit more from __sync_fetch_and_add builtin. To keep it simplistic, this patch adjusts qemu driver initialization rather than adding a new atomic increment macro.	2015-07-29 09:15:44 +02:00
Peter Krempa	dbb0baa5a7	lxc: Don't accidentaly reset autostart flag in virLXCProcessCleanup virDomainDeleteConfig is meant to delete the persistent config and thus it resets vm->autostart. Copy parts of qemuProcessRemoveDomainStatus to a new helper to avoid using the incorrect function. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1230071	2015-07-28 18:55:39 +02:00
Daniel P. Berrange	afe69e6582	remote: fix typo in remoteDomainOpenGraphicsFD The remoteDomainOpenGraphicsFD method was using the wrong RPC arg struct remote_domain_open_graphics_args instead of remote_domain_open_graphics_fd_args. Fortunately both structs had identical contents so there was no functional bug, but to avoid confusing future maintainers, we should fix it. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2015-07-27 12:53:36 +01:00
Martin Kletzander	7868f01783	admin: Tiny cleanups First hunk changes the use of srcdir to top_srcdir so it complies with other rules in the Makefile. Second one removes the need of remote_protocol.h in admin_protocol.h as it was suggested and worked in, but this one line was missed apparently. Last one just removes the 'remote' naming from admin protocol specification, just so it's cleaner. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-27 09:33:24 +02:00
Martin Kletzander	ba167186cf	qemu: Check for iotune_max support properly Commit `d506a51aeb` meant to check for QEMU_CAPS_DRIVE_IOTUNE_MAX, but checked for QEMU_CAPS_DRIVE_IOTUNE instead. That's clearly visible from the diff, but it got in. Because of that, we were supplying information unknown for QEMU if it wasn't new enough and we couldn't even properly handle the error, leading to "Unexpected error". Also iops_size came at the same time with all the other "_max" options, so check whether we're not setting that either if QEMU_CAPS_DRIVE_IOTUNE_MAX is not supported. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1224053 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-27 08:29:37 +02:00
Laine Stump	e143107240	conf: add virDomainControllerDefNew() There are some non-0 default values in virDomainControllerDef (and will soon be more) that are easier to not forget if the remembering is done by a single initializer function (rather than inline code after allocating the obejct with generic VIR_ALLOC().	2015-07-25 10:10:31 -04:00
Laine Stump	0726878297	qemu: reorganize loop in qemuDomainAssignPCIAddresses This loop occurs just after we've assured that all devices that require a PCI device have been assigned and all necessary PCI controllers have been added. It is the perfect place to add other potentially auto-generated PCI controller attributes that are dependent on the controller's PCI address (upcoming patch). There is a convenient loop through all controllers at the end of the function, but the patch to add new functionality will be cleaner if we first rearrange that loop a bit. Note that the loop originally was accessing info.addr.pci.bus prior to determining that the pci part of the object was valid. This isn't dangerous in any way, but seemed a bit ugly, so I fixed it.	2015-07-25 10:10:22 -04:00
Laine Stump	d4cf72af17	conf: pay attention to bus minSlot/maxSlot when autoassigning PCI addresses The function that auto-assigns PCI addresses was written with the hardcoded assumptions that any PCI bus would have slots available starting at 1 and ending at 31. This isn't true for many types of controllers (some have a single slot/port at 0, some have slots/ports from 0 to 31). This patch updates that function to remove the hardcoded assumptions. It will properly find/assign addresses for devices that can only connect to pcie-(root\|downstream)-port (which have minSlot/maxSlot of 0/0) or a pcie-switch-upstream-port (0/31). It still will not auto-create a new bus of the proper kind for these connections when one doesn't exist, that task is for another day.	2015-07-25 10:08:03 -04:00
Chris J Arges	c6eea54008	storage: allow zero capacity with non-backing file to be created In commit `155ca616e`, a change was introduced that no longer allowed defining volumes via XML with a capacity of '0'. Because we check for info.size_arg to be non-zero, this use-case fails. This patch allows info.size_arg to be zero if no backing store is specified. Signed-off-by: Chris J Arges <chris.j.arges@canonical.com>	2015-07-24 11:22:20 -04:00
John Ferlan	136f17efd1	nodeinfo: Check for SYSFS_INFINIBAND_DIR before open Commit id 'ac3ed2085' causes 'virsh nodedev-list --cap net' to fail on any system without SYSFS_INFINIBAND_DIR (/sys/class/infiniband). Rather than assume it's there and fail on the attempt to open the non-existent directory, check if it's there - if not, return success and move on. Also fix caller to check < 0 upon return. As reported by Suren Hajyan <shajyan@redhat.com> from run of unit tests	2015-07-24 09:41:06 -04:00
Cao jin	c1c5eb6fad	fix typo in qemu_monitor Signed-off-by: Cao jin <caoj.fnst@cn.fujitsu.com>	2015-07-24 14:29:34 +02:00
Martin Kletzander	a5bdb8459a	Revert "qemu: Use heads parameter for QXL driver" This reverts commit `7b401c3bda`. Until libvirt is able to differentiate whether heads='1' is just a leftover from previous libvirt or whether that's added by user on purpose and also whether the domain was started with the support for qxl's max_outputs, we cannot incorporate this patch into the tree due to compatibility reasons. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-24 13:06:47 +02:00
Luyao Huang	b70b5ff41a	test: introduce a function in test driver to check get vcpupin info As there is a regression in use vcpupin get info, introduce a new function to test the virsh client. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-24 06:49:54 -04:00
Laine Stump	03b6bdcab3	conf: reorganize virNetworkDHCPDefParseXML This makes the range and static host array management in virNetworkDHCPDefParseXML() more similar to what is done in virNetworkDefUpdateIPDHCPRange() and virNetworkDefUpdateIPDHCPHost() - they use VIR_APPEND_ELEMENT rather than a combination of VIR_REALLOC_N() and separate incrementing of the array size. The one functional change here is that a memory leak of the contents of the last (unsuccessful) virNetworkDHCPHostDef was previously leaked in certain failure conditions, but it is now properly cleaned up.	2015-07-23 16:38:08 -04:00
Andrea Bolognani	f86c45ca0c	nodeinfo: Check for errors when reading core_id	2015-07-23 12:01:19 +02:00
Roman Bogorodskiy	6cb9ef1bab	bhyve: add UTC clock support Bhyve as of r279225 (FreeBSD -CURRENT) or r284894 (FreeBSD 10-STABLE) supports using UTC time offset via the '-u' argument to bhyve(8). By default it's still using localtime. Make the bhyve driver use UTC clock if it's requested by specifying <clock offset='utc'> in domain XML and if the bhyve(8) binary supports the '-u' flag.	2015-07-22 19:05:09 +03:00
Roman Bogorodskiy	830344d6e7	netdev: fix build on FreeBSD Commit `ac3ed20` breaks build on FreeBSD with: CC util/libvirt_util_la-virnetdev.lo util/virnetdev.c:2967:1: error: unused function 'virNetDevRDMAFeature' [-Werror,-Wunused-function] virNetDevRDMAFeature(const char *ifname, ^ So hide virNetDevRDMAFeature function under the #ifdef 'SIOCETHTOOL' and 'HAVE_STRUCT_IFREQ' section. Pushed under the build breaker rule.	2015-07-22 18:37:00 +03:00
Luyao Huang	704cf06a14	qemu: fix the error cover issue in SetMemoryParameters https://bugzilla.redhat.com/show_bug.cgi?id=1245476 We won't return the errno after commit `0d7f45ae`, and the more clearly error will be set in the code in vircgroup*. Also We will always report error "Operation not permitted", because the return is -1. Signed-off-by: Luyao Huang <lhuang@redhat.com>	2015-07-22 11:02:17 +02:00
Andrea Bolognani	6395ec1cf0	nodeinfo: Calculate present and online CPUs only once Move the calls to the respective functions from virNodeParseNode(), which is executed once for every NUMA node, to linuxNodeInfoCPUPopulate(), which is executed just once per host.	2015-07-22 10:50:53 +02:00
Andrea Bolognani	05be606282	nodeinfo: Use a bitmap to keep track of node CPUs Keep track of what CPUs belong to the current node while walking through the sysfs node entry, so we don't need to do it a second time immediately afterwards. This also allows us to loop through all CPUs that are part of a node in guaranteed ascending order, which is something that is required for some upcoming changes.	2015-07-22 10:37:25 +02:00
Andrea Bolognani	b909e9fb2c	nodeinfo: Use nodeGetOnlineCPUBitmap() when parsing node No need to look up the online status of each CPU separately when we can get all the information in one go.	2015-07-22 10:37:20 +02:00
Andrea Bolognani	b7b506475c	nodeinfo: Phase out cpu_set_t usage Swap out all instances of cpu_set_t and replace them with virBitmap, which some of the code was already using anyway. The changes are pretty mechanical, with one notable exception: an assumption has been added on the max value we can run into while reading either socket_it or core_id. While this specific assumption was not in place before, we were using cpu_set_t improperly by not making sure not to set any bit past CPU_SETSIZE or explicitly allocating bigger bitmaps; in fact the default size of a cpu_set_t, 1024, is way too low to run our testsuite, which includes core_id values in the 2000s.	2015-07-22 10:14:02 +02:00
Andrea Bolognani	c1df42d734	nodeinfo: Rename nodeGetCPUBitmap() to nodeGetOnlineCPUBitmap() The new name makes it clear that the returned bitmap contains the information about which CPUs are online, not eg. which CPUs are present. No behavioral change.	2015-07-22 10:14:02 +02:00
Andrea Bolognani	ccd0ea7ef5	nodeinfo: Remove out parameter from nodeGetCPUBitmap() Not all users of this API will need the size of the returned bitmap; those who do can simply call virBitmapSize() themselves.	2015-07-22 10:14:01 +02:00
Andrea Bolognani	37f73e4ad5	nodeinfo: Add old kernel compatibility to nodeGetPresentCPUBitmap() If the cpu/present file is not available, we assume that the kernel is too old to support non-consecutive CPU ids and return a bitmap with all the bits set to represent this fact. This assumption is already exploited in nodeGetCPUCount(). This means users of this API can expect the information to always be available unless an error has occurred, and no longer need to treat the NULL return value as a special case. The error message has been updated as well.	2015-07-22 10:14:01 +02:00
Andrea Bolognani	a2e2add1f1	nodeinfo: Rename linuxParseCPUmax() to linuxParseCPUCount() The original name was confusing because the function returns the number of CPUs, not the maximum CPU id. The comment above the function has been updated to reflect this. No behavioral changes.	2015-07-22 10:14:01 +02:00
Andrea Bolognani	6fecc4017d	nodeinfo: Introduce linuxGetCPUOnlinePath()	2015-07-22 10:14:01 +02:00
Andrea Bolognani	bd87f07c25	nodeinfo: Introduce linuxGetCPUGlobalPath() This is just a more generic version of linuxGetCPUPresentPath(), which is now implemented by calling the new function appropriately.	2015-07-22 10:14:01 +02:00
Andrea Bolognani	2a6801892a	nodeinfo: Fix nodeGetCPUBitmap()'s fallback code path During the recent refactoring/cleanups, a bug has been introduced that caused all CPUs to be reported as online unless the sysfs cpu/present file was available. This commit fixes the fallback code path by building the directory path passed to virNodeGetCpuValue() correctly.	2015-07-22 09:57:57 +02:00
Andrea Bolognani	c30ae1864f	nodeinfo: Add nodeGetPresentCPUBitmap() to libvirt_private.syms	2015-07-22 09:57:57 +02:00
Peter Krempa	88f6c007c3	cgroup: Drop resource partition from virSystemdMakeScopeName The scope name, even according to our docs is "machine-$DRIVER\x2d$VMNAME.scope" virSystemdMakeScopeName would use the resource partition name instead of "machine-" if it was specified thus creating invalid scope paths. This makes libvirt drop cgroups for a VM that uses custom resource partition upon reconnecting since the detected scope name would not match the expected name generated by virSystemdMakeScopeName. The error is exposed by the following log entry: debug : virCgroupValidateMachineGroup:302 : Name 'machine-qemu\x2dtestvm.scope' for controller 'cpu' does not match 'testvm', 'testvm.libvirt-qemu' or 'machine-test-qemu\x2dtestvm.scope' for a "/machine/test" resource and "testvm" vm. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1238570	2015-07-22 07:12:56 +02:00
Peter Krempa	eae59247c5	qemu: Update state of block job to READY only if it actually is ready Few parts of the code looked at the current progress of and assumed that a two phase blockjob is in the _READY state as soon as the progress reached 100% (info.cur == info.end). In current versions of qemu this assumption is invalid and qemu exposes a new flag 'ready' in the query-block-jobs output that is set to true if the job is actually finished. This patch adds internal data handling for reading the 'ready' flag and acting appropriately as long as the flag is present. While this still doesn't fix the virsh client problem with two phase block jobs and the --pivot option, it at least improves the error message: $ virsh blockcommit --wait --verbose vm vda --base vda[1] --active --pivot Block commit: [100 %]error: failed to pivot job for disk vda error: internal error: unable to execute QEMU command 'block-job-complete': The active block job for device 'drive-virtio-disk0' cannot be completed to $ virsh blockcommit --wait --verbose VM vda --base vda[1] --active --pivot Block commit: [100 %]error: failed to pivot job for disk vda error: block copy still active: disk 'vda' not ready for pivot yet	2015-07-21 15:32:59 +02:00
Moshe Levi	ac3ed2085f	nodedev: add RDMA and tx-udp_tnl-segmentation NIC capabilities Adding functionality to libvirt that will allow it query the interface for the availability of RDMA and tx-udp_tnl-segmentation Offloading NIC capabilities Here is an example of the feature XML definition: <device> <name>net_eth4_90_e2_ba_5e_a5_45</name> <path>/sys/devices/pci0000:00/0000:00:03.0/0000:08:00.1/net/eth4</path> <parent>pci_0000_08_00_1</parent> <capability type='net'> <interface>eth4</interface> <address>90:e2:ba:5e:a5:45</address> <link speed='10000' state='up'/> <feature name='rx'/> <feature name='tx'/> <feature name='sg'/> <feature name='tso'/> <feature name='gso'/> <feature name='gro'/> <feature name='rxvlan'/> <feature name='txvlan'/> <feature name='rxhash'/> <feature name='rdma'/> <feature name='txudptnl'/> <capability type='80203'/> </capability> </device>	2015-07-21 07:08:35 -04:00
Roman Bogorodskiy	e46791e003	nodeinfo: fix build on FreeBSD Currently, build fails on FreeBSD with: CC libvirt_driver_la-nodeinfo.lo nodeinfo.c:1941:56: error: use of undeclared identifier 'SYSFS_SYSTEM_PATH' const char *prefix = sysfs_prefix ? sysfs_prefix : SYSFS_SYSTEM_PATH; ^ 1 error generated. This is caused by commit `b97b3048` that added sysfs_prefix to nodeCapsInitNUMA and used SYSFS_CPU_PATH. Fix it by unconditionally defining SYSFS_CPU_PATH instead of defining it under #ifdef __linux__.	2015-07-20 14:01:49 +03:00
Martin Kletzander	717c99f360	qemu: Reject updating unsupported disk information If one calls update-device with information that is not updatable, libvirt reports success even though no data were updated. The example used in the bug linked below uses updating device with <boot order='2'/> which, in my opinion, is a valid thing to request from user's perspective. Mainly since we properly error out if user wants to update such data on a network device for example. And since there are many things that might happen (update-device on disk basically knows just how to change removable media), check for what's changing and moreover, since the function might be usable in other drivers (updating only disk path is a valid possibility) let's abstract it for any two disks. We can't possibly check for everything since for many fields our code does not properly differentiate between default and unspecified values. Even though this could be changed, I don't feel like it's worth the complexity so it's not the aim of this patch. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1007228	2015-07-20 11:35:54 +02:00
Martin Kletzander	0aa81bbdc3	Escape left brace as new perl suggests After upgrade to perl-5.22.0, it started complaining about one of our scripts. The thing is that even though it works, it wants all curly brackets escaped properly. The change is not functional, it merely gets rid of the following error: Unescaped left brace in regex is deprecated, passed through in regex; marked by <-- HERE in m/^enum { <-- HERE / at -e line 3. There is one more error like this that I'm getting, but it is because of GNU automake bug #21001: https://debbugs.gnu.org/cgi/bugreport.cgi?bug=21001 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-20 10:35:18 +02:00
Frediano Ziglio	7b401c3bda	qemu: Use heads parameter for QXL driver Allows to specify maximum number of head to QXL driver. Actually can be a compatiblity problem as heads in the XML configuration was set by default to '1'. Signed-off-by: Frediano Ziglio <fziglio@redhat.com>	2015-07-20 10:35:18 +02:00
Christophe Fergeau	60d5ed8c52	storage: Fix pool building when directory already exists Currently, when trying to virsh pool-define/virsh pool-build a new 'dir' pool, if the target directory already exists, virsh pool-build/virStoragePoolBuild will error out. This is a change of behaviour compared to eg libvirt 1.2.13 This is caused by the wrong type being used for the dir_create_flags variable in virStorageBackendFileSystemBuild , it's defined as a bool but is used as a flag bit field so should be unsigned int (this matches the type virDirCreate expects for this variable). This should fix https://bugzilla.gnome.org/show_bug.cgi?id=752417 (GNOME Boxes) and https://bugzilla.redhat.com/show_bug.cgi?id=1244080 (downstream virt-manager).	2015-07-17 15:24:18 +02:00
Daniel P. Berrange	406ee8c226	rpc: ensure daemon is spawn even if dead socket exists The auto-spawn code would originally attempt to spawn the daemon for both ENOENT and ECONNREFUSED errors from connect(). The various refactorings eventually lost this so we only spawn the daemon on ENOENT. The result is if the daemon exits uncleanly, so that the socket is left in the filesystem, we will never be able to auto-spawn the daemon again.	2015-07-17 12:46:43 +01:00
Michal Privoznik	54012746ae	viraccessperm.h: Fix some typos Like s/authoriation/authorization/ and s/requries/requires/ Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-07-17 09:41:31 +02:00
John Ferlan	279238fea3	rbd: Return error from rbd_create for message processing Resolving an error reporting bug introduced by commit id '761491e' which just took the return of virStorageBackendRBDCreateImage and used it as the basis for the message generated. This would generate EPERM regardless of error seen.	2015-07-16 12:31:25 -04:00
Wido den Hollander	045cac32fd	rbd: Use RBD format 2 by default when creating images. We used to look at the librbd code version and depending on that we would invoke rbd_create3() or rbd_create(). Since librbd version 0.67.9 we can however tell RBD that it should create rbd format 2 images even if we invoke rbd_create(). The less options we pass to librbd, the more we can lean on the sane defaults it uses. For rbd_create3() we had things like the stripe count and unit hardcoded in libvirt and that might cause problems down the road. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2015-07-16 12:31:20 -04:00
Boris Fiuczynski	d01b7c7854	qemu: Make virtio-9p-ccw the default for s390-ccw-virtio machines For s390-ccw-virtio machines the default bus type is set to ccw. Specifing an address element allows to override the default. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Jason J. Herne <jjherne@us.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com>	2015-07-15 14:37:30 +02:00
Boris Fiuczynski	56f6de93b5	qemu: Support for virtio-9p-ccw Adding the recently in qemu added 9pfs support for virtio-ccw. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Jason J. Herne <jjherne@us.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com>	2015-07-15 14:37:30 +02:00
Michal Privoznik	cd043390ff	qemuMigrationRun: Don't leak @fd If we are migrating to an UNIX socket, we accept() a connection from qemu and use that FD to set up a tunnel. However, the FD is not closed as often as it should be. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-07-15 11:40:41 +02:00
Cédric Bosdonnat	4749fec10d	lxc: wait for nbd device to be up to get its PIDs The nbd device pid file doesn't appear immediately after starting qemu-nbd: adding a small loop to wait for it before getting it's processes PIDs.	2015-07-15 10:16:15 +02:00
Cédric Bosdonnat	8dd8df6f7c	Fix qemu-nbd cleanup crashes The virLXCControllerAppendNBDPids function didn't properly initialize pids and npids. In case of failure it was crashing when freeing those.	2015-07-15 10:16:14 +02:00
Andrea Bolognani	aa6c3fee86	nodeinfo: Formatting changes	2015-07-14 17:11:36 -04:00
Andrea Bolognani	75f6f54546	nodeinfo: Make sysfs_prefix usage more consistent Make sure sysfs_prefix, when present, is always the first argument to a function; don't use a different name to refer to it; check whether it is NULL, and hence SYSFS_SYSTEM_PATH should be used, only when using it directly and not just passing it down to another function; always pass down the same value we've been passed when calling another function.	2015-07-14 17:11:36 -04:00
Peter Krempa	c212e0c779	qemu: process: Improve update of maximum balloon state at startup In commit `641a145d73` I've added code that resets the balloon memory value to full size prior to resuming the vCPUs since the size certainly was not reduced at that point. Since qemuProcessStart is used also in code paths with already booted up guests (migration, save/restore) the assumption is not entirely true since the guest might already been running before. This patch adds a function that queries the monitor rather than using the full size since a balloon event would not be reissued in case we are recovering a saved migration state. Additionally the new function is used also when reconnecting to a VM after libvirtd restart since we might have missed a few balloon events while libvirtd was not running.	2015-07-14 14:47:57 +02:00
Michal Privoznik	1cf25f6334	qemuDomainSetNumaParamsLive: Check for NUMA mode more wisely https://bugzilla.redhat.com/show_bug.cgi?id=1232663 In one of my previous ptaches (`bcd9a564`) I've tried to fix the problem that we blindly assumed strict NUMA mode for guests. This led to several problems like us pinning a domain onto a nodeset via libnuma among with CGroups. Once the nodeset was changed by user, well, it did not result in desired effect. See the original commit for more info. But, the commit I wrote had a bug: when NUMA parameters are changed on a running domain we require domain to be strictly pinned onto a nodeset. Due to a typo a condition was mis-evaluated. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2015-07-14 10:29:19 +02:00
Martin Kletzander	0e3ad241f3	network: Add another collision check into networkCheckRouteCollision The comment above that function says: "This function can be a lot more exhaustive, ...", so let's be. Check for collisions between routes in the system and static routes being added explicitly from the <route/> element of the network XML. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1094205 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-14 09:56:44 +02:00
Martin Kletzander	0f10eb6a28	conf: Add getter for network routes Add virNetworkDefGetRouteByIndex() similarly to virNetworkDefGetIpByIndex(), but for routes. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2015-07-14 08:04:49 +02:00
Kothapally Madhu Pavan	bb31f4532b	nodeinfo: fix to parse present cpus rather than possible cpus This patch resolves a situation where a core is defective and is not in the present mask during boot. Optionally a host can have empty sockets could be brought online if the socket is added. In this case the present mask contains the cpu's that are actually there in the sockets even though they might be offline for some reason. This patch excludes the cpu's that are offline because the socket is defective/empty by checking the present mask before reading the cpu directory. Otherwise, the nodeinfo on such hosts always displays wrong output which includes the defective/empty sockets as set of offline cpu's. Signed-off-by: Kothapally Madhu Pavan <kmp@linux.vnet.ibm.com>	2015-07-13 16:07:44 -04:00
John Ferlan	c71f0654fc	nodeinfo: Add sysfs_prefix to nodeGetMemoryStats Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_SYSTEM_PATH.	2015-07-13 15:59:32 -04:00
John Ferlan	b97b30480d	nodeinfo: Add sysfs_prefix to nodeCapsInitNUMA Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_CPU_PATH which is a derivative of SYSFS_SYSTEM_PATH Use cpupath for nodeCapsInitNUMAFake and remove SYSFS_CPU_PATH	2015-07-13 15:59:32 -04:00
John Ferlan	29e4f2243f	nodeinfo: Add sysfs_prefix to nodeGetInfo Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_SYSTEM_PATH.	2015-07-13 15:59:32 -04:00
John Ferlan	f1c6179f0d	nodeinfo: Add sysfs_prefix to nodeGetCPUMap Add the sysfs_prefix argument to the call to allow for setting the path for tests to something other than SYSFS_SYSTEM_PATH.	2015-07-13 15:59:32 -04:00

1 2 3 4 5 ...

15121 Commits