libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2025-01-04 03:55:20 +00:00

Author	SHA1	Message	Date
Andrea Bolognani	c2e60ad0e5	qemu: Forbid <memoryBacking><locked> without <memtune><hard_limit> In order for memory locking to work, the hard limit on memory locking (and usage) has to be set appropriately by the user. The documentation mentions the requirement already: with this patch, it's going to be enforced by runtime checks as well, by forbidding a non-compliant guest from being defined as well as edited and started. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1316774	2017-02-07 18:43:10 +01:00
Michal Privoznik	7f0b382522	qemuDomainAttachDeviceMknod: Don't loop endlessly When working with symlinks it is fairly easy to get into a loop. Don't. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 13:20:19 +01:00
Michal Privoznik	3f5fcacf89	qemuDomainAttachDeviceMknod: Deal with symlinks Similarly to one of the previous commits, we need to deal properly with symlinks in hotplug case too. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 13:20:17 +01:00
Michal Privoznik	4ac847f93b	qemuDomainCreateDevice: Don't loop endlessly When working with symlinks it is fairly easy to get into a loop. Don't. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 13:18:32 +01:00
Michal Privoznik	54ed672214	qemuDomainCreateDevice: Properly deal with symlinks Imagine you have a disk with the following source set up: /dev/disk/by-uuid/$uuid (symlink to) -> /dev/sda After `cbc45525cb` the transitive end of the symlink chain is created (/dev/sda), but we need to create any item in chain too. Others might rely on that. In this case, /dev/disk/by-uuid/$uuid comes from domain XML thus it is this path that secdriver tries to relabel. Not the resolved one. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 13:18:10 +01:00
Michal Privoznik	b621291f5c	qemuDomain{Attach,Detach}Device NS helpers: Don't relabel devices After previous commit this has become redundant step. Also setting up devices in namespace and setting their label later on are two different steps and should be not done at once. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 10:40:53 +01:00
Michal Privoznik	0f0fcc2cd4	qemu_security: Use more transactions The idea is to move all the seclabel setting to security driver. Having the relabel code spread all over the place looks very messy. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 10:40:53 +01:00
Michal Privoznik	3e6839d4e8	qemuSecurityRestoreAllLabel: Don't use transactions Because of the nature of security driver transactions, it is impossible to use them properly. The thing is, transactions enter the domain namespace and commit all the seclabel changes. However, in RestoreAllLabel() this is impossible - the qemu process, the only process running in the namespace, is gone. And thus is the namespace. Therefore we shouldn't use the transactions as there is no namespace to enter. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 10:40:53 +01:00
Michal Privoznik	0a4652381f	qemuDomainPrepareDisk: Fix ordering The current ordering is as follows: 1) set label 2) create the device in namespace 3) allow device in the cgroup While this might work for now, it will definitely not work if the security driver would use transactions as in that case there would be no device to relabel in the domain namespace as the device is created in the second step. Swap steps 1) and 2) to allow security driver to use more transactions. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-02-07 10:40:53 +01:00
Nitesh Konkar	4f405ebd1d	qemu: Fix indentation in qemu_interface.h Signed-off-by: Nitesh Konkar <nitkon12@linux.vnet.ibm.com>	2017-02-01 09:27:48 +01:00
Martin Kletzander	bb5d6379a0	qemu: Don't lose group_name Now that we have a function for properly assigning the blockdeviotune info, let's use it instead of dropping the group name on every assignment. Otherwise it will not work with both --live and --config options. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2017-01-31 20:19:35 +01:00
Martin Kletzander	8336cbca21	qemu: Fix indentation in qemu_domain.h for RNG Namespaces Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2017-01-31 16:13:32 +01:00
Ján Tomko	3ac97c2ded	qemu: Add enough USB hubs to accomodate all devices Commit `815d98a` started auto-adding one hub if there are more USB devices than available USB ports. This was a strange choice, since there might be even more devices. Before USB address allocation was implemented in libvirt, QEMU automatically added a new USB hub if the old one was full. Adjust the logic to try adding as many hubs as will be needed to plug in all the specified devices. https://bugzilla.redhat.com/show_bug.cgi?id=1410188	2017-01-31 13:09:08 +01:00
Ján Tomko	de325472cc	qemu: assign USB addresses on redirdev hotplug too https://bugzilla.redhat.com/show_bug.cgi?id=1375410	2017-01-30 16:17:35 +01:00
Michal Privoznik	a5cae75a3e	qemuBuildChrChardevStr: Don't leak @charAlias ==12618== 110 bytes in 10 blocks are definitely lost in loss record 269 of 295 ==12618== at 0x4C2AE5F: malloc (vg_replace_malloc.c:297) ==12618== by 0x1CFC6DD7: vasprintf (vasprintf.c:73) ==12618== by 0x1912B2FC: virVasprintfInternal (virstring.c:551) ==12618== by 0x1912B411: virAsprintfInternal (virstring.c:572) ==12618== by 0x50B1FF: qemuAliasChardevFromDevAlias (qemu_alias.c:638) ==12618== by 0x518CCE: qemuBuildChrChardevStr (qemu_command.c:4973) ==12618== by 0x522DA0: qemuBuildShmemBackendChrStr (qemu_command.c:8674) ==12618== by 0x523209: qemuBuildShmemCommandLine (qemu_command.c:8789) ==12618== by 0x526135: qemuBuildCommandLine (qemu_command.c:9843) ==12618== by 0x48B4BA: qemuProcessCreatePretendCmd (qemu_process.c:5897) ==12618== by 0x4378C9: testCompareXMLToArgv (qemuxml2argvtest.c:498) ==12618== by 0x44D5A6: virTestRun (testutils.c:180) Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-30 10:38:03 +01:00
Martin Kletzander	b425245520	qemu: Add better message for some invalid block I/O settings For example when both total_bytes_sec and total_bytes_sec_max are set, but the former gets cleaned due to new call setting, let's say, read_bytes_sec, we end up with this weird message for the command: $ virsh blkdeviotune fedora vda --read-bytes-sec 3000 error: Unable to change block I/O throttle error: unsupported configuration: value 'total_bytes_sec_max' cannot be set if 'total_bytes_sec' is not set So let's make it more descriptive. This is how it looks after the change: $ virsh blkdeviotune fedora vda --read-bytes-sec 3000 error: Unable to change block I/O throttle error: unsupported configuration: cannot reset 'total_bytes_sec' when 'total_bytes_sec_max' is set Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1344897 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2017-01-29 19:57:13 +01:00
Martin Kletzander	87ee705183	qemu: Miscellaneous Block I/O tune cleanups Well, just two. One indentation and the usage of 'ret'. Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2017-01-29 19:53:52 +01:00
Martin Kletzander	e9d75343d4	qemu: Only set group_name when actually requested We were setting it based on whether it was supported and that lead to setting it to NULL, which our JSON code caught. However it ended up producing the following results: $ virsh blkdeviotune fedora vda --total-bytes-sec-max 2000 error: Unable to change block I/O throttle error: internal error: argument key 'group' must not have null value Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2017-01-29 19:46:51 +01:00
Michal Privoznik	572eda12ad	qemu: Implement mtu on interface Not only we should set the MTU on the host end of the device but also let qemu know what MTU did we set. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-26 10:00:01 +01:00
Michal Privoznik	b020cf73fe	domain_conf: Introduce <mtu/> to <interface/> So far we allow to set MTU for libvirt networks. However, not all domain interfaces have to be plugged into a libvirt network and even if they are, they might want to have a different MTU (e.g. for testing purposes). Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-26 09:59:56 +01:00
Chen Hanxiao	980f2a35c7	qemu_domain: add timestamp in tainting of guests log We lacked of timestamp in tainting of guests log, which bring troubles for finding guest issues: such as whether a guest powerdown caused by qemu-monitor-command or others issues inside guests. If we had timestamp in tainting of guests log, it would be helpful when checking guest's /var/log/messages. Signed-off-by: Chen Hanxiao <chenhanxiao@gmail.com>	2017-01-21 12:34:19 -05:00
Jiri Denemark	6cb204b7ac	qemu: Reset hostModelInfo in virQEMUCapsReset Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-01-20 15:52:56 +01:00
Michal Privoznik	57b5e27d3d	qemu: set default vhost-user ifname Based on work of Mehdi Abaakouk <sileht@sileht.net>. When parsing vhost-user interface XML and no ifname is found we can try to fill it in in post parse callback. The way this works is we try to make up interface name from given socket path and then ask openvswitch whether it knows the interface. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-20 15:42:12 +01:00
Peter Krempa	1d4fd2dd0f	qemu: hotplug: Properly emit "DEVICE_DELETED" event when unplugging memory The event needs to be emitted after the last monitor call, so that it's not possible to find the device in the XML accidentally while the vm object is unlocked. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1414393	2017-01-20 14:24:35 +01:00
Daniel P. Berrange	b9cc6316c0	qemu: catch failure of drive_add Previously when QEMU failed "drive_add" due to an error opening a file it would report "could not open disk image" These days though, QEMU reports "Could not open '/tmp/virtd-test_e3hnhh5/disk1.qcow2': Permission denied" which we were not detecting as an error condition. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2017-01-19 10:56:53 +00:00
Peter Krempa	9d14cf595a	qemu: Move cpu hotplug code into qemu_hotplug.c Move all the worker code into the appropriate file. This will also allow testing of cpu hotplug.	2017-01-18 09:57:06 +01:00
Peter Krempa	5570f26763	qemu: Prepare for reuse of qemuDomainSetVcpusLive Extract the call to qemuDomainSelectHotplugVcpuEntities outside of qemuDomainSetVcpusLive and decide whether to hotplug or unplug the entities specified by the cpumap using a boolean flag. This will allow to use qemuDomainSetVcpusLive in cases where we prepare the list of vcpus to enable or disable by other means.	2017-01-18 09:57:06 +01:00
Peter Krempa	5cd670fea8	qemu: monitor: More strict checking of 'query-cpus' if hotplug is supported In cases where CPU hotplug is supported by qemu force the monitor to reject invalid or broken responses to 'query-cpus'. It's expected that the command returns usable data in such case.	2017-01-18 09:57:06 +01:00
Jiri Denemark	f66b185c46	qemu: Don't leak hostCPUModelInfo in virQEMUCaps Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-01-17 14:36:52 +01:00
Michal Privoznik	d0baf54e53	qemu: Actually unshare() iff running as root https://bugzilla.redhat.com/show_bug.cgi?id=1413922 While all the code that deals with qemu namespaces correctly detects whether we are running as root (and turn into NO-OP for qemu:///session) the actual unshare() call is not guarded with such check. Therefore any attempt to start a domain under qemu:///session shall fail as unshare() is reserved for root. The fix consists of moving unshare() call (for which we have a wrapper called virProcessSetupPrivateMountNS) into qemuDomainBuildNamespace() where the proper check is performed. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Tested-by: Richard W.M. Jones <rjones@redhat.com>	2017-01-17 13:23:56 +01:00
Daniel P. Berrange	2d0c4947ab	Revert "perf: Add cache_l1d perf event support" This reverts commit `ae16c95f1b`.	2017-01-16 16:54:34 +00:00
Collin L. Walling	e8a43f1995	qemu-capabilities: Fix query-cpu-model-expansion on s390 with older kernel When running on s390 with a kernel that does not support cpu model checking and with a Qemu new enough to support query-cpu-model-expansion, the gathering of qemu capabilities will fail. Qemu responds to the query-cpu-model-expansion qmp command with an error because the needed kernel ioct does not exist. When this happens a guest cannot even be defined due to missing qemu capabilities data. This patch fixes the problem by silently ignoring generic errors stemming from calls to query-cpu-model-expansion. Reported-by: Farhan Ali <alifm@linux.vnet.ibm.com> Signed-off-by: Collin L. Walling <walling@linux.vnet.ibm.com> Signed-off-by: Jason J. Herne <jjherne@linux.vnet.ibm.com>	2017-01-13 16:55:58 +01:00
Michal Privoznik	93a062c3b2	qemu: Copy SELinux labels for namespace too When creating new /dev/* for qemu, we do chown() and copy ACLs to create the exact copy from the original /dev. I though that copying SELinux labels is not necessary as SELinux will chose the sane defaults. Surprisingly, it does not leaving namespace with the following labels: crw-rw-rw-. root root system_u:object_r:tmpfs_t:s0 random crw-------. root root system_u:object_r:tmpfs_t:s0 rtc0 drwxrwxrwt. root root system_u:object_r:tmpfs_t:s0 shm crw-rw-rw-. root root system_u:object_r:tmpfs_t:s0 urandom As a result, domain is unable to start: error: internal error: process exited while connecting to monitor: Error in GnuTLS initialization: Failed to acquire random data. qemu-kvm: cannot initialize crypto: Unable to initialize GNUTLS library: Failed to acquire random data. The solution is to copy the SELinux labels as well. Reported-by: Andrea Bolognani <abologna@redhat.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-13 14:45:52 +01:00
Jiri Denemark	19e06cfa25	qemu: Ignore non-boolean CPU model properties The query-cpu-model-expansion is currently implemented for s390(x) only and all CPU properties it returns are booleans. However, x86 implementation will report more types of properties. Without making the code more tolerant older libvirt would fail to probe newer QEMU versions. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-01-12 11:58:25 +01:00
Jiri Denemark	ec23791517	qemu: Don't check CPU model property key The qemuMonitorJSONParseCPUModelProperty function is a callback for virJSONValueObjectForeachKeyValue and is called for each key/value pair, thus it doesn't really make sense to check whether key is NULL. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-01-12 11:58:25 +01:00
Michal Privoznik	cbc45525cb	qemuDomainCreateDevice: Canonicalize paths So far the decision whether /dev/* entry is created in the qemu namespace is really simple: does the path starts with "/dev/"? This can be easily fooled by providing path like the following (for any considered device like disk, rng, chardev, ..): /dev/../var/lib/libvirt/images/disk.qcow2 Therefore, before making the decision the path should be canonicalized. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-11 18:08:13 +01:00
Michal Privoznik	49f326edc0	qemu: Use namespaces iff available on the host kernel So far the namespaces were turned on by default unconditionally. For all non-Linux platforms we provided stub functions that just ignored whatever namespaces setting there was in qemu.conf and returned 0 to indicate success. Moreover, we didn't really check if namespaces are available on the host kernel. This is suboptimal as we might have ignored user setting. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-11 18:07:43 +01:00
Michal Privoznik	41816751a7	util: Introduce virFileMoveMount This is a simple wrapper over mount(). However, not every system out there is capable of moving a mount point. Therefore, instead of having to deal with this fact in all the places of our code we can have a simple wrapper and deal with this fact at just one place. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-11 18:06:30 +01:00
Michal Privoznik	2ff8c30548	qemuDomainSetupAllInputs: Update debug message Due to a copy-paste error, the debug message reads: Setting up disks It should have been: Setting up inputs. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-11 17:39:24 +01:00
Laine Stump	5949b53aec	conf: eliminate virDomainPCIAddressReleaseSlot() in favor of ...Addr() Surprisingly there was a virDomainPCIAddressReleaseAddr() function already, but it was completely unused. Since we don't reserve entire slots at once any more, there is no need to release entire slots either, so we just replace the single call to virDomainPCIAddressReleaseSlot() with a call to virDomainPCIAddressReleaseAddr() and remove the now unused function. The keen observer may be concerned that ...Addr() doesn't call virDomainPCIAddressValidate(), as ...Slot() did. But really the validation was pointless anyway - if the device hadn't been suitable to be connected at that address, it would have failed validation before every being reserved in the first place, so by definition it will pass validation when it is being unplugged. (And anyway, even if something "bad" happened and we managed to have a device incorrectly at the given address, we would still want to be able to free it up for use by a device that did validate properly).	2017-01-11 05:00:34 -05:00
Laine Stump	6cc2014202	qemu: rename qemuDomainPCIAddressReserveNextSlot() to ...Addr() This function doesn't actually reserve an entire slot any more, it reserves a single PCI address, so this name is more appropriate.	2017-01-11 05:00:08 -05:00
Laine Stump	c5aea19d56	qemu: remove qemuDomainPCIAddressReserveNextAddr() This function is only called in two places, and the function itself is just adding a single argument and calling virDomainPCIAddressReserveNextAddr(), so we can remove it and instead call virDomainPCIAddressReserveNextAddr() directly. (The main motivation for doing this is to free up the name so that qemuDomainPCIAddressReserveNextSlot() can be renamed in the next patch, as its current name is now inaccurate and misleading).	2017-01-11 04:59:42 -05:00
Laine Stump	27b0f971c4	conf: rename virDomainPCIAddressReserveSlot() to ...Addr() This function doesn't actually reserve an entire slot any more, it reserves a single PCI address, so this name is more appropriate.	2017-01-11 04:58:32 -05:00
Laine Stump	905859a6e5	qemu: replace virDomainPCIAddressReserveAddr with virDomainPCIAddressReserveSlot All occurences of the former use fromConfig=true, and that's exactly how virDomainPCIAddressReserveSlot() calls virDomainPCIaddressReserveAddr(), so just use Slot() so that Addr() can be made static to conf/domain_addr.c (both functions will be renamed in upcoming patches).	2017-01-11 04:55:06 -05:00
Laine Stump	b59bbdba4b	conf: fix fromConfig argument to virDomainPCIAddressValidate() fromConfig should be true if the caller wants virDomainPCIAddressValidate() to loosen restrictions on its interpretation of the pciConnectFlags. In particular, either PCI_DEVICE or PCIE_DEVICE will be counted as equivalent to both, and HOTPLUG will be ignored. In a few cases where libvirt was manually overriding automatic address assignment, it was setting fromConfig to false when validating the hardcoded manual override. This patch changes those to fromConfig=true as a preemptive strike against any future bugs that might otherwise surface.	2017-01-11 04:51:54 -05:00
Laine Stump	79901543b9	conf: fix fromConfig argument to virDomainPCIAddressReserveAddr() Although setting virDomainPCIAddressReserveAddr()'s fromConfig=true is correct when a PCI addres is coming from a domain's config, the true purpose of the fromConfig argument is to lower restrictions on what kind of device can plug into what kind of controller - if fromConfig is true, then a PCIE_DEVICE can plug into a slot that is marked as only compatible with PCI_DEVICE (and vice versa), and the HOTPLUG flag is ignored. For a long time there have been several calls to virDomainPCIAddressReserveAddr() that have fromConfig incorrectly set to false - it's correct that the addresses aren't coming from user config, but they are coming from hardcoded exceptions in libvirt that should, if anything, pay even less attention to following the pciConnectFlags (under the assumption that the libvirt programmer knew what they were doing). See commit `b87703cf7` for an example of an actual bug caused by the incorrect setting of the "fromConfig" argument to virDomainPCIAddressReserveAddr(). Although they haven't resulted in any reported bugs, this patch corrects all the other incorrect settings of fromConfig in calls to virDomainPCIAddressReserveAddr().	2017-01-11 04:47:12 -05:00
Laine Stump	48d39cf96d	conf: aggregate multiple devices on a slot when assigning PCI addresses If a PCI device has VIR_PCI_CONNECT_AGGREGATE_SLOT set in its pciConnectFlags, then during address assignment we allow multiple instances of this type of device to be auto-assigned to multiple functions on the same device. A slot is used for aggregating multiple devices only if the first device assigned to that slot had VIR_PCI_CONNECT_AGGREGATE_SLOT set. but any device types that have AGGREGATE_SLOT set might be mix/matched on the same slot. (NB: libvirt should never set the AGGREGATE_SLOT flag for a device type that might need to be hotplugged. Currently it is only planned for pcie-root-port and possibly other PCI controller types, and none of those are hotpluggable anyway) There aren't yet any devices that use this flag. That will be in a later patch.	2017-01-11 04:43:22 -05:00
Laine Stump	8f4008713a	qemu: use virDomainPCIAddressSetAllMulti() to set multi when needed If there are multiple devices assigned to the different functions of a single PCI slot, they will not work properly if the device at function 0 doesn't have its "multi" attribute turned on, so it makes sense for libvirt to turn it on during PCI address assignment. Setting multi then assures that the new setting is stored in the config (so it will be used next time the domain is started), preventing any potential problems in the case that a future change in the configuration eliminates the devices on all non-0 functions (multi will still be set for function 0 even though it is the only function in use on the slot, which has no useful purpose, but also doesn't cause any problems). (NB: If we were to instead just decide on the setting for multifunction at runtime, a later removal of the non-0 functions of a slot would result in a silent change in the guest ABI for the remaining device on function 0 (although it may seem like an inconsequential guest ABI change, it is a guest ABI change to turn off the multi bit).)	2017-01-11 04:42:08 -05:00
Laine Stump	9ff9d9f5a9	conf: eliminate concept of "reserveEntireSlot" setting reserveEntireSlot really accomplishes nothing - instead of going to the trouble of computing the value for reserveEntireSlot and then possibly setting all functions of the slot as in-use, we can just set the in-use bit only for the specific function being used by a device. Later we will know from the context (the PCI connect flags, and whether we are reserving a specific address or asking for "the next available") whether or not it is okay to allocate other functions on the same slot. Although it's not used yet, we allow specifying "-1" for the function number when looking for the "next available slot" - this is going to end up meaning "return the lowest available function in the slot, but since we currently only provide a function from an otherwise unused slot, "-1" ends up meaning "0".	2017-01-11 04:36:34 -05:00
Laine Stump	9838cad9cd	conf: use struct instead of int for each slot in virDomainPCIAddressBus When keeping track of which functions of which slots are allocated, we will need to have more information than just the current bitmap with a bit for each function that is currently stored for each slot in a virDomainPCIAddressBus. To prepare for adding more per-slot info, this patch changes "uint8_t slots" into "virDomainPCIAddressSlot slot", which currently has a single member named "functions" that serves the same purpose previously served directly by "slots".	2017-01-11 04:29:48 -05:00

1 2 3 4 5 ...

5945 Commits