libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-12-26 23:55:23 +00:00

Author	SHA1	Message	Date
Michal Privoznik	d0baf54e53	qemu: Actually unshare() iff running as root https://bugzilla.redhat.com/show_bug.cgi?id=1413922 While all the code that deals with qemu namespaces correctly detects whether we are running as root (and turn into NO-OP for qemu:///session) the actual unshare() call is not guarded with such check. Therefore any attempt to start a domain under qemu:///session shall fail as unshare() is reserved for root. The fix consists of moving unshare() call (for which we have a wrapper called virProcessSetupPrivateMountNS) into qemuDomainBuildNamespace() where the proper check is performed. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Tested-by: Richard W.M. Jones <rjones@redhat.com>	2017-01-17 13:23:56 +01:00
Michal Privoznik	406e390962	qemu: Drop qemuDomainDeleteNamespace After previous commits, this function is no longer needed. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-10 13:04:57 +01:00
Michal Privoznik	6de3f11637	qemuProcessLaunch: fix indentation Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2017-01-05 14:38:45 +01:00
Wangjing (King, Euler)	3afaae4984	qemu: snapshot: restart CPUs when recover from interrupted snapshot job If we restart libvirtd while VM was doing external memory snapshot, VM's state be updated to paused as a result of running a migration-to-file operation, and then VM will be left as paused state. In this case we must restart the VM's CPUs to resume it. Signed-off-by: Wang King <king.wang@huawei.com>	2017-01-05 10:47:03 +01:00
Peter Krempa	e8f167a623	qemu: monitor: Don't resume lockspaces in resume event handler After qemu delivers the resume event it's already running and thus it's too late to enter lockspaces since it may already have modified the disk. The code only creates false log entries in the case when locking is enabled. The lockspace needs to be acquired prior to starting cpus.	2016-12-15 09:46:41 +01:00
Michal Privoznik	eadaa97548	qemu: Enter the namespace on relabelling Instead of trying to fix our security drivers, we can use a simple trick to relabel paths in both namespace and the host. I mean, if we enter the namespace some paths are still shared with the host so any change done to them is visible from the host too. Therefore, we can just enter the namespace and call SetAllLabel()/RestoreAllLabel() from there. Yes, it has slight overhead because we have to fork in order to enter the namespace. But on the other hand, no complexity is added to our code. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-12-15 09:25:16 +01:00
Michal Privoznik	bb4e529664	qemu: Spawn qemu under mount namespace Prime time. When it comes to spawning qemu process and relabelling all the devices it's going to touch, there's inherent race with other applications in the system (e.g. udev). Instead of trying convincing udev to not touch libvirt managed devices, we can create a separate mount namespace for the qemu, and mount our own /dev there. Of course this puts more work onto us as we have to maintain /dev files on each domain start and device hot(un-)plug. On the other hand, this enhances security also. From technical POV, on domain startup process the parent (libvirtd) creates: /var/lib/libvirt/qemu/$domain.dev /var/lib/libvirt/qemu/$domain.devpts The child (which is going to be qemu eventually) calls unshare() to create new mount namespace. From now on anything that child does is invisible to the parent. Child then mounts tmpfs on $domain.dev (so that it still sees original /dev from the host) and creates some devices (as explained in one of the previous patches). The devices have to be created exactly as they are in the host (including perms, seclabels, ACLs, ...). After that it moves $domain.dev mount to /dev. What's the $domain.devpts mount there for then you ask? QEMU can create PTYs for some chardevs. And historically we exposed the host ends in our domain XML allowing users to connect to them. Therefore we must preserve devpts mount to be shared with the host's one. To make this patch as small as possible, creating of devices configured for domain in question is implemented in next patches. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-12-15 09:25:16 +01:00
Viktor Mihajlovski	283e290434	qemu: Allow use of hot plugged host CPUs if no affinity set If the cpuset cgroup controller is disabled in /etc/libvirt/qemu.conf QEMU virtual machines can in principle use all host CPUs, even if they are hot plugged, if they have no explicit CPU affinity defined. However, there's libvirt code supposed to handle the situation where the libvirt daemon itself is not using all host CPUs. The code in qemuProcessInitCpuAffinity attempts to set an affinity mask including all defined host CPUs. Unfortunately, the resulting affinity mask for the process will not contain the offline CPUs. See also the sched_setaffinity(2) man page. That means that even if the host CPUs come online again, they won't be used by the QEMU process anymore. The same is true for newly hot plugged CPUs. So we are effectively preventing that QEMU uses all processors instead of enabling it to use them. It only makes sense to set the QEMU process affinity if we're able to actually grow the set of usable CPUs, i.e. if the process affinity is a subset of the online host CPUs. There's still the chance that for some reason the deliberately chosen libvirtd affinity matches the online host CPU mask by accident. In this case the behavior remains as it was before (CPUs offline while setting the affinity will not be used if they show up later on). Signed-off-by: Viktor Mihajlovski <mihajlov@linux.vnet.ibm.com> Tested-by: Matthew Rosato <mjrosato@linux.vnet.ibm.com>	2016-12-13 18:25:00 -05:00
Nikolay Shirokovskiy	1215965a4c	qemu: mark user defined websocket as used We need extra state variable to distinguish between autogenerated and user defined cases after auto generation is done.	2016-12-09 07:54:34 -05:00
Nikolay Shirokovskiy	b07cfd724f	qemu: Refactor qemuProcessGraphicsReservePorts Use switch for enums rather than if/else conditions.	2016-12-09 07:40:46 -05:00
Michal Privoznik	ce937d3710	security: Drop virSecurityManagerSetHugepages Since its introduction in 2012 this internal API did nothing. Moreover we have the same API that does exactly the same: virSecurityManagerDomainSetPathLabel. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-12-08 15:45:52 +01:00
Michal Privoznik	f55afd83b1	qemu: Create hugepage path on per domain basis If you've ever tried running a huge page backed guest under different user than in qemu.conf, you probably failed. Problem is even though we have corresponding APIs in the security drivers, there's no implementation and thus we don't relabel the huge page path. But even if we did, so far all of the domains share the same path: /hugepageMount/libvirt/qemu Our only option there would be to set 0777 mode on the qemu dir which is totally unsafe. Therefore, we can create dir on per-domain basis, i.e.: /hugepageMount/libvirt/qemu/domainName and chown domainName dir to the user that domain is configured to run under. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-12-08 15:45:52 +01:00
Laine Stump	9b0848d523	qemu: propagate virQEMUDriver object to qemuDomainDeviceCalculatePCIConnectFlags If libvirtd is running unprivileged, it can open a device's PCI config data in sysfs, but can only read the first 64 bytes. But as part of determining whether a device is Express or legacy PCI, qemuDomainDeviceCalculatePCIConnectFlags() will be updated in a future patch to call virPCIDeviceIsPCIExpress(), which tries to read beyond the first 64 bytes of the PCI config data and fails with an error log if the read is unsuccessful. In order to avoid creating a parallel "quiet" version of virPCIDeviceIsPCIExpress(), this patch passes a virQEMUDriverPtr down through all the call chains that initialize the qemuDomainFillDevicePCIConnectFlagsIterData, and saves the driver pointer with the rest of the iterdata so that it can be used by qemuDomainDeviceCalculatePCIConnectFlags(). This pointer isn't used yet, but will be used in an upcoming patch (that detects Express vs legacy PCI for VFIO assigned devices) to examine driver->privileged.	2016-11-30 15:28:07 -05:00
Jiri Denemark	0355de2e77	qemuProcessReconnect: Avoid relabeling images after migration Restarting libvirtd on the source host at the end of migration when a domain is already running on the destination would cause image labels to be reset effectively killing the domain. Commit `e8d0166e1d` fixed similar issue on the destination host, but kept the source always resetting the labels, which was mostly correct except for the specific case handled by this patch. https://bugzilla.redhat.com/show_bug.cgi?id=1343858 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-11-29 12:37:04 +01:00
Peter Krempa	b87a11340f	qemu: capabilities: Don't partially reprope caps on process reconnect Thanks to the complex capability caching code virQEMUCapsProbeQMP was never called when we were starting a new qemu VM. On the other hand, when we are reconnecting to the qemu process we reload the capability list from the status XML file. This means that the flag preventing the function being called was not set and thus we partially reprobed some of the capabilities. The recent addition of CPU hotplug clears the QEMU_CAPS_QUERY_HOTPLUGGABLE_CPUS if the machine does not support it. The partial re-probe on reconnect results into attempting to call the unsupported command and then killing the VM. Remove the partial reprobe and depend on the stored capabilities. If it will be necessary to reprobe the capabilities in the future, we should do a full reprobe rather than this partial one.	2016-11-28 10:02:36 +01:00
Jiri Denemark	7bf6f345e0	qemu: Probe CPU models for KVM and TCG CPU models (and especially some additional details which we will start probing for later) differ depending on the accelerator. Thus we need to call query-cpu-definitions in both KVM and TCG mode to get all data we want. Tests in tests/domaincapstest.c are temporarily switched to TCG to avoid having to squash even more stuff into this single patch. They will all be switched back later in separate commits. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-11-25 20:34:27 +01:00
Michal Privoznik	c2a5a4e7ea	virstring: Unify string list function names We have couple of functions that operate over NULL terminated lits of strings. However, our naming sucks: virStringJoin virStringFreeList virStringFreeListCount virStringArrayHasString virStringGetFirstWithPrefix We can do better: virStringListJoin virStringListFree virStringListFreeCount virStringListHasString virStringListGetFirstWithPrefix Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-11-25 13:54:05 +01:00
Nikolay Shirokovskiy	3c1c56781d	qemu: drop write-only agentStart	2016-11-23 11:31:14 +03:00
Nikolay Shirokovskiy	6ba861ae36	qemu: agent: cleanup agent error flag correctly Sometimes after domain restart agent is unavailabe even if it is up and running in guest. Diagnostic message is "QEMU guest agent is not available due to an error" that is 'priv->agentError' is set. Investiagion shows that 'priv->agent' is not NULL, so error flag is set probably during domain shutdown process and not cleaned up eventually. The patch is quite simple - just clean up error flag unconditionally upon domain stop. Other hunks address other cases when error flag is not cleaned up. 1. processSerialChangedEvent. We need to clean error flag unconditionally here too. For example if upon first 'connected' event we fail to connect and set error flag and then connect on second 'connected' event then error flag will remain set erroneously and make agent unavailable. 2. qemuProcessHandleAgentEOF. If error flag is set and we get EOF we need to change state (and diagnostic) from 'error' to 'not connected'.	2016-11-23 11:14:44 +03:00
Nikolay Shirokovskiy	851ae08e3e	qemu: agent: handle agent connection errors in one place qemuConnectAgent return -1 or -2 in case of different errors. A. -1 is a case of unsuccessuful connection to guest agent. B. -2 is a case of destoyed domain during connection attempt. All qemuConnectAgent callers handle the first error the same way so let's move this logic into qemuConnectAgent itself. Patched function returns 0 in case A and -1 in case B.	2016-11-23 11:14:11 +03:00
Marc Hartmayer	1c122e737e	Refactoring: Use virHostdevIsSCSIDevice() Use the util function virHostdevIsSCSIDevice() to simplify if statements. Signed-off-by: Marc Hartmayer <mhartmay@linux.vnet.ibm.com> Reviewed-by: Bjoern Walk <bwalk@linux.vnet.ibm.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com>	2016-11-22 14:37:36 +01:00
Marc Hartmayer	505bc9b025	qemu: Fix improper union member access on hostdevs Add missing checks if a hostdev is a subsystem/SCSI device before access the union member 'subsys'/'scsi'. Also fix indentation and simplify qemuDomainObjCheckHostdevTaint(). Signed-off-by: Marc Hartmayer <mhartmay@linux.vnet.ibm.com> Reviewed-by: Bjoern Walk <bwalk@linux.vnet.ibm.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com>	2016-11-22 14:37:36 +01:00
Jiri Denemark	d73422c186	cpu: Introduce virCPUConvertLegacy API PPC driver needs to convert POWERx_v* legacy CPU model names into POWERx to maintain backward compatibility with existing domains. This patch adds a new step into the guest CPU configuration work flow which CPU drivers can use to convert legacy CPU definitions. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-11-15 15:49:16 +01:00
Jiri Denemark	2a2ce08a6d	cpu: Make models array in virCPUTranslate constant The API doesn't change the array so let's make it constant. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-11-15 15:49:16 +01:00
Peter Krempa	93d9ff3da0	qemu: process: detect if dimm aliases are broken on reconnect Detect on reconnect to a running qemu VM whether the alias of a hotpluggable memory device (dimm) does not match the dimm slot number where it's connected to. This is necessary as qemu is actually considering the alias as machine ABI used to connect the backend object to the dimm device. This will require us to keep them consistent so that we can reliably restore them on migration. In some situations it was currently possible to create a mismatched configuration and qemu would refuse to restore the migration stream. To avoid breaking existing VMs we'll need to keep the old algorithm though.	2016-11-10 17:36:55 +01:00
John Ferlan	daf5c651f0	qemu: Add a secret object to/for a char source dev Add the secret object so the 'passwordid=' can be added if the command line if there's a secret defined in/on the host for TCP chardev TLS objects. Preparation for the secret involves adding the secinfo to the char source device prior to command line processing. There are multiple possibilities for TCP chardev source backend usage. Add test for at least a serial chardev as an example.	2016-10-26 07:18:25 -04:00
Pavel Hrdina	0298531b29	domain: Add optional 'tls' attribute for TCP chardev Add an optional "tls='yes\|no'" attribute for a TCP chardev. For QEMU, this will allow for disabling the host config setting of the 'chardev_tls' for a domain chardev channel by setting the value to "no" or to attempt to use a host TLS environment when setting the value to "yes" when the host config 'chardev_tls' setting is disabled, but a TLS environment is configured via either the host config 'chardev_tls_x509_cert_dir' or 'default_tls_x509_cert_dir' Signed-off-by: John Ferlan <jferlan@redhat.com> Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-10-24 16:05:33 +02:00
John Ferlan	77a12987a4	Introduce virDomainChrSourceDefNew for virDomainChrDefPtr Change the virDomainChrDef to use a pointer to 'source' and allocate that pointer during virDomainChrDefNew. This has tremendous "fallout" in the rest of the code which mainly has to change source.$field to source->$field. Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-10-21 14:03:36 -04:00
John Ferlan	a99d9082ac	qemu: Remove unnecessary cfg fetch/unref qemuProcessPrepareDomain has no need to fetch/unref the cfg, so remove it. Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-10-17 15:38:32 -04:00
Michal Privoznik	507032d98d	virDomainNetGetActualType: Return type is virDomainNetType This function for some weird reason returns integer instead of virDomainNetType type. It is important to return the correct type so that we know what values we can expect. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-10-14 10:15:51 +08:00
Michal Privoznik	b7d2d4af2b	src: Treat PID as signed This initially started as a fix of some debug printing in virCgroupDetect. However it turned out that other places suffer from the similar problem. While dealing with pids, esp. in cases where we cannot use pid_t for ABI stability reasons, we often chose an unsigned integer type. This makes no sense as pid_t is signed. Also, new syntax-check rule is introduced so we won't repeat this mistake. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-10-13 17:58:56 +08:00
Pavel Hrdina	33af92a91c	qemu_process: always check capabilities for video devices Before this patch we've checked qemu capabilities for video devices only while constructing qemu command line using "-device" option. Since we support qemu only if "-device" option is present we can use the same capabilities to check also video devices while using "-vga" option to construct qemu command line. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-10-12 17:46:47 +02:00
Pavel Hrdina	8fed30d004	qemu_process: move video validation out of qemu_command Runtime validation that depend on qemu capabilities should be moved into qemuProcessStartValidateXML. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-10-12 17:46:47 +02:00
Pavel Hrdina	34a4447bd4	qemu_capabilities: join capabilities for qxl and qxl-vga devices This patch simplifies QEMU capabilities for QXL video device. QEMU exposes this device as qxl-vga and qxl and they are both the same device with the same set of parameters, the only difference is that qxl-vga includes VGA compatibility. Based on QEMU code they are tied together so it's safe to check only for presence of only one of them. This patch also removes an invalid test case "video-qxl-sec-nodevice" where there is only qxl-vga device and qxl device is not present. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-10-12 17:46:47 +02:00
Pavel Hrdina	3632ddc766	qemu_process: move qemuProcessStartValidateGraphics to correct place Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-10-12 17:46:46 +02:00
Peter Krempa	2c739866df	qemu: attach: Close monitor socket on connection failure If attaching to a qemu process fails after opening the monitor socket libvirt does not clean up the monitor. As the monitor also holds a reference to the domain object the qemu attach API basically leaks it. QEMU also does not interact on a second monitor connection and thus a further attempt to attach to it would lock up. Prevent libvirt from leaking the monitor by explicitly closing it. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1378401	2016-10-05 08:52:34 +02:00
Peter Krempa	6ff3e65012	qemu: process: Enforce 'vcpu' order range to <1,maxvcpus> The current code that validates duplicate vcpu order would not work properly if the order would exceed def->maxvcpus. Limit the order to the interval described.	2016-09-30 08:25:20 +02:00
Peter Krempa	8924f1b256	qemu: process: Don't use shifted indexes for vcpu order verification Allocate a one larger bitmap rather than shifting the indexes back to zero.	2016-09-30 08:25:20 +02:00
Peter Krempa	3d5dd28995	qemu: process: Fix off-by-one in vcpu order duplicate error message The bitmap indexes for the order duplicate check are shifted to 0 since vcpu order 0 is not allowed. The error message doesn't need such treating though. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1370360	2016-09-30 08:25:20 +02:00
Jiri Denemark	7ce711a30e	qemu: Update guest CPU def in live XML Storing the updated CPU definition in the live domain definition saves us from having to update it over and over when we need it. Not to mention that we will soon further update the CPU definition according to QEMU once it's started. A highly wanted side effect of this patch, libvirt will pass all CPU features explicitly specified in domain XML to QEMU, even those that are already included in the host model. This patch should fix the following bugs: https://bugzilla.redhat.com/show_bug.cgi?id=1207095 https://bugzilla.redhat.com/show_bug.cgi?id=1339680 https://bugzilla.redhat.com/show_bug.cgi?id=1371039 https://bugzilla.redhat.com/show_bug.cgi?id=1373849 https://bugzilla.redhat.com/show_bug.cgi?id=1375524 https://bugzilla.redhat.com/show_bug.cgi?id=1377913 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-09-22 15:40:09 +02:00
Jiri Denemark	46c49a3004	cpu: Rename cpuHasFeature to virCPUDataCheckFeature Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-09-22 15:40:09 +02:00
Jiri Denemark	3b6be3c0c5	cpu: Rework cpuUpdate The reworked API is now called virCPUUpdate and it should change the provided CPU definition into a one which can be consumed by the QEMU command line builder: - host-passthrough remains unchanged - host-model is turned into custom CPU with a model and features copied from host - custom CPU with minimum match is converted similarly to host-model - optional features are updated according to host's CPU Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-09-22 15:40:09 +02:00
Jiri Denemark	b27adaed37	qemu: Propagate virCapsPtr to virQEMUCapsNewForBinaryInternal Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-09-22 15:40:08 +02:00
Jiri Denemark	e9634933ea	qemu: Separate guest CPU validation from command line creation qemu_command.c should deal with translating our domain definition into a QEMU command line and nothing else. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-09-22 15:40:08 +02:00
Pavel Hrdina	53e3f69b3c	qemu_process: move graphics validation into separate function Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-09-21 23:14:10 +02:00
Chen Hanxiao	5853ea85dc	qemu_process: show shutoff reasons when debug log disabled We have a few of senarios that libvirtd would invoke qemuProcessStop and leave a "shutting down" in /var/log/libvirt/qemu/$DOMAIN.log. The shutoff reason showing in debug log is also very important for us to know why VM shutting down in domain log, as we seldom enable debug log of libvirtd. Signed-off-by: Chen Hanxiao <chenhanxiao@gmail.com>	2016-09-21 17:03:24 -04:00
Chen Hanxiao	fb360df4b0	qemu_process: fix a typo s/unitl/until Signed-off-by: Chen Hanxiao <chenhanxiao@gmail.com>	2016-09-20 10:48:58 +02:00
Peter Krempa	f428ff8ad4	qemu: Add missing 'p' to qemuCgrouEmulatorAllNodesRestore	2016-09-13 12:24:02 +02:00
Jiri Denemark	97a87333a0	Add helper for removing transient definition The code for replacing domain's transient definition with the persistent one is repeated in several places and we'll need to add one more. Let's make a nice helper for it. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2016-09-08 22:25:22 +02:00
Peter Krempa	68115fe0ab	qemu: process: Fix start with unpluggable vcpus with NUMA pinning Similarly to vcpu hotplug the emulator thread cgroup numa mapping needs to be relaxed while hot-adding vcpus so that the threads can allocate data in the DMA zone. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1370084	2016-09-07 16:05:01 +02:00

1 2 3 4 5 ...

822 Commits