libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-11-01 10:51:12 +00:00

Author	SHA1	Message	Date
Michal Privoznik	a70f3b1c77	includes: Install libvirt-common.h The libvirt-common.h is build time generated file from .in. Obviously, it's generated into builddir and not srcdir. Problem is, the list of header files to install, virinc_HEADERS contains only $(srcdir)/*.h and this misses libvirt-common.h. This problem is pretty obvious when doing a VPATH build. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-30 11:55:45 +01:00
John Ferlan	63e15ad5e0	logical: Create helper virStorageBackendLogicalParseVolExtents Create a helper routine in order to parse any extents information including the extent size, length, and the device string contained within the generated 'lvs' output string. A future patch would then be able to avoid the code more cleanly Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-01-29 14:13:14 -05:00
Wido den Hollander	84678267e4	rbd: Open in Read-Only mode when refreshing a volume By opening a RBD volume in Read-Only we do not register a watcher on the header object inside the Ceph cluster. Refreshing a volume only calls rbd_stat() which is a operation which does not write to a RBD image. This allows us to use a cephx user which has no write permissions if we would want to use the libvirt storage pool for informational purposes only. It also saves us a write into the Ceph cluster which should speed up refreshing a RBD pool. rbd_open_read_only() is available in all librbd versions which also support rbd_open(). Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 14:09:34 -05:00
Wido den Hollander	0b15f92032	rbd: Implement buildVolFrom using RBD cloning RBD supports cloning by creating a snapshot, protecting it and create a child image based on that snapshot afterwards. The RBD storage driver will try to find a snapshot with zero deltas between the current state of the original volume and the snapshot. If such a snapshot is found a clone/child image will be created using the rbd_clone2() function from librbd. rbd_clone2() is available in librbd since Ceph version Dumpling (0.67) which dates back to August 2013. It will use the same features, strip size and stripe count as the parent image. This implementation will only create a single snapshot on the parent image if never changes. This reduces the amount of snapshots created for that RBD image which benefits the performance of the Ceph cluster. During build the decision will be made to use either rbd_diff_iterate() or rbd_diff_iterate2(). The latter is faster, but only available on Ceph versions after 0.94 (Hammer). Cloning is only supported if RBD format 2 is used. All images created by libvirt are already format 2. If a RBD format 1 image is used as the original volume the backend will report a VIR_ERR_OPERATION_UNSUPPORTED error. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 11:11:51 -05:00
Wido den Hollander	34872ca461	rbd: Add support for wiping RBD volumes using TRIM. Using VIR_STORAGE_VOL_WIPE_ALG_TRIM a RBD volume can be trimmed down to 0 bytes using rbd_discard() Effectively all the data on the volume will be lost/gone, but the volume remains available for use afterwards. Starting at offset 0 the storage pool will call rbd_discard() in stripe size * count increments which is usually 4MB. Stripe size being 4MB and count 1. rbd_discard() is available since Ceph version Dumpling (0.67) which dates back to August 2013. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 11:11:32 -05:00
Wido den Hollander	63cdc92f04	storage: Add TRIM algorithm to storage volume API This new algorithm adds support for wiping volumes using TRIM. It does not overwrite all the data in a volume, but it tells the backing storage pool/driver that all bytes in a volume can be discarded. It depends on the backing storage pool how this is handled. A SCSI backend might send UNMAP commands to remove all data present on a LUN. A Ceph backend might use rbd_discard() to instruct the Ceph cluster that all data on that RBD volume can be discarded. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 11:09:14 -05:00
Wido den Hollander	f226ecbfbb	rbd: Add support for wiping RBD volumes When wiping the RBD image will be filled with zeros started at offset 0 and until the end of the volume. This will result in the RBD volume growing to it's full allocation on the Ceph cluster. All data on the volume will be overwritten however, making it unavailable. It does NOT take any RBD snapshots into account. The original data might still be in a snapshot of that RBD volume. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 10:42:36 -05:00
Wido den Hollander	69535c6124	storage: Adjust fix virStorageBackendVolWipeLocal switch Use the cast of (virStorageVolWipeAlgorithm) adding the missing case:'s (VIR_STORAGE_VOL_WIPE_ALG_ZERO and VIR_STORAGE_VOL_WIPE_ALG_LAST). Additionally, the old code would also still run the SCRUB command on default since it didn't go to cleanup when a invalid flag was supplied. We now go to cleanup and exit if a invalid flag would be provided. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 10:24:20 -05:00
John Ferlan	680030c42b	logical: Fix comment examples for virStorageBackendLogicalFindLVs When commit id '82c1740a' made changes to the output format (changing from using a ',' separator to '#'), the examples in the lvs output from the comments weren't changed. Additionally, the two new fields added ('segtype' and 'stripes') were not included in the output, leaving it well confusing. This patch fixes the sample output, adds a 'striped' example, and makes other comment related adjustments for long line and spacing between followup 'NB' remarks (while I'm there). Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-01-28 16:50:46 -05:00
Andrea Bolognani	11ef5869fb	pci: Use bool return type for some virPCIDeviceGet() functions The affected functions are: virPCIDeviceGetManaged() virPCIDeviceGetUnbindFromStub() virPCIDeviceGetRemoveSlot() virPCIDeviceGetReprobe() Change their return type from unsigned int to bool: the corresponding members in struct _virPCIDevice are defined as bool, and even the corresponding virPCIDeviceSet() functions take a bool value as input so there's no point in these functions having unsigned int as return type. Suggested-by: John Ferlan <jferlan@redhat.com>	2016-01-28 17:27:58 +01:00
Michal Privoznik	3f3f7a824c	gendispatch: Don't output spaces on empty line In our generator for some code we put empty lines in the output to separate blocks of code. However, in some cases we put couple of spaces on the empty line too. It's not bug, it just isn't nice. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-28 17:10:54 +01:00
Andrea Bolognani	171607296d	pci: Add debug messages when unbinding from stub driver Unbinding a PCI device from the stub driver can require several steps, and it can be useful for debugging to be able to trace which of these steps are performed and which are skipped for each device.	2016-01-28 12:20:53 +01:00
Andrea Bolognani	771eaeb2b3	pci: Phase out virPCIDeviceReattachInit() The name is confusing, and there are just two uses: one is a test case, and the other will be removed as part of an upcoming refactoring of the hostdev code.	2016-01-28 11:31:28 +01:00
Peter Krempa	d773b57d22	qemu: don't iterate vcpus using priv->nvcpupids in qemuProcessSetSchedParams This should be the last offender.	2016-01-28 09:58:24 +01:00
Peter Krempa	763941749e	conf: disallow empty cpuset for emulatorpin It's disallowed in the API.	2016-01-27 17:27:54 +01:00
Peter Krempa	31b782a147	conf: disallow empty cpusets for vcpu pinning when parsing XML They are disallowed in the pinning API and as default cpuset. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1293241	2016-01-27 17:27:54 +01:00
Peter Krempa	414b7eeae9	qemu: Don't use priv->ncpus to iterate cgroup setting Iterate over all cpus skipping inactive ones.	2016-01-27 17:27:54 +01:00
Andrea Bolognani	d87f0c0052	virnetdevopenvswitch: Don't call strlen() twice on the same string Commit `871e10f` fixed a memory corruption error, but called strlen() twice on the same string to do so. Even though the compiler is probably smart enough to optimize the second call away, having a single invocation makes the code slightly cleaner. Suggested-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-27 13:01:24 +01:00
Michal Privoznik	720bc953f8	virnetdevmacvlan: Provide stubs for build without macvtap In `370608b4c7` we have introduced two new internal APIs. However, there are no stubs for build without macvtap. Therefore build on systems lacking macvtap support (e.g. mingw or freebds) fails when trying to link. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-27 10:07:46 +01:00
Jason J. Herne	871e10fc95	Fix libvirtd free() segfault when migrating guest with deleted open vswitch port libvirtd crashes on free()ing portData for an open vswitch port if that port was deleted. To reproduce: ovs-vsctl del-port vnet0 virsh migrate --live kvm1 qemu+ssh://dstHost/system Error message: libvirtd: * Error in `/usr/sbin/libvirtd': free(): invalid pointer: 0x000003ff90001e20 * The problem is that virCommandRun can return an empty string in the event that the port being queried does not exist. When this happens then we are unconditionally overwriting a newline character at position strlen()-1. When strlen is 0, we overwrite memory that does not belong to the string. The fix: Only overwrite the newline if the string is not empty. Reviewed-by: Bjoern Walk <bwalk@linux.vnet.ibm.com> Signed-off-by: Jason J. Herne <jjherne@linux.vnet.ibm.com>	2016-01-27 10:01:58 +01:00
Laine Stump	370608b4c7	util: keep/use a bitmap of in-use macvtap devices This patch creates two bitmaps, one for macvlan device names and one for macvtap. The bitmap position is used to indicate that libvirt is currently using a device with the name macvtap%d/macvlan%d, where %d is the position in the bitmap. When requested to create a new macvtap/macvlan device, libvirt will now look for the first clear bit in the appropriate bitmap and derive the device name from that rather than just starting at 0 and counting up until one works. When libvirtd is restarted, the qemu driver code that reattaches to active domains calls the appropriate function to "re-reserve" the device names as it is scanning the status of running domains. Note that it may seem strange that the retry counter now starts at 8191 instead of 5. This is because we now don't do a "pre-check" for the existence of a device once we've reserved it in the bitmap - we move straight to creating it; although very unlikely, it's possible that someone has a running system where they have a large number of network devices created outside libvirt named "macvtap%d" or "macvlan%d" - such a setup would still allow creating more devices with the old code, while a low retry max in the new code would cause a failure. Since the objective of the retry max is just to prevent an infinite loop, and it's highly unlikely to do more than 1 iteration anyway, having a high max is a reasonable concession in order to prevent lots of new failures.	2016-01-26 12:20:04 -05:00
Leno Hou	8c70d04bab	util: increase libnl buffer size In the following cases nl_recv() was returning the error "No buffer space available": * When switching CPUs to offline/online in a system more than 128 cpus * When using virsh to destroy domain in a system with many interfaces This patch sets the buffer size for all netlink sockets created by libnl to 128K and turns on message peeking for nl_recv(). This eliminates the "No buffer space available" errors seen in the cases above, and also preempts other future errors the smaller buffers could have caused. Signed-off-by: Leno Hou <houqy@linux.vnet.ibm.com> Signed-off-by: Laine Stump <laine@laine.org>	2016-01-26 12:20:04 -05:00
Pavel Hrdina	36785c7e77	device: cleanup input device code The current code was a little bit odd. At first we've removed all possible implicit input devices from domain definition to add them later back if there was any graphics device defined while parsing XML description. That's not all, while formating domain definition to XML description we at first ignore any input devices with bus different to USB and VIRTIO and few lines later we add implicit input devices to XML. This seems to me as a lot of code for nothing. This patch may look to be more complicated than original approach, but this is a preferred way to modify/add driver specific stuff only in those drivers and not deal with them in common parsing/formating functions. The update is to add those implicit input devices into config XML to follow the real HW configuration visible by guest OS. There was also inconsistence between our behavior and QEMU's in the way, that in QEMU there is no way how to disable those implicit input devices for x86 architecture and they are available always, even without graphics device. This applies also to XEN hypervisor. VZ driver already does its part by putting correct implicit devices into live XML. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-01-26 17:53:33 +01:00
Pavel Hrdina	2686e44e05	tests: add some missing tests to qemuxml2xmltest Those tests are in qemuargv2xmltest and it makes sense to include them also in qemuxml2xmltest and qemuxml2argvtest. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-01-26 17:53:33 +01:00
Pavel Hrdina	2d446b6eeb	tests: use virtTestDifferenceFull in tests where we have output file This will enable regenerate functionality for those tests to make developer lives easier while updating tests. Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2016-01-26 17:53:33 +01:00
Michal Privoznik	c7f5e26b5f	vircgroup: Finish renaming of virCgroupIsolateMount In `dc576025c3` we renamed virCgroupIsolateMount function to virCgroupBindMount. However, we forgot about one occurrence in section of the code which provides stubs for platforms without support for CGroups like *BSD for instance. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-26 17:39:47 +01:00
Daniel P. Berrange	dc576025c3	lxc: don't try to hide parent cgroups inside container On the host when we start a container, it will be placed in a cgroup path of /machine.slice/machine-lxc\x2ddemo.scope under /sys/fs/cgroup/* Inside the containers' namespace we need to setup /sys/fs/cgroup mounts, and currently will bind mount /machine.slice/machine-lxc\x2ddemo.scope on the host to appear as / in the container. While this may sound nice, it confuses applications dealing with cgroups, because /proc/$PID/cgroup now does not match the directory in /sys/fs/cgroup This particularly causes problems for systems and will make it create repeated path components in the cgroup for apps run in the container eg /machine.slice/machine-lxc\x2ddemo.scope/machine.slice/machine-lxc\x2ddemo.scope/user.slice/user-0.slice/session-61.scope This also causes any systemd service that uses sd-notify to fail to start, because when systemd receives the notification it won't be able to identify the corresponding unit it came from. In particular this break rabbitmq-server startup Future kernels will provide proper cgroup namespacing which will handle this problem, but until that time we should not try to play games with hiding parent cgroups. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2016-01-26 16:11:32 +00:00
Daniel P. Berrange	511e7c5bba	qemu: add reporting of vCPU wait time The VIR_DOMAIN_STATS_VCPU flag to virDomainListGetStats enables reporting of stats about vCPUs. Currently we only report the cumulative CPU running time and the execution state. This adds reporting of the wait time - time the vCPU wants to run, but the host scheduler has something else running ahead of it. The data is reported per-vCPU eg $ virsh domstats --vcpu demo Domain: 'demo' vcpu.current=4 vcpu.maximum=4 vcpu.0.state=1 vcpu.0.time=1420000000 vcpu.0.wait=18403928 vcpu.1.state=1 vcpu.1.time=130000000 vcpu.1.wait=10612111 vcpu.2.state=1 vcpu.2.time=110000000 vcpu.2.wait=12759501 vcpu.3.state=1 vcpu.3.time=90000000 vcpu.3.wait=21825087 In implementing this I notice our reporting of CPU execute time has very poor granularity, since we are getting it from /proc/$PID/stat. As a future enhancement we should prefer to get CPU execute time from /proc/$PID/schedstat or /proc/$PID/sched (if either exist on the running kernel) Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2016-01-26 14:34:23 +00:00
Luyao Huang	985f01a65f	virsh: fix cpu-stats command output format issue After commit `57177f1`, the cpu-stats command format change to: CPU0: cpu_time 14401.507878990 seconds vcpu_time 14378732785511 vcpu_time is not user friendly. After this patch, it will change back: CPU0: cpu_time 14401.507878990 seconds vcpu_time 14378.732785511 seconds https://bugzilla.redhat.com/show_bug.cgi?id=1301807 Signed-off-by: Luyao Huang <lhuang@redhat.com>	2016-01-26 09:23:49 +01:00
Peter Krempa	356e28b35e	util: buffer: Sanitize comment for virBufferAddBuffer Idioms are usually weird and obscure when translated literally.	2016-01-25 17:53:08 +01:00
Peter Krempa	7141fc7a27	test: Touch up error message when attempting to pin invalid vCPU Report error: invalid argument: requested vcpu '100' is not present in the domain instead of error: invalid argument: requested vcpu is higher than allocated vcpus	2016-01-25 17:53:08 +01:00
Peter Krempa	f82a8014c0	tests: qemuxml2xml: Order pinning information numerically A future patch will refactor the storage of the pinning information in a way where the ordering will be lost. Order them numerically to avoid changing the tests later.	2016-01-25 17:53:07 +01:00
Peter Krempa	a2e80549a2	virsh: cpu-stats: Remove unneeded flags virDomainGetCPUStats doesn't support flags so there's no need to carry the 'flags' variable around. Additionally since the API is poorly designed I doubt that it will be extended.	2016-01-25 17:45:09 +01:00
Peter Krempa	57177f1abd	virsh: cpu-stats: Extract common printing code into a function Simplify the code by extracting a common code path.	2016-01-25 17:45:09 +01:00
Peter Krempa	51f07d8f0f	(qemu\|lxc)DomainGetCPUStats: Clean up Remove unnecessary condition and variable.	2016-01-25 17:45:09 +01:00
Peter Krempa	68ee703bfe	vz: Fix invalid iteration of def->cputune.vcpupin The array doesn't necessarily have the same cardinality as the count of vCPUs for a domain. Iterating it can cause access beyond the end of the array.	2016-01-25 17:45:09 +01:00
Peter Krempa	b3c91b8a50	qemu: process: Disallow VMs with 0 vcpus Counterintuitively the user would end up with a VM with maximum number of vCPUs available. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1290324	2016-01-25 17:45:09 +01:00
Peter Krempa	adca15cf15	qemu: process: refactor and rename qemuValidateCpuMax to qemuValidateCpuCount Next patch will add minimum checking, so use a more generic name. Refactor return values to the commonly used semantics.	2016-01-25 17:45:09 +01:00
Michal Privoznik	99f8fb4c55	virt-host-validate: Fix error level for user namespace check From the code it seems to me that we need user namespace if configured in domain XML. Otherwise we don't use it at all. However our tool is more strict about that. Fix this discrepancy. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-25 16:53:23 +01:00
Michal Privoznik	d55e11a302	virt-host-validate: Check those CGroups that we actually use Since the introduction of virt-host-validate tool the set of cgroup controllers we use has changed so the tool is checking for some cgroups that we don't need (e.g. net_cls, although I doubt we have ever used that one) and is not checking for those we actually use (e.g. cpuset). Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-25 16:53:18 +01:00
Michal Privoznik	9cbd1ecc3e	virsh: Correctly detect inserted media in change-media command https://bugzilla.redhat.com/show_bug.cgi?id=1250331 It all works like this. The change-media command dumps domain XML, finds the corresponding cdrom device we want to change media in and returns it in the xmlNodePtr form. This way we don't have to bother with keeping all the subelements or attributes that we don't care about in the XML that is fed back to libvirt for the update API. Now, the problem is we try to be clever here and detect if disk already has a source (indicated by <source/> subelement). However, bare fact that the element is there does not mean disk has source. Make our clever check better. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-25 15:57:23 +01:00
Michal Privoznik	35c3aab44d	vmx: Adapt to emptyBackingString for cdrom-image https://bugzilla.redhat.com/show_bug.cgi?id=1266088 We are missing this value for cdrom-image device. It seems like there's no added value to extend this to other types of disk devices [1]. 1: https://www.redhat.com/archives/libvir-list/2016-January/msg01038.html Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-25 08:34:23 +01:00
Peter Krempa	4ac14cde9a	qemu: snapshot: Correctly report qemu error on 'savevm' Since 'savevm' was not converted to QMP libvirt has to parse for error strings in the text monitor output. One of the unhandled errors is produced when qemu treats a device as unmigratable. As current qemu actually does support AHCI migration this bug is applicable only to older versions of qemu. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1293899	2016-01-25 07:21:25 +01:00
Peter Krempa	0c1b0d83bb	qemu: monitor: Refactor error handling for 'savevm' Unify few error conditions into a single error reporting case.	2016-01-25 07:21:25 +01:00
Roman Bogorodskiy	ef01addb38	bhyve: bhyveload: respect boot dev and boot order Make bhyveload respect boot order as specified by os.boot section of the domain XML or by "boot order" for specific devices. As bhyve does not support a real boot order specification right now, it's just about choosing a single device to boot from.	2016-01-25 04:19:33 +03:00
Roman Bogorodskiy	318ae9f3be	conf: expose virDomainBootType(From\|To)String These functions are going to be used by the Bhyve driver.	2016-01-25 03:54:07 +03:00
Laine Stump	29cc45cb79	util: reset MAC address of macvtap passthrough physdev after disassociate libvirt always resets the MAC address of the physdev used for macvtap passthrough when the guest is finished with it. This was happening prior to the 802.1Qb[gh] DISASSOCIATE command, and was quite often failing, presumably because the driver wouldn't allow the MAC address to be reset while the association was still active, with a log message like this: virNetDevSetMAC:168 : Cannot set interface MAC to 00:00:00:00:00:00 on 'eth13': Cannot assign requested address This patch changes the order - we now do the 802.1Qb[gh] disassociate and delete the macvtap interface first, then and reset the MAC address.	2016-01-22 13:16:24 -05:00
Cole Robinson	81da8bc73b	lxc: fuse: Stub out Slab bits in /proc/meminfo 'free' on fedora23 wants to use the Slab field for calculated used memory. The equation is: used = MemTotal - MemFree - (Cached + Slab) - Buffers We already set Cached and Buffers to 0, do the same for Slab and its related values https://bugzilla.redhat.com/show_bug.cgi?id=1300781	2016-01-22 08:32:00 -05:00
Cole Robinson	c7be484d11	lxc: fuse: Fill in MemAvailable for /proc/meminfo 'free' on Fedora 23 will use MemAvailable to calculate its 'available' field, but we are passing through the host's value. Set it to match MemFree, which is what 'free' will do for older linux that don't have MemAvailable https://bugzilla.redhat.com/show_bug.cgi?id=1300781	2016-01-22 08:32:00 -05:00
Cole Robinson	8418245a7e	lxc: fuse: Fix /proc/meminfo size calculation We virtualize bits of /proc/meminfo by replacing host values with values specific to the container. However for calculating the final size of the returned data, we are using the size of the original file and not the altered copy, which could give garbelled output.	2016-01-22 08:32:00 -05:00

1 2 3 4 5 ...

21382 Commits