libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-12-29 00:55:22 +00:00

Author	SHA1	Message	Date
Joao Martins	a040ba9ed4	libxl: set net device prefix Use the newly added virCapabilitiesSetNetPrefix to set the network prefix for the driver. This in return will be use by NetDefFormat() and NetDefParseXML() routines to free any interface name that start with the registered prefix. Acked-by: Daniel P. Berrange <berrange@redhat.com> Signed-off-by: Joao Martins <joao.m.martins@oracle.com>	2016-02-04 12:47:42 +00:00
Joao Martins	cd57b7c742	conf: add caps to virDomainSaveConfig virDomainSaveConfig calls virDomainDefFormat which was setting the caps to NULL, thus keeping the old behaviour (i.e. not looking at netprefix). This patch adds the virCapsPtr to the function and allows the configuration to be saved and skipping interface names that were registered with virCapabilitiesSetNetPrefix(). Signed-off-by: Joao Martins <joao.m.martins@oracle.com>	2016-02-04 12:38:27 +00:00
Joao Martins	d239a5427f	conf: add caps to virDomainDefFormat* And use the newly added caps->host.netprefix (if it exists) for interface names that match the autogenerated target names. Signed-off-by: Joao Martins <joao.m.martins@oracle.com>	2016-02-04 12:38:26 +00:00
Joao Martins	481e9bd0f6	conf: add prefix in virDomainNetDefParseXML And use the newly added caps->host.netprefix for free interface names that match the autogenerated target names. Signed-off-by: Joao Martins <joao.m.martins@oracle.com>	2016-02-04 11:15:51 +00:00
Joao Martins	819d1d9438	conf: add net device prefix to capabilities In the reverted commit `d2e5538b1`, the libxl driver was changed to copy interface names autogenerated by libxl to the corresponding network def in the domain's virDomainDef object. The copied name is freed when the domain transitions to the shutoff state. But when migrating a domain, the autogenerated name is included in the XML sent to the destination host. It is possible an interface with the same name already exists on the destination host, causing migration to fail. This patch defines a new capability for setting the network device prefix that will be used in the driver. Valid prefixes are VIR_NET_GENERATED_PREFIX or the one announced by the driver. Signed-off-by: Joao Martins <joao.m.martins@oracle.com>	2016-02-04 11:15:51 +00:00
Roman Bogorodskiy	c94f6d4dff	storage: zfs: flexible use of 'volmode' option There are slight differences in various ZFS implementations. Specifically, ZFS on FreeBSD requires to set value of 'volmode' option to 'dev' to expose volumes as raw disk device (that's what we need) rather than geom provides, for example. With ZFS on Linux, however, such option is not available and volumes exposed like we need by default. To make our implementation more flexible, only pass 'volmode' when it's supported. Support is checked by parsing usage information of the 'zfs get' command.	2016-02-04 03:16:50 +03:00
Erik Skultety	8cd1d546e6	util: Export remoteSerializeTypedParameters internally via util Same as for deserializer, this method might get handy for admin one day. The major reason for this patch is to stay consistent with idea, i.e. when deserializer can be shared, why not serializer as well. The only problem to be solved was that the daemon side serializer uses a code snippet which handles sparse arrays returned by some APIs as well as removes any string parameters that can't be returned to older clients. This patch makes of the new virTypedParameterRemote datatype introduced by one of the pvious patches.	2016-02-03 15:46:45 +01:00
Erik Skultety	9afc115f73	util: Export remoteFreeTypedParameters internally via util Since the method is static to remote_driver, it can't even be used by our daemon. Other than that, it would be useful to be able to use it with admin as well. This patch uses the new virTypedParameterRemote datatype introduced in one of previous patches.	2016-02-03 15:46:45 +01:00
Erik Skultety	0472cef685	util: Export remoteDeserializeTypedParameters internally via util Currently, the deserializer is hardcoded into remote_driver which makes it impossible for admin to use it. One way to achieve a shared implementation (besides moving the code to another module) would be pass @ret_params_val as a void pointer as opposed to the remote_typed_param pointer and add a new extra argument specifying which of those two protocols is being used and typecast the pointer at the function entry. An example from remote_protocol: struct remote_typed_param_value { int type; union { int i; u_int ui; int64_t l; uint64_t ul; double d; int b; remote_nonnull_string s; } remote_typed_param_value_u; }; typedef struct remote_typed_param_value remote_typed_param_value; struct remote_typed_param { remote_nonnull_string field; remote_typed_param_value value; }; That would leave us with a bunch of if-then-elses that needed to be used across the method. This patch takes the other approach using the new datatype introduced in one of earlier commits.	2016-02-03 15:46:45 +01:00
Erik Skultety	41a459947f	util: Introduce virTypedParameterRemote datatype Both admin and remote protocols define their own types (remote_typed_param vs admin_typed_param). Because of the naming convention, admin typed params wouldn't be able to reuse the serialization/deserialization methods, which are tailored for use by remote protocol, even if those method were exported properly. In that case, introduce a new internal data type structurally copying both admin and remote protocols which, eventually, would allow serializer and deserializer to be used in a more generic way.	2016-02-03 15:46:45 +01:00
Nikolay Shirokovskiy	1e93470df0	qemu: qemuDomainRename and virDomainObjListNumOfDomains ABBA deadlock fix A pretty nasty deadlock occurs while trying to rename a VM in parallel with virDomainObjListNumOfDomains. The short description of the problem is as follows: Thread #1: qemuDomainRename: ------> aquires domain lock by qemuDomObjFromDomain ---------> waits for domain list lock in any of the listed functions: - virDomainObjListFindByName - virDomainObjListRenameAddNew - virDomainObjListRenameRemove Thread #2: virDomainObjListNumOfDomains: ------> aquires domain list lock ---------> waits for domain lock in virDomainObjListCount Introduce generic virDomainObjListRename function for renaming domains. It aquires list lock in right order to avoid deadlock. Callback is used to make driver specific domain updates. Signed-off-by: Nikolay Shirokovskiy <nshirokovskiy@virtuozzo.com> Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-02-03 15:20:11 +01:00
Martin Kletzander	92757d4d2d	systemd: Add virSystemdGetMachineNameByPID Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2016-02-03 14:57:43 +01:00
Martin Kletzander	9ba2646291	Revert "systemd: Escape only needed characters for machined" This reverts commit `0e0149ce91`. That commit was added to comply with systemd rules that were changed in the meantime, so this patch is pointless.	2016-02-03 14:44:37 +01:00
Ján Tomko	e1d7273f24	Simplify virDomainParseMemory Do not store the return value of virDomainParseScaledValue, it was overwritten anyway. Delete the cleanup label, there is nothing to clean up.	2016-02-03 13:22:43 +01:00
Peter Krempa	598927a5bc	conf: Split out logic to determine whether cpupin was provided	2016-02-03 13:10:04 +01:00
Peter Krempa	451b955d62	qemu: domain: Prepare qemuDomainDetectVcpuPids for reuse Free the old vcpupids array in case when this function is called again during the run of the VM. It will be later reused in the vCPU hotplug code. The function now returns the number of detected VCPUs.	2016-02-03 13:10:04 +01:00
Peter Krempa	e97d1d20b1	qemu: Move and rename qemuProcessDetectVcpuPIDs to qemuDomainDetectVcpuPids Future patches will tweak and reuse the function in different places so move it separately first.	2016-02-03 13:10:04 +01:00
Peter Krempa	a190744aa9	qemu: cpu hotplug: Set vcpu state directly in the new structure Avoid using virDomainDefSetVcpus when we can set it directly in the structure.	2016-02-03 13:10:04 +01:00
Peter Krempa	9bf284daa9	conf: Add helper to retrieve bitmap of active vcpus for a definition In some cases it may be better to have a bitmap representing state of individual vcpus rather than iterating the definition. The new helper creates a bitmap representing the state from the domain definition.	2016-02-03 13:10:04 +01:00
Peter Krempa	58578f83bc	cgroup: Clean up virCgroupGetPercpuStats Use 'ret' for return variable name, clarify use of 'param_idx' and avoid unnecessary 'success' label. No functional changes. Also document the function.	2016-02-03 13:10:04 +01:00
Martin Kletzander	1794a0103a	qemu: Don't crash when create fails early Since commit `7140807917` we are generating socket path later than before -- when starting a domain. That makes one particular inconsistent state of a chardev, which was not possible before, currently valid. However, SELinux security driver forgot to guard the main restoring function by a check for NULL-paths. So make it no-op for NULL paths, as in the DAC driver. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1300532 Signed-off-by: Martin Kletzander <mkletzan@redhat.com>	2016-02-03 11:01:42 +01:00
Dmitry Andreev	d2dabff3a0	qemuDomainResume: allow to resume domain with guest panicked In case of guest panicked, preserved crashed domain has stopped CPUs. It's not possible to use tools like WinDbg for the problem investigation until we start CPUs back.	2016-02-03 10:33:48 +01:00
Nikolay Shirokovskiy	4a67b044fb	qemu: return -1 on error paths in qemuDomainSaveImageStartVM Error paths after sending the event that domain is started written as if ret = -1 which is set at the beginning of the function. It's common idioma to keep 'ret' equal to -1 until the end of function where it is set to 0. But here we use ret to keep result of restore operation too and thus breaks the idioma and its users :) Let's use different variable to hold restore result. Signed-off-by: Nikolay Shirokovskiy <nshirokovskiy@virtuozzo.com>	2016-02-03 10:27:35 +01:00
John Ferlan	6ec319b84f	logical: Clean up allocation when building regex on the fly Rather than a loop reallocating space to build the regex, just allocate it once up front, then if there's more than 1 nextent, append a comma and another regex_unit string. Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-02-02 13:13:05 -05:00
John Ferlan	c6d526f33f	logical: Use 'stripes' value for mirror/raid segtype The 'stripes' value is described as the "Number of stripes or mirrors in a logical volume". So add "mirror" and anything that starts with "raid" to the list of segtypes that can have an 'nextents' value greater than one. Use of raid segtypes (raid1, raid4, raid5, raid6, and raid10) is favored over mirror in more recent lvm code. Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-02-02 13:13:01 -05:00
John Ferlan	69267756d0	logical: Use VIR_APPEND_ELEMENT instead of VIR_REALLOC_N Rather than preallocating a set number of elements, then walking through the extents and adjusting the specific element in place, use the APPEND macros to handle that chore. Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-02-02 13:12:57 -05:00
Cole Robinson	92549b3b8a	qemu: Mark some functions as static	2016-02-01 10:33:25 -05:00
Michal Privoznik	c779bf8f62	fdstream: Realign Some lines in this file are misaligned which fires up my OCD. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-02-01 16:12:22 +01:00
Boris Fiuczynski	f73ad5d47e	qemu: Align dump options for watchdog and on_crash events Having on_crash set to either coredump-destroy or coredump-restart creates core dumps with option memory-only in the directory specified by auto_dump_path. When a watchdog is triggered with the action dump the core dump is also placed into the directory specified by auto_dump_path but is created without the option memory-only. This patch sets the option memory-only also for core dumps created by the watchdog event. Signed-off-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Bjoern Walk <bwalk@linux.vnet.ibm.com> Reviewed-by: Stefan Zimmermann <stzi@linux.vnet.ibm.com>	2016-02-01 13:47:56 +01:00
John Ferlan	63e15ad5e0	logical: Create helper virStorageBackendLogicalParseVolExtents Create a helper routine in order to parse any extents information including the extent size, length, and the device string contained within the generated 'lvs' output string. A future patch would then be able to avoid the code more cleanly Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-01-29 14:13:14 -05:00
Wido den Hollander	84678267e4	rbd: Open in Read-Only mode when refreshing a volume By opening a RBD volume in Read-Only we do not register a watcher on the header object inside the Ceph cluster. Refreshing a volume only calls rbd_stat() which is a operation which does not write to a RBD image. This allows us to use a cephx user which has no write permissions if we would want to use the libvirt storage pool for informational purposes only. It also saves us a write into the Ceph cluster which should speed up refreshing a RBD pool. rbd_open_read_only() is available in all librbd versions which also support rbd_open(). Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 14:09:34 -05:00
Wido den Hollander	0b15f92032	rbd: Implement buildVolFrom using RBD cloning RBD supports cloning by creating a snapshot, protecting it and create a child image based on that snapshot afterwards. The RBD storage driver will try to find a snapshot with zero deltas between the current state of the original volume and the snapshot. If such a snapshot is found a clone/child image will be created using the rbd_clone2() function from librbd. rbd_clone2() is available in librbd since Ceph version Dumpling (0.67) which dates back to August 2013. It will use the same features, strip size and stripe count as the parent image. This implementation will only create a single snapshot on the parent image if never changes. This reduces the amount of snapshots created for that RBD image which benefits the performance of the Ceph cluster. During build the decision will be made to use either rbd_diff_iterate() or rbd_diff_iterate2(). The latter is faster, but only available on Ceph versions after 0.94 (Hammer). Cloning is only supported if RBD format 2 is used. All images created by libvirt are already format 2. If a RBD format 1 image is used as the original volume the backend will report a VIR_ERR_OPERATION_UNSUPPORTED error. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 11:11:51 -05:00
Wido den Hollander	34872ca461	rbd: Add support for wiping RBD volumes using TRIM. Using VIR_STORAGE_VOL_WIPE_ALG_TRIM a RBD volume can be trimmed down to 0 bytes using rbd_discard() Effectively all the data on the volume will be lost/gone, but the volume remains available for use afterwards. Starting at offset 0 the storage pool will call rbd_discard() in stripe size * count increments which is usually 4MB. Stripe size being 4MB and count 1. rbd_discard() is available since Ceph version Dumpling (0.67) which dates back to August 2013. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 11:11:32 -05:00
Wido den Hollander	63cdc92f04	storage: Add TRIM algorithm to storage volume API This new algorithm adds support for wiping volumes using TRIM. It does not overwrite all the data in a volume, but it tells the backing storage pool/driver that all bytes in a volume can be discarded. It depends on the backing storage pool how this is handled. A SCSI backend might send UNMAP commands to remove all data present on a LUN. A Ceph backend might use rbd_discard() to instruct the Ceph cluster that all data on that RBD volume can be discarded. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 11:09:14 -05:00
Wido den Hollander	f226ecbfbb	rbd: Add support for wiping RBD volumes When wiping the RBD image will be filled with zeros started at offset 0 and until the end of the volume. This will result in the RBD volume growing to it's full allocation on the Ceph cluster. All data on the volume will be overwritten however, making it unavailable. It does NOT take any RBD snapshots into account. The original data might still be in a snapshot of that RBD volume. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 10:42:36 -05:00
Wido den Hollander	69535c6124	storage: Adjust fix virStorageBackendVolWipeLocal switch Use the cast of (virStorageVolWipeAlgorithm) adding the missing case:'s (VIR_STORAGE_VOL_WIPE_ALG_ZERO and VIR_STORAGE_VOL_WIPE_ALG_LAST). Additionally, the old code would also still run the SCRUB command on default since it didn't go to cleanup when a invalid flag was supplied. We now go to cleanup and exit if a invalid flag would be provided. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2016-01-29 10:24:20 -05:00
John Ferlan	680030c42b	logical: Fix comment examples for virStorageBackendLogicalFindLVs When commit id '82c1740a' made changes to the output format (changing from using a ',' separator to '#'), the examples in the lvs output from the comments weren't changed. Additionally, the two new fields added ('segtype' and 'stripes') were not included in the output, leaving it well confusing. This patch fixes the sample output, adds a 'striped' example, and makes other comment related adjustments for long line and spacing between followup 'NB' remarks (while I'm there). Signed-off-by: John Ferlan <jferlan@redhat.com>	2016-01-28 16:50:46 -05:00
Andrea Bolognani	11ef5869fb	pci: Use bool return type for some virPCIDeviceGet() functions The affected functions are: virPCIDeviceGetManaged() virPCIDeviceGetUnbindFromStub() virPCIDeviceGetRemoveSlot() virPCIDeviceGetReprobe() Change their return type from unsigned int to bool: the corresponding members in struct _virPCIDevice are defined as bool, and even the corresponding virPCIDeviceSet() functions take a bool value as input so there's no point in these functions having unsigned int as return type. Suggested-by: John Ferlan <jferlan@redhat.com>	2016-01-28 17:27:58 +01:00
Michal Privoznik	3f3f7a824c	gendispatch: Don't output spaces on empty line In our generator for some code we put empty lines in the output to separate blocks of code. However, in some cases we put couple of spaces on the empty line too. It's not bug, it just isn't nice. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-28 17:10:54 +01:00
Andrea Bolognani	171607296d	pci: Add debug messages when unbinding from stub driver Unbinding a PCI device from the stub driver can require several steps, and it can be useful for debugging to be able to trace which of these steps are performed and which are skipped for each device.	2016-01-28 12:20:53 +01:00
Andrea Bolognani	771eaeb2b3	pci: Phase out virPCIDeviceReattachInit() The name is confusing, and there are just two uses: one is a test case, and the other will be removed as part of an upcoming refactoring of the hostdev code.	2016-01-28 11:31:28 +01:00
Peter Krempa	d773b57d22	qemu: don't iterate vcpus using priv->nvcpupids in qemuProcessSetSchedParams This should be the last offender.	2016-01-28 09:58:24 +01:00
Peter Krempa	763941749e	conf: disallow empty cpuset for emulatorpin It's disallowed in the API.	2016-01-27 17:27:54 +01:00
Peter Krempa	31b782a147	conf: disallow empty cpusets for vcpu pinning when parsing XML They are disallowed in the pinning API and as default cpuset. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1293241	2016-01-27 17:27:54 +01:00
Peter Krempa	414b7eeae9	qemu: Don't use priv->ncpus to iterate cgroup setting Iterate over all cpus skipping inactive ones.	2016-01-27 17:27:54 +01:00
Andrea Bolognani	d87f0c0052	virnetdevopenvswitch: Don't call strlen() twice on the same string Commit `871e10f` fixed a memory corruption error, but called strlen() twice on the same string to do so. Even though the compiler is probably smart enough to optimize the second call away, having a single invocation makes the code slightly cleaner. Suggested-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-27 13:01:24 +01:00
Michal Privoznik	720bc953f8	virnetdevmacvlan: Provide stubs for build without macvtap In `370608b4c7` we have introduced two new internal APIs. However, there are no stubs for build without macvtap. Therefore build on systems lacking macvtap support (e.g. mingw or freebds) fails when trying to link. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2016-01-27 10:07:46 +01:00
Jason J. Herne	871e10fc95	Fix libvirtd free() segfault when migrating guest with deleted open vswitch port libvirtd crashes on free()ing portData for an open vswitch port if that port was deleted. To reproduce: ovs-vsctl del-port vnet0 virsh migrate --live kvm1 qemu+ssh://dstHost/system Error message: libvirtd: * Error in `/usr/sbin/libvirtd': free(): invalid pointer: 0x000003ff90001e20 * The problem is that virCommandRun can return an empty string in the event that the port being queried does not exist. When this happens then we are unconditionally overwriting a newline character at position strlen()-1. When strlen is 0, we overwrite memory that does not belong to the string. The fix: Only overwrite the newline if the string is not empty. Reviewed-by: Bjoern Walk <bwalk@linux.vnet.ibm.com> Signed-off-by: Jason J. Herne <jjherne@linux.vnet.ibm.com>	2016-01-27 10:01:58 +01:00
Laine Stump	370608b4c7	util: keep/use a bitmap of in-use macvtap devices This patch creates two bitmaps, one for macvlan device names and one for macvtap. The bitmap position is used to indicate that libvirt is currently using a device with the name macvtap%d/macvlan%d, where %d is the position in the bitmap. When requested to create a new macvtap/macvlan device, libvirt will now look for the first clear bit in the appropriate bitmap and derive the device name from that rather than just starting at 0 and counting up until one works. When libvirtd is restarted, the qemu driver code that reattaches to active domains calls the appropriate function to "re-reserve" the device names as it is scanning the status of running domains. Note that it may seem strange that the retry counter now starts at 8191 instead of 5. This is because we now don't do a "pre-check" for the existence of a device once we've reserved it in the bitmap - we move straight to creating it; although very unlikely, it's possible that someone has a running system where they have a large number of network devices created outside libvirt named "macvtap%d" or "macvlan%d" - such a setup would still allow creating more devices with the old code, while a low retry max in the new code would cause a failure. Since the objective of the retry max is just to prevent an infinite loop, and it's highly unlikely to do more than 1 iteration anyway, having a high max is a reasonable concession in order to prevent lots of new failures.	2016-01-26 12:20:04 -05:00
Leno Hou	8c70d04bab	util: increase libnl buffer size In the following cases nl_recv() was returning the error "No buffer space available": * When switching CPUs to offline/online in a system more than 128 cpus * When using virsh to destroy domain in a system with many interfaces This patch sets the buffer size for all netlink sockets created by libnl to 128K and turns on message peeking for nl_recv(). This eliminates the "No buffer space available" errors seen in the cases above, and also preempts other future errors the smaller buffers could have caused. Signed-off-by: Leno Hou <houqy@linux.vnet.ibm.com> Signed-off-by: Laine Stump <laine@laine.org>	2016-01-26 12:20:04 -05:00

1 2 3 4 5 ...

15821 Commits