libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2025-03-07 17:28:15 +00:00

Author	SHA1	Message	Date
Pavel Hrdina	c359cb9aee	vircgroup: workaround devices in hybrid mode So the issue here is that you can end up with configuration where you have cgroup v1 and v2 enabled at the same time and the devices controllers is enabled for cgroup v1. In cgroup v2 there is no devices controller, the device access is controlled using BPF and since it is not a cgroup controller both of them can exists at the same time and both of them are applied while resolving access to devices. In order to avoid configuring both BPF and cgroup v1 devices we will use BPF if possible and otherwise fallback to cgroup v1 devices. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:43 +01:00
Pavel Hrdina	884479b42b	vircgroup: introduce virCgroupV2DenyAllDevices If we want to deny all devices we just need to replace any existing program with new program with empty map. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:42 +01:00
Pavel Hrdina	285aefb31c	vircgroup: introduce virCgroupV2AllowAllDevices If we want to allow all devices with all permissions we need to replace any existing program that has any rule configured, otherwise we just need to add new rule which will for example allow read access to all devices. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:41 +01:00
Pavel Hrdina	d5b09ce5d9	vircgroup: introduce virCgroupV2DenyDevice In order to deny device we need to check if there is any entry in BPF map and we need to load the current value from map if there is already entry for that device. If both values are same we can remove that entry but if they are different we need to update the entry because we don't have to deny all access, but for example only write access. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:40 +01:00
Pavel Hrdina	5d49651912	vircgroup: introduce virCgroupV2AllowDevice In order to allow device we need to create key and value which will be used to update BPF map. virBPFUpdateElem() can override existing entries in BPF map so we need to check if that entry exists in order to track number of entries in our map. This can add rule for specific device but major and minor can be both -1 which follows the same behavior as in cgroup v1. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:39 +01:00
Pavel Hrdina	b18b0ce609	vircgroup: introduce virCgroupV2DevicesGetKey Device rules are stored in BPF map that is a hash type, this function will create a key based on major and minor id of device. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:38 +01:00
Pavel Hrdina	63cfe7b84d	vircgroup: introduce virCgroupV2DeviceGetPerms Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:37 +01:00
Pavel Hrdina	6a24bd75ed	vircgroup: introduce virCgroupV2DevicesRemoveProg We need to close our FD that we have for BPF program and map in order to let kernel remove all resources once the cgroup is removed as well. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:34 +01:00
Pavel Hrdina	ef747499a5	vircgroup: introduce virCgroupV2DevicesPrepareProg This function will be called for every virCgroup(Allow\|Deny)* API in order to prepare BPF program for guest. Since libvirtd can be restarted at any point we will first try to detect existing progam, if there is none we will create a new empty BPF program and lastly if we don't have any space left in the existing BPF map we will create a new copy of the BPF map with more space and attach a new program with that map into the guest cgroup. This solution allows us to start with reasonably small BPF map consuming only small amount of memory and if needed we can easily extend the BPF map if there is a lot of host devices used in guest or if user wants to hot-plug a lot of devices once the guest is running. Since there is no way how to reallocate existing BPF map we need to create a new copy if we run out of space in current BPF map. This overcomes all the limitations in BPF: - map used in program has to be created before the program is loaded into kernel - once map is created you cannot change its size - you cannot replace map in existing program - you cannot use an array of maps because it can store FD to maps of one specific size so we would not be able to use it to overcome the second issue Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:33 +01:00
Pavel Hrdina	afa2788662	vircgroup: introduce virCgroupV2DevicesCreateProg This function creates new BPF program with new empty BPF map with the default size and attaches it to the guest cgroup. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:32 +01:00
Pavel Hrdina	ce11a5c59f	vircgroup: introduce virCgroupV2DevicesDetectProg This function will be called if libvirtd was restarted while some domains were running. It will try to detect existing programs attached to the guest cgroup. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:31 +01:00
Pavel Hrdina	48423a0b5d	vircgroup: introduce virCgroupV2DevicesAttachProg This function loads the BPF prog with prepared map into kernel and attaches it into guest cgroup. It can be also used to replace existing program in the cgroup if we need to resize BPF map to store more rules for devices. The old program will be closed and removed from kernel. There are two possible ways how to create BPF program: - One way is to write simple C-like code which can by compiled into BPF object file which can be loaded into kernel using elfutils. - The second way is to define macros which look like assembler instructions and can be used directly to create BPF program that can be directly loaded into kernel. Since the program is not too complex we can use the second option. If there is no program, all devices are allowed, if there is some program it is executed and based on the exit status the access is denied for 0 and allowed for 1. Our program will follow these rules: - first it will try to look for the specific key using major and minor to see if there is any rule for that specific device - if there is no specific rule it will try to look for any rule that matches only major of the device - if there is no match with major it will try the same but with minor of the device - as the last attempt it will try to look for rule for all devices and if there is no match it will return 0 to deny that access Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:05 +01:00
Pavel Hrdina	30b6ddc44c	vircgroup: introduce virCgroupV2DevicesAvailable There is no exact way how to figure out whether BPF devices support is compiled into kernel. One way is to check kernel configure options but this is not reliable as it may not be available. Let's try to do syscall to which will list BPF cgroup device programs. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:04 +01:00
Pavel Hrdina	07946d6e39	util: introduce virbpf helpers In order to implement devices controller with cgroup v2 we need to add support for BPF programs, cgroup v2 doesn't have devices controller. This introduces required helpers wrapping linux syscalls. Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-15 12:58:00 +01:00
Ján Tomko	4ac4773040	Use g_mkstemp_full instead of mkostemp(s) With g_mkstemp_full, there is no need to distinguish between mkostemp and mkostemps (no suffix vs. a suffix of a fixed length), because the GLib function looks for the XXXXXX pattern everywhere in the string. Use S_IRUSR \| S_IWUSR for the permissions and do not pass O_RDWR in flags since it's implied. Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2019-11-14 19:02:31 +01:00
Ján Tomko	b96e0dbba9	util: use GRegex in virStringMatch Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2019-11-14 17:45:40 +01:00
Ján Tomko	9c76dd3a2e	util: use GRegex in virStringSearch Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2019-11-14 17:45:40 +01:00
Ján Tomko	514b2b272b	util: use GRegex for virLogRegex Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2019-11-14 17:45:40 +01:00
Ján Tomko	039d26fcb0	util: use GRegex in virCommandRunRegex This saves us from allocating vars upfront, since GLib deals with that for us. Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2019-11-14 17:45:40 +01:00
Ján Tomko	5c89468ff2	remove unused regex.h includes The code using regexes got moved, but the include stayed. Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2019-11-14 17:45:40 +01:00
Michal Privoznik	1b0de07f41	virhostuptime: Wrap virHostGetBootTimeProcfs() call in an ifdef The virHostGetBootTimeProcfs() function is defined only for Linux and therefore it's only call should also be done if we're on Linux. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-11-14 16:42:51 +01:00
Peter Krempa	38bc2e8c1b	util: file: Replace use of 'strsep' with virStringSplit Use our helper instead of the gnulib one. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-14 15:50:43 +01:00
Peter Krempa	4bb2c51a9c	util: file: Use more obvious logic in virFindFileInPath Make it more obvious that the function will return NULL if the file is not executable and stop reusing variables. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-14 15:50:43 +01:00
Peter Krempa	6eac0c5436	util: file: Use g_autofree in virFindFileInPath Simplify the final lookup loop by freeing memory automatically and thus being able to directly return the result. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-14 15:50:43 +01:00
Michal Privoznik	070d6969fe	virhostuptime: Add linux stub for musl When we want to know the boot timestamp of the host, we can call virHostGetBootTime(). Under the hood, it uses getutxid() which is defined by POSIX and properly check for in configure. However, musl took a path where it declares the function but instead of providing any useful implementation it returns NULL meaning "no record found". If that's the case, use our second best option - /proc/uptime and a bit of maths. https://bugzilla.redhat.com/show_bug.cgi?id=1760885 Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Cole Robinson <crobinso@redhat.com>	2019-11-14 15:20:38 +01:00
Ján Tomko	ec07893a5f	util: use g_vsnprintf Instead of vsnprintf from gnulib, use g_vsnprintf from GLib. Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com> Reviewed-by: Peter Krempa <pkrempa@redhat.com>	2019-11-14 14:06:49 +01:00
Peter Krempa	5ff6eb5dc7	util: pidfile: Replace 'areadlink' by 'g_file_read_link' Use the glib function rather than gnulib. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-14 12:42:10 +01:00
Peter Krempa	f95ef9248a	util: pidfile: Sanitize return values of virPidFileReadPathIfAlive The callers don't actually use the returned errno for reporting errors. Additionally virFileResolveAllLinks returns -1 rather than -errno on error thus you'd get a spurious EPERM even on other errors. Don't try to return errno in this case. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-14 12:42:10 +01:00
Peter Krempa	b13e45911d	util: pidfile: Sanitize return values of virPidFileReadIfAlive Return -1 on failure rather than -errno since none of the callers actually cares about the return value. This specifically fixes returns of -ENOMEM in cases of bad usage, which would report wrong error anyways. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-14 12:42:09 +01:00
Peter Krempa	1900936fe6	util: file: Remove virFileReadLink The function is unused so we can remove it. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-14 12:42:09 +01:00
Ján Tomko	fa061c92ec	Remove VIR_STRNDUP usage that subtracts from a non-NULL pointer Use g_strndup in all the cases where we check upfront whether a pointer is non-NULL and then use it to calculate the copied length. Signed-off-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com>	2019-11-13 17:01:38 +01:00
Peter Krempa	48b68470c8	util: buffer: remove virBufferSetChildIndent Promote usage of separate buffers for separate formatting passes by removing the now unused virBufferSetChildIndent. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-13 09:10:30 +01:00
Peter Krempa	186e247b4c	util: sysinfo: Convert child buffers to use VIR_BUFFER_INIT_CHILD Use the new helper to initialize child XML element buffers. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-13 09:10:30 +01:00
Peter Krempa	15dc77082d	util: buffer: Add init macro for automatically setting child XML indent Add a new macro which initializes a virBuffer on the stack and also sets the indent level to be used for child XML element formatting. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-13 09:10:29 +01:00
Daniel Henrique Barboza	93af79fba3	util: remove unneeded cleanup labels Signed-off-by: Daniel Henrique Barboza <danielhb413@gmail.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-11-12 17:54:01 +01:00
Michal Privoznik	33ed622106	Drop virVasprintf() Now that function is no longer used, it can be dropped. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com>	2019-11-12 16:15:59 +01:00
Michal Privoznik	a067916975	virstring: Drop virVasprintfQuiet() This macro is no longer used and therefore can be dropped. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-11-12 16:15:59 +01:00
Michal Privoznik	dd98a6edb9	Drop virAsprintf() Now that function is no longer used, it can be dropped. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com>	2019-11-12 16:15:59 +01:00
Michal Privoznik	a028630620	virstring: Drop virAsprintfQuiet() This macro is no longer used and therefore can be dropped. Signed-off-by: Michal Privoznik <mprivozn@redhat.com>	2019-11-12 16:15:59 +01:00
Michal Privoznik	8eaa708991	Use g_strdup_vprintf() instead of virVasprintf() everywhere Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com>	2019-11-12 16:15:59 +01:00
Michal Privoznik	91d88aaf23	util: Use g_strdup_printf() instead of virAsprintf() Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: Daniel Henrique Barboza <danielhb413@gmail.com>	2019-11-12 16:15:58 +01:00
Peter Krempa	e39d3424e3	util: pci: Remove always-false condition Commit d19c21429fd modified the condition so that it checks whether the value is more than 0xFFFFFFFF. Since addr->domain is an unsigned int, it will never be more than that. Remove the whole check src/util/virpci.c:1291:22: error: result of comparison 'unsigned int' > 4294967295 is always false [-Werror,-Wtautological-type-limit-compare] if (addr->domain > 0xFFFFFFFF) { ~~~~~~~~~~~~ ^ ~~~~~~~~~~ Signed-off-by: Peter Krempa <pkrempa@redhat.com>	2019-11-11 16:48:26 +01:00
Pavel Hrdina	0985a9597b	src: stop distributing generated source files Signed-off-by: Pavel Hrdina <phrdina@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com> Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>	2019-11-08 17:07:57 +01:00
Wang Yechao	ebd004299a	util: Set SIGPIPE to a no-op handler in virFork Libvirtd has set SIGPIPE to ignored, and virFork resets all signal handlers to the defaults. But child process may write logs to stderr/stdout, that may generate SIGPIPE if journald has stopped. So set SIGPIPE to a dummy no-op handler before unmask signals in virFork(), and the handler will get reset to SIG_DFL when execve() runs. Now we can delete sigaction() call entirely in virExec(). Reviewed-by: Daniel P. Berrangé <berrange@redhat.com> Signed-off-by: Wang Yechao <wang.yechao255@zte.com.cn>	2019-11-08 10:53:30 +00:00
Laine Stump	13ec827052	util: set bridge device MAC address explicitly during virNetDevBridgeCreate When libvirt first implemented a stable and configurable MAC address for the bridges created for libvirt virtual networks (commit 5754dbd56d, in libvirt v0.8.8) most distro stable releases didn't support explicitly setting the MAC address of a bridge; the bridge just always assumed the lowest numbered MAC of all attached interfaces. Because of this, we stabilized the bridge MAC address by creating a "dummy" tap interface with a MAC address guaranteed to be lower than any of the guest tap devices' MACs (which all started with 0xFE, so it's not difficult to do) and attached it to the bridge - this was the inception of the "virbr0-nic" device that has confused so many people over the years. Even though the linux kernel had recently gained support for explicitly setting a bridge MAC, we deemed it unnecessary to set the MAC that way, because the other (indirect) method worked everywhere. But recently there have been reports that the bridge MAC address was not following the setting in the network config, and mismatched the MAC of the dummy tap device (which was still correct). It turns out that this is due to a change in systemd-242 that persists whatever MAC address is set for a bridge when it's initially started. According to the systemd NEWS file entry for version 242 (https://github.com/systemd/systemd/blob/master/NEWS): "if a bridge interface is created without any slaves, and gains a slave later, then now the bridge does not inherit slave's MAC." This change was the result of: https://github.com/systemd/systemd/issues/3374 (apparently if there is no MAC saved for a bridge by the name of a bridge being created, the random MAC generated during creation is saved, and then that same MAC is used to explicitly set the MAC each time it is created). Once a bridge has an explicitly set MAC, the "use the lowest numbered MAC of attached devices" rule is ignored, so our dummy tap device is like the goggles - it does nothing! (well, almost). We could whine about changes in default behavior, etc. etc., but because the change was in response to actual user problems, that seems likely a fruitless task. Fortunately, time has marched on, and even distro releases that are old enough that they are no longer supported by upstream libvirt (e.g. RHEL6) have support for explicitly setting a bridge device MAC address, either during creation or with a separate ioctl after creation, so we can now do that. To enable explicitly setting the mac during bridge creation, we add a mac arg to virNetDevBridgeCreate(). In the case of platforms where the bridge is created with a netlink RTM_NEWLINK message, we just add that mac to the message. For platforms that still use an ioctl (either SIOCBRADDBR or SIOCIFCREATE2), we make a separate call to virNetDevSetMAC() after creating the bridge. (NB: I was unable to test the calling of virNetDevSetMAC() from the SIOCIFCREATE2 (BSD) version of virNetDevBridgeCreate(); even though I managed to get a FreeBSD system setup and libvirt built there, when I tried to start the default network the SIOCIFCREATE2 ioctl itself failed, so it never even got to the virNetDevSetMAC(). That leaves the FreeBSD implementation untested.) This makes the dummy tap pointless for purposes of setting the MAC address, but it is still useful for IPv6 DAD initialization (which apparently requires at least one interface to be attached to the bridge and online), as well as for setting an initial MTU for the bridge, so it hasn't been removed. (NB: we can safely always call virNetDevBridgeCreate() with &def->mac from the network driver because, in spite of the existence of a "mac_specified" bool in the config suggesting that it may not always be present, in reality a mac address will always be added to any network that doesn't have one - this is guaranteed in all cases by commit a47ae7c004) https://bugzilla.redhat.com/show_bug.cgi?id=1760851 Signed-off-by: Laine Stump <laine@redhat.com> Reviewed-by: Jiri Denemark <jdenemar@redhat.com>	2019-11-07 22:09:19 -05:00
Laine Stump	b596d6c106	util: allow sending mac addr to virNetNewLink without ifindex Although until now, any use of the extra_args argument (a pointer to a struct containing extra attributes to add the the RTM_NEWLINK message) would always have the ifindex and mac set, so the code could assume it was safe to add both to the message if extra_args != NULL. There is now a use for setting a MAC address in the RTM_NEWLINK without setting the ifindex, so we should check each of these separately. Signed-off-by: Laine Stump <laine@redhat.com> Reviewed-by: Jiri Denemark <jdenemar@redhat.com>	2019-11-07 22:08:55 -05:00
Peter Krempa	bf0e7bdeeb	util: xml: Make virXMLFormatElement void Now that we don't have to deal with errors of virBuffer we can also make this function void. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-10-24 19:35:34 +02:00
Peter Krempa	0967708b81	util: buffer: Remove virBufferCheckError The function now does not return an error so we can drop it fully. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-10-24 19:35:34 +02:00
Peter Krempa	205d6a2af7	util: buffer: Remove virBufferError The function now does not return an error so we can drop it fully. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-10-24 19:35:34 +02:00
Peter Krempa	418aa809fd	util: buffer: Remove error handling internals Now that there are no errors reported and tracked in virBuffer, remove all the internals which were used to track them. Signed-off-by: Peter Krempa <pkrempa@redhat.com> Reviewed-by: Ján Tomko <jtomko@redhat.com>	2019-10-24 19:35:34 +02:00

1 2 3 4 5 ...

4369 Commits