libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-11-02 19:31:18 +00:00

Author	SHA1	Message	Date
Laine Stump	9395894585	qemu: set qemu process' RLIMIT_MEMLOCK when VFIO is used VFIO requires all of the guest's memory and IO space to be lockable in RAM. The domain's max_balloon is the maximum amount of memory the domain can have (in KiB). We add a generous 1GiB to that for IO space (still much better than KVM device assignment, where the KVM module actually ignores the process limits and locks everything anyway), and convert from KiB to bytes. In the case of hotplug, we are changing the limit for the already existing qemu process (prlimit() is used under the hood), and for regular commandline additions of vfio devices, we schedule a call to setrlimit() that will happen after the qemu process is forked.	2013-04-26 10:23:46 -04:00
Laine Stump	7bdf459d2c	qemu: use new virCommandSetMax(Processes\|Files) These were previously being set in a custom hook function, but now that virCommand directly supports setting them, we can eliminate that part of the hook and call the APIs directly.	2013-04-26 10:23:46 -04:00
Laine Stump	776d49f492	util: new virCommandSetMax(MemLock\|Processes\|Files) This patch adds two sets of functions: 1) lower level virProcessSet() functions that will immediately set the RLIMIT_MEMLOCK. RLIMIT_NPROC, or RLIMIT_NOFILE of either the current process (using setrlimit()) or any other process (using prlimit()). "current process" is indicated by passing a 0 for pid. 2) functions for virCommand that will setup a virCommand object to set those limits at a later time just after it has forked a new process, but before it execs the new program. configure.ac has prlimit and setrlimit added to the list of functions to check for, and the low level functions log an "unsupported" error) on platforms that don't support those functions.	2013-04-26 10:23:46 -04:00
Daniel P. Berrange	f3662737b1	Do proper escaping of cgroup resource partitions If a user cgroup name begins with "cgroup.", "_" or with any of the controllers from /proc/cgroups followed by a dot, then they need to be prefixed with a single underscore. eg if there is an object "cpu.service", then this would end up as "_cpu.service" in the cgroup filesystem tree, however, "waldo.service" would stay "waldo.service", at least as long as nobody comes up with a cgroup controller called "waldo". Since we require a '.XXXX' suffix on all partitions, there is no scope for clashing with the kernel 'tasks' and 'release_agent' files. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-26 13:52:02 +01:00
Daniel P. Berrange	9ddfe7eea6	Ensure all cgroup partitions have a suffix of ".partition" If the partition named passed in the XML does not already have a suffix, ensure it gets a '.partition' added to each component. The exceptions are /machine, /user and /system which do not need to have a suffix, since they are fixed partitions at the top level. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-26 13:52:02 +01:00
Daniel P. Berrange	824e86e723	Change VM cgroup suffix from '{lxc,qemu}.libvirt' to 'libvirt-{lxc,qemu}' Recently we changed to create VM cgroups with the naming pattern $VMNAME.$DRIVER.libvirt. Following discussions with the systemd community it was decided that only having a single '.' in the names is preferrable. So this changes the naming scheme to be $VMNAME.libvirt-$DRIVER. eg for LXC 'mycontainer.libvirt-lxc' or for KVM 'myvm.libvirt-qemu'. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-26 13:52:02 +01:00
Stefan Berger	7e77f252bd	test: Add JSON test for query-tpm-types Add a test case for query-tpm-models QMP command. Signed-off-by: Stefan Berger <stefanb@linux.vnet.ibm.com>	2013-04-26 08:33:44 -04:00
Eric Blake	117dc4cc8a	virsh: suppress aliases in group help 'virsh help \| grep nodedev-det' shows only nodedev-detach, but 'virsh help nodedev \| grep nodedev-det' also shows the old alias nodedev-dettach that we intentionally hid in commit `af3f9aab`. See also commit `787f4fe` and this bug report: https://bugzilla.redhat.com/show_bug.cgi?id=956966 * tools/virsh.c (vshCmdGrpHelp): Copy suppression of vshCmdHelp. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-04-26 05:45:11 -06:00
Laine Stump	f0bd70a940	security: update hostdev labelling functions for VFIO Legacy kvm style pci device assignment requires changes to the labelling of several sysfs files for each device, but for vfio device assignment, the only thing that needs to be relabelled/chowned is the "group" device for the group that contains the device to be assigned.	2013-04-25 21:28:43 -04:00
Laine Stump	b210208f97	util: new function virPCIDeviceGetVFIOGroupDev Given a virPCIDevice, this function returns the path for the device that controls the vfio group the device belongs to, e.g. "/dev/vfio/15".	2013-04-25 21:28:43 -04:00
Laine Stump	d923f6c882	virsh: use new virNodeDeviceDetachFlags The virsh nodedev-detach command has a new --driver option. If it's given virsh will attempt to use the new virNodeDeviceDetachFlags API instead of virNodeDeviceDettach. Validation of the driver name string is left to the hypervisor (qemu accepts "kvm" or "vfio". The only other hypervisor that implements these functions is xen, and it only accepts NULL).	2013-04-25 21:28:43 -04:00
Laine Stump	cad14a52ca	xen: implement virNodeDeviceDetachFlags backend This was the only hypervisor driver other than qemu that implemented virNodeDeviceDettach. It doesn't currently support multiple pci device assignment driver backends, but it is simple to plug in this new API, which will make it easier for Xen people to fill it in later when they decide to support VFIO (or whatever other) device assignment. Also it means that management applications will have the same API available to them for both hypervisors on any given version of libvirt. The only acceptable value for driverName in this case is NULL, since there is no alternate, and I'm not willing to pick a name for the default driver used by Xen.	2013-04-25 21:28:43 -04:00
Laine Stump	eaff16113a	qemu: implement virNodeDeviceDetachFlags backend The differences from virNodeDeviceDettach are very minor: 1) Check that the flags are 0. 2) Set the virPCIDevice's stubDriver according to the driverName that is passed in. 3) Call virPCIDeviceDetach with a NULL stubDriver, indicating it should get the name of the stub driver from the virPCIDevice object.	2013-04-25 21:28:10 -04:00
Laine Stump	cc875b8372	hypervisor api: implement RPC calls for virNodeDeviceDetachFlags This requires a custom function for remoteNodeDeviceDetachFlags, because it is named *NodeDevice, but it goes through the hypervisor driver rather than nodedevice driver, and so it uses privateData instead of nodeDevicePrivateData. (It has to go through the hypervisor driver, because that is the driver that knows about the backend drivers that will perform the pci device assignment).	2013-04-25 21:28:10 -04:00
Laine Stump	353941961a	hypervisor api: new virNodeDeviceDetachFlags The existing virNodeDeviceDettach() assumes that there is only a single PCI device assignment backend driver appropriate for any hypervisor. This is no longer true, as the qemu driver is getting support for PCI device assignment via VFIO. The new API virNodeDeviceDetachFlags adds a driverName arg that should be set to the exact same string set in a domain <hostdev>'s <driver name='x'/> element (i.e. "vfio", "kvm", or NULL for default). It also adds a flags arg for good measure (and because it's possible we may need it when we start dealing with VFIO's "device groups").	2013-04-25 21:28:10 -04:00
Laine Stump	cc0a918872	qemu: bind/unbind stub driver according to config <driver name='x'/> If the config for a device has specified <driver name='vfio'/>, "backend" in the pci part of the hostdev object will be set to ..._VFIO. In this case, when creating a virPCIDevice set the stubDriver to "vfio-pci", otherwise set it to "pci-stub". We will rely on the lower levels to report an error if the vfio driver isn't loaded. The detach/attach functions in virpci.c will pay attention to the stubDriver setting in the device, and bind/unbind the appropriate driver when preparing hostdevs for the domain. Note that we don't yet attempt to do anything to mark active any other devices in the same vfio "group" as a single device that is being marked active. We do need to do that, but in order to get basic VFIO functionality testing sooner rather than later, initially we'll just live with more cryptic errors when someone tries to do that.	2013-04-25 21:28:10 -04:00
Laine Stump	be64199e17	pci: keep a stubDriver in each virPCIDevice This can be set when the virPCIDevice is created and placed on a list, then used later when traversing the list to determine which stub driver to bind/unbind for managed devices. The existing Detach and Attach functions' signatures haven't been changed (they still accept a stub driver name in the arg list), but if the arg list has NULL for stub driver and one is available in the device's object, that will be used. (we may later deprecate and remove the arg from those functions).	2013-04-25 21:28:10 -04:00
Laine Stump	731b0f36f1	qemu: use vfio-pci on commandline when appropriate The device option for vfio-pci is nearly identical to that for pci-assign - only the configfd parameter isn't supported (or needed). Checking for presence of the bootindex parameter is done separately from constructing the commandline, similar to how it is done for pci-assign. This patch contains tests to check for proper commandline construction. It also includes tests for parser-formatter-parser roundtrips (xml2xml), because those tests use the same data files, and would have failed had they been included before now. qemu: xml/args tests for VFIO hostdev and <interface type='hostdev'/> These should be squashed in with the patch that adds commandline handling of vfio (they would fail at any earlier time).	2013-04-25 21:28:10 -04:00
Laine Stump	c4f63ef080	conf: formatter/parser/RNG/docs for hostdev <driver name='kvm\|vfio'/> A domain's <interface> or <hostdev>, as well as a <network>'s <forward>, can now have an optional <driver name='kvm\|vfio'/> element. As of this patch, there is no functionality behind this new knob - this patch adds support to the domain and network formatter/parser, and to the RNG and documentation. When the backend is added, legacy KVM PCI device assignment will continue to be used when no driver name is specified (or if <driver name='kvm'/> is specified), but if driver name is 'vfio', the new UEFI Secure Boot compatible VFIO device assignment will be used. Note that the parser doesn't automatically insert the current default value of this setting. This is done on purpose because the two possibilities are functionally equivalent from the guest's point of view, and we want to be able to automatically start using vfio as the default (even for existing domains) at some time in the future. This is similar to what was done with the "vhost" driver option in <interface>.	2013-04-25 21:23:38 -04:00
Laine Stump	9f80fc1bd5	conf: put hostdev pci address in a struct There will soon be other items related to pci hostdevs that need to be in the same part of the hostdevsubsys union as the pci address (which is currently a single member called "pci". This patch replaces the single member named pci with a struct named pci that contains a single member named "addr".	2013-04-25 21:23:38 -04:00
Laine Stump	5b90ef0847	qemu: detect vfio-pci device and its bootindex parameter QEMU_CAPS_DEVICE_VFIO_PCI is set if the device named "vfio-pci" is supported in the qemu binary. QEMU_CAPS_VFIO_PCI_BOOTINDEX is set if the vfio-pci device supports the "bootindex" parameter; for some reason, the bootindex parameter wasn't included in early versions of vfio support (qemu 1.4) so we have to check for it separately from vfio itself.	2013-04-25 21:23:38 -04:00
Eric Blake	1fbf190554	build: avoid unsafe functions in libgen.h POSIX says that both basename() and dirname() may return static storage (aka they need not be thread-safe); and that they may but not must modify their input argument. Furthermore, <libgen.h> is not available on all platforms. For these reasons, you should never use these functions in a multi-threaded library. Gnulib instead recommends a way to avoid the portability nightmare: gnulib's "dirname.h" provides useful thread-safe counterparts. The obvious dir_name() and base_name() are GPL (because they malloc(), but call exit() on failure) so we can't use them; but the LGPL variants mdir_name() (malloc's or returns NULL) and last_component (always points into the incoming string without modifying it, differing from basename semantics only on corner cases like the empty string that we shouldn't be hitting in the first place) are already in use in libvirt. This finishes the swap over to the safe functions. * cfg.mk (sc_prohibit_libgen): New rule. * src/util/vircgroup.c: Fix offenders. * src/parallels/parallels_storage.c (parallelsPoolAddByDomain): Likewise. * src/parallels/parallels_network.c (parallelsGetBridgedNetInfo): Likewise. * src/node_device/node_device_udev.c (udevProcessSCSIHost) (udevProcessSCSIDevice): Likewise. * src/storage/storage_backend_disk.c (virStorageBackendDiskDeleteVol): Likewise. * src/util/virpci.c (virPCIGetDeviceAddressFromSysfsLink): Likewise. * src/util/virstoragefile.h (_virStorageFileMetadata): Avoid false positive. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-04-25 14:47:01 -06:00
Christophe Fergeau	09c9395a59	Fix VIR_DOMAIN_EVENT_ID_PMSUSPEND capitalization in API doc It was written VIR_DOMAIN_EVENT_ID_PMSuspend	2013-04-25 21:57:19 +02:00
Christophe Fergeau	cc6d19f313	Improve /domainsnapshot/disks/disk@snapshot doc The previous description was a bit confusing.	2013-04-25 21:57:19 +02:00
Eric Blake	b121584f58	qemu: fix build error with older platforms Jim Fehlig reported on IRC that older gcc/glibc triggers this warning: cc1: warnings being treated as errors qemu/qemu_domain.c: In function 'qemuDomainDefFormatBuf': qemu/qemu_domain.c:1297: error: declaration of 'remove' shadows a global declaration [-Wshadow] /usr/include/stdio.h:157: error: shadowed declaration is here [-Wshadow] make[3]: *** [libvirt_driver_qemu_impl_la-qemu_domain.lo] Error 1 Fix it like we have done in the past (such as commit `2e6322a`). * src/qemu/qemu_domain.c (qemuDomainDefFormatBuf): Avoid shadowing a function name. Signed-off-by: Eric Blake <eblake@redhat.com>	2013-04-25 11:26:58 -06:00
Ján Tomko	caf659a868	docs: fix memballoon examples Use a pair of 'memballoon' tags instead of single 'watchdog' one. Add a few missing colons.	2013-04-25 18:56:33 +02:00
Ján Tomko	2bbbf0beb8	conf: reject controllers with duplicate indexes Reject multiple controllers with the same index, except for USB controllers. Multi-function USB controllers can have the same index.	2013-04-25 17:54:32 +02:00
Ján Tomko	5c9cffea23	qemu: auto-add pci-root to 'pc-i440*' machines too Commit `b33eb0d` missed this machine type.	2013-04-25 17:29:27 +02:00
Bamvor Jian Zhang	0aeae5df5e	fix typo introduced by `90430791` Signed-off-by: Bamvor Jian Zhang <bjzhang@suse.com>	2013-04-25 08:21:13 -06:00
Martin Kletzander	a553dbedb8	Fix usb master startport parsing When all usb controllers connected to the same bus have <master startport='x'/> specified, none of them have 'id=usb' assigned and thus qemu fails due to invalid masterport specification (we use 'usb' for that purpose). Adding a check that at least one of the controllers is specified without <master startport='x'/> and in case this happens, error out due to invalid configuration.	2013-04-25 14:54:46 +02:00
Michal Privoznik	01d5a97210	qemu_command.c: Fix whitespacing within for() After `9d6e56db` the syntax-check was unhappy due to wrong whitespacing: src/qemu/qemu_command.c:1637: for ( ; a.slot < QEMU_PCI_ADDRESS_SLOT_LAST; a.slot++) { maint.mk: incorrect whitespace around brackets, see HACKING for rules make: *** [bracket-spacing-check] Error 1	2013-04-25 13:52:49 +02:00
Michal Privoznik	6ddbabf938	qemu_conf: Don't discard strdup OOM error After `78d7c3c5` we are strdup()-ing path to qemu-bridge-helper. However, the check for its return value is missing. So it is possible we've ignored the OOM error silently.	2013-04-25 13:45:37 +02:00
Ján Tomko	9d6e56dbce	qemu: auto-add bridges and allow using them Add a "dry run" address allocation to figure out how many bridges will be needed for all the devices without explicit addresses. Auto-add just enough bridges to put all the devices on, or up to the bridge with the largest specified index.	2013-04-25 13:19:40 +02:00
Ján Tomko	b33eb0dca1	qemu: auto-add pci-root controller for pc machine types <controller type='pci' index='0' model='pci-root'/> is auto-added to pc* machine types. Without this controller PCI bus 0 is not available and no PCI addresses are assigned by default. Since older libvirt supported PCI bus 0 even without this controller, it is removed from the XML when migrating.	2013-04-25 13:05:10 +02:00
liguang	d350a34caf	qemu: build command line for pci-bridge device Signed-off-by: Ján Tomko <jtomko@redhat.com>	2013-04-25 12:54:59 +02:00
Ján Tomko	df0ebf6b38	conf: add PCI controllers Add new controller type 'pci' with models 'pci-root' and 'pci-bridge'.	2013-04-25 12:54:38 +02:00
Ján Tomko	024e9af3e5	qemu: call post-parse callbacks when parsing command line too Now we set the default disk driver name when parsing the qemu command line too, hence all the test changes. Assume format type is 'auto' when none is specified on qemu command line.	2013-04-25 12:10:22 +02:00
Osier Yang	48f43940e9	qemu: Fix the indention Pushed under trivial rule.	2013-04-25 17:13:33 +08:00
Li Zhang	dfd0e4f7f2	qemu: Add command line builder and parser for NVRAM. This patch is to add command line builder and parser for NVRAM device, and add test cases. Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2013-04-25 16:50:45 +08:00
Li Zhang	bf1888738b	Add NVRAM device For pSeries guest in QEMU, NVRAM is one kind of spapr-vio device. Users are allowed to specify spapr-vio devices'address. But NVRAM is not supported in libvirt. So this patch is to add NVRAM device to allow users to specify its address. In QEMU, NVRAM device's address is specified by "-global spapr-nvram.reg=xxxxx". In libvirt, XML file is defined as the following: <nvram> <address type='spapr-vio' reg='0x3000'/> </nvram> Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>	2013-04-25 16:50:11 +08:00
Michal Privoznik	19c345f2fe	qemuBuildCommandLine: Don't overwrite errors with NWFilter's one Currently, if there has been an error in building command line process after virtual interfaces has been created, the flow jumps to 'error' label, where virDomainConfNWFilterTeardown() is called. This may report an error as well, but should not overwrite the original cause why we jumped to 'error' label.	2013-04-25 08:59:49 +02:00
Peter Krempa	278a833922	doc: Clarify usage of SELinux baselabel State what fields are used when generating SELinux labels from a baselabel.	2013-04-25 08:33:52 +02:00
Osier Yang	45d6c67143	Introduce a sub-element <driver> for controller Like what we did for "disk", "filesystem" and "interface", this introduces sub-element <driver> for "controller", and put the "queues" into it.	2013-04-25 12:10:58 +08:00
Wido den Hollander	e3e866aee0	qemu: Don't require a block or file when looking for an alias This for example prohibits you to use iotune for Ceph or Sheepdog devices. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2013-04-24 16:29:26 -06:00
Osier Yang	18b428980f	Change the tag name "num_queues" into "queues" Instead of making a choice between the underscore and camelCase, this simply changes "num_queues" into "queues", which is also consistent with Michal's multiple queue support for interface.	2013-04-24 23:36:07 +08:00
Osier Yang	148edcce66	util: Error out if the numa nodeset is out of range Instead of a silent warning, it's better to error out if the numa nodeset is out of range. Just like for numa node larger than NUMA_NUM_NODES.	2013-04-24 23:23:31 +08:00
Peter Krempa	4165ea9663	selinux: Don't mask errors of virSecuritySELinuxGenNewContext Since `cbe67ff9b0` virSecuritySELinuxGenNewContext reports good error messages. Fix callers that mask the errors by generic error message.	2013-04-24 16:03:35 +02:00
Peter Krempa	20cb7f3a41	qemu: Improve handling of channels when generating SPICE command line Improve error reporting and generating of SPICE command line arguments according to the need to enable TLS. If TLS is disabled, there's no need to pass the certificate dir to qemu. This patch resolves: https://bugzilla.redhat.com/show_bug.cgi?id=953126	2013-04-24 14:37:57 +02:00
Peter Krempa	7b4a630484	qemu: Do sensible auto allocation of SPICE port numbers With this patch, if the autoport attribute is used, the code will sensibly auto allocate the ports only if needed.	2013-04-24 14:37:20 +02:00
Daniel P. Berrange	90430791ae	Make driver method names consistent with public APIs Ensure that all drivers implementing public APIs use a naming convention for their implementation that matches the public API name. eg for the public API virDomainCreate make sure QEMU uses qemuDomainCreate and not qemuDomainStart Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2013-04-24 11:00:18 +01:00

... 3 4 5 6 7 ...

13136 Commits