libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2024-11-02 11:21:12 +00:00

Author	SHA1	Message	Date
Eric Blake	0d0b409863	cpustats: collect VM user and sys times As documented in linux.git/Documentation/cgroups/cpuacct.txt, cpuacct.stat returns user and system time in ticks (the same unit used in times(2)). It would be a bit nicer if it were like getrusage(2) and reported timeval contents, or like cpuacct.usage and in nanoseconds, but we can't be picky. * src/util/cgroup.h (virCgroupGetCpuacctStat): New function. * src/util/cgroup.c (virCgroupGetCpuacctStat): Implement it. (virCgroupGetValueStr): Allow for multi-line files. * src/libvirt_private.syms (cgroup.h): Export it.	2012-03-12 08:46:56 -06:00
KAMEZAWA Hiroyuki	44b0a53a7c	qemu driver for virDomainGetCPUstats using cpuacct cgroup. * For now, only "cpu_time" is supported. * cpuacct cgroup is used for providing percpu cputime information. * src/qemu/qemu.conf - take care of cpuacct cgroup. * src/qemu/qemu_conf.c - take care of cpuacct cgroup. * src/qemu/qemu_driver.c - added an interface * src/util/cgroup.c/h - added interface for getting percpu cputime Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> Signed-off-by: Lai Jiangshan <laijs@cn.fujitsu.com>	2012-03-06 21:54:48 -07:00
Eric Blake	3e2c3d8f6d	build: use correct type for pid and similar types No thanks to 64-bit windows, with 64-bit pid_t, we have to avoid constructs like 'int pid'. Our API in libvirt-qemu cannot be changed without breaking ABI; but then again, libvirt-qemu can only be used on systems that support UNIX sockets, which rules out Windows (even if qemu could be compiled there) - so for all points on the call chain that interact with this API decision, we require a different variable name to make it clear that we audited the use for safety. Adding a syntax-check rule only solves half the battle; anywhere that uses printf on a pid_t still needs to be converted, but that will be a separate patch. * cfg.mk (sc_correct_id_types): New syntax check. * src/libvirt-qemu.c (virDomainQemuAttach): Document why we didn't use pid_t for pid, and validate for overflow. * include/libvirt/libvirt-qemu.h (virDomainQemuAttach): Tweak name for syntax check. * src/vmware/vmware_conf.c (vmwareExtractPid): Likewise. * src/driver.h (virDrvDomainQemuAttach): Likewise. * tools/virsh.c (cmdQemuAttach): Likewise. * src/remote/qemu_protocol.x (qemu_domain_attach_args): Likewise. * src/qemu_protocol-structs (qemu_domain_attach_args): Likewise. * src/util/cgroup.c (virCgroupPidCode, virCgroupKillInternal): Likewise. * src/qemu/qemu_command.c(qemuParseProcFileStrings): Likewise. (qemuParseCommandLinePid): Use pid_t for pid. * daemon/libvirtd.c (daemonForkIntoBackground): Likewise. * src/conf/domain_conf.h (_virDomainObj): Likewise. * src/probes.d (rpc_socket_new): Likewise. * src/qemu/qemu_command.h (qemuParseCommandLinePid): Likewise. * src/qemu/qemu_driver.c (qemudGetProcessInfo, qemuDomainAttach): Likewise. * src/qemu/qemu_process.c (qemuProcessAttach): Likewise. * src/qemu/qemu_process.h (qemuProcessAttach): Likewise. * src/uml/uml_driver.c (umlGetProcessInfo): Likewise. * src/util/virnetdev.h (virNetDevSetNamespace): Likewise. * src/util/virnetdev.c (virNetDevSetNamespace): Likewise. * tests/testutils.c (virtTestCaptureProgramOutput): Likewise. * src/conf/storage_conf.h (_virStoragePerms): Use mode_t, uid_t, and gid_t rather than int. * src/security/security_dac.c (virSecurityDACSetOwnership): Likewise. * src/conf/storage_conf.c (virStorageDefParsePerms): Avoid compiler warning.	2012-03-02 06:57:43 -07:00
Eric Blake	19896423f7	hash: minor touchups On RHEL5, I got: util/virrandom.c:66: warning: nested extern declaration of '_gl_verify_function66' [-Wnested-externs] The fix is to hoist the verify earlier. Also some other hodge-podge fixes I noticed while reviewing Dan's recent series. * .gitignore: Ignore new test. * src/util/cgroup.c: Bump copyright year. * src/util/virhash.c: Fix typo in description. * src/util/virrandom.c (virRandomBits): Mark doc comment, and hoist assert to silence older gcc.	2012-01-26 15:27:10 -07:00
Daniel P. Berrange	72b4139700	Replace hashing algorithm with murmurhash Recent discussions have illustrated the potential for DOS attacks with the hash table implementations used by most languages and libraries. https://lwn.net/Articles/474912/ libvirt has an internal hash table impl, and uses hash tables for a variety of purposes. The hash key generation code is pretty simple and thus not strongly collision resistant. This patch replaces the current libvirt hash key generator with the (public domain) Murmurhash3 code. In addition every hash table now gets a random seed value which is used to perturb the hashing code. This should make it impossible to mount any practical attack against libvirt hashing code. * bootstrap.conf: Import bitrotate module * src/Makefile.am: Add virhashcode.[ch] * src/util/util.c: Make virRandom() return a fixed 32 bit integer value. * src/util/hash.c, src/util/hash.h, src/util/cgroup.c: Replace hash code generation with a call to virHashCodeGen() * src/util/virhashcode.h, src/util/virhashcode.c: Add a new virHashCodeGen() API using the Murmurhash3 algorithm.	2012-01-26 14:18:53 +00:00
Daniel P. Berrange	1d5c7a9fdf	Rename hash.h and hash.c to virhash.h and virhash.c In preparation for the patch to include Murmurhash3, which introduces a virhashcode.h and virhashcode.c files, rename the existing hash.h and hash.c to virhash.h and virhash.c respectively.	2012-01-26 14:11:13 +00:00
Daniel P. Berrange	9f2bf8fd03	Convert various virHash functions to use size_t / uint32 In preparation for conversion over to use the Murmurhash3 algorithm, convert various virHash APIs to use size_t or uint32 for their return values/parameters, instead of the variable size 'unsigned long' or 'int' types	2012-01-26 14:09:21 +00:00
Hu Tao	059425ae45	Add functions to set/get cgroup cpuset parameters	2011-12-20 09:13:36 -07:00
Hu Tao	93ab58595d	blkiotune: add qemu support for blkiotune.device_weight Implement setting/getting per-device blkio weights in qemu, using the cgroups blkio.weight_device tunable.	2011-11-29 12:26:21 -07:00
Jiri Denemark	54bf875aa6	lxc: Fix suspend/resume with freezer cgroup	2011-11-29 14:16:42 +01:00
Daniel P. Berrange	c32536e7da	Don't leak memory if a cgroup is mounted multiple times It is possible (expected/likely in Fedora 15) for a cgroup controller to be mounted in multiple locations at the same time, due to bind mounts. Currently we leak memory if this happens, because we overwrite the previous 'mountPoint' string. Instead just accept the first match we find. * src/util/cgroup.c: Only accept first match for a cgroup controller mount	2011-08-31 17:51:09 +01:00
Eric Blake	8e22e08935	build: rename files.h to virfile.h In preparation for a future patch adding new virFile APIs. * src/util/files.h, src/util/files.c: Move... * src/util/virfile.h, src/util/virfile.c: ...here, and rename functions to virFile prefix. Macro names are intentionally left alone. * .c: All '#include "files.h"' uses changed. src/Makefile.am (UTIL_SOURCES): Reflect rename. * cfg.mk (exclude_file_name_regexp--sc_prohibit_close): Likewise. * src/libvirt_private.syms: Likewise. * docs/hacking.html.in: Likewise. * HACKING: Regenerate.	2011-07-21 10:34:51 -06:00
Wen Congyang	fd7c172340	cgroup: Implement cpu.cfs_period_us and cpu.cfs_quota_us tuning API This patch provides 4 APIs to get and set cpu.cfs_period_us and cpu.cfs_quota_us.	2011-07-21 17:11:12 +08:00
Wen Congyang	8e64f87306	Introduce the function virCgroupForVcpu Introduce the function virCgroupForVcpu() to create sub directory for each vcpu.	2011-07-21 17:11:12 +08:00
Eric Blake	3f81f8e4c1	cgroup: silence coverity warning Coverity noted that most clients reacted to failure to hash; but in a best-effort kill loop, we can ignore failure. * src/util/cgroup.c (virCgroupKillInternal): Ignore hash failure.	2011-07-04 10:28:27 +08:00
Lai Jiangshan	b65f37a4a1	libvirt,logging: cleanup VIR_XXX0() These VIR_XXXX0 APIs make us confused, use the non-0-suffix APIs instead. How do these coversions works? The magic is using the gcc extension of ##. When __VA_ARGS__ is empty, "##" will swallow the "," in "fmt," to avoid compile error. example: origin after CPP high_level_api("%d", a_int) low_level_api("%d", a_int) high_level_api("a string") low_level_api("a string") About 400 conversions. 8 special conversions: VIR_XXXX0("") -> VIR_XXXX("msg") (avoid empty format) 2 conversions VIR_XXXX0(string_literal_with_%) -> VIR_XXXX(%->%%) 0 conversions VIR_XXXX0(non_string_literal) -> VIR_XXXX("%s", non_string_literal) (for security) 6 conversions Signed-off-by: Lai Jiangshan <laijs@cn.fujitsu.com>	2011-05-11 12:41:14 -06:00
Eric Blake	ead2b43357	cgroup: avoid leaking a file Clang detected a dead store to rc. It turns out that in fixing this, I also found a FILE* leak. This is a subtle change in behavior, although unlikely to hit. The pidfile is a kernel file, so we've probably got more serious problems under foot if we fail to parse one. However, the previous behavior was that even if one pid file failed to parse, we tried others, whereas now we give up on the first failure. Either way, though, the function returns -1, so the caller will know that something is going wrong, and that not all pids were necessarily reaped. Besides, there were other instances already in the code where failure in the inner loop aborted the outer loop. * src/util/cgroup.c (virCgroupKillInternal): Abort rather than resuming loop on fscanf failure, and cleanup file on error.	2011-05-04 08:38:27 -06:00
Hu Tao	ae5155768f	Don't return an error on failure to create blkio controller This patch enables cgroup controllers as much as possible by skipping the creation of blkio controller when running with old kernels that doesn't support multi-level directory for blkio controller. Signed-off-by: Hu Tao <hutao@cn.fujitsu.com> Signed-off-by: Eric Blake <eblake@redhat.com>	2011-03-18 16:59:03 -06:00
Nikunj A. Dadhania	78ba748ef1	virsh: fix memtune's help message for swap_hard_limit * Correct the documentation for cgroup: the swap_hard_limit indicates mem+swap_hard_limit. * Change cgroup private apis to: virCgroupGet/SetMemSwapHardLimit Signed-off-by: Nikunj A. Dadhania <nikunj@linux.vnet.ibm.com>	2011-03-17 16:45:06 -06:00
Eric Blake	5564c57528	cgroup: allow fine-tuning of device ACL permissions Adding audit points showed that we were granting too much privilege to qemu; it should not need any mknod rights to recreate any devices. On the other hand, lxc should have all device privileges. The solution is adding a flag parameter. This also lets us restrict write access to read-only disks. * src/util/cgroup.h (virCgroupDevice): Adjust prototypes. * src/util/cgroup.c (virCgroupAllowDevice) (virCgroupAllowDeviceMajor, virCgroupAllowDevicePath) (virCgroupDenyDevice, virCgroupDenyDeviceMajor) (virCgroupDenyDevicePath): Add parameter. * src/qemu/qemu_driver.c (qemudDomainSaveFlag): Update clients. * src/lxc/lxc_controller.c (lxcSetContainerResources): Likewise. * src/qemu/qemu_cgroup.c: Likewise. (qemuSetupDiskPathAllow): Also, honor read-only disks.	2011-03-09 11:35:36 -07:00
Eric Blake	f2512684ad	audit: also audit cgroup controller path Although the cgroup device ACL controller path can be worked out by researching the code, it is more efficient to include that information directly in the audit message. * src/util/cgroup.h (virCgroupPathOfController): New prototype. * src/util/cgroup.c (virCgroupPathOfController): Export. * src/libvirt_private.syms: Likewise. * src/qemu/qemu_audit.c (qemuAuditCgroup): Use it.	2011-03-09 10:19:17 -07:00
Eric Blake	b1a5aefcee	build: fix build on cygwin On cygwin: CC libvirt_util_la-cgroup.lo util/cgroup.c: In function 'virCgroupKillRecursiveInternal': util/cgroup.c:1458: warning: implicit declaration of function 'virCgroupNew' [-Wimplicit-function-declaration] * src/util/cgroup.c (virCgroupKill): Don't build on platforms where virCgroupNew is unsupported.	2011-03-08 21:44:24 -07:00
Daniel P. Berrange	3c37a171a2	Add check for kill() to fix build of cgroups on win32 The kill() function doesn't exist on Win32, so it needs to be checked for at build time & code disabled in cgroups * configure.ac: Check for kill() * src/util/cgroup.c: Stub out virCGroupKill* functions when kill() isn't available	2011-02-28 14:13:58 +00:00
Daniel P. Berrange	33191b419c	Add APIs for killing off processes inside a cgroup The virCgroupKill method kills all PIDs found in a cgroup The virCgroupKillRecursively method does this recursively for child cgroups. The virCgroupKillPainfully method does a recursive kill several times in a row until everything has really died	2011-02-25 14:21:30 +00:00
Eric Blake	061738764d	cgroup: determine when skipping non-devices * src/util/cgroup.c (virCgroupAllowDevicePath) (virCgroupDenyDevicePath): Don't fail with EINVAL for non-devices. * src/qemu/qemu_driver.c (qemudDomainSaveFlag): Update caller. * src/qemu/qemu_cgroup.c (qemuSetupDiskPathAllow) (qemuSetupChardevCgroup, qemuSetupHostUsbDeviceCgroup) (qemuSetupCgroup, qemuTeardownDiskPathDeny): Likewise.	2011-02-24 13:31:05 -07:00
Daniel P. Berrange	35416720c2	Put <stdbool.h> into internal.h so it is available everywhere Remove the <stdbool.h> header from all source files / headers and just put it into internal.h * src/internal.h: Add <stdbool.h>	2011-02-24 12:04:06 +00:00
Eric Blake	76c57a7c1d	cgroup: preserve correct errno on failure * src/util/cgroup.c (virCgroupSetValueStr, virCgroupGetValueStr) (virCgroupRemoveRecursively): VIR_DEBUG can clobber errno. (virCgroupRemove): Use VIR_DEBUG rather than DEBUG.	2011-02-16 08:10:30 -07:00
Eric Blake	bd6ea30384	build: silence false positive clang report clang complained that STREQ(group->controllers[i].mountPoint,...) was a NULL dereference when i==VIR_CGROUP_CONTROLLER_CPUSET, because it assumes the worst about virCgroupPathOfController. Marking the argument const doesn't yet have an effect, per this clang bug: http://llvm.org/bugs/show_bug.cgi?id=7758 So, we use sa_assert, which was designed to shut up false positives from tools like clang. * src/util/cgroup.c (virCgroupMakeGroup): Teach clang that there is no NULL dereference.	2011-02-14 15:37:32 -07:00
Gui Jianfeng	c3658ab543	cgroup: Implement blkio.weight tuning API. Implement blkio.weight tuning API. Acked-by: Daniel P. Berrange <berrange@redhat.com> Signed-off-by: Gui Jianfeng <guijianfeng@cn.fujitsu.com>	2011-02-08 11:25:33 -07:00
Gui Jianfeng	b58241a690	cgroup: Enable cgroup hierarchy for blkio cgroup Enable cgroup hierarchy for blkio cgroup Acked-by: Daniel P. Berrange <berrange@redhat.com> Signed-off-by: Gui Jianfeng <guijianfeng@cn.fujitsu.com>	2011-02-08 10:42:14 -07:00
Nikunj A. Dadhania	d94a14f89d	memtune: Let virsh know the unlimited value for memory tunables Display or set unlimited values for memory parameters. Unlimited is represented by INT64_MAX in memory cgroup. Signed-off-by: Nikunj A. Dadhania <nikunj@linux.vnet.ibm.com> Reported-by: Justin Clift <jclift@redhat.com>	2011-01-14 17:17:27 -07:00
Jean-Baptiste Rouault	966a1bfe22	Create file in virFileWriteStr() if it doesn't exist This patch adds a mode_t parameter to virFileWriteStr(). If mode is different from 0, virFileWriteStr() will try to create the file if it doesn't exist. * src/util/util.h (virFileWriteStr): Alter signature. * src/util/util.c (virFileWriteStr): Allow file creation. * src/network/bridge_driver.c (networkEnableIpForwarding) (networkDisableIPV6): Adjust clients. * src/node_device/node_device_driver.c (nodeDeviceVportCreateDelete): Likewise. * src/util/cgroup.c (virCgroupSetValueStr): Likewise. * src/util/pci.c (pciBindDeviceToStub, pciUnBindDeviceFromStub): Likewise.	2010-12-03 08:08:22 -07:00
Stefan Berger	7b7cb1ecc9	deprecate fclose() and introduce VIR_{FORCE_}FCLOSE() Similarly to deprecating close(), I am now deprecating fclose() and introduce VIR_FORCE_FCLOSE() and VIR_FCLOSE(). Also, fdopen() is replaced with VIR_FDOPEN(). Most of the files are opened in read-only mode, so usage of VIR_FORCE_CLOSE() seemed appropriate. Others that are opened in write mode already had the fclose()< 0 check and I converted those to VIR_FCLOSE()< 0. I did not find occurrences of possible double-closed files on the way.	2010-11-16 21:13:29 -05:00
Lai Jiangshan	41b2cee2a8	qemu_driver: add virCgroupMounted When we mount any cgroup without "-o devices", we will fail to start vms: error: Failed to start domain vm1 error: Unable to deny all devices for vm1: No such file or directory When we mount any cgroup without "-o cpu", we will fail to get schedinfo: Scheduler : posix error: unable to get cpu shares tunable: No such file or directory We should only use the cgroup controllers which are mounted on host. So I add virCgroupMounted() for qemuCgroupControllerActive() Signed-off-by: Lai Jiangshan <laijs@cn.fujitsu.com>	2010-10-29 09:46:25 -06:00
Nikunj A. Dadhania	5f481e4df1	Implement cgroup memory controller tunables Provides interfaces for setting/getting memory tunables like hard_limit, soft_limit and swap_hard_limit	2010-10-12 19:26:09 +02:00
Ryota Ozaki	29da015aac	cgroup: Fix compilation broken on MinGW due to dirent->d_type As pointed out by Eric Blake, using dirent->d_type breaks compilation on MinGW. This patch addresses this by using '#if defined' as same as doing for virCgroupForDriver.	2010-06-30 08:32:23 -06:00
Ryota Ozaki	adc796c8eb	cgroup: Add missing errno == ENOENT check in virCgroupRemoveRecursively ENOENT happens normally when a subsystem is enabled with any other subsystems and the directory of the target group has already removed in a prior loop. In that case, the function should just return without leaving an error message. NB this is the same behavior as before introducing virCgroupRemoveRecursively.	2010-06-29 12:16:51 -06:00
Daniel P. Berrange	2bad82f71e	Set labelling for character devices in security drivers When configuring serial, parallel, console or channel devices with a file, dev or pipe backend type, it is necessary to label the file path in the security drivers. For char devices of type file, it is neccessary to pre-create (touch) the file if it does not already exist since QEMU won't be allowed todo so itself. dev/pipe configs already require the admin to pre-create before starting the guest. * src/qemu/qemu_security_dac.c: set file ownership for character devices * src/security/security_selinux.c: Set file labeling for character devices * src/qemu/qemu_driver.c: Add character devices to cgroup ACL	2010-06-25 14:39:54 +01:00
Ryota Ozaki	4a4eb13e7a	cgroup: Enable memory.use_hierarchy of cgroup for domain Through conversation with Kumar L Srikanth-B22348, I found that the function of getting memory usage (e.g., virsh dominfo) doesn't work for lxc with ns subsystem of cgroup enabled. This is because of features of ns and memory subsystems. Ns creates child cgroup on every process fork and as a result processes in a container are not assigned in a cgroup for domain (e.g., libvirt/lxc/test1/). For example, libvirt_lxc and init (or somewhat specified in XML) are assigned into libvirt/lxc/test1/8839/ and libvirt/lxc/test1/8839/8849/, respectively. On the other hand, memory subsystem accounts memory usage within a group of processes by default, i.e., it does not take any child (and descendant) groups into account. With the two features, virsh dominfo which just checks memory usage of a cgroup for domain always returns zero because the cgroup has no process. Setting memory.use_hierarchy of a group allows to account (and limit) memory usage of every descendant groups of the group. By setting it of a cgroup for domain, we can get proper memory usage of lxc with ns subsystem enabled. (To be exact, the setting is required only when memory and ns subsystems are enabled at the same time, e.g., mount -t cgroup none /cgroup.)	2010-06-23 14:31:38 -06:00
Ryota Ozaki	842b51ff5d	cgroup: Change virCgroupRemove to remove all descendant groups at first As same as normal directories, a cgroup cannot be removed if it contains sub groups. This patch changes virCgroupRemove to remove all descendant groups (subdirectories) of a target group before removing the target group. The handling is required when we run lxc with ns subsystem of cgroup. Ns subsystem automatically creates child cgroups on every process forks, but unfortunately the groups are not removed on process exits, so we have to remove them by ourselves. With this patch, such child (and descendant) groups are surely removed at lxc shutdown, i.e., lxcVmCleanup which calls virCgroupRemove.	2010-06-23 14:30:19 -06:00
Jim Meyering	2d3208029b	maint: mark translatable string args of VIR_ERROR Run this: git grep -l 'VIR_ERROR\s("'\|xargs perl -pi -e \ 's/(VIR_ERROR)\s\((".*?"),/$1(_($2),/'	2010-05-20 21:36:25 +02:00
Jim Meyering	8d63d82e5c	maint: mark translatable string args of VIR_ERROR0 Run this: git grep -l 'VIR_ERROR0\s("'\|xargs perl -pi -e \ 's/(VIR_ERROR0)\s$(".*?")$/$1(_($2))/'	2010-05-20 21:36:25 +02:00
Ryota Ozaki	c4157e5272	cgroup: Fix possible memory leak in virCgroupMakeGroup * src/util/cgroup.c: free temporal path string before breaking loop	2010-05-03 15:01:12 -06:00
Matthias Bolte	40648b156b	cygwin: Check explicitly for getmntent_r Cygwin has mntent.h but lacks getmntent_r. Update preprocessor checks to catch this combination.	2010-04-23 20:15:53 +02:00
Matthias Bolte	73b45bfbff	cgroup: Replace sscanf with virStrToLong_ll The switch from %lli to %lld in virCgroupGetValueI64 is intended, as virCgroupGetValueU64 uses base 10 too, and virCgroupSetValueI64 uses %lld to format the number to string. Parsing is stricter now and doesn't accept trailing characters after the actual value anymore.	2010-04-01 12:53:41 +02:00
Jim Fehlig	09fafa1e21	Avoid libvirtd crash when cgroups is not configured on host Invoking virDomainSetMemory() on lxc driver results in libvirtd segfault when cgroups has not been configured on the host. Ensure driver->cgroup is non-null before invoking virCgroupForDomain(). To prevent similar segfaults in the future, ensure driver parameter to virCgroupForDomain() is non-null before dereferencing.	2010-03-22 09:42:14 -06:00
Eric Blake	36d8e7d8d7	build: consistently indent preprocessor directives * global: patch created by running: for f in $(git ls-files '*.[ch]') ; do cppi $f > $f.t && mv $f.t $f done	2010-03-09 19:22:28 +01:00
Daniel P. Berrange	ede3bc1128	Avoid creating top level cgroups if just querying for existance When getting the driver/domain cgroup it is possible to specify whether it should be auto created. If auto-creation was turned off, libvirt still mistakenly created its own top level cgroup * src/util/cgroup.c: Honour autocreate flag for top level cgroup	2010-03-05 15:00:58 +00:00
Matthias Bolte	f972dc2d5c	Remove conn parameter from util functions It was used for error reporting only.	2010-02-09 01:04:54 +01:00
Jim Meyering	fd10c4e1ee	cgroup.c: don't leak mem+FD upon OOM * src/util/cgroup.c (virCgroupDetectPlacement): Close the mapping FILE* also upon error.	2010-02-04 20:00:07 +01:00

1 2

58 Commits