libvirt

mirror of https://gitlab.com/libvirt/libvirt.git synced 2025-01-19 11:05:16 +00:00

Author	SHA1	Message	Date
Jiri Denemark	bcc5710708	qemu: Fix crash in offline migration When migrating a shutoff domain (i.e., offline migration), we have no statistics to report and thus jobInfo will be NULL in qemuMigrationFinish. Broken by me in v3.10.0-183-ge8784e7868. https://bugzilla.redhat.com/show_bug.cgi?id=1536351 Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2018-01-19 10:51:19 +01:00
Jiri Denemark	e8784e7868	qemu: Fix type of a completed job Libvirt 3.7.0 and earlier libvirt reported a migration job as completed immediately after QEMU finished sending migration data at which point migration was not really complete yet. Commit v3.7.0-29-g3f2d6d829e fixed this, but caused a regression in reporting statistics for completed jobs which started reporting the job as still running. This happened because the completed job statistics including the job status are copied from the running job before we finally mark it as completed. Let's make sure QEMU_DOMAIN_JOB_STATUS_COMPLETED is always set in the completed job info even when the job has not finished yet. https://bugzilla.redhat.com/show_bug.cgi?id=1523036 Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2018-01-12 10:45:31 +01:00
Nikolay Shirokovskiy	5b0451ab57	qemu: report drive mirror errors on migration Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-12-06 12:43:57 +01:00
Andrea Bolognani	3e7db8d3e8	Remove backslash alignment attempts Right-aligning backslashes when defining macros or using complex commands in Makefiles looks cute, but as soon as any changes is required to the code you end up with either distractingly broken alignment or unnecessarily big diffs where most of the changes are just pushing all backslashes a few characters to one side. Generated using $ git grep -El '[[:blank:]][[:blank:]]\\$' \| \ grep -E '\.([chx]\|am\|mk)$$' \| \ while read f; do \ sed -Ei 's/[[:blank:]][[:blank:]]\\$/ \\/g' "$f"; \ done Signed-off-by: Andrea Bolognani <abologna@redhat.com>	2017-11-03 13:24:12 +01:00
Jiri Denemark	af1d2fe270	qemu: Rename TLS related migration parameters The parameters used "migrate" prefix which is pretty redundant and qemuMonitorMigrationParams structure is our internal representation of QEMU migration parameters and it is supposed to use names which match QEMU names. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-11-02 22:14:20 +01:00
Jiri Denemark	32c29f10db	qemu: Enabled pause-before-switchover migration capability QEMU identified a race condition between the device state serialization and the end of storage migration. Both QEMU and libvirt needs to be updated to fix this. Our migration work flow is modified so that after starting the migration we to wait for QEMU to enter "pre-switchover", "postcopy-active", or "completed" state. Once there, we cancel all block jobs as usual. But if QEMU is in "pre-switchover", we need to resume the migration afterwards and wait again for the real end (either "postcopy-active" or "completed" state). Old QEMU will just enter either "postcopy-active" or "completed" directly, which is still correctly handled even by new libvirt. The "pre-switchover" state will only be entered if QEMU supports it and the pause-before-switchover capability was enabled. Thus all combinations of libvirt and QEMU will work, but only new QEMU with new libvirt will avoid the race condition. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-10-26 10:36:02 +02:00
Jiri Denemark	6addde24be	qemu: Add pause-before-switchover migration capability This new capability enables a pause before device state serialization so that we can finish all block jobs without racing with the end of the migration. The pause is indicated by "pre-switchover" state. Once we're done QEMU enters "device" migration state. This patch just defines the new capability and QEMU migration states and their mapping to our job states. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-10-26 10:36:02 +02:00
Jiri Denemark	55ac6a5d44	qemu: Set correct job status when qemuMigrationRun fails Instead of enumerating all states which need to be turned into QEMU_DOMAIN_JOB_STATUS_FAILED (and failing to add all of them), it's better to mention just the one which needs to be left alone. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-23 10:08:29 +02:00
Jiri Denemark	73a352263c	qemu: Consistently use exit_monitor in qemuMigrationRun Almost every failure in qemuMigrationRun while we are talking to QEMU monitor results in a jump to exit_monitor label. The only exception is removed by this patch. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-23 10:08:29 +02:00
Jiri Denemark	af32e57f8e	qemu: Don't misuse "ret" in qemuMigrationRun The "ret" variable is used for storing the return value of a function and should not be used as a temporary variable. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-23 10:08:29 +02:00
Jiri Denemark	7d2fbabcaf	qemu: Unite error handling in qemuMigrationRun Merge cancel and cancelPostCopy sections with the generic error section, where we can easily decide whether canceling the ongoing migration is required. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-23 10:08:29 +02:00
Jiri Denemark	c1a643b68f	qemu: Split cleanup and error code in qemuMigrationRun Let cleanup only do things common to both failure and success paths and move error handling code inside the new "error" section. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-23 10:08:29 +02:00
Jiri Denemark	f8ede9cc23	qemu: Refactor qemuMigrationRun a bit Some code which was supposed to be executed only when migration succeeded was buried inside the cleanup code. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-23 10:08:29 +02:00
Jiri Denemark	9603262377	qemu: Use switch in qemuMigrationCompleted When adding a new job state it's useful to let the compiler complain about places where we need to think about what to do with the new state. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-23 10:08:29 +02:00
Jiri Denemark	310287b1c9	qemu: Use bitmap with migration capabilities All calls to qemuMonitorGetMigrationCapability in QEMU driver are replaced with qemuMigrationCapsGet. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-10-20 13:37:03 +02:00
Pavel Hrdina	e859da6f42	qemu: send allowReboot in migration cookie We need to send allowReboot in the migration cookie to ensure the same behavior of the virDomainSetLifecycleAction() API on the destination. Consider this scenario: 1. On the source the domain is started with: <on_poweroff>destroy</on_poweroff> <on_reboot>restart</on_reboot> <on_crash>destroy</on_crash> 2. User calls an API to set "destroy" for <on_reboot>: <on_poweroff>destroy</on_poweroff> <on_reboot>destroy</on_reboot> <on_crash>destroy</on_crash> 3. The guest is migrated to a different host 4a. Without the allowReboot in the migration cookie the QEMU process on destination would be started with -no-reboot which would prevent using the virDomainSetLifecycleAction() API for the rest of the guest lifetime. 4b. With the allowReboot in the migration cookie the QEMU process on destination is started without -no-reboot like it was started on the source host and the virDomainSetLifecycleAction() API continues to work. The following patch adds a QEMU implementation of the virDomainSetLifecycleAction() API and that implementation disallows using the API if all actions are set to "destroy" because we add "-no-reboot" on the QEMU command line. Changing the lifecycle action is in this case pointless because the QEMU process is always terminated. Reviewed-by: John Ferlan <jferlan@redhat.com> Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2017-10-19 11:52:34 +02:00
Jiri Denemark	e1ca8ecb46	qemu: Check QEMU error on failed migration When migration fails, QEMU may provide a description of the error in the reply to query-migrate QMP command. We can fetch this error and use it instead of the generic "unexpectedly failed" message. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2017-10-17 10:41:45 +02:00
Peter Krempa	2e78c588d8	qemu: process: Pass flags to qemuProcessPrepareHost Pass flags to the function rather than just whether we have incoming migration. This also enforces correct startup policy for USB devices when reverting from a snapshot.	2017-10-05 09:40:13 +02:00
Peter Krempa	b8c0262efa	qemu: migration: Extract flags for starting VM into a variable qemuMigrationPrepareAny called multiple of the functions starting the qemu process for incoming migration by adding the flags explicitly. Extract them to a variable so that they can be easily used for other calls or changed in the future.	2017-10-05 09:38:15 +02:00
Daniel P. Berrange	32d6c7386d	Print hex values with '0x' prefix and octal with '0' in debug messages Seeing a log message saying 'flags=93' is ambiguous & confusing unless you happen to know that libvirt always prints flags as hex. Change our debug messages so that they always add a '0x' prefix when printing flags, and '0' prefix when printing mode. A few other misc places gain a '0x' prefix in error messages too. Signed-off-by: Daniel P. Berrange <berrange@redhat.com>	2017-09-25 13:34:53 +01:00
Peter Krempa	771a38609d	qemu: monitor: Remove support for "legacy" block jobs Drop all the monitor code necessary to do the downstream block jobs. Reviewed-by: Eric Blake <eblake@redhat.com>	2017-09-14 10:03:38 +02:00
Nikolay Shirokovskiy	3f2d6d829e	qemu: migration: don't expose incomplete job as complete In case of real migration (not migrating to file on save, dump etc) migration info is not complete at time qemu finishes migration in normal (non postcopy) mode. We need to update disks stats, downtime info etc. Thus let's not expose this job status as completed. To archive this let's set status to 'qemu completed' after qemu reports migration is finished. It is not visible as complete job to clients. Cookie code on confirm phase will finally turn job into completed. As we don't need more things to do when migrating to file status is set to 'completed' as before in this case. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 12:52:36 +02:00
Nikolay Shirokovskiy	8c46658337	qemu: migrate: add mirror stats to migration stats When getting job info in case mirror does not reach ready phase fetch mirror stats from qemu. Otherwise mirror stats are already saved in current job. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 11:18:10 +02:00
Nikolay Shirokovskiy	5a274d4fdc	qemu: introduce migrating job status Instead of checking stat.status let's set status to migrating as soon as migrate command is send (waiting for completion is a good place too). Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 11:15:43 +02:00
Nikolay Shirokovskiy	b6868c3cdd	qemu: start all async job with job status active Setting status to none has little value - getting job status will not return even elapsed time. After this patch getting job stats stays correct in a sence it will not fetch migration stats because it consults stats.status before doing the fetch. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 11:15:01 +02:00
Nikolay Shirokovskiy	6a2a80c653	qemu: refactor fetching migration stats qemuMigrationFetchJobStatus is rather inconvinient. Some of its callers don't need status to be updated, some don't need to update elapsed time right away. So let's update status or elapsed time in callers instead. This patch drops updating job status on getting job stats by client. This way we will not provide status 'completed' while it is not yet updated by migration routine. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 10:38:10 +02:00
Nikolay Shirokovskiy	e796747092	qemu: drop excessive zero-out in qemuMigrationFetchJobStatus qemuMonitorGetMigrationStats will do it for us anyway. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 09:41:45 +02:00
Nikolay Shirokovskiy	e87d4b9e2f	qemu: drop QEMU_MIGRATION_COMPLETED_UPDATE_STATS This way we get stats only in one place. The former code waits for complete/postcopy status basically and don't need to mess with stats. The patch drops raising an error on stats updates failure. This does not make much sense anyway. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 09:41:45 +02:00
Nikolay Shirokovskiy	09f57f9aac	qemu: introduce QEMU_DOMAIN_JOB_STATUS_POSTCOPY Let's introduce QEMU_DOMAIN_JOB_STATUS_POSTCOPY state for job.current->status instead of checking job.current->stats.status. The latter can be changed when fetching migration statistics. Moving state function from the variable and leave only store function seems more managable. This patch removes all state checking usage of stats except for qemuDomainGetJobStatsInternal. This place will be handled separately. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 09:41:45 +02:00
Nikolay Shirokovskiy	751a1c7f0a	qemu: introduce qemu domain job status This patch simply switches code from using VIR_DOMAIN_JOB_* to introduced QEMU_DOMAIN_JOB_STATUS_*. Later this gives us freedom to introduce states for postcopy and mirroring phases. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-09-07 09:41:45 +02:00
Michal Privoznik	9115dcd83e	qemu: Introduce and use qemuDomainRemoveInactiveJob At some places we either already have synchronous job or we just released it. Also, some APIs might want to use this code without having to release their job. Anyway, the job acquire code is moved out to qemuDomainRemoveInactiveJob so that qemuDomainRemoveInactive does just what it promises. Signed-off-by: Michal Privoznik <mprivozn@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-08-29 11:18:34 +02:00
Pavel Hrdina	abab46a29b	qemu: don't check whether offline migration is safe Offline migration transfers only the domain definition. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1449715 Signed-off-by: Pavel Hrdina <phrdina@redhat.com>	2017-08-18 14:45:48 +02:00
Hao Peng	fed9cc85ea	qemu: shared disks with cache=directsync should be safe for migration At present shared disks can be migrated with either readonly or cache=none. But cache=directsync should be safe for migration, because both cache=directsync and cache=none don't use the host page cache, and cache=direct write through qemu block layer cache. Signed-off-by: Peng Hao <peng.hao2@zte.com.cn> Reviewed-by: Wang Yechao <wang.yechao255@zte.com.cn>	2017-07-20 10:17:18 +01:00
Wang King	057c2fba1c	qemu: avoid deadlock on domain object enter monitor fail Should be followed with qemuDomainObjExitMonitor only if qemuDomainObjEnterMonitorAsync returns 0. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-07-19 10:07:21 +02:00
Jiri Denemark	2abb0e4bb2	qemu: Avoid fd leak on incoming tunneled migration While qemuProcessIncomingDefNew takes an fd argument and stores it in qemuProcessIncomingDef structure, the caller is still responsible for closing the file descriptor. Introduced by commit v1.2.21-140-ge7c6f4575. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: John Ferlan <jferlan@redhat.com>	2017-06-26 10:36:57 +02:00
Jiri Denemark	f0a3fe1b0a	qemu: Use qemuDomainCheckABIStability where needed Most places which want to check ABI stability for an active domain need to call this API rather than the original qemuDomainDefCheckABIStability. The only exception is in snapshots where we need to decide what to do depending on the saved image data. https://bugzilla.redhat.com/show_bug.cgi?id=1460952 Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2017-06-14 17:13:24 +02:00
Marc Hartmayer	adf846d3c9	Use ATTRIBUTE_FALLTHROUGH Use ATTRIBUTE_FALLTHROUGH, introduced by commit 5d84f5961b8e28e802f600bb2d2c6903e219092e, instead of comments to indicate that the fall through is an intentional behavior. Signed-off-by: Marc Hartmayer <mhartmay@linux.vnet.ibm.com> Reviewed-by: Boris Fiuczynski <fiuczy@linux.vnet.ibm.com> Reviewed-by: Bjoern Walk <bwalk@linux.vnet.ibm.com>	2017-06-12 19:11:30 -04:00
Jiri Denemark	8e34f47813	qemu: Use updated CPU when starting QEMU if possible If QEMU is new enough and we have the live updated CPU definition in either save or migration cookie, we can use it to enforce ABI. The original guest CPU from domain XML will be stored in private data. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2017-06-07 13:36:02 +02:00
Jiri Denemark	48bc3053b8	qemu: Send updated CPU in migration cookie Since the domain XML send during migration uses the original guest CPU definition but we still want the destination to enforce ABI if it is new enough, we send the live updated CPU definition in a migration cookie. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2017-06-07 13:36:02 +02:00
Jiri Denemark	b0a16641fa	qemu: Always send persistent XML during migration When persistent migration of a transient domain is requested but no custom XML is passed to the migration API we would just let the destination daemon make a persistent definition from the live definition itself. This is not a problem now, but once the destination daemon starts replacing the original CPU definition with the one from migration cookie before starting a domain, it would need to add more ugly hacks to reverse the operation. Let's just always send the persistent definition in the cookie to make things a bit cleaner. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2017-06-07 13:36:02 +02:00
Jiri Denemark	356a2161e2	qemu: Report the original CPU in migratable xml The destination host may not be able to start a domain using the live updated CPU definition because either libvirt or QEMU may not be new enough. Thus we need to send the original guest CPU definition. Signed-off-by: Jiri Denemark <jdenemar@redhat.com> Reviewed-by: Pavel Hrdina <phrdina@redhat.com>	2017-06-07 13:36:02 +02:00
Jiri Denemark	59307fade8	qemu: Fix persistent migration of transient domains While fixing a bug with incorrectly freed memory in commit v3.1.0-399-g5498aa29a, I accidentally broke persistent migration of transient domains. Before adding qemuDomainDefCopy in the path, the code just took NULL from vm->newDef and used it as the persistent def, which resulted in no persistent XML being sent in the migration cookie. This scenario is perfectly valid and the destination correctly handles it by using the incoming live definition and storing it as the persistent one. After the mentioned commit libvirtd would just segfault in the described scenario. https://bugzilla.redhat.com/show_bug.cgi?id=1446205 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-05-02 18:53:19 +02:00
Jiri Denemark	fc48fc7930	qemu: Don't reset "events" migration capability When creating v3.2.0-77-g8be3ccd04 commit, I completely forgot that one migration capability is very special. It's the "events" capability which tells QEMU to report "MIGRATION" events. Since libvirt always wants the events, it is enabled in qemuConnectMonitor and the rest of the code should not touch it. https://bugzilla.redhat.com/show_bug.cgi?id=1439841 https://bugzilla.redhat.com/show_bug.cgi?id=1441165 Messed-up-by: Jiri Denemark <jdenemar@redhat.com> Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-05-02 12:26:35 +02:00
Jiri Denemark	2a978269fc	qemu: Report VIR_DOMAIN_JOB_OPERATION Not all async jobs are visible via virDomainGetJobStats (either they are too fast or getting the stats is not allowed during the job), but forcing all of them to advertise the operation is easier than hunting the jobs for which fetching statistics is allowed. And we won't need to think about this when we add support for getting stats for more jobs. https://bugzilla.redhat.com/show_bug.cgi?id=1441563 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-04-27 15:08:12 +02:00
Nikolay Shirokovskiy	bc82d1eaf6	qemu: migration: fix race on cancelling drive mirror 0feebab2 adds calling qemuBlockNodeNamesDetect for completed job on updating block jobs. This affects cancelling drive mirror logic as this function drops vm lock. Now we have to recheck all disks before the disk with the completed block job before going to wait for block job events. Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-04-27 14:38:29 +02:00
Nikolay Shirokovskiy	dd8e40790b	qemu: take current async job into account in qemuBlockNodeNamesDetect Becase it can be called during migration out (namely on cancelling blockjobs). Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-04-27 14:38:29 +02:00
Jiri Denemark	eeb2feb9fb	qemu: Properly reset non-p2p migration While peer-to-peer migration enters the Confirm phase even if the Perform phase fails, the client which initiated a non-p2p migration will never call virDomainMigrateConfirm* API if the Perform phase failed. Thus we need to explicitly reset migration before reporting a failure from the Perform phase API. https://bugzilla.redhat.com/show_bug.cgi?id=1425003 Signed-off-by: Jiri Denemark <jdenemar@redhat.com>	2017-04-27 13:55:46 +02:00
Peter Krempa	41e9f54d05	qemu: migration: Skip cache=none check for disks which are storage-migrated Since the disks are copied by qemu, there's no need to enforce cache=none. Thankfully the code that added qemuMigrateDisk did not break existing configs, since if you don't select any disk to migrate explicitly the code behaves sanely. The logic for determining whether a disk should be migrated is open-coded since using qemuMigrateDisk twice would be semantically incorrect.	2017-04-18 10:41:49 +02:00
Peter Krempa	5a990e0bf3	qemu: migration: Reject migration of an empty disk If you specify disks to migrate it would be possible to select an empty drive for migration. Reject such config.	2017-04-13 12:33:24 +02:00
Peter Krempa	03766247ae	qemu: migration: Use virStorageSourceIsEmpty in qemuMigrateDisk Use the proper check whether a disk is empty.	2017-04-13 12:33:24 +02:00

1 2 3 4 5 ...

681 Commits