womax/nova - nova - Gitea: Git with a cup of tea

womax/nova

Author	SHA1	Message	Date
Kamil Sambor	f6314d9027	Switch nova-conductor to use global executor Change-Id: I0700a907746712965860455909d976f6eab0d9bd Signed-off-by: Kamil Sambor <kamil.sambor@gmail.com>	2025-10-13 13:26:26 +02:00
Balazs Gibizer	ec426532c3	Run nova-conductor in native threading mode Previous patches removed direct eventlet usage from nova-conductor so now we can run it with native threading as well. This patch documents the possibility and switches both nova-conductor process to native threading mode in the nova-next job. Change-Id: If26c0c7199cbda157f24b99a419697ecb6618fa6 Signed-off-by: Balazs Gibizer <gibi@redhat.com>	2025-09-22 10:17:39 +00:00
Kamil Sambor	9f58f596db	Switch nova-conductor to use ThreadPoolExecutor This is a pure refactor so not having any unit test change actually signals that the refactor did not change the existing behavior which is good. The unit test run on this patch only covers the eventlet mode but higher in the series we run unit test with native threading mode in a separate job that will complement the coverage for this patch. Change-Id: Iafc96c93a0d4c406b77902942b2940653441fe38 Signed-off-by: Kamil Sambor <kamil.sambor@gmail.com>	2025-09-17 11:35:35 +02:00
Zuul	640782207c	Merge "Remove eventlet timer from multi_cell_list"	2025-09-15 17:15:03 +00:00
Zuul	87bf7700b8	Merge "reno: Update master for unmaintained/2023.1"	2025-09-12 10:55:00 +00:00
OpenStack Release Bot	71607ef8a5	Update master for stable/2025.2 Add file to the reno documentation build to show release notes for stable/2025.2. Use pbr instruction to increment the minor version number automatically so that master versions are higher than the versions on stable/2025.2. Sem-Ver: feature Change-Id: I7d967c1d5b1ac7fa2e601acfa25c3b5c3880056e Signed-off-by: OpenStack Release Bot <infra-root@openstack.org> Generated-By: openstack/project-config:roles/copy-release-tools-scripts/files/release-tools/add_release_note_page.sh	2025-09-12 08:54:07 +00:00
Zuul	759e03c35d	Merge "Add Flamingo prelude section"	2025-09-11 09:03:15 +00:00
Zuul	7e1d86bdff	Merge "Fix bug 2114951"	2025-09-10 14:38:54 +00:00
Zuul	ee0cb67782	Merge "Update Debian qemu/libvirt/libguestfs versions"	2025-09-10 11:37:14 +00:00
Zuul	36c63f1664	Merge "hypervisors: Optimize uptime retrieval for better performance"	2025-09-10 11:36:59 +00:00
Thomas Goirand	187ffa120f	Update Debian qemu/libvirt/libguestfs versions Change-Id: I99b742bd527672cb32dd7cf8e80c20aeb8b7a5b0 Signed-off-by: Takashi Kajinami <kajinamit@oss.nttdata.com>	2025-09-10 18:44:57 +09:00
René Ribaud	45ddbc2569	Add Flamingo prelude section Shamelessly copied from the cycle highlights Signed-off-by: René Ribaud <rribaud@redhat.com> Change-Id: Ib9de63fe4ccce24921326ef3bcfc690fd4481687	2025-09-10 10:39:44 +02:00
Takashi Kajinami	51aceec3ab	docs: Update libvirt version support matrix for Flamingo Change-Id: I05ac8ec870e75d58095b9f34a63ce786a47c3922 Signed-off-by: Takashi Kajinami <kajinamit@oss.nttdata.com>	2025-09-10 02:25:08 +09:00
Zuul	2952c10948	Merge "Fix fast8 tox target"	2025-09-08 19:43:32 +00:00
Zuul	74673be235	Merge "Update compute rpc alias for epoxy"	2025-09-08 19:17:46 +00:00
Zuul	9efabaa993	Merge "Reproducer for bug 2114951"	2025-09-08 17:48:28 +00:00
Dan Smith	2bf2814add	Fix fast8 tox target This is a major timesaver for repos the size of nova. This broke recently due to changes in flake8 itself. This removes some needless complexity to make it work again. It also removes the suggestion to use pre-commit which has nothing to do with this target, which also stings more when pre-commit is breaking things which is why you're using this in the first place. Change-Id: Ieb150bf0931ad8031ca83bae1f206075a9f505e2 Signed-off-by: Dan Smith <dansmith@redhat.com>	2025-09-08 09:35:47 -07:00
Sean Mooney	567dbe1867	hypervisors: Optimize uptime retrieval for better performance The /os-hypervisors/detail API endpoint was experiencing significant performance issues in environments with many compute nodes when using microversion 2.88 or higher, as it made sequential RPC calls to gather uptime information from each compute node. This change optimizes uptime retrieval by: * Adding uptime to periodic resource updates sent by nova-compute to the database, eliminating synchronous RPC calls during API requests * Restricting RPC-based uptime retrieval to hypervisor types that support it (libvirt and z/VM), avoiding unnecessary calls that would always fail * Preferring cached database uptime data over RPC calls when available Closes-Bug: #2122036 Assisted-By: Claude <noreply@anthropic.com> Change-Id: I5723320f578192f7e0beead7d5df5d7e47d54d2b Co-Authored-By: Sylvain Bauza <sbauza@redhat.com> Signed-off-by: Sean Mooney <work@seanmooney.info>	2025-09-05 19:03:38 +01:00
Zuul	0dd7cb1fb0	Merge "libvirt: Disable VMCoreInfo device for SEV-encrypted instances"	2025-09-05 16:32:24 +00:00
Zuul	7d5521ac84	Merge "[pci]Keep used dev in Placement regardless of dev_spec"	2025-09-05 15:36:30 +00:00
Zuul	ab744e5040	Merge "[PCI tracker]Remove non configured devs when freed"	2025-09-05 15:36:13 +00:00
Zuul	cb6ed0e3c0	Merge "Reproduce bug/2115905"	2025-09-05 14:40:57 +00:00
Balazs Gibizer	4495f1f019	[pci]Keep used dev in Placement regardless of dev_spec This changes the PCI Placement translator edge case handling logic to resolve a bug preventing VM deletion. If a device is allocated but removed from the dev_spec then we need to keep the device in Placement otherwise the Placement update will be rejected as we are trying to delete an RP that has allocations. This prevent the deletion of a VM that is using this removed device. The alternative would be to not allow the nova-compute service to start if it detects this situation. However this situation can happen in at least two very different cases: 1. The admin removed a dev_spec. In this case adding the dev_spec back, removing the VM, then removing the dev_spec is the right course of action and nova-compute failing to start would be OK to enforce this. 2. A device disappeared as the HW is died. In this case not allowing the nova-compute to start up would prevent the admin to migrate the other VMs away from the host before doing a HW replacement. Note that this is fairly complex change due to the fact that based on purely the PciDevice object we cannot differentiate between the two cases: 1. A PciDevice object is being removed as the related device spec is removed from the configuration or the device is disappeared from the hypervisor. 2. A PciDevice object was held back for a while as the device spec is removed (or the device disappeared from the hypervisor) while the device was allocated to a VM. And now that VM is undergoing deletion. In both case the PCI in Placement logic sees a PciDevice object in dev.status.REMOVED and dev.instance_uuid = None. However the two cases require different handling. 1. The related inventory can be removed from Placement 2. The related inventory cannot be removed from Placement as it is still being allocated to the VM that is undergoing deletion. The second case is due to the sequence of events during a VM deletion being: * We destroy the VM on the hypervisor * We update the PCI tracker to free the device. As the device was held back the tracker not just frees the device but removes it as well as it is not configured any more in the dev_spec so it should not go to AVAILABLE state. * When the PCI tracker is updated it calls the PCI in Placement logic to update Placement inventories as well. At this point the VM deletion still in progress and the VM's allocation hasn't been deleted in Placement, so the Placement inventory cannot be removed as it is still allocated. * After the resource tracker update is finished the compute manager deletes the VM's allocation in Placement. So in this edge case we temporarily keep the Placement inventory and only remove that in a subsequent periodic run where we are sure the VM's allocation is gone. This means there is a time window when the Placement inventory shows an extra resource even though that resource has already been removed from the PCI tracker. During this window the scheduler might select a host based on this ghost inventory and the compute resource tracker will reject the boot request forcing a normal re-schedule. Closes-Bug: #2115905 Change-Id: Ie9d311ea9f59ff49593003e3773b690dd36fdeb2 Signed-off-by: Balazs Gibizer <gibi@redhat.com>	2025-09-04 10:05:20 +00:00
Balazs Gibizer	f37cdf0c41	[PCI tracker]Remove non configured devs when freed The PCI tracker handles the case when a device spec is removed from the configuration while a device is still being allocated. It keeps the device until the VM is deleted to avoid inconsistencies. However the full removal of such a device needs not just the VM deletion, but also a nova-compute restart. The device tracker just frees the device during VM deletion but does not removed them until the next nova-compute startup. This allows the device to be re-allocated by another VM even though the device is not allowed by a device_spec. This change adds yet another in memory dict to the pci tracker to track these devices that are only kept until they are freed. Then during free() this list is consulted and if the device is in the list then the device is marked for removal as well. This kills two birds with one stone: * We prevent the re-allocation of the device as the state of the device will be set to REMOVED not AVAILABLE during VM deletion. * As PCI in Placement relies on the state of the device to decide what to track in placement, this change makes sure that a device that needs to be removed, is now removed from placement too. Note that we have another bug that prevents this removal for now. But at least the reproducers of that bug now starts to behave the same regardless of how many device belongs to the same RP in placement. Related-Bug: #2115905 Change-Id: I63c8fb2669a3c6b3adb77d210c0f9b39d3657c80 Signed-off-by: Balazs Gibizer <gibi@redhat.com>	2025-09-04 10:30:19 +02:00
Balazs Gibizer	d86aa2d15a	Reproduce bug/2115905 Both the PCI tracker and the PCI in Placement logic handles the case when a device spec is removed from the configuration while a device is still being allocated. However there are edge cases in PCI in Placement that it not handled well. Namely that if the VM with this allocation is deleted, then depending on the amount of VFs the PF had originally, the logic might try to delete the RP before the allocation is removed. That is rejected by Placement. This prevent the deletion of such a VM and therefore prevents one of the ways the original inconsistency can be Note that with this patch we see two additional behaviors worth mentioning: * When the VM is successfully deleted (in a single VF or PF case) the PCI tracker still keeps the now free device in the DB and therefore PCI in Placement also keeps the RP. This keeps the non whitelisted device available for allocations until the next nova-compute restart. * The PCI in Placement logic is different between the case where the last device is removed from an RP and the case where there are other devices on the RP, some that can be removed and some that cannot due to allocation. Related-Bug: #2115905 Change-Id: Ib3febb77299da65ada24ed49849c04cbf3c41af1 Signed-off-by: Balazs Gibizer <gibi@redhat.com>	2025-09-04 10:19:31 +02:00
Zuul	9f156aa954	Merge "Fix 'nova-manage image_property set' command"	2025-09-03 17:29:24 +00:00
Zuul	74e4ff46db	Merge "Do not yield in threading mode"	2025-09-03 16:59:21 +00:00
René Ribaud	1593dddfaf	Fix bug 2114951 This patch fixes an ambiguous regexp prevent using device_filename like 'mkwinimage-cdrom'. The schema in Ib8857d9f725e277f27ccfab83335142b05e2cdf1 mistakenly included r, which caused _-r to be interpreted as a range of items from - to r. The intent was to align the schema with what is allowed in block_device.pyi -> validate_device_name()[1]. Since this brings the schema in line with existing validation logic, it can be treated as a bug fix and does not require a microversion bump. [1] https://github.com/openstack/nova/blob/73724fef9a66c4df3d018e7368067f883b1ed9e2/nova/block_device.py#L378 Closes-Bug: #2114951 Change-Id: Ie852e6dbd005ce46b41a486d3121d4d34d1fc1a2 Signed-off-by: René Ribaud <rribaud@redhat.com>	2025-09-03 18:38:56 +02:00
René Ribaud	aa59133626	Reproducer for bug 2114951 Ambiguous regexp prevent using device_filename like 'mkwinimage-cdrom'. The regexp matches a single character in the range between _ (index 94) and r (index 114) (case sensitive) Related-Bug: #2114951 Change-Id: I5c7ce18eb635a75d5aadc889e730ed77c9a10dc3 Signed-off-by: René Ribaud <rribaud@redhat.com>	2025-09-03 18:38:52 +02:00
Zuul	fa31983299	Merge "[CI]Make nova-tox-py312-threading voting"	2025-09-03 10:40:39 +00:00
Zuul	6fa7f807ad	Merge "Fix duplicate words"	2025-09-03 10:24:05 +00:00
Zuul	a4df1dea8c	Merge "Fix pci_tracker.save to delete all removed devs"	2025-09-02 20:20:45 +00:00
Zuul	ba2d41e463	Merge "Add service version for Falmingo"	2025-09-02 20:20:10 +00:00
René Ribaud	60ba6afc49	Add service version for Falmingo We agreed by I2dd906f34118da02783bb7755e0d6c2a2b88eb5d on the support envelope. Pre-RC1, we need to add a service version in the object. Post-RC1, depending on whether it's SLURP or not SLURP, we need to bump the minimum version or not. This patch only focuses on pre-RC1 stage. Given Gazpacho won't be skippable, we won't need a post-RC1 patch for updating the min that will continue to support Epoxy. HTH. Signed-off-by: René Ribaud <rribaud@redhat.com> Change-Id: I5bf6ad1077fe62e6ff628d211b745857167280fb	2025-09-02 15:51:00 +02:00
René Ribaud	2c87ab7573	Update compute rpc alias for epoxy This adds an alias for Flamingo Signed-off-by: René Ribaud <rribaud@redhat.com> Change-Id: I2b1e76eee52dc9a15d1938a3384ebe30786ceb43	2025-09-02 15:40:03 +02:00
René Ribaud	73724fef9a	doc: mark the maximum microversion for 2025.2 Flamingo Change-Id: I4158fc072ebeda7709bc08eb7d0b924cbc99ca5a Signed-off-by: René Ribaud <rribaud@redhat.com>	2025-09-02 15:37:02 +02:00
Rajesh Tailor	68fbace8af	Fix duplicate words This change fixes duplicate consecutive words from docs as well as code. Signed-off-by: Rajesh Tailor <ratailor@redhat.com> Change-Id: I236ff41fccf831023b6f85840097148a30e84743	2025-09-02 18:06:31 +05:30
Zuul	9c1d971f01	Merge "Reproduce that only half of the PCI devs are removed"	2025-09-02 11:08:42 +00:00
Rajesh Tailor	19f206f58c	Fix 'nova-manage image_property set' command As of now, if operator wants to set traits using 'nova-manage image_property set' command, it fails with below error, because in ImageMetaProps traits are not stored as individual fields, but stored in 'traits_required' field which is of type list. 'Invalid image property name trait:CUSTOM_XYZ' The setting of traits are handled by _set_attr_from_trait_names method here [1]. This change handles the issue by continue the loop, if the property startswith 'traits' string. [1] https://opendev.org/openstack/nova/src/commit/725a307693806e6e32834198e23be75f771bebc1/nova/objects/image_meta.py#L708-L714 Closes-Bug: #2096341 Change-Id: Ifc20894801f723627726e3c9bed7076144542660 Signed-off-by: Rajesh Tailor <ratailor@redhat.com>	2025-09-02 12:22:55 +05:30
Zuul	539e971126	Merge "Follow-up of AMD SEV-ES support"	2025-09-01 11:59:27 +00:00
Zuul	aed238c064	Merge "Drop CentOS 8 Stream"	2025-09-01 11:30:40 +00:00
Zuul	e700b18f2b	Merge "Replace remaining usage of Ubuntu Jammy"	2025-09-01 11:30:28 +00:00
Zuul	8ddf918a0b	Merge "[test]RPC using threading or eventlet selectively"	2025-09-01 10:11:38 +00:00
Zuul	023c1eab47	Merge "Run unit test with threading mode"	2025-09-01 10:11:11 +00:00
Zuul	29eaf28acc	Merge "Update min support for Flamingo"	2025-08-31 18:13:06 +00:00
Zuul	4301fc390e	Merge "api: Fix validators for hw:cpu_max_* extra specs"	2025-08-31 18:12:45 +00:00
Takashi Kajinami	583d88308f	Replace remaining usage of Ubuntu Jammy Ubuntu Jammy is no longer supported since 2025.2 . Replace it by Ubuntu Noble which is used in the other jobs. Change-Id: I790fb06ede2c41cb80b3d2e8ff7faa7315c84016 Signed-off-by: Takashi Kajinami <kajinamit@oss.nttdata.com>	2025-08-31 16:36:44 +09:00
Zuul	7b8e054bd2	Merge "api: Correct expected errors"	2025-08-29 21:12:29 +00:00
Takashi Kajinami	79846eb0d0	libvirt: Disable VMCoreInfo device for SEV-encrypted instances When VMCoreInfo device is enabled, the QEMU fw_cfg device in guest OS requires DMA between host OS and guest OS through the device. However DMA is prohibited when guest memory is encrypted using SEV, and the attempt results in kernel crash. Do not add VMCoreInfo when memory encryption is enabled. Closes-Bug: #2117170 Change-Id: I05c7b1ae46ccd8d9aa42456b493ac6ee7ddd8bae Signed-off-by: Takashi Kajinami <kajinamit@oss.nttdata.com>	2025-08-29 21:19:10 +09:00
Zuul	07ab08aa69	Merge "Allow to start unit test without eventlet"	2025-08-29 04:57:32 +00:00

1 2 3 4 5 ...

61682 Commits