CVE-2024-31076
Severity CVSS v4.0:
Pending analysis
Type:
Unavailable / Other
Publication date:
21/06/2024
Last modified:
15/07/2024
Description
In the Linux kernel, the following vulnerability has been resolved:<br />
<br />
genirq/cpuhotplug, x86/vector: Prevent vector leak during CPU offline<br />
<br />
The absence of IRQD_MOVE_PCNTXT prevents immediate effectiveness of<br />
interrupt affinity reconfiguration via procfs. Instead, the change is<br />
deferred until the next instance of the interrupt being triggered on the<br />
original CPU.<br />
<br />
When the interrupt next triggers on the original CPU, the new affinity is<br />
enforced within __irq_move_irq(). A vector is allocated from the new CPU,<br />
but the old vector on the original CPU remains and is not immediately<br />
reclaimed. Instead, apicd->move_in_progress is flagged, and the reclaiming<br />
process is delayed until the next trigger of the interrupt on the new CPU.<br />
<br />
Upon the subsequent triggering of the interrupt on the new CPU,<br />
irq_complete_move() adds a task to the old CPU&#39;s vector_cleanup list if it<br />
remains online. Subsequently, the timer on the old CPU iterates over its<br />
vector_cleanup list, reclaiming old vectors.<br />
<br />
However, a rare scenario arises if the old CPU is outgoing before the<br />
interrupt triggers again on the new CPU.<br />
<br />
In that case irq_force_complete_move() is not invoked on the outgoing CPU<br />
to reclaim the old apicd->prev_vector because the interrupt isn&#39;t currently<br />
affine to the outgoing CPU, and irq_needs_fixup() returns false. Even<br />
though __vector_schedule_cleanup() is later called on the new CPU, it<br />
doesn&#39;t reclaim apicd->prev_vector; instead, it simply resets both<br />
apicd->move_in_progress and apicd->prev_vector to 0.<br />
<br />
As a result, the vector remains unreclaimed in vector_matrix, leading to a<br />
CPU vector leak.<br />
<br />
To address this issue, move the invocation of irq_force_complete_move()<br />
before the irq_needs_fixup() call to reclaim apicd->prev_vector, if the<br />
interrupt is currently or used to be affine to the outgoing CPU.<br />
<br />
Additionally, reclaim the vector in __vector_schedule_cleanup() as well,<br />
following a warning message, although theoretically it should never see<br />
apicd->move_in_progress with apicd->prev_cpu pointing to an offline CPU.
Impact
References to Advisories, Solutions, and Tools
- https://git.kernel.org/stable/c/59f86a2908380d09cdc726461c0fbb8d8579c99f
- https://git.kernel.org/stable/c/6752dfcfff3ac3e16625ebd3f0ad9630900e7e76
- https://git.kernel.org/stable/c/9eeda3e0071a329af1eba15f4e57dc39576bb420
- https://git.kernel.org/stable/c/a40209d355afe4ed6d533507838c9e5cd70a76d8
- https://git.kernel.org/stable/c/a6c11c0a5235fb144a65e0cb2ffd360ddc1f6c32
- https://git.kernel.org/stable/c/e9c96d01d520498b169ce734a8ad1142bef86a30
- https://git.kernel.org/stable/c/ebfb16fc057a016abb46a9720a54abf0d4f6abe1
- https://git.kernel.org/stable/c/f5f4675960609d8c5ee95f027fbf6ce380f98372