Impact
A specially crafted 1x1 pixel image can be supplied to the Idefics3 vision model implementation within vLLM, causing a tensor dimension mismatch that triggers an unhandled runtime error. The error results in the termination of the entire server process, denying all legitimate requests. The flaw is classified under CWE-770, indicating a denial‑of‑service vulnerability due to improper resource management.
Affected Systems
vLLM, released by the vllm-project, is affected in versions ranging from 0.6.4 up to but not including 0.12.0. The vulnerability applies to the multimodal model handling pathways that engage the Idefics3 vision component.
Risk and Exploitability
The CVSS score of 6.5 classifies the issue as a moderate severity vulnerability. The EPSS score of less than 1% indicates a low likelihood of exploitation at the current time, and the vulnerability is not listed in the CISA KEV catalog. The attack vector is inferred to be remote, where an attacker can send a crafted image via the exposed API endpoints to trigger the crash. An unpatched server could experience repeated service outages until the patch is applied.
OpenCVE Enrichment
Github GHSA