CVE-2026-5497

Modified Modified - Updated After Analysis

Out-of-Memory DoS in vLLM via Unbounded Frame Processing

Publication date: 2026-06-11

Last updated on: 2026-07-15

Assigner: huntr.dev

Description

vLLM versions 0.8.0 and later are vulnerable to an Out-of-Memory (OOM) Denial of Service (DoS) attack due to unbounded frame count processing in the `VideoMediaIO.load_base64()` method. When processing `video/jpeg` data URLs, the method splits the base64 data string on commas to extract individual JPEG frames without enforcing a frame count limit. An attacker can exploit this by crafting a single API request containing thousands of comma-separated base64-encoded JPEG frames in a data URL, causing the server to decode all frames into memory and crash due to excessive memory consumption. This vulnerability is reachable via the OpenAI-compatible chat completions API and does not require authentication.

CVSS Scores

EPSS Scores

Probability:
Percentile:

Meta Information

Published

2026-06-11

Last Modified

2026-07-15

Generated

2026-07-22

AI Q&A

2026-06-11

EPSS Evaluated

2026-07-20

NVD

CVE-2026-5497

EUVD

EUVD-2026-36217

Affected Vendors & Products

Vendor	Product	Version / Range
vllm	vllm	From 0.8.0 (inc) to 0.19.0 (exc)

Helpful Resources

Exploitability

CWE

KEV

CWE ID	Description
CWE-400	The product does not properly control the allocation and maintenance of a limited resource.
CWE-770	The product allocates a reusable resource or group of resources on behalf of an actor without imposing any intended restrictions on the size or number of resources that can be allocated.

Attack-Flow Graph

Executive Summary

vLLM versions 0.8.0 and later have a vulnerability in the VideoMediaIO.load_base64() method that processes video/jpeg data URLs. This method splits the base64 data string on commas to extract individual JPEG frames but does not limit the number of frames it processes.

An attacker can exploit this by sending a single API request containing thousands of comma-separated base64-encoded JPEG frames. The server attempts to decode all these frames into memory, which can cause it to run out of memory and crash, resulting in a Denial of Service (DoS) condition.

This vulnerability is reachable via the OpenAI-compatible chat completions API and does not require authentication.

Detection Guidance

This vulnerability can be detected by monitoring for unusually large or suspicious API requests to the OpenAI-compatible chat completions API that contain data URLs with a very high number of comma-separated base64-encoded JPEG frames.

Specifically, detection involves inspecting incoming requests for the presence of 'video/jpeg' data URLs with an excessive number of frames, which can be identified by counting the commas in the base64 data string.

While no explicit commands are provided in the resources, a possible approach is to use network traffic inspection tools or log analysis to filter and count frames in the base64 data URLs.

Use tools like tcpdump or Wireshark to capture API request traffic.
Extract and analyze the payloads containing 'video/jpeg' data URLs.
Count the number of commas in the base64 string to estimate the number of frames.
Example command snippet (Linux shell) to count frames in a captured payload: `echo '<base64_data_string>' | grep -o ',' | wc -l`

Impact Analysis

This vulnerability can cause a Denial of Service (DoS) by exhausting the server's memory resources. An attacker can crash the server by sending a specially crafted request with a large number of base64-encoded JPEG frames, causing the server to consume excessive memory and become unresponsive.

As a result, legitimate users may be unable to access the service while the server is down or recovering from the crash.

Compliance Impact

The provided information does not specify any direct impact of this vulnerability on compliance with common standards and regulations such as GDPR or HIPAA.

Mitigation Strategies

Immediate mitigation involves applying the security fix introduced in the vLLM project that enforces a configurable frame limit on the number of base64-encoded JPEG frames processed by the VideoMediaIO.load_base64() method.

Specifically, update to a version of vLLM that includes the patch which adds the `num_frames` parameter to limit the number of frames processed, preventing unbounded memory consumption.

If updating immediately is not possible, consider implementing request filtering or rate limiting on the API endpoint to block or throttle requests containing excessively large or numerous frames.

Upgrade vLLM to a version including the patch that enforces frame limits (see commit 58ee61422169ce17e08248f8efa1e9df434fe395).
Configure the `num_frames` parameter to a safe positive value to limit frame processing.
Implement API request validation to reject or limit requests with large numbers of frames.
Monitor and rate-limit incoming requests to prevent abuse.

Hi! I’m here to help you understand CVE-2026-5497. Ask me anything about the vulnerability, its impact, or mitigation strategies.

0/70