CVE-2026-24146
Published: 07 April 2026
Summary
CVE-2026-24146 is a high-severity Memory Allocation with Excessive Size Value (CWE-789) vulnerability in Nvidia Triton Inference Server. Its CVSS base score is 7.5 (High).
Operationally, exploitation aligns with the MITRE ATT&CK technique Application or System Exploitation (T1499.004); ranked at the 20.8th percentile by exploit likelihood (below the median); it is not currently listed in the CISA KEV catalog.
The strongest mitigations our analysis identified are NIST 800-53 SC-5 (Denial-of-service Protection) and SI-10 (Information Input Validation).
Threat & Defense at a Glance
Threat & Defense Details
Mitigating Controls (NIST 800-53 r5)AI
Directly addresses the insufficient input validation that allows large numbers of outputs to trigger uncontrolled memory allocation and server crash.
Protects against denial-of-service attacks by limiting the effects of resource exhaustion from malformed inputs causing server crashes.
Ensures timely identification, reporting, and patching of the specific flaw in NVIDIA Triton Inference Server leading to DoS.
MITRE ATT&CK Enterprise TechniquesAI
Why these techniques?
CVE directly describes remote unauthenticated exploitation of an input validation flaw (uncontrolled memory allocation) to crash the Triton server, mapping to application/system exploitation for endpoint DoS.
NVD Description
NVIDIA Triton Inference Server contains a vulnerability where insufficient input validation and a large number of outputs could cause a server crash. A successful exploit of this vulnerability might lead to denial of service.
Deeper analysisAI
CVE-2026-24146 is a vulnerability in NVIDIA Triton Inference Server stemming from insufficient input validation combined with a large number of outputs, which can trigger a server crash. This issue, classified under CWE-789 (Uncontrolled Memory Allocation), carries a CVSS v3.1 base score of 7.5 (AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H), indicating high severity primarily due to its impact on availability.
A remote, unauthenticated attacker can exploit this vulnerability over the network with low complexity and no user interaction required. Successful exploitation results in denial-of-service, causing the server to crash and potentially disrupting inference services until restart.
Official advisories, including NVIDIA's security bulletin at https://nvidia.custhelp.com/app/answers/detail/a_id/5816 and NVD details at https://nvd.nist.gov/vuln/detail/CVE-2026-24146, provide guidance on mitigation; security practitioners should consult these for patching instructions and workarounds specific to affected Triton Inference Server versions.
Details
- CWE(s)