CVE-2025-23311

Critical

Published: 06 August 2025

Published

06 August 2025

Modified

12 August 2025

KEV Added

—

Patch

—

CVSS Score v3.1 9.8 CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H

EPSS Score 0.0167 82.5th percentile

Risk Priority 21 60% EPSS · 20% KEV · 20% CVSS

Summary

CVE-2025-23311 is a critical-severity Stack-based Buffer Overflow (CWE-121) vulnerability in Nvidia Triton Inference Server. Its CVSS base score is 9.8 (Critical).

Operationally, exploitation aligns with the MITRE ATT&CK technique Exploit Public-Facing Application (T1190); ranked in the top 17.5% of CVEs by exploit likelihood; it is not currently listed in the CISA KEV catalog.

The strongest mitigations our analysis identified are NIST 800-53 SI-2 (Flaw Remediation) and SI-10 (Information Input Validation).

Deeper analysis

NVIDIA Triton Inference Server is affected by CVE-2025-23311, a stack-based buffer overflow vulnerability (CWE-121) that can be triggered by specially crafted HTTP requests. The flaw carries a CVSS 3.1 score of 9.8 and may result in remote code execution, denial of service, information disclosure, or data tampering.

An unauthenticated attacker with network access can exploit the issue by sending malicious HTTP requests, potentially compromising the inference server without any user interaction or credentials.

The referenced NVIDIA security advisory at nvidia.custhelp.com provides official guidance on the vulnerability. EPSS remains low and flat at 0.0167 with no observed increase after disclosure.

EU & UK References

🇪🇺 ENISA EUVD: EUVD-2025-23840

Vulnerability details

NVIDIA Triton Inference Server contains a vulnerability where an attacker could cause a stack overflow through specially crafted HTTP requests. A successful exploit of this vulnerability might lead to remote code execution, denial of service, information disclosure, or data tampering.

CWE(s): CWE-121

Related Threats

MITRE ATT&CK Enterprise TechniquesAI

T1190 Exploit Public-Facing Application Initial Access

Adversaries may attempt to exploit a weakness in an Internet-facing host or system to initially access a network.

attack.mitre.org →

Why these techniques?

Stack-based buffer overflow in public-facing Triton Inference Server triggered by crafted HTTP requests directly enables remote code execution via T1190 Exploit Public-Facing Application.

Confidence: HIGH · MITRE ATT&CK Enterprise v18.1

CVEs Like This One

CVE-2025-23310Same product: Linux Linux Kernel

CVE-2025-23319Same product: Linux Linux Kernel

CVE-2025-23318Same product: Linux Linux Kernel

CVE-2025-23317Same product: Linux Linux Kernel

CVE-2025-23316Same product: Linux Linux Kernel

CVE-2026-24208Same product: Linux Linux Kernel

CVE-2026-24206Same product: Linux Linux Kernel

CVE-2026-24207Same product: Linux Linux Kernel

CVE-2026-24209Same product: Linux Linux Kernel

CVE-2026-28710Same product: Linux Linux Kernel

Affected Assets

nvidia

triton inference server

≤ 25.07

Mitigating Controls

Mitigating Controls (NIST 800-53 r5) AI

SI-2 Flaw Remediation good match

prevent

Directly requires identification, reporting, and timely remediation of the stack-based buffer overflow flaw in NVIDIA Triton Inference Server via patching.

SI-10 Information Input Validation partial match

prevent

Mandates validation of HTTP request inputs to block specially crafted requests that trigger the stack overflow vulnerability.

SI-16 Memory Protection partial match

prevent

Implements memory protection mechanisms such as stack canaries, ASLR, and non-executable stacks to mitigate exploitation of the stack overflow for RCE or data tampering.

References

https://nvd.nist.gov/vuln/detail/CVE-2025-23311
US Government Resource · psirt@nvidia.com
https://nvidia.custhelp.com/app/answers/detail/a_id/5687
Vendor Advisory · psirt@nvidia.com
https://www.cve.org/CVERecord?id=CVE-2025-23311
Third Party Advisory · psirt@nvidia.com