Report: CVE-2026-34760 - vLLM: Downmix Implementation Differences as Attack Vectors Against Audio AI Models - 2025 Update

Report: CVE-2026-34760 - vLLM: Downmix Implementation Differences as Attack Vectors Against Audio AI Models - 2025 Update

CVE ID :CVE-2026-34760 Published : April 2, 2026, 8:16 p.m. | 1 hour, 3 minutes ago Description :vLLM is an inference and serving engine for large language models (LLMs). From version 0.5.5 to before version 0.18.0, Librosa defaults to using numpy.mean for mono downmixing (to_mono), while the international standard ITU-R BS.775-4 specifies a weighted downmixing algorithm. This discrepancy results in inconsistency between audio heard by humans (e.g., through headphones/regular speakers) and audio processed by AI models (Which infra via Librosa, such as vllm, transformer). This issue has been patched in version 0.18.0. Severity: 5.9 | MEDIUM Visit the link for more details, such as CVSS details, affected products, timeline, and more...

CVE Details

Severity
MEDIUM
Published
April 2, 2026