CVE-2026-45311 - Vulnerability Details

- CodeWhale: run_tests Tool Enables RCE via Malicious Repository Without Approval

Description

CodeWhale is a DeepSeek + MiMo coding agent in terminal. From 0.3.0 to 0.8.23, the run_tests tool executes cargo test in the workspace with ApprovalRequirement::Auto, meaning it runs without any user approval prompt. cargo test compiles and executes arbitrary code: test binaries, build.rs build scripts, and proc macros. While auto-approving test execution is a deliberate design choice, it creates an inconsistency in the security boundary. However, in a malicious repository, test code can execute arbitrary shell commands, exfiltrate credentials, or establish persistence with zero approval. The attack is amplified by AGENTS.md (auto-loaded into the system prompt), which can instruct the model to run tests proactively at session start. This vulnerability is fixed in 0.8.23.

Published: 2026-05-28

Score: 9.6 Critical

EPSS: < 1% Very Low

KEV: No

Impact:

Action:

Analysis

Impact

CodeWhale's run_tests tool automatically runs cargo test without prompting the user. Cargo test compiles and executes test binaries, build.rs scripts, and proc macros, which allow test code to run arbitrary shell commands. A repository containing malicious test code can thus cause an attacker to execute commands, exfiltrate credentials, or establish persistence on the host. This flaw is an example of CWE‑94. The consequence is that a compromised repository can give an attacker full code execution on the system running CodeWhale, compromising confidentiality, integrity, and availability.

Affected Systems

The vulnerability affects Hmbown:CodeWhale versions from 0.3.0 up to but not including 0.8.23. It has been fixed in release 0.8.23 and all subsequent versions. Users should verify the version in use and upgrade if necessary.

Risk and Exploitability

The CVSS score of 9.6 reflects a highly severe risk. An attacker can supply a malicious repository that contains exploit code in its tests; because run_tests auto‑approves execution, no user interaction is required. The EPSS score is not available and the vulnerability is not in the CISA KEV catalog, indicating that widespread exploitation has not yet been observed. Nonetheless, the design of the tool makes exploitation straightforward for anyone who can place a repository into CodeWhale’s workspace, so the risk is considered high.

Default status is the baseline for the product, each version can override it (e.g. patched versions marked unaffected).

Vendor Product Default status Versions

Hmbown

CodeWhale

affected

Version	Status	Constraints
`>= 0.3.0, < 0.8.23`	affected	—

No data.

Vendor Product Confidence Versions

Hmbown

Codewhale

100%

Version	Status	Scheme	Platform
`[0.3.0,0.8.23)`	affected	semver	—

Found an issue or want to improve our Enrichment? You can suggest it directly by opening an issue on our dedicated GitHub repository .

Remediation

No vendor fix or workaround currently provided.

OpenCVE Recommended Actions

Upgrade CodeWhale to version 0.8.23 or later.
If upgrading is not immediately possible, disable the automatic approval setting for run_tests so that tests require manual confirmation before execution.
Remove or restrict AGENTS.md to prevent the model from automatically invoking run_tests at session start.
Validate any repository before running tests, ensuring it does not contain malicious test code.

Generated by OpenCVE AI on May 28, 2026 at 19:21 UTC.

Tracking

Sign in to view the affected projects.

Advisories

Source	ID	Title
Github GHSA	GHSA-wx44-2q6h-j6p8	DeepSeek TUI: run_tests Tool Enables RCE via Malicious Repository Without Approval

No CVSS v4.0

Attack Vector Network

Attack Complexity Low

Privileges Required None

Scope Changed

Confidentiality Impact High

Integrity Impact High

Availability Impact High

User Interaction Required

No CVSS v3.0

No CVSS v2

This CVE is not in the KEV list.

The EPSS score is 0.00375.

Exploitation poc

Automatable no

Technical Impact total

References

Link	Providers
https://github.com/Hmbown/CodeWhale/security/advisories/GHSA-wx44-2q6h-j6p8

History

Mon, 01 Jun 2026 21:30:00 +0000

Type	Values Removed	Values Added
Metrics		ssvc `{'options': {'Automatable': 'no', 'Exploitation': 'poc', 'Technical Impact': 'total'}, 'version': '2.0.3'}`

Fri, 29 May 2026 16:00:00 +0000

Type	Values Removed	Values Added
First Time appeared		Hmbown Hmbown codewhale
Vendors & Products		Hmbown Hmbown codewhale

Thu, 28 May 2026 18:00:00 +0000

Type	Values Removed	Values Added
Description		CodeWhale is a DeepSeek + MiMo coding agent in terminal. From 0.3.0 to 0.8.23, the run_tests tool executes cargo test in the workspace with ApprovalRequirement::Auto, meaning it runs without any user approval prompt. cargo test compiles and executes arbitrary code: test binaries, build.rs build scripts, and proc macros. While auto-approving test execution is a deliberate design choice, it creates an inconsistency in the security boundary. However, in a malicious repository, test code can execute arbitrary shell commands, exfiltrate credentials, or establish persistence with zero approval. The attack is amplified by AGENTS.md (auto-loaded into the system prompt), which can instruct the model to run tests proactively at session start. This vulnerability is fixed in 0.8.23.
Title		CodeWhale: run_tests Tool Enables RCE via Malicious Repository Without Approval
Weaknesses		CWE-94
References		https://github.com/Hmbown/CodeWhale/security/advisories/GHSA-wx44-2q6h-j6p8
Metrics		cvssV3_1 `{'score': 9.6, 'vector': 'CVSS:3.1/AV:N/AC:L/PR:N/UI:R/S:C/C:H/I:H/A:H'}`

Subscriptions

Hmbown Codewhale

MITRE

Status: PUBLISHED

Assigner: GitHub_M

Published: 2026-05-28T17:32:26.950Z

Updated: 2026-06-01T19:09:50.099Z

Reserved: 2026-05-11T20:50:30.538Z

Link: CVE-2026-45311

Vulnrichment

Updated: 2026-06-01T19:09:14.092Z

NVD

Status : Deferred

Published: 2026-05-28T18:16:35.170

Modified: 2026-06-01T21:16:45.620

Link: CVE-2026-45311

Redhat

No data.

OpenCVE Enrichment

Updated: 2026-05-29T15:48:09Z

Weaknesses

CWE-94
Improper Control of Generation of Code ('Code Injection')

Impact

Affected Systems

Risk and Exploitability

Tracking

Attack Vector Network

Attack Complexity Low

Privileges Required None

Scope Changed

Confidentiality Impact High

Integrity Impact High

Availability Impact High

User Interaction Required

Exploitation poc

Automatable no

Technical Impact total

Subscriptions

JSON object

JSON object

JSON object

JSON object

JSON object