Description
CodeWhale is a DeepSeek + MiMo coding agent in terminal. From 0.3.0 to 0.8.23, the run_tests tool executes cargo test in the workspace with ApprovalRequirement::Auto, meaning it runs without any user approval prompt. cargo test compiles and executes arbitrary code: test binaries, build.rs build scripts, and proc macros. While auto-approving test execution is a deliberate design choice, it creates an inconsistency in the security boundary. However, in a malicious repository, test code can execute arbitrary shell commands, exfiltrate credentials, or establish persistence with zero approval. The attack is amplified by AGENTS.md (auto-loaded into the system prompt), which can instruct the model to run tests proactively at session start. This vulnerability is fixed in 0.8.23.
Published: 2026-05-28
Score: 9.6 Critical
EPSS: n/a
KEV: No
Impact: n/a
Action: n/a
AI Analysis

Impact

CodeWhale's run_tests tool automatically runs cargo test without prompting the user. Cargo test compiles and executes test binaries, build.rs scripts, and proc macros, which allow test code to run arbitrary shell commands. A repository containing malicious test code can thus cause an attacker to execute commands, exfiltrate credentials, or establish persistence on the host. This flaw is an example of CWE‑94. The consequence is that a compromised repository can give an attacker full code execution on the system running CodeWhale, compromising confidentiality, integrity, and availability.

Affected Systems

The vulnerability affects Hmbown:CodeWhale versions from 0.3.0 up to but not including 0.8.23. It has been fixed in release 0.8.23 and all subsequent versions. Users should verify the version in use and upgrade if necessary.

Risk and Exploitability

The CVSS score of 9.6 reflects a highly severe risk. An attacker can supply a malicious repository that contains exploit code in its tests; because run_tests auto‑approves execution, no user interaction is required. The EPSS score is not available and the vulnerability is not in the CISA KEV catalog, indicating that widespread exploitation has not yet been observed. Nonetheless, the design of the tool makes exploitation straightforward for anyone who can place a repository into CodeWhale’s workspace, so the risk is considered high.

Generated by OpenCVE AI on May 28, 2026 at 19:21 UTC.

Remediation

No vendor fix or workaround currently provided.

OpenCVE Recommended Actions

  • Upgrade CodeWhale to version 0.8.23 or later.
  • If upgrading is not immediately possible, disable the automatic approval setting for run_tests so that tests require manual confirmation before execution.
  • Remove or restrict AGENTS.md to prevent the model from automatically invoking run_tests at session start.
  • Validate any repository before running tests, ensuring it does not contain malicious test code.

Generated by OpenCVE AI on May 28, 2026 at 19:21 UTC.

Tracking

Sign in to view the affected projects.

Advisories
Source ID Title
Github GHSA Github GHSA GHSA-wx44-2q6h-j6p8 DeepSeek TUI: run_tests Tool Enables RCE via Malicious Repository Without Approval
History

Thu, 28 May 2026 18:00:00 +0000

Type Values Removed Values Added
Description CodeWhale is a DeepSeek + MiMo coding agent in terminal. From 0.3.0 to 0.8.23, the run_tests tool executes cargo test in the workspace with ApprovalRequirement::Auto, meaning it runs without any user approval prompt. cargo test compiles and executes arbitrary code: test binaries, build.rs build scripts, and proc macros. While auto-approving test execution is a deliberate design choice, it creates an inconsistency in the security boundary. However, in a malicious repository, test code can execute arbitrary shell commands, exfiltrate credentials, or establish persistence with zero approval. The attack is amplified by AGENTS.md (auto-loaded into the system prompt), which can instruct the model to run tests proactively at session start. This vulnerability is fixed in 0.8.23.
Title CodeWhale: run_tests Tool Enables RCE via Malicious Repository Without Approval
Weaknesses CWE-94
References
Metrics cvssV3_1

{'score': 9.6, 'vector': 'CVSS:3.1/AV:N/AC:L/PR:N/UI:R/S:C/C:H/I:H/A:H'}


Subscriptions

No data.

cve-icon MITRE

Status: PUBLISHED

Assigner: GitHub_M

Published:

Updated: 2026-05-28T17:32:26.950Z

Reserved: 2026-05-11T20:50:30.538Z

Link: CVE-2026-45311

cve-icon Vulnrichment

No data.

cve-icon NVD

Status : Deferred

Published: 2026-05-28T18:16:35.170

Modified: 2026-05-28T18:40:37.990

Link: CVE-2026-45311

cve-icon Redhat

No data.

cve-icon OpenCVE Enrichment

Updated: 2026-05-28T19:30:16Z

Weaknesses