Description
html-sanitizer is an allowlist-based HTML cleaner. If using `keep_typographic_whitespace=False` (which is the default), the sanitizer normalizes unicode to the NFKC form at the end. Some unicode characters normalize to chevrons; this allows specially crafted HTML to escape sanitization. The problem has been fixed in 2.4.2.
Published: 2024-05-06
Score: 6.1 Medium
EPSS: < 1% Very Low
KEV: No
Impact: n/a
Action: n/a
AI Analysis

No analysis available yet.

Remediation

No remediation available yet.

Tracking

Sign in to view the affected projects.

Advisories
Source ID Title
Debian DLA Debian DLA DLA-3856-1 python-html-sanitizer security update
EUVD EUVD EUVD-2024-1840 html-sanitizer is an allowlist-based HTML cleaner. If using `keep_typographic_whitespace=False` (which is the default), the sanitizer normalizes unicode to the NFKC form at the end. Some unicode characters normalize to chevrons; this allows specially crafted HTML to escape sanitization. The problem has been fixed in 2.4.2.
Github GHSA Github GHSA GHSA-wvhx-q427-fgh3 Arbitrary HTML present after sanitization because of unicode normalization
History

Mon, 26 Aug 2024 18:30:00 +0000

Type Values Removed Values Added
References

Subscriptions

No data.

cve-icon MITRE

Status: PUBLISHED

Assigner: GitHub_M

Published:

Updated: 2024-08-26T18:03:11.753Z

Reserved: 2024-04-30T06:56:33.383Z

Link: CVE-2024-34078

cve-icon Vulnrichment

Updated: 2024-08-26T18:03:11.753Z

cve-icon NVD

Status : Deferred

Published: 2024-05-06T15:15:24.187

Modified: 2026-06-17T07:32:51.770

Link: CVE-2024-34078

cve-icon Redhat

No data.

cve-icon OpenCVE Enrichment

No data.

Weaknesses
  • CWE-79

    Improper Neutralization of Input During Web Page Generation ('Cross-site Scripting')