html-sanitizer is an allowlist-based HTML cleaner. If using `keep_typographic_whitespace=False` (which is the default), the sanitizer normalizes unicode to the NFKC form at the end. Some unicode characters normalize to chevrons; this allows specially crafted HTML to escape sanitization. The problem has been fixed in 2.4.2.
Advisories
Source ID Title
Debian DLA Debian DLA DLA-3856-1 python-html-sanitizer security update
EUVD EUVD EUVD-2024-1840 html-sanitizer is an allowlist-based HTML cleaner. If using `keep_typographic_whitespace=False` (which is the default), the sanitizer normalizes unicode to the NFKC form at the end. Some unicode characters normalize to chevrons; this allows specially crafted HTML to escape sanitization. The problem has been fixed in 2.4.2.
Github GHSA Github GHSA GHSA-wvhx-q427-fgh3 Arbitrary HTML present after sanitization because of unicode normalization
Fixes

Solution

No solution given by the vendor.


Workaround

No workaround given by the vendor.

History

Mon, 26 Aug 2024 18:30:00 +0000

Type Values Removed Values Added
References

Projects

Sign in to view the affected projects.

cve-icon MITRE

Status: PUBLISHED

Assigner: GitHub_M

Published:

Updated: 2024-08-26T18:03:11.753Z

Reserved: 2024-04-30T06:56:33.383Z

Link: CVE-2024-34078

cve-icon Vulnrichment

Updated: 2024-08-26T18:03:11.753Z

cve-icon NVD

Status : Awaiting Analysis

Published: 2024-05-06T15:15:24.187

Modified: 2024-11-21T09:18:02.690

Link: CVE-2024-34078

cve-icon Redhat

No data.

cve-icon OpenCVE Enrichment

No data.

Weaknesses