Search for packages
| purl | pkg:pypi/lxml-html-clean@0.3.0 |
| Vulnerability | Summary | Fixed by |
|---|---|---|
|
VCID-cawv-npps-aba1
Aliases: CVE-2024-52595 GHSA-5jfw-gq64-q45f PYSEC-2024-160 |
lxml_html_clean is a project for HTML cleaning functionalities copied from `lxml.html.clean`. Prior to version 0.4.0, the HTML Parser in lxml does not properly handle context-switching for special HTML tags such as `<svg>`, `<math>` and `<noscript>`. This behavior deviates from how web browsers parse and interpret such tags. Specifically, content in CSS comments is ignored by lxml_html_clean but may be interpreted differently by web browsers, enabling malicious scripts to bypass the cleaning process. This vulnerability could lead to Cross-Site Scripting (XSS) attacks, compromising the security of users relying on lxml_html_clean in default configuration for sanitizing untrusted HTML content. Users employing the HTML cleaner in a security-sensitive context should upgrade to lxml 0.4.0, which addresses this issue. As a temporary mitigation, users can configure lxml_html_clean with the following settings to prevent the exploitation of this vulnerability. Via `remove_tags`, one may specify tags to remove - their content is moved to their parents' tags. Via `kill_tags`, one may specify tags to be removed completely. Via `allow_tags`, one may restrict the set of permissible tags, excluding context-switching tags like `<svg>`, `<math>` and `<noscript>`. |
Affected by 2 other vulnerabilities. |
|
VCID-kdjc-yc24-kbdu
Aliases: CVE-2026-28350 GHSA-xvp8-3mhv-424c |
lxml_html_clean is a project for HTML cleaning functionalities copied from `lxml.html.clean`. Prior to version 0.4.4, the <base> tag passes through the default Cleaner configuration. While page_structure=True removes html, head, and title tags, there is no specific handling for <base>, allowing an attacker to inject it and hijack relative links on the page. This issue has been patched in version 0.4.4. |
Affected by 0 other vulnerabilities. |
|
VCID-x913-sjr8-qydu
Aliases: CVE-2026-28348 GHSA-hw26-mmpg-fqfg |
lxml_html_clean is a project for HTML cleaning functionalities copied from `lxml.html.clean`. Prior to version 0.4.4, the _has_sneaky_javascript() method strips backslashes before checking for dangerous CSS keywords. This causes CSS Unicode escape sequences to bypass the @import and expression() filters, allowing external CSS loading or XSS in older browsers. This issue has been patched in version 0.4.4. |
Affected by 0 other vulnerabilities. |
| Vulnerability | Summary | Aliases |
|---|---|---|
| This package is not known to fix vulnerabilities. | ||
| Date | Actor | Action | Vulnerability | Source | VulnerableCode Version |
|---|---|---|---|---|---|
| 2026-06-12T21:12:30.211413+00:00 | GitLab Importer | Affected by | VCID-x913-sjr8-qydu | https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/lxml-html-clean/CVE-2026-28348.yml | 38.6.0 |
| 2026-06-12T21:11:54.241207+00:00 | GitLab Importer | Affected by | VCID-kdjc-yc24-kbdu | https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/lxml-html-clean/CVE-2026-28350.yml | 38.6.0 |
| 2026-06-12T19:47:10.144547+00:00 | GitLab Importer | Affected by | VCID-cawv-npps-aba1 | https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/lxml-html-clean/CVE-2024-52595.yml | 38.6.0 |
| 2026-06-12T04:19:41.908942+00:00 | Pypa Importer | Affected by | VCID-cawv-npps-aba1 | https://github.com/pypa/advisory-database/blob/main/vulns/lxml-html-clean/PYSEC-2024-160.yaml | 38.6.0 |
| 2026-06-11T21:03:36.831421+00:00 | PyPI Importer | Affected by | VCID-cawv-npps-aba1 | https://osv-vulnerabilities.storage.googleapis.com/PyPI/all.zip | 38.6.0 |