VulnerableCode Package Details - pkg:pypi/nltk@3.2.1

Vulnerabilities affecting this package (12)

Vulnerability	Summary	Fixed by
VCID-1n1s-amsg-83aa Aliases: CVE-2024-39705 GHSA-cgvx-9447-vcch PYSEC-2024-167	NLTK through 3.8.1 allows remote code execution if untrusted packages have pickled Python code, and the integrated data package download functionality is used. This affects, for example, averaged_perceptron_tagger and punkt.	3.9 Affected by 7 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-48uj-cw5e-mucw Aliases: CVE-2021-3828 GHSA-2ww3-fxvq-293j PYSEC-2021-356	nltk is vulnerable to Inefficient Regular Expression Complexity	3.6.4 Affected by 10 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-5skj-ygwz-73e6 Aliases: CVE-2026-33231 GHSA-jm6w-m3j8-898g	nltk: NLTK: Denial of Service via unauthenticated remote shutdown	3.9.4 Affected by 0 other vulnerabilities.
VCID-924g-fe71-9uhp Aliases: CVE-2026-33236 GHSA-469j-vmhf-r6v7	nltk: NLTK: Arbitrary file overwrite and creation via path traversal in XML index files	3.9.3 Affected by 3 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-94me-p193-vfb8 Aliases: CVE-2025-14009 GHSA-7p94-766c-hgjp PYSEC-2026-96	A critical vulnerability exists in the NLTK downloader component of nltk/nltk, affecting all versions. The _unzip_iter function in nltk/downloader.py uses zipfile.extractall() without performing path validation or security checks. This allows attackers to craft malicious zip packages that, when downloaded and extracted by NLTK, can execute arbitrary code. The vulnerability arises because NLTK assumes all downloaded packages are trusted and extracts them without validation. If a malicious package contains Python files, such as __init__.py, these files are executed automatically upon import, leading to remote code execution. This issue can result in full system compromise, including file system access, network access, and potential persistence mechanisms.	3.9.3 Affected by 3 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-ajve-q4uj-qffv Aliases: CVE-2021-3842 GHSA-rqjh-jp2r-59cj PYSEC-2022-5	nltk is vulnerable to Inefficient Regular Expression Complexity	3.6.6 Affected by 8 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-c8bp-rz92-53g8 Aliases: GHSA-rf74-v2fm-23pw	Natural Language Toolkit (NLTK) has unbounded recursion in JSONTaggedDecoder.decode_obj() may cause DoS ### Summary `JSONTaggedDecoder.decode_obj()` in `nltk/jsontags.py` calls itself recursively without any depth limit. A deeply nested JSON structure exceeding `sys.getrecursionlimit()` (default: 1000) will raise an unhandled `RecursionError`, crashing the Python process. ### Affected code File: `nltk/jsontags.py`, lines 47–52 ```python @classmethod def decode_obj(cls, obj): if isinstance(obj, dict): obj = {key: cls.decode_obj(val) for (key, val) in obj.items()} elif isinstance(obj, list): obj = list(cls.decode_obj(val) for val in obj) ``` ### Proof of Concept ```python import sys, json from nltk.jsontags import JSONTaggedDecoder depth = sys.getrecursionlimit() + 50 # e.g. 1050 payload = '{"x":' * depth + "null" + "}" * depth # Raises RecursionError, crashing the process json.loads(payload, cls=JSONTaggedDecoder) ``` ### Impact Any code path that passes externally-supplied JSON to `JSONTaggedDecoder` is vulnerable to denial of service. The severity depends on whether such a path exists in the calling code (e.g. `nltk/data.py`). ### Suggested Fix Add a depth parameter with a hard limit: ```python @classmethod def decode_obj(cls, obj, _depth=0): if _depth > 100: raise ValueError("JSON nesting too deep") if isinstance(obj, dict): obj = {key: cls.decode_obj(val, _depth + 1) for (key, val) in obj.items()} elif isinstance(obj, list): obj = list(cls.decode_obj(val, _depth + 1) for val in obj) ```	There are no reported fixed by versions.
VCID-esfz-42mm-x3ad Aliases: CVE-2019-14751 GHSA-mr7p-25v2-35wr PYSEC-2019-106	NLTK Downloader before 3.4.5 is vulnerable to a directory traversal, allowing attackers to write arbitrary files via a ../ (dot dot slash) in an NLTK package (ZIP archive) that is mishandled during extraction.	3.4.5 Affected by 11 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-g2jr-e9d2-qqgz Aliases: CVE-2026-33230 GHSA-gfwx-w7gr-fvh7	nltk: NLTK: Script execution via reflected cross-site scripting in WordNet Browser	3.9.4 Affected by 0 other vulnerabilities.
VCID-muw6-dqdh-u3fb Aliases: CVE-2021-43854 GHSA-f8m6-h2c7-8h9x PYSEC-2021-859	NLTK (Natural Language Toolkit) is a suite of open source Python modules, data sets, and tutorials supporting research and development in Natural Language Processing. Versions prior to 3.6.5 are vulnerable to regular expression denial of service (ReDoS) attacks. The vulnerability is present in PunktSentenceTokenizer, sent_tokenize and word_tokenize. Any users of this class, or these two functions, are vulnerable to the ReDoS attack. In short, a specifically crafted long input to any of these vulnerable functions will cause them to take a significant amount of execution time. If your program relies on any of the vulnerable functions for tokenizing unpredictable user input, then we would strongly recommend upgrading to a version of NLTK without the vulnerability. For users unable to upgrade the execution time can be bounded by limiting the maximum length of an input to any of the vulnerable functions. Our recommendation is to implement such a limit.	3.6.5 Affected by 10 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} 3.6.6 Affected by 8 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-rkj9-d4q7-aqhv Aliases: CVE-2026-0846 GHSA-h8wq-7xc4-p3qx PYSEC-2026-97	A vulnerability in the `filestring()` function of the `nltk.util` module in nltk version 3.9.2 allows arbitrary file read due to improper validation of input paths. The function directly opens files specified by user input without sanitization, enabling attackers to access sensitive system files by providing absolute paths or traversal paths. This vulnerability can be exploited locally or remotely, particularly in scenarios where the function is used in web APIs or other interfaces that accept user-supplied input.	3.9.3 Affected by 3 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}
VCID-un8t-2sde-ekc3 Aliases: CVE-2026-0847 GHSA-68j8-pq59-fqgm PYSEC-2026-98	A vulnerability in NLTK versions up to and including 3.9.2 allows arbitrary file read via path traversal in multiple CorpusReader classes, including WordListCorpusReader, TaggedCorpusReader, and BracketParseCorpusReader. These classes fail to properly sanitize or validate file paths, enabling attackers to traverse directories and access sensitive files on the server. This issue is particularly critical in scenarios where user-controlled file inputs are processed, such as in machine learning APIs, chatbots, or NLP pipelines. Exploitation of this vulnerability can lead to unauthorized access to sensitive files, including system files, SSH private keys, and API tokens, and may potentially escalate to remote code execution when combined with other vulnerabilities.	3.9.3 Affected by 3 other vulnerabilities. This version is affected by these other vulnerabilities: {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }} {{ fixed_by_vuln.vulnerability_id }}

Date	Actor	Action	Vulnerability	Source	VulnerableCode Version
2026-06-06T07:31:25.379859+00:00	GitLab Importer	Affected by	VCID-5skj-ygwz-73e6	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2026-33231.yml	38.6.0
2026-06-06T07:31:18.231708+00:00	GitLab Importer	Affected by	VCID-924g-fe71-9uhp	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2026-33236.yml	38.6.0
2026-06-06T07:30:34.860376+00:00	GitLab Importer	Affected by	VCID-c8bp-rz92-53g8	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/GHSA-rf74-v2fm-23pw.yml	38.6.0
2026-06-06T07:30:07.928429+00:00	GitLab Importer	Affected by	VCID-g2jr-e9d2-qqgz	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2026-33230.yml	38.6.0
2026-06-06T07:14:56.776411+00:00	GitLab Importer	Affected by	VCID-rkj9-d4q7-aqhv	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2026-0846.yml	38.6.0
2026-06-06T07:13:01.808245+00:00	GitLab Importer	Affected by	VCID-un8t-2sde-ekc3	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2026-0847.yml	38.6.0
2026-06-06T06:54:02.541199+00:00	GitLab Importer	Affected by	VCID-94me-p193-vfb8	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2025-14009.yml	38.6.0
2026-06-06T05:09:52.683937+00:00	GitLab Importer	Affected by	VCID-1n1s-amsg-83aa	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2024-39705.yml	38.6.0
2026-06-06T01:16:19.552767+00:00	GitLab Importer	Affected by	VCID-muw6-dqdh-u3fb	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2021-43854.yml	38.6.0
2026-06-06T01:16:03.386517+00:00	GitLab Importer	Affected by	VCID-ajve-q4uj-qffv	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2021-3842.yml	38.6.0
2026-06-06T01:00:56.237285+00:00	GitLab Importer	Affected by	VCID-48uj-cw5e-mucw	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2021-3828.yml	38.6.0
2026-06-05T16:37:07.175479+00:00	GHSA Importer	Affected by	VCID-ajve-q4uj-qffv	https://github.com/advisories/GHSA-rqjh-jp2r-59cj	38.6.0
2026-06-05T16:34:48.992941+00:00	GHSA Importer	Affected by	VCID-48uj-cw5e-mucw	https://github.com/advisories/GHSA-2ww3-fxvq-293j	38.6.0
2026-06-05T16:22:33.367312+00:00	GHSA Importer	Affected by	VCID-esfz-42mm-x3ad	https://github.com/advisories/GHSA-mr7p-25v2-35wr	38.6.0
2026-06-04T20:24:16.813964+00:00	GitLab Importer	Affected by	VCID-esfz-42mm-x3ad	https://gitlab.com/gitlab-org/advisories-community/-/blob/main/pypi/nltk/CVE-2019-14751.yml	38.6.0

Next non-vulnerable version	3.9.4
Latest non-vulnerable version	3.9.4
Risk	4.5