CVE-2021-43854 - Vulnerability Details

NLTK (Natural Language Toolkit) is a suite of open source Python modules, data sets, and tutorials supporting research and development in Natural Language Processing. Versions prior to 3.6.5 are vulnerable to regular expression denial of service (ReDoS) attacks. The vulnerability is present in PunktSentenceTokenizer, sent_tokenize and word_tokenize. Any users of this class, or these two functions, are vulnerable to the ReDoS attack. In short, a specifically crafted long input to any of these vulnerable functions will cause them to take a significant amount of execution time. If your program relies on any of the vulnerable functions for tokenizing unpredictable user input, then we would strongly recommend upgrading to a version of NLTK without the vulnerability. For users unable to upgrade the execution time can be bounded by limiting the maximum length of an input to any of the vulnerable functions. Our recommendation is to implement such a limit.

No CVSS v4.0

Attack Vector Network

Attack Complexity Low

Privileges Required None

Scope Unchanged

Confidentiality Impact None

Integrity Impact None

Availability Impact High

User Interaction None

No CVSS v3.0

Access Vector Network

Access Complexity Low

Authentication None

Confidentiality Impact None

Integrity Impact None

Availability Impact Partial

This CVE is not in the KEV list.

The EPSS score is 0.00836.

Key SSVC decision points have not yet been added.

Default status is the baseline for the product, each version can override it (e.g. patched versions marked unaffected).

Vendor Product Default status Versions

nltk

affected

Version	Status	Constraints
`< 3.6.5`	affected	—

Configuration 1 [-]

cpe:2.3:a:nltk:nltk:*:*:*:*:*:*:*:*

No data.

Project Subscriptions

Vendors	Products
Nltk Subscribe	Nltk Subscribe

Advisories

Source	ID	Title
EUVD	EUVD-2021-0148	NLTK (Natural Language Toolkit) is a suite of open source Python modules, data sets, and tutorials supporting research and development in Natural Language Processing. Versions prior to 3.6.5 are vulnerable to regular expression denial of service (ReDoS) attacks. The vulnerability is present in PunktSentenceTokenizer, sent_tokenize and word_tokenize. Any users of this class, or these two functions, are vulnerable to the ReDoS attack. In short, a specifically crafted long input to any of these vulnerable functions will cause them to take a significant amount of execution time. If your program relies on any of the vulnerable functions for tokenizing unpredictable user input, then we would strongly recommend upgrading to a version of NLTK without the vulnerability. For users unable to upgrade the execution time can be bounded by limiting the maximum length of an input to any of the vulnerable functions. Our recommendation is to implement such a limit.
Github GHSA	GHSA-f8m6-h2c7-8h9x	Inefficient Regular Expression Complexity in nltk (word_tokenize, sent_tokenize)
Ubuntu USN	USN-7365-1	NLTK vulnerabilities

Fixes

Solution

No solution given by the vendor.

Workaround

No workaround given by the vendor.

References

Link	Providers
https://github.com/nltk/nltk/commit/1405aad979c6b8080dbbc8e0858f89b2e3690341
https://github.com/nltk/nltk/issues/2866
https://github.com/nltk/nltk/pull/2869
https://github.com/nltk/nltk/security/advisories/GHSA-f8m6-h2c7-8h9x

History

No history.

Projects

Sign in to view the affected projects.

MITRE

Status: PUBLISHED

Assigner: GitHub_M

Published: 2021-12-23T17:55:11.000Z

Updated: 2024-08-04T04:10:16.281Z

Reserved: 2021-11-16T00:00:00.000Z

Link: CVE-2021-43854

Vulnrichment

No data.

NVD

Status : Modified

Published: 2021-12-23T18:15:07.327

Modified: 2024-11-21T06:29:56.200

Link: CVE-2021-43854

Redhat

No data.

OpenCVE Enrichment

No data.

Weaknesses

CWE-400

Attack Vector Network

Attack Complexity Low

Privileges Required None

Scope Unchanged

Confidentiality Impact None

Integrity Impact None

Availability Impact High

User Interaction None

Access Vector Network

Access Complexity Low

Authentication None

Confidentiality Impact None

Integrity Impact None

Availability Impact Partial

Project Subscriptions

Projects

JSON object

JSON object

JSON object

JSON object

JSON object