CVE-2024-5206

Sensitive Data Leakage in sklearn.feature_extraction.text.TfidfVectorizer in scikit-learn/scikit-learn

Description

A sensitive data leakage vulnerability was identified in scikit-learn's TfidfVectorizer, specifically in versions up to and including 1.4.1.post1, which was fixed in version 1.5.0. The vulnerability arises from the unexpected storage of all tokens present in the training data within the `stop_words_` attribute, rather than only storing the subset of tokens required for the TF-IDF technique to function. This behavior leads to the potential leakage of sensitive information, as the `stop_words_` attribute could contain tokens that were meant to be discarded and not stored, such as passwords or keys. The impact of this vulnerability varies based on the nature of the data being processed by the vectorizer.

Categories

4.7
CVSS
Severity: Medium
CVSS 3.1 •
CVSS 3.0 •
EPSS 0.03%
Third-Party Advisory huntr.com
Affected: scikit-learn scikit-learn/scikit-learn
Published at:
Updated at:

References

Frequently Asked Questions

What is the severity of CVE-2024-5206?
CVE-2024-5206 has been scored as a medium severity vulnerability.
How to fix CVE-2024-5206?
To fix CVE-2024-5206, make sure you are using an up-to-date version of the affected component(s) by checking the vendor release notes. As for now, there are no other specific guidelines available.
Is CVE-2024-5206 being actively exploited in the wild?
As for now, there are no information to confirm that CVE-2024-5206 is being actively exploited. According to its EPSS score, there is a ~0% probability that this vulnerability will be exploited by malicious actors in the next 30 days.
What software or system is affected by CVE-2024-5206?
CVE-2024-5206 affects scikit-learn scikit-learn/scikit-learn.
This platform uses data from the NIST NVD, MITRE CVE, MITRE CWE, First.org and CISA KEV but is not endorsed or certified by these entities. CVE is a registred trademark of the MITRE Corporation and the authoritative source of CVE content is MITRE's CVE web site. CWE is a registred trademark of the MITRE Corporation and the authoritative source of CWE content is MITRE's CWE web site.
© 2025 Under My Watch. All Rights Reserved.