TY - GEN
T1 - Evaluating the Robustness of Fake News Detectors to Adversarial Attacks with Real User Comments (Extended Abstract)
AU - Koren, Annat
AU - Underwood, Chandler
AU - Serra, Edoardo
AU - Spezzano, Francesca
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The widespread use of social media has led to an increase in false and misleading information presented as legitimate news, also known as fake news. This poses a threat to societal stability and has led to the development of fake news de-tectors that use machine learning to flag suspicious information. However, existing fake news detection models are vulnerable to attacks by malicious actors who can manipulate data to change predictions. Research on attacks on news comments is limited, and current attack models are easily detectable. We propose two new attack strategies that instead use real, pre-existing comments from the same dataset as the news article to fool fake news detectors. Our experimental results show that fake news detectors are less robust to our proposed attack strategies than existing methods using pre-existing human-written comments, as well as a malicious synthetic comment generator.
AB - The widespread use of social media has led to an increase in false and misleading information presented as legitimate news, also known as fake news. This poses a threat to societal stability and has led to the development of fake news de-tectors that use machine learning to flag suspicious information. However, existing fake news detection models are vulnerable to attacks by malicious actors who can manipulate data to change predictions. Research on attacks on news comments is limited, and current attack models are easily detectable. We propose two new attack strategies that instead use real, pre-existing comments from the same dataset as the news article to fool fake news detectors. Our experimental results show that fake news detectors are less robust to our proposed attack strategies than existing methods using pre-existing human-written comments, as well as a malicious synthetic comment generator.
KW - adversarial machine learning
KW - machine learning robustness
KW - misinformation
UR - http://www.scopus.com/inward/record.url?scp=85209407450&partnerID=8YFLogxK
U2 - 10.1109/DSAA61799.2024.10722837
DO - 10.1109/DSAA61799.2024.10722837
M3 - Conference contribution
AN - SCOPUS:85209407450
T3 - 2024 IEEE 11th International Conference on Data Science and Advanced Analytics, DSAA 2024
BT - 2024 IEEE 11th International Conference on Data Science and Advanced Analytics, DSAA 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2024
Y2 - 6 October 2024 through 10 October 2024
ER -