N°24-59: Battle of Transformers: Adversarial Attacks on Financial Sentiment Models

AutorM. Leippold, Aysun Can Turetken

Datum14. Nov. 2024

KategorieWorking Papers

Linkhttps://papers.ssrn.com/sol3/papers.cfm?abstract_id=4977483

Financial sentiment analysis models, which extract meaning from vast amounts of unstructured data, play a crucial role in sentiment-driven financial decisions. However, the complex and domain-specific language used in finance poses unique challenges for adversarial attacks. To address these challenges, we propose a novel, white-box attack methodology leveraging a pre-trained general-purpose language model (GPT-4o). We employ carefully designed instructions and incorporate a new loss function based on embedding similarity to ensure semantic coherence while producing syntactically diverse samples. Our experimental results demonstrate that both FinBERT and Fin-GPT, leading models in financial sentiment analysis, exhibit significant susceptibility to our proposed adversarial attacks. Specifically, the sentiment predictions of these models were successfully altered for a substantial proportion of the samples across three public datasets, including Financial Phrase Bank (FPB), Twitter Financial News Sentiment (TFNS), and Sentimence and Entity Annotated Financial News (SEntFiN). Our findings emphasize the need for enhanced robustness in financial classification models against adversarially targeted attacks. By understanding and addressing these vulnerabilities, it is possible to improve the reliability and security of automated financial systems.

Vorherige Publikation

Die Rolle von KI bei der Trans...

Nächste Publikation

Ein Leitfaden durch die Welt d...

Swiss Finance Institute - Genf

Universität Genf
42, Bd du Pont d`Arve
CH-1211 Genf 4

Tel:	+41 22 379 84 71
Fax:	+41 22 379 84 77
E-Mail:	research@sfi.ch
	phd@sfi.ch

Swiss Finance Institute - Zürich

Walchestrasse 9
CH-8006 Zürich

Tel:	+41 44 254 30 80
Fax:	+41 44 254 30 85
E-Mail:	info@sfi.ch

N°24-59: Battle of Transformers: Adversarial Attacks on Financial Sentiment Models