SOTAVerified

Financial Sentiment Analysis: Leveraging Actual and Synthetic Data for Supervised Fine-tuning

2024-12-13Code Available0· sign in to hype

Abraham Atsiwo

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

The Efficient Market Hypothesis (EMH) highlights the essence of financial news in stock price movement. Financial news comes in the form of corporate announcements, news titles, and other forms of digital text. The generation of insights from financial news can be done with sentiment analysis. General-purpose language models are too general for sentiment analysis in finance. Curated labeled data for fine-tuning general-purpose language models are scare, and existing fine-tuned models for sentiment analysis in finance do not capture the maximum context width. We hypothesize that using actual and synthetic data can improve performance. We introduce BertNSP-finance to concatenate shorter financial sentences into longer financial sentences, and finbert-lc to determine sentiment from digital text. The results show improved performance on the accuracy and the f1 score for the financial phrasebank data with 50\% and 100\% agreement levels.

Tasks

Reproductions