Improving Sentiment Analysis in Persian text based on combination of Stacked Auto-Encoder and Transformer-BiLSTM-CNN

Subject Areas : AI and Robotics

Sina Dami ^{1
*} , MohammadAli Sanagoo ye Moharrer ²

1 - Department of Computer Engineering, West Tehran Branch, Islamic Azad University, Tehran, Iran
2 - Department of Computer Engineering, West Tehran Branch, Islamic Azad University, Tehran, Iran

Received: 2024-11-05 Accepted : 2025-04-30 Published : 2025-10-17

Keywords: Sentiment Analysis, Feature Extraction, Transformer, Stacked Auto-Encoder,

Abstract :

The expansion of the internet and the increasing amount of user-generated textual opinions on various topics have made sentiment analysis a crucial tool for understanding public sentiment towards different subjects. These insights are invaluable for businesses, policymakers, and society as a whole, but manually analyzing such a volume of data is costly and impractical. This study leverages automated and deep learning approaches by combining a Stacked Autoencoder (SAE) for feature extraction and a Transformer-BiLSTM-CNN model for sentiment classification, specifically designed for the Persian language. ParsBert, the Persian version of BERT, was used for data preprocessing. This combined approach demonstrated improved performance in key evaluation metrics such as accuracy, precision, recall, and F1 score, outperforming comparative models like Transformer-BiLSTM-CNN, SAE-LSTM, and CNN. Results on datasets including user reviews from the Taghcheh and Digikala platforms and Persian tweets affirm the effectiveness of this hybrid model.

References:

Automatic identification of personality traits using X contents based on Myers Briggs index
Print Date : 2025-10-17
Creation of Persian dataset for sentiment analysis in texts published in social networks
Print Date : 2025-10-17
Application of Artificial Intelligence in the Art of Music A Systematic Review
Print Date : 2025-10-17
The Analysis of User Reviews on Digikala with the Aim of Detecting Deceptive Opinions
Print Date : 2025-10-17
UsERQA: An LLM-Driven User-Aware Community Question Answering System
Print Date : 2025-10-17
Synthetic Photoplethysmogram (PPG) Generation Using Genetic Programming Based Generative Model
Print Date : 2025-08-02

Share To

Article Url

Improving Sentiment Analysis in Persian text based on combination of Stacked Auto-Encoder and Transformer-BiLSTM-CNN