0% Complete
فارسی
Home
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
PersianRAG A Retrieval Augmented Generation System for Persian Language
Authors :
Hossein Hosseini
1
Mohammad Sobhan Zare
2
Amir Hossein Mohammadi
3
Arefeh Kazemi
4
Zahra Zojaji
5
Mohammad Ali Nematbakhsh
6
1- دانشگاه اصفهان
2- دانشگاه اصفهان
3- دانشگاه اصفهان
4- دانشگاه اصفهان
5- دانشگاه اصفهان
6- دانشگاه اصفهان
Keywords :
Retrieval Augmented Generation،Large Language Models،Persian،PersianRAG
Abstract :
Retrieval augmented generation (RAG) models, which integrate large-scale pre-trained generative models with external retrieval mechanisms, have shown significant success in various natural language processing (NLP) tasks. However, applying RAG models in Persian language as a low-resource language, poses distinct challenges. These challenges primarily involve the preprocessing, embedding, retrieval, prompt construction, language modeling, and response evaluation of the system. In this paper, we address the challenges towards implementing a real-world RAG system for Persian language called PersianRAG. We propose novel solutions to overcome these obstacles and evaluate our approach using several Persian benchmark datasets. Our experimental results demonstrate the capability of the PersianRAG framework to enhance question answering task in Persian.
Papers List
List of archived papers
A New Method Based on Deep Learning and Time Stabilization of the Propagation Path for Fake News Detection
Fatemeh Torgheh - Dr Mohammad Reza Keyvanpour - Dr Behrooz Masoumi
UltraLearn: Next-Generation CyberSecurity Learning Platform
Saeed Raisi - Saeid Ghasemshirazi - Ghazaleh Shirvani
Presenting an Edge-based Air Quality Management System for Smart City Scenarios
Tina Samizadeh Nikoui - Ali Balador - Amir Masoud Rahmani - Hooman Tabarsaied
Human Resource Allocation to the Credit Requirement Process, A Process Mining Approach
Omid Mahdi Ebadati - Mohammad Mehrabioun - Shokoofeh Sadat Hosseini
A U-Net architecture with graph attention networks to accurately define tooth boundaries
Ehsan Akefi - Hassan Khotanlou
A Deep Neural Network-based Method for MmWave Time-varying Channel Estimation
Amirhossein Molazadeh - Zahra Maroufi - Mehrdad Ardebilipour
Mode Selection and Resource Allocation in D2D-Enabled MC-NOMA using Matching Theory
Alireza Gholamrezaee - Hamid Farrokhi - Javad Zeraatkar Moghaddam
ارائه تکنیک یادگیری چندهسته ای مبتنی بر روش بهینه سازی برای مسئله دسته بندی سیگنال های EEG مبتنی بر تصور حرکتی
یوکابد امیری - حسام عمرانپور
A Hybrid Crow Search and Penguin Optimization Algorithm (CPMM) for Efficient Cloud Workflow Scheduling
Reza Akraminejad - Farhad Kazemipour - Mozhdeh Koreh Davoodi
بکارگیری الگوریتم بهینه سازی فاخته و منطق فازی به منظور بهبود زمانبندی وظایف در محیط محاسبات مه
فاطمه دوامی - حمید جلیلوند - فاطمه نجفی
more
Samin Hamayesh - Version 43.8.0