0% Complete
English
صفحه اصلی
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
PersianRAG A Retrieval Augmented Generation System for Persian Language
نویسندگان :
Hossein Hosseini
1
Mohammad Sobhan Zare
2
Amir Hossein Mohammadi
3
Arefeh Kazemi
4
Zahra Zojaji
5
Mohammad Ali Nematbakhsh
6
1- دانشگاه اصفهان
2- دانشگاه اصفهان
3- دانشگاه اصفهان
4- دانشگاه اصفهان
5- دانشگاه اصفهان
6- دانشگاه اصفهان
کلمات کلیدی :
Retrieval Augmented Generation،Large Language Models،Persian،PersianRAG
چکیده :
Retrieval augmented generation (RAG) models, which integrate large-scale pre-trained generative models with external retrieval mechanisms, have shown significant success in various natural language processing (NLP) tasks. However, applying RAG models in Persian language as a low-resource language, poses distinct challenges. These challenges primarily involve the preprocessing, embedding, retrieval, prompt construction, language modeling, and response evaluation of the system. In this paper, we address the challenges towards implementing a real-world RAG system for Persian language called PersianRAG. We propose novel solutions to overcome these obstacles and evaluate our approach using several Persian benchmark datasets. Our experimental results demonstrate the capability of the PersianRAG framework to enhance question answering task in Persian.
لیست مقالات
لیست مقالات بایگانی شده
Particle Swarm Optimization-Based Framework for 3D Swarm Robotic Navigation Using Artificial Potential Field Dynamics
Samim Kamyab - Masoud Shirzadeh - Ghoncheh Zand
A Blockchain-Based Smart Contract Framework for Peer-to-Peer Energy Trading in Smart Grids
Hossein Shahinzadeh - Farshad Ebrahimi - S. Mohammadali Zanjani - Amirafshin Zamani - Saiedeh Mehrabani-Najafabadi - Gevork B. Gharehpetian
طبقه بندی روش های شناسایی داده های تکراری در جهت تسهیل فرایند پاکسازی داده ها
مهدی جعفری - احمد عبدالله زاده بار فروش
FiReT: A Neural Radiance Fields Framework for Wireless Field Reconstruction and Transmitter Placement
Negar Pouya - Armin Soleymani - Gholamreza Moradi - Farzaneh Abdollahi
Adaptive Stopping Criteria-based A-RANSAC algorithm in Copy Move Image Forgery detection
ZAHRA HOSEINNEJAD - Dr MEHDI NASRI
A Model-Driven Approach for Automatic Generation of Android Tourism Applications
Sara Adib - Bahman Zamani
Sentiment Analysis of the Amazon Customers Using the BiGRU Neural Network Enhanced by Attention Mechanism
Sara Sinan Salman al-Abedi - Keyvan Mohebbi
Presenting an Edge-based Air Quality Management System for Smart City Scenarios
Tina Samizadeh Nikoui - Ali Balador - Amir Masoud Rahmani - Hooman Tabarsaied
Targeted Vaccination for COVID-19 Using Mobile Communication Networks
Mohammadmohsen Jadidi - Pegah Moslemi - Saeed Jamshidiha - Iman Masroori - Abbas Mohammadi - Vahid Pourahmadi
A Novel Resource Allocation Scheme for Underlaying NOMA-Based Multi-Channel Cognitive D2D Communications
Anahita Akbari - Dr Javad Zeraatkar Moghaddam - Dr Mehrdad Ardebilipour
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0