0% Complete
فارسی
Home
/
پانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Leveraging Retrieval-Augmented Generation for Persian University Knowledge Retrieval
Authors :
Arshia Hemmat
1
Mohammad Hassan Heydari
2
Kianoosh Vadaei
3
Afsaneh Fatemi
4
1- University of Isfahan
2- University of Isfahan
3- University of Isfahan
4- University of Isfahan
Keywords :
Large Language Models،Natural Language Processing،Retrieval Augmented Generation،Dataset Generation،QuestionAnswering System
Abstract :
This paper introduces an innovative approach using Retrieval-Augmented Generation (RAG) pipelines with Large Language Models (LLMs) to enhance information retrieval and query response systems for university-related question answering. By systematically extracting data from the university's official website, primarily in Persian, and employing advanced prompt engineering techniques, we generate accurate and contextually relevant responses to user queries. We developed a comprehensive university benchmark, UniversityQuestionBench (UQB), to rigorously evaluate our system’s performance. UQB focuses on Persian-language data, assessing accuracy and reliability through various metrics and real-world scenarios. Our experimental results demonstrate significant improvements in the precision and relevance of generated responses, enhancing user experiences, and reducing the time required to obtain relevant answers. In summary, this paper presents a novel application of RAG pipelines and LLMs for Persian-language data retrieval, supported by a meticulously prepared university benchmark, offering valuable insights into advanced AI techniques for academic data retrieval and setting the stage for future research in this domain.\footnote{Dataset is publicly available at \url{https://huggingface.co/datasets/UIAIC/UQB}}
Papers List
List of archived papers
Non-Linear Control of Cancer Model, Considering the Drug Resistance Using Feedback Based Chemotherapy Approach
Danial Kiaei - Hami Tourajizadeh
NFV-Based Distributed Service Function Chaining with Imperfect Information
Mahsa Alikhani - Marzieh Sheikhi - Dr Vesal Hakami
انتخاب ویژگی با استفاده از الگوریتم بهینه سازی ذرات مبتنی بر استراتژی خود تطبیقی دودویی جهت تشخیص بیماری
الهام صالحی - دکتر محمدرضا کرمی ملایی - دکتر حسام عمرانپور الهام صالحی - محمدرضا کرمی ملایی - حسام عمرانپور -
پیشنهادات کالیبره شده براساس احساسات استخراج شده از متون مرتبط با آیتم ها
شیوا پارساراد - دکتر سامان هراتی زاده شیوا پارساراد - سامان هراتی زاده -
Identifying Children's Personality Styles through Drawing Analysis using Machine Learning
Maedeh Mosharraf - Faezeh Banabazi
Detection and Identification of Cyber-Attacks in Cyber-Physical Systems Based on Machine Learning Methods
Zohre Nasiri Zarandi
Persian Language Understanding in Task-oriented Dialogue System for Online Shopping
Zeinab Borhanifard - Hossein Basafa - Seyedeh Zahra Razavi - Heshaam Faili
Improving Personalized Federated Learning-based QoE Assessment using Clustering
Skokufe Motaharipour - Behrouz Shahgholi Ghahfarokhi - Saeid Afshari
Context Awareness Gate for Retrieval Augmented Generation
Mohammad Hassan Heydari - Arshia Hemmat - Erfan Naman - Afsaneh Fatemi
Using Deconvolutional Variational Autoencoder for Answer Selection in Community Question Answering
Golshan Afzali Boroujeni - Heshaam Faili
more
Samin Hamayesh - Version 41.3.1