0% Complete
English
صفحه اصلی
/
یازدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
PeCoQ: A Dataset for Persian Complex Question Answering over Knowledge Graph
نویسندگان :
Romina Etezadi
1
Mehrnoush Shamsfard
2
1- دانشگاه شهید بهشتی
2- دانشگاه شهید بهشتی
کلمات کلیدی :
question answering, complex question, knowledge graph
چکیده :
Question answering systems may find the answers to users' questions from either unstructured texts or structured data such as knowledge graphs. Answering questions using supervised learning approaches including deep learning models need large training datasets. In recent years, some datasets have been presented for the task of Question answering over knowledge graphs, which is the focus of this paper. Although many datasets in English were proposed, there have been a few question answering datasets in Persian. This paper introduces PeCoQ, a dataset for Persian question answering. This dataset contains 10,000 complex questions and answers extracted from the Persian knowledge graph, FarsBase. For each question, the SPARQL query and two paraphrases that were written by linguists are provided as well. There are different types of complexities in the dataset, such as multi-relation, multi-entity, ordinal, and temporal constraints. In this paper, we discuss the dataset's characteristics and describe our methodology for building it.
لیست مقالات
لیست مقالات بایگانی شده
Hardware Imperfection Effects in Wireless Virtual Reality System with Hybrid Beamforming
Nasim Alikhani - Abbas Mohammadi
Advanced SMS Spam Detection using Deep Complex Models and Sine-Cosine Algorithm
Sepehr Rezaei - Mohammadreza Shams - Mohsen Alambardar Meybodi
Classification of Personality Traits on Facebook Using Key Phrase Extraction, Language Models and Machine Learning
Faezeh Safari - Abdolah Chalechale
طراحی سیستم پشتیبانی تجاری با استفاده از فناوری هوش مصنوعی
سجاد قطعی - زهره عربی - محمد روحی
GanjNet: Leveraging Network Modeling with Large Language Models for Persian Word Sense Induction
Amir Mohammad Kouyeshpour - Hadi Veisi - Saman Haratizadeh
Listening with Precision: ASR-Guided Method and Fusion Strategy for Text-Dependent Speaker Verification
Mohammad Reza Molavi - Reza Khodadadi - Hossein Zeinali
A Fuzzy Cluster-Based Routing Algorithm to Extend Wireless Sensor Network Lifetime
Mostafa Mirzaie - Armin Mazinani - Dr Sayyed Majid Mazinani
Leveraging Retrieval-Augmented Generation for Persian University Knowledge Retrieval
Arshia Hemmat - Mohammad Hassan Heydari - Kianoosh Vadaei - Afsaneh Fatemi
Benchmarking Embedding Models for Persian-Language Semantic Information Retrieval
Mahmood Kalantari - Mehdi Feghhi - Nasser Mozayani
معماری مبتنی بر مدلهای زبانی بزرگ برای تخصیص وظایف پویا و خودکار در سامانه رباتیک ازدحامی چندالگوریتمی
حمید هوشمند - سینا میرخانی - محمد حسین وارث وزیریان
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2