0% Complete
English
صفحه اصلی
/
دوازدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Conceptual Intelligent Model for Visual Question Answering using Attention Mechanism and Relational Reasoning
نویسندگان :
ٍElham Alighardash
1
Hassan Khotanlou
2
Vahid Pour Amin
3
1- دانشگاه بوعلی سینا
2- دانشگاه بوعلی سینا
3- دانشگاه سیدجمال الدین اسدآبادی
کلمات کلیدی :
visual question answering, attention mechanism, visual reasoning, zero-shot learning
چکیده :
In recent years, a great deal of interest in research of Visual Question Answering (VQA) has been propounded it as a hot topic in computer vision. Many sub-problems were raised in this regard, and reasonable efforts have been made to solve them. Considering salient elements of different modalities, discovering inter or intra correlation, proper information fusion method, using supplementary information of external knowledge bases, visual reasoning, and accepting correct answers that have not been seen before in the training set are examples of these issues. In this paper, the focus is on reinforcing the model by reasoning about complex questions, applying the attention mechanism, and leveraging knowledge graphs (KG) to improve the generated answers. Moreover, the proposed conceptual model includes a zero-shot learning method to allow unlabeled correct answers by implementing a semantic space mapping approach. The use of the fact-based VQA knowledge base for integrating the scene graph with additional information is suggested in the research. It is expected that based on the proposed approach of the framework, its implementation will lead to better accuracy and improvement in efficiency for predicting the appropriate answers.
لیست مقالات
لیست مقالات بایگانی شده
کنترل کیفیت غیرمتمرکز مبتنی بر هوش ترکیبی در سیستمهای مشارکتی برخط
مهدیه طالب زاده - هاله امین طوسی - محمد اله بخش
Artificial Empathy in AI-Based Mental Health: A Review
Shabnam Moradi
Using Deconvolutional Variational Autoencoder for Answer Selection in Community Question Answering
Golshan Afzali Boroujeni - Heshaam Faili
AI-based Message Spam Classification Framework for Secure Autonomous Vehicles Communication
Riya Upadhyay - Mili Virani - Lakshit Pathak - Rajesh Gupta - Sudeep Tanwar - Hossein Shahinzadeh
ارائه یک مدل جهت تخصیص منابع به توابع مجازی شبکه (VNF) باهدف حفظ درجه تعادل بار در شبکه های چند دامنه ای مبتنی بر نرمافزار(multi-SDN)
امین زنداقطاعی - دکتر وحید ستاری نائینی امین زنداقطاعی - وحید ستاری نائینی -
خوشه بندی ویسیلاب های دو آوایی زبان فارسی در کاربرد لب خوانی
مهسا هدایتی پور - دکتر یاسر شکفته - دکتر محسن ابراهیمی مقدم
Inner and Outer Bearing Fault Diagnosis of electrical Motors Using a Proposed Algorithm and Vibration Signals
Vahid Safari Dehnavi - Masoud Shafiee
ارائه یک مدل تصمیم گیری چند معیاره فازی به منظور بهبود دقت فرایند تصمیم گیری به هنگام اختلال هوانوردی
فاطمه عطا عبدالرزاق - نگار مجمع
Context Awareness Gate for Retrieval Augmented Generation
Mohammad Hassan Heydari - Arshia Hemmat - Erfan Naman - Afsaneh Fatemi
Blockchain-based Secure UAV-assisted Battlefield Operation underlying 5G
Dhruvi Pancholi - Nilesh Kumar Jadav - Sudeep Tanwar - Deepak Garg - S. Mohammadali Zanjani
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0