0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Integrating Wasserstein GANs for High-Speed Transformer-Based Neural Machine Translation
نویسندگان :
Parisa Nekoogol
1
Mostafa Salehi
2
1- دانشگاه تهران
2- دانشگاه تهران
کلمات کلیدی :
Neural Machine Translation،Generative Adversarial Networks،Reinforcement Learning،Transformer
چکیده :
Neural machine translation (NMT), a key achievement in natural language processing (NLP), continues to face challenges such as producing low-quality output for complex sentences and lacking natural fluency. This study aimed to improve machine translation quality by integrating Generative Adversarial Networks (GANs) with an NMT model. Initially, the baseline NMT model, derived from previous research and based on recurrent neural networks (RNNs), was reconstructed and implemented. Subsequently, this architecture was replaced with the advanced Transformer architecture, and the system was developed using a Wasserstein Generative Adversarial Network (WGAN). To overcome the crucial problem of textual data discontinuity (non-differentiability), the Self-Critical Sequence Training (SCST) method, a reinforcement learning (RL) algorithm, was employed. A core objective was to analyze the performance benefits of adversarial training when applied to a robust Transformer-based generator. The research concluded that while adversarial training enhances the model's performance in generating more fluent translations, this particular improvement is more substantial and notable for models based on recurrent neural networks compared to the Transformer architecture.
لیست مقالات
لیست مقالات بایگانی شده
Distributed coordination protocol for event data exchange in IoT monitoring applications
Behnam Khazael - Hadi Tabatabaee Malazi
Exploring the Relationship Between Gameplay Log Data and Depression & Anxiety
Soroush Elyasi - Arya Varasteh Nezhad - Fattaneh Taghiyareh
Enhancing kNN-Based Intrusion Detection with Differential Evolution with Auto-Enhanced Population Diversity
Zohre Karimi - Zeinab Torabi
تاثیر مدیریت دانش مشتری بر توسعه محصول جدید و نوآورانه با رویکرد مدل سازی معادلات ساختاری با استفاده از حداقل مربعات جزئی: مطالعۀ موردی شرکت کاله
دکتر آرش خسروی - سیده فاطمه حسینی - دکتر مرتضی رجب زاده آرش خسروی - سیده فاطمه حسینی - مرتضی رجب زاده -
An Improved Drone Detection Method Using Deep Learning for Augmentation Detection Speed
Mohammad Bahrami - Seyyed Amir Asghari - Mohammadreza Binesh Marvasti - Sajjad Ansaria
ارزیابی و برنامهریزی اجرای پیشنهادی هوش مصنوعی در صنعت پتروشیمی ایران
امین رضا انصاری - احد قائمی - سید مهدی کوچک کوثری
پیاده سازی سیستم پیش بیمارستانی یافت آمبولانس مناسب در محیط رایانش ابری با استفاده از شبیه ساز کلودسیم
ریحانه حسن رحیمی - فهیمه یزدان پناه
3D Mesh ONoC: Design of low Insertion Loss and Non-blocking Optical Router and Efficient Routing Algorithm
Sanaz Asadinia - Elham Yaghoubi - Mostafa Sadeghi - Mahdi Mehrabi
Enhancing Supervised Learning in Speech Emotion Recognition through Unsupervised Representations
Niloufar Faridani - Amirali Soltani Tehrani - Ramin Toosi
SecVanet: provably secure authentication protocol for sending emergency events in VANET
Seyed Amir Mousavi - Mohammad Sadeq Sirjani - Seyyed Javad Bozorg zadeh Razavi - Morteza Nikooghadam
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0