0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Listening with Precision: ASR-Guided Method and Fusion Strategy for Text-Dependent Speaker Verification
نویسندگان :
Mohammad Reza Molavi
1
Reza Khodadadi
2
Hossein Zeinali
3
1- دانشگاه صنعتی امیرکبیر (پلیتکنیک تهران)
2- دانشگاه صنعتی شریف
3- دانشگاه صنعتی امیرکبیر (پلیتکنیک تهران)
کلمات کلیدی :
speaker verification،text-dependent, ASR،speaker embedding fusion
چکیده :
This paper proposes a text-dependent speaker verification approach (TD-SV) that improves accuracy and robustness by leveraging automatic speech recognition (ASR) to guide the verification process and final fusion score. Our system inte- grates a Fast-Conformer-based ASR module to validate speech content, effectively filtering out target-wrong and impostor- wrong trials. We propose a feature fusion method for speaker verification that combines speaker embeddings from Wav2Vec- BERT and ReDimNet, leveraging self-supervised and task- specific representations for superior performance. This fusion significantly improves verification accuracy compared to indi- vidual embeddings. Our approach achieves a competitive nor- malized minDCF of 0.045 on the Iranian division of the TD- SV 2024 Challenge test set, demonstrating an optimal balance between performance and computational efficiency. Our best submission secured the second rank in the challenge.
لیست مقالات
لیست مقالات بایگانی شده
Statistical distance-base acceptance strategy for desirable offers in bilateral automated negotiation
Arash Ebrahimnezhad - Dr Hamid Jazayeriy - Dr Faria Nassiri-mofakham
Improving Personalized Federated Learning-based QoE Assessment using Clustering
Skokufe Motaharipour - Behrouz Shahgholi Ghahfarokhi - Saeid Afshari
Classical-Quantum Multiple Access Wiretap Channel with Common Message: One-shot Rate Region
Hadi Aghaee - Dr Bahareh Akhbari
یک سیستم پاسخ به نفوذ در شبکه های اینترنت اشیاء با استفاده از شبکه های مبتنی بر نرم افزار
احسان شاهرخی مینا - رضا محمدی - محمد نصیری
Distributed Deep Reinforcement Learning for Energy-Efficient and Low-Latency Load Balancing in Mobile Edge Computing
Pooria Azizi - Siavash Khorsandi
An ESB-based Architecture for Authentication as a Service Through Enterprise Application Integration
Masoumeh Hashemi - Mehdi Sakhaei-nia - Morteza Yousef Sanati
IoMT-Enabled Smart Healthcare: State-of-the-Art, Security and Future Directions
Shivam Tripathi - Vatsalkumar Makwana - Malaram Kumhar - Harshal Trivedi - Jitendra Bhatia - Sudeep Tanwar - Hossein Shahinzadeh
A novel approach audio watermarking based on (GBT,DCT,SVD)
Mahdi Mosleh
Traffic Aware Routing in P4 Based Software Defined Networks
Ahmad Hamid - Reza Mohammadi
Combinatorial Auction Based on Social Choice in the Internet of Things
Maede Esmaeili - Faria Nassiri-Mofakham - Fatemeh Hassanvand
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2