0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
PC-MCLD: Pose-Constrained and Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
نویسندگان :
Hanieh Fazli
1
Reza Azmi
2
1- دانشگاه الزهرا(س)
2- دانشگاه الزهرا(س)
کلمات کلیدی :
pose-guided person image synthesis،latent diffusion model،texture consistency،adaptive feature fusion،fashion image generation
چکیده :
Pose-guided person image synthesis (PGPIS) aims to generate a person in a target pose while preserving identity and garment details, yet large pose variations often cause texture misalignment and loss of facial fidelity in existing diffusion models. We propose PC-MCLD, a latent diffusion framework that introduces (i) a pose-aware texture transfer constraint ensuring anatomically consistent correspondence between source and target regions, and (ii) an adaptive weighting mechanism that balances global appearance, garment texture, and facial identity cues during generation. Experiments on the DeepFashion In-Shop benchmark show clear improvements over a reproduced MCLD baseline. At 176×256, PC-MCLD reduces FID by 1.39% and LPIPS by 8.24%; at 352×512, the gains increase to 2.53% in FID and 19.48% in LPIPS. These results demonstrate that PC-MCLD enhances both perceptual quality and structural fidelity under challenging pose changes.
لیست مقالات
لیست مقالات بایگانی شده
AOV-IDS: Arithmetic Optimizer with Voting classifier for Intrusion Detection System
Amir Soltany Mahboob - Mohammad Reza Ostadi Moghaddam - Shima Yousefi
Scattering Wavelet-Based Image Quality Assessment Metric for Medical Images
Sina Omidvar - Jamshid Shanbehzadeh
Effective Design of Reversible 2×2 Vedic Multiplier With Low Cost
Mojtaba Noorallahzadeh - Mohammad Mosleh - Ali Shahidikia
Investigating the impact of management information systems (MIS) on organizational transparency with an emphasis on work ethics
Sadegh Balouch - Omid mehdi Ebadati
Multi-Modal Longitudinal Tooth Labeling with Temporal Graph–Transformer Integration
Maral Mirza mohammadi - Mahdi Tarom
DRL-Based Phase Optimization for O-RIS in Dual-Hop Hard Switching FSO/RIS-aided RF and UWOC Systems
Aboozar Heydaribeni - Hamzeh Beyranvand - Sahar Eslami
Predictive Maintenance using LSTM and Adaptive Windowing
Aien Ghanbari Adivi - Behrouz Shahgholi Ghahfarokhi
Automatic identification and reconstruction of Tuberculosis in microscopic images using convolutional auto-encoder network
Ahmad Reza Nadafi - Farahnaz Mohanna
AI-Driven Approach to Detect Equivalent Elements within Domain Models
Mohammad-Sajad Kasaei - Mohammadreza Sharbaf - Afsaneh Fatemi - Bahman Zamani
A High-Speed Quantum Reversible Controlled Adder/Subtractor Circuit
Negin Mashayekhi - Mohammad Reza Reshadinezhad - Shekoofeh Moghimi
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0