0% Complete
فارسی
Home
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
PC-MCLD: Pose-Constrained and Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Authors :
Hanieh Fazli
1
Reza Azmi
2
1- دانشگاه الزهرا(س)
2- دانشگاه الزهرا(س)
Keywords :
pose-guided person image synthesis،latent diffusion model،texture consistency،adaptive feature fusion،fashion image generation
Abstract :
Pose-guided person image synthesis (PGPIS) aims to generate a person in a target pose while preserving identity and garment details, yet large pose variations often cause texture misalignment and loss of facial fidelity in existing diffusion models. We propose PC-MCLD, a latent diffusion framework that introduces (i) a pose-aware texture transfer constraint ensuring anatomically consistent correspondence between source and target regions, and (ii) an adaptive weighting mechanism that balances global appearance, garment texture, and facial identity cues during generation. Experiments on the DeepFashion In-Shop benchmark show clear improvements over a reproduced MCLD baseline. At 176×256, PC-MCLD reduces FID by 1.39% and LPIPS by 8.24%; at 352×512, the gains increase to 2.53% in FID and 19.48% in LPIPS. These results demonstrate that PC-MCLD enhances both perceptual quality and structural fidelity under challenging pose changes.
Papers List
List of archived papers
IoT-Based Model in Smart Urban Traffic Control: Graph theory and Genetic Algorithm
Saeed Doostali - Seyed Morteza Babamir - Mohammad Shiralizadeh Dezfoli - Behzad Soleimani Neysiani
توسعه مدل مفهومی طراحی فرآیند مدیریت بحران سیلاب از طریق بهینه سازی استفاده از دستگاه های اینترنت اشیاء (IoT Devices) در تصمیم گیری
محمود رسولی - سید احسان ملیحی
AI-Powered Beauty Insights: Sentiment Analysis in a Low-Resource Language
Sajedeh Talebi - Neda Abdolvand - Fatemeh Mahdian
Advanced SMS Spam Detection using Deep Complex Models and Sine-Cosine Algorithm
Sepehr Rezaei - Mohammadreza Shams - Mohsen Alambardar Meybodi
Task Scheduling for Real-time Object Detection: Methods and Performance Comparison in ADAS Applications
Mahdi Seyfipoor - Sayyed Muhammad Jaffry - Siamak Mohamadi
A Graph Attention-Based Autoencoder for Critical Path Anomaly Detection in Microservices
Mahdi Naderi - Hossein Momeni - Shayan Shahini
Design and modeling of a waiter robot
Amin Mohammadnejad - Hami Tourajizadeh
PersianRAG A Retrieval Augmented Generation System for Persian Language
Hossein Hosseini - Mohammad Sobhan Zare - Amir Hossein Mohammadi - Arefeh Kazemi - Zahra Zojaji - Mohammad Ali Nematbakhsh
Improved Weighting in the Automated Texts Classification using Fuzzy Method
Hamidreza Sadrarhami - S. Mohammadali Zanjani - Ghazanfar Shahgholian
Design and Simulation of a New Multiplexer with Energy Analysis in Quantum Cellular Automata Technology
- - -
more
Samin Hamayesh - Version 43.8.0