ReMaskable: Controllable Facial Attribute Editing Using Segmentation-Guided Latent Diffusion

Santhosh I; Vishagan V; Dr. M.K. Kirubakaran

doi:10.47392/IRJAEH.2026.0171

Authors

Santhosh I Dept. of Artificial Intelligence and Data Science, St. Joseph’s Institute of Technology (Autonomous), OMR, Chennai–600119, India Author
Vishagan V Dept. of Artificial Intelligence and Data Science, St. Joseph’s Institute of Technology (Autonomous), OMR, Chennai–600119, India Author
Dr. M.K. Kirubakaran Dept. of Artificial Intelligence and Data Science, St. Joseph’s Institute of Technology (Autonomous), OMR, Chennai–600119, India Author

DOI:

https://doi.org/10.47392/IRJAEH.2026.0171

Keywords:

Controllable generation;, Diffusion models, Facial attribute editing, Identity preservation, Semantic segmentation

Abstract

Facial attribute editing demands both spatial precision and visual fidelity, yet existing approaches fall short on one or both counts. Generative Adversarial Networks achieve photorealistic synthesis but suffer from attribute entanglement, where modifying one feature inadvertently alters unrelated regions. Diffusion models produce high-quality text-guided edits but lack spatial control, causing changes to propagate beyond the intended area. This paper presents ReMaskable, a framework that decouples the spatial localization problem (where to edit) from the semantic generation problem (what to generate). ReMaskable combines a multi-source segmentation system integrating DeepLabv3+ for 19-class face parsing, SAM for promptable region selection, and DINOv2 for boundary refinement, with a CLIP-conditioned latent diffusion inpainting model that operates exclusively within the masked region. Identity preservation is enforced through ArcFace cosine embedding loss and LPIPS perceptual consistency on unmasked regions. We describe the complete architecture, mathematical formulation, and training methodology. Evaluation metrics are projected from published baselines of each component rather than from completed end-to-end experimental runs, and this distinction is stated throughout. The modular architecture is designed for extensibility to video editing and 3D avatar generation.

Downloads

Download data is not yet available.

ReMaskable: Controllable Facial Attribute Editing Using Segmentation-Guided Latent Diffusion

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

Issue

Section

License

How to Cite

Similar Articles

Language

Information

Make a Submission