
Share
Researchers have developed a diffusion model that enhances the creation of realistic shadows in composite images, overcoming previous limitations through innovative intensity modulation techniques.
In the world of computer vision, generating realistic shadows for composite images has long been a challenging task. Traditional image-to-image translation models often struggle with this due to data scarcity and the inherent complexity of shadow generation. However, a recent paper titled "Shadow Generation for Composite Image Using Diffusion Model" by Qingyang Liu, Junqi You, Jianting Wang, Xinhao Tao, Bo Zhang, and Li Niu offers a promising solution. This work leverages foundation models and introduces novel intensity modulation techniques to improve shadow realism.
The core of this research lies in adapting ControlNet, a popular framework for conditional image generation, to the specific task of shadow generation. Here are the key technical advancements:
The model's architecture is built on top of a pre-trained diffusion model, specifically fine-tuned for shadow generation. Here are the key components:

The researchers evaluated their model on both the original DESOBA dataset and the new DESOBAv2 dataset, as well as on real composite images. The results showed significant improvements in shadow realism compared to previous methods:
To facilitate further research and practical applications, the authors have released the DESOBAv2 dataset, code, and pre-trained models. You can access these resources at this GitHub link.
This paper marks a significant step forward in the field of image composition by addressing one of its most challenging aspects: realistic shadow generation. By leveraging foundation models and introducing intensity modulation techniques, the researchers have created a powerful tool for generating high-quality shadows. This work not only advances the state-of-the-art but also provides valuable resources for the research community.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
26 March 2024
133 articles
Related Articles
Related Articles
More Stories