Gradio

Leffa: Learning Flow Fields in Attention for Controllable Person Image Generation

Star ⭐ us if you like it!

News

09/Jan/2025. Inference defaults to float16, generating an image in 6 seconds (on A100).

More news can be found in the GitHub repository.

Leffa is a unified framework for controllable person image generation that enables precise manipulation of both appearance (i.e., virtual try-on) and pose (i.e., pose transfer).

Person Image

Examples

Garment Image

Examples

Generated Image

Note: The models used in the demo are trained solely on academic datasets. Virtual try-on uses VITON-HD/DressCode, and pose transfer uses DeepFashion.