Hunyuan · @TencentHunyuan

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model. The new model de...

View this X/Twitter post from @TencentHunyuan published on 9 de septiembre de 2025, 15:14. This post contains 1 video.

Published
9 de septiembre de 2025, 15:14
Thread Items
1
Media Items
1
Hunyuan avatar
Hunyuan
@TencentHunyuan
9 de septiembre de 2025, 15:14

Tweet Overview

View this X/Twitter post from @TencentHunyuan published on 9 de septiembre de 2025, 15:14. This post contains 1 video.

We've officially released and open-sourced HunyuanImage 2.1, our latest text-to-image model.

The new model delivers on our commitment to balancing performance and quality. With native 2K image generation, HunyuanImage 2.1 is an advanced open-source text-to-image model.🎨

✨ New in 2.1:
🔹Advanced Semantics: Supports ultra-long and complex prompts of up to 1000 tokens, and precisely controls the generation of multiple subjects in a single image.
🔹Precise Chinese and English Text Rendering with seamless image–text integration:
The model naturally integrates text into images, making it suitable for a wide range of applications such as product covers, illustrations, and poster design to meet the needs of various fields.
🔹Rich Styles and High Aesthetic: Capable of generating images in various styles—including photorealistic portraits, comics, and vinyl figures—it delivers outstanding visual appeal and artistic quality.
🔹High-Quality Generation: Efficiently produces ultra-high-definition (2K) images in the same time other models take to generate a 1K image.

HunyuanImage 2.1 uses two text encoders: a multimodal large language model (MLLM) to improve the model's image and text alignment capabilities, and a multi-language character-aware encoder to improve text rendering capabilities. The model is a single- and double-stream diffusion transformer with 17B parameters.

We've also open-sourced the weights of the  the accelerated version with meanflow which reduces inference steps from 100 to just 8, and PromptEnhancer, the first industrial-grade rewriting model that enhances your prompts for more nuanced and expressive image generation. Now, creators turn complex ideas—like posters with slogans or multi-panel comics—into visuals faster than ever.

We’re just getting started. Stay tuned for our native multimodal image generation model coming soon.

🌐Website: 
🔗Github: 
🤗Hugging Face: 
✨Hugging Face Demo:

More from @TencentHunyuan

Archived posts from Hunyuan

Ver todo

Related Creators

TwitFast

v1.4.88

Free Twitter video downloader. Top Twitter trends and hashtags list, Monitor, track hottest trending topics, hashtags.

© 2024 TwitFast Reservados todos los derechos.