Stability AI and its multimodal AI research lab, DeepFloyd, have announced the research release of DeepFloyd IF, a cutting-edge text-to-image cascaded pixel diffusion model. The model is initially released under a non-commercial, research-permissible license, but an open-source release is planned for the future.
DeepFloyd IF boasts several remarkable features, including:
- Deep text prompt understanding: The model uses T5-XXL-1.1 as a text encoder, with numerous text-image cross-attention…
Read the full article here