Understanding Taming Transformers for High-Resolution Image Synthesis

Overview Introduction This approach enables transformers to synthesize high-resolution images like this one! Transformers are on the rise and are taking over as the de-facto state-of-the-art architecture in all language-related tasks and other domains such as audio and vision. CNN’s have shown to be vital but have been designed to exploit prior knowledge about strong … Continue reading Understanding Taming Transformers for High-Resolution Image Synthesis