an-image-is-worth-16x16-words-transformers-for-image-recognition-at-scale-2023-015

https://inrepscholar.com/projects/954Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsbyan-image-is-worth-16x16-words-transformers-for-image-recognition-at-scale-2023-015UFTAN UNIVERSITY2023Masters ThesisThis record links to the Vision Transformer paper, which adapts transformer architectures to image classification through patch-based image representation. It is useful for validating computer vision categories and AI search relevance.