<?xml version="1.0" encoding="UTF-8"?><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/"><dc:title>an-image-is-worth-16x16-words-transformers-for-image-recognition-at-scale-2023-015</dc:title><dc:creator>Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby</dc:creator><dc:date>2023</dc:date><dc:description>This record links to the Vision Transformer paper, which adapts transformer architectures to image classification through patch-based image representation. It is useful for validating computer vision categories and AI search relevance.</dc:description><dc:identifier>https://inrepscholar.com/projects/954</dc:identifier></oai_dc:dc>