This post is a brief summary about the paper that I read for my study and curiosity, so I shortly arrange the content of the paper, titled An Image is Worth 16 X 16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al. arXiv 2021), that I read and studied.

Dosovitskiy et al. arXiv 2021

For detailed experiment and explanation, refer to the paper, titled An Image is Worth 16 X 16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al. arXiv 2021)

Reference