This repo is the official implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" as well as the follow-ups. It currently includes code and models for the following tasks: Image Classification: Included in this repo. See get_started.md for a quick start. Object Detection and Instance Segmentation: See Swin Transformer for Object Detection. Semantic Segmentation: