Swin-Transformer transform the image to tokens to input to transformer.

Is each token (before-embedding) value an integer?

In practice, where is this done? https://github.com/microsoft/Swin-Transformer

The code `self.head = nn.Linear(self.num_features, num_classes)` seems not output integers?

More Tong Guo's questions See All
Similar questions and discussions