Paper

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection 리뷰

2023. 9. 26. 18:27

[paper review] BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation (3)	2025.01.13
[paper review] Meshed-Memory Transformer (3)	2024.12.24
ViViT: A Video Vision Transformer 리뷰 (0)	2023.09.08
Non-local Neural Networks 리뷰 (+ code review ) (3)	2023.08.20
Quo Vaids, Action Recognition? A New Model and the Kinetics Dataset (2018) (1)	2023.08.11

Abstract