Are Pre-Trained Convolutions Better Than Pre-Trained Transformers? (2021) arxiv.org 1 points by fzliu 3 hours ago