Res2Net: A New Multi-scale Backbone Architecture
Mixed Precision Training
Adversarial NLI: A New Benchmark for Natural Language Understanding
CE-Net: Context Encoder Network for 2D Medical Image Segmentation
BitTrain: Sparse Bitmap Compression for Memory-Efficient Training on the Edge
Massively Multilingual Neural Machine Translation
Adversarial Representation Learning for Robust Privacy Preservation in\n Audio
Gaussian Error Linear Units (GELUs)
Deep Fragment Embeddings for Bidirectional Image Sentence Mapping
Robust Optimization for Multilingual Translation with Imbalanced Data
Making Pre-trained Language Models Better Few-shot Learners
TextBoxes++: A Single-Shot Oriented Scene Text Detector
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Neural Natural Language Inference Models Enhanced with External Knowledge
K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters
Distilling the Knowledge in a Neural Network
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Denoising Diffusion Implicit Models
Ecological Consequences of Trophic Cascades: A Global Perspective