Adversarial NLI: A New Benchmark for Natural Language Understanding
Mixed Precision Training
Massively Multilingual Neural Machine Translation
Res2Net: A New Multi-scale Backbone Architecture
CE-Net: Context Encoder Network for 2D Medical Image Segmentation
BitTrain: Sparse Bitmap Compression for Memory-Efficient Training on the Edge
Adversarial Representation Learning for Robust Privacy Preservation in\n Audio
Making Pre-trained Language Models Better Few-shot Learners
Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Deep Directed Generative Autoencoders
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
Denoising Diffusion Implicit Models
Gaussian Error Linear Units (GELUs)
Deep Fragment Embeddings for Bidirectional Image Sentence Mapping
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
TextBoxes++: A Single-Shot Oriented Scene Text Detector
Robust Optimization for Multilingual Translation with Imbalanced Data
Neural Natural Language Inference Models Enhanced with External Knowledge
Ecological Consequences of Trophic Cascades: A Global Perspective
Visual Relationship Detection with Language Priors