Adversarial NLI: A New Benchmark for Natural Language Understanding
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning
Massively Multilingual Neural Machine Translation
Mixed Precision Training
Res2Net: A New Multi-scale Backbone Architecture
Adversarial Representation Learning for Robust Privacy Preservation in\n Audio
BitTrain: Sparse Bitmap Compression for Memory-Efficient Training on the Edge
Whitening Sentence Representations for Better Semantics and Faster Retrieval
Deep learning for cardiac image segmentation: A review
Gaussian Error Linear Units (GELUs)
Deep Directed Generative Autoencoders
Neural Natural Language Inference Models Enhanced with External Knowledge
Deep Fragment Embeddings for Bidirectional Image Sentence Mapping
TextBoxes++: A Single-Shot Oriented Scene Text Detector
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
Denoising Diffusion Implicit Models
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Making Pre-trained Language Models Better Few-shot Learners
Ecological Consequences of Trophic Cascades: A Global Perspective
Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval