Note: The reference list may be incomplete. This list contains all references that BIP software was able to retrieve.
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics · 2019
       
Self-Attentive Sequential Recommendation
2018 IEEE International Conference on Data Mining (ICDM) · 2018
       
Towards Accurate Multi-person Pose Estimation in the Wild
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) · 2017
       
Effective Approaches to Attention-based Neural Machine Translation
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing · 2015
       
On the Properties of Neural Machine Translation: Encoder–Decoder Approaches
Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation · 2014
       
Glove: Global Vectors for Word Representation
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) · 2014
       
Exploring the Limits of Weakly Supervised Pretraining
Computer Vision – ECCV 2018 · 2018
       
Listen, attend and spell: A neural network for large vocabulary conversational speech recognition
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) · 2016
       
Gradient-based learning applied to document recognition
Proceedings of the IEEE · 1998
       
Universal Language Model Fine-tuning for Text Classification
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) · 2018
       
Using Multi-Sense Vector Embeddings for Reverse Dictionaries
Proceedings of the 13th International Conference on Computational Semantics - Long Papers · 2019
       
Long Short-Term Memory
Neural Computation · 1997
       
Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
2015 IEEE International Conference on Computer Vision (ICCV) · 2015
       
A Decomposable Attention Model for Natural Language Inference
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing · 2016
       
HellaSwag: Can a Machine Really Finish Your Sentence?
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics · 2019
       
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics · 2019
       
TinyBERT: Distilling BERT for Natural Language Understanding
Findings of the Association for Computational Linguistics: EMNLP 2020 · 2020
       
Finding Structure in Time
Cognitive Science · 1990
       
A Neural Architecture for Generating Natural Language Descriptions from Source Code Changes
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) · 2017
       
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) · 2014
       
Neural Speech Synthesis with Transformer Network
Proceedings of the AAAI Conference on Artificial Intelligence · 2019
       
A Character-level Decoder without Explicit Segmentation for Neural Machine Translation
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) · 2016
       
Neural Text Generation from Structured Data with Application to the Biography Domain
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing · 2016
       
Bidirectional recurrent neural networks
IEEE Transactions on Signal Processing · 1997
       
Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies · 2015
       
Probing Neural Network Comprehension of Natural Language Arguments
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics · 2019
       
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Bioinformatics · 2019
       
Transfer Learning in Natural Language Processing
Proceedings of the 2019 Conference of the North · 2019
       
Show and tell: A neural image caption generator
2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) · 2015
       
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
NAACL · 2018
       
Semi-supervised sequence tagging with bidirectional language models
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) · 2017
       
Deep Contextualized Word Representations
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers) · 2018
       
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP · 2018
       
Mask R-CNN
2017 IEEE International Conference on Computer Vision (ICCV) · 2017
       
Solving Arithmetic Word Problems Automatically Using Transformer and Unambiguous Representations
2019 International Conference on Computational Science and Computational Intelligence (CSCI) · 2019
       
Multi-Task Deep Neural Networks for Natural Language Understanding
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics · 2019
       
A Neural Attention Model for Abstractive Sentence Summarization
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing · 2015
       
“Cloze Procedure”: A New Tool for Measuring Readability
Journalism Quarterly · 1953
       
ImageNet: A large-scale hierarchical image database
2009 IEEE Conference on Computer Vision and Pattern Recognition · 2009