Joël Grus
YOU?
Author Swipe
View article: RichJackson/pytorch-transformers: supporting ablation paper v3
RichJackson/pytorch-transformers: supporting ablation paper v3 Open
Code to support the paper Ablations over Transformer Models for Biomedical Relationship Extraction
View article: huggingface/transformers: CTRL, DistilGPT-2, Pytorch TPU, tokenizer enhancements, guideline requirements
huggingface/transformers: CTRL, DistilGPT-2, Pytorch TPU, tokenizer enhancements, guideline requirements Open
New model architectures: CTRL, DistilGPT-2 Two new models have been added since release 2.0. CTRL (from Salesforce) released with the paper CTRL: A Conditional Transformer Language Model for Controllable Generation, by Nitish Shirish Keska…
View article: huggingface/pytorch-transformers: DistilBERT, GPT-2 Large, XLM multilingual models, bug fixes
huggingface/pytorch-transformers: DistilBERT, GPT-2 Large, XLM multilingual models, bug fixes Open
New model architecture: DistilBERT Adding Huggingface's new transformer architecture, DistilBERT described in Smaller, faster, cheaper, lighter: Introducing DistilBERT, a distilled version of BERT by Victor Sanh, Lysandre Debut and Thomas …
Reasoning about Actions and State Changes by Injecting Commonsense Knowledge Open
Comprehending procedural text, e.g., a paragraph describing photosynthesis, requires modeling actions and the state changes they produce, so that questions about entities at different timepoints can be answered. Although several recent sys…
Reasoning about Actions and State Changes by Injecting Commonsense\n Knowledge Open
Comprehending procedural text, e.g., a paragraph describing photosynthesis,\nrequires modeling actions and the state changes they produce, so that questions\nabout entities at different timepoints can be answered. Although several recent\n…
Reasoning about Actions and State Changes by Injecting Commonsense Knowledge Open
Comprehending procedural text, e.g., a paragraph describing photosynthesis, requires modeling actions and the state changes they produce, so that questions about entities at different timepoints can be answered. Although several recent sys…
AllenNLP: A Deep Semantic Natural Language Processing Platform Open
This paper describes AllenNLP, a platform for research on deep learning methods in natural language understanding. AllenNLP is designed to support researchers who want to build novel language understanding models quickly and easily. It is …