ISC Publications

Publisher

Association for Computational Linguistics (ACL) / Workshop on Arabic Natural Language Processing (WANLP)

Authors (1)

Paul McNamee

2019

JHU System Description for the MADAR Arabic Dialect Identification Shared Task

Abstract

Our submission to the MADAR shared task on Arabic dialect identification employed a language modeling technique called Prediction by Partial Matching, an ensemble of neural architectures, and sources of additional data for training word embeddings and auxiliary language models. We found several of these techniques provided small boosts in performance, though a simple character-level language model was a strong baseline, and a lower-order LM achieved best performance on Subtask 2. Interestingly, word embeddings provided no consistent benefit, and ensembling struggled to outperform the best component submodel. This suggests the variety of architectures are learning redundant information, and future work may focus on encouraging decorrelated learning.

Publisher

Association for Computational Linguistics (ACL) / Workshop on Arabic Natural Language Processing (WANLP)

Authors (1)

Paul McNamee

2019

JHU System Description for the MADAR Arabic Dialect Identification Shared Task

Abstract

ISC

Bart Paulhamus, Chief
Bart.Paulhamus@jhuapl.edu
240-228-8514

Doh Youn Hong, Operations Manager
Doh.Hong@jhuapl.edu
240-592-2560

Intelligent Systems Center
7701 Montpelier Road
Laurel, MD 20723

Contact Us

Publisher

Association for Computational Linguistics (ACL) / Workshop on Arabic Natural Language Processing (WANLP)

Authors (1)

Paul McNamee

2019

JHU System Description for the MADAR Arabic Dialect Identification Shared Task

Abstract

Bart Paulhamus, Chief Bart.Paulhamus@jhuapl.edu 240-228-8514

Doh Youn Hong, Operations Manager Doh.Hong@jhuapl.edu 240-592-2560

Intelligent Systems Center 7701 Montpelier Road Laurel, MD 20723

Bart Paulhamus, Chief
Bart.Paulhamus@jhuapl.edu
240-228-8514

Doh Youn Hong, Operations Manager
Doh.Hong@jhuapl.edu
240-592-2560

Intelligent Systems Center
7701 Montpelier Road
Laurel, MD 20723