TY - GEN
T1 - NUIG-DSI’s submission to The GEM Benchmark 2021
AU - Pasricha, Nivranshu
AU - Arcan, Mihael
AU - Buitelaar, Paul
N1 - Publisher Copyright:
© 2021 Association for Computational Linguistics
PY - 2021
Y1 - 2021
N2 - This paper describes the submission by NUIG-DSI to the GEM benchmark 2021. We participate in the modeling shared task where we submit outputs on four datasets for data-to-text generation, namely, DART, WebNLG (en), E2E and CommonGen. We follow an approach similar to the one described in the GEM benchmark paper where we use the pre-trained T5-base model for our submission. We train this model on additional monolingual data where we experiment with different masking strategies specifically focused on masking entities, predicates and concepts as well as a random masking strategy for pre-training. In our results we find that random masking performs the best in terms of automatic evaluation metrics, though the results are not statistically significantly different compared to other masking strategies.
AB - This paper describes the submission by NUIG-DSI to the GEM benchmark 2021. We participate in the modeling shared task where we submit outputs on four datasets for data-to-text generation, namely, DART, WebNLG (en), E2E and CommonGen. We follow an approach similar to the one described in the GEM benchmark paper where we use the pre-trained T5-base model for our submission. We train this model on additional monolingual data where we experiment with different masking strategies specifically focused on masking entities, predicates and concepts as well as a random masking strategy for pre-training. In our results we find that random masking performs the best in terms of automatic evaluation metrics, though the results are not statistically significantly different compared to other masking strategies.
UR - https://www.scopus.com/pages/publications/85123737319
M3 - Conference Publication
AN - SCOPUS:85123737319
T3 - GEM 2021 - 1st Workshop on Natural Language Generation, Evaluation, and Metrics, Proceedings
SP - 148
EP - 154
BT - GEM 2021 - 1st Workshop on Natural Language Generation, Evaluation, and Metrics, Proceedings
A2 - Bosselut, Antoine
A2 - Durmus, Esin
A2 - Gangal, Varun Prashant
A2 - Gehrmann, Sebastian
A2 - Jernite, Yacine
A2 - Perez-Beltrachini, Laura
A2 - Shaikh, Samira
A2 - Xu, Wei
PB - Association for Computational Linguistics (ACL)
T2 - 1st Workshop on Natural Language Generation, Evaluation, and Metrics, GEM 2021
Y2 - 5 August 2021 through 6 August 2021
ER -