[2023-05-13] 오늘의 자연어처리 Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction Continual few-shot relation extraction (RE) aims to continuously train a model for new relations with few labeled training data, of which the major challenges are the catastrophic forgetting of old relations and the overfitting caused by data sparsity. In this paper, we propose a new model, namely SCKD, to accom.. 2023. 5. 13.
[2023-05-12] 오늘의 자연어처리 Context-Aware Document Simplification To date, most work on text simplification has focused on sentence-level inputs. Early attempts at document simplification merely applied these approaches iteratively over the sentences of a document. However, this fails to coherently preserve the discourse structure, leading to suboptimal output quality. Recently, strategies from controllable simplification .. 2023. 5. 12.
[2023-05-11] 오늘의 자연어처리 WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset Webpages have been a rich resource for language and vision-language tasks. Yet only pieces of webpages are kept: image-caption pairs, long text articles, or raw HTML, never all in one place. Webpage tasks have resultingly received little attention and structured image-text data underused. To study multimodal webpage understanding, we introduce.. 2023. 5. 11.
[2023-05-10] 오늘의 자연어처리 XAI in Computational Linguistics: Understanding Political Leanings in the Slovenian Parliament The work covers the development and explainability of machine learning models for predicting political leanings through parliamentary transcriptions. We concentrate on the Slovenian parliament and the heated debate on the European migrant crisis, with transcriptions from 2014 to 2020. We develop both c.. 2023. 5. 10.