본문 바로가기
반응형

논문572

[2023-01-02] 오늘의 자연어처리 The URW-KG: a Resource for Tackling the Underrepresentation of non-Western Writers Digital media have enabled the access to unprecedented literary knowledge. Authors, readers, and scholars are now able to discover and share an increasing amount of information about books and their authors. Notwithstanding, digital archives are still unbalanced: writers from non-Western countries are less represe.. 2023. 1. 2.
[2023-01-01] 오늘의 자연어처리 TextBox 2.0: A Text Generation Library with Pre-trained Language Models To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2.0, focusing on the use of pre-trained language models (PLMs). To be comprehensive, our library covers $13$ common text generation tasks and their corresponding $83$ datasets and further incorporates $45$ PLMs coverin.. 2023. 1. 1.
[2022-12-31] 오늘의 자연어처리 Page Layout Analysis of Text-heavy Historical Documents: a Comparison of Textual and Visual Approaches Page layout analysis is a fundamental step in document processing which enables to segment a page into regions of interest. With highly complex layouts and mixed scripts, scholarly commentaries are text-heavy documents which remain challenging for state-of-the-art models. Their layout considera.. 2022. 12. 31.
[2022-12-30] 오늘의 자연어처리 Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation Automatic Speech Recognition (ASR) systems frequently use a search-based decoding strategy aiming to find the best attainable transcript by considering multiple candidates. One prominent speech recognition decoding heuristic is beam search, which seeks the transcript with the greatest likelihood computed using the predicted distri.. 2022. 12. 30.
반응형