본문 바로가기

분류 전체보기599

[2022-11-29] 오늘의 자연어처리 Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt Automatic International Classification of Diseases (ICD) coding aims to assign multiple ICD codes to a medical note with an average of 3,000+ tokens. This task is challenging due to the high-dimensional space of multi-label assignment (155,000+ ICD code candidates) and the long-tail challenge - Many ICD codes are infrequent.. 2022. 11. 29.

[2022-11-28] 오늘의 자연어처리 Embedding Compression for Text Classification Using Dictionary Screening In this paper, we propose a dictionary screening method for embedding compression in text classification tasks. The key purpose of this method is to evaluate the importance of each keyword in the dictionary. To this end, we first train a pre-specified recurrent neural network-based model using a full dictionary. This leads .. 2022. 11. 28.

[2022-11-27] 오늘의 자연어처리 Sarcasm Detection Framework Using Emotion and Sentiment Features Sarcasm detection is an essential task that can help identify the actual sentiment in user-generated data, such as discussion forums or tweets. Sarcasm is a sophisticated form of linguistic expression because its surface meaning usually contradicts its inner, deeper meaning. Such incongruity is the essential component of sarcasm, h.. 2022. 11. 27.

[2022-11-26] 오늘의 자연어처리 Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning To overcome the data sparsity issue in short text topic modeling, existing methods commonly rely on data augmentation or the data characteristic of short texts to introduce more word co-occurrence information. However, most of them do not make full use of the augmented data or the data characteristic: t.. 2022. 11. 26.

이전 1 ··· 106 107 108 109 110 111 112 ··· 150 다음

티스토리툴바