본문 바로가기
반응형

논문572

[2023-05-28] 오늘의 자연어처리 IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages India has a rich linguistic landscape with languages from 4 major language families spoken by over a billion people. 22 of these languages are listed in the Constitution of India (referred to as scheduled languages) are the focus of this work. Given the linguistic diversity, high-qua.. 2023. 5. 28.
[2023-05-27] 오늘의 자연어처리 UNITE: A Unified Benchmark for Text-to-SQL Evaluation A practical text-to-SQL system should generalize well on a wide variety of natural language questions, unseen database schemas, and novel SQL query structures. To comprehensively evaluate text-to-SQL systems, we introduce a \textbf{UNI}fied benchmark for \textbf{T}ext-to-SQL \textbf{E}valuation (UNITE). It is composed of publicly available te.. 2023. 5. 27.
[2023-05-26] 오늘의 자연어처리 Sentiment Analysis Using Aligned Word Embeddings for Uralic Languages In this paper, we present an approach for translating word embeddings from a majority language into 4 minority languages: Erzya, Moksha, Udmurt and Komi-Zyrian. Furthermore, we align these word embeddings and present a novel neural network model that is trained on English data to conduct sentiment analysis and then applied on .. 2023. 5. 26.
[2023-05-25] 오늘의 자연어처리 TalkUp: A Novel Dataset Paving the Way for Understanding Empowering Language Empowering language is important in many real-world contexts, from education to workplace dynamics to healthcare. Though language technologies are growing more prevalent in these contexts, empowerment has not been studied in NLP, and moreover, it is inherently challenging to operationalize because of its subtle, implici.. 2023. 5. 25.
반응형