본문 바로가기

전체 글599

[2023-09-03] 오늘의 자연어처리 The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the evaluation of text models in high-, medium-, and low-resource language.. 2023. 9. 3.

[2023-09-02] 오늘의 자연어처리 SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models Current speech large language models build upon discrete speech representations, which can be categorized into semantic tokens and acoustic tokens. However, existing speech tokens are not specifically designed for speech language modeling. To assess the suitability of speech tokens for building speech language models, we .. 2023. 9. 2.

[2023-09-01] 오늘의 자연어처리 Proceedings 39th International Conference on Logic Programming This volume contains the Technical Communications presented at the 39th International Conference on Logic Programming (ICLP 2023), held at Imperial College London, UK from July 9 to July 15, 2023. Technical Communications included here concern the Main Track, the Doctoral Consortium, the Application and Systems/Demo track, the Recent.. 2023. 9. 1.

[2023-08-31] 오늘의 자연어처리 TaskLAMA: Probing the Complex Task Understanding of Language Models Structured Complex Task Decomposition (SCTD) is the problem of breaking down a complex real-world task (such as planning a wedding) into a directed acyclic graph over individual steps that contribute to achieving the task, with edges specifying temporal dependencies between them. SCTD is an important component of assistive plann.. 2023. 8. 31.

이전 1 ··· 30 31 32 33 34 35 36 ··· 150 다음

티스토리툴바