본문 바로가기
반응형

arxiv572

[2023-10-13] 오늘의 자연어처리 Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity Abstract:This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As LLMs find applications across diverse domains, the reliability and accuracy of their outputs become vital. We define the Factuality Issue as the probability of LLMs to produce content inconsistent with establ.. 2023. 10. 13.
[2023-10-12] 오늘의 자연어처리 TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models Abstract:Aligned large language models (LLMs) demonstrate exceptional capabilities in task-solving, following instructions, and ensuring safety. However, the continual learning aspect of these aligned LLMs has been largely overlooked. Existing continual learning benchmarks lack sufficient challenge for leading align.. 2023. 10. 12.
[2023-10-11] 오늘의 자연어처리 The Program Testing Ability of Large Language Models for Code Abstract:Recent development of large language models (LLMs) for code like CodeX and CodeT5+ demonstrates tremendous promise in achieving code intelligence. Their ability of synthesizing code that completes a program for performing a pre-defined task has been intensively tested and verified on benchmark datasets including HumanEval and.. 2023. 10. 11.
[2023-10-10] 오늘의 자연어처리 Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models Abstract:Recent advancements in Large Language Models (LLMs) have demonstrated impressive capabilities across a range of natural language processing tasks, especially in reasoning, a cornerstone for achieving Artificial General Intelligence (AGI). However, commonly used benchmarks may not fully encapsu.. 2023. 10. 10.
반응형