본문 바로가기

논문572

[2024-01-07] 오늘의 자연어처리 A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity Abstract:While alignment algorithms are now commonly used to tune pre-trained language models towards a user's preferences, we lack explanations for the underlying mechanisms in which models become ``aligned'', thus making it difficult to explain phenomena like jailbreaks. In this work we study a popular algori.. 2024. 1. 7.

[2024-01-06] 오늘의 자연어처리 Joint Multi-Facts Reasoning Network For Complex Temporal Question Answering Over Knowledge Graph Abstract:Temporal Knowledge Graph (TKG) is an extension of regular knowledge graph by attaching the time scope. Existing temporal knowledge graph question answering (TKGQA) models solely approach simple questions, owing to the prior assumption that each question only contains a single temporal fact w.. 2024. 1. 6.

[2024-01-05] 오늘의 자연어처리 PLLaMa: An Open-source Large Language Model for Plant Science Abstract:Large Language Models (LLMs) have exhibited remarkable capabilities in understanding and interacting with natural language across various sectors. However, their effectiveness is limited in specialized areas requiring high accuracy, such as plant science, due to a lack of specific expertise in these fields. This paper introdu.. 2024. 1. 5.

[2024-01-04] 오늘의 자연어처리 Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models Abstract:Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP). Although convenient for research and practical applications, open-source LLMs with fewer parameters often suffer from severe hallucinations compared to their larger counterparts. This paper focuses on measuring and.. 2024. 1. 4.

이전 1 2 3 4 5 ··· 143 다음

티스토리툴바