본문 바로가기
반응형

arxiv572

[2023-10-04] 오늘의 자연어처리 UltraFeedback: Boosting Language Models with High-quality Feedback Abstract:Reinforcement learning from human feedback (RLHF) has become a pivot technique in aligning large language models (LLMs) with human preferences. In RLHF practice, preference data plays a crucial role in bridging human proclivity and LLMs. However, the scarcity of diverse, naturalistic datasets of human preferences on LLM .. 2023. 10. 4.
[2023-10-03] 오늘의 자연어처리 Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles Abstract:This paper presents the results of the shared task on Lay Summarisation of Biomedical Research Articles (BioLaySumm), hosted at the BioNLP Workshop at ACL 2023. The goal of this shared task is to develop abstractive summarisation models capable of generating "lay summaries" (i.e., summaries .. 2023. 10. 3.
[2023-10-02] 오늘의 자연어처리 Prompt-and-Align: Prompt-Based Social Alignment for Few-Shot Fake News Detection Abstract:Despite considerable advances in automated fake news detection, due to the timely nature of news, it remains a critical open question how to effectively predict the veracity of news articles based on limited fact-checks. Existing approaches typically follow a "Train-from-Scratch" paradigm, which is fundamen.. 2023. 10. 2.
[2023-10-01] 오늘의 자연어처리 Augmenting transformers with recursively composed multi-grained representations Abstract:We present ReCAT, a recursive composition augmented Transformer that is able to explicitly model hierarchical syntactic structures of raw texts without relying on gold trees during both learning and inference. Existing research along this line restricts data to follow a hierarchical tree structure and thus l.. 2023. 10. 1.
반응형