본문 바로가기

arxiv572

[2022-12-08] 오늘의 자연어처리 Semantic-Conditional Diffusion Networks for Image Captioning Recent advances on text-to-image generation have witnessed the rise of diffusion models which act as powerful generative models. Nevertheless, it is not trivial to exploit such latent variable models to capture the dependency among discrete words and meanwhile pursue complex visual-language alignment in image captioning. In this paper,.. 2022. 12. 8.

[2022-12-07] 오늘의 자연어처리 Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single Transformer Systems for knowledge-intensive tasks such as open-domain question answering (QA) usually consist of two stages: efficient retrieval of relevant documents from a large corpus and detailed reading of the selected documents to generate answers. Retrievers and readers are usually modeled separately, whi.. 2022. 12. 7.

[2022-12-06] 오늘의 자연어처리 Tackling Low-Resourced Sign Language Translation: UPC at WMT-SLT 22 This paper describes the system developed at the Universitat Politècnica de Catalunya for the Workshop on Machine Translation 2022 Sign Language Translation Task, in particular, for the sign-to-text direction. We use a Transformer model implemented with the Fairseq modeling toolkit. We have experimented with the vocabulary size,.. 2022. 12. 6.

[2022-12-05] 오늘의 자연어처리 Noisy Label Detection for Speaker Recognition The success of deep neural networks requires both high annotation quality and massive data. However, the size and the quality of a dataset are usually a trade-off in practice, as data collection and cleaning are expensive and time-consuming. Therefore, automatic noisy label detection (NLD) techniques are critical to real-world applications, especiall.. 2022. 12. 5.

이전 1 ··· 102 103 104 105 106 107 108 ··· 143 다음

티스토리툴바