eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Verifying the Effectiveness of Sentence Embedding Learning in Japanese based on Contrastive Learning with Non-linguistic Data / 非言語データを用いた対照学習による文埋め込み学習の日本語における効果検証

Resource Type: Journal Article
Authors: Daisuke KAWAHARA; Hirofumi SHIMIZU; 河原大輔; 清水博文
Source: Proceedings of the Annual Conference of JSAI. 2023, :3
Subject: Contrastive Learning
NLP
Sentence Embedding
対照学習
文埋め込み
自然言語処理
Language: Japanese
ISSN: 2758-7347

Online Access

Find it @ DONGA

초록

Sentence embedding learned from text is widely used for semantic textual similarity, automatic evaluation of text generation, and so on. As one of the sentence embedding learning methods, SimCSE based on contrastive learning is proposed and achieves high accuracy in the semantic textual similarity task. VisualCSE and AudioCSE, which are derivatives of SimCSE, are methods that add training using image and audio data in addition to text-based training and have been shown to further improve accuracy in English. However, these methods using non-linguistic data have not been validated in Japanese. This study examines the effectiveness of VisualCSE in Japanese. As a result, VisualCSE in Japanese did not show the significant improvement in accuracy seen in the English experiment. Also, we analyze the impact of sentence embedding learning by using noise data instead of image data.

공지

DAU Library

eArticles

요약정보

Verifying the Effectiveness of Sentence Embedding Learning in Japanese based on Contrastive Learning with Non-linguistic Data / 非言語データを用いた対照学習による文埋め込み学習の日本語における効果検証

Online Access

초록