학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

FSD: An Initial Chinese Dataset for Fake Song Detection

Resource Type: Working Paper
Authors: Xie, Yuankun; Zhou, Jingjing; Lu, Xiaolin; Jiang, Zhenghao; Yang, Yuxin; Cheng, Haonan; Ye, Long
Source
Subject: Computer Science - Sound
Computer Science - Artificial Intelligence
Electrical Engineering and Systems Science - Audio and Speech Processing
Language

Online Access

초록

Singing voice synthesis and singing voice conversion have significantly advanced, revolutionizing musical experiences. However, the rise of "Deepfake Songs" generated by these technologies raises concerns about authenticity. Unlike Audio DeepFake Detection (ADD), the field of song deepfake detection lacks specialized datasets or methods for song authenticity verification. In this paper, we initially construct a Chinese Fake Song Detection (FSD) dataset to investigate the field of song deepfake detection. The fake songs in the FSD dataset are generated by five state-of-the-art singing voice synthesis and singing voice conversion methods. Our initial experiments on FSD revealed the ineffectiveness of existing speech-trained ADD models for the task of song deepFake detection. Thus, we employ the FSD dataset for the training of ADD models. We subsequently evaluate these models under two scenarios: one with the original songs and another with separated vocal tracks. Experiment results show that song-trained ADD models exhibit a 38.58% reduction in average equal error rate compared to speech-trained ADD models on the FSD test set.
Comment: Submitted to ICASSP 2024

공지

DAU Library

학술논문

요약정보

FSD: An Initial Chinese Dataset for Fake Song Detection

Online Access

초록