Research on the Construction of Multimodal Datasets for Digital Libraries
- Resource Type
- Conference
- Authors
- Zeng, Yi; Zhou, Juxiang; Xu, Tianwei
- Source
- 2023 5th International Conference on Computer Science and Technologies in Education (CSTE) CSTE Computer Science and Technologies in Education (CSTE), 2023 5th International Conference on. :330-334 Apr, 2023
- Subject
- Computing and Processing
Computer science
Statistical analysis
Filtering
Annotations
Soft sensors
Education
Crawlers
digital library
dataset construction
multimodal
cross-modal retrieval
- Language
In order to apply cross-modal retrieval to digital library retrieval services, this paper firstly collects image and text data in digital libraries by crawlers and filters out the unqualified data; then adds text descriptions for images without text descriptions; and finally labels the images using annotation tools. A multimodal dataset containing 4400 image-text pairs is constructed and experimentally validated on several cross-modal retrieval methods. The experiments show that the dataset constructed in this paper is suitable for cross-modal retrieval research.