eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Pineapple Quality Classification in a Multimodal Audio-Visual Dataset

Resource Type: Conference
Authors: Jiang, Yi-Lu; Chang, Wen-Chang; Chiu, Chih-Yi
Source: 2023 IEEE International Conference on Big Data (BigData) Big Data (BigData), 2023 IEEE International Conference on. :6175-6177 Dec, 2023
Subject: Bioengineering
Computing and Processing
Geoscience
Robotics and Control Systems
Signal Processing and Analysis
Visualization
Analytical models
Deep architecture
Machine learning
Predictive models
Big Data
Cameras
multimodal analysis
audio
visual
hollow sound
solid sound
drum sound
meat sound
Language

Online Access

Full Text (IEEE)

초록

In this paper, a multimodal classification model is constructed to predict the pineapple quality. We compile a pineapple dataset consisting of 500 pineapples with two modalities: one is tapping a pineapple to record the sounds, and the other is taking pictures by cameras. Three classification models, including audio model, visual model, and audio-visual model, are built based on the deep learning architectures for the corresponding feature representations. The experimental evaluation demonstrates that the audio-visual model, which combines both audio and visual representations, can yield the best accuracy than the other two unimodal models.

공지

DAU Library

eArticles

요약정보

Pineapple Quality Classification in a Multimodal Audio-Visual Dataset

Online Access

초록