Library Catalog

>>
Library Catalog
>
000 nam5i
001 2210080933726
003 DE-He213
005 20250321105239
007 cr nn 008mamaa
008 240214s2024 si | s |||| 0|eng d
020 a97898197060139978-981-97-0601-3
024 a10.1007/978-981-97-0601-32doi
040 a221008
050 aTA1634
072 aUYQV2bicssc
072 aCOM0160002bisacsh
072 aUYQV2thema
082 a006.37223
245 00 aMan-Machine Speech Communicationh[electronic resource] :b18th National Conference, NCMMSC 2023, Suzhou, China, December 8–10, 2023, Proceedings /cedited by Jia Jia, Zhenhua Ling, Xie Chen, Ya Li, Zixing Zhang.
250 a1st ed. 2024.
264 aSingapore :bSpringer Nature Singapore :bImprint: Springer,c2024.
300 aXIV, 368 p. 108 illus., 86 illus. in color.bonline resource.
336 atextbtxt2rdacontent
337 acomputerbc2rdamedia
338 aonline resourcebcr2rdacarrier
347 atext filebPDF2rda
490 aCommunications in Computer and Information Science,x1865-0937 ;v2006
505 aUltra-Low Complexity Residue Echo and Noise Suppression Based on Recurrent Neural Network -- Semi-End-to-End Nested Named Entity Recognition from Speech -- A Lightweight Music Source Separation Model with Graph Convolution Network -- Joint time-domain and frequency-domain progressive learning for single-channel speech enhancement and recognition -- A Study on Domain Adaptation for Audio-visual Speech Enhancement -- APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra -- Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker Verification -- Joint speech and noise estimation using SNR-adaptive target learning for deep-learning-based speech enhancement -- Data Augmentation By Finite Element Analysis for Enhanced Machine Anomalous Sound Detection -- A Fast Sampling Method in Diffusion-based Dance Generation Models -- End-to-end Streaming Customizable KeywordSpotting based on text-adaptive neural search -- The Production of Successive Addition Boundary Tone in Mandarin Preschoolers -- Emotional Support Dialog System Through Recursive Interactions Among Large Language Models -- Task-Adaptive Generative Adversarial Network based Speech Dereverberation for Robust Speech Recognition -- Real-time Automotive Engine Sound Simulation with Deep Neural Network -- A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech Enhancement -- Accent-VITS: accent transfer for end-to-end TTS -- Multi-branch Network with Cross-Domain Feature Fusion for Anomalous Sound Detection -- A Packet Loss Concealment Method Based on the Demucs Network Structure -- Improving Speech Perceptual Quality and Intelligibility through Sub-band Temporal Envelope Characteristics -- Adaptive Deep Graph Convolutional Network For Dialogical Speech Emotion Recognition -- Iterative Noisy-target Approach: Speech Enhancement without Clean Speech -- Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization -- Zero-shot Singing Voice Conversion Method Based on Timbre Space Modeling and Excitation Signal Control -- A Comparative Study of Pre-trained Audio and Speech Models for Heart Sound Detection -- CAM-GUI: A Conversational Assistant on Mobile GUI -- A Pilot Study on the Prosodic Factors Influencing Voice Attractiveness of AI Speech -- The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023 -- Chinese EFL Learners’ Auditory and Visual Perception of English Statement and Question Intonation: The Effect of Stress -- An Improved System for Partially Fake Audio Detection Using Pre-trained Model -- Leveraging Synthetic Speech for CIF-based Customized Keyword Spotting.
520 aThis book constitutes the refereed proceedings of the 18th National Conference on Man-Machine Speech Communication, NCMMSC 2023, held in Suzhou, China, during December 8–11, 2023. The 20 full papers and 11 short papers included in this book were carefully reviewed and selected from 117 submissions. They deal with topics such as speech recognition, synthesis, enhancement and coding, audio/music/singing synthesis, avatar, speaker recognition and verification, human–computer dialogue systems, large language models as well as phonetic and linguistic topics such as speech prosody analysis, pathological speech analysis, experimental phonetics, acoustic scene classification.
650 aComputer vision.
650 aNatural language processing (Computer science).
650 aSignal processing.
650 aArtificial intelligence.
650 aUser interfaces (Computer systems).
650 aHuman-computer interaction.
650 aComputer Vision.
650 aNatural Language Processing (NLP).
650 aSignal, Speech and Image Processing.
650 aArtificial Intelligence.
650 aUser Interfaces and Human Computer Interaction.
700 aJia, Jia.eeditor.4edt4http://id.loc.gov/vocabulary/relators/edt
700 aLing, Zhenhua.eeditor.4edt4http://id.loc.gov/vocabulary/relators/edt
700 aChen, Xie.eeditor.4edt4http://id.loc.gov/vocabulary/relators/edt
700 aLi, Ya.eeditor.4edt4http://id.loc.gov/vocabulary/relators/edt
700 aZhang, Zixing.eeditor.4edt4http://id.loc.gov/vocabulary/relators/edt
710 aSpringerLink (Online service)
773 tSpringer Nature eBook
776 iPrinted edition:z9789819706006
776 iPrinted edition:z9789819706020
830 aCommunications in Computer and Information Science,x1865-0937 ;v2006
856 uhttps://doi.org/10.1007/978-981-97-0601-3
912 aZDB-2-SCS
912 aZDB-2-SXCS
950 aComputer Science (SpringerNature-11645)
950 aComputer Science (R0) (SpringerNature-43710)
Man-Machine Speech Communication[electronic resource] :18th National Conference, NCMMSC 2023, Suzhou, China, December 8–10, 2023, Proceedings /edited by Jia Jia, Zhenhua Ling, Xie Chen, Ya Li, Zixing Zhang
Material type
전자책
Title
Man-Machine Speech Communication[electronic resource] :18th National Conference, NCMMSC 2023, Suzhou, China, December 8–10, 2023, Proceedings /edited by Jia Jia, Zhenhua Ling, Xie Chen, Ya Li, Zixing Zhang
Author's Name
Jia Jia. editor Ling Zhenhua. editor Chen Xie. editor Li Ya. editor Zhang Zixing. editor
판 사항
1st ed. 2024.
Physical Description
XIV, 368 p 108 illus, 86 illus in color online resource.
Keyword
This book constitutes the refereed proceedings of the 18th National Conference on Man-Machine Speech Communication, NCMMSC 2023, held in Suzhou, China, during December 8–11, 2023. The 20 full papers and 11 short papers included in this book were carefully reviewed and selected from 117 submissions. They deal with topics such as speech recognition, synthesis, enhancement and coding, audio/music/singing synthesis, avatar, speaker recognition and verification, human–computer dialogue systems, large language models as well as phonetic and linguistic topics such as speech prosody analysis, pathological speech analysis, experimental phonetics, acoustic scene classification.
내용주기
Ultra-Low Complexity Residue Echo and Noise Suppression Based on Recurrent Neural Network / Semi-End-to-End Nested Named Entity Recognition from Speech / A Lightweight Music Source Separation Model with Graph Convolution Network / Joint time-domain and frequency-domain progressive learning for single-channel speech enhancement and recognition / A Study on Domain Adaptation for Audio-visual Speech Enhancement / APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra / Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker Verification / Joint speech and noise estimation using SNR-adaptive target learning for deep-learning-based speech enhancement / Data Augmentation By Finite Element Analysis for Enhanced Machine Anomalous Sound Detection / A Fast Sampling Method in Diffusion-based Dance Generation Models / End-to-end Streaming Customizable KeywordSpotting based on text-adaptive neural search / The Production of Successive Addition Boundary Tone in Mandarin Preschoolers / Emotional Support Dialog System Through Recursive Interactions Among Large Language Models / Task-Adaptive Generative Adversarial Network based Speech Dereverberation for Robust Speech Recognition / Real-time Automotive Engine Sound Simulation with Deep Neural Network / A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech Enhancement / Accent-VITS: accent transfer for end-to-end TTS / Multi-branch Network with Cross-Domain Feature Fusion for Anomalous Sound Detection / A Packet Loss Concealment Method Based on the Demucs Network Structure / Improving Speech Perceptual Quality and Intelligibility through Sub-band Temporal Envelope Characteristics / Adaptive Deep Graph Convolutional Network For Dialogical Speech Emotion Recognition / Iterative Noisy-target Approach: Speech Enhancement without Clean Speech / Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization / Zero-shot Singing Voice Conversion Method Based on Timbre Space Modeling and Excitation Signal Control / A Comparative Study of Pre-trained Audio and Speech Models for Heart Sound Detection / CAM-GUI: A Conversational Assistant on Mobile GUI / A Pilot Study on the Prosodic Factors Influencing Voice Attractiveness of AI Speech / The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023 / Chinese EFL Learners’ Auditory and Visual Perception of English Statement and Question Intonation: The Effect of Stress / An Improved System for Partially Fake Audio Detection Using Pre-trained Model / Leveraging Synthetic Speech for CIF-based Customized Keyword Spotting.
관련 URL

Holdings Information

RReservation
MMissing Book Request
CClosed Stack Request
IInter-Campus Loan
CPriority Cataloging
PPrint
Registration no. Call no. Location Mark Location Status Due for return Service
전자자료는 소장사항이 존재하지 않습니다

Book Overview

Full menu