Text, speech, and dialogue = 27th In...
TSD (Conference) (2024 :)

FindBook      Google Book      Amazon      博客來     
  • Text, speech, and dialogue = 27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024 : proceedingss.. Part II /
  • 紀錄類型: 書目-電子資源 : Monograph/item
    正題名/作者: Text, speech, and dialogue/ edited by Elmar Nöth, Aleš Horák, Petr Sojka.
    其他題名: 27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024 : proceedingss.
    其他題名: TSD 2024
    其他作者: Nöth, Elmar.
    團體作者: TSD (Conference)
    出版者: Cham :Springer Nature Switzerland : : 2024.,
    面頁冊數: xvii, 326 p. :ill. (some col.), digital ;24 cm.
    內容註: Speech. -- Retrieval Augmented Spoken Language Generation for Transport Domain. -- Adapting Audiovisual Speech Synthesis to Estonian. -- Dysphonia Diagnosis Using Self-Supervised Speech Models in Mono- and Cross-Lingual Settings. -- Sentences vs Phrases in Neural Speech Synthesis. -- Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model. -- Deep Speaker Embeddings for Speaker Verification of Children. -- Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding. -- Attention to Phonetics: A Visually Informed Explanation of Speech Transformers. -- Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis. -- Stream-Based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning. -- Data Alignment and Duration Modelling in VITS. -- Multiword Expressions Resources for Italian: Presenting a Manually Annotated Spoken Corpus. -- Generating High-Quality F0 Embeddings Using the Vector-Quantized Variational Autoencoder. -- Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation. -- X-vector-based Speaker Diarization Using Bi-LSTM and Interim Voting-driven Post-processing. -- A Paradigm for Interpreting Metrics and Measuring Error Severity in Automatic Speech Recognition. -- Enhancing Speech Emotion Recognition Using Transfer Learning From Speaker Embeddings. -- Dialogue. -- Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets. -- PiCo-VITS: Leveraging Pitch Contours for Fine-grained Emotional Speech Synthesis. -- Improving and Understanding Clarifying Question Generation in Conversational Search. -- Explainable Multimodal Fusion for Dementia Detection From Text and Speech. -- Robust Classification of Parkinson's Speech: an Approximation to a Scenario With Non-controlled Acoustic Conditions. -- Leveraging Conceptual Similarities to Enhance Modeling of Factors Affecting Adolescents' Well-Being. -- Joint-Average Mean and Variance Feature Matching (JAMVFM) Semi-supervised GAN with Additional-Objective Training Function for Intent Detection. -- Capturing Task-Related Information for Text-Based Grasp Classification Using Fine-Tuned Embeddings. -- StepDP: A Step Towards Expressive and Pervasive Dialogue Platforms. -- Automatic Classification of Parkinson's Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels.
    Contained By: Springer Nature eBook
    標題: Natural language processing (Computer science) - Congresses. -
    電子資源: https://doi.org/10.1007/978-3-031-70566-3
    ISBN: 9783031705663
館藏地:  出版年:  卷號: 
館藏
  • 1 筆 • 頁數 1 •
 
W9498793 電子資源 11.線上閱覽_V 電子書 EB QA76.9.N38 T73 2024 一般使用(Normal) 在架 0
  • 1 筆 • 頁數 1 •
多媒體
評論
Export
取書館
 
 
變更密碼
登入