語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Speech and computer = 25th Internati...
~
International Conference Speech and Computer (2023 :)
FindBook
Google Book
Amazon
博客來
Speech and computer = 25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023 : proceedings.. Part II /
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Speech and computer/ edited by Alexey Karpov ... [et al.].
其他題名:
25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023 : proceedings.
其他題名:
SPECOM 2023
其他作者:
Karpov, Alexey.
團體作者:
International Conference Speech and Computer
出版者:
Cham :Springer Nature Switzerland : : 2023.,
面頁冊數:
xxvi, 568 p. :illustrations (some col.), digital ;24 cm.
內容註:
Industrial Speech and Language Technology -- Analysing Breathing Patterns in Reading and Spontaneous Speech -- Audio-Visual Speaker Verification via Joint Cross Attention -- A Novel Scheme to Classify Read and Spontaneous Speech -- Analysis of a Hinglish ASR System's Performance for Fraud Detection -- Anomaly Detection in Speech: A Comprehensive Approach for Enhanced Speech Analysis -- CAPTuring Accents: An Approach to Personalize Pronunciation Training for Learners with Different L1 Backgrounds -- Speech Technology for Under-Resourced Languages -- Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages -- Phone Durations Modeling for Livvi-Karelian ASR -- Significance of Indic Self-Supervised Speech Representations for Indic Under-Resourced ASR -- Study of Various End-to-End Keyword Spotting Systems on the Bengali language under Low-Resource Condition -- Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language -- Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children's ASR -- Code-Mixed Text-to-Speech Synthesis under Low-Resource Constraints -- An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language -- An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language -- Cross Lingual Style Transfer using Multiscale Loss Function for Soliga: A Low Resource Tribal Language -- Preliminary Analysis of Lambani Vowels and Vowel Classification using Acoustic Feature -- Curriculum Learning based Approach for Faster Convergence of TTS Model -- Rhythm Measures and Language Endangerment: the Case of Deori -- Konkani Phonetic Transcription System 1.0 -- Speech Analysis and Synthesis -- E-TTS: Expressive Text-to-Speech Synthesis for Hindi using Data Augmentation -- Direct vs Cascaded Speech-to-Speech Translation using Transformer -- Deep Learning based Speech Quality Assessment Focusing on Noise Effects -- Quantifying the Emotional Landscape of Music with Three Dimensions -- Analysis of Mandarin vs. English Language for Emotional Voice Conversion -- Audio DeepFake Detection Employing Multiple Parametric Exponential Linear Units -- A Comparison of Learned Representations with Jointly Optimized VAE and DNN for Syllable Stress Detection -- On the Asymptotic Behaviour of the Speech Signal -- Improvement of Audio-Visual Keyword Spotting System Accuracy using Excitation Source Feature -- Developing a Question Answering System on the material of Holocaust survivors' testimonies in Russian -- Enhancing Children's Short Utterance based ASV using Data Augmentation Techniques and Feature Concatenation Approach -- Studying the Effectiveness of Data Augmentation and Frequency-Domain Linear Prediction Coefficients in Children's Speaker Verification under Low-Resource Conditions -- Constant-Q based Harmonic and Pitch Features for Normal vs Pathological Infant Cry Classification -- Robustness of Whisper Features for Infant Cry Classification -- Speaker and Language Identification, Verification, and Diarization -- I-MSV 2022: Indic-Multilingual and Multi-Sensor Speaker Verification Challenge -- Multi-Task Learning over Mixup Variants for the Speaker Verification Task -- Exploring the Impact of Different Approaches for Spoken Dialect Identification of Konkani Language -- Adversarially Trained Hierarchical Attention Network for Domain-Invariant Spoken Language Identification -- Ensemble of Incremental System Enhancements for Robust Speaker Diarization in Code-Switched Real-Life Audios -- Enhancing Language Identification in Indian Context through Exploiting Learned Features with Wav2Vec2.0 -- Design and Development of Voice OTP Authentication System -- End-to-End Native Language Identification using a Modified Vision Transformer(ViT) from L2 English Speech -- Dialect Identification in Ao using Modulation-based Representation -- Self-Supervised Speaker Verification Employing Augmentation Mix and Self-Augmented Training-based Clustering. .
Contained By:
Springer Nature eBook
標題:
Natural language processing (Computer science) - Congresses. -
電子資源:
https://doi.org/10.1007/978-3-031-48312-7
ISBN:
9783031483127
Speech and computer = 25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023 : proceedings.. Part II /
Speech and computer
25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023 : proceedings.Part II /[electronic resource] :SPECOM 2023edited by Alexey Karpov ... [et al.]. - Cham :Springer Nature Switzerland :2023. - xxvi, 568 p. :illustrations (some col.), digital ;24 cm. - Lecture notes in computer science,143390302-9743 ;. - Lecture notes in computer science ;14339..
Industrial Speech and Language Technology -- Analysing Breathing Patterns in Reading and Spontaneous Speech -- Audio-Visual Speaker Verification via Joint Cross Attention -- A Novel Scheme to Classify Read and Spontaneous Speech -- Analysis of a Hinglish ASR System's Performance for Fraud Detection -- Anomaly Detection in Speech: A Comprehensive Approach for Enhanced Speech Analysis -- CAPTuring Accents: An Approach to Personalize Pronunciation Training for Learners with Different L1 Backgrounds -- Speech Technology for Under-Resourced Languages -- Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages -- Phone Durations Modeling for Livvi-Karelian ASR -- Significance of Indic Self-Supervised Speech Representations for Indic Under-Resourced ASR -- Study of Various End-to-End Keyword Spotting Systems on the Bengali language under Low-Resource Condition -- Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language -- Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children's ASR -- Code-Mixed Text-to-Speech Synthesis under Low-Resource Constraints -- An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language -- An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language -- Cross Lingual Style Transfer using Multiscale Loss Function for Soliga: A Low Resource Tribal Language -- Preliminary Analysis of Lambani Vowels and Vowel Classification using Acoustic Feature -- Curriculum Learning based Approach for Faster Convergence of TTS Model -- Rhythm Measures and Language Endangerment: the Case of Deori -- Konkani Phonetic Transcription System 1.0 -- Speech Analysis and Synthesis -- E-TTS: Expressive Text-to-Speech Synthesis for Hindi using Data Augmentation -- Direct vs Cascaded Speech-to-Speech Translation using Transformer -- Deep Learning based Speech Quality Assessment Focusing on Noise Effects -- Quantifying the Emotional Landscape of Music with Three Dimensions -- Analysis of Mandarin vs. English Language for Emotional Voice Conversion -- Audio DeepFake Detection Employing Multiple Parametric Exponential Linear Units -- A Comparison of Learned Representations with Jointly Optimized VAE and DNN for Syllable Stress Detection -- On the Asymptotic Behaviour of the Speech Signal -- Improvement of Audio-Visual Keyword Spotting System Accuracy using Excitation Source Feature -- Developing a Question Answering System on the material of Holocaust survivors' testimonies in Russian -- Enhancing Children's Short Utterance based ASV using Data Augmentation Techniques and Feature Concatenation Approach -- Studying the Effectiveness of Data Augmentation and Frequency-Domain Linear Prediction Coefficients in Children's Speaker Verification under Low-Resource Conditions -- Constant-Q based Harmonic and Pitch Features for Normal vs Pathological Infant Cry Classification -- Robustness of Whisper Features for Infant Cry Classification -- Speaker and Language Identification, Verification, and Diarization -- I-MSV 2022: Indic-Multilingual and Multi-Sensor Speaker Verification Challenge -- Multi-Task Learning over Mixup Variants for the Speaker Verification Task -- Exploring the Impact of Different Approaches for Spoken Dialect Identification of Konkani Language -- Adversarially Trained Hierarchical Attention Network for Domain-Invariant Spoken Language Identification -- Ensemble of Incremental System Enhancements for Robust Speaker Diarization in Code-Switched Real-Life Audios -- Enhancing Language Identification in Indian Context through Exploiting Learned Features with Wav2Vec2.0 -- Design and Development of Voice OTP Authentication System -- End-to-End Native Language Identification using a Modified Vision Transformer(ViT) from L2 English Speech -- Dialect Identification in Ao using Modulation-based Representation -- Self-Supervised Speaker Verification Employing Augmentation Mix and Self-Augmented Training-based Clustering. .
The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29-December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.
ISBN: 9783031483127
Standard No.: 10.1007/978-3-031-48312-7doiSubjects--Topical Terms:
752585
Natural language processing (Computer science)
--Congresses.
LC Class. No.: QA76.9.N38
Dewey Class. No.: 006.35
Speech and computer = 25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023 : proceedings.. Part II /
LDR
:06041nmm a2200361 a 4500
001
2336013
003
DE-He213
005
20231121204311.0
006
m d
007
cr nn 008maaau
008
240402s2023 sz s 0 eng d
020
$a
9783031483127
$q
(electronic bk.)
020
$a
9783031483110
$q
(paper)
024
7
$a
10.1007/978-3-031-48312-7
$2
doi
035
$a
978-3-031-48312-7
040
$a
GP
$c
GP
041
0
$a
eng
050
4
$a
QA76.9.N38
072
7
$a
UYQ
$2
bicssc
072
7
$a
COM004000
$2
bisacsh
072
7
$a
UYQ
$2
thema
082
0 4
$a
006.35
$2
23
090
$a
QA76.9.N38
$b
I61 2023
111
2
$a
International Conference Speech and Computer
$n
(25th :
$d
2023 :
$c
Dharwad, India ; Online)
$3
3668815
245
1 0
$a
Speech and computer
$h
[electronic resource] :
$b
25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023 : proceedings.
$n
Part II /
$c
edited by Alexey Karpov ... [et al.].
246
3
$a
SPECOM 2023
260
$a
Cham :
$b
Springer Nature Switzerland :
$b
Imprint: Springer,
$c
2023.
300
$a
xxvi, 568 p. :
$b
illustrations (some col.), digital ;
$c
24 cm.
490
1
$a
Lecture notes in computer science,
$x
0302-9743 ;
$v
14339
490
1
$a
Lecture notes in artificial intelligence
505
0
$a
Industrial Speech and Language Technology -- Analysing Breathing Patterns in Reading and Spontaneous Speech -- Audio-Visual Speaker Verification via Joint Cross Attention -- A Novel Scheme to Classify Read and Spontaneous Speech -- Analysis of a Hinglish ASR System's Performance for Fraud Detection -- Anomaly Detection in Speech: A Comprehensive Approach for Enhanced Speech Analysis -- CAPTuring Accents: An Approach to Personalize Pronunciation Training for Learners with Different L1 Backgrounds -- Speech Technology for Under-Resourced Languages -- Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages -- Phone Durations Modeling for Livvi-Karelian ASR -- Significance of Indic Self-Supervised Speech Representations for Indic Under-Resourced ASR -- Study of Various End-to-End Keyword Spotting Systems on the Bengali language under Low-Resource Condition -- Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language -- Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children's ASR -- Code-Mixed Text-to-Speech Synthesis under Low-Resource Constraints -- An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language -- An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language -- Cross Lingual Style Transfer using Multiscale Loss Function for Soliga: A Low Resource Tribal Language -- Preliminary Analysis of Lambani Vowels and Vowel Classification using Acoustic Feature -- Curriculum Learning based Approach for Faster Convergence of TTS Model -- Rhythm Measures and Language Endangerment: the Case of Deori -- Konkani Phonetic Transcription System 1.0 -- Speech Analysis and Synthesis -- E-TTS: Expressive Text-to-Speech Synthesis for Hindi using Data Augmentation -- Direct vs Cascaded Speech-to-Speech Translation using Transformer -- Deep Learning based Speech Quality Assessment Focusing on Noise Effects -- Quantifying the Emotional Landscape of Music with Three Dimensions -- Analysis of Mandarin vs. English Language for Emotional Voice Conversion -- Audio DeepFake Detection Employing Multiple Parametric Exponential Linear Units -- A Comparison of Learned Representations with Jointly Optimized VAE and DNN for Syllable Stress Detection -- On the Asymptotic Behaviour of the Speech Signal -- Improvement of Audio-Visual Keyword Spotting System Accuracy using Excitation Source Feature -- Developing a Question Answering System on the material of Holocaust survivors' testimonies in Russian -- Enhancing Children's Short Utterance based ASV using Data Augmentation Techniques and Feature Concatenation Approach -- Studying the Effectiveness of Data Augmentation and Frequency-Domain Linear Prediction Coefficients in Children's Speaker Verification under Low-Resource Conditions -- Constant-Q based Harmonic and Pitch Features for Normal vs Pathological Infant Cry Classification -- Robustness of Whisper Features for Infant Cry Classification -- Speaker and Language Identification, Verification, and Diarization -- I-MSV 2022: Indic-Multilingual and Multi-Sensor Speaker Verification Challenge -- Multi-Task Learning over Mixup Variants for the Speaker Verification Task -- Exploring the Impact of Different Approaches for Spoken Dialect Identification of Konkani Language -- Adversarially Trained Hierarchical Attention Network for Domain-Invariant Spoken Language Identification -- Ensemble of Incremental System Enhancements for Robust Speaker Diarization in Code-Switched Real-Life Audios -- Enhancing Language Identification in Indian Context through Exploiting Learned Features with Wav2Vec2.0 -- Design and Development of Voice OTP Authentication System -- End-to-End Native Language Identification using a Modified Vision Transformer(ViT) from L2 English Speech -- Dialect Identification in Ao using Modulation-based Representation -- Self-Supervised Speaker Verification Employing Augmentation Mix and Self-Augmented Training-based Clustering. .
520
$a
The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29-December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.
650
0
$a
Natural language processing (Computer science)
$v
Congresses.
$3
752585
650
0
$a
Automatic speech recognition
$v
Congresses.
$3
840482
650
0
$a
Speech processing systems
$x
Congresses.
$3
678615
650
0
$a
Human-computer interaction
$x
Congresses.
$3
705966
650
0
$a
Linguistics
$v
Congresses.
$3
792572
650
1 4
$a
Artificial Intelligence.
$3
769149
650
2 4
$a
Computer Imaging, Vision, Pattern Recognition and Graphics.
$3
890871
650
2 4
$a
Computer Engineering and Networks.
$3
3538504
650
2 4
$a
Computer and Information Systems Applications.
$3
3538505
700
1
$a
Karpov, Alexey.
$3
3251780
710
2
$a
SpringerLink (Online service)
$3
836513
773
0
$t
Springer Nature eBook
830
0
$a
Lecture notes in computer science ;
$v
14339.
$3
3668817
830
0
$a
Lecture notes in artificial intelligence.
$3
3382562
856
4 0
$u
https://doi.org/10.1007/978-3-031-48312-7
950
$a
Computer Science (SpringerNature-11645)
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9462218
電子資源
11.線上閱覽_V
電子書
EB QA76.9.N38
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入