Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Monaural speech segregation in rever...
~
Jin, Zhaozhang.
Linked to FindBook
Google Book
Amazon
博客來
Monaural speech segregation in reverberant environments.
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Monaural speech segregation in reverberant environments./
Author:
Jin, Zhaozhang.
Description:
155 p.
Notes:
Source: Dissertation Abstracts International, Volume: 71-12, Section: B, page: 7598.
Contained By:
Dissertation Abstracts International71-12B.
Subject:
Engineering, Computer. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3429645
ISBN:
9781124292212
Monaural speech segregation in reverberant environments.
Jin, Zhaozhang.
Monaural speech segregation in reverberant environments.
- 155 p.
Source: Dissertation Abstracts International, Volume: 71-12, Section: B, page: 7598.
Thesis (Ph.D.)--The Ohio State University, 2010.
Room reverberation is a major source of signal degradation in real environments. While listeners excel in "hearing out" a target source from sound mixtures in noisy and reverberant conditions, simulating this perceptual ability remains a fundamental challenge. The goal of this dissertation is to build a computational auditory scene analysis (CASA) system that separates target voiced speech from its acoustic background in reverberant environments. A supervised learning approach to pitch-based grouping of reverberant speech is proposed, followed by a robust multipitch tracking algorithm based on a hidden Markov model (HMM) framework. Finally, a monaural CASA system for reverberant speech segregation is designed by combining the supervised learning approach and the multipitch tracker.
ISBN: 9781124292212Subjects--Topical Terms:
1669061
Engineering, Computer.
Monaural speech segregation in reverberant environments.
LDR
:04341nam 2200301 4500
001
1400154
005
20111005095604.5
008
130515s2010 ||||||||||||||||| ||eng d
020
$a
9781124292212
035
$a
(UMI)AAI3429645
035
$a
AAI3429645
040
$a
UMI
$c
UMI
100
1
$a
Jin, Zhaozhang.
$3
1679178
245
1 0
$a
Monaural speech segregation in reverberant environments.
300
$a
155 p.
500
$a
Source: Dissertation Abstracts International, Volume: 71-12, Section: B, page: 7598.
500
$a
Adviser: DeLiang Wang.
502
$a
Thesis (Ph.D.)--The Ohio State University, 2010.
520
$a
Room reverberation is a major source of signal degradation in real environments. While listeners excel in "hearing out" a target source from sound mixtures in noisy and reverberant conditions, simulating this perceptual ability remains a fundamental challenge. The goal of this dissertation is to build a computational auditory scene analysis (CASA) system that separates target voiced speech from its acoustic background in reverberant environments. A supervised learning approach to pitch-based grouping of reverberant speech is proposed, followed by a robust multipitch tracking algorithm based on a hidden Markov model (HMM) framework. Finally, a monaural CASA system for reverberant speech segregation is designed by combining the supervised learning approach and the multipitch tracker.
520
$a
Monaural speech segregation in reverberant environments is a particularly challenging problem. Although inverse filtering has been proposed to partially restore the harmonicity of reverberant speech before segregation, this approach is sensitive to specific source/receiver and room configurations. Assuming that the true target pitch is known, our first study lends to a novel supervised learning approach to monaural segregation of reverberant voiced speech, which learns to map a set of pitch-based auditory features to a grouping cue encoding the posterior probability of a time-frequency (T-F) unit being target dominant given observed features. We devise a novel objective function for the learning process, which directly relates to the goal of maximizing signal-to-noise ratio. The model trained using this objective function yields significantly better T-F unit labeling. A segmentation and grouping framework is utilized to form reliable segments under reverberant conditions and organize them into streams. Systematic evaluations show that our approach produces very promising results under various reverberant conditions and generalizes well to new utterances and new speakers.
520
$a
Multipitch tracking in real environments is critical for speech signal processing. Determining pitch in both reverberant and noisy conditions is another difficult task. In the second study, we propose a robust algorithm for multipitch tracking in the presence of background noise and room reverberation. A new channel selection method is utilized to extract periodicity features. We derive pitch scores for each pitch state, which estimate the likelihoods of the observed periodicity features given pitch candidates. An HMM integrates these pitch scores and searches for the best pitch state sequence. Our algorithm can reliably detect single and double pitch contours in noisy and reverberant conditions.
520
$a
Building on the first two studies, we propose a CASA approach to monaural segregation of reverberant voiced speech, which performs multipitch tracking of reverberant mixtures and supervised classification. Speech and nonspeech models are separately trained, and each learns to map pitch-based features to the posterior probability of a T-F unit being dominated by the source with the given pitch estimate. Because interference can be either speech or nonspeech, a likelihood ratio test is introduced to select the correct model for labeling corresponding T-F units. Experimental results show that the proposed system performs robustly in different types of interference and various reverberant conditions, and has a significant advantage over existing systems.
590
$a
School code: 0168.
650
4
$a
Engineering, Computer.
$3
1669061
690
$a
0464
710
2
$a
The Ohio State University.
$3
718944
773
0
$t
Dissertation Abstracts International
$g
71-12B.
790
1 0
$a
Wang, DeLiang,
$e
advisor
790
$a
0168
791
$a
Ph.D.
792
$a
2010
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3429645
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9163293
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login