語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Nonstationary time series modeling w...
~
Rudoy, Daniel.
FindBook
Google Book
Amazon
博客來
Nonstationary time series modeling with applications to speech signal processing.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Nonstationary time series modeling with applications to speech signal processing./
作者:
Rudoy, Daniel.
面頁冊數:
349 p.
附註:
Source: Dissertation Abstracts International, Volume: 72-01, Section: B, page: .
Contained By:
Dissertation Abstracts International72-01B.
標題:
Applied Mathematics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3435443
ISBN:
9781124339221
Nonstationary time series modeling with applications to speech signal processing.
Rudoy, Daniel.
Nonstationary time series modeling with applications to speech signal processing.
- 349 p.
Source: Dissertation Abstracts International, Volume: 72-01, Section: B, page: .
Thesis (Ph.D.)--Harvard University, 2010.
We develop statistical methods for the analysis of nonstationary time series and apply them to a variety of problems arising in speech signal processing. Information-carrying natural sound signals such as speech exhibit a degree of controlled nonstationarity in that their statistical properties vary slowly over time. Faithfully modeling these temporal variations is extremely valuable for a wide range of applications and can be accomplished by relying on well-understood acoustic models of speech production, which motivate many of the methods developed in this thesis.
ISBN: 9781124339221Subjects--Topical Terms:
1669109
Applied Mathematics.
Nonstationary time series modeling with applications to speech signal processing.
LDR
:03438nam 2200325 4500
001
1396640
005
20110610084504.5
008
130515s2010 ||||||||||||||||| ||eng d
020
$a
9781124339221
035
$a
(UMI)AAI3435443
035
$a
AAI3435443
040
$a
UMI
$c
UMI
100
1
$a
Rudoy, Daniel.
$3
1675428
245
1 0
$a
Nonstationary time series modeling with applications to speech signal processing.
300
$a
349 p.
500
$a
Source: Dissertation Abstracts International, Volume: 72-01, Section: B, page: .
500
$a
Adviser: Patrick J. Wolfe.
502
$a
Thesis (Ph.D.)--Harvard University, 2010.
520
$a
We develop statistical methods for the analysis of nonstationary time series and apply them to a variety of problems arising in speech signal processing. Information-carrying natural sound signals such as speech exhibit a degree of controlled nonstationarity in that their statistical properties vary slowly over time. Faithfully modeling these temporal variations is extremely valuable for a wide range of applications and can be accomplished by relying on well-understood acoustic models of speech production, which motivate many of the methods developed in this thesis.
520
$a
First, we make a number of contributions to the classical problem of formant tracking, in which vocal tract resonances are estimated under the assumption of their invariance on the 15-30 ms scale. Next, we relax this piecewise-stationarity constraint and model the temporal dynamics of the vocal tract using time-varying autoregressive (TVAR) models. We develop their algebraic and geometric properties, introduce several new estimators, and use TVAR models to develop a hypothesis test to detect the presence of vocal tract variation in speech waveform data. We study its asymptotic properties, and illustrate its practical efficacy by detecting vocal tract changes across different timescales of speech dynamics.
520
$a
Next, we explore how standard fixed-resolution short-time Fourier representations may be generalized in order to adapt to the time-frequency structure of a speech signal. To this end, we introduce a family of adaptive, linear time-frequency representations termed superposition frames and show that they are invertible, numerically-stable, and admit fast overlap-add reconstruction akin to standard short-time Fourier techniques. The general construction proceeds via a local signal-adaptive modification of a Gabor frame. Two signal-dependent schemes for selecting an appropriate superposition frame for signal analysis are given, and the framework is illustrated in the context of speech enhancement.
520
$a
Finally, we introduce a joint model of the vocal tract and the source waveform in order to take into account its quasi-periodic temporal variations during voicing. We incorporate an estimate of the source waveform into the traditional linear prediction framework via nonparametric wavelet regression; the resultant semi-parametric model is applied to various speech analysis problems including formant and source-harmonics-to-noise ratio estimation, inverse filtering, and voicing detection.
590
$a
School code: 0084.
650
4
$a
Applied Mathematics.
$3
1669109
650
4
$a
Statistics.
$3
517247
650
4
$a
Engineering, Electronics and Electrical.
$3
626636
690
$a
0364
690
$a
0463
690
$a
0544
710
2
$a
Harvard University.
$3
528741
773
0
$t
Dissertation Abstracts International
$g
72-01B.
790
1 0
$a
Wolfe, Patrick J.,
$e
advisor
790
$a
0084
791
$a
Ph.D.
792
$a
2010
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3435443
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9159779
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入