東華大學圖書館 |

語系: 繁體中文

說明(常見問題)

回圖書館首頁

手機版館藏查詢

登入

回首頁

切換: 標籤 | MARC模式 | ISBD

Representation, classification and i...

Li, Ming.

FindBook

Google Book

Amazon

博客來

Representation, classification and information fusion for robust and efficient multimodal human states recognition.

紀錄類型:	書目-電子資源 : Monograph/item
正題名/作者:	Representation, classification and information fusion for robust and efficient multimodal human states recognition./
作者:	Li, Ming.
面頁冊數:	166 p.
附註:	Source: Dissertation Abstracts International, Volume: 75-02(E), Section: B.
Contained By:	Dissertation Abstracts International75-02B(E).
標題:	Engineering, Electronics and Electrical. -
電子資源:	http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3598274
ISBN:	9781303467950

Representation, classification and information fusion for robust and efficient multimodal human states recognition.
Li, Ming.

Representation, classification and information fusion for robust and efficient multimodal human states recognition. - 166 p.

Source: Dissertation Abstracts International, Volume: 75-02(E), Section: B.

Thesis (Ph.D.)--University of Southern California, 2013.

This item must not be sold to any third party vendors.

The goal of this work is to enhance the robustness and efficiency of the multimodal human states recognition task. Human states recognition can be considered as a joint term for identifying/verifing various kinds of human related states, such as biometric identity, language spoken, age, gender, emotion, intoxication level, physical activity, vocal tract patterns, ECG QT intervals and so on. I performed research on the aforementioned states recognition problems and my focus is to increase the performance while reduce the computational cost.

ISBN: 9781303467950Subjects--Topical Terms:

626636
Engineering, Electronics and Electrical.

Representation, classification and information fusion for robust and efficient multimodal human states recognition.
LDR:04819nmm a2200349 4500 001 2057354
005 20150610074909.5
008 170521s2013 ||||||||||||||||| ||eng d
020 $a 9781303467950
035 $a (MiAaPQ)AAI3598274
035 $a AAI3598274
040 $a MiAaPQ $c MiAaPQ
100 1 $a Li, Ming. $3 559294
245 1 0 $a Representation, classification and information fusion for robust and efficient multimodal human states recognition.
300 $a 166 p.
500 $a Source: Dissertation Abstracts International, Volume: 75-02(E), Section: B.
500 $a Adviser: Shrikanth Narayanan.
502 $a Thesis (Ph.D.)--University of Southern California, 2013.
506 $a This item must not be sold to any third party vendors.
520 $a The goal of this work is to enhance the robustness and efficiency of the multimodal human states recognition task. Human states recognition can be considered as a joint term for identifying/verifing various kinds of human related states, such as biometric identity, language spoken, age, gender, emotion, intoxication level, physical activity, vocal tract patterns, ECG QT intervals and so on. I performed research on the aforementioned states recognition problems and my focus is to increase the performance while reduce the computational cost.
520 $a I start by extending the well known total variability i-vector modeling (a factor analysis on the concatenated GMM mean supervectors) to the simplified supervised i-vector modeling to enhance the robustness and efficiency. First, by concatenating the label vector and the linear classifier matrix at the end of the mean supervector and the i-vector factor loading matrix, respectively, the traditional i-vectors are extended to the label regularized supervised i-vectors. This supervised i-vectors are optimized to not only reconstruct the mean supervectors well but also minimize the mean square error between the original and the reconstructed label vectors, thus can make the supervised i-vectors more discriminative in terms of the label information regularized. Second, I perform the factor analysis (FA) on the pre-normalized GMM first order statistics supervector to ensure each gaussian component's statistics sub-vector is treated equally in the FA which reduce the computational cost by a factor of 25.
520 $a Inspired by the recent success of sparse representation on face recognition, I explored the possibility to adopt sparse representation for both representation and classification in this multimodal human sates recognition problem. For classification purpose, a sparse representation computed by l1-minimization (to approximate the l0 minimization) with quadratic constraints was proposed to replace the SVM on the GMM mean supervectors and by fusing the sparse representation based classification (SRC) method with SVM, the overall system performance was improved. Second, by adding a redundant identity matrix at the end of the original over-complete dictionary, the sparse representation is made more robust to variability and noise. Third, both the l1 norm ratio and the background normalized (BNorm) l2 residual ratio are used and shown to outperform the conventional l2 residual ratio in the verification task.
520 $a I also present an automatic speaker affective state recognition approach which models the factor vectors in the latent factor analysis framework improving upon the Gaussian Mixture Model (GMM) baseline performance. I consider the affective speech signal as the original normal average speech signal being corrupted by the affective channel effects. Rather than reducing the channel variability to enhance the robustness as in the speaker verification task, I directly model the speaker state on the channel factors under the factor analysis framework. Experimental results show that the proposed speaker state factor vector modeling system achieved unweighted and weighted accuracy improvement over the GMM baseline on the intoxicated speech detection task and the emotion recognition task, respectively.
520 $a To summarize the methods for representation, I propose a general optimization framework. The aforementioned methods, such as traditional factor analysis, i-vector, supervised i-vector, simplified i-vector and s-vectors, are all special cases of this general optimization problem. In the future, I plan to investigate other kinds of distance measures, cost functions and constraints in this unified general optimization problem. (Abstract shortened by UMI.).
590 $a School code: 0208.
650 4 $a Engineering, Electronics and Electrical. $3 626636
650 4 $a Information Science. $3 1017528
650 4 $a Computer Science. $3 626642
690 $a 0544
690 $a 0723
690 $a 0984
710 2 $a University of Southern California. $b Electrical Engineering. $3 1020963
773 0 $t Dissertation Abstracts International $g 75-02B(E).
790 $a 0208
791 $a Ph.D.
792 $a 2013
793 $a English
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3598274