東華大學圖書館 |

語系: 繁體中文

說明(常見問題)

回圖書館首頁

手機版館藏查詢

登入

回首頁

切換: 標籤 | MARC模式 | ISBD

Adaptive and multimodal approach to ...

Liu, Zhu.

FindBook

Google Book

Amazon

博客來

Adaptive and multimodal approach to multimedia content analysis.

紀錄類型:	書目-語言資料,印刷品 : Monograph/item
正題名/作者:	Adaptive and multimodal approach to multimedia content analysis./
作者:	Liu, Zhu.
面頁冊數:	133 p.
附註:	Adviser: Yao Wang.
Contained By:	Dissertation Abstracts International61-09B.
標題:	Engineering, Electronics and Electrical. -
電子資源:	http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=9988124
ISBN:	9780599950085

Adaptive and multimodal approach to multimedia content analysis.
Liu, Zhu.

Adaptive and multimodal approach to multimedia content analysis. - 133 p.

Adviser: Yao Wang.

Thesis (Ph.D.)--Polytechnic University, 2001.

The volume of multimedia data generated nowadays is exploding. To efficiently access and retrieve desired information, tools that enable automated analysis based on content are becoming indispensable. Multimedia content is defined at both perceptual and conceptual levels. The former refers to the content characterized purely by intrinsic perception properties such as color, motion, or acoustic features. The latter refers to the content that is specified based on concepts or semantics such as sunset, anchors, or news headline stories. At both levels, the content is embedded in multiple forms that are usually complimentary to each other. The main objective of this thesis is to adaptively analyze the multimedia content by integrating cues from multiple modalities, including audio, video, and text, mainly in the scope of news broadcast.

ISBN: 9780599950085Subjects--Topical Terms:

626636
Engineering, Electronics and Electrical.

Adaptive and multimodal approach to multimedia content analysis.
LDR:03257nam 2200289 a 45 001 965909
005 20110908
008 110908s2001 eng d
020 $a 9780599950085
035 $a (UnM)AAI9988124
035 $a AAI9988124
040 $a UnM $c UnM
100 1 $a Liu, Zhu. $3 1057868
245 1 0 $a Adaptive and multimodal approach to multimedia content analysis.
300 $a 133 p.
500 $a Adviser: Yao Wang.
500 $a Source: Dissertation Abstracts International, Volume: 61-09, Section: B, page: 4890.
502 $a Thesis (Ph.D.)--Polytechnic University, 2001.
520 $a The volume of multimedia data generated nowadays is exploding. To efficiently access and retrieve desired information, tools that enable automated analysis based on content are becoming indispensable. Multimedia content is defined at both perceptual and conceptual levels. The former refers to the content characterized purely by intrinsic perception properties such as color, motion, or acoustic features. The latter refers to the content that is specified based on concepts or semantics such as sunset, anchors, or news headline stories. At both levels, the content is embedded in multiple forms that are usually complimentary to each other. The main objective of this thesis is to adaptively analyze the multimedia content by integrating cues from multiple modalities, including audio, video, and text, mainly in the scope of news broadcast.
520 $a At the perceptual level, news broadcast data is segmented and classified into different video events such as news reporting and commercials. Audio and visual features are developed and integrated, aiming at discriminating different events effectively. Various classification mechanisms, including linear fuzzy threshold, maximum likelihood using Gaussian Mixture Model and Hidden Markov Model, Neural Network, as well as Support Vector Machine, are benchmarked.
520 $a At the conceptual level, algorithms and demonstration systems for three applications are developed. In News Broadcast Browsing System, recovering and presentation of the embedded hierarchy structure of news broadcast are addressed. Important semantic objects such as hosting characters and headline news stories are adaptively extracted using the audio/visual models that are bootstrapped from on-line data. The problem of efficient search and retrieval of segmented multimedia objects based on audio is discussed in Query-by-example in Audio System. A distance metric framework is proposed to determine the difference of mixture type Probability Density Functions, and is applied in measuring the dissimilarity of audio segments based on their model parameters. In Major Cast Detection System, we developed an algorithm to detect the major casts in video, for example, anchor persons in news broadcasts and major characters in movies. The algorithm integrates both speaker and face information and constructs a ranked list of major casts based on their temporal and spacial presence.
590 $a School code: 0179.
650 4 $a Engineering, Electronics and Electrical. $3 626636
690 $a 0544
710 2 0 $a Polytechnic University. $3 1249856
773 0 $t Dissertation Abstracts International $g 61-09B.
790 $a 0179
790 1 0 $a Wang, Yao, $e advisor
791 $a Ph.D.
792 $a 2001
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=9988124