語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Co-clustering algorithms: Extensions...
~
The University of Texas at Austin., Computer Sciences.
FindBook
Google Book
Amazon
博客來
Co-clustering algorithms: Extensions and applications.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Co-clustering algorithms: Extensions and applications./
作者:
Cho, Hyuk.
面頁冊數:
272 p.
附註:
Adviser: Inderjit S. Dhillon.
Contained By:
Dissertation Abstracts International69-08B.
標題:
Biology, Bioinformatics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3320678
ISBN:
9780549739920
Co-clustering algorithms: Extensions and applications.
Cho, Hyuk.
Co-clustering algorithms: Extensions and applications.
- 272 p.
Adviser: Inderjit S. Dhillon.
Thesis (Ph.D.)--The University of Texas at Austin, 2008.
In summary, we present co-clustering algorithms to discover latent local patterns, propose their algorithmic extensions to incorporate specific requirements, and provide their applications to a wide range of practical domains.
ISBN: 9780549739920Subjects--Topical Terms:
1018415
Biology, Bioinformatics.
Co-clustering algorithms: Extensions and applications.
LDR
:04997nmm 2200325 a 45
001
891420
005
20101111
008
101111s2008 ||||||||||||||||| ||eng d
020
$a
9780549739920
035
$a
(UMI)AAI3320678
035
$a
AAI3320678
040
$a
UMI
$c
UMI
100
1
$a
Cho, Hyuk.
$3
1065417
245
1 0
$a
Co-clustering algorithms: Extensions and applications.
300
$a
272 p.
500
$a
Adviser: Inderjit S. Dhillon.
500
$a
Source: Dissertation Abstracts International, Volume: 69-08, Section: B, page: 4836.
502
$a
Thesis (Ph.D.)--The University of Texas at Austin, 2008.
520
$a
In summary, we present co-clustering algorithms to discover latent local patterns, propose their algorithmic extensions to incorporate specific requirements, and provide their applications to a wide range of practical domains.
520
$a
Co-clustering is rather a recent paradigm for unsupervised data analysis, but it has become increasingly popular because of its potential to discover latent local patterns, otherwise unapparent by usual unsupervised algorithms such as k-means. Wide deployment of co-clustering, however, requires addressing a number of practical challenges such as data transformation, cluster initialization, scalability, and so on. Therefore, this thesis focuses on developing sophisticated co-clustering methodologies to maturity and its ultimate goal is to promote co-clustering as an invaluable and indispensable unsupervised analysis tool for varied practical applications. To achieve this goal, we explore the three specific tasks: (1) development of co-clustering algorithms to be functional, adaptable, and scalable ( co-clustering algorithms); (2) extension of co-clustering algorithms to incorporate application-specific requirements (extensions); and (3) application of co-clustering algorithms broadly to existing and emerging problems in practical application domains (applications).
520
$a
As for co-clustering algorithms, we develop two fast Minimum Sum-Squared Residue Co-clustering (MSSRCC) algorithms [CDGS04], which simultaneously cluster data points and features via an alternating minimization scheme and generate co-clusters in a "checkerboard" structure. The first captures co-clusters with constant values, while the other discovers co-clusters with coherent "trends" as well as constant values. We note that the proposed algorithms are two special cases (bases 2 and 6 with Euclidean distance, respectively) of the general co-clustering framework, Bregman Co-clustering (BCC) [BDG+07], which contains six Euclidean BCC and six I-divergence BCC algorithms. Then, we substantially enhance the performance of the two MSSRCC algorithms by escaping from poor local minima and resolving the degeneracy problem of generating empty clusters in partitional clustering algorithms through the three specific strategies: (1) data transformation; (2) deterministic spectral initialization; and (3) local search strategy.
520
$a
Concerning co-clustering extensions, we investigate general algorithmic strategies for the general BCC framework, since it is applicable to a large class of distance measures and data types. We first formalize various data transformations for datasets with varied scaling and shifting factors, mathematically justify their effects on the six Euclidean BCC algorithms, and empirically validate the analysis results. We also adapt the local search strategy, initially developed for the two MSSRCC algorithms, to all the twelve BCC algorithms. Moreover, we consider variations of cluster assignments and cluster updates, including greedy vs. non-greedy cluster assignment, online vs. batch cluster update, and so on. Furthermore, in order to provide better scalability and usability, we parallelize all the twelve BCC algorithms, which are capable of co-clustering large-scaled datasets over multiple processors.
520
$a
Regarding co-clustering applications, we extend the functionality of BCC to incorporate application-specific requirements: (1) discovery of inverted patterns, whose goal is to find anti-correlation; (2) discovery of coherent co-clusters from noisy data, whose purpose is to do dimensional reduction and feature selection; and (3) discovery of patterns from time-series data, whose motive is to guarantee critical time-locality. Furthermore, we employ co-clustering to pervasive computing for mobile devices, where the task is to extract latent patterns from usage logs as well as to recognize specific situations of mobile-device users. Finally, we demonstrate the applicability of our proposed algorithms for aforementioned applications through empirical results on various synthetic and real-world datasets.
590
$a
School code: 0227.
650
4
$a
Biology, Bioinformatics.
$3
1018415
650
4
$a
Computer Science.
$3
626642
690
$a
0715
690
$a
0984
710
2
$a
The University of Texas at Austin.
$b
Computer Sciences.
$3
1037904
773
0
$t
Dissertation Abstracts International
$g
69-08B.
790
$a
0227
790
1 0
$a
Dhillon, Inderjit S.,
$e
advisor
791
$a
Ph.D.
792
$a
2008
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3320678
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9083548
電子資源
11.線上閱覽_V
電子書
EB W9083548
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入