語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Clustering time-course gene-expressi...
~
Rice University.
FindBook
Google Book
Amazon
博客來
Clustering time-course gene-expression array data.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Clustering time-course gene-expression array data./
作者:
Gershman, Jason Andrew.
面頁冊數:
207 p.
附註:
Adviser: Rudy Guerra.
Contained By:
Dissertation Abstracts International69-04B.
標題:
Biology, Bioinformatics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3309873
ISBN:
9780549579397
Clustering time-course gene-expression array data.
Gershman, Jason Andrew.
Clustering time-course gene-expression array data.
- 207 p.
Adviser: Rudy Guerra.
Thesis (Ph.D.)--Rice University, 2008.
This thesis examines methods used to cluster time-course gene expression array data. In the past decade, various model-based methods have been published and advocated for clustering this type of data in place of classic non-parametric techniques like K-means and hierarchical clustering. On simulated data, where the variance between clusters is large, I show that the model-based MCLUST outperforms model-based SSClust and non-model-based K-means clustering. I also show that the number of genes or the number of clusters has no significant effect on the performance of these model-based clustering techniques. On two real data sets, where the variance between clusters is smaller, I show that model-based SSClust outperforms both MCLUST and K-means clustering. Since the "truth" is often not known for real data sets, I use the clustered data as "truth" and then perturb the data by adding pointwise noise to cluster this noisy data. Throughout my analysis of real and simulated expression data, I use the misclassification rate and the overall success rate as measures of success of the clustering algorithm. Overall, the model-based methods appear to cluster the data better than the non-model-based methods.
ISBN: 9780549579397Subjects--Topical Terms:
1018415
Biology, Bioinformatics.
Clustering time-course gene-expression array data.
LDR
:03009nam 2200289 a 45
001
854325
005
20100702
008
100702s2008 ||||||||||||||||| ||eng d
020
$a
9780549579397
035
$a
(UMI)AAI3309873
035
$a
AAI3309873
040
$a
UMI
$c
UMI
100
1
$a
Gershman, Jason Andrew.
$3
1020668
245
1 0
$a
Clustering time-course gene-expression array data.
300
$a
207 p.
500
$a
Adviser: Rudy Guerra.
500
$a
Source: Dissertation Abstracts International, Volume: 69-04, Section: B, page: 2396.
502
$a
Thesis (Ph.D.)--Rice University, 2008.
520
$a
This thesis examines methods used to cluster time-course gene expression array data. In the past decade, various model-based methods have been published and advocated for clustering this type of data in place of classic non-parametric techniques like K-means and hierarchical clustering. On simulated data, where the variance between clusters is large, I show that the model-based MCLUST outperforms model-based SSClust and non-model-based K-means clustering. I also show that the number of genes or the number of clusters has no significant effect on the performance of these model-based clustering techniques. On two real data sets, where the variance between clusters is smaller, I show that model-based SSClust outperforms both MCLUST and K-means clustering. Since the "truth" is often not known for real data sets, I use the clustered data as "truth" and then perturb the data by adding pointwise noise to cluster this noisy data. Throughout my analysis of real and simulated expression data, I use the misclassification rate and the overall success rate as measures of success of the clustering algorithm. Overall, the model-based methods appear to cluster the data better than the non-model-based methods.
520
$a
Later, I examine the role of gene ontology (GO) and using gene ontology data to cluster gene expression data. I find that clustering expression data, using a synthesis of gene expression and gene ontology not only provides clustering that has a biologic meaning but also clusters the data well. I also introduce an algorithm for clustering expression profiles on both gene expression and gene ontology data when some of the genes are missing the ontology data. Instead of some other methods which ignore the missing data or lump it all into a miscellaneous cluster, I use classification and inferential techniques to cluster using all of the available data and this method shows promising results. I also examine which ontology, among molecular function, biological process, and cellular component, is best in clustering expression data. This analysis shows that biological process is the preferred ontology for clustering expression data.
590
$a
School code: 0187.
650
4
$a
Biology, Bioinformatics.
$3
1018415
650
4
$a
Statistics.
$3
517247
690
$a
0463
690
$a
0715
710
2
$a
Rice University.
$3
960124
773
0
$t
Dissertation Abstracts International
$g
69-04B.
790
$a
0187
790
1 0
$a
Guerra, Rudy,
$e
advisor
791
$a
Ph.D.
792
$a
2008
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3309873
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9070245
電子資源
11.線上閱覽_V
電子書
EB W9070245
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入