語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Using the C -index to measure predic...
~
Huang, Yunda.
FindBook
Google Book
Amazon
博客來
Using the C -index to measure prediction accuracy and variable importance of random forests, with application to tissue microarray data.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Using the C -index to measure prediction accuracy and variable importance of random forests, with application to tissue microarray data./
作者:
Huang, Yunda.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2004,
面頁冊數:
98 p.
附註:
Source: Dissertations Abstracts International, Volume: 65-11, Section: B.
Contained By:
Dissertations Abstracts International65-11B.
標題:
Biostatistics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3117700
Using the C -index to measure prediction accuracy and variable importance of random forests, with application to tissue microarray data.
Huang, Yunda.
Using the C -index to measure prediction accuracy and variable importance of random forests, with application to tissue microarray data.
- Ann Arbor : ProQuest Dissertations & Theses, 2004 - 98 p.
Source: Dissertations Abstracts International, Volume: 65-11, Section: B.
Thesis (Ph.D.)--University of California, Los Angeles, 2004.
This item must not be sold to any third party vendors.
Tissue microarray (TMA) is a state-of-art technique for high throughput molecular analysis of large number of tumor samples in a single staining reaction. TMAs allow one to evaluate highly specialized tumor marker expression patterns which may lead to improved diagnostic, prognostic and therapeutic applications in the clinic. Typically, TMA data have relatively few observations yet many highly skewed and correlated covariates with weak marginal effects. Random forests (RF) predictors [Bre01] are known to produce improved accuracy with such data. In this thesis, we propose to use the C-index as an alternative prediction accuracy measure to the error rate for RF predictors. Unlike the error rate, the C-index compares the overall distribution of the posterior predictions and sidesteps the need to specify the cost function and the classification threshold. We prove that the C-index is far superior to the error rate in determining the prediction accuracy and variable importance of RF predictors in certain situations. We also introduce a C-margin to measure the prediction strength of individual observations. Based on these C-margins, we propose new measures of variable importance. We show that the C-margin based importance measures are superior to the current E-margin based importance measures and the Gini index especially when the class prevalence is unbalanced. We apply our proposed methods to benchmark data from the UCI repository and to our simulated data. We extend the use of the C-index and C-margins to other important data areas such as those with continuous outcomes and censored outcomes, We find that the C-margin based variable importance measures often outperform existing measures. Furthermore, we extend the local full likelihood method proposed by LeBlance and Crowley [LC92] for the construction of residual-based survival random forest predictors, Employing the C-index and the C-margins, we find that our residual-based survival random forests predictor outperforms Breiman's survival random forest predictor (2001) especially in finding important covariates.Subjects--Topical Terms:
1002712
Biostatistics.
Using the C -index to measure prediction accuracy and variable importance of random forests, with application to tissue microarray data.
LDR
:03223nmm a2200301 4500
001
2206342
005
20190829083250.5
008
201008s2004 ||||||||||||||||| ||eng d
035
$a
(MiAaPQ)AAI3117700
035
$a
AAI3117700
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Huang, Yunda.
$3
3433239
245
1 0
$a
Using the C -index to measure prediction accuracy and variable importance of random forests, with application to tissue microarray data.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2004
300
$a
98 p.
500
$a
Source: Dissertations Abstracts International, Volume: 65-11, Section: B.
500
$a
Publisher info.: Dissertation/Thesis.
500
$a
Horvath, Steve.
502
$a
Thesis (Ph.D.)--University of California, Los Angeles, 2004.
506
$a
This item must not be sold to any third party vendors.
506
$a
This item must not be added to any third party search indexes.
520
$a
Tissue microarray (TMA) is a state-of-art technique for high throughput molecular analysis of large number of tumor samples in a single staining reaction. TMAs allow one to evaluate highly specialized tumor marker expression patterns which may lead to improved diagnostic, prognostic and therapeutic applications in the clinic. Typically, TMA data have relatively few observations yet many highly skewed and correlated covariates with weak marginal effects. Random forests (RF) predictors [Bre01] are known to produce improved accuracy with such data. In this thesis, we propose to use the C-index as an alternative prediction accuracy measure to the error rate for RF predictors. Unlike the error rate, the C-index compares the overall distribution of the posterior predictions and sidesteps the need to specify the cost function and the classification threshold. We prove that the C-index is far superior to the error rate in determining the prediction accuracy and variable importance of RF predictors in certain situations. We also introduce a C-margin to measure the prediction strength of individual observations. Based on these C-margins, we propose new measures of variable importance. We show that the C-margin based importance measures are superior to the current E-margin based importance measures and the Gini index especially when the class prevalence is unbalanced. We apply our proposed methods to benchmark data from the UCI repository and to our simulated data. We extend the use of the C-index and C-margins to other important data areas such as those with continuous outcomes and censored outcomes, We find that the C-margin based variable importance measures often outperform existing measures. Furthermore, we extend the local full likelihood method proposed by LeBlance and Crowley [LC92] for the construction of residual-based survival random forest predictors, Employing the C-index and the C-margins, we find that our residual-based survival random forests predictor outperforms Breiman's survival random forest predictor (2001) especially in finding important covariates.
590
$a
School code: 0031.
650
4
$a
Biostatistics.
$3
1002712
690
$a
0308
710
2
$a
University of California, Los Angeles.
$3
626622
773
0
$t
Dissertations Abstracts International
$g
65-11B.
790
$a
0031
791
$a
Ph.D.
792
$a
2004
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3117700
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9382891
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入