東華大學圖書館 |

語系: 繁體中文

說明(常見問題)

回圖書館首頁

手機版館藏查詢

登入

回首頁

切換: 標籤 | MARC模式 | ISBD

Statistical Learning Methods for Per...

Liu, Ying.

FindBook

Google Book

Amazon

博客來

Statistical Learning Methods for Personalized Medical Decision Making.

紀錄類型:	書目-電子資源 : Monograph/item
正題名/作者:	Statistical Learning Methods for Personalized Medical Decision Making./
作者:	Liu, Ying.
出版者:	Ann Arbor : ProQuest Dissertations & Theses, : 2016,
面頁冊數:	148 p.
附註:	Source: Dissertation Abstracts International, Volume: 77-09(E), Section: B.
Contained By:	Dissertation Abstracts International77-09B(E).
標題:	Biostatistics. -
電子資源:	http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10100777
ISBN:	9781339646633

Statistical Learning Methods for Personalized Medical Decision Making.
Liu, Ying.

Statistical Learning Methods for Personalized Medical Decision Making. - Ann Arbor : ProQuest Dissertations & Theses, 2016 - 148 p.

Source: Dissertation Abstracts International, Volume: 77-09(E), Section: B.

Thesis (Ph.D.)--Columbia University, 2016.

This item is not available from ProQuest Dissertations & Theses.

The theme of my dissertation is on merging statistical modeling with medical domain knowledge and machine learning algorithms to assist in making personalized medical decisions. In its simplest form, making personalized medical decisions for treatment choices and disease diagnosis modality choices can be transformed into classification or prediction problems in machine learning, where the optimal decision for an individual is a decision rule that yields the best future clinical outcome or maximizes diagnosis accuracy. However, challenges emerge when analyzing complex medical data. On one hand, statistical modeling is needed to deal with inherent practical complications such as missing data, patients' loss to follow-up, ethical and resource constraints in randomized controlled clinical trials. On the other hand, new data types and larger scale of data call for innovations combining statistical modeling, domain knowledge and information technologies. This dissertation contains three parts addressing the estimation of optimal personalized rule for choosing treatment, the estimation of optimal individualized rule for choosing disease diagnosis modality, and methods for variable selection if there are missing data.

ISBN: 9781339646633Subjects--Topical Terms:

1002712
Biostatistics.

Statistical Learning Methods for Personalized Medical Decision Making.
LDR:06674nmm a2200349 4500 001 2154109
005 20180330130620.5
008 190424s2016 ||||||||||||||||| ||eng d
020 $a 9781339646633
035 $a (MiAaPQ)AAI10100777
035 $a (MiAaPQ)columbia:13262
035 $a AAI10100777
040 $a MiAaPQ $c MiAaPQ
100 1 $a Liu, Ying. $3 898465
245 1 0 $a Statistical Learning Methods for Personalized Medical Decision Making.
260 1 $a Ann Arbor : $b ProQuest Dissertations & Theses, $c 2016
300 $a 148 p.
500 $a Source: Dissertation Abstracts International, Volume: 77-09(E), Section: B.
500 $a Adviser: Yuanjia Wang.
502 $a Thesis (Ph.D.)--Columbia University, 2016.
506 $a This item is not available from ProQuest Dissertations & Theses.
520 $a The theme of my dissertation is on merging statistical modeling with medical domain knowledge and machine learning algorithms to assist in making personalized medical decisions. In its simplest form, making personalized medical decisions for treatment choices and disease diagnosis modality choices can be transformed into classification or prediction problems in machine learning, where the optimal decision for an individual is a decision rule that yields the best future clinical outcome or maximizes diagnosis accuracy. However, challenges emerge when analyzing complex medical data. On one hand, statistical modeling is needed to deal with inherent practical complications such as missing data, patients' loss to follow-up, ethical and resource constraints in randomized controlled clinical trials. On the other hand, new data types and larger scale of data call for innovations combining statistical modeling, domain knowledge and information technologies. This dissertation contains three parts addressing the estimation of optimal personalized rule for choosing treatment, the estimation of optimal individualized rule for choosing disease diagnosis modality, and methods for variable selection if there are missing data.
520 $a In the first part of this dissertation, we propose a method to find optimal Dynamic treatment regimens (DTRs) in Sequential Multiple Assignment Randomized Trial (SMART) data.Dynamic treatment regimens (DTRs) are sequential decision rules tailored at each stage of treatment by potentially time-varying patient features and intermediate outcomes observed in previous stages. The complexity, patient heterogeneity, and chronicity of many diseases and disorders call for learning optimal DTRs that best dynamically tailor treatment to each individual's response over time.
520 $a We propose a robust and efficient approach referred to as Augmented Multistage Outcome-Weighted Learning (AMOL) to identify optimal DTRs from sequential multiple assignment randomized trials. We improve outcome-weighted learning (Zhao et al.~2012) to allow for negative outcomes; we propose methods to reduce variability of weights to achieve numeric stability and higher efficiency; and finally, for multiple-stage trials, we introduce robust augmentation to improve efficiency by drawing information from Q-function regression models at each stage. The proposed AMOL remains valid even if the regression model is misspecified. We formally justify that proper choice of augmentation guarantees smaller stochastic errors in value function estimation for AMOL; we then establish the convergence rates for AMOL. The comparative advantage of AMOL over existing methods is demonstrated in extensive simulation studies and applications to two SMART data sets: a two-stage trial for attention deficit hyperactivity disorder and the STAR*D trial for major depressive disorder.
520 $a The second part of the dissertation introduced a machine learning algorithm to estimate personalized decision rules for medical diagnosis/screening to maximize a weighted combination of sensitivity and specificity. Using subject-specific risk factors and feature variables, such rules administer screening tests with balanced sensitivity and specificity, and thus protect low-risk subjects from unnecessary pain and stress caused by false positive tests, while achieving high sensitivity for subjects at high risk. We conducted simulation study mimicking a real breast cancer study, and we found significant improvements on sensitivity and specificity comparing our personalized screening strategy (assigning mammography+MRI to high-risk patients and mammography alone to low-risk subjects based on a composite score of their risk factors) to one-size-fits-all strategy (assigning mammography+MRI or mammography alone to all subjects). When applying to a Parkinson's disease(PD) FDG-PET and fMRI data, we showed that the method provided individualized modality selection that can improve AUC, and it can provide interpretable decision rules for choosing brain imaging modality for early detection of PD. To the best of our knowledge, this is the first time in the literature to propose automatic data-driven methods and learning algorithm for personalized diagnosis/screening strategy.
520 $a In the last part of the dissertation, we propose a method, Multiple Imputation Random Lasso (MIRL), to select important variables and to predict the outcome for an epidemiological study of Eating and Activity in Teens. % in the presence of missing data. In this study, 80% of individuals have at least one variable missing. Therefore, using variable selection methods developed for complete data after list-wise deletion substantially reduces prediction power. Recent work on prediction models in the presence of incomplete data cannot adequately account for large numbers of variables with arbitrary missing patterns. We propose MIRL to combine penalized regression techniques with multiple imputation and stability selection. Extensive simulation studies are conducted to compare MIRL with several alternatives. MIRL outperforms other methods in high-dimensional scenarios in terms of both reduced prediction error and improved variable selection performance, and it has greater advantage when the correlation among variables is high and missing proportion is high. MIRL is shown to have improved performance when comparing with other applicable methods when applied to the study of Eating and Activity in Teens for the boys and girls separately, and to a subgroup of low social economic status (SES) Asian boys who are at high risk of developing obesity.
590 $a School code: 0054.
650 4 $a Biostatistics. $3 1002712
690 $a 0308
710 2 $a Columbia University. $b Biostatistics. $3 2105522
773 0 $t Dissertation Abstracts International $g 77-09B(E).
790 $a 0054
791 $a Ph.D.
792 $a 2016
793 $a English
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10100777