語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Improving GWAS Phenotypes Through Ba...
~
Shafquat, Afrah .
FindBook
Google Book
Amazon
博客來
Improving GWAS Phenotypes Through Bayesian and Machine Learning Approaches.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Improving GWAS Phenotypes Through Bayesian and Machine Learning Approaches./
作者:
Shafquat, Afrah .
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2020,
面頁冊數:
168 p.
附註:
Source: Dissertations Abstracts International, Volume: 81-12, Section: B.
Contained By:
Dissertations Abstracts International81-12B.
標題:
Bioinformatics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=27738692
ISBN:
9798645486631
Improving GWAS Phenotypes Through Bayesian and Machine Learning Approaches.
Shafquat, Afrah .
Improving GWAS Phenotypes Through Bayesian and Machine Learning Approaches.
- Ann Arbor : ProQuest Dissertations & Theses, 2020 - 168 p.
Source: Dissertations Abstracts International, Volume: 81-12, Section: B.
Thesis (Ph.D.)--Cornell University, 2020.
This item must not be sold to any third party vendors.
Large-scale genome-wide association studies (GWAS) have enabled detection of numerous candidate genetic loci that may impact human diseases, expanded our knowledge about the underlying biological mechanisms and pathways for complex disease phenotypes, and provided tangible biological targets for treatment and drug development. Moreover, GWAS summary statistics have improved disease risk assessment for complex disease phenotypes paving the way for early disease detection, intervention, and mitigation strategies with the added potential of risk-based stratification for differential treatments. However, existing statistical models and inference suffer from limitations of disease phenotypes, including (i) ambiguity in disease definition (where genetically distinct disease phenotypes are defined as the same disease/disease complex) and (ii) disease misdiagnosis/misclassification (where errors in phenotype misclassify a disease case as a control and vice versa). Current methods addressing these issues show reasonable performance, however, these methods may be improved by exploring alternative methodologies and incorporating constraints for a typical GWAS dataset. In this dissertation, I propose a Bayesian hierarchical latent variable model, PheLEx (Phenotype Latent variable Extraction of disease misdiagnosis) and a bootstrapping approach PheBEs (Phenotype Bootstrapping Estimation method) for the extraction of misclassification errors in GWAS phenotypes. Performance of both methods is evaluated using simulated GWAS datasets and real GWAS phenotypes from the UK Biobank dataset. Improved performance of proposed methods over existing methods in identifying misclassified individuals in simulated GWAS phenotypes indicates potential for improved GWAS statistical power and candidate loci discovery through use of these methods. Finally, I propose an alternate disease risk assessment method for computation of disease risk scores and phenotype prediction. The proposed alternate disease risk assessment methodology showed comparable (and in some cases, improved) performance for prediction of risk assessment for UK Biobank phenotypes and subtypes indicating potential for improved disease risk assessment.
ISBN: 9798645486631Subjects--Topical Terms:
553671
Bioinformatics.
Subjects--Index Terms:
Bayesian
Improving GWAS Phenotypes Through Bayesian and Machine Learning Approaches.
LDR
:03439nmm a2200397 4500
001
2273049
005
20201105110330.5
008
220629s2020 ||||||||||||||||| ||eng d
020
$a
9798645486631
035
$a
(MiAaPQ)AAI27738692
035
$a
AAI27738692
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Shafquat, Afrah .
$3
3550479
245
1 0
$a
Improving GWAS Phenotypes Through Bayesian and Machine Learning Approaches.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2020
300
$a
168 p.
500
$a
Source: Dissertations Abstracts International, Volume: 81-12, Section: B.
500
$a
Includes supplementary digital materials.
500
$a
Advisor: Mezey, Jason.
502
$a
Thesis (Ph.D.)--Cornell University, 2020.
506
$a
This item must not be sold to any third party vendors.
520
$a
Large-scale genome-wide association studies (GWAS) have enabled detection of numerous candidate genetic loci that may impact human diseases, expanded our knowledge about the underlying biological mechanisms and pathways for complex disease phenotypes, and provided tangible biological targets for treatment and drug development. Moreover, GWAS summary statistics have improved disease risk assessment for complex disease phenotypes paving the way for early disease detection, intervention, and mitigation strategies with the added potential of risk-based stratification for differential treatments. However, existing statistical models and inference suffer from limitations of disease phenotypes, including (i) ambiguity in disease definition (where genetically distinct disease phenotypes are defined as the same disease/disease complex) and (ii) disease misdiagnosis/misclassification (where errors in phenotype misclassify a disease case as a control and vice versa). Current methods addressing these issues show reasonable performance, however, these methods may be improved by exploring alternative methodologies and incorporating constraints for a typical GWAS dataset. In this dissertation, I propose a Bayesian hierarchical latent variable model, PheLEx (Phenotype Latent variable Extraction of disease misdiagnosis) and a bootstrapping approach PheBEs (Phenotype Bootstrapping Estimation method) for the extraction of misclassification errors in GWAS phenotypes. Performance of both methods is evaluated using simulated GWAS datasets and real GWAS phenotypes from the UK Biobank dataset. Improved performance of proposed methods over existing methods in identifying misclassified individuals in simulated GWAS phenotypes indicates potential for improved GWAS statistical power and candidate loci discovery through use of these methods. Finally, I propose an alternate disease risk assessment method for computation of disease risk scores and phenotype prediction. The proposed alternate disease risk assessment methodology showed comparable (and in some cases, improved) performance for prediction of risk assessment for UK Biobank phenotypes and subtypes indicating potential for improved disease risk assessment.
590
$a
School code: 0058.
650
4
$a
Bioinformatics.
$3
553671
650
4
$a
Genetics.
$3
530508
650
4
$a
Biostatistics.
$3
1002712
653
$a
Bayesian
653
$a
Disease risk estimation
653
$a
GWAS
653
$a
MCMC
653
$a
Misclassification
653
$a
Phenotypes
690
$a
0715
690
$a
0369
690
$a
0308
710
2
$a
Cornell University.
$b
Computational Biology.
$3
3286038
773
0
$t
Dissertations Abstracts International
$g
81-12B.
790
$a
0058
791
$a
Ph.D.
792
$a
2020
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=27738692
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9425283
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入