語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Predicting the Effects of Protein Va...
~
Baugh, Evan H.
FindBook
Google Book
Amazon
博客來
Predicting the Effects of Protein Variants using Structural Modeling, Large-Scale Data Integration, and Machine Learning.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Predicting the Effects of Protein Variants using Structural Modeling, Large-Scale Data Integration, and Machine Learning./
作者:
Baugh, Evan H.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2017,
面頁冊數:
150 p.
附註:
Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
Contained By:
Dissertation Abstracts International78-08B(E).
標題:
Bioinformatics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10247644
ISBN:
9781369630343
Predicting the Effects of Protein Variants using Structural Modeling, Large-Scale Data Integration, and Machine Learning.
Baugh, Evan H.
Predicting the Effects of Protein Variants using Structural Modeling, Large-Scale Data Integration, and Machine Learning.
- Ann Arbor : ProQuest Dissertations & Theses, 2017 - 150 p.
Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
Thesis (Ph.D.)--New York University, 2017.
High-throughput sequencing technologies and new computational techniques for analyzing population genetics data are rapidly improving our understanding of disease susceptibility in humans and adaptation in a wide variety of organisms. These studies often discover nonsynonymous variation with large effects as even a single amino acid change can disrupt the folding, catalytic activity, and physical interactions of proteins. Current estimates predict that every human genome contains 10,000-11,000 nonsynonymous variations and, while we cannot currently characterize all this diversity experimentally, many variants that alter protein function can be identified computationally from destabilization of structural models or amino acid conservation. Methods for annotating variant effects in genome-wide association studies and exome sequencing studies use conservation and other sequence-based features to identify damaging variants but cannot predict the effect these variants have on protein function. Recent studies of de novo variants have demonstrated the power of these methods but also the need for additional information, such as physical models from the Protein Data Bank, to identify causal variants in disease association studies.
ISBN: 9781369630343Subjects--Topical Terms:
553671
Bioinformatics.
Predicting the Effects of Protein Variants using Structural Modeling, Large-Scale Data Integration, and Machine Learning.
LDR
:05180nmm a2200337 4500
001
2124047
005
20171023101706.5
008
180830s2017 ||||||||||||||||| ||eng d
020
$a
9781369630343
035
$a
(MiAaPQ)AAI10247644
035
$a
AAI10247644
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Baugh, Evan H.
$3
3286016
245
1 0
$a
Predicting the Effects of Protein Variants using Structural Modeling, Large-Scale Data Integration, and Machine Learning.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2017
300
$a
150 p.
500
$a
Source: Dissertation Abstracts International, Volume: 78-08(E), Section: B.
500
$a
Adviser: Richard A. Bonneau.
502
$a
Thesis (Ph.D.)--New York University, 2017.
520
$a
High-throughput sequencing technologies and new computational techniques for analyzing population genetics data are rapidly improving our understanding of disease susceptibility in humans and adaptation in a wide variety of organisms. These studies often discover nonsynonymous variation with large effects as even a single amino acid change can disrupt the folding, catalytic activity, and physical interactions of proteins. Current estimates predict that every human genome contains 10,000-11,000 nonsynonymous variations and, while we cannot currently characterize all this diversity experimentally, many variants that alter protein function can be identified computationally from destabilization of structural models or amino acid conservation. Methods for annotating variant effects in genome-wide association studies and exome sequencing studies use conservation and other sequence-based features to identify damaging variants but cannot predict the effect these variants have on protein function. Recent studies of de novo variants have demonstrated the power of these methods but also the need for additional information, such as physical models from the Protein Data Bank, to identify causal variants in disease association studies.
520
$a
I present VIPUR, a computational framework that integrates sequence analysis and structural modeling using the Rosetta protein modeling suite to identify and interpret deleterious protein variants. To train VIPUR, I collected 9,477 protein variants with known effects on protein function from multiple organisms and curated structural models for each variant from crystal structures and homology models. VIPUR can be applied to variants in any organism's proteome with improved generalized accuracy (AUROC .83) and interpretability (AUPR .87) compared to other methods. I show that VIPUR's predictions of deleteriousness match the biological phenotypes for pathogenicity in ClinVar despite being trained on a different label. I use VIPUR to interpret mutations associated with inflammation and diabetes, demonstrating the structural diversity of disrupted functional sites and improved interpretation functional effects.
520
$a
Generalizable tools for interpreting genetic variants are especially needed with individualized exome sequencing, where clear indications of confident predictions are necessary to identify causal variation. I demonstrate VIPUR's ability to select candidate variants associated with human diseases by predicting the effects of de novo variants associated with Autism Spectrum Disorders (ASD) in the Simons Simplex Collection. Compared to existing methods, VIPUR deleterious predictions have the greatest enrichment for mutations found in children with ASD. VIPUR's predictions of deleterious effects are easily combined with other protein functional data to produce a small set of candidate genes and variants with specific mechanistic predictions.
520
$a
Although designed to aid in the discovery of causal variants, VIPUR can also simulate mutations to better understand specific protein functions. The distribution of VIPUR scores across all positions in a protein can be used to highlight conserved residues and provides an overall measure of protein conservation. When applied to levoglucosan kinase, a bacterial enzyme of interest for biofuel processing, VIPUR neutral predictions have a five fold enrichment for beneficial growth mutations. While VIPUR is not designed to detect gain-of-function mutations, this enrichment suggests VIPUR scores can identify potentially beneficial mutations by removing clearly deleterious ones. When applied to TP53, a human protein that is mutated in nearly half of all cancers, VIPUR score trends highlight the most common mutations in the COSMIC database, suggesting other variants that may have similar effects on tumor growth. VIPUR and the large-scale data analysis empowering it will aid in the interpretation of protein variation by providing a detailed feature space to characterize protein functional effects and confident predictions of deleterious variation in Genome-Wide Association Studies, exome sequencing initiatives, and protein engineering.
590
$a
School code: 0146.
650
4
$a
Bioinformatics.
$3
553671
650
4
$a
Biophysics.
$3
518360
650
4
$a
Genetics.
$3
530508
690
$a
0715
690
$a
0786
690
$a
0369
710
2
$a
New York University.
$b
Biology.
$3
1029435
773
0
$t
Dissertation Abstracts International
$g
78-08B(E).
790
$a
0146
791
$a
Ph.D.
792
$a
2017
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10247644
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9334659
電子資源
01.外借(書)_YB
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入