語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
An information theoretic framework f...
~
Chanda, Pritam.
FindBook
Google Book
Amazon
博客來
An information theoretic framework for identification and modeling of gene-gene and gene-environment interactions.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
An information theoretic framework for identification and modeling of gene-gene and gene-environment interactions./
作者:
Chanda, Pritam.
面頁冊數:
247 p.
附註:
Source: Dissertation Abstracts International, Volume: 71-02, Section: B, page: 1101.
Contained By:
Dissertation Abstracts International71-02B.
標題:
Biology, Bioinformatics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3391030
ISBN:
9781109625134
An information theoretic framework for identification and modeling of gene-gene and gene-environment interactions.
Chanda, Pritam.
An information theoretic framework for identification and modeling of gene-gene and gene-environment interactions.
- 247 p.
Source: Dissertation Abstracts International, Volume: 71-02, Section: B, page: 1101.
Thesis (Ph.D.)--State University of New York at Buffalo, 2010.
In this dissertation, we develop, extend, validate and apply information theoretic metrics for identification and characterization of interactions among genetic variations in the epidemiological studies as studies have linked the complex epidemiological associations between genetic variations with the risk of developing many diseases. We investigate interactions between genes (referred to as gene-gene interactions or GGI) and between genes and non-genetic factors or environmental variables (referred to as gene-environment interactions or GEI) and systematically investigate the dependence of our metrics on genetic and study-design factors to identify the GGI/GEI and enable a visual presentation of the results. We also develop several simulation strategies to be used extensively for performance evaluation because the underlying structure and true relationships between genetic and environmental factors in experimental data sets are rarely known with certainty.
ISBN: 9781109625134Subjects--Topical Terms:
1018415
Biology, Bioinformatics.
An information theoretic framework for identification and modeling of gene-gene and gene-environment interactions.
LDR
:05877nam 2200349 4500
001
1399799
005
20110930095832.5
008
130515s2010 ||||||||||||||||| ||eng d
020
$a
9781109625134
035
$a
(UMI)AAI3391030
035
$a
AAI3391030
040
$a
UMI
$c
UMI
100
1
$a
Chanda, Pritam.
$3
1678810
245
1 3
$a
An information theoretic framework for identification and modeling of gene-gene and gene-environment interactions.
300
$a
247 p.
500
$a
Source: Dissertation Abstracts International, Volume: 71-02, Section: B, page: 1101.
500
$a
Adviser: Aidong Zhang.
502
$a
Thesis (Ph.D.)--State University of New York at Buffalo, 2010.
520
$a
In this dissertation, we develop, extend, validate and apply information theoretic metrics for identification and characterization of interactions among genetic variations in the epidemiological studies as studies have linked the complex epidemiological associations between genetic variations with the risk of developing many diseases. We investigate interactions between genes (referred to as gene-gene interactions or GGI) and between genes and non-genetic factors or environmental variables (referred to as gene-environment interactions or GEI) and systematically investigate the dependence of our metrics on genetic and study-design factors to identify the GGI/GEI and enable a visual presentation of the results. We also develop several simulation strategies to be used extensively for performance evaluation because the underlying structure and true relationships between genetic and environmental factors in experimental data sets are rarely known with certainty.
520
$a
Also, the high dimensionality of large data sets (e.g. from genome-wide studies) and presence of confounding factors like multiple correlations (or linkage disequilibrium among genes) and genetic heterogeneity results in combinatorial explosion of the number of possible interactions present in the data. This combinatorial growth makes it computationally difficult, if not impossible, to exhaustively assess the full range of predictor variables for potential interactions associated with the trait or phenotype variables and diseases in epidemiological studies. Therefore, we develop and evaluate a set of algorithms capable of efficiently searching the combinatorial space for mining significant and non-redundant interactions for both discrete and quantitative phenotypes and conduct detailed power, false-discovery rate and sample size analysis for epidemiological studies.
520
$a
In GEI analysis, the presence of high degree of linkage disequilibrium among the genetic variables results in several interactions to contain redundant information regarding the phenotype variable. Therefore it is essential to prune a set of GEI using a modeling step which we define as the process of identifying a parsimonious set of combinations or variables capable of explaining the disease phenotype/trait variable that will avoid over- and under-fitted models. We develop a novel algorithm that uses information theoretic metrics and their properties to efficiently perform the model synthesis task.
520
$a
Another principal challenge in GEI analyses is to develop metrics for prioritization of genetic variables for sequencing studies that incorporates knowledge from interactions between the genes. The gene-environment associations identified from large scale genotyping studies require large follow-on studies to comprehensively sequence the disease-associated regions to enable discovery of less common genetic variations that may be contributing to disease. Such comprehensive follow up studies are resource intensive and require large sample sizes so that it is essential to leverage the available information from existing genotyping studies to identify the most promising disease-associated regions and the possible environmental factors. Prioritizing genetic regions involved in GGI or GEI for sequencing studies can be difficult because the number of interactions, the order of interactions and their magnitudes can vary considerably making it difficult to make decisions regarding the relative importance of, e.g., a few large magnitude interactions vis--vis numerous interactions of moderate magnitude. In this research, we develop a novel metric for effectively visualizing and ranking the genetic and environmental variables involved in numerous statistical interactions.
520
$a
Finally, often in genetic data sets, the phenotype or trait variable is absent and it is useful to mine statistical interactions among the genetic variables in an unsupervised fashion that can highlight the underlying biological interactions among the genes and proteins present in pathways. To address such analyses, in this dissertation, we study the problem of mining statistically significant correlation patterns and interaction information in genetic data. We develop novel concepts of combinations of variables containing highly significant, moderately significant and non-significant correlation information and present some bounds on correlation information and develop several pruning strategies utilizing these bounds to efficiently prune the combinatorial search space. Using the bounds and pruning strategies, we develop efficient search algorithms to mine such associations in an efficient and effective manner and also critically examine the performance of our proposed mining algorithms. (Abstract shortened by UMI.)
590
$a
School code: 0656.
650
4
$a
Biology, Bioinformatics.
$3
1018415
650
4
$a
Computer Science.
$3
626642
690
$a
0715
690
$a
0984
710
2
$a
State University of New York at Buffalo.
$b
Computer Science and Engineering.
$3
1035503
773
0
$t
Dissertation Abstracts International
$g
71-02B.
790
1 0
$a
Zhang, Aidong,
$e
advisor
790
1 0
$a
Ramanathan, Murali
$e
committee member
790
1 0
$a
Govindaraju, Venu
$e
committee member
790
$a
0656
791
$a
Ph.D.
792
$a
2010
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3391030
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9162938
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入