語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Statistical models of sequencing err...
~
Li, Ming.
FindBook
Google Book
Amazon
博客來
Statistical models of sequencing error and algorithms of polymorphism detection.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Statistical models of sequencing error and algorithms of polymorphism detection./
作者:
Li, Ming.
面頁冊數:
145 p.
附註:
Adviser: Lei Li.
Contained By:
Dissertation Abstracts International67-10B.
標題:
Biology, Bioinformatics. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3237138
ISBN:
9780542911866
Statistical models of sequencing error and algorithms of polymorphism detection.
Li, Ming.
Statistical models of sequencing error and algorithms of polymorphism detection.
- 145 p.
Adviser: Lei Li.
Thesis (Ph.D.)--University of Southern California, 2006.
Estimation of the sequencing error patterns is the key to accuracy improvement in DNA sequencing. In chapter 2, with probabilistic interpretation of the quality value, we propose a conditional sequencing error model. In chapter 3, we model the sequencing errors by a mixture of multinomials and a mixture of logistic regressions. In these models, the underlying DNA target is unknown and treated as missing data. The redundancy in the assembly allows us to impute the missing data by an EM algorithm. In the mixture of logistic regressions, we use piecewise linear functions of the quality value to deal with nonlinear effects. We evaluate the models with different knots by AIC and single base discrepancy and use a backward elimination algorithm for model selection. We apply these methods to a whole genome assembly, C. jejuni , and improve the accuracy of consensus sequence by a great deal. Based on the predicted sequencing error patterns, we correct the bias in the quality value and assign to base-calls new quality values with probabilistic interpretation. We also make an effort to expand this model by including more covariates, such as GC content. These statistical models provide a framework for the analysis of sequence assembly and can directly be applied to the daily practice of DNA sequencing.
ISBN: 9780542911866Subjects--Topical Terms:
1018415
Biology, Bioinformatics.
Statistical models of sequencing error and algorithms of polymorphism detection.
LDR
:03072nam 2200301 a 45
001
965491
005
20110906
008
110906s2006 eng d
020
$a
9780542911866
035
$a
(UMI)AAI3237138
035
$a
AAI3237138
040
$a
UMI
$c
UMI
100
1
$a
Li, Ming.
$3
559294
245
1 0
$a
Statistical models of sequencing error and algorithms of polymorphism detection.
300
$a
145 p.
500
$a
Adviser: Lei Li.
500
$a
Source: Dissertation Abstracts International, Volume: 67-10, Section: B, page: 5792.
502
$a
Thesis (Ph.D.)--University of Southern California, 2006.
520
$a
Estimation of the sequencing error patterns is the key to accuracy improvement in DNA sequencing. In chapter 2, with probabilistic interpretation of the quality value, we propose a conditional sequencing error model. In chapter 3, we model the sequencing errors by a mixture of multinomials and a mixture of logistic regressions. In these models, the underlying DNA target is unknown and treated as missing data. The redundancy in the assembly allows us to impute the missing data by an EM algorithm. In the mixture of logistic regressions, we use piecewise linear functions of the quality value to deal with nonlinear effects. We evaluate the models with different knots by AIC and single base discrepancy and use a backward elimination algorithm for model selection. We apply these methods to a whole genome assembly, C. jejuni , and improve the accuracy of consensus sequence by a great deal. Based on the predicted sequencing error patterns, we correct the bias in the quality value and assign to base-calls new quality values with probabilistic interpretation. We also make an effort to expand this model by including more covariates, such as GC content. These statistical models provide a framework for the analysis of sequence assembly and can directly be applied to the daily practice of DNA sequencing.
520
$a
Polymorphism detection in the resequencing data is especially important for linkage analysis and association study. We propose a novel method for trace preprocessing and spike alignment to identify polymorphism accurately. In chapter 4, we introduce a procedure to preprocess DNA sequencing traces and recover signal spike trains in five steps: color correction, normalization, baseline subtraction, width estimation, and deconvolution. In chapter 5, we describe the dynamic programming algorithm for spike alignment, and demonstrate the polymorphism detection from the alignment. Based on this method, we develop a software with graphical user interface for resequencing data and made it available to the public. Our software offers a new perspective for polymorphism detection, especially insertion-deletion polymorphism in mononucleotide runs.
590
$a
School code: 0208.
650
4
$a
Biology, Bioinformatics.
$3
1018415
650
4
$a
Biology, Molecular.
$3
1017719
650
4
$a
Mathematics.
$3
515831
690
$a
0307
690
$a
0405
690
$a
0715
710
2 0
$a
University of Southern California.
$3
700129
773
0
$t
Dissertation Abstracts International
$g
67-10B.
790
$a
0208
790
1 0
$a
Li, Lei,
$e
advisor
791
$a
Ph.D.
792
$a
2006
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3237138
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9125092
電子資源
11.線上閱覽_V
電子書
EB W9125092
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入