Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Data Mining Framework for Metagenome...
~
Rasheed, Zeehasham.
Linked to FindBook
Google Book
Amazon
博客來
Data Mining Framework for Metagenome Analysis.
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Data Mining Framework for Metagenome Analysis./
Author:
Rasheed, Zeehasham.
Description:
143 p.
Notes:
Source: Dissertation Abstracts International, Volume: 74-11(E), Section: B.
Contained By:
Dissertation Abstracts International74-11B(E).
Subject:
Computer Science. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3589152
ISBN:
9781303276316
Data Mining Framework for Metagenome Analysis.
Rasheed, Zeehasham.
Data Mining Framework for Metagenome Analysis.
- 143 p.
Source: Dissertation Abstracts International, Volume: 74-11(E), Section: B.
Thesis (Ph.D.)--George Mason University, 2013.
Advances in biotechnology have dramatically changed the manner of characterizing large populations of microbial communities that are ubiquitous across several environments. The process of "metagenomics" involves sequencing of the genetic material of organisms co-existing within ecosystems ranging from ocean, soil and human body. Researchers are trying to determine the collective microbial community or population of microbes that co-exist across different environmental and clinical samples. Several researchers and clinicians have embarked on studying the pathogenic role played by the microbiome (i.e., the collection of microbial organisms within the human body) with respect to human health and disease conditions.
ISBN: 9781303276316Subjects--Topical Terms:
626642
Computer Science.
Data Mining Framework for Metagenome Analysis.
LDR
:04667nam a2200337 4500
001
1964264
005
20141015113823.5
008
150210s2013 ||||||||||||||||| ||eng d
020
$a
9781303276316
035
$a
(MiAaPQ)AAI3589152
035
$a
AAI3589152
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Rasheed, Zeehasham.
$3
2100680
245
1 0
$a
Data Mining Framework for Metagenome Analysis.
300
$a
143 p.
500
$a
Source: Dissertation Abstracts International, Volume: 74-11(E), Section: B.
500
$a
Adviser: Huzefa Rangwala.
502
$a
Thesis (Ph.D.)--George Mason University, 2013.
520
$a
Advances in biotechnology have dramatically changed the manner of characterizing large populations of microbial communities that are ubiquitous across several environments. The process of "metagenomics" involves sequencing of the genetic material of organisms co-existing within ecosystems ranging from ocean, soil and human body. Researchers are trying to determine the collective microbial community or population of microbes that co-exist across different environmental and clinical samples. Several researchers and clinicians have embarked on studying the pathogenic role played by the microbiome (i.e., the collection of microbial organisms within the human body) with respect to human health and disease conditions.
520
$a
There is a critical need to develop new methods that can analyze metagenomes and correlate heterogeneous microbiome data to clinical metadata. Lack of such methods is an impediment for the identification of the function and presence of microbial organism within different samples, reducing our ability to elucidate the microbial-host interactions and discover novel therapeutics. From another perspective, comparing metagenomes across different ecological samples allows for the characterization of biodiversity across the planet. The goals of this dissertation are to present novel data mining algorithms that allow for the accurate and efficient analysis of metagenome data obtained from different environments.
520
$a
Specific contributions include the development of a suite of clustering algorithms for handling large-scale targeted and whole metagenome sequences. We developed a novel locality sensitive hashing (LSH) based method for clustering metagenome sequence reads. Our method achieves efficiency by approximating the pairwise sequence comparison operations using randomized hashing technique. We incorporate this clustering approach within a computational pipeline (LSH-Div) to estimate the species diversity within an ecological sample. We also developed an algorithm called MC-MinH that uses the min-wise hashing approach along with a greedy clustering algorithm to group 16S and whole metagenome sequences. We represent unequal length sequences using contiguous subsequences or k-mers, and then approximate the computation of pairwise similarity using independent min-wise hashing. Further, MC-MinH is extended as a distributed algorithm implemented within the Map-Reduce based Hadoop platform. The distributed clustering algorithm can perform a greedy iterative clustering as well as an agglomerative hierarchical clustering and can handle large volumes of input sequences.
520
$a
We also developed a novel sequence composition-based taxonomic classifier using extreme learning machines referred to as TAC-ELM. This algorithm uses the framework of extreme learning machines to quickly and accurately learn the weights for a neural network model. TAC-ELM when combined with BLAST (Basic Local Alignment Search Tool) has shown improved taxonomy classification results.
520
$a
In order to make these developed computational tools accessible to a broad group of researchers, we developed a web portal for scientific analysis. The portal implements a LIMS database using Drupal content management system to store and retrieve the multi-modal microbiome data. For analysis and development of workflows, we use the Galaxy platform which provides a web-based interface for integrating the computational tools to create user-customized pipelines and a batch-based job submission system.
520
$a
To summarize, this dissertation has contributions in the area of metagenome sequence clustering and classification which can be easily integrated within computational workflows for species diversity estimation and large-scale microbiome analysis.
590
$a
School code: 0883.
650
4
$a
Computer Science.
$3
626642
650
4
$a
Biology, Bioinformatics.
$3
1018415
690
$a
0984
690
$a
0715
710
2
$a
George Mason University.
$b
Computer Science.
$3
2097804
773
0
$t
Dissertation Abstracts International
$g
74-11B(E).
790
$a
0883
791
$a
Ph.D.
792
$a
2013
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3589152
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9259263
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login