Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Scalable data clustering using GPUs.
~
Pangborn, Andrew D.
Linked to FindBook
Google Book
Amazon
博客來
Scalable data clustering using GPUs.
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Scalable data clustering using GPUs./
Author:
Pangborn, Andrew D.
Description:
127 p.
Notes:
Source: Masters Abstracts International, Volume: 49-01, page: 0561.
Contained By:
Masters Abstracts International49-01.
Subject:
Engineering, Computer. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=1480233
ISBN:
9781124177205
Scalable data clustering using GPUs.
Pangborn, Andrew D.
Scalable data clustering using GPUs.
- 127 p.
Source: Masters Abstracts International, Volume: 49-01, page: 0561.
Thesis (M.S.)--Rochester Institute of Technology, 2010.
Flow cytometry is a mainstay technology used by biologists and immunologists for counting, sorting, and analyzing cells suspended in a fluid. Like many modern scientific applications, flow cytometry produces massive amounts of data, which must be clustered in order to be useful. Conventional analysis of flow cytometry data uses manual sequential bivariate gating. However, this technique is limited in the quantity, quality, and speed of analyses produced. Unsupervised multivariate clustering techniques have shown promise for producing sound statistical analyses of flow cytometry data in previous research.
ISBN: 9781124177205Subjects--Topical Terms:
1669061
Engineering, Computer.
Scalable data clustering using GPUs.
LDR
:03310nam 2200361 4500
001
1393974
005
20110415112032.5
008
130515s2010 ||||||||||||||||| ||eng d
020
$a
9781124177205
035
$a
(UMI)AAI1480233
035
$a
AAI1480233
040
$a
UMI
$c
UMI
100
1
$a
Pangborn, Andrew D.
$3
1672559
245
1 0
$a
Scalable data clustering using GPUs.
300
$a
127 p.
500
$a
Source: Masters Abstracts International, Volume: 49-01, page: 0561.
500
$a
Adviser: Muhammad E. Shaaban.
502
$a
Thesis (M.S.)--Rochester Institute of Technology, 2010.
520
$a
Flow cytometry is a mainstay technology used by biologists and immunologists for counting, sorting, and analyzing cells suspended in a fluid. Like many modern scientific applications, flow cytometry produces massive amounts of data, which must be clustered in order to be useful. Conventional analysis of flow cytometry data uses manual sequential bivariate gating. However, this technique is limited in the quantity, quality, and speed of analyses produced. Unsupervised multivariate clustering techniques have shown promise for producing sound statistical analyses of flow cytometry data in previous research.
520
$a
The computational demands of multivariate clustering grow rapidly, and therefore processing large data sets, like those found in flow cytometry data, is very time consuming on a single CPU. Fortunately these techniques lend themselves naturally to large scale parallel processing. To address the computational demands, graphics processing units, specifically NVIDIA's CUDA framework and Tesla architecture, were investigated as a low-cost, high performance solution to a number of clustering algorithms.
520
$a
C-means and Expectation Maximization with Gaussian mixture models were implemented using the CUDA framework. The algorithm implementations use a hybrid of CUDA, OpenMP, and MPI to scale to many GPUs on multiple nodes in a high performance computing environment. This framework is envisioned as part of a larger cloud-based workflow service where biologists can apply multiple algorithms and parameter sweeps to their data sets and quickly receive a thorough set of results that can be further analyzed by experts.
520
$a
Improvements over previous GPU-accelerated implementations range from 1.42x to 21x for C-means and 3.72x to 5.65x for the Gaussian mixture model on non-trivial data sets. Using a single NVIDIA GTX 260 speedups are on average 90x for C-means and 74x for Gaussians with flow cytometry files compared to optimized C code running on a single core of a modern Intel CPU. Using the TeraGrid "Lincoln" high performance cluster at NCSA C-means achieves 42% parallel efficiency and a CPU speedup of 4794x with 128 Tesla C1060 GPUs. The Gaussian mixture model achieves 72% parallel efficiency and a CPU speedup of 6286x.
590
$a
School code: 0465.
650
4
$a
Engineering, Computer.
$3
1669061
650
4
$a
Biology, Bioinformatics.
$3
1018415
650
4
$a
Computer Science.
$3
626642
690
$a
0464
690
$a
0715
690
$a
0984
710
2
$a
Rochester Institute of Technology.
$b
Computer Engineering.
$3
1018788
773
0
$t
Masters Abstracts International
$g
49-01.
790
1 0
$a
Shaaban, Muhammad E.,
$e
advisor
790
1 0
$a
von Laszewski, Gregor
$e
committee member
790
1 0
$a
Cavenaugh, James S.
$e
committee member
790
1 0
$a
Melton, Roy W.
$e
committee member
790
$a
0465
791
$a
M.S.
792
$a
2010
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=1480233
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9157113
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login