Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Comparison of clustering algorithms ...
~
Chen, Jie.
Linked to FindBook
Google Book
Amazon
博客來
Comparison of clustering algorithms and its application to document clustering.
Record Type:
Electronic resources : Monograph/item
Title/Author:
Comparison of clustering algorithms and its application to document clustering./
Author:
Chen, Jie.
Description:
214 p.
Notes:
Source: Dissertation Abstracts International, Volume: 66-03, Section: B, page: 1543.
Contained By:
Dissertation Abstracts International66-03B.
Subject:
Computer Science. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3169789
ISBN:
0542059452
Comparison of clustering algorithms and its application to document clustering.
Chen, Jie.
Comparison of clustering algorithms and its application to document clustering.
- 214 p.
Source: Dissertation Abstracts International, Volume: 66-03, Section: B, page: 1543.
Thesis (Ph.D.)--Princeton University, 2005.
We investigate the methodology to evaluate and compare the quality of clustering algorithms. We study the issues raised in evaluation, such as data generation and choice of evaluation metrics. We give head-to-head comparison of six important clustering algorithms from different research communities.
ISBN: 0542059452Subjects--Topical Terms:
626642
Computer Science.
Comparison of clustering algorithms and its application to document clustering.
LDR
:03388nmm 2200337 4500
001
1848446
005
20051202072552.5
008
130614s2005 eng d
020
$a
0542059452
035
$a
(UnM)AAI3169789
035
$a
AAI3169789
040
$a
UnM
$c
UnM
100
1
$a
Chen, Jie.
$3
623762
245
1 0
$a
Comparison of clustering algorithms and its application to document clustering.
300
$a
214 p.
500
$a
Source: Dissertation Abstracts International, Volume: 66-03, Section: B, page: 1543.
500
$a
Adviser: Andrea S. LaPaugh.
502
$a
Thesis (Ph.D.)--Princeton University, 2005.
520
$a
We investigate the methodology to evaluate and compare the quality of clustering algorithms. We study the issues raised in evaluation, such as data generation and choice of evaluation metrics. We give head-to-head comparison of six important clustering algorithms from different research communities.
520
$a
We generate data using an extended mixture of Gaussian model that controls data characteristics such as shape, volume and size of a class. The added control facilitates more thorough exploration into the parameter space of the generative model and therefore makes the algorithmic evaluation more comprehensive.
520
$a
We summarize datasets by their data characteristics. Previous comparison conclusions based on specific datasets are hard to be applied to other datasets. In contrast, conclusions based on data characteristics can be easily generalized. Therefore, we can predict the performance of an algorithm. The prediction is a significant step forward for comparison studies.
520
$a
We compare three evaluation indices from different research fields. We recommend the f-score measurement if a single evaluation index is used. However, it is more desirable to apply multiple indices and vote among them to determine the ranking of algorithms.
520
$a
We isolate the objective function (sum-of-squares) and the optimization approach (greedy) used in the popular k-means algorithm. We investigate the effectiveness and efficiency in the four algorithms resulting from substituting another objective function (min-max cut) and another optimization approach (Kernighan-Lin). We illustrate by a quantitative study that contrary to conventional wisdom, k-means is not only fast but also produces quality clusters. It achieves 95% of the best quality among the candidate algorithms running in time an order of magnitude faster.
520
$a
We present detailed studies of the algorithmic behavior in response to data characteristics. We give an alternative definition of cluster separation and show that the new definition measures the degree of cluster difficulty for spherical and ellipsoidal clusters more consistently.
520
$a
We develop and systematically evaluate a practical version of a spectral clustering algorithm originally specified for provable guarantees of correctness. We observe that the modified algorithm can find perfect solution when the clusters are well separated, where iterative algorithms such as k-means tend to miss the perfect solution.
590
$a
School code: 0181.
650
4
$a
Computer Science.
$3
626642
690
$a
0984
710
2 0
$a
Princeton University.
$3
645579
773
0
$t
Dissertation Abstracts International
$g
66-03B.
790
1 0
$a
LaPaugh, Andrea S.,
$e
advisor
790
$a
0181
791
$a
Ph.D.
792
$a
2005
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3169789
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9197960
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login