Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Techniques for improved LSI text ret...
~
Yan, Hua.
Linked to FindBook
Google Book
Amazon
博客來
Techniques for improved LSI text retrieval.
Record Type:
Electronic resources : Monograph/item
Title/Author:
Techniques for improved LSI text retrieval./
Author:
Yan, Hua.
Description:
190 p.
Notes:
Source: Dissertation Abstracts International, Volume: 67-03, Section: B, page: 1535.
Contained By:
Dissertation Abstracts International67-03B.
Subject:
Computer Science. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3210996
ISBN:
9780542595486
Techniques for improved LSI text retrieval.
Yan, Hua.
Techniques for improved LSI text retrieval.
- 190 p.
Source: Dissertation Abstracts International, Volume: 67-03, Section: B, page: 1535.
Thesis (Ph.D.)--Wayne State University, 2006.
This work identifies and studies four major issues in LSI (Latent Semantic Indexing) text retrieval: a multiplicity of standard query methods, alternative non-standard query methods, the issue of Generic Terms, and the lacking of Structural Data.
ISBN: 9780542595486Subjects--Topical Terms:
626642
Computer Science.
Techniques for improved LSI text retrieval.
LDR
:03431nmm 2200325 4500
001
1835073
005
20071204070610.5
008
130610s2006 eng d
020
$a
9780542595486
035
$a
(UMI)AAI3210996
035
$a
AAI3210996
040
$a
UMI
$c
UMI
100
1
$a
Yan, Hua.
$3
1923707
245
1 0
$a
Techniques for improved LSI text retrieval.
300
$a
190 p.
500
$a
Source: Dissertation Abstracts International, Volume: 67-03, Section: B, page: 1535.
500
$a
Advisers: William Grosky; Farshad Fotouhi.
502
$a
Thesis (Ph.D.)--Wayne State University, 2006.
520
$a
This work identifies and studies four major issues in LSI (Latent Semantic Indexing) text retrieval: a multiplicity of standard query methods, alternative non-standard query methods, the issue of Generic Terms, and the lacking of Structural Data.
520
$a
Firstly, three commonly-used standard query methods (versions A, B and B') are identified, compared, analyzed, and tested. Both mathematical analysis and experimental results reveal that version B is a better choice than version A, and that versions B and B' are essentially equivalent provided that the Equivalency Principle is satisfied. This finding shall eliminate the confusion and randomness of applying possibly incompatible query methods among LSI researchers and help restore the comparability of their works.
520
$a
Secondly, some novel non-standard versions of query methods using the discovered technique of singular value rescaling (SVR) are proposed and studied. Testing results in the prototyping experimental environments and the standardized TREC data sets both confirmed the effectiveness of SVR. This finding bears the practical significance that the current information retrieval techniques may be significantly improved by simply adopting a novel query method which is computationally as efficient as the best standard query method.
520
$a
Thirdly, this work studies the effects of Generic Terms, a minority group of terms that have relatively uniform distribution pattern among all topics of documents, on the LSI models. Characterization and definition of Generic Terms are achieved and an iterative algorithm is designed and implemented to identify these special terms. Experimental results strongly suggest that identification and exclusion of Generic Terms helps improve LSI text retrieval performance.
520
$a
Fourthly, this work also studies how to integrate Structural Data (loosely defined as sentence structure) into the LSI models. Four major characteristics of Structural Data are identified: derivativity, maneuverability, language dependency, and updatability/downdatability. Qualifications of two candidate forms of Structural Data, i.e., word order and non-word-order syntax (both in English language), are carefully studied. A complete series of procedures are developed to fully integrate Structural Data (in its most qualified form of word order data) into the LSI models. Experimental results strongly suggest that acquisition and integration of Structural Data helps improve LSI text retrieval performance.
590
$a
School code: 0254.
650
4
$a
Computer Science.
$3
626642
690
$a
0984
710
2 0
$a
Wayne State University.
$3
975058
773
0
$t
Dissertation Abstracts International
$g
67-03B.
790
1 0
$a
Grosky, William,
$e
advisor
790
1 0
$a
Fotouhi, Farshad,
$e
advisor
790
$a
0254
791
$a
Ph.D.
792
$a
2006
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3210996
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9226093
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login