語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Text Classification: Exploiting the ...
~
Alkhereyf, Sakhar.
FindBook
Google Book
Amazon
博客來
Text Classification: Exploiting the Social Network.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Text Classification: Exploiting the Social Network./
作者:
Alkhereyf, Sakhar.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2021,
面頁冊數:
228 p.
附註:
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
Contained By:
Dissertations Abstracts International82-06B.
標題:
Computer science. -
電子資源:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28258554
ISBN:
9798557004442
Text Classification: Exploiting the Social Network.
Alkhereyf, Sakhar.
Text Classification: Exploiting the Social Network.
- Ann Arbor : ProQuest Dissertations & Theses, 2021 - 228 p.
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
Thesis (Ph.D.)--Columbia University, 2021.
This item must not be sold to any third party vendors.
Within the context of social networks, existing methods for document classification tasks typically only capture textual semantics while ignoring the text's metadata, e.g., the users who exchange emails and the communication networks they form. However, some work has shown that incorporating the social network information in addition to information from language is useful for various NLP applications, including sentiment analysis, inferring user attributes, and predicting interpersonal relations.In this thesis, we present empirical studies of incorporating social network information from the underlying communication graphs for various text classification tasks. We show different graph representations for different problems. Also, we introduce social network features extracted from these graphs. We use and extend graph embedding models for text classification.Our contributions are as follows. First, we have annotated large datasets of emails with fine-grained business and personal labels. Second, we propose graph representations for the social networks induced from documents and users and apply them on different text classification tasks. Third, we propose social network features extracted from these structures for documents and users. Fourth, we exploit different methods for modeling the social network of communication for four tasks: email classification into business and personal, overt display of power detection in emails, hierarchical power detection in emails, and Reddit post classification.Our main findings are: incorporating the social network information using our proposed methods improves the classification performance for all of the four tasks, and we beat the state-of-the-art graph embedding based model on the three tasks on email; additionally, for the fourth task (Reddit post classification), we argue that simple methods with the proper representation for the task can outperform a state-of-the-art generic model.
ISBN: 9798557004442Subjects--Topical Terms:
523869
Computer science.
Subjects--Index Terms:
Natural language processing
Text Classification: Exploiting the Social Network.
LDR
:03261nmm a2200445 4500
001
2275878
005
20210401103759.5
008
220723s2021 ||||||||||||||||| ||eng d
020
$a
9798557004442
035
$a
(MiAaPQ)AAI28258554
035
$a
AAI28258554
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Alkhereyf, Sakhar.
$3
3554124
245
1 0
$a
Text Classification: Exploiting the Social Network.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2021
300
$a
228 p.
500
$a
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
500
$a
Advisor: Rambow, Owen C.
502
$a
Thesis (Ph.D.)--Columbia University, 2021.
506
$a
This item must not be sold to any third party vendors.
520
$a
Within the context of social networks, existing methods for document classification tasks typically only capture textual semantics while ignoring the text's metadata, e.g., the users who exchange emails and the communication networks they form. However, some work has shown that incorporating the social network information in addition to information from language is useful for various NLP applications, including sentiment analysis, inferring user attributes, and predicting interpersonal relations.In this thesis, we present empirical studies of incorporating social network information from the underlying communication graphs for various text classification tasks. We show different graph representations for different problems. Also, we introduce social network features extracted from these graphs. We use and extend graph embedding models for text classification.Our contributions are as follows. First, we have annotated large datasets of emails with fine-grained business and personal labels. Second, we propose graph representations for the social networks induced from documents and users and apply them on different text classification tasks. Third, we propose social network features extracted from these structures for documents and users. Fourth, we exploit different methods for modeling the social network of communication for four tasks: email classification into business and personal, overt display of power detection in emails, hierarchical power detection in emails, and Reddit post classification.Our main findings are: incorporating the social network information using our proposed methods improves the classification performance for all of the four tasks, and we beat the state-of-the-art graph embedding based model on the three tasks on email; additionally, for the fourth task (Reddit post classification), we argue that simple methods with the proper representation for the task can outperform a state-of-the-art generic model.
590
$a
School code: 0054.
650
4
$a
Computer science.
$3
523869
650
4
$a
Mass communications.
$3
3422380
650
4
$a
Web studies.
$3
2122754
650
4
$a
Technical communication.
$3
3172863
650
4
$a
Information technology.
$3
532993
650
4
$a
Social research.
$3
2122687
653
$a
Natural language processing
653
$a
Social network
653
$a
Text classification
653
$a
Semantics
653
$a
Communication networks
653
$a
Email exchange
653
$a
Interpersonal relations
653
$a
Reddit
690
$a
0984
690
$a
0643
690
$a
0489
690
$a
0344
690
$a
0646
690
$a
0708
710
2
$a
Columbia University.
$b
Computer Science.
$3
1679268
773
0
$t
Dissertations Abstracts International
$g
82-06B.
790
$a
0054
791
$a
Ph.D.
792
$a
2021
793
$a
English
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28258554
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9427612
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入