語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Applications in Sentiment Analysis a...
~
Clark, Eric M.
FindBook
Google Book
Amazon
博客來
Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media./
作者:
Clark, Eric M.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2019,
面頁冊數:
136 p.
附註:
Source: Dissertation Abstracts International, Volume: 80-04(E), Section: B.
Contained By:
Dissertation Abstracts International80-04B(E).
標題:
Computer science. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=13419619
ISBN:
9780438725409
Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
Clark, Eric M.
Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
- Ann Arbor : ProQuest Dissertations & Theses, 2019 - 136 p.
Source: Dissertation Abstracts International, Volume: 80-04(E), Section: B.
Thesis (Ph.D.)--The University of Vermont and State Agricultural College, 2019.
Twitter, a popular social media outlet, has evolved into a vast source of linguistic data, rich with opinion, sentiment, and discussion. We mined data from several public Twitter endpoints to identify content relevant to healthcare providers and public health regulatory professionals. We began by compiling content related to electronic nicotine delivery systems (or e-cigarettes) as these had become popular alternatives to tobacco products. There was an apparent need to remove high frequency tweeting entities, called bots, that would spam messages, advertisements, and fabricate testimonials. Algorithms were constructed using natural language processing and machine learning to sift human responses from automated accounts with high degrees of accuracy. We found the average hyperlink per tweet, the average character dissimilarity between each individual's content, as well as the rate of introduction of unique words were valuable attributes in identifying automated accounts. We performed a 10-fold Cross Validation and measured performance of each set of tweet features, at various bin sizes, the best of which performed with 97% accuracy. These methods were used to isolate automated content related to the advertising of electronic cigarettes. A rich taxonomy of automated entities, including robots, cyborgs, and spammers, each with different measurable linguistic features were categorized.
ISBN: 9780438725409Subjects--Topical Terms:
523869
Computer science.
Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
LDR
:05002nmm a2200337 4500
001
2204460
005
20190716100707.5
008
201008s2019 ||||||||||||||||| ||eng d
020
$a
9780438725409
035
$a
(MiAaPQ)AAI13419619
035
$a
(MiAaPQ)uvm:10805
035
$a
AAI13419619
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Clark, Eric M.
$3
3431329
245
1 0
$a
Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2019
300
$a
136 p.
500
$a
Source: Dissertation Abstracts International, Volume: 80-04(E), Section: B.
500
$a
Advisers: Peter S. Dodds; Chris M. Danforth.
502
$a
Thesis (Ph.D.)--The University of Vermont and State Agricultural College, 2019.
520
$a
Twitter, a popular social media outlet, has evolved into a vast source of linguistic data, rich with opinion, sentiment, and discussion. We mined data from several public Twitter endpoints to identify content relevant to healthcare providers and public health regulatory professionals. We began by compiling content related to electronic nicotine delivery systems (or e-cigarettes) as these had become popular alternatives to tobacco products. There was an apparent need to remove high frequency tweeting entities, called bots, that would spam messages, advertisements, and fabricate testimonials. Algorithms were constructed using natural language processing and machine learning to sift human responses from automated accounts with high degrees of accuracy. We found the average hyperlink per tweet, the average character dissimilarity between each individual's content, as well as the rate of introduction of unique words were valuable attributes in identifying automated accounts. We performed a 10-fold Cross Validation and measured performance of each set of tweet features, at various bin sizes, the best of which performed with 97% accuracy. These methods were used to isolate automated content related to the advertising of electronic cigarettes. A rich taxonomy of automated entities, including robots, cyborgs, and spammers, each with different measurable linguistic features were categorized.
520
$a
Electronic cigarette related posts were classified as automated or organic and content was investigated with a hedonometric sentiment analysis. The overwhelming majority (≈ 80%) were automated, many of which were commercial in nature. Others used false testimonials that were sent directly to individuals as a personalized form of targeted marketing. Many tweets advertised nicotine vaporizer fluid (or e-liquid) in various "kid-friendly" flavors including 'Fudge Brownie', 'Hot Chocolate', 'Circus Cotton Candy' along with every imaginable flavor of fruit, which were long ago banned for traditional tobacco products. Others offered free trials, as well as incentives to retweet and spread the post among their own network. Free prize giveaways were also hosted whose raffle tickets were issued for sharing their tweet. Due to the large youth presence on the public social media platform, this was evidence that the marketing of electronic cigarettes needed considerable regulation. Twitter has since officially banned all electronic cigarette advertising on their platform.
520
$a
Social media has the capacity to afford the healthcare industry with valuable feedback from patients who reveal and express their medical decision-making process, as well as self-reported quality of life indicators both during and post treatment. We have studied several active cancer patient populations, discussing their experiences with the disease as well as survivor-ship. We experimented with a Convolutional Neural Network (CNN) as well as logistic regression to classify tweets as patient related. This led to a sample of 845 breast cancer survivor accounts to study, over 16 months. We found positive sentiments regarding patient treatment, raising support, and spreading awareness. A large portion of negative sentiments were shared regarding political legislation that could result in loss of coverage of their healthcare. We refer to these online public testimonies as "Invisible Patient Reported Outcomes" (iPROs), because they carry relevant indicators, yet are difficult to capture by conventional means of self-reporting. Our methods can be readily applied interdisciplinary to obtain insights into a particular group of public opinions. Capturing iPROs and public sentiments from online communication can help inform healthcare professionals and regulators, leading to more connected and personalized treatment regimens. Social listening can provide valuable insights into public health surveillance strategies.
590
$a
School code: 0243.
650
4
$a
Computer science.
$3
523869
650
4
$a
Social research.
$3
2122687
650
4
$a
Information technology.
$3
532993
690
$a
0984
690
$a
0344
690
$a
0489
710
2
$a
The University of Vermont and State Agricultural College.
$b
Complex Systems.
$3
3431330
773
0
$t
Dissertation Abstracts International
$g
80-04B(E).
790
$a
0243
791
$a
Ph.D.
792
$a
2019
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=13419619
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9381009
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入