東華大學圖書館 |

語系: 繁體中文

說明(常見問題)

回圖書館首頁

手機版館藏查詢

登入

回首頁

切換: 標籤 | MARC模式 | ISBD

Applications in Sentiment Analysis a...

Clark, Eric M.

FindBook

Google Book

Amazon

博客來

Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.

紀錄類型:	書目-電子資源 : Monograph/item
正題名/作者:	Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media./
作者:	Clark, Eric M.
出版者:	Ann Arbor : ProQuest Dissertations & Theses, : 2019,
面頁冊數:	136 p.
附註:	Source: Dissertation Abstracts International, Volume: 80-04(E), Section: B.
Contained By:	Dissertation Abstracts International80-04B(E).
標題:	Computer science. -
電子資源:	http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=13419619
ISBN:	9780438725409

Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
Clark, Eric M.

Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media. - Ann Arbor : ProQuest Dissertations & Theses, 2019 - 136 p.

Source: Dissertation Abstracts International, Volume: 80-04(E), Section: B.

Thesis (Ph.D.)--The University of Vermont and State Agricultural College, 2019.

Twitter, a popular social media outlet, has evolved into a vast source of linguistic data, rich with opinion, sentiment, and discussion. We mined data from several public Twitter endpoints to identify content relevant to healthcare providers and public health regulatory professionals. We began by compiling content related to electronic nicotine delivery systems (or e-cigarettes) as these had become popular alternatives to tobacco products. There was an apparent need to remove high frequency tweeting entities, called bots, that would spam messages, advertisements, and fabricate testimonials. Algorithms were constructed using natural language processing and machine learning to sift human responses from automated accounts with high degrees of accuracy. We found the average hyperlink per tweet, the average character dissimilarity between each individual's content, as well as the rate of introduction of unique words were valuable attributes in identifying automated accounts. We performed a 10-fold Cross Validation and measured performance of each set of tweet features, at various bin sizes, the best of which performed with 97% accuracy. These methods were used to isolate automated content related to the advertising of electronic cigarettes. A rich taxonomy of automated entities, including robots, cyborgs, and spammers, each with different measurable linguistic features were categorized.

ISBN: 9780438725409Subjects--Topical Terms:

523869
Computer science.

Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
LDR:05002nmm a2200337 4500 001 2204460
005 20190716100707.5
008 201008s2019 ||||||||||||||||| ||eng d
020 $a 9780438725409
035 $a (MiAaPQ)AAI13419619
035 $a (MiAaPQ)uvm:10805
035 $a AAI13419619
040 $a MiAaPQ $c MiAaPQ
100 1 $a Clark, Eric M. $3 3431329
245 1 0 $a Applications in Sentiment Analysis and Machine Learning for Identifying Public Health Variables across Social Media.
260 1 $a Ann Arbor : $b ProQuest Dissertations & Theses, $c 2019
300 $a 136 p.
500 $a Source: Dissertation Abstracts International, Volume: 80-04(E), Section: B.
500 $a Advisers: Peter S. Dodds; Chris M. Danforth.
502 $a Thesis (Ph.D.)--The University of Vermont and State Agricultural College, 2019.
520 $a Twitter, a popular social media outlet, has evolved into a vast source of linguistic data, rich with opinion, sentiment, and discussion. We mined data from several public Twitter endpoints to identify content relevant to healthcare providers and public health regulatory professionals. We began by compiling content related to electronic nicotine delivery systems (or e-cigarettes) as these had become popular alternatives to tobacco products. There was an apparent need to remove high frequency tweeting entities, called bots, that would spam messages, advertisements, and fabricate testimonials. Algorithms were constructed using natural language processing and machine learning to sift human responses from automated accounts with high degrees of accuracy. We found the average hyperlink per tweet, the average character dissimilarity between each individual's content, as well as the rate of introduction of unique words were valuable attributes in identifying automated accounts. We performed a 10-fold Cross Validation and measured performance of each set of tweet features, at various bin sizes, the best of which performed with 97% accuracy. These methods were used to isolate automated content related to the advertising of electronic cigarettes. A rich taxonomy of automated entities, including robots, cyborgs, and spammers, each with different measurable linguistic features were categorized.
520 $a Electronic cigarette related posts were classified as automated or organic and content was investigated with a hedonometric sentiment analysis. The overwhelming majority (≈ 80%) were automated, many of which were commercial in nature. Others used false testimonials that were sent directly to individuals as a personalized form of targeted marketing. Many tweets advertised nicotine vaporizer fluid (or e-liquid) in various "kid-friendly" flavors including 'Fudge Brownie', 'Hot Chocolate', 'Circus Cotton Candy' along with every imaginable flavor of fruit, which were long ago banned for traditional tobacco products. Others offered free trials, as well as incentives to retweet and spread the post among their own network. Free prize giveaways were also hosted whose raffle tickets were issued for sharing their tweet. Due to the large youth presence on the public social media platform, this was evidence that the marketing of electronic cigarettes needed considerable regulation. Twitter has since officially banned all electronic cigarette advertising on their platform.
520 $a Social media has the capacity to afford the healthcare industry with valuable feedback from patients who reveal and express their medical decision-making process, as well as self-reported quality of life indicators both during and post treatment. We have studied several active cancer patient populations, discussing their experiences with the disease as well as survivor-ship. We experimented with a Convolutional Neural Network (CNN) as well as logistic regression to classify tweets as patient related. This led to a sample of 845 breast cancer survivor accounts to study, over 16 months. We found positive sentiments regarding patient treatment, raising support, and spreading awareness. A large portion of negative sentiments were shared regarding political legislation that could result in loss of coverage of their healthcare. We refer to these online public testimonies as "Invisible Patient Reported Outcomes" (iPROs), because they carry relevant indicators, yet are difficult to capture by conventional means of self-reporting. Our methods can be readily applied interdisciplinary to obtain insights into a particular group of public opinions. Capturing iPROs and public sentiments from online communication can help inform healthcare professionals and regulators, leading to more connected and personalized treatment regimens. Social listening can provide valuable insights into public health surveillance strategies.
590 $a School code: 0243.
650 4 $a Computer science. $3 523869
650 4 $a Social research. $3 2122687
650 4 $a Information technology. $3 532993
690 $a 0984
690 $a 0344
690 $a 0489
710 2 $a The University of Vermont and State Agricultural College. $b Complex Systems. $3 3431330
773 0 $t Dissertation Abstracts International $g 80-04B(E).
790 $a 0243
791 $a Ph.D.
792 $a 2019
793 $a English
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=13419619