Willkommen bei Scribd!

Individual Assignment #1: Data Source and Libraries

Hochgeladen von

0% fanden dieses Dokument nützlich (0 Abstimmungen)

11 Ansichten2 Seiten

This document discusses extracting Twitter data about people's opinions on dockless bike sharing programs. It explains that Twitter provides a way to understand public perceptions at low cost and with anonymity. The author plans to search Twitter for tweets containing "dock-less bike" and analyze the sentiment in the text towards the new bike sharing model. Several R packages are installed to facilitate extracting the Twitter data, cleaning the text, visualizing word frequencies and sentiments, and allowing future text analysis.

Originalbeschreibung:

text mining

Originaltitel

week1-JXiao

Copyright

Verfügbare Formate

DOCX, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Verfügbare Formate

Als DOCX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

11 Ansichten2 Seiten

Individual Assignment #1: Data Source and Libraries

Hochgeladen von

Xiao Jiewen

Copyright:

Verfügbare Formate

Als DOCX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 2

Im Dokument suchen

Individual Assignment #1

Data Source and Libraries

Jiewen Xiao
U07648520

When we are trying to understand people’s perception of a particular issue, social media

mining has two advantages over the traditional ways of the survey. The cost and efforts of

reaching a broad audience are low, and the anonymity of social media makes people more likely

to express their real opinion without self-imposed censor (Das, Sun, and Dutta, 2015).

Twitter, as one of the largest social platforms, provides us with vast resources for

conducting social research as well as understanding the target market where a company plans to

operate. The data I’m planning to work with are those individual tweets that contain keywords of

interest, and particularly for my case, it’s “dock-less bike.” The tweets include introductory

information about reformed bike-sharing programs that are relatively new to the market. Most

importantly, the unstructured text area contains people’s opinion towards this new form of the

bike-sharing program. Some people may think they are convenient and environmental friendly;

others may dislike it because the bikes take up public space and could be potentially dangerous.

A company that provides bike-sharing services can utilize these positive or negative sentiments

to decide when and where to initiate the program, how to improve the service and how to address

the common concerns.

I installed several packages to facilitate the twitter data extraction process. I followed the

tutorial of Roy(2017) to use the R package “twitteR”, “ROAuth”, and “RCurl” to set up a search-

and-extract mechanism to get the data from Twitter (Roy, 2017). Using the function

“searchTwitter”, I can customize the keywords, length, language, location, and other
characteristics for the search results (Gentry, 2016). I also installed the “tm” package for further

text cleaning and analysis, the “wordcloud” package for presenting word frequency in a keyword

cloud, the “ggplot2” package for elegantly visualize the data, the “XML” package for parsing

XML and HTML documents, the “stringr” package for making string functions simpler and

easier to use, and the “RTextTools” package to use the machine learning to simplify data

processing. There might be more packages and tools that I will find useful in the future.

References:

1. Das, S., Sun, X., & Dutta, A. (2015). Investigating user ridership sentiments for bike

sharing programs. Journal of Transportation Technologies, 5(02), 69.

2. Jia, Z., Xie, G., Gao, J., & Yu, S. (2016, December). Bike-Sharing System: A Big-Data

Perspective. International Conference on Smart Computing and Communication, 548-

557. Springer, Cham.

3. Tweets. Retrieved from Twitter: https://www.twitter.com

4. Developer Website for Twitter. Retrieved from https://dev.twitter.com/docs/auth/oauth

5. Roy S. (2017). Tutorial on how to extract tweets using R. Retrieved from

https://www.researchgate.net/post/How_do_I_extract_tweets_using_R

6. Gentry, J. (2016). Package ‘twitteR’. R package version, 1(9).

Das könnte Ihnen auch gefallen

Research Paper
Dokument7 Seiten
Research Paper
Neelam Kumari B
Noch keine Bewertungen
Big Data J
Dokument3 Seiten
Big Data J
shanmugaraja85
Noch keine Bewertungen
2 Cesare-PromisesPitfallsUsing-2018
Dokument22 Seiten
2 Cesare-PromisesPitfallsUsing-2018
Fahmi Burhanuddin
Noch keine Bewertungen
IEEASMD00
Dokument12 Seiten
IEEASMD00
music2850
Noch keine Bewertungen
05-B Felt 2014 (Twitter)
Dokument15 Seiten
05-B Felt 2014 (Twitter)
100111
Noch keine Bewertungen
20 On Scalable and Robust Truth Discovery in Big Data Social Media Sensing Applications
Dokument3 Seiten
20 On Scalable and Robust Truth Discovery in Big Data Social Media Sensing Applications
Baranishankar
Noch keine Bewertungen
Compusoft, 3 (4), 738-742 PDF
Dokument5 Seiten
Compusoft, 3 (4), 738-742 PDF
Ijact Editor
Noch keine Bewertungen
Social Media
Dokument17 Seiten
Social Media
vira8384
Noch keine Bewertungen
BRM Project-Abhishek's MacBook Air
Dokument24 Seiten
BRM Project-Abhishek's MacBook Air
ac64113
Noch keine Bewertungen
Joseph 2017
Dokument14 Seiten
Joseph 2017
RAHUL SINGH RAJPUT
Noch keine Bewertungen
Impact of Big Data and Social Media On Society: March 2016
Dokument3 Seiten
Impact of Big Data and Social Media On Society: March 2016
Robby Walsen Pasaribu
Noch keine Bewertungen
Backchanneling Conversations
Dokument11 Seiten
Backchanneling Conversations
Sahba Sadeghian
Noch keine Bewertungen
Research Challenge On Opinion Mining and Sentiment Analysis: Background
Dokument9 Seiten
Research Challenge On Opinion Mining and Sentiment Analysis: Background
Venkata Kiran Kumar Sathikela
Noch keine Bewertungen
CCL MiniProject
Dokument8 Seiten
CCL MiniProject
Sakshi Pawar
Noch keine Bewertungen
New Research
Dokument4 Seiten
New Research
Priyanshi Goyal
Noch keine Bewertungen
Big Data Analytics Meets Social Media A Systematic Review of Tech - Compressed
Dokument38 Seiten
Big Data Analytics Meets Social Media A Systematic Review of Tech - Compressed
saal wow
Noch keine Bewertungen
Twitter Data Mining For Sentiment Analysis On Peoples Feedback Against Government Public Policy
Dokument13 Seiten
Twitter Data Mining For Sentiment Analysis On Peoples Feedback Against Government Public Policy
Global Research and Development Services
100% (2)
The Emerging Role of Artificial Intelligence in Modern Society
Dokument7 Seiten
The Emerging Role of Artificial Intelligence in Modern Society
James Wilson
Noch keine Bewertungen
A Review On Twitter Sentiment Analysis Using ML
Dokument6 Seiten
A Review On Twitter Sentiment Analysis Using ML
IJRASETPublications
Noch keine Bewertungen
Volume 6 Issue 2
Dokument74 Seiten
Volume 6 Issue 2
drrajput
Noch keine Bewertungen
07516129
Dokument6 Seiten
07516129
Govind Upadhyay
Noch keine Bewertungen
Critical Questions For Big Data
Dokument6 Seiten
Critical Questions For Big Data
Sueli Sousa
0% (1)
Undp-Gpn-Sdgi-Data Philanthropy International Organizations and Development Policy
Dokument18 Seiten
Undp-Gpn-Sdgi-Data Philanthropy International Organizations and Development Policy
Amina Daoud
Noch keine Bewertungen
Social Media Sentiment Analysis for Detecting Racist Tweets
Dokument5 Seiten
Social Media Sentiment Analysis for Detecting Racist Tweets
Tanya Gupta
100% (1)
IJETR042461
Dokument5 Seiten
IJETR042461
erpublication
Noch keine Bewertungen
Big Data Analytics Meets Social Media, Systematic Review of Techniques
Dokument39 Seiten
Big Data Analytics Meets Social Media, Systematic Review of Techniques
JAGA ADHI
Noch keine Bewertungen
Perception of Students on Social Media as Data Storage Sites
Dokument13 Seiten
Perception of Students on Social Media as Data Storage Sites
Alysa May Mejorada
Noch keine Bewertungen
S U R J S S: Indh Niversity Esearch Ournal (Cience Eries)
Dokument6 Seiten
S U R J S S: Indh Niversity Esearch Ournal (Cience Eries)
ads
Noch keine Bewertungen
1 s2.0 S2214785320391501 Main
Dokument6 Seiten
1 s2.0 S2214785320391501 Main
rose rise
Noch keine Bewertungen
A Thematic Review On Digital Storytelling (DST) in Social Media
Dokument33 Seiten
A Thematic Review On Digital Storytelling (DST) in Social Media
JOVINER LACTAM
Noch keine Bewertungen
Global Digital Activism Data Set User's Manual and Codebook
Dokument46 Seiten
Global Digital Activism Data Set User's Manual and Codebook
Akshay Aggarwal
Noch keine Bewertungen
Case Stud1
Dokument5 Seiten
Case Stud1
john paul demeterio
Noch keine Bewertungen
Big Data
Dokument5 Seiten
Big Data
mahmood
Noch keine Bewertungen
Chapter 2
Dokument13 Seiten
Chapter 2
Shible Sheikh
Noch keine Bewertungen
Case Study 1
Dokument32 Seiten
Case Study 1
Joanne Louise Camonias Birondo
Noch keine Bewertungen
Increasing The Investment's Opportunities in Kingdom of Saudi Arabia by Studying and Analyzing The Social Media Data
Dokument16 Seiten
Increasing The Investment's Opportunities in Kingdom of Saudi Arabia by Studying and Analyzing The Social Media Data
Anonymous Gl4IRRjzN
Noch keine Bewertungen
Increasing The Investment's Opportunities in Kingdom of Saudi Arabia by Studying and Analyzing The Social Media Data
Dokument16 Seiten
Increasing The Investment's Opportunities in Kingdom of Saudi Arabia by Studying and Analyzing The Social Media Data
Anonymous Gl4IRRjzN
Noch keine Bewertungen
Social Media
Dokument3 Seiten
Social Media
HARIS E SALIM
Noch keine Bewertungen
SMM ch1 PDF
Dokument11 Seiten
SMM ch1 PDF
Raul Andres Camacho Cruz
Noch keine Bewertungen
Mittelstadt, The Ethics of Algorithims Mapping The Debate
Dokument17 Seiten
Mittelstadt, The Ethics of Algorithims Mapping The Debate
ZAM ARTISAN CHOCOLATES
Noch keine Bewertungen
Towards A Standard Sampling Methodology On Online Social Networks: Collecting Global Trends On Twitter
Dokument19 Seiten
Towards A Standard Sampling Methodology On Online Social Networks: Collecting Global Trends On Twitter
reyhshs731
Noch keine Bewertungen
The Challenges of Social Media Research in Online Learning Environments
Dokument4 Seiten
The Challenges of Social Media Research in Online Learning Environments
IJAR JOURNAL
Noch keine Bewertungen
Methods To Investigate Concept Drift in Big Data Streams
Dokument25 Seiten
Methods To Investigate Concept Drift in Big Data Streams
Renata Lins
Noch keine Bewertungen
Sentiment Analysis Tool Using Machine Learning Algorithms
Dokument5 Seiten
Sentiment Analysis Tool Using Machine Learning Algorithms
International Journal of Application or Innovation in Engineering & Management
Noch keine Bewertungen
Mastering Social Media Mining With R - Sample Chapter
Dokument27 Seiten
Mastering Social Media Mining With R - Sample Chapter
Packt Publishing
Noch keine Bewertungen
BIG DATA ANALYTICS SURVEY APPLICATIONS SOCIAL MEDIA
Dokument21 Seiten
BIG DATA ANALYTICS SURVEY APPLICATIONS SOCIAL MEDIA
mail
Noch keine Bewertungen
Data Justice: Connecting Digital Rights Globally
Dokument14 Seiten
Data Justice: Connecting Digital Rights Globally
Suraj Prakash
Noch keine Bewertungen
Result Analysis of User Review For Sentiment Classification
Dokument8 Seiten
Result Analysis of User Review For Sentiment Classification
IJRASETPublications
Noch keine Bewertungen
Social Media Mining With R Sample Chapter
Dokument18 Seiten
Social Media Mining With R Sample Chapter
Packt Publishing
Noch keine Bewertungen
Wurood Shaher. Synopsis Social Media History Assignment
Dokument5 Seiten
Wurood Shaher. Synopsis Social Media History Assignment
Wurood Shaher
Noch keine Bewertungen
A Survey On Bigdata Analytics Using Social Media Data
Dokument4 Seiten
A Survey On Bigdata Analytics Using Social Media Data
kritheedevi
Noch keine Bewertungen
Sentiment Analysis On Twitter
Dokument7 Seiten
Sentiment Analysis On Twitter
armanghouri
Noch keine Bewertungen
Survey On Big Data Mining Platforms, Algorithms and Challenges
Dokument9 Seiten
Survey On Big Data Mining Platforms, Algorithms and Challenges
ASMA SHABBIR
Noch keine Bewertungen
Why Map Issues? On Controversy Analysis As A Digital Method: Noortje Marres
Dokument32 Seiten
Why Map Issues? On Controversy Analysis As A Digital Method: Noortje Marres
Ta Cont
Noch keine Bewertungen
Emerging Technologies and Law
Dokument11 Seiten
Emerging Technologies and Law
Aditya Pratap Singh
Noch keine Bewertungen
American Elections 2012
Dokument29 Seiten
American Elections 2012
tiziano pacifico
Noch keine Bewertungen
Timeline Analysis of Twitter User Timeline Analysis of Twitter User
Dokument10 Seiten
Timeline Analysis of Twitter User Timeline Analysis of Twitter User
rathan_cage6181
Noch keine Bewertungen
Evolution of Big Data and Tools For Big Data
Dokument9 Seiten
Evolution of Big Data and Tools For Big Data
lu09
Noch keine Bewertungen
Making Sense of Tweets Using Sentiment Analysis On Closely Related Topics
Dokument11 Seiten
Making Sense of Tweets Using Sentiment Analysis On Closely Related Topics
sinung19002
Noch keine Bewertungen
Information and Recommender Systems
Von Everand
Information and Recommender Systems
Elsa Nègre
Noch keine Bewertungen
Obtaining Workplace Information
Dokument4 Seiten
Obtaining Workplace Information
Jessica Carisma
Noch keine Bewertungen
Communication Tourism PDF
Dokument2 Seiten
Communication Tourism PDF
Shane
0% (1)
Factors Affecting English Speaking Skills of Students
Dokument18 Seiten
Factors Affecting English Speaking Skills of Students
Rona Jane Miranda
Noch keine Bewertungen
AI Capstone Project Report for Image Captioning and Digital Assistant
Dokument28 Seiten
AI Capstone Project Report for Image Captioning and Digital Assistant
akg299
50% (2)
Bianchi Size Chart for Mountain Bikes
Dokument1 Seite
Bianchi Size Chart for Mountain Bikes
Syafiq Ishak
Noch keine Bewertungen
Heidegger - Nietzsches Word God Is Dead
Dokument31 Seiten
Heidegger - Nietzsches Word God Is Dead
Soumyadeep
Noch keine Bewertungen
12 Preliminary Conference Brief
Dokument7 Seiten
12 Preliminary Conference Brief
kaizen shinichi
Noch keine Bewertungen
Compro Russindo Group Tahun 2018 Update
Dokument44 Seiten
Compro Russindo Group Tahun 2018 Update
Elyza Farah Fadhillah
Noch keine Bewertungen
SLE Case Report on 15-Year-Old Girl
Dokument38 Seiten
SLE Case Report on 15-Year-Old Girl
DiLa NandaRi
Noch keine Bewertungen
Set up pfSense transparent Web proxy with multi-WAN failover
Dokument8 Seiten
Set up pfSense transparent Web proxy with multi-WAN failover
Alicia Smith
Noch keine Bewertungen
ISA standards, materials, and control room concepts
Dokument8 Seiten
ISA standards, materials, and control room concepts
Giovanni
Noch keine Bewertungen
Year 11 Economics Introduction Notes
Dokument9 Seiten
Year 11 Economics Introduction Notes
anon_315466406
0% (1)
Brain Chip Report
Dokument30 Seiten
Brain Chip Report
srikanthkalemla
100% (3)
Supreme Court declares Pork Barrel System unconstitutional
Dokument3 Seiten
Supreme Court declares Pork Barrel System unconstitutional
Dom Robinson Baggayan
Noch keine Bewertungen
What Is Love? - Osho: Sat Sangha Salon
Dokument7 Seiten
What Is Love? - Osho: Sat Sangha Salon
Michael Vladislav
Noch keine Bewertungen
Research Proposal by Efe Onomake Updated.
Dokument18 Seiten
Research Proposal by Efe Onomake Updated.
efe west
Noch keine Bewertungen
Organisation Study of KAMCO
Dokument62 Seiten
Organisation Study of KAMCO
Robin Thomas
100% (11)
HERBAL SHAMPOO PPT by SAILI RAJPUT
Dokument24 Seiten
HERBAL SHAMPOO PPT by SAILI RAJPUT
Saili Rajput
100% (1)
2C Syllable Division: Candid Can/d
Dokument32 Seiten
2C Syllable Division: Candid Can/d
Rawats002
Noch keine Bewertungen
Secondary Sources Works Cited
Dokument7 Seiten
Secondary Sources Works Cited
Jacqueline
Noch keine Bewertungen
An Analysis of Students Pronounciation Errors Made by Ninth Grade of Junior High School 1 Tengaran
Dokument22 Seiten
An Analysis of Students Pronounciation Errors Made by Ninth Grade of Junior High School 1 Tengaran
Octa Wibawa
Noch keine Bewertungen
Marrickville DCP 2011 - 2.3 Site and Context Analysis
Dokument9 Seiten
Marrickville DCP 2011 - 2.3 Site and Context Analysis
kiranji
Noch keine Bewertungen
Battery Genset Usage 06-08pelj0910
Dokument4 Seiten
Battery Genset Usage 06-08pelj0910
b400013
Noch keine Bewertungen
Political Philosophy and Political Science: Complex Relationships
Dokument15 Seiten
Political Philosophy and Political Science: Complex Relationships
Vane Valiente
Noch keine Bewertungen
Improve Your Social Skills With Soft And Hard Techniques
Dokument26 Seiten
Improve Your Social Skills With Soft And Hard Techniques
Earlkenneth Navarro
Noch keine Bewertungen
Toxicology: General Aspects, Types, Routes of Exposure & Analysis
Dokument76 Seiten
Toxicology: General Aspects, Types, Routes of Exposure & Analysis
Asma Sikander
Noch keine Bewertungen
Lesson 1 Reviewer in Pmls
Dokument10 Seiten
Lesson 1 Reviewer in Pmls
Charisa Joyce Agbon
Noch keine Bewertungen
Markle 1999 Shield Veria
Dokument37 Seiten
Markle 1999 Shield Veria
Mads Sondre Prøitz
Noch keine Bewertungen
IJAKADI: A Stage Play About Spiritual Warfare
Dokument9 Seiten
IJAKADI: A Stage Play About Spiritual Warfare
obiji marvelous Chibuzo
Noch keine Bewertungen
Purposive Communication Module 1
Dokument18 Seiten
Purposive Communication Module 1
daphne pejo
100% (4)