SlideShare uma empresa Scribd logo
1 de 24
What is the link and text doing here: A Case Study of Cyworld Minihompies in Korea Steven Sams and Han Woo Park
Background This study analyses user-generated comments posted to Korean politicians on SNS Cyworld that contain a URL The study examines the type of service being linked to    through the URL and determines the frequency of           services A developed program captures all comments given to a   selected set of politicians within a predefined timeframe The text component of messages is analyzed using two    separate machine-learning mechanisms
Types of Hyperlinks Five social functions that hyperlinks can be said to perform Information Provision Network Strengthening Identity Building Audience Sharing Message Amplification Ackland et al. (2010)
Online Korean Political Sphere As in other countries, Korean politicians are increasingly  turning to social networks as a means to engage with      their electorate In 2007 Cyworld commanded a penetration rate of one third of the total population of South Korea, and since then all indications are that this proportion has increased.
Sample One hundred and thirty Korean National Assembly        Members’ Cyworld Minihomies. The date parameters of the study were April 2008 – June 2009 One hundred and fifty three thousand six hundred and    two comments were collected for period chosen for the     study.  One thousand two hundred and seventy six comments    contained links
Data Collection Method A program was developed that performs HTTP call to     request one page of comments from the politician’s         visitor board The content and date are isolated and held in temporary storage.  The process repeats until the target date parameters have been met.
Data Analysis Method: Links The links are checked to determine the number of unique URLs and corresponding number of unique domains.       These links / domains are then manually categorised into website type, such as portals, media, parties, homepages  of politicians, petition sites, online fan clubs, and NGOs) Location of service found using network query tool to determine the proportion of domestic and international     websites
Data Analysis Method: Text To analyse a large body of text, Natural Language            Processing (NLP) is one approach to categorisation that  can mitigate the problem of obtaining accurate results     that is unfeasible to perform manually A rudimentary Java class was developed that wrapped a small subset of the methods provided in the LingPipe API so that they could be called on the extracted text comments. The developed Java class enabled two forms of analysis: Sentiment Analysis and Collocation
Sentiment Analysis A polarity analyser was developed that is able to locate   significant word combinations and, using the developed    corpus model as a training dataset, determine if the         combination is generally positive or negative An accessible corpus of positive and negative sentiment composed in Korean has yet to be realized. A sample body of 2000 Korean text statements were      coded into objective,  subjective - positive and subjective - negative categories
Collocation 	Collocation analysis can determine which tokens are      more frequently found together than would normally be          expected. Collocation can identify proper nouns in this   way (such as the names or persons, places, or events) that would be lost if the frequency of each token were           analysed in isolation.
Results - Links 153,602 comments were collected for period chosen for the study 1,276 comments contained hyperlinks Total link count was 1,920 as it was common to have     more than one hyperlink contained within an individual   posting	 762 were unique full URLs and 259 were unique domains 1,849 URLs encountered in the sample were found to belong to services based in Korea and 71 from international service Performing message amplification and network building were prominent causes of link posting
Table 1: LexiURL Unique / Full hosts Based on the top 10 domains (24.5%) by occurrence out of 259
Table 2: LexiURL Unique / Full URLs
Table 3: Total links to each domain (Korea) Based on 1,078 (58.3%) of 1,849 links to Korean services
Table 4: Total links to each domain (Overseas) Based on 51 (71.8%) of 71 links to overseas services
Table 5: poster-gender and politician background
Table 6: Comments categorized by link type from the six groups of gender and political affiliation Table 6: Comments categorized by link type from the six groups of gender and political affiliation Based on 206 comments agreed on by both coders from the initial set of 300
Results - Text May and June 2008 were found to have high numbers of comments containing links that showed negative sentiment, and this date corresponds with the period of the candlelight protest May 2009 also shows large numbers of comments containing  hyperlinks that indicate negative sentiment, coinciding with the suicide of ex-president Roh Moo-Hyun The name of Korean President Lee Myung-bak was found to   occur two hundred and twenty nine times Terms pertaining to the candlelight protests, such as Mad Cow disease, beef, American goods, and candlelight protest occurred frequently Gini coefficient and a less formal term describing a similar measurement of wealth occurred frequently
Figure 1. Positive and negative sentiment from comments containing links
Confidence Levels 	To determine the effectiveness of the classification          approach, 10% of training data was removed from the      training set and used to evaluate the developed model.    This approach allows testing the classification based on   known human-classified data. The Average Conditional     Probability score provides a basis for determining the      ability of the classifier to correctly identify positive and    negative sentiment.  Based on the training set used, the    Average Conditional Probability was found to be 87%.
Limitations Less than 1% of all comments posted to the sample of politicians and indicates that although previous studies have shown how links can support communication in SNSs, their frequency in the Korean online political environment remains rare Comments deleted over the period of the study may omit the full extent of negative sentiment towards politicians The practice of deleting content in Korea has been found to be less constrained by social norms than found in Western SNSs, such as Facebook Legal mechanisms also exist in Korea to encourage the removal of negative content during election periods
Conclusion Links are almost solely targeted to Korean domestic services,  and   the few that do point to overseas sites are usually related in some  way to domestic issues in Korea Males are marginally more likely to comment on Cyworld              Minihompies using links than females, and those Minihompies        managed by ruling politicians were found to be of greater               prominence than those of the opposition parties Message Amplification and Network Building were found to be the            dominant purpose for submitting links within user-generated           comments.  Using two forms of machine-based learning algorithms, sentiment analysis   and collocation of significant phrases, revealed primarily        negative sentiment towards President Lee and his role in the           reintroduction of American beef  imports.  Issues surrounding the    suicide of ex-President Roh suggested anger towards those who    were seen to be harassing him prior to his death
Acknowledgement 	Research for this paper has been supported by the World Class University (WCU) program through the National Research Foundation of Korea, which is funded by the Ministry of Education, Science and Technology (No. 515-82-06574).
Thank you

Mais conteúdo relacionado

Mais procurados

INFO4990_Hossain
INFO4990_HossainINFO4990_Hossain
INFO4990_Hossainwebuploader
 
18th home blog_twitter_English (12OCT2010)
18th home blog_twitter_English (12OCT2010) 18th home blog_twitter_English (12OCT2010)
18th home blog_twitter_English (12OCT2010) Han Woo PARK
 
An evolutionary approach to comparative analysis of detecting Bangla abusive ...
An evolutionary approach to comparative analysis of detecting Bangla abusive ...An evolutionary approach to comparative analysis of detecting Bangla abusive ...
An evolutionary approach to comparative analysis of detecting Bangla abusive ...journalBEEI
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collectiondnac
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measuresdnac
 
Liao and petzold opensym berlin wikipedia geolinguistic normalization
Liao and petzold opensym berlin wikipedia geolinguistic normalizationLiao and petzold opensym berlin wikipedia geolinguistic normalization
Liao and petzold opensym berlin wikipedia geolinguistic normalizationHanteng Liao
 
Mapping big data science
Mapping big data scienceMapping big data science
Mapping big data scienceHan Woo PARK
 
09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memory09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memorydnac
 
11 Network Experiments and Interventions
11 Network Experiments and Interventions11 Network Experiments and Interventions
11 Network Experiments and Interventionsdnac
 
TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’
TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’
TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’TCI Network
 
Social listening: how to do it and how to use (SNA Perspective)
Social listening: how to do it and how to use (SNA Perspective)Social listening: how to do it and how to use (SNA Perspective)
Social listening: how to do it and how to use (SNA Perspective)Toronto Metropolitan University
 
Power Of Online Conversation 2009.05.01
Power Of Online Conversation 2009.05.01Power Of Online Conversation 2009.05.01
Power Of Online Conversation 2009.05.01nextgenweb
 
Data collection thru social media
Data collection thru social mediaData collection thru social media
Data collection thru social mediai4box Anon
 
IJSRED-V2I2P09
IJSRED-V2I2P09IJSRED-V2I2P09
IJSRED-V2I2P09IJSRED
 
12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...dnac
 
“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)
“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)
“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)WeGov project
 

Mais procurados (18)

INFO4990_Hossain
INFO4990_HossainINFO4990_Hossain
INFO4990_Hossain
 
18th home blog_twitter_English (12OCT2010)
18th home blog_twitter_English (12OCT2010) 18th home blog_twitter_English (12OCT2010)
18th home blog_twitter_English (12OCT2010)
 
An evolutionary approach to comparative analysis of detecting Bangla abusive ...
An evolutionary approach to comparative analysis of detecting Bangla abusive ...An evolutionary approach to comparative analysis of detecting Bangla abusive ...
An evolutionary approach to comparative analysis of detecting Bangla abusive ...
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collection
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 
Liao and petzold opensym berlin wikipedia geolinguistic normalization
Liao and petzold opensym berlin wikipedia geolinguistic normalizationLiao and petzold opensym berlin wikipedia geolinguistic normalization
Liao and petzold opensym berlin wikipedia geolinguistic normalization
 
Mapping big data science
Mapping big data scienceMapping big data science
Mapping big data science
 
09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memory09 Respondent Driven Sampling and Network Sampling with Memory
09 Respondent Driven Sampling and Network Sampling with Memory
 
11 Network Experiments and Interventions
11 Network Experiments and Interventions11 Network Experiments and Interventions
11 Network Experiments and Interventions
 
presentation29
presentation29presentation29
presentation29
 
TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’
TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’
TCI 2015 What Do Links Mean in Innovation Clusters? ‘Relational Dialectics’
 
Social listening: how to do it and how to use (SNA Perspective)
Social listening: how to do it and how to use (SNA Perspective)Social listening: how to do it and how to use (SNA Perspective)
Social listening: how to do it and how to use (SNA Perspective)
 
Power Of Online Conversation 2009.05.01
Power Of Online Conversation 2009.05.01Power Of Online Conversation 2009.05.01
Power Of Online Conversation 2009.05.01
 
Data collection thru social media
Data collection thru social mediaData collection thru social media
Data collection thru social media
 
04 Ego Network Analysis
04 Ego Network Analysis04 Ego Network Analysis
04 Ego Network Analysis
 
IJSRED-V2I2P09
IJSRED-V2I2P09IJSRED-V2I2P09
IJSRED-V2I2P09
 
12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...12 Network Experiments and Interventions: Studying Information Diffusion and ...
12 Network Experiments and Interventions: Studying Information Diffusion and ...
 
“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)
“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)
“What is WeGov” - User Guide for the Phase 2 Evaluation (in English)
 

Destaque

Image text duke_political_conference(25_may2010)presentation
Image text duke_political_conference(25_may2010)presentationImage text duke_political_conference(25_may2010)presentation
Image text duke_political_conference(25_may2010)presentationHan Woo PARK
 
웹보메트릭스와 계량정보학11 1
웹보메트릭스와 계량정보학11 1웹보메트릭스와 계량정보학11 1
웹보메트릭스와 계량정보학11 1Han Woo PARK
 
웹보메트릭스와 계량정보학11 2
웹보메트릭스와 계량정보학11 2웹보메트릭스와 계량정보학11 2
웹보메트릭스와 계량정보학11 2Han Woo PARK
 
대구경북언론사(21 march2013)
대구경북언론사(21 march2013)대구경북언론사(21 march2013)
대구경북언론사(21 march2013)Han Woo PARK
 
웹보메트릭스와 계량정보학 강의소개
웹보메트릭스와 계량정보학 강의소개웹보메트릭스와 계량정보학 강의소개
웹보메트릭스와 계량정보학 강의소개Han Woo PARK
 

Destaque (7)

Image text duke_political_conference(25_may2010)presentation
Image text duke_political_conference(25_may2010)presentationImage text duke_political_conference(25_may2010)presentation
Image text duke_political_conference(25_may2010)presentation
 
웹보메트릭스와 계량정보학11 1
웹보메트릭스와 계량정보학11 1웹보메트릭스와 계량정보학11 1
웹보메트릭스와 계량정보학11 1
 
웹보메트릭스와 계량정보학11 2
웹보메트릭스와 계량정보학11 2웹보메트릭스와 계량정보학11 2
웹보메트릭스와 계량정보학11 2
 
Jiwon disc
Jiwon discJiwon disc
Jiwon disc
 
대구경북언론사(21 march2013)
대구경북언론사(21 march2013)대구경북언론사(21 march2013)
대구경북언론사(21 march2013)
 
웹보메트릭스와 계량정보학 강의소개
웹보메트릭스와 계량정보학 강의소개웹보메트릭스와 계량정보학 강의소개
웹보메트릭스와 계량정보학 강의소개
 
Толерантность
ТолерантностьТолерантность
Толерантность
 

Semelhante a Target link presentation

How to social scientists use link data (11 june2010)
How to social scientists use link data (11 june2010)How to social scientists use link data (11 june2010)
How to social scientists use link data (11 june2010)Han Woo PARK
 
Triple helix 2012 president
Triple helix 2012 presidentTriple helix 2012 president
Triple helix 2012 presidentHan Woo PARK
 
Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)
Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)
Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)SangMe Nam
 
MLA Members' Social Software Use and Beliefs
MLA Members' Social Software Use and BeliefsMLA Members' Social Software Use and Beliefs
MLA Members' Social Software Use and BeliefsMelissa Rethlefsen
 
Final Poster for Engineering Showcase
Final Poster for Engineering ShowcaseFinal Poster for Engineering Showcase
Final Poster for Engineering ShowcaseTucker Truesdale
 
Neso nuffic presentation in Seoul
Neso nuffic presentation in SeoulNeso nuffic presentation in Seoul
Neso nuffic presentation in SeoulMaurice Vergeer
 
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1crore projects
 
The ‘BBK scandal’ in the 2007 presidential election of South Korea
The ‘BBK scandal’ in the 2007 presidential election of South KoreaThe ‘BBK scandal’ in the 2007 presidential election of South Korea
The ‘BBK scandal’ in the 2007 presidential election of South KoreaHan Woo PARK
 
A large-scale sentiment analysis using political tweets
A large-scale sentiment analysis using political tweetsA large-scale sentiment analysis using political tweets
A large-scale sentiment analysis using political tweetsIJECEIAES
 
Investigating Internet-based Korean politics using e-research tools Kaist Cu...
Investigating Internet-based Korean politics using e-research tools Kaist Cu...Investigating Internet-based Korean politics using e-research tools Kaist Cu...
Investigating Internet-based Korean politics using e-research tools Kaist Cu...Han Woo PARK
 
Mapping online social networks among Korean politicians: Homepage, blog, and ...
Mapping online social networks among Korean politicians: Homepage, blog, and ...Mapping online social networks among Korean politicians: Homepage, blog, and ...
Mapping online social networks among Korean politicians: Homepage, blog, and ...Han Woo PARK
 
An Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep LearningAn Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep LearningIRJET Journal
 
How to utilize ‘big data’ on SNS for academic purpose?
How to utilize ‘big data’ on SNS  for academic purpose?How to utilize ‘big data’ on SNS  for academic purpose?
How to utilize ‘big data’ on SNS for academic purpose?Han Woo PARK
 
Trust, online social_networks,_communication_nia_conference_(27_oct2010)final
Trust, online social_networks,_communication_nia_conference_(27_oct2010)finalTrust, online social_networks,_communication_nia_conference_(27_oct2010)final
Trust, online social_networks,_communication_nia_conference_(27_oct2010)finalHan Woo PARK
 
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYFRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYcscpconf
 

Semelhante a Target link presentation (20)

Networked politics(31may2010)
Networked politics(31may2010)Networked politics(31may2010)
Networked politics(31may2010)
 
Networked politics(3june2010)
Networked politics(3june2010)Networked politics(3june2010)
Networked politics(3june2010)
 
How to social scientists use link data (11 june2010)
How to social scientists use link data (11 june2010)How to social scientists use link data (11 june2010)
How to social scientists use link data (11 june2010)
 
Triple helix 2012 president
Triple helix 2012 presidentTriple helix 2012 president
Triple helix 2012 president
 
Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)
Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)
Cyworld Jeju 2009 Conference(10 Aug2009)No2(2)
 
Studying Social Science Using E Tools
Studying Social Science Using E ToolsStudying Social Science Using E Tools
Studying Social Science Using E Tools
 
MLA Members' Social Software Use and Beliefs
MLA Members' Social Software Use and BeliefsMLA Members' Social Software Use and Beliefs
MLA Members' Social Software Use and Beliefs
 
Final Poster for Engineering Showcase
Final Poster for Engineering ShowcaseFinal Poster for Engineering Showcase
Final Poster for Engineering Showcase
 
Neso nuffic presentation in Seoul
Neso nuffic presentation in SeoulNeso nuffic presentation in Seoul
Neso nuffic presentation in Seoul
 
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
 
The ‘BBK scandal’ in the 2007 presidential election of South Korea
The ‘BBK scandal’ in the 2007 presidential election of South KoreaThe ‘BBK scandal’ in the 2007 presidential election of South Korea
The ‘BBK scandal’ in the 2007 presidential election of South Korea
 
A large-scale sentiment analysis using political tweets
A large-scale sentiment analysis using political tweetsA large-scale sentiment analysis using political tweets
A large-scale sentiment analysis using political tweets
 
Investigating Internet-based Korean politics using e-research tools Kaist Cu...
Investigating Internet-based Korean politics using e-research tools Kaist Cu...Investigating Internet-based Korean politics using e-research tools Kaist Cu...
Investigating Internet-based Korean politics using e-research tools Kaist Cu...
 
Social media in the public sector south korea twitter
Social media in the public sector south korea twitterSocial media in the public sector south korea twitter
Social media in the public sector south korea twitter
 
Mapping online social networks among Korean politicians: Homepage, blog, and ...
Mapping online social networks among Korean politicians: Homepage, blog, and ...Mapping online social networks among Korean politicians: Homepage, blog, and ...
Mapping online social networks among Korean politicians: Homepage, blog, and ...
 
Overview Of Wcu Research (16 Dec2009)Sj
Overview Of Wcu Research (16 Dec2009)SjOverview Of Wcu Research (16 Dec2009)Sj
Overview Of Wcu Research (16 Dec2009)Sj
 
An Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep LearningAn Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
An Analytical Survey on Hate Speech Recognition through NLP and Deep Learning
 
How to utilize ‘big data’ on SNS for academic purpose?
How to utilize ‘big data’ on SNS  for academic purpose?How to utilize ‘big data’ on SNS  for academic purpose?
How to utilize ‘big data’ on SNS for academic purpose?
 
Trust, online social_networks,_communication_nia_conference_(27_oct2010)final
Trust, online social_networks,_communication_nia_conference_(27_oct2010)finalTrust, online social_networks,_communication_nia_conference_(27_oct2010)final
Trust, online social_networks,_communication_nia_conference_(27_oct2010)final
 
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYFRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
 

Mais de Han Woo PARK

소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석
소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석
소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석Han Woo PARK
 
페이스북 선도자 탄핵촛불에서 캠폐인 이동경로
페이스북 선도자 탄핵촛불에서 캠폐인 이동경로페이스북 선도자 탄핵촛불에서 캠폐인 이동경로
페이스북 선도자 탄핵촛불에서 캠폐인 이동경로Han Woo PARK
 
WATEF 2018 신년 세미나(수정)
WATEF 2018 신년 세미나(수정)WATEF 2018 신년 세미나(수정)
WATEF 2018 신년 세미나(수정)Han Woo PARK
 
세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나
세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나
세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나Han Woo PARK
 
Disc 2015 보도자료 (휴대폰번호 삭제-수정)
Disc 2015 보도자료 (휴대폰번호 삭제-수정)Disc 2015 보도자료 (휴대폰번호 삭제-수정)
Disc 2015 보도자료 (휴대폰번호 삭제-수정)Han Woo PARK
 
Another Interdisciplinary Transformation: Beyond an Area-studies Journal
Another Interdisciplinary Transformation: Beyond an Area-studies JournalAnother Interdisciplinary Transformation: Beyond an Area-studies Journal
Another Interdisciplinary Transformation: Beyond an Area-studies JournalHan Woo PARK
 
4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등
4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등
4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등Han Woo PARK
 
KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집
KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집
KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집Han Woo PARK
 
박한우 교수 프로파일 (31 oct2017)
박한우 교수 프로파일 (31 oct2017)박한우 교수 프로파일 (31 oct2017)
박한우 교수 프로파일 (31 oct2017)Han Woo PARK
 
Global mapping of artificial intelligence in Google and Google Scholar
Global mapping of artificial intelligence in Google and Google ScholarGlobal mapping of artificial intelligence in Google and Google Scholar
Global mapping of artificial intelligence in Google and Google ScholarHan Woo PARK
 
박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용
박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용
박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용Han Woo PARK
 
향기담은 하루찻집
향기담은 하루찻집향기담은 하루찻집
향기담은 하루찻집Han Woo PARK
 
Twitter network map of #ACPC2017 1st day using NodeXL
Twitter network map of #ACPC2017 1st day using NodeXLTwitter network map of #ACPC2017 1st day using NodeXL
Twitter network map of #ACPC2017 1st day using NodeXLHan Woo PARK
 
페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회
페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회
페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회Han Woo PARK
 
Facebook bigdata to understand regime change and migration patterns during ca...
Facebook bigdata to understand regime change and migration patterns during ca...Facebook bigdata to understand regime change and migration patterns during ca...
Facebook bigdata to understand regime change and migration patterns during ca...Han Woo PARK
 
세계산학관협력총회 Watef 패널을 공지합니다
세계산학관협력총회 Watef 패널을 공지합니다세계산학관협력총회 Watef 패널을 공지합니다
세계산학관협력총회 Watef 패널을 공지합니다Han Woo PARK
 
2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우
2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우
2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우Han Woo PARK
 
2017년 인포그래픽스 과제모음
2017년 인포그래픽스 과제모음2017년 인포그래픽스 과제모음
2017년 인포그래픽스 과제모음Han Woo PARK
 
SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로
SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로
SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로Han Woo PARK
 
2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상
2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상
2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상Han Woo PARK
 

Mais de Han Woo PARK (20)

소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석
소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석
소셜 빅데이터를 활용한_페이스북_이용자들의_반응과_관계_분석
 
페이스북 선도자 탄핵촛불에서 캠폐인 이동경로
페이스북 선도자 탄핵촛불에서 캠폐인 이동경로페이스북 선도자 탄핵촛불에서 캠폐인 이동경로
페이스북 선도자 탄핵촛불에서 캠폐인 이동경로
 
WATEF 2018 신년 세미나(수정)
WATEF 2018 신년 세미나(수정)WATEF 2018 신년 세미나(수정)
WATEF 2018 신년 세미나(수정)
 
세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나
세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나
세계트리플헬릭스미래전략학회 WATEF 2018 신년 세미나
 
Disc 2015 보도자료 (휴대폰번호 삭제-수정)
Disc 2015 보도자료 (휴대폰번호 삭제-수정)Disc 2015 보도자료 (휴대폰번호 삭제-수정)
Disc 2015 보도자료 (휴대폰번호 삭제-수정)
 
Another Interdisciplinary Transformation: Beyond an Area-studies Journal
Another Interdisciplinary Transformation: Beyond an Area-studies JournalAnother Interdisciplinary Transformation: Beyond an Area-studies Journal
Another Interdisciplinary Transformation: Beyond an Area-studies Journal
 
4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등
4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등
4차산업혁명 린든달러 비트코인 알트코인 암호화폐 가상화폐 등
 
KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집
KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집
KISTI-WATEF-BK21Plus-사이버감성연구소 2017 동계세미나 자료집
 
박한우 교수 프로파일 (31 oct2017)
박한우 교수 프로파일 (31 oct2017)박한우 교수 프로파일 (31 oct2017)
박한우 교수 프로파일 (31 oct2017)
 
Global mapping of artificial intelligence in Google and Google Scholar
Global mapping of artificial intelligence in Google and Google ScholarGlobal mapping of artificial intelligence in Google and Google Scholar
Global mapping of artificial intelligence in Google and Google Scholar
 
박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용
박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용
박한우 영어 이력서 Curriculum vitae 경희대 행사 제출용
 
향기담은 하루찻집
향기담은 하루찻집향기담은 하루찻집
향기담은 하루찻집
 
Twitter network map of #ACPC2017 1st day using NodeXL
Twitter network map of #ACPC2017 1st day using NodeXLTwitter network map of #ACPC2017 1st day using NodeXL
Twitter network map of #ACPC2017 1st day using NodeXL
 
페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회
페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회
페이스북 댓글을 통해 살펴본 대구·경북(TK) 촛불집회
 
Facebook bigdata to understand regime change and migration patterns during ca...
Facebook bigdata to understand regime change and migration patterns during ca...Facebook bigdata to understand regime change and migration patterns during ca...
Facebook bigdata to understand regime change and migration patterns during ca...
 
세계산학관협력총회 Watef 패널을 공지합니다
세계산학관협력총회 Watef 패널을 공지합니다세계산학관협력총회 Watef 패널을 공지합니다
세계산학관협력총회 Watef 패널을 공지합니다
 
2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우
2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우
2017 대통령선거 후보수락 유튜브 후보수락 동영상 김찬우 박효찬 박한우
 
2017년 인포그래픽스 과제모음
2017년 인포그래픽스 과제모음2017년 인포그래픽스 과제모음
2017년 인포그래픽스 과제모음
 
SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로
SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로
SNS 매개 학습공동체의 학습네트워크 탐색 : 페이스북 그룹을 중심으로
 
2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상
2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상
2016년 촛불집회의 페이스북 댓글 데이터를 통해 본 하이브리드 미디어 현상
 

Target link presentation

  • 1. What is the link and text doing here: A Case Study of Cyworld Minihompies in Korea Steven Sams and Han Woo Park
  • 2. Background This study analyses user-generated comments posted to Korean politicians on SNS Cyworld that contain a URL The study examines the type of service being linked to through the URL and determines the frequency of services A developed program captures all comments given to a selected set of politicians within a predefined timeframe The text component of messages is analyzed using two separate machine-learning mechanisms
  • 3. Types of Hyperlinks Five social functions that hyperlinks can be said to perform Information Provision Network Strengthening Identity Building Audience Sharing Message Amplification Ackland et al. (2010)
  • 4. Online Korean Political Sphere As in other countries, Korean politicians are increasingly turning to social networks as a means to engage with their electorate In 2007 Cyworld commanded a penetration rate of one third of the total population of South Korea, and since then all indications are that this proportion has increased.
  • 5. Sample One hundred and thirty Korean National Assembly Members’ Cyworld Minihomies. The date parameters of the study were April 2008 – June 2009 One hundred and fifty three thousand six hundred and two comments were collected for period chosen for the study. One thousand two hundred and seventy six comments contained links
  • 6. Data Collection Method A program was developed that performs HTTP call to request one page of comments from the politician’s visitor board The content and date are isolated and held in temporary storage. The process repeats until the target date parameters have been met.
  • 7. Data Analysis Method: Links The links are checked to determine the number of unique URLs and corresponding number of unique domains. These links / domains are then manually categorised into website type, such as portals, media, parties, homepages of politicians, petition sites, online fan clubs, and NGOs) Location of service found using network query tool to determine the proportion of domestic and international websites
  • 8. Data Analysis Method: Text To analyse a large body of text, Natural Language Processing (NLP) is one approach to categorisation that can mitigate the problem of obtaining accurate results that is unfeasible to perform manually A rudimentary Java class was developed that wrapped a small subset of the methods provided in the LingPipe API so that they could be called on the extracted text comments. The developed Java class enabled two forms of analysis: Sentiment Analysis and Collocation
  • 9. Sentiment Analysis A polarity analyser was developed that is able to locate significant word combinations and, using the developed corpus model as a training dataset, determine if the combination is generally positive or negative An accessible corpus of positive and negative sentiment composed in Korean has yet to be realized. A sample body of 2000 Korean text statements were coded into objective, subjective - positive and subjective - negative categories
  • 10. Collocation Collocation analysis can determine which tokens are more frequently found together than would normally be expected. Collocation can identify proper nouns in this way (such as the names or persons, places, or events) that would be lost if the frequency of each token were analysed in isolation.
  • 11. Results - Links 153,602 comments were collected for period chosen for the study 1,276 comments contained hyperlinks Total link count was 1,920 as it was common to have more than one hyperlink contained within an individual posting 762 were unique full URLs and 259 were unique domains 1,849 URLs encountered in the sample were found to belong to services based in Korea and 71 from international service Performing message amplification and network building were prominent causes of link posting
  • 12. Table 1: LexiURL Unique / Full hosts Based on the top 10 domains (24.5%) by occurrence out of 259
  • 13. Table 2: LexiURL Unique / Full URLs
  • 14. Table 3: Total links to each domain (Korea) Based on 1,078 (58.3%) of 1,849 links to Korean services
  • 15. Table 4: Total links to each domain (Overseas) Based on 51 (71.8%) of 71 links to overseas services
  • 16. Table 5: poster-gender and politician background
  • 17. Table 6: Comments categorized by link type from the six groups of gender and political affiliation Table 6: Comments categorized by link type from the six groups of gender and political affiliation Based on 206 comments agreed on by both coders from the initial set of 300
  • 18. Results - Text May and June 2008 were found to have high numbers of comments containing links that showed negative sentiment, and this date corresponds with the period of the candlelight protest May 2009 also shows large numbers of comments containing hyperlinks that indicate negative sentiment, coinciding with the suicide of ex-president Roh Moo-Hyun The name of Korean President Lee Myung-bak was found to occur two hundred and twenty nine times Terms pertaining to the candlelight protests, such as Mad Cow disease, beef, American goods, and candlelight protest occurred frequently Gini coefficient and a less formal term describing a similar measurement of wealth occurred frequently
  • 19. Figure 1. Positive and negative sentiment from comments containing links
  • 20. Confidence Levels To determine the effectiveness of the classification approach, 10% of training data was removed from the training set and used to evaluate the developed model. This approach allows testing the classification based on known human-classified data. The Average Conditional Probability score provides a basis for determining the ability of the classifier to correctly identify positive and negative sentiment. Based on the training set used, the Average Conditional Probability was found to be 87%.
  • 21. Limitations Less than 1% of all comments posted to the sample of politicians and indicates that although previous studies have shown how links can support communication in SNSs, their frequency in the Korean online political environment remains rare Comments deleted over the period of the study may omit the full extent of negative sentiment towards politicians The practice of deleting content in Korea has been found to be less constrained by social norms than found in Western SNSs, such as Facebook Legal mechanisms also exist in Korea to encourage the removal of negative content during election periods
  • 22. Conclusion Links are almost solely targeted to Korean domestic services, and the few that do point to overseas sites are usually related in some way to domestic issues in Korea Males are marginally more likely to comment on Cyworld Minihompies using links than females, and those Minihompies managed by ruling politicians were found to be of greater prominence than those of the opposition parties Message Amplification and Network Building were found to be the dominant purpose for submitting links within user-generated comments. Using two forms of machine-based learning algorithms, sentiment analysis and collocation of significant phrases, revealed primarily negative sentiment towards President Lee and his role in the reintroduction of American beef imports. Issues surrounding the suicide of ex-President Roh suggested anger towards those who were seen to be harassing him prior to his death
  • 23. Acknowledgement Research for this paper has been supported by the World Class University (WCU) program through the National Research Foundation of Korea, which is funded by the Ministry of Education, Science and Technology (No. 515-82-06574).

Notas do Editor

  1. this approach is largely descriptive and does not consider the accompanying text
  2. LingPipe is a comprehensive NLP toolkit and the methods used in the developed Java class enabled three forms of analysis: Sentiment Analysis, Collocation, and Language Identification
  3. LingPipe is a comprehensive NLP toolkit and the methods used in the developed Java class enabled three forms of analysis: Sentiment Analysis, Collocation, and Language Identification
  4. as wall-cleaning (Raynes–Goldie, 2010), occurs when the owner of a profile page periodically or reactively evaluates comments and deletes those that cast the owner in an unfavorable light. Howver, whilst the occurrence of this process on facebook is not in question, the degree to which this happens has been challenged. Walther, et al. (2008) explain that deleting content regardless of whether it is deemed to be negative or unflattering is avoided as this contravenes the spirit of open content. Smith and Kidder (2010) extend this concept to other forms of user generated content and explain that social norms deter users from deleting content once it is in the community.  The practice of deleting content in Korea however appears to not be restrained by the same unwritten rules as that which govern Facebook. Yoo (2009) explains that content that submissions to user message boards are routinely deleted if the SNS page owner judges them to be unflattering or negative.  In addition to the practice of cultural deleting of content, there also exists a legal motivation to remove that which is deemed to be incorrect or negative. The extent to which this deletion practice occurs remains unclear, although legal frameworks exist in Korea and elsewhere to encourage the deletion of content by either the service provider or owner of the SNS account.
  5. For example, the linking of a petition to call upon the governing president to be impeached combined with the name of the president occurring frequently and the negative sentiment recorded does point to….