SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Mining and Comparing Engagement Dynamics
Across Multiple Social Media Platforms
Matthew Rowe
Lancaster University, UK
@halani
harith-alani
@halani
ACM Web Science Conference (WebSci) 2014, Bloomington, IND
http://people.kmi.open.ac.uk/harith/
Harith Alani
Knowledge Media institute, UK
Engagement in Social Media
Moving on …
§  How can we move on
from these (micro)
studies?
§  Are results consistent
across datasets, and
platforms?
§  One way forward is:
§  Multiple platforms
§  Multiple topics
Publications on "social media analysis”
0
100
200
300
400
500
600
2006 2007 2008 2009 2010 2011 2012 2013
Publications on "social media analysis"
Papers studying single/multiple
social media platforms
Papers studying single/multiple
social media platforms
Papers studying single/multiple
social media platforms
Papers studying single/multiple
social media platforms
Apples and Oranges
§  We mix and compare
different features,
datasets, and platforms
§  Aim is to figure out their
similarities and
differences
Contributions
§  Examine replying dynamics as a modality of engagement
§  Define a framework of engagement analysis that fits multiple social platforms
§  Show the varying features at play in different platforms, and where the
similarities and differences are
§  Contrast the role of different features on engagement likelihood across five
social media platforms
§  Compare results to relevant literature on same or different platforms and
engagement indicators
7 datasets from 5 platforms
Platform Posts Users Seeds Non-seeds Replies
Boards.ie 6,120,008 65,528 398,508 81,273 5,640,227
Twitter Random 1,468,766 753,722 144,709 930,262 390,795
Twitter (Haiti
Earthquake)
65,022 45,238 1,835 60,686 2,501
Twitter (Obama
State of Union
Address)
81,458 67,417 11,298 56,135 14,025
SAP 427,221 32,926 87,542 7,276 332,403
Server Fault 234,790 33,285 65,515 6,447 162,828
Facebook 118,432 4,745 15,296 8,123 95,013
Seed posts are those that receive a reply
Non-seed posts are those with no replies
Data Balancing
Platform Seeds Non-seeds Instance Count
Boards.ie 398,508 81,273 162,546
Twitter Random 144,709 930,262 289,418
Twitter (Haiti
Earthquake)
1,835 60,686 3,670
Twitter (Obama State
of Union Address)
11,298 56,135 22,596
SAP 87,542 7,276 14,552
Server Fault 65,515 6,447 12,894
Facebook 15,296 8,123 16,246
Total 521,922
For each dataset, an equal number of seeds and non-seed
posts are used in the analysis.
Features
§  Post Length: number of words in
the post
§  Complexity: Measures the
cumulative entropy of terms in a
post
§  Readability: Gunning Fog index,
gauges how hard the post is to
parse by readers, and LIX
Readability metric to determine
complexity of words based on
number of letters
§  Referral Count: number of URLs
in the post
§  Informativeness: TF-IDF of the
post
§  Polarity: average sentiment
polarity of the post (using
SentiWordnet)
§  In-degree: number of in-coming
social connections (explicit or implicit)
§  Out-degree: number of out-going
social connections (explicit or implicit)
§  Post Count: number of posts made in
previous 6 months
§  User Age: length of membership in
community in days
§  Post Rate: number of posts by the
user per day
Social Features
Content Features
Classification of Posts
Seed Posts
Non-Seed
Posts
§  Binary classification model
§  Trained with social, content,
and combined features
§  80/20 training/testing
§  Compare results across
platforms, to see how a change
in each feature is associated
with likelihood of engagement
§  Compare engagement
dynamics from our platforms
against the literature
Classification Results
Feature P R F1
Social 0.592 0.591 0.591
Content 0.664 0.660 0.658
Social+Content 0.670 0.666 0.665
(Random) (Haiti Earthquake)
(Obama’s State Union Address)
P R F1
0.561 0.561 0.560
0.612 0.612 0.611
0.628 0.628 0.628
P R F1
0.968 0.966 0.966
0.752 0.747 0.747
0.974 0.973 0.973
Feature P R F1
Social 0.542 0.540 0.539
Content 0.650 0.642 0.639
Social+Content 0.656 0.649 0.646
P R F1
0.650 0.631 0.628
0.575 0.541 0.521
0.652 0.632 0.629
P R F1
0.528 0.380 0.319
0.626 0.380 0.275
0.568 0.407 0.359
Feature P R F1
Social 0.635 0.632 0.632
Content 0.641 0.641 0.641
Social+Content 0.660 0.660 0.660
§  Performance of the logistic regression
classifier trained over different feature
sets and applied to the test set.
Effect of features on engagement
Boards.ie
β
−2
−1
0
1
2
Twitter Random
β
−0.5
0.0
0.5
1.0
Twitter Haiti
−6e+16
−4e+16
−2e+16
0e+00
2e+16
4e+16
6e+16
Twitter Union
β
−0.8
−0.6
−0.4
−0.2
0.0
0.2
Server Fault
β
−1.0
−0.5
0.0
0.5
1.0
1.5
2.0
SAP
β
−10
−5
0
5
Facebook
β
−0.1
0.0
0.1
0.2
0.3
0.4
0.5
In−degree
Out−degree
Post Count
Age
Post Rate
Post Length
Referrals Count
Polarity
Complexity
Readability
Readability Fog
Informativeness
Logistic regression coefficients for each platform's features
Significance of regression coefficients
Boards.ie
p
0.0
0.2
0.4
0.6
0.8
1.0
Titter Random
p
0.0
0.2
0.4
0.6
0.8
1.0
Titter Haiti
p
0.0
0.2
0.4
0.6
0.8
1.0
Titter Union
p
0.0
0.2
0.4
0.6
0.8
1.0
Server Fault
p
0.0
0.2
0.4
0.6
0.8
1.0
SAP
p
0.0
0.2
0.4
0.6
0.8
1.0
Facebook
p
0.0
0.2
0.4
0.6
0.8
1.0
In−degree
Out−degree
Post Count
Age
Post Rate
Post Length
Referrals Count
Polarity
Complexity
Readability
Readability Fog
Informativeness
Comparison
to literature
§  How performance
of our feature
compare to other
studies on different
datasets and
platforms?
Positive impact
Negative impact
Mismatch
Match
Positive impact
Negative impact
Mismatch
Match
Summary
§  We tested the consistency and applicability of engagement
patterns across multiple platforms
§  Used 12 social/content features that map to 5 platforms
§  Studied the impact of those features on engagement across these
platforms
§  Compared the impact of our features against generally relevant
studies in the literature
§  Showed that same features could play a different roles in different
platforms, or different non-random datasets
So what’s Next!
§  LOTS!
§  Apply same study to more datasets from the same platforms, and from other
platforms
§  Expand from replies to other engagement indicators
§  Improve classification of seeds/non-seeds with more common features
§  Further study on impact of topics and non-randomness on engagement
dynamics
§  Take user type into account – e.g. posts from new agencies are more likely to
be tweeted than replied to
Questions!
1.  Why those specific datasets and platforms?
2.  What about platform-specific features?
3.  Could we ever get a full understanding of these dynamics
across all social platforms?
4.  Could these findings be used to increase engagement?
5.  Who’s right/wrong when the same feature appears to have
conflicting impact on the same platform?
6.  Couldn’t be the case that the same feature is used
differently in different platforms?
7.  How could we study event-specific engagement dynamics?
@halani
harith-alani
@halani
http://people.kmi.open.ac.uk/harith/
ACM Web Science Conference (WebSci) 2014, middle of nowhere!

Mais conteúdo relacionado

Mais procurados

CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)
Lora Aroyo
 
Data Cleaning for social media knowledge extraction
Data Cleaning for social media knowledge extractionData Cleaning for social media knowledge extraction
Data Cleaning for social media knowledge extraction
Marco Brambilla
 

Mais procurados (20)

Social Network Analysis (SNA) and its implications for knowledge discovery in...
Social Network Analysis (SNA) and its implications for knowledge discovery in...Social Network Analysis (SNA) and its implications for knowledge discovery in...
Social Network Analysis (SNA) and its implications for knowledge discovery in...
 
2015 pdf-marc smith-node xl-social media sna
2015 pdf-marc smith-node xl-social media sna2015 pdf-marc smith-node xl-social media sna
2015 pdf-marc smith-node xl-social media sna
 
2010 sept - mobile web africa - marc smith - says who - mapping social medi...
2010   sept - mobile web africa - marc smith - says who - mapping social medi...2010   sept - mobile web africa - marc smith - says who - mapping social medi...
2010 sept - mobile web africa - marc smith - says who - mapping social medi...
 
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...
 
20151001 charles university prague - marc smith - node xl-picturing political...
20151001 charles university prague - marc smith - node xl-picturing political...20151001 charles university prague - marc smith - node xl-picturing political...
20151001 charles university prague - marc smith - node xl-picturing political...
 
CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)
 
Data Cleaning for social media knowledge extraction
Data Cleaning for social media knowledge extractionData Cleaning for social media knowledge extraction
Data Cleaning for social media knowledge extraction
 
2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formatted2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formatted
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis
 
Think Link: Network Insights with No Programming Skills
Think Link: Network Insights with No Programming SkillsThink Link: Network Insights with No Programming Skills
Think Link: Network Insights with No Programming Skills
 
2014 TheNextWeb-Mapping connections with NodeXL
2014 TheNextWeb-Mapping connections with NodeXL2014 TheNextWeb-Mapping connections with NodeXL
2014 TheNextWeb-Mapping connections with NodeXL
 
Ph.D. defense: semantic social network analysis
Ph.D. defense: semantic social network analysisPh.D. defense: semantic social network analysis
Ph.D. defense: semantic social network analysis
 
2016 SocialMedia.Org Marc Smith-NodeXL-Social Media SNA
2016 SocialMedia.Org Marc Smith-NodeXL-Social Media SNA2016 SocialMedia.Org Marc Smith-NodeXL-Social Media SNA
2016 SocialMedia.Org Marc Smith-NodeXL-Social Media SNA
 
Visualizing Big Data - Social Network Analysis
Visualizing Big Data - Social Network AnalysisVisualizing Big Data - Social Network Analysis
Visualizing Big Data - Social Network Analysis
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Social Media Mining and Analytics
 
20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...
 
Big Data: Social Network Analysis
Big Data: Social Network AnalysisBig Data: Social Network Analysis
Big Data: Social Network Analysis
 
20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
Understanding Public Sentiment: Conducting a Related-Tags Content Network Ext...
Understanding Public Sentiment: Conducting a Related-Tags Content Network Ext...Understanding Public Sentiment: Conducting a Related-Tags Content Network Ext...
Understanding Public Sentiment: Conducting a Related-Tags Content Network Ext...
 

Semelhante a Mining and Comparing Engagement Dynamics Across Multiple Social Media Platforms #websci14

Predicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic WebPredicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic Web
Matthew Rowe
 
ISWC2023-McGuinnessTWC16x9FinalShort.pdf
ISWC2023-McGuinnessTWC16x9FinalShort.pdfISWC2023-McGuinnessTWC16x9FinalShort.pdf
ISWC2023-McGuinnessTWC16x9FinalShort.pdf
Deborah McGuinness
 
Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...
Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...
Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...
IJCSIS Research Publications
 
TruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social NetworkTruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social Network
Lora Aroyo
 
2006 www - lento welser gu smith - ties thatblog
2006   www - lento welser gu smith - ties thatblog2006   www - lento welser gu smith - ties thatblog
2006 www - lento welser gu smith - ties thatblog
Marc Smith
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
Kristi Holmes
 

Semelhante a Mining and Comparing Engagement Dynamics Across Multiple Social Media Platforms #websci14 (20)

Predicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic WebPredicting Discussions on the Social Semantic Web
Predicting Discussions on the Social Semantic Web
 
Alamw15 VIVO
Alamw15 VIVOAlamw15 VIVO
Alamw15 VIVO
 
ISWC2023-McGuinnessTWC16x9FinalShort.pdf
ISWC2023-McGuinnessTWC16x9FinalShort.pdfISWC2023-McGuinnessTWC16x9FinalShort.pdf
ISWC2023-McGuinnessTWC16x9FinalShort.pdf
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Essay Revision Online.pdf
Essay Revision Online.pdfEssay Revision Online.pdf
Essay Revision Online.pdf
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Social Network Analysis based on MOOC's (Massive Open Online Classes)
Social Network Analysis based on MOOC's (Massive Open Online Classes)Social Network Analysis based on MOOC's (Massive Open Online Classes)
Social Network Analysis based on MOOC's (Massive Open Online Classes)
 
Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...
Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...
Birds of a Feather Flock Together? A Study of Developers’ Flocking and Migrat...
 
ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4
 
TruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social NetworkTruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social Network
 
2006 www - lento welser gu smith - ties thatblog
2006   www - lento welser gu smith - ties thatblog2006   www - lento welser gu smith - ties thatblog
2006 www - lento welser gu smith - ties thatblog
 
Show me the data! Actionable insight from open courses
Show me the data! Actionable insight from open coursesShow me the data! Actionable insight from open courses
Show me the data! Actionable insight from open courses
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
 
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHIBig Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
Big Data Analytics- USE CASES SOLVED USING NETWORK ANALYSIS TECHNIQUES IN GEPHI
 
Enabling Citizen-empowered Apps over Linked Data
Enabling Citizen-empowered Apps over Linked DataEnabling Citizen-empowered Apps over Linked Data
Enabling Citizen-empowered Apps over Linked Data
 
A network based model for predicting a hashtag break out in twitter
A network based model for predicting a hashtag break out in twitter A network based model for predicting a hashtag break out in twitter
A network based model for predicting a hashtag break out in twitter
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
 
Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?
 
How to Ask for Technical Help? Evidence-based Guidelines for Writing Question...
How to Ask for Technical Help? Evidence-based Guidelines for Writing Question...How to Ask for Technical Help? Evidence-based Guidelines for Writing Question...
How to Ask for Technical Help? Evidence-based Guidelines for Writing Question...
 
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
 

Mais de The Open University

Mais de The Open University (15)

Misinformation vs Fact-Checks: The Ongoing Battle
Misinformation vs Fact-Checks: The Ongoing BattleMisinformation vs Fact-Checks: The Ongoing Battle
Misinformation vs Fact-Checks: The Ongoing Battle
 
knod22-Alani.pdf
knod22-Alani.pdfknod22-Alani.pdf
knod22-Alani.pdf
 
Co-Creating Misinformation Resilient Societies
Co-Creating Misinformation Resilient Societies Co-Creating Misinformation Resilient Societies
Co-Creating Misinformation Resilient Societies
 
SASIG Workshop on “Improving the digital landscape for our children”
SASIG Workshop on “Improving the digital landscape for our children”SASIG Workshop on “Improving the digital landscape for our children”
SASIG Workshop on “Improving the digital landscape for our children”
 
COMRADES summary
COMRADES summaryCOMRADES summary
COMRADES summary
 
COMRADES project introduction
COMRADES project introduction COMRADES project introduction
COMRADES project introduction
 
Co-Inform (Co-Creating Misinformation Resilient Societies)
Co-Inform (Co-Creating Misinformation Resilient Societies)Co-Inform (Co-Creating Misinformation Resilient Societies)
Co-Inform (Co-Creating Misinformation Resilient Societies)
 
COMRADES ICT2018
COMRADES ICT2018COMRADES ICT2018
COMRADES ICT2018
 
Crisis Information Processing - with the power of A.I.
Crisis Information Processing - with the power of A.I.Crisis Information Processing - with the power of A.I.
Crisis Information Processing - with the power of A.I.
 
H2020 COMRADES project introduction
H2020 COMRADES project introduction H2020 COMRADES project introduction
H2020 COMRADES project introduction
 
Radicalisation detection on social media
Radicalisation detection on social mediaRadicalisation detection on social media
Radicalisation detection on social media
 
Analysing the dark side of Social Media
Analysing the dark side of Social MediaAnalysing the dark side of Social Media
Analysing the dark side of Social Media
 
Detecting online grooming and radicalisation
Detecting online grooming and radicalisationDetecting online grooming and radicalisation
Detecting online grooming and radicalisation
 
Detecting Grooming Behaviour on Social Media
Detecting Grooming Behaviour on Social MediaDetecting Grooming Behaviour on Social Media
Detecting Grooming Behaviour on Social Media
 
Semantics, Sensors, and the Social Web
Semantics, Sensors, and the Social WebSemantics, Sensors, and the Social Web
Semantics, Sensors, and the Social Web
 

Último

Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
ZurliaSoop
 
Capstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdfCapstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdf
eliklein8
 
Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdfSociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
SocioCosmos
 
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
ZurliaSoop
 
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
Cara Menggugurkan Kandungan 087776558899
 
+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<
+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<
+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<
Health
 
Capstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdfCapstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdf
eliklein8
 
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
Cara Menggugurkan Kandungan 087776558899
 
Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...
Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...
Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...
Heena Escort Service
 

Último (17)

Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
 
Capstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdfCapstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdf
 
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIRBVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
 
Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdfSociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
 
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
Jual Obat Aborsi Palu ( Taiwan No.1 ) 085657271886 Obat Penggugur Kandungan C...
 
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
 
Enhancing Consumer Trust Through Strategic Content Marketing
Enhancing Consumer Trust Through Strategic Content MarketingEnhancing Consumer Trust Through Strategic Content Marketing
Enhancing Consumer Trust Through Strategic Content Marketing
 
+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<
+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<
+971565801893>> ORIGINAL CYTOTEC ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI<<
 
Marketing Plan - Social Media. The Sparks Foundation
Marketing Plan -  Social Media. The Sparks FoundationMarketing Plan -  Social Media. The Sparks Foundation
Marketing Plan - Social Media. The Sparks Foundation
 
Capstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdfCapstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdf
 
The Butterfly Effect
The Butterfly EffectThe Butterfly Effect
The Butterfly Effect
 
Capstone slide deck on the TikTok revolution
Capstone slide deck on the TikTok revolutionCapstone slide deck on the TikTok revolution
Capstone slide deck on the TikTok revolution
 
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic HappensIgnite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
 
Content strategy : Content empire and cash in
Content strategy : Content empire and cash inContent strategy : Content empire and cash in
Content strategy : Content empire and cash in
 
SEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdf
SEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdfSEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdf
SEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdf
 
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
 
Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...
Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...
Meet Incall & Out Escort Service in D -9634446618 | #escort Service in GTB Na...
 

Mining and Comparing Engagement Dynamics Across Multiple Social Media Platforms #websci14

  • 1. Mining and Comparing Engagement Dynamics Across Multiple Social Media Platforms Matthew Rowe Lancaster University, UK @halani harith-alani @halani ACM Web Science Conference (WebSci) 2014, Bloomington, IND http://people.kmi.open.ac.uk/harith/ Harith Alani Knowledge Media institute, UK
  • 3. Moving on … §  How can we move on from these (micro) studies? §  Are results consistent across datasets, and platforms? §  One way forward is: §  Multiple platforms §  Multiple topics
  • 4. Publications on "social media analysis” 0 100 200 300 400 500 600 2006 2007 2008 2009 2010 2011 2012 2013 Publications on "social media analysis"
  • 9. Apples and Oranges §  We mix and compare different features, datasets, and platforms §  Aim is to figure out their similarities and differences
  • 10. Contributions §  Examine replying dynamics as a modality of engagement §  Define a framework of engagement analysis that fits multiple social platforms §  Show the varying features at play in different platforms, and where the similarities and differences are §  Contrast the role of different features on engagement likelihood across five social media platforms §  Compare results to relevant literature on same or different platforms and engagement indicators
  • 11. 7 datasets from 5 platforms Platform Posts Users Seeds Non-seeds Replies Boards.ie 6,120,008 65,528 398,508 81,273 5,640,227 Twitter Random 1,468,766 753,722 144,709 930,262 390,795 Twitter (Haiti Earthquake) 65,022 45,238 1,835 60,686 2,501 Twitter (Obama State of Union Address) 81,458 67,417 11,298 56,135 14,025 SAP 427,221 32,926 87,542 7,276 332,403 Server Fault 234,790 33,285 65,515 6,447 162,828 Facebook 118,432 4,745 15,296 8,123 95,013 Seed posts are those that receive a reply Non-seed posts are those with no replies
  • 12. Data Balancing Platform Seeds Non-seeds Instance Count Boards.ie 398,508 81,273 162,546 Twitter Random 144,709 930,262 289,418 Twitter (Haiti Earthquake) 1,835 60,686 3,670 Twitter (Obama State of Union Address) 11,298 56,135 22,596 SAP 87,542 7,276 14,552 Server Fault 65,515 6,447 12,894 Facebook 15,296 8,123 16,246 Total 521,922 For each dataset, an equal number of seeds and non-seed posts are used in the analysis.
  • 13. Features §  Post Length: number of words in the post §  Complexity: Measures the cumulative entropy of terms in a post §  Readability: Gunning Fog index, gauges how hard the post is to parse by readers, and LIX Readability metric to determine complexity of words based on number of letters §  Referral Count: number of URLs in the post §  Informativeness: TF-IDF of the post §  Polarity: average sentiment polarity of the post (using SentiWordnet) §  In-degree: number of in-coming social connections (explicit or implicit) §  Out-degree: number of out-going social connections (explicit or implicit) §  Post Count: number of posts made in previous 6 months §  User Age: length of membership in community in days §  Post Rate: number of posts by the user per day Social Features Content Features
  • 14. Classification of Posts Seed Posts Non-Seed Posts §  Binary classification model §  Trained with social, content, and combined features §  80/20 training/testing §  Compare results across platforms, to see how a change in each feature is associated with likelihood of engagement §  Compare engagement dynamics from our platforms against the literature
  • 15. Classification Results Feature P R F1 Social 0.592 0.591 0.591 Content 0.664 0.660 0.658 Social+Content 0.670 0.666 0.665 (Random) (Haiti Earthquake) (Obama’s State Union Address) P R F1 0.561 0.561 0.560 0.612 0.612 0.611 0.628 0.628 0.628 P R F1 0.968 0.966 0.966 0.752 0.747 0.747 0.974 0.973 0.973 Feature P R F1 Social 0.542 0.540 0.539 Content 0.650 0.642 0.639 Social+Content 0.656 0.649 0.646 P R F1 0.650 0.631 0.628 0.575 0.541 0.521 0.652 0.632 0.629 P R F1 0.528 0.380 0.319 0.626 0.380 0.275 0.568 0.407 0.359 Feature P R F1 Social 0.635 0.632 0.632 Content 0.641 0.641 0.641 Social+Content 0.660 0.660 0.660 §  Performance of the logistic regression classifier trained over different feature sets and applied to the test set.
  • 16. Effect of features on engagement Boards.ie β −2 −1 0 1 2 Twitter Random β −0.5 0.0 0.5 1.0 Twitter Haiti −6e+16 −4e+16 −2e+16 0e+00 2e+16 4e+16 6e+16 Twitter Union β −0.8 −0.6 −0.4 −0.2 0.0 0.2 Server Fault β −1.0 −0.5 0.0 0.5 1.0 1.5 2.0 SAP β −10 −5 0 5 Facebook β −0.1 0.0 0.1 0.2 0.3 0.4 0.5 In−degree Out−degree Post Count Age Post Rate Post Length Referrals Count Polarity Complexity Readability Readability Fog Informativeness Logistic regression coefficients for each platform's features
  • 17. Significance of regression coefficients Boards.ie p 0.0 0.2 0.4 0.6 0.8 1.0 Titter Random p 0.0 0.2 0.4 0.6 0.8 1.0 Titter Haiti p 0.0 0.2 0.4 0.6 0.8 1.0 Titter Union p 0.0 0.2 0.4 0.6 0.8 1.0 Server Fault p 0.0 0.2 0.4 0.6 0.8 1.0 SAP p 0.0 0.2 0.4 0.6 0.8 1.0 Facebook p 0.0 0.2 0.4 0.6 0.8 1.0 In−degree Out−degree Post Count Age Post Rate Post Length Referrals Count Polarity Complexity Readability Readability Fog Informativeness
  • 18. Comparison to literature §  How performance of our feature compare to other studies on different datasets and platforms?
  • 21. Summary §  We tested the consistency and applicability of engagement patterns across multiple platforms §  Used 12 social/content features that map to 5 platforms §  Studied the impact of those features on engagement across these platforms §  Compared the impact of our features against generally relevant studies in the literature §  Showed that same features could play a different roles in different platforms, or different non-random datasets
  • 22. So what’s Next! §  LOTS! §  Apply same study to more datasets from the same platforms, and from other platforms §  Expand from replies to other engagement indicators §  Improve classification of seeds/non-seeds with more common features §  Further study on impact of topics and non-randomness on engagement dynamics §  Take user type into account – e.g. posts from new agencies are more likely to be tweeted than replied to
  • 23. Questions! 1.  Why those specific datasets and platforms? 2.  What about platform-specific features? 3.  Could we ever get a full understanding of these dynamics across all social platforms? 4.  Could these findings be used to increase engagement? 5.  Who’s right/wrong when the same feature appears to have conflicting impact on the same platform? 6.  Couldn’t be the case that the same feature is used differently in different platforms? 7.  How could we study event-specific engagement dynamics?