SlideShare a Scribd company logo
1 of 24
Predicting Discussions on the Social Semantic Web Matthew Rowe, Sofia Angeletou and HarithAlani Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom
Mass of Social Data Social content is now published at a staggering rate…. Predicting Discussions on the Social Semantic Web 1
Social Data Publication Rates ~600 Tweets per second [1] ~700 Facebook status updates per second [1] Spinn3r dataset collected from Jan – Feb 2011 [2] 133 million blog posts 5.7 million forum posts 231 million social media posts [1] http://searchengineland.com/by-the-numbers-twitter-vs-facebook-vs-google-buzz-36709 [2] http://icwsm.org/data/index.php Predicting Discussions on the Social Semantic Web 2
The New Information Era Predicting Discussions on the Social Semantic Web 3
But…. Analysis is Limited Market Analysts What are people saying about my products? Opinion Mining How are people perceiving a given subject or topic? eGovernment Policy Makers How is a policy or law received by the public? How can I maximise feedback to my content? Predicting Discussions on the Social Semantic Web 4
Attention Economics Given all this data… How do we decide on what information  to focus on?  How do we know what posts will  evolve into discussions? Attention Economics (Goldhaber, 1997) Need to understand key indicators of high-attention discussions 5 Predicting Discussions on the Social Semantic Web
Discussions on Twitter Twitter is used as medium to: Share opinions and ideas Engage in discussions  Discussing events  Debating topics Identifying online discussions enables: Up-to-date public opinion Observation of topics of interest Gauging the popularity of government policies Fine-grained customer support 6 Predicting Discussions on the Social Semantic Web
Predicting Discussions Pre-empt discussions on the Social Web: Identifying seed posts  i.e. posts that start a discussion Will a given post start a discussion? What are the key features of seed posts? Predicting discussion activity levels What is the level of discussion that a seed post will generate? What are the key factors of lengthy discussions? 7 Predicting Discussions on the Social Semantic Web
The Need for Semantics For predictions we require statistical features User features Content features Features provided using differing schemas by different platforms How to overcome heterogeneity? Currently, no ontologies capture such features Predicting Discussions on the Social Semantic Web 8
Behaviour Ontology Predicting Discussions on the Social Semantic Web 9 www.purl.org/NET/oubo/0.23/
Features 10 Predicting Discussions on the Social Semantic Web
Identifying Seed Posts Experiments Haiti and Union Address Datasets Divided each dataset up using 70/20/10 split for training/validation/testing Evaluated a binary classification task Is this post a seed post or not? Precision, Recall, F1 and Area under ROC Tested: user, content, user+content features Tested Perceptron, SVM, Naïve Bayes and J48 11 Predicting Discussions on the Social Semantic Web
Identifying Seed Posts 12 Predicting Discussions on the Social Semantic Web
Identifying Seed Posts What are the most important features? 13 Predicting Discussions on the Social Semantic Web
Identifying Seed Posts What is the correlation between seed posts and features? 14 Predicting Discussions on the Social Semantic Web
Identifying Seed Posts 15 Can we identify seed posts using the top-k features? Predicting Discussions on the Social Semantic Web
Predicting Discussion Activity From identified seed posts: Can we predict the level of discussion activity? How much activity will a post generate? [Wang & Groth, 2010] learns a regression model, and reports on coefficients Identifying relationship between features We do something different: Predict the volume of the discussion 16 Predicting Discussions on the Social Semantic Web
Predicting Discussion Activity 17 Predicting Discussions on the Social Semantic Web
Predicting Discussion Activity Compare rankings Ground truth vs predicted Experiments Using Haiti and Union Address datasets Evaluation measure: NormalisedDiscounted Cumulative Gain Assessing nDCG@k where k={1,5,10,20,50,100) TestedSupport Vector Regression with:  user, content, user+content features 18 Predicting Discussions on the Social Semantic Web
Predicting Discussion Activity 19 Predicting Discussions on the Social Semantic Web
Findings User reputation and standing is crucial eliciting a response starting a discussion Greater broadcast capability = greater likelihood of response More listeners = more discussion Activity levels influenced by out-degree Allow the poster to see response from ‘respected’ peers 20 Predicting Discussions on the Social Semantic Web
Conclusions Pre-empt discussions to empower Market analysts Opinion mining eGovernment policy makers Behaviour ontology Captures impact across platforms Approach accurately predicts: Which posts will yield a reply, and; The level of discussion activity Predicting Discussions on the Social Semantic Web 21
Current and Future Work Experiments over a forum dataset Content features >> user features Different platform dynamics Extend experiments to a random Twitter dataset Extension to behaviour ontology Captures concentration i.e. focus of a user on specific topics Categorising users by role Based on observed behaviour Predicting Discussions on the Social Semantic Web 22
Questions Questions? people.kmi.open.ac.uk/rowe m.c.rowe@open.ac.uk @mattroweshow Predicting Discussions on the Social Semantic Web 23

More Related Content

What's hot

Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systems
Davide Eynard
 
int.ere.st: SCOT-based Tag Sharing Services
int.ere.st: SCOT-based Tag Sharing Servicesint.ere.st: SCOT-based Tag Sharing Services
int.ere.st: SCOT-based Tag Sharing Services
Haklae Kim
 
A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...
A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...
A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...
annehelmond
 

What's hot (20)

Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...
 
Tutorial: Social Semantics
Tutorial: Social SemanticsTutorial: Social Semantics
Tutorial: Social Semantics
 
The “use” of an electronic resource from a social network analysis perspective
The “use” of an electronic resource from a social network analysis perspectiveThe “use” of an electronic resource from a social network analysis perspective
The “use” of an electronic resource from a social network analysis perspective
 
DataPortability and Me: Introducing SIOC, FOAF and the Semantic Web
DataPortability and Me: Introducing SIOC, FOAF and the Semantic WebDataPortability and Me: Introducing SIOC, FOAF and the Semantic Web
DataPortability and Me: Introducing SIOC, FOAF and the Semantic Web
 
Data Portability with SIOC and FOAF
Data Portability with SIOC and FOAFData Portability with SIOC and FOAF
Data Portability with SIOC and FOAF
 
Graph Structure In The Web
Graph Structure In The WebGraph Structure In The Web
Graph Structure In The Web
 
Web 2.0 2006: Implications for the LMS
Web 2.0 2006: Implications for the LMSWeb 2.0 2006: Implications for the LMS
Web 2.0 2006: Implications for the LMS
 
The Social Semantic Web: New York Times Edition
The Social Semantic Web: New York Times EditionThe Social Semantic Web: New York Times Edition
The Social Semantic Web: New York Times Edition
 
Data Accessibility and Me: Introducing SIOC, FOAF and the Linked Data Web
Data Accessibility and Me: Introducing SIOC, FOAF and the Linked Data WebData Accessibility and Me: Introducing SIOC, FOAF and the Linked Data Web
Data Accessibility and Me: Introducing SIOC, FOAF and the Linked Data Web
 
Web 2.0 and the LMS
Web 2.0 and the LMSWeb 2.0 and the LMS
Web 2.0 and the LMS
 
SDoW2010 keynote
SDoW2010 keynoteSDoW2010 keynote
SDoW2010 keynote
 
The Social Semantic Web
The Social Semantic Web The Social Semantic Web
The Social Semantic Web
 
Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systems
 
2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formatted2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formatted
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social Semantics
 
Linking In-Game Events and Entities to Social Data on the Web
Linking In-Game Events and Entities to Social Data on the WebLinking In-Game Events and Entities to Social Data on the Web
Linking In-Game Events and Entities to Social Data on the Web
 
int.ere.st: SCOT-based Tag Sharing Services
int.ere.st: SCOT-based Tag Sharing Servicesint.ere.st: SCOT-based Tag Sharing Services
int.ere.st: SCOT-based Tag Sharing Services
 
A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...
A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...
A Protest’s Web: The Cross-Syndication Practices of G20 Toronto Summit Online...
 
Semantic Web 2.0: Creating Social Semantic Information Spaces
Semantic Web 2.0: Creating Social Semantic Information SpacesSemantic Web 2.0: Creating Social Semantic Information Spaces
Semantic Web 2.0: Creating Social Semantic Information Spaces
 
Personal Digital Archiving 2011 - Charting Collections of Connections in Soci...
Personal Digital Archiving 2011 - Charting Collections of Connections in Soci...Personal Digital Archiving 2011 - Charting Collections of Connections in Soci...
Personal Digital Archiving 2011 - Charting Collections of Connections in Soci...
 

Viewers also liked

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
Tomek Pluskiewicz
 
Social Networks, Dominance And Interoperability
Social Networks, Dominance And InteroperabilitySocial Networks, Dominance And Interoperability
Social Networks, Dominance And Interoperability
blogzilla
 

Viewers also liked (17)

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
PLNs, CoPs, and Connectivism
PLNs, CoPs, and ConnectivismPLNs, CoPs, and Connectivism
PLNs, CoPs, and Connectivism
 
Using narratives in enterprise gamification for sales, training, service and ...
Using narratives in enterprise gamification for sales, training, service and ...Using narratives in enterprise gamification for sales, training, service and ...
Using narratives in enterprise gamification for sales, training, service and ...
 
The digital traces of user generated content
The digital traces of user generated contentThe digital traces of user generated content
The digital traces of user generated content
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
 
Increase your college’s visibility with content curation
Increase your college’s visibility with content curationIncrease your college’s visibility with content curation
Increase your college’s visibility with content curation
 
Social Networks, Dominance And Interoperability
Social Networks, Dominance And InteroperabilitySocial Networks, Dominance And Interoperability
Social Networks, Dominance And Interoperability
 
Social Media and Scholarly Communication
Social Media and Scholarly CommunicationSocial Media and Scholarly Communication
Social Media and Scholarly Communication
 
Gamification: How it can be used to Engage Library Users
Gamification: How it can be used to Engage Library UsersGamification: How it can be used to Engage Library Users
Gamification: How it can be used to Engage Library Users
 
Global inspiration, local action #ili2014
Global inspiration, local action #ili2014Global inspiration, local action #ili2014
Global inspiration, local action #ili2014
 
Twitter as a First Draft of the Present – and the Challenges of Preserving It...
Twitter as a First Draft of the Present – and the Challenges of Preserving It...Twitter as a First Draft of the Present – and the Challenges of Preserving It...
Twitter as a First Draft of the Present – and the Challenges of Preserving It...
 
Effective Content Curation in Higher Ed
Effective Content Curation in Higher EdEffective Content Curation in Higher Ed
Effective Content Curation in Higher Ed
 
Why Semantic Knowledge Graphs matter
Why Semantic Knowledge Graphs matterWhy Semantic Knowledge Graphs matter
Why Semantic Knowledge Graphs matter
 
Gamification in Libraries
Gamification in LibrariesGamification in Libraries
Gamification in Libraries
 
Semantic Web and Web 3.0 - Web Technologies (1019888BNR)
Semantic Web and Web 3.0 - Web Technologies (1019888BNR)Semantic Web and Web 3.0 - Web Technologies (1019888BNR)
Semantic Web and Web 3.0 - Web Technologies (1019888BNR)
 
How to pass a coding interview as an automation developer talk - Oct 17 2016
How to pass a coding interview as an automation developer talk - Oct 17 2016How to pass a coding interview as an automation developer talk - Oct 17 2016
How to pass a coding interview as an automation developer talk - Oct 17 2016
 

Similar to Predicting Discussions on the Social Semantic Web

Anticipating Discussion Activity on Community Forums
Anticipating Discussion Activity on Community ForumsAnticipating Discussion Activity on Community Forums
Anticipating Discussion Activity on Community Forums
Matthew Rowe
 
Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...
Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...
Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...
Anna De Liddo
 
Jawad Haqbeen from Nagoya Institute of Technology
Jawad Haqbeen from Nagoya Institute of Technology	Jawad Haqbeen from Nagoya Institute of Technology
Jawad Haqbeen from Nagoya Institute of Technology
Jawad Haqbeen
 

Similar to Predicting Discussions on the Social Semantic Web (20)

Anticipating Discussion Activity on Community Forums
Anticipating Discussion Activity on Community ForumsAnticipating Discussion Activity on Community Forums
Anticipating Discussion Activity on Community Forums
 
Extension 2.0
Extension 2.0Extension 2.0
Extension 2.0
 
Social Web .20 Class Week 6: Lightweight Authoring, Blogs, Wikis
Social Web .20 Class Week 6: Lightweight Authoring, Blogs, WikisSocial Web .20 Class Week 6: Lightweight Authoring, Blogs, Wikis
Social Web .20 Class Week 6: Lightweight Authoring, Blogs, Wikis
 
Tn T Horizons April 28 2008
Tn T Horizons April 28 2008Tn T Horizons April 28 2008
Tn T Horizons April 28 2008
 
Understanding User-Community Engagement by Multi-faceted Features: A Case ...
Understanding User-Community Engagement by Multi-faceted Features: A Case ...Understanding User-Community Engagement by Multi-faceted Features: A Case ...
Understanding User-Community Engagement by Multi-faceted Features: A Case ...
 
Using ICTs to Promote Cultural Change: A Study from a Higher Education Context
Using ICTs to Promote Cultural Change: A Study from a Higher Education ContextUsing ICTs to Promote Cultural Change: A Study from a Higher Education Context
Using ICTs to Promote Cultural Change: A Study from a Higher Education Context
 
Mining and Comparing Engagement Dynamics Across Multiple Social Media Platfor...
Mining and Comparing Engagement Dynamics Across Multiple Social Media Platfor...Mining and Comparing Engagement Dynamics Across Multiple Social Media Platfor...
Mining and Comparing Engagement Dynamics Across Multiple Social Media Platfor...
 
Engagement, Impact, Value: Measuring and Maximising Impact Using the Social Web
Engagement, Impact, Value: Measuring and Maximising Impact Using the Social WebEngagement, Impact, Value: Measuring and Maximising Impact Using the Social Web
Engagement, Impact, Value: Measuring and Maximising Impact Using the Social Web
 
Horizon_INACAP
Horizon_INACAPHorizon_INACAP
Horizon_INACAP
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Yahoo! Engagement Study
Yahoo! Engagement StudyYahoo! Engagement Study
Yahoo! Engagement Study
 
Building and Communicating Evidence of Effectiveness in OER through Collectiv...
Building and Communicating Evidence of Effectiveness in OER through Collectiv...Building and Communicating Evidence of Effectiveness in OER through Collectiv...
Building and Communicating Evidence of Effectiveness in OER through Collectiv...
 
Metrics for Understanding Personal and Institutional Use of the Social Web
Metrics for Understanding Personal and Institutional Use of the Social WebMetrics for Understanding Personal and Institutional Use of the Social Web
Metrics for Understanding Personal and Institutional Use of the Social Web
 
Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...
Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...
Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...
 
An Introduction To The Knowledge Hub
An Introduction To The Knowledge HubAn Introduction To The Knowledge Hub
An Introduction To The Knowledge Hub
 
Social smarts presenation marshall sponder - updated 9-16-12
Social smarts presenation   marshall sponder - updated 9-16-12Social smarts presenation   marshall sponder - updated 9-16-12
Social smarts presenation marshall sponder - updated 9-16-12
 
Jawad Haqbeen from Nagoya Institute of Technology
Jawad Haqbeen from Nagoya Institute of Technology	Jawad Haqbeen from Nagoya Institute of Technology
Jawad Haqbeen from Nagoya Institute of Technology
 
Measuring All of YouTube
Measuring All of YouTubeMeasuring All of YouTube
Measuring All of YouTube
 
Lifecasting Microblogging
Lifecasting MicrobloggingLifecasting Microblogging
Lifecasting Microblogging
 
Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?Who are the top influencers and what characterizes them?
Who are the top influencers and what characterizes them?
 

More from Matthew Rowe

Existing Research and Future Research Agenda
Existing Research and Future Research AgendaExisting Research and Future Research Agenda
Existing Research and Future Research Agenda
Matthew Rowe
 
Modelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online CommunitiesModelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online Communities
Matthew Rowe
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
Matthew Rowe
 
Forecasting Audience Increase on Youtube
Forecasting Audience Increase on YoutubeForecasting Audience Increase on Youtube
Forecasting Audience Increase on Youtube
Matthew Rowe
 
PhD Viva - Disambiguating Identity Web References using Social Data
PhD Viva - Disambiguating Identity Web References using Social DataPhD Viva - Disambiguating Identity Web References using Social Data
PhD Viva - Disambiguating Identity Web References using Social Data
Matthew Rowe
 

More from Matthew Rowe (20)

Social Computing Research with Apache Spark
Social Computing Research with Apache SparkSocial Computing Research with Apache Spark
Social Computing Research with Apache Spark
 
Predicting Online Community Churners using Gaussian Sequences
Predicting Online Community Churners using Gaussian SequencesPredicting Online Community Churners using Gaussian Sequences
Predicting Online Community Churners using Gaussian Sequences
 
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
Transferring Semantic Categories with Vertex Kernels: Recommendations with Se...
 
SemanticSVD++: Incorporating Semantic Taste Evolution for Predicting Ratings
SemanticSVD++: Incorporating Semantic Taste Evolution for Predicting RatingsSemanticSVD++: Incorporating Semantic Taste Evolution for Predicting Ratings
SemanticSVD++: Incorporating Semantic Taste Evolution for Predicting Ratings
 
The Semantic Evolution of Online Communities
The Semantic Evolution of Online CommunitiesThe Semantic Evolution of Online Communities
The Semantic Evolution of Online Communities
 
From Mining to Understanding: The Evolution of Social Web Users
From Mining to Understanding: The Evolution of Social Web UsersFrom Mining to Understanding: The Evolution of Social Web Users
From Mining to Understanding: The Evolution of Social Web Users
 
Mining User Lifecycles from Online Community Platforms and their Application ...
Mining User Lifecycles from Online Community Platforms and their Application ...Mining User Lifecycles from Online Community Platforms and their Application ...
Mining User Lifecycles from Online Community Platforms and their Application ...
 
From User Needs to Community Health: Mining User Behaviour to Analyse Online ...
From User Needs to Community Health: Mining User Behaviour to Analyse Online ...From User Needs to Community Health: Mining User Behaviour to Analyse Online ...
From User Needs to Community Health: Mining User Behaviour to Analyse Online ...
 
Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...
Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...
Changing with Time: Modelling and Detecting User Lifecycle Periods in Online ...
 
Identity: Physical, Cyber, Future
Identity: Physical, Cyber, FutureIdentity: Physical, Cyber, Future
Identity: Physical, Cyber, Future
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online Communities
 
Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...
Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...
Who will follow whom? Exploiting Semantics for Link Prediction in Attention-I...
 
Attention Economics in Social Web Systems
Attention Economics in Social Web SystemsAttention Economics in Social Web Systems
Attention Economics in Social Web Systems
 
What makes communities tick? Community health analysis using role compositions
What makes communities tick? Community health analysis using role compositionsWhat makes communities tick? Community health analysis using role compositions
What makes communities tick? Community health analysis using role compositions
 
Existing Research and Future Research Agenda
Existing Research and Future Research AgendaExisting Research and Future Research Agenda
Existing Research and Future Research Agenda
 
Modelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online CommunitiesModelling and Analysis of User Behaviour in Online Communities
Modelling and Analysis of User Behaviour in Online Communities
 
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web SystemsUsing Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
Using Behaviour Analysis to Detect Cultural Aspects in Social Web Systems
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
 
Forecasting Audience Increase on Youtube
Forecasting Audience Increase on YoutubeForecasting Audience Increase on Youtube
Forecasting Audience Increase on Youtube
 
PhD Viva - Disambiguating Identity Web References using Social Data
PhD Viva - Disambiguating Identity Web References using Social DataPhD Viva - Disambiguating Identity Web References using Social Data
PhD Viva - Disambiguating Identity Web References using Social Data
 

Recently uploaded

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 

Predicting Discussions on the Social Semantic Web

  • 1. Predicting Discussions on the Social Semantic Web Matthew Rowe, Sofia Angeletou and HarithAlani Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom
  • 2. Mass of Social Data Social content is now published at a staggering rate…. Predicting Discussions on the Social Semantic Web 1
  • 3. Social Data Publication Rates ~600 Tweets per second [1] ~700 Facebook status updates per second [1] Spinn3r dataset collected from Jan – Feb 2011 [2] 133 million blog posts 5.7 million forum posts 231 million social media posts [1] http://searchengineland.com/by-the-numbers-twitter-vs-facebook-vs-google-buzz-36709 [2] http://icwsm.org/data/index.php Predicting Discussions on the Social Semantic Web 2
  • 4. The New Information Era Predicting Discussions on the Social Semantic Web 3
  • 5. But…. Analysis is Limited Market Analysts What are people saying about my products? Opinion Mining How are people perceiving a given subject or topic? eGovernment Policy Makers How is a policy or law received by the public? How can I maximise feedback to my content? Predicting Discussions on the Social Semantic Web 4
  • 6. Attention Economics Given all this data… How do we decide on what information to focus on? How do we know what posts will evolve into discussions? Attention Economics (Goldhaber, 1997) Need to understand key indicators of high-attention discussions 5 Predicting Discussions on the Social Semantic Web
  • 7. Discussions on Twitter Twitter is used as medium to: Share opinions and ideas Engage in discussions Discussing events Debating topics Identifying online discussions enables: Up-to-date public opinion Observation of topics of interest Gauging the popularity of government policies Fine-grained customer support 6 Predicting Discussions on the Social Semantic Web
  • 8. Predicting Discussions Pre-empt discussions on the Social Web: Identifying seed posts i.e. posts that start a discussion Will a given post start a discussion? What are the key features of seed posts? Predicting discussion activity levels What is the level of discussion that a seed post will generate? What are the key factors of lengthy discussions? 7 Predicting Discussions on the Social Semantic Web
  • 9. The Need for Semantics For predictions we require statistical features User features Content features Features provided using differing schemas by different platforms How to overcome heterogeneity? Currently, no ontologies capture such features Predicting Discussions on the Social Semantic Web 8
  • 10. Behaviour Ontology Predicting Discussions on the Social Semantic Web 9 www.purl.org/NET/oubo/0.23/
  • 11. Features 10 Predicting Discussions on the Social Semantic Web
  • 12. Identifying Seed Posts Experiments Haiti and Union Address Datasets Divided each dataset up using 70/20/10 split for training/validation/testing Evaluated a binary classification task Is this post a seed post or not? Precision, Recall, F1 and Area under ROC Tested: user, content, user+content features Tested Perceptron, SVM, Naïve Bayes and J48 11 Predicting Discussions on the Social Semantic Web
  • 13. Identifying Seed Posts 12 Predicting Discussions on the Social Semantic Web
  • 14. Identifying Seed Posts What are the most important features? 13 Predicting Discussions on the Social Semantic Web
  • 15. Identifying Seed Posts What is the correlation between seed posts and features? 14 Predicting Discussions on the Social Semantic Web
  • 16. Identifying Seed Posts 15 Can we identify seed posts using the top-k features? Predicting Discussions on the Social Semantic Web
  • 17. Predicting Discussion Activity From identified seed posts: Can we predict the level of discussion activity? How much activity will a post generate? [Wang & Groth, 2010] learns a regression model, and reports on coefficients Identifying relationship between features We do something different: Predict the volume of the discussion 16 Predicting Discussions on the Social Semantic Web
  • 18. Predicting Discussion Activity 17 Predicting Discussions on the Social Semantic Web
  • 19. Predicting Discussion Activity Compare rankings Ground truth vs predicted Experiments Using Haiti and Union Address datasets Evaluation measure: NormalisedDiscounted Cumulative Gain Assessing nDCG@k where k={1,5,10,20,50,100) TestedSupport Vector Regression with: user, content, user+content features 18 Predicting Discussions on the Social Semantic Web
  • 20. Predicting Discussion Activity 19 Predicting Discussions on the Social Semantic Web
  • 21. Findings User reputation and standing is crucial eliciting a response starting a discussion Greater broadcast capability = greater likelihood of response More listeners = more discussion Activity levels influenced by out-degree Allow the poster to see response from ‘respected’ peers 20 Predicting Discussions on the Social Semantic Web
  • 22. Conclusions Pre-empt discussions to empower Market analysts Opinion mining eGovernment policy makers Behaviour ontology Captures impact across platforms Approach accurately predicts: Which posts will yield a reply, and; The level of discussion activity Predicting Discussions on the Social Semantic Web 21
  • 23. Current and Future Work Experiments over a forum dataset Content features >> user features Different platform dynamics Extend experiments to a random Twitter dataset Extension to behaviour ontology Captures concentration i.e. focus of a user on specific topics Categorising users by role Based on observed behaviour Predicting Discussions on the Social Semantic Web 22
  • 24. Questions Questions? people.kmi.open.ac.uk/rowe m.c.rowe@open.ac.uk @mattroweshow Predicting Discussions on the Social Semantic Web 23

Editor's Notes

  1. Web Science talk!!!
  2. Provision of a range of applicationsLarge-scale uptake of smart phones
  3. How can I track discussions?
  4. What features are correlated with discussions?
  5. User features more important than content featuresSignificant at alpha = 0.01User reputation therefore more important than the quality of content!
  6. Over training split!Pearsoncorrelation: 0.674 = Fair agreement
  7. Ran experiments using j48 with all features, testing on the test split
  8. Fitted Kolmogorov-Smirnov goodness of fit test
  9. For haitiContent features more important than user features at higher ranksUser features show greater performance as ranks are increasedUnion:User features outperform content for all ranks apart from topIndicates the differences in the dynamics of the data
  10. Attention Economics!!